Multiple Liner Regression (2)
Multiple Liner Regression
(2)
|
|
Summaries
Dummy
Variables –
independent variables with binary features
- Independent variables represent
a complete set of classification
- Y^ = b0 + sum(bi*Xi^)
- b0 represents the average Y due
to the last classification not specified. (this last classification is not
specified to prevent linear dependency between the dummy variables as all
information are contained in Y and Xi)
- bi represent the different
between the effect of Xi and the last classification
- DOF = n (data point) –
(k+1) (number of classification, k is the number of independent variable)
Heteroskedasticity: error term variance is not constant across the
observations
- Unconditional Heteroskedasticity: does not depend on the independent
variable. Spread is random. Does not cause much problem
- Conditional Heteroskedasticity:
e.g. variance increases as the value of the independent variable increases
=> cause big problem
Problem:
standard error is too small, since the expected value is unchanged =>
increases type I error (reject what should not be rejected) (Power = 1- Type II
error)
Breusch-Pagan Test for conditional heteroskedasticity:
Chi-test
with nR2 with DOF = k (# of independent variable), n is the # of
sample
R2
is obtained by regressing the 1st regression
squared residual against the independent variable
Correction:
Use White-corrected standard error if there is heteroskedasticity
May 1st, 2008 in
CFA - LEVEL 2, Quantitative Posted by Editor