Stat 601, Fall 2000, Class 10




What you need to have learnt from Class 9

*
The partial F test, and why you would use it.
*
 The partial-F 
    \begin{verbatim}

        /    2                 2     \     /Number of 
       |    R        -        R      |    / variables
        \    BIG               SMALL /   / in the subset.
        __________________________________________________________________

         /                      2     \     / Number of observations
        |    1        -        R      |    / minus number of parameters
         \                      BIG   /   / in Big model. (inc. intercept).

Example

Page 152 of the Case Book. Testing

\begin{displaymath}H_0: \beta_2 \& \beta_3 = 0 \quad v. \quad H_1: \beta_2 \mbox{\rm or} \beta_3 \ne 0.\end{displaymath}

Regression for time series

*
Objective: model a time series.
*
Example: Model default rates on mortgages as a function of interest rates.
*
Problem: Time series often have autocorrelated error terms which violates the standard assumption of independence.
*
Definition: Autocorrelation - successive error terms are dependent (see p.41 and p.300 of the Bulk Pack).
*
Diagnostics.
*
Key graphic - residuals plotted against time. Tracking in the residual plots.
*
Look at the Durbin-Watson statistic. Less than 1.5 or over 2.5 suggests a problem.
*
Correlation of the residuals is roughly 1 - DW/2.
*
Consequences of positive autocorrelation:
*
Over-optimistic about the information content in the data.
*
Standard errors for slopes too small, confidence intervals too narrow.
*
Think variables are significant when really they are not.
*
False sense of precision.
*
Fix ups.
*
Use differences of both Y and X, not raw data (pp.326-327).
*
Include lagged residuals in the model (p. 309).
*
Include lag Y in the model (as an X-variable p.331).
*
Benefits of differencing.
*
Often reduces autocorrelation.
*
Can reduce collinearity between X-variables.

Two transforms

Linear


\begin{displaymath}Y = \beta_0 + \beta_1 X + \epsilon,\end{displaymath}

or, say X is in grams and we re-express as kilos:

\begin{displaymath}Y = \beta_0 + 1000 \times \beta_1 X/1000 + \epsilon.\end{displaymath}

Essentially the same equation but the original $\beta_1$ would be 1000 times larger in the second equation.

Say we transform Y but not X, Y is originally in cm, but we divide by 100 to get to meters:

\begin{displaymath}Y/100 = \beta_0/100 + (\beta_1/100) X + \epsilon/100.\end{displaymath}

Notice how the slope and intercept change. The RMSE would fall by a factor of 100 too.

Log transforms

Note: all percent change interpretations for log transforms are valid only if the percent change considered is small. The smaller it is the better the approximation.

Four cases:

*
$Av(Y\vert X) = \beta_0 + \beta_1 X.$
*
$Av(Y\vert X) = \beta_0 + \beta_1 ln(X).$
*
$Av(ln(Y)\vert X) = \beta_0 + \beta_1 X.$
*
$Av(ln(Y)\vert X) = \beta_0 + \beta_1 ln(X).$

Four respective interpretations for $\beta_1$:

*
For a 1 unit change in X, the average of Y changes by $\beta_1$.
*
For a 1 percent change in X, the average of Y changes by $\beta_1/100$.
*
For a 1 unit change in X, the average of Y changes by 100 $\beta_1$ percent.
*
For a 1 percent change in X, the average of Y changes by $\beta_1$ percent - the economist's elasticity definition.


*
Plug in numbers if in doubt: take $\beta_0 = 5$ and $\beta_1 = 0.5$.
*
Av(ln(Y)|X) = 5 + 0.5 ln(X).
*
Calculate Av(ln(Y)) at X = 100: $ln(Y) = 5 + 0.5 \times ln(100) = 7.3026$, so Y = 1484.13
*
Increase X by 1% (X = 101) and recalculate: $ln(Y) = 5 + 0.5 \times ln(101) = 7.3075$, so Y = 1491.53
*
Y has gone from 1484.13 to 1491.53, or in percent terms: (1491.53 - 1484.13)/1484.13 = 0.00498 = 0.498%, which is approximately 0.5%.


*
Plug in numbers if in doubt: take $\beta_0 = 5$ and $\beta_1 = 0.03$.
*
Av(ln(Y)|X) = 5 + 0.03 X.
*
Calculate Av(ln(Y)) at X = 50: $ln(Y) = 5 + 0.03 \times 50 = 6.5$, so Y = 665.14
*
Increase X by 1 (X = 51) and recalculate: $ln(Y) = 5 + 0.03 \times 51 = 6.53$, so Y = 685.40
*
Y has gone from 665.14 to 685.40, or in percent terms: (685.40 - 665.14)/665.14 = 0.0305 = 3.05%, which is approximately 3%.




2000-12-15