One-Variable Regression Example

E C O N S T A T S
Regression Examples
Example :	alpha=1	alpha=0
One-Variable Regression Example From J. Johnston (1984) p19.	With Intercept	No Intercept
Two-Variable Regression Example From J. Johnston (1984) p178.	With Intercept	No Intercept
Three-Independent Variables Regression Example	With Intercept	No Intercept
Three-variables, 20 observations	With Intercept	No Intercept
Three-variables, 10 observations	With Intercept	No Intercept


         



EXAMPLE:   One-Variable Regression Example
     |     
With Intercept 


From J. Johnston (1984) p19.


I. Data and Summary Stats 
One-Variable Regression Example
Observations :  n=5
Independent Variables :  k=1
With Intercept 


Data Table 

 obs y_i  intcpt x_i,1
1. 4 1 2
2. 7 1 3
3. 3 1 1
4. 9 1 5
5. 17 1 9

sum 40 5 20
mean 8 1 4
StD ≡ σ 5.568 0
3.162





Means and Standard Deviations 



 Mean 
 Var 
 StD 


 M_x= Σx_i/n   
 Var_x≡σ_x² = Σ(x-M_x)² / n-1 
 StD_x≡σ_x=Var_x^1/2

y
8
31.00
5.568

x₁
4
 10.00
 3.162



Covariance Matrix  -- Cov(xi,xj)=Σ[(xi-M_xi)(xj-M_xj)] / n-1 
     NOTE: be careful of MS Excel's COVAR() function,
     which divides by n instead of n-1.

  y   x₁
y 31 17.50
x₁ 17.50 10



Correlation Matrix  -- Corr(x_i,x_j)=Σ[(x_i-M_xi)(x_j-M_xj)] / (n-1)σ_iσ_j

  y   x₁
y  1.000  0.994
x₁  0.994  1.000



The basic input matrices are: 



 
  y =  
 (5x1)   

4
7
3
9
17


   


  X = 
 (5x2)

1 2
1 3
1 1
1 5
1 9



   

 

   X' = 
 (2x5)   


1 1 1 1 1
2 3 1 5 9









II. Regression Calculations

y_i =  alpha +  b₁ x_i,1 +  u_i

The q.c.e. basic equation in matrix form is: 
  y = Xb + e
  where y (dependent variable) is (nx1) or  (5x1)
        X (independent vars) is (nxk) or  (5x2)
        b (betas) is (kx1) or  (2x1)
        e (errors) is (nx1) or  (5x1)
Minimizing sum or squared errors using calculus results in the OLS eqn:
  b=(X'X)^-1.X'y
To minimize the sum of squared errors of
a k dimensional line that describes the relationship 
between the k independent variables and y we
find the set of slopes (betas) that minimizes
Σ_{i=1 to n}e_i²
Re-written in linear algebra we seek to min e'e
Rearranging the regression model equation, we get e = y - Xb
So e'e = (y-Xb)'(y-Xb) = y'y - 2b'X'y + b'X'Xb   (see Judge et al (1985) p14 )
Differentiating by b we get 0 = - 2X'y + X'Xb -> 2X'Xb=2X'y
Rearranging, dividing both sides by 2 -> b = X'X^-1X'y
So to obtain the elements of the (kx1) vector b we need the elements 
of the (kxk) matrix X'X^-1 and of the (kx1) matrix X'y. 
Caclulating X'y is easy (see (1) below) but X'X^-1 requires 
first calculation of X'X then finding cofactors -- see (4) -- and 
the deteminant - see (3) - in order to invert.



(1) X'y Matrix  (2x1)


40
230



(2) X'X Matrix  (2x2)



5	
20	

20	
120	


(3) Determinant   Det(X'X)≡|X'X|  
    i.e. the determinant of matrix of X'X 

Det(X'X) = 200
Det(X'X) =    5*120
         - 20*20
        

(4) Cofactors(X'X) i.e. cofactor matrix of X 'X  (2x2)
120 -20
-20 5


(5) Adj(X'X) i.e. adjugate matrix of X'X, this is just the 
    transpose of the cofactor matrix.  (2x2)
    For a symmetric matrix, will be same as cofactor matrix.
120 -20
-20 5



(6) Inverse Matrix, inv(X'X)≡(X'X)^-1
    = adj(X'X)/|X'X| = adj(X'X)/200   (2x2)

0.6000 -0.1000
-0.1000 0.02500


(7) Beta Matrix (β)
    b = [X'X^-1].[X'y] , this is  (2x1).
    Finally we can calculate b through matrix multiplication.

 Betas 

 alpha  1.0000 
 β ₁  1.750 

   =    
X'X^-1 

 0.6000  -0.1000 
 -0.1000  0.02500 

   X   
 X'y 
 40 
 230 



Yhat₁= + 1.0000x1 + 1.750x2 = 4.5000
Yhat₂= + 1.0000x1 + 1.750x3 = 6.2500
Yhat₃= + 1.0000x1 + 1.750x1 = 2.7500
Yhat₄= + 1.0000x1 + 1.750x5 = 9.7500
Yhat₅= + 1.0000x1 + 1.750x9 = 16.7500


ESS
=(4.500 - 8)^2
=(6.250 - 8)^2
=(2.750 - 8)^2
=(9.750 - 8)^2
=(16.75 - 8)^2
=122.5

REPORT 


 obs 
 calculation of yhat_obs
yhat_obs = Σβ_ix_i,obs
 yhat_obs
a
 y_obs
 (data)
 Mean_y
 (y_obs - yhat_obs)²
 (yhat_obs - M_y)²
 (y_obs - M_y)²
a
e_obs=y_obs-yhat_obs
e_obs²

1
Yhat₁ = Σβ_ix_i,11.0000x1 = 1.750x2  = 4.500 4 8 0.2500 12.25 16  e₁ = 4 - 4.500 = -0.5000 0.2500

2
Yhat₂ = Σβ_ix_i,2 = 1.0000x1 = 1.750x3  = 6.250 7 8 0.5625 3.063 1  e₂ = 7 - 6.250 = 0.7500 0.5625

3
Yhat₃ = Σβ_ix_i,3 = 1.0000x1 = 1.750x1  = 2.750 3 8 0.06250 27.56 25  e₃ = 3 - 2.750 = 0.2500 0.06250

4
Yhat₄ = Σβ_ix_i,4 = 1.0000x1 = 1.750x5  = 9.750 9 8 0.5625 3.063 1  e₄ = 9 - 9.750 = -0.7500 0.5625

5
Yhat₅ = Σβ_ix_i,5 = 1.0000x1 = 1.750x9  = 16.75 17 8 0.06250 76.56 81  e₅ = 17 - 16.75 = 0.2500 0.06250
RSS = 
Σ(y_obs - yhat_obs)² ESS = 
Σ(yhat_obs - M_y)² TSS = 
Σ(y_obs - M_y)² e'e=Σe_obs²
sum-> 1.500 122.5 124 1.500




(11) Betas and their t-Stats
     from the covar matrix of b=σ²(X'X)^-1
     the var(βi) = σ²v_ii where v_ii is the ith diag element of X'X^-1
     where σ² = e'e / n-k  (k=num of ind vars plus 1 for the intercept if present).
     and where v_ii is the ith diag element of X'X^-1
     Std(βi) = sqr root of Var(βi) 
     TStat(βi) = βi / Std(βi)
     Estimate of σ² = 0.5



 Coef value 
 StD(β)
 tStat(β)

alpha = 
   0.6000 * 40
 + -0.1000 * 230
 = 1.0000
(0.5000 * 0.6000)^1/2 
 = 0.5477
1.0000 / 0.5477
 = 1.826

β₁ = 
   -0.1000 * 40
 + 0.02500 * 230
 = 1.750
(0.5000 * 0.02500)^1/2 
 = 0.1118
1.750 / 0.1118
 = 15.65


(12) Table of Outputs: y_obs =  alpha   +  β₁  X_obs,1 +  e_obs
 1.0000   1.750  
(1.826) (15.65)  <- tstats 
 r² = 0.987903    |    adj r² = 0.983871


(13)  RSS   = Sum{y     - y_hat }^2  = 1.5
      TSS   = Sum{y     - y_avg }^2  = 124
      ESS(a)= Sum{y_hat - y_avg }^2  = 122.5
           we use the ESSb (below) cuz smthn wrng w ESS when no intercept.
      ESS(b)= TSS-RSS 122.5
      note:  TSS = ESS + RSS 
(14)  r² = ESS/TSS = 0.987903225806452
(15)  adjusted r² = ESS/TSS = 0.983870967741936
(16)  F-stat = [ESS/(k-1)] / [RSS/(n-k)] = 245
               see Johnston(1984) p186
               F measures the joint significance of 
               all explanatory variables.
      Alternatively:  F-stat = r²/(k-1) / (1-r²)/(n-k)
(17)  Durbin-Watson Statistic (DW or d) measures autocorrelation.
      DW = 3.27272727272727
 ________________________________________________________

Note, RSS, ESS and TSS stand for ... 
      Residual Sum of Squares (RSS),
      Explained Sum of Squares (ESS), and 
      Total Sum of Squares (TSS).
      However ESS is sometimes referred to as the Regression Sum of Squares.
      and     RSS is sometimes referred to as the Sum of Squares Rresidual.
Note, an alternative way of calculating TSS, ESS is...
      TSS = y'Ay   
      ESS = b_v'X_v'Ay  where b_v' X_v' are b' & X' wo intercept row col
      RSS = TSS-ESS 

Bibliography 

J. Johnston (1984) Econometric Methods, 3rd ed.
Judge et al (1985) The Theory and Practice of Econometrics 2rd ed, Wiley, New York.
Donald F. Morrison (1990) Multivariate Statistical Methods, 3rd edition, McGraw Hill, New York. 
A. H. Studenmund (1997) Using Econometrics: A Practical Guide, 3rd edition.  Addison-Wesley, Reading.