Case 9

CHAPTER FIVE

Case #9: NEAR MULTICOLLINEARITY

Goal: This assignment introduces multiple regression by examining the problem of model specification known as near multicollinearity. Specifically, it introduces:

Estimating the Demand for Money with Multiple Regression
How Near Multicollinearity Effects the Quality of OLS regression
How to change Model Specification to remove Near Multicollinearity.

Problem Spreadsheet

The spreadsheet for this problem is CH5_Case1.xls. It contains the following data:

Variable	Data Range
REALMONEY	1980M1-1989M12
INCOME	1980M1-1989M12
SHORTRATE	1980M1-1989M12
MEDRATE	1980M1-1989M12
LONGRATE	1980M1-1989M12

The series REALMONEY is the monthly demand for real money services. Monetary theory argues that people demand the real services (purchasing power) of money. Accordingly, our dependent variable must be expressed in real terms and transformed into logs to suppress any heteroscedasticity. Accordingly, we calculated monthly REALMONEY using a spreadsheet as the log of the ratio of the M1 money supply over the consumer price index (CPI):

REALMONEY = log(M1/CPI)

The series INCOME is the log of seasonally adjusted Personal Income.

Note: The demand for money and income series are log transformations of the original data. This adjustment is made to suppress any heteroskedasticity in these series.

The series SHORTRATE is the Federal Funds interest rate, which is the rate banks charge on short-term reserves.

The series MEDRATE is Municipal Bond yield and represents the average rate of a portfolio of municipal notes.

The series LONGRATE is the Mortgage interest rate representing the average fixed mortgage interest rate.

Note: To help in the interpretation of our multiple regression results, we have named the interest rate variables according to their relative average maturity.

Explaining the Demand for Money

Milton Friedman is considered by many as the father of the "Monetarist" school of economic thought, which arose as a challenge to modern Keynesian macroeconomics in the late 1960s. Friedman argued that the supply of money effectively determined all nominal prices, and monetary policy was an effective tool of stabilization policy. Central to Friedman’s arguments was the assumption that the demand for money was stable, or at least, predictable. This led empirical researchers to develop forecasting models of the demand for money for purposes of macroeconomic forecasting.

The conventional theory of the demand for money concentrates on two motives for holding money:

1)People demand money to facilitate transactions, termed the Transactions Motive. Accordingly, the demand for money should have a positive correlation with Personal Income (a measure of aggregate economic activity).

2)Since money is an asset with special properties, the demand for money should be inversely related to the yields on substitute assets. This approach, termed the Portfolio Motive suggests that the demand for money is negatively related to interest rates and yields on substitute assets.

We can summarize these factors in a function, explaining behavior of the demand for money:

Real Demand for Money = f(Income, Shortrate, Medrate, Longrate),

Where we expect the slope coefficient on the income variable to be positive, whereas the slope coefficients on the interest rate variables should all be negative.

Estimating the Demand for Money

The demand for money can be estimated by applying Ordinary Least Squares to the following multiple regression model:

REALMONEY = a + b1INCOME + b2SHORTRATE + b3MEDRATE + b4LONGRATE + e

According to theory we expect b1> 0 and b2, b3, and b4 to be negative.

The fitted regression line is:

Multiple Regression -- Result Formula

*realmoney = -0.85829 + ( (income) 0.357183 ) + ( (longrate) * -0.028359 ) + ( (medrate) * -0.001423 ) + ( (shortrate) * 0.004384 )**

Details on coefficient estimates and their significance levels are reported in the table below.

Audit Trail -- Coefficient Table (Multiple Regression Selected)
Series	Included		Standard			Overall
Description	in Model	Coefficient	Error	T-test	P-value	F-test
income	Yes	3.57E-01	2.87E-02	12.46	0.00
longrate	Yes	-2.84E-02	7.44E-03	-3.81	0.00
medrate	Yes	-1.42E-03	7.47E-03	-0.19	0.85
realmoney	Dependent	-8.58E-01	2.56E-01	-3.36	0.00	404.91
shortrate	Yes	4.38E-03	1.95E-03	2.25	0.03

Finally, some summary accuracy statistics are reported below.

Accuracy Measures	Value
AIC		-469.06
BIC		-466.27
Sum Squared Error (SSE)	0.14
R-Square		93.37%
Adjusted R-Square	93.14%

Question #1: Do the coefficients on the interest rate variables have the correct signs? Can we reject the null that their respective slopes are zero using a two-tailed test with significance level of 5 percent?

ANSWER:

Testing for Multicollinearity

Past researchers have noted that any multiple regression estimated with three different interest rate variables as independent variables is likely to be plagued by Near Multicollinearity. Specifically, since interest rates tend to be highly correlated, OLS regression should produce large errors, i.e., unreliable results.

This is because OLS has trouble isolating the partial effects among independent variables because one or more are highly correlated, i.e., react to the same influence.

To examine for possible Near Multicollinearity we estimate the correlation matrix of the interest-rate variables summarized in the following table.

Audit Trail -- Correlation Coefficient Table
Series	Included
Description	in Model	income	longrate	medrate	realmoney	shortrate
income	Yes	1.00	-0.79	-0.64	0.92	-0.70
longrate	Yes	-0.79	1.00	0.94	-0.90	0.82
medrate	Yes	-0.64	0.94	1.00	-0.80	0.66
realmoney	Dependent	0.92	-0.90	-0.80	1.00	-0.73
shortrate	Yes	-0.70	0.82	0.66	-0.73	1.00

Question #2: Based upon the regression results above and the correlation matrix of interest rates, is near multicollinearity a problem in the former regression? Explain

ANSWER:

In addition, while the F-test for the complete model shows the presence of a significant relationship among the set of explanatory variables with the dependent variable, some of the individual parameter estimates are insignificant as shown by their t-ratios. This, once again, suggests that multicollinearity may be a problem.

Dealing with Multicollinearity

A way to correct for multicollinearity is to change model specification. Specifically, since all three interest-rate variables measure the same effect (the cost of credit), we should try estimating the model with only one interest rate.

Using FORECASTX^TM we re-estimated the demand for money using only one interest rate variable, the short-term interest rate.

The fitted revised model is:

Multiple Regression -- Result Formula

*realmoney = -2.30 + ( (income) 0.502607 ) + ( (shortrate) * -0.006565 )**

Coefficient estimates and standard errors of the revised model are reported below.

Audit Trail -- Coefficient Table (Multiple Regression Selected)
Series	Included		Standard			Overall
Description	in Model	Coefficient	Error	T-test	P-value	F-test
income	Yes	0.50	0.03	16.75	0.00
realmoney	Dependent	-2.30	0.26	-8.97	0.00	364.00
shortrate	Yes	-0.01	0.00	-3.48	0.00

Finally, the following accuracy statistics were estimated for the revised model.

Accuracy Measures	Value
AIC		-380.68
BIC		-377.90
Sum Squared Error (SSE)	0.29
R-Square		86.15%
Adjusted R-Square	85.92%

Question #3: Does the coefficient on the SHORTRATE variable now have the correct sign? Can we reject the null that this coefficient is zero at the 95 percent level of confidence?

ANSWER:

Question #4: Explain the differences between the first and second model specifications? Which model specification is superior?

ANSWER:

Student Practice Questions

Question #1: Redo the assignment with the addition of an international variable to the list of explanatory variables used to predict the demand for money. Specifically, does the addition of the exchange rate or the trade balance increase our ability to forecast the demand for money? (Note: The fact that foreign interests influence the U.S. demand for money is a serious issue to monetary authorities).

Question #2: Select four macroeconomic variables at random and add to the explanatory variables used to predict the demand for money. Redo the regression with all the independent variables, including the four new ones. Contrast and compare the R² and AIC and SIC statistics between the two regressions. Comment.