MS Mod 19: Correlation – practice problems


MS Mod 19: Correlation – practice problems

Author
Message
NEAS
Supreme Being
Supreme Being (5.9K reputation)Supreme Being (5.9K reputation)Supreme Being (5.9K reputation)Supreme Being (5.9K reputation)Supreme Being (5.9K reputation)Supreme Being (5.9K reputation)Supreme Being (5.9K reputation)Supreme Being (5.9K reputation)Supreme Being (5.9K reputation)

Group: Administrators
Posts: 4.3K, Visits: 1.3K

MS Module 19: Correlation – practice problems

(The attached PDF file has better formatting.)

Exercise 19.1: Correlation

A regression model Yj = β0 + β1 Xj + εj has N = 11 observations. The sample correlation between X and Y is 0.60. We test the null hypothesis H0: ρ = 0 (the true correlation between the X and Y variables is zero).

A.    What is the t value to test the null hypothesis?
B.    What is the p value to test the null hypothesis?

Part A: The t value is R √(n-2) / √(1– R2) = 0.6 × (11 – 2)0.5 / (1 – 0.62)0.5 = 2.25000

Part B: The t distribution has n-2 degrees of freedom, so the p value for a two-tailed test is 0.051 (table look-up or spread-sheet function).

Question: The sample correlation is 0.60, which is much different from zero, yet the p value is 5.1%, which does not satisfy even a 5% significance level.

Answer: The scenario has only 11 observations. Even if the true correlation is zero, a sample with only a few observations often shows a high sample correlation.

Exercise 19.2: Correlation and β1

A linear regression with 11 data points has an estimated β1 of 4.5 and a sample correlation between the X and Y values of 0.60.

A.    What is the t value to test the null hypothesis that the correlation ρ is zero?
B.    What is the t value to test the null hypothesis that β1 is zero?
C.    What is the standard deviation of the estimate of β1?

Part A: The t value to test the null hypothesis that the correlation ρ is zero is r √(n-2) / √(1– r2) =

    0.6 × (11 – 2)0.5 / (1 – 0.62)0.5 = 2.25000

Part B: The t value to test the null hypothesis that β1 is zero is 1 / σ(1) = 4.5 / σ(1), where

●    1 is the estimate of β1.
●    s(1) is the standard deviation of the estimate of β1.

Part C: The two tests are the same: ρ = 0 implies β1 = 0 and vice versa.

    4.5 / s(1) = 2.25 ➾ s(1) = 4.5 / 2.25 = 2.


Exercise 19.3: Fisher transformation

X and Y are a bivariate normal distribution from which a sample of 40 observations is taken. The sample correlation between X and Y is 0.833. We test the null hypothesis H0: ρ = 0.750. The alternative hypothesis is Ha: ρo > 0.750.

A.    What is the Fisher transform V of the random variable R of the correlation R between X and Y?
B.    What is the Fisher transform of the observed correlation ρ?
C.    What is the distribution of V?
D.    What is the Fisher transform of the correlation ρ0 assumed in the null hypothesis?
E.    What is the z value to test this null hypothesis?
F.    What is the p value for this test of the null hypothesis?
G.    What is the 95% confidence interval for the true value of the Fisher transform of the correlation?
H.    What is the 95% confidence interval for the true value of the correlation?

Part A: The Fisher transform V = ½ ln( (1+R)/(1-R) )

Question: Why do we use a Fisher transform of the sample correlation?

Answer: The correlation is bounded by –1 and +1, and it is not normally distributed. If the sample correlation is from a bivariate normal distribution, the Fisher transform of the correlation has (approximately) a normal distribution with a mean of ½ ln( (1+R)/(1-R) ) and a variance of 1/(n-3). The textbook notes that “the rationale for the transformation is to obtain a function of R that has a variance independent of r; this would not be the case with R itself.”

Part B: The Fisher transform of ρ = 0.833 is 0.5 × ln( (1 + 0.833) / (1 – 0.833) ) = 1.197858

Part C: V has a normal distribution with mean μ = 1.19786 and variance σ2 = 1/(40-3) = 1/37.

Part D: The Fisher transform of ρ = 0.75 is 0.5 × ln( (1 + 0.75) / (1 – 0.75) ) = 0.972955

Part E: The z value is ( ρ – ρo ) / standard deviation of V = (1.19786 – 0.97296) / (1 / 370.5) = 1.36801

Part F: The p value for a one-sided test is 1 – CDF(1.368) = 0.08565. The p value for a two sided test is twice this value.

Part G: The z value for a two-sided 95% confidence interval is 1.96. The 95% confidence interval is

( 1.19786 – 1.96 / 370.5, 1.19786 + 1.96 / 370.5 ) = ( 0.87564, 1.52008 ).

Part H: The inverse of the Fisher transform is ( e2x – 1) / ( e2x + 1).

●    (exp(2 × 0.87564) – 1) / (exp(2 × 0.87564) + 1) = 0.70423
●    (exp(2 × 1.52008) – 1) / (exp(2 × 1.52008) + 1) = 0.90871

The 95% confidence interval for the correlation is (0.70423, 0.90871).

Question: The observed correlation 0.833 is not the midpoint of this confidence interval.

Answer: The correlation does not have a normal distribution. Suppose we had a sample of four points with a sample correlation of 0.980. With only four points, the observed correlation may be due to chance. The 95% confidence interval is wide, but the correlation can not be greater than 1.000. The confidence interval of the Fisher transform is symmetric, and the Fisher transform of the observed correlation is the mid-point of the confidence interval.

Question: Do exam problems give the sample correlation or must we derive it?

Answer: Exam problems give either the sample correlation or data from which it derive it. The data may be

●    Summary statistics (sums, sums of squares, and sum of cross products)
●    Sxx, Syy, and Sxy
●    The total sum of squares (SST) and the error sum of squares (SSE).


Attachments
GO
Merge Selected
Merge into selected topic...



Merge into merge target...



Merge into a specific topic ID...





Reading This Topic


Login
Existing Account
Email Address:


Password:


Social Logins

  • Login with twitter
  • Login with twitter
Select a Forum....











































































































































































































































Neas-Seminars

Search