Neas-Seminars

Fox Module 11: Statistical inference for simple linear regression HW


http://33771.hs2.instantasp.net/Topic8673.aspx

By NEAS - 12/3/2009 12:49:23 PM

Module 11: Statistical inference for simple linear regression: homework assignment

 

(The attached PDF file has better formatting.)

 

Homework assignment: Estimating regression parameters

 

Some final exam problems give a set of points and ask to compute ordinary least squares estimators, sums of squares, t values, confidence intervals, and other regression statistics.

The final exam may give one set of points and ask about several statistics and estimates, or separate points for each statistic. 

 

This homework assignment continues the exercise from Module 8.

 

An actuary fits a two-variable regression model ( )

to the relation between the incurred loss ratio (x) and the retrospective ratio (y):

 

Policy Year

(x)

(y)

(x – 0)

(x – 0)2

(y– )

(y– )2

(x–0)(y– )

20X1

66.00%

22.50%

0.00%

0.00%

0.60%

0.0036%

0.0000%

20X2

67.00%

19.50%

1.00%

0.01%

-2.40%

0.0576%

-0.024%

20X3

68.00%

21.00%

2.00%

0.04%

-0.90%

0.0081%

-0.018%

20X4

65.00%

22.50%

-1.00%

0.01%

0.60%

0.0036%

-0.006%

20X5

64.00%

24.00%

-2.00%

0.04%

2.10%

0.0441%

-0.042%

Average

66.00%

21.90%

0.00%

0.02%

0.00%

0.0234%

-0.0180%

 

~      The column captions in the table use lower case x and y for the variables; the deviations are shown explicitly as (x – 0) and (y– ).

~      The last line has averages, not totals.  Some formulas in the textbook use totals.

 

We regress retrospective premium ratios on reported loss ratios to estimate premium assets for retrospectively rated business. Actuarial issues of retrospectively rated business are not important; the homework deals with the regression analysis only.

 

The Module 8 homework assignment solves for the least squares estimators of á and â.

 


A.       What is the variance of the ordinary least squares estimator of â?

B.       What is the t statistic for testing the null hypothesis that â = 0?

C.       What is the 95% confidence interval for the true value of â?

D.       What is the p value for testing the null hypothesis that â = 0?

E.       What is the variance of the ordinary least squares estimator of á?

F.        What is the t statistic for testing the null hypothesis that á = 21.90%?

G.       What is the 95% confidence interval for the true value of á?

H.       What is the p value for testing the null hypothesis that á = 21.90%?

 

Show the computations for the homework assignment, not just the solution. You can check your solutions with Excel or other statistical software.

 

The null hypotheses for á and â depend on the scenario.

 


 

            A null hypothesis of â = 0 means that retrospective rating has no effect on the premium.

            A null hypothesis of â = 1 means that a dollar of loss causes a dollar of premium.


 

 

The null hypothesis for the value of á depends on the situation.

 


 

            A null hypothesis of á = 0 means that retrospective rating is the standard premium.

            This hypothesis is not realistic, since large insureds get premium discounts.

            A null hypothesis of á = 21.90% means retrospective rating has no effect on premium.

            The average observed discount is the expected discount regardless of losses.

 

These retrospective rating issues are not part of the regression analysis course.


 

 

The confidence intervals require a t distribution for the appropriate degrees of freedom. Use Excel to find the t values for a 95% confidence interval.

 

By Hooks5 - 11/28/2015 7:41:28 PM

When solving for SE(A) at the top of page 112 in Fox's textbook, there is a sqrt(329,731) in the numerator. Where did this number come from? I can't find the value in the original Module 8 material, and I can't find the original data to test different values out.

Also, when solving for test statistic with B sub 0, how did you calculate the B in the numerator of the formula t sub 0 = (B-B sub 0)/SE(B). The formula for m sub i isn't exactly clear. Is this even the correct test statistic that we are supposed to be calculating?