MS Mod 6: Inferences Based on Two Samples (overview)

Author	Message
NEAS	NEAS posted 6 Years Ago #15777 #
Supreme Being Group: Administrators Posts: 4.3K, Visits: 1.3K	MS Module 6: Inferences Based on Two Samples (overview) (The attached PDF file has better formatting.) Reading: §10.1: z Tests and Confidence Intervals for a Difference Between Two Population Means Much actuarial pricing and risk classification deals with means of two or more groups of policyholders, such as men vs women or rural residents vs urban residents. The Z statistic is the difference in the sample means divided by the standard deviation. The variance of the difference of two independent random variables is the sum of their variances, and the standard deviation is the square root of this variance. For the variance of the difference in means, each variance is divided by the size of its sample. The null hypothesis is that the difference in means is some pre-determined value, not necessarily zero. An actuary may test a hypothesis that the expected life is K years longer for people of Country S vs Country T. Causation is hard to infer from observational studies. Residents of a country differ many ways: income, health care, education, and diet are examples that may affect mortality rates. The textbook discusses inferences of causation several times; the modules on two-factor ANOVA and insurance risk classification explain how the inferences are interpreted. Causation is controversial for actuarial pricing and risk classification. Urban residents in developed countries have higher motor insurance accident frequencies and pay higher rates than rural residents pay. Residence does not cause accidents, and an urban resident who moves to a rural area is charged a lower rate. The two views to this controversy are ● Accident frequencies depend on characteristics of the policyholders, and residence is the best available predictor of these characteristics. ● Residence over-charges some policyholders and under-charges others, so actuaries should focus on the true characteristics affecting accident frequencies. A similar controversy affects mortality rates by country or region. Insurers in east Asia assume lower mortality than insurers in other countries, but a person of east Asian extraction pays the same rate as other consumers for insurance contracts bought in other countries. The textbook gives formulas for β (the probability of a Type II error) for three types of alternative hypothesis: more than, less than, and not equal to the hypothesized difference in means. For each formula, know the area under the normal curve to which it refers. Recalling the abstract formulas is difficult, since the parameters depend on the scenario in the section of the text, but the graphic remains the same. The textbook shows the graphics for several of the scenarios. If the sample is large enough, the population need not be normally distributed and the sample variance S2 may be used in place of the population variance σ2. This application of the central limit theorem is used repeatedly in the textbook. The confidence intervals for differences of means are similar to hypothesis testing for differences of means. Attachments MS Module 6 Inferences Based on Two Samples (overview).pdf (586 views, 38.00 KB) 0
	Reply