By NEAS - 6/30/2024 2:17:30 PM
MS Module 22: χ2 tests (overview second 2nd edition)
(The attached PDF file has better formatting.)
(Readings from the second 2nd edition of the Devore text.)
Reading §13.1 Goodness-of-Fit Tests When Category Probabilities Are Completely Specified and part of §13.2: Goodness-of-Fit Tests for Composite Hypotheses (until the end of example13.6 on human blood types)
This module covers χ2 tests, along with related p values.
The null hypotheses may specify the probabilities or a relation among the probabilities. If the study examines the probabilities of dominant and recessive alleles (versions of genes), we may test whether the dominant allele has a specified probability or whether the assumed allele structure follows Mendel’s laws.
Binomial experiments use the statistical tests for proportions in earlier modules. Multinomial experiments use χ2 tests, explained in this module.
The textbook uses illustrations from biology and from sports. Actuarial science also offers many illustrations.
● If mortality rates follow mathematical curves, the percentage of deaths by age and sex track the expected. ● If accident frequencies follow mathematical curves, the accidents by age, sex, and territory track the expected.
A χ2 test helps decide if differences of actual from expected are random fluctuations or if the expected values are incorrect.
Know the test statistic above example 13.1 and that it has approximately a χ2 distribution with k – 1 degrees of freedom. Chapter 6 of the text describes the χ2 distribution; if you are not familiar with this topic, review the relevant sections of that chapter. You will be tested on the use of the χ2 distribution for hypothesis testing, not on the mathematics of the χ2 distribution. Know example 13.1. Both this example and the final exam problems assume you are familiar with Mendel’s laws of inheritance at a high school level. The experiment crosses AaBb with AaBb.
p2 dominant alleles for both characteristics: AABB, AaBB, aABB, AABb, AAbB, AaBb, AabB, aABb, aAbB, p2 a dominant allele for the first characteristic, and no dominant allele for the second characteristic: AAbb, Aabb, aAbb p2 no dominant allele for the first characteristic, and a dominant allele for the second characteristic: aaBB, aaBb, aabB p4 no dominant allele for either characteristic: aabb
Of the 16 genotypes, the phenotypic probabilities are 9/16, 3/16, 3/16, 1/16
From the section “χ2 when the pi’s Are Functions of Other Parameters,” know example 13.2. Remember that p10 means p1 nought, not pten, and p50 means p5 nought, not pfifty.
Review end of chapter exercises 1, 2, 3, 4, 6, 7, and 8.
Read the section “Goodness-of-Fit Tests for Composite Hypotheses.” Know expression 13.1, example 13.5, and example 13.6. Skip from Example 13.6 through the end of §13.
This section covers χ2 tests when observed probabilities are functions of other parameters. The probabilities may be phenotypes, and the parameter may be the percentage of alleles of a certain type in the population. An actuary may model mortality for a class as a combination of high mortality and low mortality policyholders or accident frequencies for a class as a combination of high risk and low risk insureds. The relevant question is whether the model is reasonable, not whether the parameters are correctly specified. Example 13.5-6 uses a simple distribution so that the maximum likelihood fitting is easy. Final exam problems use the same type of distribution but with different observed values.
Final exam problems may give the expected values for (k-1) levels. The total probability is one, so you derive the probability of the last layer. As the textbook says: “The fact that df = k – 1 in the preceding theorem is a consequence of the restriction Ni = n: although there are k observed counts, once any k – 1 are known, the remaining one is uniquely determined. That is, there are only k – 1 “freely determined” cell counts, and thus k – 1 df.”
Pricing actuaries use χ2 tests to judge whether accident frequency has a Poisson distribution or whether risk classifications are homogeneous. The actuary is not testing a specific Poisson distribution, but only testing whether a Poisson distribution is a reasonable model for accident frequency. In general, Poisson distributions are reasonable only for homogeneous classes, so the actuary is testing whether the class is homogeneous.
The final exam problems uses allele probabilities, for which the arithmetic can be done without a computer. Some examples using Poisson distributions and professional sports playoff games cannot be done by pencil and paper.
Review end of chapter exercises 12a and 13.
|
|