Yates Correction
for Continuity
Lecture 39
Yates correction is a statistical
technique to improve the precision of the chi-square test of independence of two
variables classification presented in a contingency table. In chi-square
approximation, the smaller cell frequencies (less than 5) combine with the
larger one and reduce the chi-square degree of freedom. But in the case of 2x2
contingency, the smaller cell cannot combine with the larger because the chi-square table value is not available at zero degrees of freedom.
Facing such a situation, Frank Yates proposed the following continuity correction for the 2 x 2 table, which markedly enhanced the chi-square approximation.
The above modification in the chi-square approximation is known as Yates continuity correction and is applicable when there is a single degree
of freedom.
The Frank Yates continuity correction for a 2 x 2 contingency
table is given by:
|
|
B1 |
B2 |
Total |
|
A1 |
a |
b |
a+b |
|
A2 |
c |
d |
c+d |
|
Total |
a+c |
b+d |
n |
Example 9.20: A study examined the relationship
between blood group and disease severity. The results are displayed in the 2X2
contingency table that follows:
Is there an association between blood group and the severity of the condition?
Can you suggest applying Yates continuity correction?
Solution: The cell frequency is small (less than 5); it is suggested to apply Yates continuity correction.
i. The null and alternative hypotheses may be stated as:
H0: The blood group and the severity of the disease are not associated.
Vs.
H1: The blood group and the severity of the disease are associated.
ii. The significance level: α = 0.05
iii. The test statistic:
vi. Critical Region:Reject H₀ when χ² ≥ χ²₀.₀₅(1
In a 2x2 contingency table where the cell frequencies are small. The effectiveness of the chi-square approximation will be questioned to some extent. In response to these circumstances, R.A. Fisher, J.O. Irwin, and Frank Yates developed the Fisher exact test, which is a method for evaluating the hypothesis of independence in a contingency table with fairly small cell frequencies.
Procedure: First, identify the smaller cell frequency and then alter the cell
frequency with the restriction that marginal frequencies are fixed.
If it is desired to test the null hypothesis, there is no association between the two variables
classification.
|
A / B |
B1 |
B2 |
Total |
|
A1 |
a |
b |
(a+b) |
|
A2 |
c |
d |
(c+d) |
|
Total |
(a+c) |
(b+d) |
n |
Assuming that d is the least frequency, the other possible tables are obtained by reducing d by unity, altering the cell frequencies of the other cells, and repeating the procedure till d becomes zero. Then compute the probability of the observed and other possible tables.
Then the total probabilities, P = Pd + Pd-1 + Pd-2 + ⋯ + P0.
%20(1)%20(1).png)
%20(1)%20(1).png)
%20(1)%20(1).png)
%20(1)%20(1).png)
%20(1)%20(1).png)
No comments:
Post a Comment