Biserial Correlation
Lecture 58
The biserial correlation measures the strength of
association between an artificial dichotomous variable and a continuous variable.
The artificial dichotomous variable is defined by the researcher, expert or
investigator, like pass/fail, weak/strong, etc.
e.g., an examiner decides that below 40% is considered a fail. Thus, above 40% will be considered a pass.
A quantitative measure that measures the strength of association between an artificial dichotomous variable and a continuous variable is called biserial correlation, denoted by ρb (population) and rb (sample). The following is used to compute the biserial correlation between an artificial dichotomous and continuous variable.
X−p is the mean of the
interval variable’s values associated with the dichotomous variable’s first
category.
X−q is the mean of the
interval variable’s values associated with the dichotomous variable’s second
category.
s is the standard deviation of the variable on the
interval scale.
Pp is the proportion of the interval variable values
associated with the dichotomous variable’s first category.
Pq is the proportion of the interval variable values
associated with the dichotomous variable’s second category.
The mean and proportion of the dichotomous variable's first “p” category:
The mean and proportion of the dichotomous variable second “q” category:
The standard deviation “s” can be obtained as:
Z is the table value against Pp and Pq and is called the height of
the ordinate of the normal curve separating the proportions p and q.
Hypothesis Testing about the Association
of an Artificial dichotomous and Continuous Variable
Let rb be
That’s
H0:
If the sample size is large, the following test statistics are used to test the above null hypothesis.
|
Exam Result |
GPA |
|
|
1 |
F |
3.5 |
|
2 |
F |
3.4 |
|
3 |
F |
3.3 |
|
4 |
F |
3.2 |
|
5 |
F |
3.6 |
|
6 |
P |
4.0 |
|
7 |
P |
3.6 |
|
8 |
P |
4.0 |
|
9 |
P |
4.0 |
|
10 |
P |
3.8 |
|
11 |
P |
3.9 |
|
12 |
P |
3.9 |
|
13 |
P |
4.0 |
|
14 |
P |
3.8 |
|
15 |
P |
3.5 |
|
16 |
P |
3.6 |
Test the hypothesis that there is no association between exam results and GPAs at a 5% significance level.
Solution: First calculate the biserial correlation and next test the hypothesis.
|
Participants |
Exam
Result |
X |
X^2 |
|
1 |
F |
3.5 |
12.25 |
|
2 |
F |
3.4 |
11.56 |
|
3 |
F |
3.3 |
10.89 |
|
4 |
F |
3.2 |
10.24 |
|
5 |
F |
3.6 |
12.96 |
|
6 |
P |
4.0 |
16.00 |
|
7 |
P |
3.6 |
12.96 |
|
8 |
P |
4.0 |
16.00 |
|
9 |
P |
4.0 |
16.00 |
|
10 |
P |
3.8 |
14.44 |
|
11 |
P |
3.9 |
15.21 |
|
12 |
P |
3.9 |
15.21 |
|
13 |
P |
4.0 |
16.00 |
|
14 |
P |
3.8 |
14.44 |
|
15 |
P |
3.5 |
12.25 |
|
16 |
P |
3.6 |
12.96 |
|
|
|
59.1 |
219.37 |
Let p represent the fail category and q the pass category.
Now, compute the biserial correlation coefficient using
The biserial correlation shows a strong association between exam results and GPAs.The standard deviation “s” can be obtained as:
Z is the table value against Pp and Pq and is called height of
the ordinate of the normal curve separating the proportion p = 0.475 and q =
0.525, y = 0.3982
%20(1).png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)


%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
No comments:
Post a Comment