Point Biserial Correlation
Lecture 59
The point biserial correlation is a statistical measure
that assesses the association between a natural dichotomous variable and a continuous variable. The natural dichotomous variable has two natural categories, like 'male / female', 'yes / no', etc. The point biserial correlation is
a special case of correlation and is based on the following assumptions.
i. There will be no outliers in the continuous variable.
ii. The continuous variable follows normal distribution or
approximately follows normal distribution.
iii. The variance of the continuous variable is homogeneous
for both categories of the natural dichotomous variable.
e.g., suppose it is desired to study the association
between study hours (continuous variable) and gender (natural dichotomous
variable); then such a kind of association can be measured by point biserial
correlation.
Coefficient Point Biserial Correlation
A numerical quantity that measures the strength of linear
association between a natural dichotomous variable and a continuous variable. The
point biserial correlation coefficient is denoted by
The
point biserial correlation between dichotomous variables, categorised into natural categories “p” and “q”, and a continuous variable is denoted by “rb”
Where:
s is the standard deviation of the variable on the interval scale.
Pp is the proportion of the interval variable values associated with the
dichotomous variable’s first category.
Pq is the proportion of the interval variable values associated with the
dichotomous variable’s second category.
The mean and proportion of the dichotomous variable's first “p” category:
The mean and proportion of the dichotomous variable second “q” category:
The
standard deviation “s” can be obtained as:
|
Gender |
Score |
|
|
1 |
M |
7 |
|
2 |
M |
19 |
|
3 |
M |
8 |
|
4 |
M |
10 |
|
5 |
M |
7 |
|
6 |
M |
15 |
|
7 |
M |
6 |
|
8 |
M |
13 |
|
9 |
F |
14 |
|
10 |
F |
11 |
|
11 |
F |
18 |
|
12 |
F |
23 |
|
13 |
F |
17 |
|
14 |
F |
20 |
|
15 |
F |
14 |
|
16 |
F |
24 |
|
17 |
F |
22 |
The researcher wants to know the association between
gender and score. Test the hypothesis that there is no association between
gender and score is null.
Solution: First calculate the point biserial correlation and then test the hypothesis.
|
Participants |
Gender |
X |
X^2 |
|
1 |
M |
7 |
49 |
|
2 |
M |
19 |
361 |
|
3 |
M |
8 |
64 |
|
4 |
M |
10 |
100 |
|
5 |
M |
7 |
49 |
|
6 |
M |
15 |
225 |
|
7 |
M |
6 |
36 |
|
8 |
M |
13 |
169 |
|
F |
14 |
196 |
|
|
10 |
F |
11 |
121 |
|
11 |
F |
18 |
324 |
|
12 |
F |
23 |
529 |
|
13 |
F |
17 |
289 |
|
14 |
F |
20 |
400 |
|
15 |
F |
14 |
196 |
|
16 |
F |
24 |
576 |
|
17 |
F |
22 |
484 |
|
|
|
248 |
4168 |
Let p represent the male category and q the female category.
The standard deviation of the score:The coefficient of the point biserial correlation is given by;Now test the hypothesis:
- Read More: Introduction to A/B Test
%20(1).png)
.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
%20-%20Copy.png)
No comments:
Post a Comment