Respuesta :
Answer:
a) H0: There is no association between level of education and TV station preference (Independence)
H1: There is association between level of education and TV station preference (No independence)
b) [tex]\chi^2 = \frac{(15-10)^2}{10}+\frac{(15-20)^2}{20}+\frac{(10-10)^2}{10}+\frac{(5-10)^2}{10}+\frac{(25-10)^2}{10}+\frac{(10-20)^2}{20} =33.75[/tex]
c) [tex]\chi^2_{crit}=5.991[/tex]
d) Since the p value is lower than the significance level we enough evidence to reject the null hypothesis at 5% of significance, and we can conclude that we have dependence between the two variables analyzed.
Step-by-step explanation:
A chi-square goodness of fit test "determines if a sample data matches a population".
A chi-square test for independence "compares two variables in a contingency table to see if they are related. In a more general sense, it tests to see whether distributions of categorical variables differ from each another".
Assume the following dataset:
High school Some College Bachelor or higher Total
Public Broadcasting 15 15 10 40
Commercial stations 5 25 10 40
Total 20 40 20 80
We need to conduct a chi square test in order to check the following hypothesis:
Part a
H0: There is no association between level of education and TV station preference (Independence)
H1: There is association between level of education and TV station preference (No independence)
The level os significance assumed for this case is [tex]\alpha=0.05[/tex]
The statistic to check the hypothesis is given by:
[tex]\sum_{i=1}^n \frac{(O_i -E_i)^2}{E_i}[/tex]
Part b
The table given represent the observed values, we just need to calculate the expected values with the following formula [tex]E_i = \frac{total col * total row}{grand total}[/tex]
And the calculations are given by:
[tex]E_{1} =\frac{20*40}{80}=10[/tex]
[tex]E_{2} =\frac{40*40}{80}=20[/tex]
[tex]E_{3} =\frac{20*40}{80}=10[/tex]
[tex]E_{4} =\frac{20*40}{80}=10[/tex]
[tex]E_{5} =\frac{40*40}{80}=20[/tex]
[tex]E_{6} =\frac{20*40}{80}=10[/tex]
And the expected values are given by:
High school Some College Bachelor or higher Total
Public Broadcasting 10 20 10 40
Commercial stations 10 10 20 40
Total 20 30 30 80
Part b
And now we can calculate the statistic:
[tex]\chi^2 = \frac{(15-10)^2}{10}+\frac{(15-20)^2}{20}+\frac{(10-10)^2}{10}+\frac{(5-10)^2}{10}+\frac{(25-10)^2}{10}+\frac{(10-20)^2}{20} =33.75[/tex]
Now we can calculate the degrees of freedom for the statistic given by:
[tex]df=(rows-1)(cols-1)=(2-1)(3-1)=2[/tex]
Part c
In order to find the critical value we need to look on the right tail of the chi square distribution with 2 degrees of freedom a value that accumulates 0.05 of the area. And this value is [tex]\chi^2_{crit}=5.991[/tex]
Part d
And we can calculate the p value given by:
[tex]p_v = P(\chi^2_{3} >33.75)=2.23x10^{-7}[/tex]
And we can find the p value using the following excel code:
"=1-CHISQ.DIST(33.75,2,TRUE)"
Since the p value is lower than the significance level we enough evidence to reject the null hypothesis at 5% of significance, and we can conclude that we have dependence between the two variables analyzed.