Respuesta :
Answer:
a) [tex]\chi^2 = \frac{(35-42)^2}{42}+\frac{(15-8)^2}{8}+\frac{(64-63)^2}{63}+\frac{(11-12)^2}{12}+\frac{(90-84)^2}{84}+\frac{(10-16)^2}{16}=10.07[/tex]
[tex]p_v = P(\chi^2_{2} >10.069)=0.0065[/tex]
Since the p value is lower than the significance level we can reject the null hypothesis at 5% of significance, and we can conclude that the two variables are dependent for this case.
b) Small
[tex]\% Yes =\frac{35}{50} x100=70\%[/tex]
[tex]\% No =\frac{15}{50} x100=30\%[/tex]
Medium
[tex]\% Yes =\frac{64}{75} x100=85.33\%[/tex]
[tex]\% No =\frac{11}{75} x100=14.67\%[/tex]
Large
[tex]\% Yes =\frac{84}{100} x100=84\%[/tex]
[tex]\% No =\frac{16}{100} x100=16\%[/tex]
On this case we see that the percentage of people with no health insurance in small companies (30%) is significantly higher than for the medium (14.67%) and large companies (16%)
Step-by-step explanation:
Previous concepts
A chi-square goodness of fit test "determines if a sample data matches a population".
A chi-square test for independence "compares two variables in a contingency table to see if they are related. In a more general sense, it tests to see whether distributions of categorical variables differ from each another".
Assume the following dataset:
Size if Company Yes No Total
Small 35 15 50
Medium 64 11 75
Large 90 10 100
Total 189 36 225
Part a
We need to conduct a chi square test in order to check the following hypothesis:
H0: There variable size of company is independent from the variable Health Insurance
H1: There variable size of company is dependent from the variable Health Insurance
The level of significance assumed for this case is [tex]\alpha=0.05[/tex]
The statistic to check the hypothesis is given by:
[tex]\chi^2=\sum_{i=1}^n \frac{(O_i -E_i)^2}{E_i}[/tex]
The table given represent the observed values, we just need to calculate the expected values with the following formula [tex]E_i = \frac{total col * total row}{grand total}[/tex]
And the calculations are given by:
[tex]E_{1} =\frac{189*50}{225}=42[/tex]
[tex]E_{2} =\frac{36*50}{225}=8[/tex]
[tex]E_{3} =\frac{189*75}{225}=63[/tex]
[tex]E_{4} =\frac{36*75}{225}=12[/tex]
[tex]E_{5} =\frac{189*100}{225}=84[/tex]
[tex]E_{6} =\frac{36*100}{225}=16[/tex]
And the expected values are given by:
Size if Company Yes No Total
Small 42 8 50
Medium 63 12 75
Large 84 16 100
Total 189 36 225
And now we can calculate the statistic:
[tex]\chi^2 = \frac{(35-42)^2}{42}+\frac{(15-8)^2}{8}+\frac{(64-63)^2}{63}+\frac{(11-12)^2}{12}+\frac{(90-84)^2}{84}+\frac{(10-16)^2}{16}=10.07[/tex]
Now we can calculate the degrees of freedom for the statistic given by:
[tex]df=(rows-1)(cols-1)=(3-1)(2-1)=2[/tex]
And we can calculate the p value given by:
[tex]p_v = P(\chi^2_{2} >10.069)=0.0065[/tex]
And we can find the p value using the following excel code:
"=1-CHISQ.DIST(10.069,2,TRUE)"
Since the p value is lower than the significance level we can reject the null hypothesis at 5% of significance, and we can conclude that the two variables are dependent for this case.
Part b
First for this case we have this:
[tex]\% Yes =\frac{189}{225} x100=84\%[/tex]
[tex]\% No =\frac{36}{225} x100=16\%[/tex]
So we see that the proportion of people with health insurance is clear higher than the proportion without it.
Now if we find the proportion for each type of company we have this:
Small
[tex]\% Yes =\frac{35}{50} x100=70\%[/tex]
[tex]\% No =\frac{15}{50} x100=30\%[/tex]
Medium
[tex]\% Yes =\frac{64}{75} x100=85.33\%[/tex]
[tex]\% No =\frac{11}{75} x100=14.67\%[/tex]
Large
[tex]\% Yes =\frac{84}{100} x100=84\%[/tex]
[tex]\% No =\frac{16}{100} x100=16\%[/tex]
On this case we see that the percentage of people with no health insurance in small companies (30%) is significantly higher than for the medium (14.67%) and large companies (16%)