2019-09-19

顯著性檢定

顯著性檢定實作By Tony

1.Hypotheses

虛無假說(null hypothesis: $H_0$ ) - 欲推翻的決策

對立假說(alternative hypothesis: $H_1$ ) - 欲證實的決策

目的：證明對立假說 $H_1$ 是事實

方法：利用數據資料證明虛無假說 $H_0$ 不成立，間接地證明對立假說 $H_1$ 成立

結論：

推翻虛無假說 $H_0$ ，證明 $H_1$ 成立
無法推翻虛無假說 $H_0$ ，不能證明虛無假說 $H_0$ 為真，而是資料無法提供足夠證據推翻 $H_0$

example 1 - Lady Testing Tea

https://en.wikipedia.org/wiki/Lady_tasting_tea

“The null hypothesis is that the subject has no ability to distinguish the teas.”

example 2

中秋節後逢甲某公司員工平均體重是否較過節前重

$H_0$ ：沒有變胖

$H_1$ ：變胖!

2.檢定統計量 Test Statistic

母體 $\sigma^2$ 已知，使用常態分配 $Z = \frac{\bar{X} - \mu}{\frac{\sigma}{\sqrt{n}}}$

母體 $\sigma^2$ 未知，當樣本數 $n \geq 30$ ，使用常態分配 $Z = \frac{\bar{X} - \mu}{\frac{s}{\sqrt{n}}}$ ( $\sigma$ 用 s 取代)

母體 $\sigma^2$ 未知，當樣本數 $n < 30$ ，使用 t 分配 $t = \frac{\bar{X} - \mu}{\frac{s}{\sqrt{n}}}$

example 3

假說：

The 公司宣稱員工中秋節後胖超過 500 公克

員工懷疑多出的肥肉，重量不足 500 公克

$H_0$ ：平均重量至少 500 公克

$H_1$ ：平均重量不足 500 公克

Q：今天隨機取樣 36 位員工，平均重量為 485 公克，標準差為 30 公克，較其宣稱少 15 公克是否可以證明多出的肥肉重量不足 500 公克。( $\alpha = 0.05$ )

檢定統計量 Test Statistic： $Z = \frac{分子}{分母} (\frac{\bar{X} - \mu}{\frac{\sigma}{\sqrt{n}}})$

分子：樣本估算值與虛無假說所假定母體參數之間的差異

分母：樣本估算值的抽樣誤差

$H_0$ ： $\mu \geq 500g\ \ v.s.\ H_1 < 500g$ (左尾檢定)

$Z = \frac{485-500}{\frac{30}{\sqrt{36}}} = -3$

P-value:在虛無假說為真時(平均重量至少 500 公克)觀測到 36 個樣本平均值為 485 公克或比485 公克更輕的機率。

$\text{P-value} = Pr (z < -3) = 0.0013$

有足夠的證據顯示多出的肥肉重量不足 500 公克。

3. midterm 資料

班級 A 平均分數是否超過 50 分

t.test(midterm$`班級A`, 
       alternative = "greater",
       mu = 50)

## 
##  One Sample t-test
## 
## data:  midterm$班級A
## t = 3.145, df = 39, p-value = 0.001587
## alternative hypothesis: true mean is greater than 50
## 95 percent confidence interval:
##  53.71415      Inf
## sample estimates:
## mean of x 
##        58

班級 A 平均分數是否超過 60 分

t.test(midterm$`班級A`, 
       alternative = "greater",
       mu = 60)

## 
##  One Sample t-test
## 
## data:  midterm$班級A
## t = -0.78625, df = 39, p-value = 0.7818
## alternative hypothesis: true mean is greater than 60
## 95 percent confidence interval:
##  53.71415      Inf
## sample estimates:
## mean of x 
##        58

班級 A 與班級 B

t.test(midterm$`班級A`, midterm$`班級B`)

## 
##  Welch Two Sample t-test
## 
## data:  midterm$班級A and midterm$班級B
## t = 0.57892, df = 77.878, p-value = 0.5643
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
##  -5.182786  9.432786
## sample estimates:
## mean of x mean of y 
##    58.000    55.875

班級 B 與班級 C

t.test(midterm$`班級B`, midterm$`班級C`)

## 
##  Welch Two Sample t-test
## 
## data:  midterm$班級B and midterm$班級C
## t = -11.264, df = 50.097, p-value = 2.452e-15
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
##  -37.58792 -26.21208
## sample estimates:
## mean of x mean of y 
##    55.875    87.775

班級 A 與班級 C

t.test(midterm$`班級A`, midterm$`班級C`)

## 
##  Welch Two Sample t-test
## 
## data:  midterm$班級A and midterm$班級C
## t = -10.881, df = 50.967, p-value = 6.758e-15
## alternative hypothesis: true difference in means is not equal to 0
## 95 percent confidence interval:
##  -35.26861 -24.28139
## sample estimates:
## mean of x mean of y 
##    58.000    87.775

參考資料