
Hypothesis Testing I: Introduction
Megan Ayers
Math 141 | Spring 2026
Wednesday, Week 7
\[ \text{P-value} = \text{Probability of what happened in reality (or something more extreme)} \]
\[ \text{P-value} = \text{Probability guessing correctly 10 times in a row} \]
Discuss with Neighbor(s): If the “p-value”,
\[ \text{P-value} = \text{Probability of what happened in reality assuming the hypothesis is true} \]
is small, why does it make sense to reject the hypothesis?
In life, we tend to believe things that happen are pretty “typical” or “likely”… or else they wouldn’t happen!
With a small p-value, the hypothesis holding means “You’re special! You observed something really rare!”
With a large p-value, the hypothesis holding means “Your observation meets expectations.”
Hypothesis Testing is a scientific experiment, and follows the general scientific method.
\[ \textbf{Null Hypothesis:} \quad p = 1/5 \qquad \qquad \textbf{Alternative Hypothesis:} \quad p > 1/5 \]
Present research question and identify hypotheses
Describe “Null” distribution - What should we expect to happen due to randomness if the null hypothesis is true?
Obtain data, calculate relevant “Test Statistic”
Calculate the “P-value”
Use the P-value to make a conclusion on the research question
To compare Null and Alternate Hypotheses, we need to quantify how likely it is to observe a particular sample statistic, if the null hypothesis were true.
To answer questions like these, we need to know the distribution of the statistic of interest, if the null hypothesis were true.
This distribution is called the Null Distribution and is the theoretical sampling distribution for the statistic if the null hypothesis were true.
We can approximate the Null Distribution using simulation or theory.
We can use R to simulate one experiment of 10 guesses by…
This gives us a \(\widehat{p} = 2/10 = 0.2\)
We can use R to simulate 2000 experiments of 10 guesses by putting in reps = 2000 (reps = 1 is the default).
# A tibble: 2,000 × 3
replicate n_correct p_hat
<int> <dbl> <dbl>
1 1 2 0.2
2 2 5 0.5
3 3 1 0.1
4 4 2 0.2
5 5 0 0
6 6 3 0.3
7 7 0 0
8 8 2 0.2
9 9 1 0.1
10 10 4 0.4
# ℹ 1,990 more rows

Present research question and identify hypotheses
Describe “Null” distribution
Obtain data, calculate relevant “Test Statistic”
Calculate the “P-value”
Use the P-value to make a conclusion on the research question
Present research question and identify hypotheses
Describe “Null” distribution
Obtain data, calculate relevant “Test Statistic”
Calculate the “P-value”
\[\text{P-value} = (1/5)^{10} \approx 0.0000001 \]
Present research question and identify hypotheses
Describe “Null” distribution
Obtain data, calculate relevant “Test Statistic”
Calculate the “P-value”
\[ \textbf{Null Hypothesis:} \quad p = 1/5 \qquad \qquad \textbf{Alternative Hypothesis:} \quad p > 1/5 \]
Discuss with Neighbor(s):
When \(\widehat{p} = 1\) (Megan guesses all 10 cards correctly), we found P-value \(\approx 0\). Do we reject the Null Hypothesis under this framework? Why?
Hypothetically, a \(\widehat{p} = 0.5\) gives P-value \(\approx 0.04\). Do we reject the Null now? Why or why not?
Answers:
Yes, we’re rejecting the Null
Probably, but we don’t feel good about it!