

Confidence Intervals II
Megan Ayers
Math 141 | Spring 2026
Wednesday, Week 7
Please be mindful of students finishing exam from the previous section when arriving for the midterm (section 2)
Lab 5 grades are posted
Final office hours before midterm: Megan’s office today from 3:30-5pm
Review the concept of a 95% confidence interval
Discuss (one way) of creating confidence intervals with different confidence levels.
Interpret confidence intervals and discuss common misconceptions
What are the differences between a sampling distribution and a bootstrap distribution?
Suppose we created confidence intervals based on distinct samples of size \(n=10\) and \(n=100\). How might they differ?
\[ \textrm{Statistic }\pm \textrm{ Margin of Error (ME)} \]
\[ \text{Mean} \pm 2*\text{Standard Error} \]


Implication: in the sampling distribution, 95% of sample statistics lie in the range: \[ \text{Parameter} \pm 2*\text{Standard Error} \] where the Standard Error is the standard deviation of the sampling distribution
So, 95% of sample statistics are within \(2*\mathrm{SE}\) from the parameter!
Thus, 95% confidence intervals usually take the form:
\[ \textrm{Statistic }\pm 2*\textrm{Standard Error (SE)} \]
Researchers are interested in the COVID-19 reproduction rate (the average number of individuals each infected person further infects)
Sample 50 infected individuals and perform contract tracing.
infected n
1 0 5
2 1 13
3 2 14
4 3 12
5 4 5
6 6 1
mean_infected
1 2.06

Goal: Create an interval of plausible values for the reproduction rate.
Q: What is the population? What is the parameter?
Q: What is the sample? What is the statistic?
We can use our sample to create a 95% confidence interval. What is each step doing, and why?

\[ \bar x \pm 2 \cdot SE \implies 2.06 \pm 2 \cdot 0.181 \]
We can use our sample to create a 95% confidence interval.

\[ \bar x \pm 2 \cdot SE \implies 2.06 \pm 2 \cdot 0.181 \]
General Confidence Intervals
The \(C\%\) confidence interval for a parameter is an interval estimate that is computed from sample data by a method that captures the parameter for \(C\%\) of all samples.

By definition, 2.5% of the data is less than the .025 quantile, and 2.5% of the data is greater than the .975 quantile

By definition, 2.5% of the data is less than the .025 quantile, and 2.5% of the data is greater than the .975 quantile

For sampling distributions that are bell-shaped, the .025 quantile is about \(2\cdot SE\) below the mean, and the .975 quantile is about \(2\cdot SE\) above the mean
So using the .025 and .975 quantiles is roughly equivalent to forming a 95% CI as: \(\text{Statistic} \pm 2*\text{SE}\)!

quantile function in R to calculate the .05 and .95 quantiles
With neighbor(s), name the quantiles of the bootstrap distribution you would need for:
80% Confidence Interval
99% Confidence Interval
2% Confidence Interval (You would never do this! This is just for fun (: )
Answers:
0.10 and 0.90 quantiles
0.005 and 0.995 quantiles
0.49 and 0.51 quantiles
Two factors determine the width of a confidence interval:
Discuss with Neighbor(s): Confidence Intervals get smaller with:
a larger sample size
a lower confidence level
Intuitively, why does this make sense?
Confidence Intervals get smaller with:
a larger sample size
a lower confidence level
Note: These reasons for getting smaller are competing in terms of certainty!
Reminder: While a lower confidence level gives you a smaller interval, there is a cost! (i.e., lower success rate)
Suppose we wish to estimate the number of hours a Reed student sleeps on a typical night. We obtain the following 95% confidence interval: \((7.86, 8.34)\)

Suppose we wish to estimate the number of hours a Reed student sleeps on a typical night. We obtain the following 95% confidence interval: \((7.86, 8.34)\)




Once we take a sample and calculate a confidence interval, there’s no more randomness!
This is may seem like arguing over semantics – but it’s an important distinction!
Instead, say either:
“If we were to take many samples and calculate a 95% confidence interval for each, then 95% of them would contain the true parameter”
“We are 95% confident that the true parameter is in our confidence interval”