Skip to main content
SingaporeFurther MathsSyllabus dot point

How do we estimate a population parameter from a sample, and what does a confidence interval mean?

Compute unbiased estimates of a population mean and variance and construct and interpret confidence intervals for a population mean

A focused answer to the H2 Further Mathematics outcome on estimation. Unbiased estimators of the population mean and variance, the sample variance with its n minus 1 divisor, and constructing and correctly interpreting confidence intervals for a mean.

Generated by Claude Opus 4.811 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. The answer
  3. Examples in context
  4. Try this

What this dot point is asking

SEAB wants you to estimate population parameters from sample data: to compute unbiased estimates of the population mean and variance (the latter using the n1n - 1 divisor), and to construct and interpret a confidence interval for a population mean. The interpretation of a confidence interval, in terms of the long-run capture rate, is examined as carefully as the calculation.

The answer

Estimators and unbiasedness

A statistic computed from a sample is an estimator of a population parameter. It is unbiased if its expected value equals the parameter it estimates. The sample mean is an unbiased estimator of the population mean:

xˉ=xn,E(Xˉ)=μ.\bar{x} = \frac{\sum x}{n}, \qquad \mathrm{E}(\bar{X}) = \mu.

The unbiased estimate of variance

Dividing the sum of squared deviations by nn underestimates the population variance, because the deviations are taken about the sample mean. The unbiased estimate uses n1n - 1:

s2=1n1(xxˉ)2=1n1(x2(x)2n).s^2 = \frac{1}{n - 1}\sum (x - \bar{x})^2 = \frac{1}{n - 1}\left(\sum x^2 - \frac{(\sum x)^2}{n}\right).

The divisor n1n - 1 (the degrees of freedom) is what makes s2s^2 unbiased.

The distribution of the sample mean

For a sample of size nn from a population with mean μ\mu and variance σ2\sigma^2, the sample mean has

E(Xˉ)=μ,Var(Xˉ)=σ2n.\mathrm{E}(\bar{X}) = \mu, \qquad \operatorname{Var}(\bar{X}) = \frac{\sigma^2}{n}.

By the Central Limit Theorem, for large nn the sample mean is approximately normally distributed, which underpins the confidence interval.

Constructing a confidence interval for the mean

A confidence interval gives a range of plausible values for μ\mu. With the population standard deviation σ\sigma known (or a large sample),

xˉ±zσn,\bar{x} \pm z^*\frac{\sigma}{\sqrt{n}},

where zz^* is the critical value for the chosen confidence level (1.961.96 for 95%95\%, 2.5762.576 for 99%99\%). The term σn\dfrac{\sigma}{\sqrt{n}} is the standard error and zσnz^*\dfrac{\sigma}{\sqrt{n}} the margin of error.

Interpreting a confidence interval

A 95%95\% confidence interval does not mean there is a 95%95\% probability the true mean lies in this particular interval. It means the procedure produces an interval that captures the true mean in 95%95\% of repeated samples. Wider confidence (say 99%99\%) gives a wider interval; a larger sample narrows it.

Examples in context

Example 1. Polling. A political poll reports a percentage with a margin of error; that margin is zσnz^*\dfrac{\sigma}{\sqrt{n}}, and the "95% confidence" caveat is exactly the long-run capture interpretation, which is why larger polls report tighter margins.

Example 2. Quality assurance. A factory estimates the mean fill of bottles from a sample and reports a confidence interval; if the target value lies outside the interval, the process is flagged, linking estimation directly to hypothesis testing.

Try this

Q1. Why does the unbiased estimate of variance divide by n1n - 1? [2 marks]

  • Cue. Deviations are measured about the sample mean, which understates spread; the n1n - 1 divisor (degrees of freedom) corrects the bias.

Q2. State the 95%95\% confidence interval formula for a mean with known σ\sigma. [1 mark]

  • Cue. xˉ±1.96σn\bar{x} \pm 1.96\dfrac{\sigma}{\sqrt{n}}.

Q3. Does a 95%95\% confidence interval mean a 95%95\% chance the true mean is inside it? [1 mark]

  • Cue. No; it means 95%95\% of intervals from repeated samples would contain the true mean.

Exam-style practice questions

Practice questions written in the style of SEAB exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

Original6 marksA sample of n=8n = 8 measurements gives x=240\sum x = 240 and x2=7300\sum x^2 = 7300. Find unbiased estimates of the population mean and variance.
Show worked answer →

The unbiased estimate of the population mean is the sample mean:

xˉ=xn=2408=30.\bar{x} = \frac{\sum x}{n} = \frac{240}{8} = 30.

The unbiased estimate of the population variance uses the divisor n1n - 1:
s2=1n1(x2(x)2n)=17(730024028)=17(73007200)=100714.29.s^2 = \frac{1}{n - 1}\left(\sum x^2 - \frac{(\sum x)^2}{n}\right) = \frac{1}{7}\left(7300 - \frac{240^2}{8}\right) = \frac{1}{7}\left(7300 - 7200\right) = \frac{100}{7} \approx 14.29.

Markers reward the sample mean 3030, the unbiased variance formula with the n1n - 1 divisor, the correct x2(x)2n=100\sum x^2 - \tfrac{(\sum x)^2}{n} = 100, and s214.29s^2 \approx 14.29.

Original7 marksA sample of 5050 light bulbs has mean lifetime xˉ=1200\bar{x} = 1200 hours. The population standard deviation is known to be σ=90\sigma = 90 hours. Construct a 95%95\% confidence interval for the population mean lifetime.
Show worked answer →

With σ\sigma known and a large sample, the confidence interval for the mean is xˉ±zσn\bar{x} \pm z^*\dfrac{\sigma}{\sqrt{n}}, where z=1.96z^* = 1.96 for 95%95\% confidence.

Standard error: σn=9050=907.071=12.728\dfrac{\sigma}{\sqrt{n}} = \dfrac{90}{\sqrt{50}} = \dfrac{90}{7.071} = 12.728.

Margin of error: 1.96×12.728=24.951.96\times 12.728 = 24.95.

Interval: 1200±24.951200 \pm 24.95, that is (1175.05, 1224.95)(1175.05,\ 1224.95) hours.

Interpretation: we are 95%95\% confident that the true mean lifetime lies between about 11751175 and 12251225 hours; the procedure captures the true mean in 95%95\% of such samples.

Markers reward the interval formula with z=1.96z^* = 1.96, the standard error 12.7312.73, the margin 24.9524.95, and the interval with a correct interpretation.

Related dot points