How do we test a claim about a population mean using sample evidence?
Carry out a hypothesis test for a population mean, stating hypotheses, computing a test statistic or p-value, and interpreting the conclusion in context
A focused answer to the H2 Mathematics outcome on hypothesis testing. Setting up null and alternative hypotheses, one- and two-tailed tests, the test statistic and p-value, the significance level, and interpreting the conclusion.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
SEAB wants you to carry out a hypothesis test for a population mean: state the null and alternative hypotheses, decide one- or two-tailed, compute a test statistic and -value (or compare with a critical value), and interpret the conclusion in context at the stated significance level.
The answer
The hypotheses
- The null hypothesis states the value being tested, for example .
- The alternative hypothesis states what we suspect: or (one-tailed) or (two-tailed).
The direction of comes from the question's wording ("less than", "greater than", or just "different").
The test statistic
Assuming true, and using the CLT, . The standardised test statistic is
When the population variance is unknown, use the unbiased sample estimate (for large the normal model still applies via the CLT).
The p-value and the decision
The -value is the probability, under , of a sample result at least as extreme as the one observed. The decision rule:
- If -value significance level, reject .
- Otherwise, do not reject (there is insufficient evidence).
For a two-tailed test, compare the two-tailed -value (or split the significance level between the tails).
The significance level and errors
The significance level (such as ) is the probability of rejecting a true , a Type I error. Choosing a smaller level makes rejection harder, reducing false positives.
Examples in context
Example 1. Drug efficacy. A trial tests whether a new treatment lowers mean blood pressure: of no change against of a decrease, with a small significance level chosen to guard against falsely approving an ineffective drug.
Example 2. Manufacturing tolerance. A factory tests whether the mean fill volume differs from the target (two-tailed), rejecting when the sample evidence is extreme enough to act on, the routine statistical quality check.
Try this
Q1. State suitable hypotheses to test whether a mean has increased above . [2 marks]
- Cue. , (one-tailed, upper).
Q2. A test gives a -value of at the level. State the conclusion. [1 mark]
- Cue. Since , reject .
Q3. Explain what a Type I error is. [2 marks]
- Cue. Rejecting the null hypothesis when it is in fact true; its probability is the significance level.
Exam-style practice questions
Practice questions written in the style of SEAB exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
Original6 marksA manufacturer claims its bags have mean mass . A sample of bags has mean . The population standard deviation is . Test at the level whether the mean mass is less than claimed.Show worked answer →
: ; : (one-tailed, lower).
Under , , standard error .
Test statistic: .
-value . Since , reject .
There is significant evidence at the level that the mean mass is less than .
Markers reward correct hypotheses, the test statistic, the -value, comparison with , and a contextual conclusion.
Original5 marksExplain the meaning of a significance level and the difference between a one-tailed and a two-tailed test.Show worked answer →
A significance level means we reject if the observed result (or more extreme) would occur with probability less than when is true; it is the probability of wrongly rejecting a true (a Type I error).
A one-tailed test has specifying a direction (for example or ), placing all the rejection region in one tail. A two-tailed test has , splitting the rejection region between both tails ( each at the level).
Markers reward defining the significance level as the Type I error probability and correctly distinguishing the one- and two-tailed alternatives.
Related dot points
- Describe the distribution of the sample mean, use the Central Limit Theorem, and find unbiased estimates of the population mean and variance from a sample
A focused answer to the H2 Mathematics outcome on sampling. The distribution of the sample mean, the Central Limit Theorem, the standard error, and unbiased estimators of the population mean and variance.
- Model continuous data with the normal distribution, standardise to the Z-distribution to find probabilities, and find values from given probabilities
A focused answer to the H2 Mathematics outcome on the normal distribution. The bell curve and its parameters, standardising to Z, finding probabilities and inverse problems, and combining normal variables.
- Compute and interpret the product moment correlation coefficient, find the least squares regression line, and use it for prediction within the data range
A focused answer to the H2 Mathematics outcome on correlation and regression. The product moment correlation coefficient, the least squares regression line, choosing which line to use, and the limits of prediction.
- Approximate the binomial by the Poisson or the normal, and the Poisson by the normal, under stated conditions, applying a continuity correction where appropriate
A focused answer to the H2 Mathematics outcome on distribution approximations. The conditions for the Poisson and normal approximations to the binomial and the normal approximation to the Poisson, and the continuity correction.