How do Type I and Type II errors and the power of a test describe what a hypothesis test can get wrong?
Carry out hypothesis tests and analyse Type I and Type II errors and the power of a test
A focused answer to the H2 Further Mathematics outcome on errors in hypothesis testing. Type I and Type II errors, the significance level as the Type I error probability, computing the probability of a Type II error, and the power of a test.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this dot point is asking
SEAB wants you to carry out a hypothesis test and analyse the two ways it can be wrong: a Type I error (rejecting a true null hypothesis) and a Type II error (failing to reject a false one). You must relate the significance level to the Type I error, compute the probability of a Type II error for a specified alternative, and define the power of the test.
The answer
The structure of a test
A hypothesis test sets a null hypothesis and an alternative , chooses a significance level , defines a rejection region, and rejects if the test statistic falls in that region. The decision can be correct or can commit one of two errors.
Type I and Type II errors
The four outcomes are summarised by truth versus decision:
- Type I error: reject when is true (a false positive). Probability .
- Type II error: do not reject when is false (a false negative). Probability .
A correct decision is either rejecting a false or retaining a true .
The significance level is the Type I error probability
By construction, the significance level is the probability of a Type I error, because the rejection region is chosen so that, when is true, the test statistic lands there with probability . Choosing a smaller reduces Type I errors but, for fixed sample size, increases Type II errors.
Computing the Type II error probability
depends on the specific true value assumed under . To find it, work out the distribution of the test statistic at that true value and compute the probability the statistic falls in the acceptance region (so is not rejected):
The power of a test
The power is the probability of correctly rejecting a false :
A powerful test is good at detecting a real effect. Power rises with a larger sample size, a larger true effect, and a larger .
Examples in context
Example 1. Medical screening. A diagnostic test's Type I error is a false positive (alarming a healthy patient) and its Type II error a false negative (missing a real disease); the power is the test's sensitivity, which is why screening programmes are tuned to balance these two errors.
Example 2. Manufacturing acceptance sampling. Accepting a bad batch is one error and rejecting a good batch the other; the power curve of the sampling plan shows how reliably a genuinely defective batch is caught, the basis of quality-control standards.
Try this
Q1. Define a Type I error. [1 mark]
- Cue. Rejecting the null hypothesis when it is actually true (a false positive); its probability is .
Q2. What is the power of a test in terms of ? [1 mark]
- Cue. Power , the probability of correctly rejecting a false .
Q3. Why does reducing tend to increase for a fixed sample size? [2 marks]
- Cue. A smaller rejection region makes it harder to reject , so a false is more often retained, raising the Type II error rate.
Exam-style practice questions
Practice questions written in the style of SEAB exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
Original6 marksExplain what is meant by a Type I error and a Type II error in a hypothesis test, and state how the significance level relates to each.Show worked answer →
A Type I error is rejecting the null hypothesis when it is in fact true (a false positive). Its probability equals the significance level , because is precisely the probability of obtaining a result in the rejection region when holds.
A Type II error is failing to reject when it is in fact false (a false negative). Its probability is denoted and depends on the true value of the parameter, the sample size, and .
So the significance level is the Type I error probability, set by the experimenter; (the Type II error probability) is not directly set and must be computed for a specific alternative.
Markers reward the definitions of both errors (rejecting a true ; not rejecting a false ), and stating that is the Type I error probability while is the Type II error probability.
Original7 marksA test of against uses trials and rejects if the number of successes . Find the probability of a Type I error, and the probability of a Type II error if the true value is .Show worked answer →
Under , . The Type I error probability is rejecting when it is true:
A Type II error is failing to reject when holds. If the true , then , and we fail to reject when :
So the Type I error probability is about and the Type II error probability (at ) about .
Markers reward the binomial under each hypothesis, , and .
Related dot points
- Compute unbiased estimates of a population mean and variance and construct and interpret confidence intervals for a population mean
A focused answer to the H2 Further Mathematics outcome on estimation. Unbiased estimators of the population mean and variance, the sample variance with its n minus 1 divisor, and constructing and correctly interpreting confidence intervals for a mean.
- Apply non-parametric tests including the sign test and the Wilcoxon signed-rank test, and know when they are appropriate
A focused answer to the H2 Further Mathematics outcome on non-parametric tests. When to use distribution-free methods, the sign test for a median, the Wilcoxon signed-rank test, the test statistics and how to reach a conclusion.
- Work with discrete random variables, their probability distributions, expectation, variance, and the expectation and variance of linear functions
A focused answer to the H2 Further Mathematics outcome on discrete random variables. Probability distributions, expectation and variance, the computational formula for variance, and the rules for the expectation and variance of a linear function aX + b.
- Recognise and apply the geometric and negative binomial distributions, including their probabilities, expectations and variances
A focused answer to the H2 Further Mathematics outcome on the geometric and negative binomial distributions. Their probability formulae, when each applies, the expectation and variance of each, and the link to the binomial distribution.