Skip to main content
SingaporeMathsSyllabus dot point

How do we use a cumulative frequency curve to find the median, quartiles and spread?

Construct a cumulative frequency curve and use it to estimate the median, quartiles, interquartile range and percentiles

A focused answer to the O-Level E-Maths outcome on cumulative frequency. Building the cumulative frequency curve, reading the median and quartiles, the interquartile range, and estimating percentiles.

Generated by Claude Opus 4.89 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this dot point is asking
  2. The answer
  3. Examples in context
  4. Try this

What this dot point is asking

SEAB wants you to build a cumulative frequency curve from grouped data and read off the median, the lower and upper quartiles, the interquartile range, and percentiles. The curve turns grouped data into estimates of position and spread that a frequency table alone cannot give.

The answer

Cumulative frequency

Cumulative frequency is a running total of the frequencies, the number of values up to and including the upper boundary of each class. You build it by adding each class frequency to the sum of all the earlier ones, so the final cumulative frequency equals the total number of values.

The cumulative frequency curve

Plot the cumulative frequency against the upper class boundary and join the points with a smooth curve. The result is an S-shaped (ogive) curve that rises from zero to the total frequency, and reading across and down from it estimates positions in the data.

Median and quartiles

For nn values, read the curve at these cumulative frequency positions:

  • median (middle) at n2\dfrac{n}{2},
  • lower quartile at n4\dfrac{n}{4},
  • upper quartile at 3n4\dfrac{3n}{4}.

Read across from the position on the vertical axis to the curve, then down to the value on the horizontal axis.

Interquartile range and percentiles

The interquartile range measures spread using the middle half of the data:

IQR=upper quartilelower quartile\text{IQR} = \text{upper quartile} - \text{lower quartile}

A percentile is the value below which a given percentage of the data lies; the 9090th percentile is read at the cumulative frequency 0.9n0.9n. The IQR is less affected by outliers than the full range.

Examples in context

Example 1. Comparing two classes. Drawing two cumulative frequency curves on the same axes lets a teacher compare the medians and interquartile ranges of two classes' marks. A curve further to the right with a smaller IQR shows higher and more consistent scores.

Example 2. Setting a pass mark. To pass the top 30%30\% of candidates, an examiner reads the 7070th percentile off the curve, setting the boundary so that 70%70\% fall below it. Percentiles turn a target proportion into a cut-off value.

Try this

Q1. State the cumulative frequency position for the median of 160160 values. [1 mark]

  • Cue. 1602=80\dfrac{160}{2} = 80.

Q2. The lower quartile is 2222 and the upper quartile is 3535. Find the interquartile range. [1 mark]

  • Cue. 3522=1335 - 22 = 13.

Q3. For 240240 values, state the position used to read the upper quartile. [1 mark]

  • Cue. 3×2404=180\dfrac{3 \times 240}{4} = 180.

Exam-style practice questions

Practice questions written in the style of SEAB exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

Original4 marksThe times taken by 8080 runners are recorded. From the cumulative frequency curve, the lower quartile is 3434 minutes, the median is 4141 minutes and the upper quartile is 4747 minutes. (a) Find the interquartile range. (b) Estimate how many runners took longer than 4747 minutes.
Show worked answer →

(a) Interquartile range == upper quartile - lower quartile =4734=13= 47 - 34 = 13 minutes.

(b) The upper quartile is at the 34\dfrac{3}{4} position, so 75%75\% of runners took up to 4747 minutes, leaving 25%25\% above it.

25%25\% of 80=2080 = 20 runners took longer than 4747 minutes.

Markers reward the interquartile range of 1313 minutes, recognising the upper quartile as the 75%75\% point, and the 2020 runners above it.

Original3 marksA cumulative frequency curve is drawn for the marks of 200200 candidates. Explain how you would use it to estimate the median mark.
Show worked answer →

The median is the value at the middle of the data, the 12\dfrac{1}{2} position.

For 200200 candidates, find the cumulative frequency of 2002=100\dfrac{200}{2} = 100 on the vertical axis.

Draw a horizontal line from 100100 across to the curve, then drop a vertical line down to the horizontal axis; the mark where it meets is the estimated median.

Markers reward locating 100100 on the cumulative frequency axis, reading across to the curve and down to the mark, giving the median.

Related dot points