Fatskills
Practice. Master. Repeat.
Study Guide: How to Solve: Statistical Sampling & Data Presentation (Box Plots, Histograms, Skewness)
Source: https://www.fatskills.com/gcse-math/chapter/how-to-solve-statistical-sampling-data-presentation-box-plots-histograms-skewness

How to Solve: Statistical Sampling & Data Presentation (Box Plots, Histograms, Skewness)

By Fatskills Exam Guides Team — the exam nerds behind 28,500+ quizzes and 2.1M practice questions across 500+ global exams.

⏱️ ~5 min read

How to Solve: Statistical Sampling & Data Presentation (Box Plots, Histograms, Skewness)

GCSE / A-Level Maths


Introduction

"Mastering box plots, histograms, and skewness doesn’t just get you 5–10 marks on your GCSE/A-Level exam—it lets you spot misleading news graphs, compare exam results, and even analyse sports performance like a pro. One question on this topic can be the difference between a 6 and a 7, or a B and an A!


What You Need To Know First

  1. Basic statistical terms – Mean, median, mode, range, quartiles.
  2. Frequency tables – How to read and interpret grouped data.
  3. Number lines & scales – How to plot values accurately.

(If you’re shaky on these, pause and review them first—this guide assumes you’re solid.)


Key Vocabulary

Term Plain-English Definition Quick Example
Population The entire group you’re studying. All Year 11 students in the UK.
Sample A smaller group taken from the population. 100 Year 11 students from 5 schools.
Quartiles Values that split data into 4 equal parts. Q1 = 25th percentile, Q2 = median, Q3 = 75th percentile.
Interquartile Range (IQR) The range of the middle 50% of data. IQR = Q3 – Q1.
Skewness How asymmetrical the data is. Right-skewed = tail on the right.
Outlier A data point far from the rest. In a test, one student scores 10% when everyone else scores 70%+.

Formulas To Know

Formula What It Means Exam Note
IQR = Q3 – Q1 Measures spread of the middle 50% of data. MEMORISE THIS
Lower Outlier Boundary = Q1 – 1.5 × IQR Data below this is an outlier. MEMORISE THIS
Upper Outlier Boundary = Q3 + 1.5 × IQR Data above this is an outlier. MEMORISE THIS
Frequency Density = Frequency ÷ Class Width Used for histograms with unequal class widths. Given on exam sheet (but know how to use it!)

Step-by-Step Method

1. Drawing a Box Plot

Step 1: Order the data from smallest to largest. Step 2: Find the minimum, Q1, median (Q2), Q3, and maximum. Step 3: Draw a number line covering the full range. Step 4: Plot the 5 key values as vertical lines. Step 5: Draw a box from Q1 to Q3. Step 6: Draw a vertical line inside the box at the median. Step 7: Extend "whiskers" from Q1 to min and Q3 to max (unless outliers exist). Step 8: Mark outliers with crosses (×) if they exist.

2. Drawing a Histogram

Step 1: Check if class widths are equal. If not, calculate frequency density. Step 2: Label the x-axis with class boundaries. Step 3: Label the y-axis with frequency density (if unequal widths) or frequency (if equal). Step 4: Draw bars with heights matching frequency density (or frequency). Step 5: Ensure bars touch (no gaps unless data is discrete).

3. Describing Skewness

Step 1: Compare the median and mean. - If mean > medianRight-skewed (positive skew). - If mean < medianLeft-skewed (negative skew). - If mean ≈ medianSymmetrical. Step 2: Look at the box plot whiskers. - Longer right whisker → Right-skewed. - Longer left whisker → Left-skewed. Step 3: Look at the histogram shape. - Tail on the right → Right-skewed. - Tail on the left → Left-skewed.


Worked Examples

Example 1 – Basic Box Plot

Data: 3, 5, 7, 8, 9, 10, 12, 15, 18, 20 Step 1: Ordered data is already given. Step 2: Min = 3, Q1 = 7, Median (Q2) = 9.5, Q3 = 15, Max = 20. Step 3: Draw number line from 0 to 20. Step 4: Plot points at 3, 7, 9.5, 15, 20. Step 5: Draw box from 7 to 15. Step 6: Draw median line at 9.5. Step 7: Whiskers from 3 to 7 and 15 to 20. What we did and why: We followed the exact steps to visualise the spread of data. The box shows the middle 50%, and whiskers show the full range.


Example 2 – Medium Histogram (Unequal Class Widths)

Data: | Class (hours) | Frequency | |--------------|-----------| | 0 ≤ x < 2 | 5 | | 2 ≤ x < 5 | 12 | | 5 ≤ x < 10 | 18 | | 10 ≤ x < 20 | 10 |

Step 1: Class widths: 2, 3, 5, 10 → Unequal, so use frequency density. Step 2: Calculate frequency density: - 0–2: 5 ÷ 2 = 2.5 - 2–5: 12 ÷ 3 = 4 - 5–10: 18 ÷ 5 = 3.6 - 10–20: 10 ÷ 10 = 1 Step 3: Label x-axis with class boundaries (0, 2, 5, 10, 20). Step 4: Label y-axis "Frequency Density". Step 5: Draw bars with heights 2.5, 4, 3.6, 1. What we did and why: Unequal widths mean we can’t use frequency directly—frequency density ensures the area of each bar represents the true frequency.


Example 3 – Exam-Style Skewness Question

Question: A box plot shows: - Min = 10, Q1 = 20, Median = 30, Q3 = 45, Max = 80. - The mean is 35. Describe the skewness of the data.

Step 1: Compare mean (35) and median (30). - Mean > Median → Right-skewed. Step 2: Check whiskers. - Right whisker (35 units) is longer than left (10 units) → Right-skewed. Step 3: No histogram, but box plot confirms right skew. Answer: The data is right-skewed (positive skew) because the mean is greater than the median and the right whisker is longer. What we did and why: We used two methods (mean vs. median and box plot shape) to confirm skewness—examiners love this!


Common Mistakes

Mistake Why it Happens Correct Approach
Forgetting to order data before finding quartiles Students rush and pick wrong numbers. Always sort data first!
Drawing gaps in histograms Confusing histograms (continuous) with bar charts (discrete). Bars must touch unless data is discrete.
Using frequency instead of frequency density for unequal widths Misapplying histogram rules. Check class widths first! If unequal, use frequency density.
Misidentifying skewness Only looking at the box plot or only the mean/median. Use both methods to confirm.
Ignoring outliers in box plots Forgetting to mark them or extending whiskers to them. Calculate outlier boundaries and mark them with crosses.

Exam Traps

Trap How to Spot it How to Avoid it
Unequal class widths in histograms Question gives a table with varying widths (e.g., 0–5, 5–20). Always check class widths first! If unequal, use frequency density.
Box plot with hidden outliers Whiskers don’t reach min/max, but no crosses are shown. Calculate outlier boundaries (Q1 – 1.5×IQR, Q3 + 1.5×IQR).
"Describe the skewness" without context Question gives a box plot or histogram but no mean/median. Use the shape (whiskers, tail direction) if no numbers are given.

1-Minute Recap

"Right, listen up—this is your last-minute cheat sheet for box plots, histograms, and skewness. For box plots: order the data, find min/Q1/median/Q3/max, draw the box and whiskers, and mark outliers if they exist. For histograms: check if class widths are equal—if not, use frequency density. Skewness? Mean > median = right skew, mean < median = left skew, and always check the box plot whiskers or histogram tail. Common traps? Unequal widths in histograms, forgetting outliers, and misreading skewness. You’ve got this—go smash those marks!