By Fatskills Exam Guides Team — the exam nerds behind 28,500+ quizzes and 2.1M practice questions across 500+ global exams.
Describing distributions using Shape, Outliers, Center, and Spread (SOCS) is the foundation of exploratory data analysis in AP Statistics. The AP exam frequently tests your ability to analyze and interpret distributions, whether in multiple-choice questions or free-response (FRQ) problems. For example, a researcher might collect data on the lifespans of light bulbs to determine if a new manufacturing process improves durability. By describing the distribution (e.g., skewed right, mean vs. median, standard deviation), you can draw meaningful conclusions about the data’s behavior and make informed decisions.
Example: A right-skewed distribution has a long tail on the right (e.g., income data).
Outliers: Data points that fall outside the overall pattern. Use the 1.5 × IQR rule:
Any point outside these fences is an outlier.
Center: Measures of central tendency.
In skewed distributions, the median is a better measure of center than the mean.
Spread: Measures of variability.
1-Var Stats
Variance = s² (standard deviation squared).
Five-Number Summary: Min, Q1, Median, Q3, Max.
Use 1-Var Stats → Stats → Calc → 1-Var Stats on TI-84.
Stats
Calc
Boxplot (Box-and-Whisker Plot): Visual representation of the five-number summary.
Use STAT PLOT → Boxplot on TI-84 (adjust window for outliers).
STAT PLOT
Boxplot
Histogram: Bar graph showing frequency/relative frequency of data in bins.
Use STAT PLOT → Histogram on TI-84 (adjust bin width with Xscl).
Histogram
Xscl
Stem-and-Leaf Plot: Quick way to display small datasets while preserving individual values.
Resistant vs. Non-Resistant Measures:
How to Describe a Distribution (SOCS) in an FRQ:
Example: “The distribution represents the number of hours students spend on homework per night.”
Describe the Shape
Example: “The distribution is skewed right, with most students studying 1–3 hours but a few studying 6+ hours.”
Identify Outliers (if any)
Example: “There appears to be an outlier at 10 hours, which is above the upper fence of 7.5 hours.”
Describe the Center
Example: “The median is 2.5 hours, which is a better measure of center than the mean due to the right skew.”
Describe the Spread
Example: “The IQR is 2 hours (Q3 = 4, Q1 = 2), and the standard deviation is 1.8 hours.”
Summarize in Context
Why? The mean is pulled in the direction of the skew (e.g., right skew → mean > median).
Mistake: Forgetting to check for outliers before calculating spread.
Why? Outliers inflate the range and standard deviation, making them misleading.
Mistake: Confusing standard deviation (s) with variance (s²).
Why? The AP exam expects s, not s², in most contexts.
Mistake: Describing shape without context (e.g., “the graph is skewed”).
Why? The AP rubric awards points for specificity.
Mistake: Misinterpreting the IQR as “the middle 50% of the data.”
Calculator skills (e.g., 1-Var Stats, boxplots, histograms).
Tricky Distinctions:
Outliers vs. Extreme Values: Not all extreme values are outliers (must exceed 1.5 × IQR).
Common FRQ Setups:
“Identify any outliers and describe their effect on the mean and standard deviation.”
Calculator Pitfalls:
ZoomStat
Answer: (C) Skewed right Explanation: When the mean > median, the distribution is typically skewed right.
Answer: - Outliers: Any points outside the whiskers (e.g., scores below Q1 – 1.5(IQR) or above Q3 + 1.5(IQR)). - Effect: Removing outliers would decrease the mean (since they’re likely high scores) and decrease the standard deviation (less spread).
Answer: (D) IQR Explanation: The IQR is resistant to outliers because it only considers the middle 50% of data.
Join 4M+ learners. Unlock unlimited quizzes, wrong-answer tracking, flashcards + reminders, study guides, and 1-on-1 challenges.