By Fatskills Exam Guides Team — the exam nerds behind 28,500+ quizzes and 2.1M practice questions across 500+ global exams.
A scatterplot is a graphical display of the relationship between two quantitative variables. It helps us assess direction (positive/negative), form (linear/nonlinear), strength (weak/moderate/strong), and unusual features (outliers, clusters, influential points). On the AP exam, you’ll analyze scatterplots to describe associations, check conditions for regression, and interpret correlation. Real-world example: A real estate agent wants to predict home prices based on square footage—does a larger home generally cost more, and how strong is that relationship?
LinReg(ax+b)
STAT-CALC-4
DiagnosticOn
2nd-0-CATALOG
STAT PLOT
2nd-Y=
How to analyze a scatterplot on the AP exam (FRQ or MC):
Unusual features: Outliers, clusters, or influential points?
Check for linearity (if regression is involved):
Does the scatterplot look roughly linear? If not, linear regression may not be appropriate.
Interpret correlation (r):
Example: "There is a strong, positive, linear relationship between study time and test scores (r = 0.85)."
Identify unusual points:
Influential points: Points that change the regression line significantly if removed.
Avoid causation claims:
Mistake: Saying a relationship is "strong" just because it’s linear. Correction: Strength depends on how tightly points cluster around the line, not just the form. Use r to quantify strength.
Mistake: Ignoring nonlinear patterns and forcing a linear interpretation. Correction: If the scatterplot curves, don’t use linear regression—mention the nonlinear form instead.
Mistake: Confusing r with slope. Correction: r measures strength and direction, while slope (b) measures the rate of change in y per unit x.
Mistake: Claiming causation from correlation. Correction: Correlation-causation! Always mention lurking variables (e.g., "Ice cream sales and drowning deaths are correlated, but hot weather is a lurking variable.").
Mistake: Forgetting to check for influential points. Correction: Always look for points that could distort the regression line—especially if they’re far from the rest of the data in the x-direction.
"Interpret the correlation coefficient in context."
Tricky distinctions:
Outliers vs. influential points: All influential points are outliers, but not all outliers are influential.
Calculator pitfalls:
LinReg
Answer: (B) A linear model is not appropriate for these data. Explanation: The scatterplot is curved, so linear regression shouldn’t be used.
Answer: (a) There is a moderate, negative, linear relationship between hours of sleep and test scores. As sleep increases, test scores tend to decrease (or vice versa). (b) No, correlation does not imply causation. There could be lurking variables (e.g., stress, study habits) affecting both sleep and test scores.
Good luck—you’ve got this! ?
Join 4M+ learners. Unlock unlimited quizzes, wrong-answer tracking, flashcards + reminders, study guides, and 1-on-1 challenges.