By Fatskills Exam Guides Team — the exam nerds behind 28,500+ quizzes and 2.1M practice questions across 500+ global exams.
The p-value is the probability of obtaining a test statistic as extreme or more extreme than the one observed, assuming the null hypothesis (H?) is true. It helps determine whether results are statistically significant—meaning they’re unlikely to have occurred by random chance. On the AP exam, you’ll use p-values to make decisions about hypotheses (e.g., "Does a new teaching method improve test scores?" or "Is the proportion of defective smartphones in a factory higher than 5%?"). Misinterpreting p-values is a common mistake, so mastering this concept is critical for both multiple-choice and FRQs.
STAT-TESTS-5:1-PropZTest
STAT-TESTS-2:T-Test
2nd-VARS-2:normalcdf(lower, upper, ?, ?)
2nd-VARS-6:tcdf(lower, upper, df)
Follow these steps for any hypothesis test on the AP exam:
Example: Testing if a coin is fair (H?: p = 0.5, H?: p-0.5).
Check Conditions
Random: Data must come from a random sample/experiment.
Compute Test Statistic
Example: For a proportion, use 1-PropZTest; for a mean, use T-Test.
1-PropZTest
T-Test
Find p-value
normalcdf
tcdf
For two-sided tests, double the p-value from the calculator.
Make a Conclusion
If p-value > ?: "Fail to reject H?. There is not convincing evidence that [H? in context]."
Interpret in Context
Correction: The p-value is the probability of the data (or more extreme) assuming H? is true, not the probability of H? itself.
Mistake: Forgetting to double the p-value for two-sided tests.
Correction: For H?:-? , multiply the one-sided p-value by 2 (or use T-Test with ?:?).
?:?
Mistake: Using a z-test for means when the population SD is unknown.
Correction: Use a t-test when-is unknown (unless n-30, where z is acceptable).
Mistake: Ignoring the 10% condition for independence.
Correction: If sampling without replacement, check n-0.10N (where N = population size).
Mistake: Confusing "statistically significant" with "practically important."
p-value vs. ?: The p-value is calculated from data;-is set before the test.
Common FRQ Setups:
"Is the machine producing bags with a mean weight different from 50 lbs?" (Two-sided t-test for ?).
Calculator Pitfalls:
Answer: (A) The p-value (0.03) is less than? (0.05), so we reject H?.
Answer: - Step 1: H?: p = 0.05, H?: p > 0.05. - Step 2: BINS: Binary (defective/not), Independent (200-10% of all bulbs), np? = 10-10, n(1–p?) = 190-10. - Step 3: p? = 16/200 = 0.08; z = (0.08 – 0.05)/?(0.050.95/200)-1.95; p-value = normalcdf(1.95, 1E99, 0, 1)-0.0256. - Step 4:* Since 0.0256 < 0.05, reject H?. There is convincing evidence that the defect rate is higher than 5%.
normalcdf(1.95, 1E99, 0, 1)-0.0256
Join 4M+ learners. Unlock unlimited quizzes, wrong-answer tracking, flashcards + reminders, study guides, and 1-on-1 challenges.