Fatskills
Practice. Master. Repeat.
Study Guide: How P-Values Help Us Test Hypotheses (Statistics)
Source: https://www.fatskills.com/crash-course/chapter/how-p-values-help-us-test-hypotheses-statistics

How P-Values Help Us Test Hypotheses (Statistics)

By Fatskills Exam Guides Team — the exam nerds behind 28,500+ quizzes and 2.1M practice questions across 500+ global exams.

⏱️ ~5 min read

Crash Course: How P-Values Help Us Test Hypotheses (Statistics)

How P-Values Help Us Test Hypotheses (Statistics)

Opening Hook

Imagine you're a detective trying to solve a murder mystery, but instead of clues, you have a bunch of random data points. That's basically what we're dealing with in statistics, and p-values are the ultimate clue to figuring out if our hypothesis is correct or not.

The Core Idea

P-values are a way to measure the probability that our observed data would occur by chance, given that our hypothesis is false. In other words, they help us determine if our results are due to a real effect or just a fluke.

Key Facts & Figures

  • The concept of p-values was first introduced by Karl Pearson in 1900, but it wasn't until the 1920s that it became a standard tool in statistics.
  • The term "p-value" was coined by Ronald Fisher in 1925, who used it to describe the probability of observing a result at least as extreme as the one we got, assuming that the null hypothesis is true.
  • P-values are usually expressed as a decimal value between 0 and 1, where 0 means the result is extremely unlikely to occur by chance, and 1 means it's extremely likely.
  • A common threshold for significance is a p-value of 0.05, which means that if we observe a result with a p-value of 0.05 or lower, we can reject the null hypothesis and conclude that our results are statistically significant.
  • However, p-values don't tell us the size of the effect, only whether it's statistically significant or not.
  • P-values can be misleading if we don't consider the sample size, because a small sample size can lead to a high p-value even if the effect is real.
  • The p-value is not a measure of the probability that the null hypothesis is true, but rather a measure of the probability of observing the data we got, assuming that the null hypothesis is true.
  • P-values are not the only way to test hypotheses, but they're a popular and widely used method.
  • The p-value is not a measure of the importance of the result, but rather a measure of its statistical significance.
  • P-values can be affected by multiple testing, where we're testing multiple hypotheses and the p-value becomes inflated.
  • P-values are not a substitute for scientific judgment, but rather a tool to help us make informed decisions.

Thought Bubble

Imagine you're a scientist studying the effect of a new medication on blood pressure. You collect data from 100 patients and find that the average blood pressure decrease is 10 mmHg. You want to know if this result is due to the medication or just a fluke. You calculate the p-value and get 0.01, which means that the probability of observing a result at least as extreme as the one you got, assuming that the medication has no effect, is 1%. You reject the null hypothesis and conclude that the medication has a statistically significant effect on blood pressure.

Why This Matters

  • P-values have been widely used in medicine to test the effectiveness of new treatments, but they've also been criticized for their limitations.
  • P-values have been used in social sciences to test hypotheses about human behavior, but they've also been criticized for their cultural bias.
  • P-values have been used in business to test the effectiveness of marketing campaigns, but they've also been criticized for their lack of generalizability.
  • P-values are not a substitute for replication, because a single study with a low p-value doesn't necessarily mean that the result will be replicated in future studies.
  • P-values are not a substitute for scientific skepticism, because we should always question the results and consider alternative explanations.
  • P-values are not a substitute for critical thinking, because we should always consider the context and limitations of the study.

Crash Course Recap

  • P-values are a way to measure the probability that our observed data would occur by chance, given that our hypothesis is false.
  • P-values are usually expressed as a decimal value between 0 and 1.
  • A common threshold for significance is a p-value of 0.05.
  • P-values don't tell us the size of the effect, only whether it's statistically significant or not.
  • P-values can be misleading if we don't consider the sample size.
  • P-values are not a measure of the probability that the null hypothesis is true.
  • P-values are not the only way to test hypotheses.
  • P-values are not a substitute for scientific judgment.
  • P-values can be affected by multiple testing.
  • P-values are not a substitute for replication.
  • P-values are not a substitute for scientific skepticism.

Quiz Yourself

  1. What is the purpose of a p-value? a) To measure the size of the effect b) To measure the probability that the null hypothesis is true c) To measure the probability that our observed data would occur by chance, given that our hypothesis is false d) To measure the importance of the result

Answer: c

  1. What is the common threshold for significance? a) 0.01 b) 0.05 c) 0.1 d) 0.2

Answer: b

  1. What is the problem with using p-values in multiple testing? a) They become inflated b) They become deflated c) They become unchanged d) They become irrelevant

Answer: a

  1. What is the difference between a statistically significant result and a practically significant result? a) A statistically significant result is always practically significant b) A practically significant result is always statistically significant c) A statistically significant result is not necessarily practically significant d) A practically significant result is not necessarily statistically significant

Answer: c

  1. Why is it important to consider the sample size when interpreting p-values? a) Because a small sample size can lead to a high p-value even if the effect is real b) Because a large sample size can lead to a low p-value even if the effect is not real c) Because a small sample size can lead to a low p-value even if the effect is not real d) Because a large sample size can lead to a high p-value even if the effect is real

Answer: a