Fatskills
Practice. Master. Repeat.
Study Guide: Correlation Doesn't Equal Causation (Statistics)
Source: https://www.fatskills.com/crash-course/chapter/correlation-doesnt-equal-causation-statistics

Correlation Doesn't Equal Causation (Statistics)

By Fatskills Exam Guides Team — the exam nerds behind 28,500+ quizzes and 2.1M practice questions across 500+ global exams.

⏱️ ~6 min read

Crash Course: Correlation Doesn't Equal Causation (Statistics)

Crash Course: Correlation Doesn't Equal Causation

Introduction Did you know that a study found a strong correlation between the number of ice cream sales and the number of drownings in the US? Sounds crazy, right? But here's the thing: correlation doesn't equal causation. Let's dive into the world of statistics and figure out what's really going on.

The Core Idea Correlation doesn't equal causation is a fundamental concept in statistics that means just because two things are related, it doesn't mean one causes the other. Think of it like this: just because you wear your lucky socks and win at Mario Kart, it doesn't mean the socks are actually causing you to win. There might be other factors at play.

Key Facts & Figures

  • Ancient Greece: The concept of correlation was first discussed by the Greek philosopher Aristotle in the 4th century BCE.
  • 19th century: The term "correlation" was coined by Sir Francis Galton in 1888.
  • Galton's work: Galton studied the relationship between height and other physical characteristics, but he also realized that correlation doesn't equal causation.
  • The Ice Cream Study: In the 1990s, a study found a strong correlation between ice cream sales and drowning rates in the US. But, as we'll see, there's a more plausible explanation.
  • The Ice Cream Study's flaw: The study didn't account for the fact that both ice cream sales and drowning rates tend to increase during hot summer months.
  • The Ice Cream Study's lesson: This study shows how correlation can be misleading, but it's not a unique example. Many studies have found correlations that don't hold up to closer inspection.
  • The concept of confounding variables: A confounding variable is a factor that affects both variables in a study, making it difficult to determine causation.
  • The example of smoking and lung cancer: In the 1950s, studies found a strong correlation between smoking and lung cancer. But, it wasn't until later that the actual cause (tobacco smoke) was identified.
  • The example of the "French Paradox": In the 1990s, studies found a correlation between high levels of saturated fat consumption in France and low rates of heart disease. But, it turned out that the French were consuming a lot of antioxidants, which might have been the real reason for the low heart disease rates.
  • The concept of reverse causality: Reverse causality occurs when the effect is actually the cause. For example, if you're feeling tired, you might be more likely to eat a snack, but the snack isn't causing your tiredness.
  • The concept of selection bias: Selection bias occurs when the sample is not representative of the population. For example, if you only study people who have already developed a disease, you might find a correlation between a particular factor and the disease, but it's not a causal relationship.

Thought Bubble Imagine you're a detective trying to solve a mystery. You notice that every time it rains, there's a surge in sales of umbrellas. You might think, "Ah-ha! Rain causes umbrella sales!" But, what if the real reason is that people are more likely to buy umbrellas when they see other people buying them? Or what if the rain is just a coincidence, and the real reason for the umbrella sales is the fact that it's a weekend and people are out and about? You see, correlation doesn't equal causation, and it's up to us to dig deeper and find the real explanation.

Why This Matters

  • Understanding correlation doesn't equal causation is crucial in medicine: Many medical studies have found correlations that don't hold up to closer inspection, leading to incorrect conclusions and treatments.
  • Correlation doesn't equal causation is essential in business: Companies often use statistical analysis to make decisions, but if they don't account for correlation, they might make incorrect conclusions.
  • Correlation doesn't equal causation is vital in science: Scientists often use statistical analysis to test hypotheses, but if they don't account for correlation, they might draw incorrect conclusions.
  • Correlation doesn't equal causation is important in everyday life: We often make decisions based on correlations, but if we don't account for correlation, we might make incorrect conclusions.
  • Correlation doesn't equal causation is a fundamental concept in statistics: It's essential to understand this concept to avoid making incorrect conclusions based on statistical analysis.
  • Correlation doesn't equal causation is a key concept in data science: Data scientists often use statistical analysis to make decisions, but if they don't account for correlation, they might make incorrect conclusions.
  • Correlation doesn't equal causation is a crucial concept in economics: Economists often use statistical analysis to make decisions, but if they don't account for correlation, they might make incorrect conclusions.

Crash Course Recap

  • Correlation doesn't equal causation is a fundamental concept in statistics.
  • The concept was first discussed by Aristotle in ancient Greece.
  • Sir Francis Galton coined the term "correlation" in the 19th century.
  • The ice cream study is a classic example of how correlation can be misleading.
  • Confounding variables can affect the results of a study.
  • Reverse causality occurs when the effect is actually the cause.
  • Selection bias occurs when the sample is not representative of the population.
  • Understanding correlation doesn't equal causation is crucial in medicine, business, science, and everyday life.
  • Correlation doesn't equal causation is a fundamental concept in statistics, data science, and economics.

Quiz Yourself

  1. What is the concept of correlation, and why is it important to understand? a) Correlation is the relationship between two variables, and it's essential to understand it to avoid making incorrect conclusions. b) Correlation is the relationship between two variables, but it's not essential to understand it. c) Correlation is the relationship between two variables, and it's only important in certain fields.

Answer: a) Correlation is the relationship between two variables, and it's essential to understand it to avoid making incorrect conclusions.

  1. Who coined the term "correlation"? a) Aristotle b) Sir Francis Galton c) Galileo

Answer: b) Sir Francis Galton

  1. What is the concept of confounding variables? a) A factor that affects both variables in a study, making it difficult to determine causation. b) A factor that affects only one variable in a study. c) A factor that has no effect on either variable in a study.

Answer: a) A factor that affects both variables in a study, making it difficult to determine causation.

  1. What is the concept of reverse causality? a) When the effect is actually the cause. b) When the cause is actually the effect. c) When there is no causality.

Answer: a) When the effect is actually the cause.

  1. What is the concept of selection bias? a) When the sample is not representative of the population. b) When the sample is representative of the population. c) When there is no bias.

Answer: a) When the sample is not representative of the population.