Fatskills
Practice. Master. Repeat.
Study Guide: Why Small Samples Mislead You (Statistics)
Source: https://www.fatskills.com/crash-course/chapter/why-small-samples-mislead-you-statistics

Why Small Samples Mislead You (Statistics)

By Fatskills Exam Guides Team — the exam nerds behind 28,500+ quizzes and 2.1M practice questions across 500+ global exams.

⏱️ ~6 min read

Crash Course: Why Small Samples Mislead You (Statistics)

Why Small Samples Mislead You (Statistics)

Introduction Did you know that a single, seemingly insignificant event can change the course of history? Think of the butterfly effect, but in statistics. A small sample size can lead to a massive misinterpretation of data, and it's more common than you think.

The Core Idea In statistics, a small sample size can lead to biased or misleading results, which can have serious consequences in fields like medicine, politics, and business. This is because small samples often don't accurately represent the larger population, leading to incorrect conclusions.

Key Facts & Figures

  • The Law of Large Numbers: In 1713, mathematician Abraham de Moivre discovered that as the sample size increases, the average of the sample will converge to the population mean. ⚠️
  • The Central Limit Theorem: In 1733, mathematician de Moivre also discovered that the distribution of sample means will approach a normal distribution as the sample size increases.
  • The Dunning-Kruger Effect: In 1999, psychologists David Dunning and Justin Kruger found that people who are incompetent in a particular domain tend to overestimate their own performance and abilities.
  • The Monty Hall Problem: In 1966, game show host Monty Hall presented a probability puzzle that has since become a classic example of the dangers of small sample sizes.
  • The 2016 US Presidential Election: A small sample size of voters in key battleground states led to a misinterpretation of the election results, which was later corrected when more data became available.
  • The 2008 Financial Crisis: A small sample size of mortgage defaults led to a misinterpretation of the risk of subprime lending, which contributed to the crisis.
  • The Challenger Space Shuttle Disaster: In 1986, a small sample size of faulty O-rings led to a catastrophic failure of the space shuttle's solid rocket boosters.
  • The 2011 Japanese Earthquake: A small sample size of seismic data led to a misinterpretation of the earthquake's severity, which delayed the evacuation of the Fukushima Daiichi nuclear power plant.
  • The 1994 Rwandan Genocide: A small sample size of data on the number of Hutu and Tutsi populations led to a misinterpretation of the genocide's severity and scope.
  • The 1970s Watergate Scandal: A small sample size of evidence led to a misinterpretation of the extent of President Nixon's involvement in the scandal.

Thought Bubble Imagine you're a detective trying to solve a murder mystery. You collect a small sample of fingerprints from the crime scene and match them to a suspect. But what if the suspect is innocent, and the fingerprints are just a coincidence? Or what if the fingerprints are from a different crime scene altogether? You'd be making a huge mistake by relying on a small sample size. In statistics, this is called a Type I error.

Let's say you're a doctor trying to develop a new medicine. You collect a small sample of patients and find that the medicine works for 90% of them. But what if the sample size is too small to accurately represent the larger population? You might be making a huge mistake by approving the medicine for widespread use. In statistics, this is called a Type II error.

Why This Matters

  • Misleading results: Small sample sizes can lead to misleading results, which can have serious consequences in fields like medicine, politics, and business.
  • Biased conclusions: Small sample sizes can lead to biased conclusions, which can perpetuate stereotypes and reinforce existing power structures.
  • Lack of generalizability: Small sample sizes can lead to a lack of generalizability, which can make it difficult to apply findings to larger populations.
  • Increased risk of errors: Small sample sizes can increase the risk of errors, which can have serious consequences in fields like medicine and finance.
  • Decreased confidence: Small sample sizes can decrease confidence in results, which can make it difficult to make informed decisions.
  • Increased costs: Small sample sizes can increase costs, which can be a major burden for businesses and organizations.
  • Decreased accuracy: Small sample sizes can decrease accuracy, which can have serious consequences in fields like medicine and finance.

Crash Course Recap

  • Small sample sizes can lead to biased or misleading results.
  • The Law of Large Numbers states that as the sample size increases, the average of the sample will converge to the population mean.
  • The Central Limit Theorem states that the distribution of sample means will approach a normal distribution as the sample size increases.
  • The Dunning-Kruger Effect states that people who are incompetent in a particular domain tend to overestimate their own performance and abilities.
  • The Monty Hall Problem is a classic example of the dangers of small sample sizes.
  • A small sample size of voters in key battleground states led to a misinterpretation of the 2016 US Presidential Election results.
  • A small sample size of mortgage defaults led to a misinterpretation of the risk of subprime lending in the 2008 Financial Crisis.
  • A small sample size of faulty O-rings led to a catastrophic failure of the Challenger Space Shuttle's solid rocket boosters in 1986.
  • A small sample size of seismic data led to a misinterpretation of the severity of the 2011 Japanese Earthquake.
  • A small sample size of data on the number of Hutu and Tutsi populations led to a misinterpretation of the severity and scope of the 1994 Rwandan Genocide.
  • A small sample size of evidence led to a misinterpretation of the extent of President Nixon's involvement in the 1970s Watergate Scandal.

Quiz Yourself

  1. What is the Law of Large Numbers? a) The average of the sample will converge to the population mean as the sample size increases. b) The distribution of sample means will approach a normal distribution as the sample size increases. c) The sample size will always be too small to accurately represent the larger population.

Answer: a) The average of the sample will converge to the population mean as the sample size increases.

  1. What is the Dunning-Kruger Effect? a) People who are competent in a particular domain tend to overestimate their own performance and abilities. b) People who are incompetent in a particular domain tend to overestimate their own performance and abilities. c) People who are neither competent nor incompetent tend to overestimate their own performance and abilities.

Answer: b) People who are incompetent in a particular domain tend to overestimate their own performance and abilities.

  1. What is the Monty Hall Problem? a) A classic example of the dangers of small sample sizes. b) A classic example of the dangers of large sample sizes. c) A classic example of the dangers of biased conclusions.

Answer: a) A classic example of the dangers of small sample sizes.

  1. What is a Type I error? a) A false positive result. b) A false negative result. c) A Type II error.

Answer: a) A false positive result.

  1. What is a Type II error? a) A false positive result. b) A false negative result. c) A failure to detect a true effect.

Answer: c) A failure to detect a true effect.