Fatskills
Practice. Master. Repeat.
Study Guide: Statistical Thinking in Science (Statistics)
Source: https://www.fatskills.com/crash-course/chapter/statistical-thinking-in-science-statistics

Statistical Thinking in Science (Statistics)

By Fatskills Exam Guides Team — the exam nerds behind 28,500+ quizzes and 2.1M practice questions across 500+ global exams.

⏱️ ~5 min read

Crash Course: Statistical Thinking in Science (Statistics)

Crash Course: Statistical Thinking in Science (Statistics)

Introduction Did you know that the average American eats about 18 pounds of cheese per year? That's a whole lotta cheddar! But what if I told you that the way we collect and analyze data about cheese consumption is rooted in a fascinating history of statistical thinking?

The Core Idea Statistical thinking is the process of using data to make informed decisions and understand the world around us. It's like being a detective, but instead of solving crimes, you're trying to figure out why people eat so much cheese (or not).

Key Facts & Figures

  • Ancient Greece: The concept of statistical thinking dates back to the 5th century BCE, when the Greek philosopher Aristotle used data to understand the natural world.
  • 17th century: John Graunt, an English statistician, published the first statistical analysis of mortality rates in London, which helped him understand the impact of the plague.
  • 18th century: Adam Smith used statistical data to argue for free trade and the benefits of economic growth.
  • 19th century: Charles Babbage, the father of computers, developed the first mechanical calculator to help with statistical calculations.
  • 1900s: Ronald Fisher, a British statistician, developed the concept of null hypothesis testing, which is still widely used today.
  • 1920s: Karl Pearson, a British statistician, developed the chi-squared test, which is used to determine the significance of data.
  • 1950s: C. West Churchman, an American statistician, developed the concept of operational research, which applies statistical thinking to real-world problems.
  • 1960s: George Box, a British statistician, developed the concept of robust statistics, which helps to reduce the impact of outliers on data analysis.
  • 1980s: Edward Tufte, an American statistician, developed the concept of data visualization, which helps to communicate complex data in a clear and concise way.
  • 1990s: Brad Efron, an American statistician, developed the concept of bootstrap sampling, which helps to estimate the variability of data.
  • 2000s: Hadley Wickham, a British statistician, developed the R programming language, which is widely used for statistical analysis.

Thought Bubble Imagine you're a detective trying to solve a mystery. You have a bunch of clues, but you're not sure what they mean. You start by collecting more data, like interviewing witnesses and analyzing physical evidence. But then you realize that some of the clues are fake or misleading. That's where statistical thinking comes in. You use statistical methods to filter out the noise and get to the truth. For example, let's say you're trying to figure out if a new restaurant is popular. You collect data on the number of customers each day, but you also notice that the data is skewed by a group of rowdy teenagers who come in every Friday night. You use statistical methods to account for this bias and get a more accurate picture of the restaurant's popularity.

Why This Matters

  • Science: Statistical thinking is essential for scientific research, as it helps to ensure that data is collected and analyzed in a way that is accurate and reliable.
  • Business: Statistical thinking is used in business to make informed decisions about investments, marketing, and product development.
  • Politics: Statistical thinking is used in politics to understand public opinion and make informed decisions about policy.
  • Healthcare: Statistical thinking is used in healthcare to understand the effectiveness of treatments and make informed decisions about patient care.
  • Environment: Statistical thinking is used in environmental science to understand the impact of human activity on the environment.
  • Economics: Statistical thinking is used in economics to understand the impact of economic policies on the economy.
  • Social Justice: Statistical thinking is used in social justice to understand the impact of policies on marginalized communities.

Crash Course Recap

  • Statistical thinking is the process of using data to make informed decisions and understand the world around us.
  • The concept of statistical thinking dates back to ancient Greece.
  • John Graunt published the first statistical analysis of mortality rates in London in the 17th century.
  • Ronald Fisher developed the concept of null hypothesis testing in the 20th century.
  • Karl Pearson developed the chi-squared test in the 20th century.
  • C. West Churchman developed the concept of operational research in the 20th century.
  • George Box developed the concept of robust statistics in the 20th century.
  • Edward Tufte developed the concept of data visualization in the 20th century.
  • Brad Efron developed the concept of bootstrap sampling in the 20th century.
  • Hadley Wickham developed the R programming language in the 21st century.
  • Statistical thinking is used in science, business, politics, healthcare, environment, economics, and social justice.
  • Statistical thinking helps to ensure that data is collected and analyzed in a way that is accurate and reliable.
  • Statistical thinking is essential for making informed decisions and understanding the world around us.

Quiz Yourself

  1. Who developed the concept of null hypothesis testing? a) Ronald Fisher b) Karl Pearson c) C. West Churchman d) George Box

Answer: a) Ronald Fisher

  1. What is the name of the programming language developed by Hadley Wickham? a) R b) Python c) Java d) C++

Answer: a) R

  1. Who published the first statistical analysis of mortality rates in London? a) John Graunt b) Adam Smith c) Charles Babbage d) Karl Pearson

Answer: a) John Graunt

  1. What is the name of the test developed by Karl Pearson? a) Chi-squared test b) T-test c) ANOVA d) Regression analysis

Answer: a) Chi-squared test

  1. What is the concept of robust statistics developed by George Box? a) The use of outliers to estimate data variability b) The use of data visualization to communicate complex data c) The use of statistical methods to reduce the impact of outliers on data analysis d) The use of bootstrap sampling to estimate data variability

Answer: c) The use of statistical methods to reduce the impact of outliers on data analysis