By Fatskills Exam Guides Team — the exam nerds behind 28,500+ quizzes and 2.1M practice questions across 500+ global exams.
Descriptive statistics summarize and describe the features of a dataset using numbers, tables, or visualizations. You use it to quickly understand patterns, trends, and distributions in data—whether analyzing sales figures, survey responses, or sensor readings.
Without descriptive statistics, raw data is just noise. It helps businesses: - Spot trends (e.g., monthly revenue growth). - Compare groups (e.g., customer segments by age). - Detect outliers (e.g., fraudulent transactions). - Communicate insights clearly (e.g., dashboards for executives).
Tell you where the "center" of your data lies. - Mean (Average): Sum of values divided by count. Sensitive to outliers. python mean = sum(data) / len(data) - Median: Middle value when data is sorted. Robust to outliers. - Mode: Most frequent value. Works for categorical data.
python mean = sum(data) / len(data)
Show how spread out your data is. - Range: Max value – Min value. Simple but ignores distribution. - Variance: Average squared distance from the mean. Hard to interpret alone. python variance = sum((x - mean) 2 for x in data) / len(data) - Standard Deviation (?): Square root of variance. Same units as data. - Interquartile Range (IQR): Q3 (75th percentile) – Q1 (25th percentile). Ignores outliers.
python variance = sum((x - mean) 2 for x in data) / len(data)
import pandas as pd import matplotlib.pyplot as plt # Load data data = pd.read_csv("sales_data.csv") # Compute metrics print("Mean:", data["revenue"].mean()) print("Median:", data["revenue"].median()) print("Standard Deviation:", data["revenue"].std()) # Visualize data["revenue"].hist(bins=20) plt.title("Revenue Distribution") plt.show()
Expected Outcome: - Numeric summary (mean, median, standard deviation). - Histogram showing revenue distribution.
A dataset has a mean of 50 and a median of 45. What does this suggest? A) The data is symmetric. B) The data is left-skewed. C) The data is right-skewed. D) There are no outliers.
Correct Answer: C) The data is right-skewed. Explanation: When the mean > median, the distribution has a long tail on the right. Why the Distractors Are Tempting: - A) Symmetric data has mean-median. - B) Left-skewed data has mean < median. - D) Outliers can cause skewness, but skewness doesn’t always mean outliers.
You’re analyzing customer ages and find a standard deviation of 20 years. What does this tell you? A) Most customers are 20 years apart in age. B) The average distance from the mean age is 20 years. C) The age range is 20 years. D) The data is normally distributed.
Correct Answer: B) The average distance from the mean age is ~20 years. Explanation: Standard deviation measures spread around the mean, not range. Why the Distractors Are Tempting: - A) Misinterprets standard deviation as a fixed gap. - C) Confuses standard deviation with range. - D) Standard deviation alone doesn’t imply normality.
Which visualization is best for comparing distributions of two groups? A) Scatter plot B) Box plot C) Pie chart D) Line graph
Correct Answer: B) Box plot. Explanation: Box plots show median, quartiles, and outliers for multiple groups. Why the Distractors Are Tempting: - A) Scatter plots show relationships, not distributions. - C) Pie charts compare parts of a whole, not distributions. - D) Line graphs show trends over time, not group comparisons.
Join 4M+ learners. Unlock unlimited quizzes, wrong-answer tracking, flashcards + reminders, study guides, and 1-on-1 challenges.