By Fatskills Exam Guides Team — the exam nerds behind 28,500+ quizzes and 2.1M practice questions across 500+ global exams.
Simple Linear Regression is a statistical method that models the relationship between a dependent variable (y) and an independent variable (x) using a straight line. The equation is ŷ = b₀ + b₁x, where ŷ is the predicted value, b₀ is the y-intercept, and b₁ is the slope. This topic appears in exams because it tests your ability to understand and apply fundamental statistical concepts to real-world data. Questions typically involve calculating coefficients, interpreting their meaning, and applying the least squares method.
Simple linear regression is tested in various statistics and data analysis exams, including AP Statistics, GRE, and job interviews for data analyst roles. It frequently appears and can carry significant marks, typically around 10-15% of the total score. This topic tests your ability to perform statistical analysis, interpret data, and make predictions based on linear relationships.
The equation for simple linear regression is ŷ = b₀ + b₁x.
Imagine a scatter plot with a line of best fit. The line should pass through the center of the data points, minimizing the vertical distances (residuals) between the points and the line.
Intermediate
Question: Given the regression equation ŷ = 2 + 3x, what is the predicted value of y when x = 4?
Step-by-Step: 1. Substitute x = 4 into the equation: ŷ = 2 + 3(4) 2. Calculate: ŷ = 2 + 12 = 14
Answer: 14
Question: You have the following data points: (1, 2), (2, 3), (3, 4). Find the slope (b₁) of the regression line.
Step-by-Step: 1. Calculate the mean of x and y: μₓ = (1+2+3)/3 = 2, μᵧ = (2+3+4)/3 = 3 2. Use the least squares formula for b₁: b₁ = ∑(xᵢ - μₓ)(yᵢ - μᵧ) / ∑(xᵢ - μₓ)² 3. Calculate: b₁ = [(1-2)(2-3) + (2-2)(3-3) + (3-2)(4-3)] / [(1-2)² + (2-2)² + (3-2)²] 4. Simplify: b₁ = [(-1)(-1) + 0 + 1] / [1 + 0 + 1] = 2/2 = 1
Answer: 1
Question: Given the data points (1, 3), (2, 5), (3, 7), find the regression equation ŷ = b₀ + b₁x.
Step-by-Step: 1. Calculate the mean of x and y: μₓ = (1+2+3)/3 = 2, μᵧ = (3+5+7)/3 = 5 2. Use the least squares formula for b₁: b₁ = ∑(xᵢ - μₓ)(yᵢ - μᵧ) / ∑(xᵢ - μₓ)² 3. Calculate: b₁ = [(1-2)(3-5) + (2-2)(5-5) + (3-2)(7-5)] / [(1-2)² + (2-2)² + (3-2)²] 4. Simplify: b₁ = [(-1)(-2) + 0 + 1(2)] / [1 + 0 + 1] = 4/2 = 2 5. Use the formula for b₀: b₀ = μᵧ - b₁μₓ = 5 - 2(2) = 1
Answer: ŷ = 1 + 2x
Correct Approach: b₀ is the value of y when x=0.
Mistake: Not understanding the least squares method.
Correct Approach: b₁ = ∑(xᵢ - μₓ)(yᵢ - μᵧ) / ∑(xᵢ - μₓ)²
Mistake: Incorrectly interpreting the slope.
Correct Approach: b₁ indicates the change in y for a one-unit change in x.
Mistake: Ignoring the residuals.
Example: What is the slope of the regression line for the data points (1, 2), (2, 4), (3, 6)?
Short Answer: Often seen in university exams.
Example: Calculate the y-intercept for the regression line given the slope is 2 and the mean of x is 3, mean of y is 8.
Data Interpretation: Frequent in job interviews and practical exams.
Question: What is the slope (b₁) of the regression line for the data points (1, 3), (2, 5), (3, 7)? - A) 1- B) 2- C) 3- D) 4
Correct Answer: B) 2
Explanation: The slope b₁ is calculated using the least squares method, which gives b₁ = 2.
Why the Distractors Are Tempting: - A) 1: Might confuse with the change in y for each point.- C) 3: Might think it's the total change in y.- D) 4: Might miscalculate the sum of squares.
Question: If the regression equation is ŷ = 4 + 2x, what is the predicted value of y when x = 5? - A) 10- B) 12- C) 14- D) 16
Correct Answer: C) 14
Explanation: Substitute x = 5 into the equation: ŷ = 4 + 2(5) = 14.
Why the Distractors Are Tempting: - A) 10: Might miscalculate the multiplication.- B) 12: Might forget to add the y-intercept.- D) 16: Might add the y-intercept twice.
Question: Which of the following is NOT a characteristic of the least squares method? - A) Minimizes the sum of squared residuals- B) Always passes through the mean of x and y- C) The slope is always positive- D) The residuals are randomly distributed
Correct Answer: C) The slope is always positive
Explanation: The slope can be negative, positive, or zero depending on the data.
Why the Distractors Are Tempting: - A) Minimizes the sum of squared residuals: True characteristic.- B) Always passes through the mean of x and y: True characteristic.- D) The residuals are randomly distributed: True characteristic.
Question: If the data points are (1, 2), (2, 3), (3, 4), what is the y-intercept (b₀) of the regression line? - A) 0- B) 1- C) 2- D) 3
Correct Answer: B) 1
Explanation: Using the least squares method, b₀ is calculated to be 1.
Why the Distractors Are Tempting: - A) 0: Might think the line passes through the origin.- C) 2: Might confuse with the first y-value.- D) 3: Might miscalculate the mean of y.
Question: What does a slope of -2 in the regression equation ŷ = 3 - 2x indicate? - A) For every unit increase in x, y increases by 2- B) For every unit increase in x, y decreases by 2- C) The y-intercept is -2- D) The regression line is horizontal
Correct Answer: B) For every unit increase in x, y decreases by 2
Explanation: A negative slope indicates an inverse relationship.
Why the Distractors Are Tempting: - A) For every unit increase in x, y increases by 2: Incorrect interpretation of negative slope.- C) The y-intercept is -2: Confuses slope with y-intercept.- D) The regression line is horizontal: Incorrect understanding of slope.
Join 4M+ learners. Unlock unlimited quizzes, wrong-answer tracking, flashcards + reminders, study guides, and 1-on-1 challenges.