Fatskills
Practice. Master. Repeat.
Study Guide: AP Psychology – Operant Conditioning (Skinner, Reinforcement, Punishment, Schedules)
Source: https://www.fatskills.com/ap-psychology/chapter/ap-topic-guides-ap-psychology-operant-conditioning-skinner-reinforcement-punishment-schedules

AP Psychology – Operant Conditioning (Skinner, Reinforcement, Punishment, Schedules)

By Fatskills Exam Guides Team — the exam nerds behind 28,500+ quizzes and 2.1M practice questions across 500+ global exams.

⏱️ ~5 min read

AP Psychology – Operant Conditioning (Skinner, Reinforcement, Punishment, Schedules)

AP Psychology: Operant Conditioning Study Guide

(Skinner, Reinforcement, Punishment, Schedules)


What This Is

Operant conditioning is a type of learning where behavior is strengthened or weakened by reinforcement (rewards) or punishment. Unlike classical conditioning (which pairs stimuli), operant conditioning focuses on voluntary behaviors and their consequences. This is a major topic on the AP exam—expect multiple-choice questions and FRQs on reinforcement schedules, Skinner’s experiments, and real-world applications (e.g., parenting, education, therapy). Example: B.F. Skinner’s "Skinner box" trained rats to press a lever for food (reinforcement) or avoid a lever to escape a shock (punishment).


Key Terms & Concepts

  • Operant Conditioning: Learning where behavior is controlled by consequences (reinforcement or punishment).
  • Law of Effect (Thorndike): Behaviors followed by favorable consequences become more likely; behaviors followed by unfavorable consequences become less likely.
  • Reinforcement: Any consequence that increases the likelihood of a behavior.
  • Positive Reinforcement: Adding a desirable stimulus (e.g., giving a dog a treat for sitting).
  • Negative Reinforcement: Removing an aversive stimulus (e.g., fastening a seatbelt to stop the car’s beeping).
  • Punishment: Any consequence that decreases the likelihood of a behavior.
  • Positive Punishment: Adding an aversive stimulus (e.g., spanking a child for misbehaving).
  • Negative Punishment: Removing a desirable stimulus (e.g., taking away a teen’s phone for breaking curfew).
  • Shaping: Reinforcing successive approximations of a desired behavior (e.g., rewarding a rat for moving closer to a lever before pressing it).
  • Primary Reinforcer: Innately satisfying (e.g., food, water).
  • Secondary (Conditioned) Reinforcer: Learned value (e.g., money, grades).
  • Reinforcement Schedules: Rules for delivering reinforcement.
  • Continuous Reinforcement: Reinforcing every correct response (fast learning, fast extinction).
  • Partial (Intermittent) Reinforcement: Reinforcing some responses (slower learning, slower extinction).
    • Fixed-Ratio (FR): Reinforcement after a set number of responses (e.g., buy 10 coffees, get 1 free).
    • Variable-Ratio (VR): Reinforcement after an unpredictable number of responses (e.g., slot machines).
    • Fixed-Interval (FI): Reinforcement after a set time (e.g., weekly paycheck).
    • Variable-Interval (VI): Reinforcement after unpredictable time (e.g., pop quizzes).
  • Extinction (Operant): Behavior decreases when reinforcement stops (e.g., a child stops whining if ignored).
  • Spontaneous Recovery: A previously extinguished behavior briefly reappears after a rest period.

Step-by-Step: Applying Operant Conditioning

Scenario: You’re training a dog to "sit" on command. How would you use operant conditioning?

  1. Identify the Target Behavior: Clearly define what you want (e.g., "dog’s bottom touches the ground").
  2. Choose a Reinforcer: Pick something the dog values (e.g., treats, praise).
  3. Shape the Behavior:
  4. Reward small steps (e.g., dog looks at you-reward; dog lowers head-reward; dog sits-big reward).
  5. Select a Reinforcement Schedule:
  6. Start with continuous reinforcement (reward every sit) for fast learning.
  7. Switch to variable-ratio (reward randomly) to maintain the behavior long-term.
  8. Add a Cue: Pair the behavior with a command ("Sit!") and reward only when the cue is given.
  9. Generalize the Behavior: Practice in different locations to ensure the dog responds everywhere.

Common Mistakes

  • Mistake: Confusing negative reinforcement with punishment.
  • Correction: Negative reinforcement removes an aversive stimulus to increase behavior (e.g., taking aspirin to relieve a headache). Punishment decreases behavior.

  • Mistake: Assuming all reinforcement is "good" and all punishment is "bad."

  • Correction: Reinforcement/punishment are defined by their effect on behavior, not morality. A "time-out" (negative punishment) can be effective parenting.

  • Mistake: Forgetting that variable schedules resist extinction better than fixed schedules.

  • Correction: Slot machines (variable-ratio) keep people playing longer than a vending machine (fixed-ratio).

  • Mistake: Ignoring biological constraints (e.g., trying to reinforce a behavior an animal can’t physically do).

  • Correction: Pigeons can’t be reinforced to "sing" like a bird, but they can peck for food.

  • Mistake: Overusing punishment without reinforcement.

  • Correction: Punishment alone doesn’t teach what to do—pair it with reinforcement for desired behaviors (e.g., punish hitting but reward sharing).

AP Exam Insights

  1. FRQ Hot Topic: Expect a question like:
  2. "Explain how a teacher could use operant conditioning to increase student participation. Include two specific reinforcement schedules and one type of punishment."
  3. Key: Name specific schedules (e.g., variable-ratio for random praise) and types of punishment (e.g., negative punishment = losing recess).

  4. Multiple-Choice Traps:

  5. Tricky Distinction: "Negative reinforcement vs. punishment" (see Common Mistakes).
  6. Schedule Confusion: Variable-ratio (unpredictable responses) vs. variable-interval (unpredictable time).

  7. Real-World Applications:

  8. Parenting: Time-outs (negative punishment) vs. praise (positive reinforcement).
  9. Gambling: Variable-ratio schedules explain addiction.
  10. Workplace: Bonuses (fixed-ratio) vs. performance reviews (variable-interval).

  11. Skinner’s Legacy: Know his radical behaviorism (only observable behaviors matter) and how it contrasts with cognitive learning (e.g., latent learning, insight).


Quick Check Questions

  1. Multiple Choice: A parent takes away a teen’s video games for breaking curfew. This is an example of: a) Positive reinforcement b) Negative reinforcement c) Positive punishment d) Negative punishment Answer: d) Negative punishment (removing a desirable stimulus to decrease behavior).

  2. Short FRQ: "A rat in a Skinner box receives a food pellet every 5 minutes, regardless of its behavior. Identify the reinforcement schedule and explain why this schedule leads to slow, steady responding." Answer: Fixed-interval schedule. The rat learns that food comes after a set time, so it responds more as the interval nears (e.g., "scalloped" response pattern).

  3. Multiple Choice: Which reinforcement schedule is most resistant to extinction? a) Fixed-ratio b) Variable-ratio c) Fixed-interval d) Continuous Answer: b) Variable-ratio (unpredictable rewards create persistent behavior, like gambling).


Last-Minute Cram Sheet

  1. Operant conditioning = behavior + consequences (Skinner).
  2. Reinforcement increases behavior; punishment decreases it.
  3. Positive = add; Negative = remove.
  4. Primary reinforcers = innate (food); Secondary = learned (money).
  5. Schedules:
  6. Fixed-ratio: "Buy 10, get 1 free."
  7. Variable-ratio: Slot machines (most resistant to extinction).
  8. Fixed-interval: Weekly paycheck.
  9. Variable-interval: Pop quizzes.
  10. Shaping = reinforcing small steps toward a goal.
  11. Extinction = behavior stops when reinforcement stops.
  12. Negative reinforcement-punishment!
  13. Skinner’s box: rats/pigeons + levers/buttons.
  14. Real-world apps: parenting, education, therapy (e.g., token economies).