Why Your Brain Can’t Stop Scrolling
Feeds don’t hook you with constant pleasure—they hook you with uncertainty. Here’s the science of variable rewards and how to disrupt the loop.

Key Points
- 1Recognize intermittent reinforcement: unpredictable “wins” make scrolling persist, because “no reward yet” never means “no reward coming.”
- 2Understand rewards broadly: social feedback, information, emotion, and novelty reinforce checking—even when the experience feels negative or stressful.
- 3Disrupt the loop strategically: restore endings, reduce notifications and cues, track daily checks, and replace quick hits with chosen alternatives.
Your thumb makes a small, forgettable motion. Up the screen goes another post, another video, another argument, another joke. Most of it is mediocre. Some of it is magnetic.
The strange part is that the mediocre parts aren’t a failure of the system. They’re a feature. A feed that paid out every time would quickly become predictable, and predictable is easy to put down.
What keeps people scrolling isn’t constant pleasure. It’s uncertainty—an engineered maybe. One more swipe might bring something genuinely useful, socially flattering, or emotionally activating. The brain is built to learn from that kind of pattern, and platforms are built to supply it.
A feed doesn’t need to be good. It needs to be intermittently good.
— — TheMurrow Editorial
The psychology behind the scroll: variable rewards
Operant conditioning research describes reinforcement schedules along two axes: fixed vs. variable, and ratio vs. interval. A variable-ratio schedule provides reward after an unpredictable number of actions. Classic example: gambling. The payoff might come on the next pull, or the next, or the next. That uncertainty drives “very high and persistent responding,” as psychology texts put it.
Feeds aren’t slot machines in a literal, one-to-one way. No platform sets a clean “reward after N swipes” schedule. Yet the experience maps cleanly onto variable reinforcement: many swipes yield nothing special, and then—suddenly—something hits.
A key concept here is the intermittent reinforcement effect. Behaviors rewarded intermittently resist extinction better than behaviors rewarded continuously, partly because the absence of reward is ambiguous. When a behavior always pays, a dry spell signals the reward is gone. When a behavior pays unpredictably, “no reward yet” doesn’t mean “no reward coming.”
That ambiguity is sticky. It helps explain why quitting a feed feels less like closing a book and more like walking away from a conversation where something might happen any second.
Why unpredictability beats quality
- Uncertain
- Occasional
- Connected to the action (scroll, refresh, check)
A feed that is reliably decent can be less habit-forming than one that is wildly inconsistent. The latter keeps your brain guessing—and guessing keeps you moving.
Intermittent rewards don’t just encourage repetition. They make stopping feel premature.
— — TheMurrow Editorial
What counts as a “reward” on social media?
A rewarding post might be hilarious. It might also be enraging, affirming, or socially clarifying. Sometimes it’s simply new. Often it’s a signal: you are included, you are seen, you are informed.
On platforms, rewards commonly arrive in four overlapping forms:
- Social rewards: likes, comments, new followers, replies
- Informational rewards: breaking news, a useful link, an answer, a “how-to”
- Emotional rewards: humor, outrage, belonging, envy, tenderness
- Novelty rewards: the sensation of newness itself
A large computational study published in Nature Communications analyzed over one million posts from more than 4,000 individuals and found that social media behavior can conform to reinforcement-learning principles. In other words, users appear to adjust behavior in ways consistent with maximizing a “social reward rate.” The same paper reported an online experiment in a simplified social-media-like environment suggesting social rewards can causally influence behavior.
That’s a rigorous way to say something many people already suspect: feedback shapes posting, checking, and returning. Social reinforcement is not a metaphor. It is a measurable force.
Doomscrolling isn’t a contradiction
The answer is that reinforcement doesn’t require happiness. Relief from uncertainty can reinforce behavior. So can the sense of “staying on top of things,” even when the things are grim. So can social positioning—knowing what “everyone” is talking about so you aren’t left out.
Negative emotion can be energizing. Novelty can be rewarding even when the content is unsettling. A feed can hook you with agitation as easily as with delight.
Dopamine, without the myth
A defensible framing—supported by reward-learning research—is that dopamine is tightly tied to learning and motivation, including reward prediction error: the difference between what you expected and what you got. A review summarized on PubMed describes a classic pattern: dopamine neurons respond strongly to unpredicted rewards, respond less when rewards are fully predicted, and can show a drop in activity when a predicted reward is omitted.
That logic matters for feeds. Prediction error thrives under uncertainty. When you can’t predict what the next swipe will bring, the system that learns from surprises stays engaged.
Avoid the cartoon version—“dopamine spikes every swipe”—because the evidence doesn’t support that level of certainty. The more accurate claim is structural: environments that keep outcomes uncertain can keep reward-learning signals in play, and intermittent reinforcement produces persistent behavior.
The feed as a prediction machine
- Maybe there’s a message that matters.
- Maybe there’s a post that confirms your views.
- Maybe there’s something you didn’t know you needed.
When rewards become predictable, motivation often drops. When rewards are intermittent, the brain keeps trying to close the gap between expectation and outcome. The result is not constant pleasure. It is constant seeking.
The most powerful hook isn’t pleasure. It’s prediction.
— — TheMurrow Editorial
Product design: how platforms amplify variable rewards
The modern feed stacks several mechanics that reduce friction and remove the cues people use to stop. Each element is defensible on its own as “improving engagement” or “making content easier to find.” Together, they turn intermittent reinforcement into an always-available loop.
Infinite scroll: the missing stopping point
Endings matter. Pages, chapters, and “you’re all caught up” screens are quiet forms of self-control. They give the user a chance to leave without feeling like they’re interrupting something. Infinite scroll removes that pause.
The mechanic is often linked to designer Aza Raskin, who is widely credited with inventing or popularizing infinite scroll around 2006. Wired has described the origin story and Raskin’s later public regret, framed as an example of technology’s unintended consequences.
Algorithmic recommendation: personalized uncertainty
The next item is selected to be relevant, novel, or emotionally salient—based on your behavior and on the platform’s objectives. That makes the feed hard to predict. Not just in general, but specifically for you.
Unpredictability is not an accident here. It is the product. The algorithm becomes a dealer shuffling a deck that seems to know which cards you tend to keep.
Notifications: rewards that come looking for you
- Someone liked your post.
- Someone replied.
- A “memory” resurfaced.
- Breaking news arrived.
Notifications function as cues—signals that reinforcement might be available. Even if the payoff is small, the cue can trigger checking, and checking reopens the scroll.
Real-life metrics: why “checks” can matter more than “time”
Some reporting has highlighted research suggesting that frequency of checking may better capture problematic use than total time. A Washington Post interactive on phone-checking summarized findings indicating that around 110 checks per day may signal elevated risk.
That number shouldn’t be read as a moral verdict. It’s a clue about how reinforcement loops work. Variable rewards train short, repeated behaviors—tiny bets placed many times. A person can rack up a high number of pickups without ever having a single long session.
The hidden cost: attention as interruption
The real cost is often lived as:
- fractured focus
- reduced tolerance for boredom
- a persistent sense of being “on call”
- difficulty returning to deeper work after a glance
Many people describe the pattern with a familiar sentence: “I didn’t mean to.” That’s not mere denial. It’s a common feature of cue-driven behavior: the action begins before the intention is fully formed.
Are feeds “addictive”? A fairer way to argue about it
The clarifying part is structural. Variable rewards, unpredictable payoffs, and frictionless interfaces can produce persistent, hard-to-extinguish habits. That’s not moral panic; it’s established behavioral science.
The confusing part is clinical. “Addiction” carries medical meanings and implies diagnostic thresholds that the word often outruns in casual conversation. Many heavy users are not clinically addicted. Many are responding normally to systems designed to capture attention.
A better public debate asks two questions:
1. How intentionally are platforms building intermittent reinforcement loops?
2. What responsibilities follow from building them at scale?
The Nature Communications work strengthens the first question by showing behavior that fits reinforcement-learning principles at population scale—over one million posts, 4,000+ individuals—and experimental evidence that social rewards can cause changes in behavior in a simplified platform-like setting.
Still, multiple perspectives matter. Platforms can argue—sometimes legitimately—that:
- recommendations help people find relevant content
- notifications can be useful and user-controlled
- engagement reflects user choice, not coercion
Both realities can be true at once: users have agency, and design shapes behavior.
A practical standard: intent and control
Intermittent reinforcement reduces predictability. Infinite scroll reduces control. The result can feel like choice slipping into compulsion—not because users are weak, but because the environment is optimized for repetition.
Key Insight
Practical takeaways: how to disrupt the variable reward loop
Reintroduce “endings”
- Use built-in tools that create stopping points (where available).
- Prefer formats with natural endpoints (articles, newsletters, podcasts) when you want to relax without spiraling.
Reduce uncertainty in the environment
- Turn off nonessential notifications, especially those that advertise social rewards.
- Move tempting apps off your home screen to reduce cue-based pickups.
- Decide what you’re checking for before you open the app: message, event detail, specific news item.
Watch the metric that predicts the loop: checks
Try a one-week experiment: aim to reduce checks by 20–30%, without obsessing over total minutes. Fewer openings often create the biggest subjective relief.
One-week experiment to reduce checking
- 1.Aim to reduce checks by 20–30%.
- 2.Avoid obsessing over total minutes.
- 3.Notice subjective relief from fewer openings.
Replace the reward, don’t just remove it
- novelty
- social connection
- emotional stimulation
If you remove the feed without replacing the need, the loop returns. Replace it with a planned alternative: a group chat, a saved reading list, a hobby with visible progress. The brain still wants reward; give it one you chose.
Editor’s Note
Conclusion: the thumb, the maybe, and the future of attention
The uncomfortable truth is that a feed doesn’t have to make you happy to keep you there. It only has to make you curious, socially alert, or slightly unsure of what you might miss.
Once you see the mechanism, you gain a kind of leverage. You can stop treating your attention as a character flaw and start treating it as a resource—one that responds predictably to predictable pressures.
The question for the next decade isn’t whether people will keep scrolling. People will. The real question is whether we will demand products that respect the human learning system rather than quietly exploiting it.
Frequently Asked Questions
What are variable rewards, in plain English?
Variable rewards are payoffs that arrive unpredictably. You do the same action—scroll, refresh, check—and sometimes you get something rewarding, sometimes you don’t. Behavioral psychology shows that this kind of intermittent reinforcement can produce persistent habits because the next reward always feels possible, even after a long dry stretch.
Are social media feeds the same as slot machines?
Not literally. Slot machines often operate on a more formal variable-ratio schedule. Feeds mix many signals—social, informational, emotional—and use recommendations that make rewards unpredictable in a personalized way. The shared feature is uncertainty: the next action might pay off, which keeps the behavior going.
Is dopamine the reason I can’t stop scrolling?
Dopamine is part of the story, but not in the “pleasure chemical” way people often claim. Research emphasizes dopamine’s role in learning and motivation, especially reward prediction error—responding to surprises and mismatches between expectation and outcome. Uncertainty can keep that learning system engaged.
Why do I keep doomscrolling if it makes me feel worse?
Reinforcement doesn’t require joy. A feed can reward you with relief from uncertainty (“now I know”), a sense of social awareness, or emotional activation like outrage. Novelty itself can be rewarding. Those payoffs can reinforce checking and scrolling even when the overall experience feels negative.
What design feature makes scrolling hardest to stop?
Infinite scroll is a major culprit because it removes natural stopping points. Instead of reaching the end of a page or a clear boundary, content keeps loading. Wired has reported that designer Aza Raskin, linked to popularizing infinite scroll around 2006, later expressed regret about its compulsive effects.
Is “time spent” the best way to measure problematic use?
Not always. Some evidence highlighted in reporting suggests frequency of checking may be a better signal than total minutes. The Washington Post summarized findings suggesting around 110 checks per day may indicate higher risk. High check frequency reflects the reinforcement loop: many small bets rather than one long session.















