English Português

The Incrementality Measurement Cheat Sheet

📈 THE INCREMENTALITY MEASUREMENT FRAMEWORK

Measure what changed because of you — not what got credited to you
FROM “CLICK HERE” BY ALEX SCHULTZ
🧭

North Star Setup

🎯 Define One Outcome

Pick one business-linked metric for decisions (e.g., qualified trials, retained revenue), not a vanity proxy.

One metric beats five conflicting ones

⏱️ Set Read Windows

Define exactly when outcomes are counted so tests are comparable across channels and cycles.

🧮 Make It Decision-Usable

Your metric must tell you whether to scale, hold, or cut spend this week.

💼 Finance-Ready

If finance wouldn’t care when it moves, it isn’t your North Star metric.

🪜

Incrementality Ladder

LevelUse whenStrengthWeakness
1. AttributionNeed rapid directional steeringFast and cheapWeakest causal truth
2. Pre/PostControls limited, medium-stakes decisionQuick calibrationHigh confounding risk
3. Matched Markets/CohortsNeed better quasi-experimental confidenceStronger comparabilityMore setup and monitoring
4. Randomized HoldoutHigh-stakes budget or strategy callsBest causal evidenceHardest operationally
🧪

How to Run Tests

  1. Pre-register everything — hypothesis, KPI, duration, stop rules, and decision thresholds.
  2. Pick highest feasible rigor — don’t pretend attribution is experimentation.
  3. Protect test integrity — no mid-test KPI changes, no peeking and early stopping.
  4. Read business + statistical significance — tiny but significant effects usually don’t move the company.
  5. Reallocate by marginal lift — keep coverage where needed, cut weak marginal spend first.
📊

What Matters vs What Doesn’t

✅ This Matters

  • Sustained lift over pre-registered windows
  • Result survives replication across cohorts/regions
  • Magnitude changes budget decisions
  • Findings feed MMM/planning assumptions

🚫 This Doesn’t

  • Microscope-level gains with heavy interpretation
  • Winner-only reporting after many tests
  • Attribution-only proof for budget defense
  • One-day significance spikes
“If you need a data scientist and a microscope to prove impact, you didn’t move the business.”
⚠️

Failure Modes to Avoid

🧷 P-Hacking

Stopping tests once significance appears or changing metrics midstream inflates false wins.

🔎 Last-Click Overstatement

High-intent channels (e.g., branded search/app store search) often absorb credit for demand created elsewhere.

🪫 Underpowered Tests

Small samples create noise disguised as confidence.

🧠 No Learning Loop

If insights are not documented and reused, each quarter resets to zero learning.

🧭 Operating Rule

Use attribution for steering, incrementality for truth, and MMM for scale allocation. One system, three roles.

Scroll to Top