Course lexicon

Causal inference, in plain language

65 terms from the course, defined for development practitioners who design, read and commission impact evidence. Search, or filter by part of the toolkit.

Foundations

Potential outcomes

The pair of outcomes a unit would show with and without treatment, Y(1) and Y(0). Causal effects are contrasts between them (Splawa-Neyman 1923; Rubin 1974).

Foundations

Counterfactual

The outcome that did not happen: how a treated unit would have fared untreated, or vice versa. The thing every causal claim must reconstruct.

Foundations

Fundamental problem of causal inference

For any unit you observe only one potential outcome, never both, so an individual effect can never be measured directly (Holland 1986). A missing-data problem, not a sample-size one.

Foundations

Treatment effect

The difference a treatment makes to an outcome: τ = Y(1) − Y(0) for a unit, or an average over a group.

Foundations

ATE

Average treatment effect: the mean of Y(1) − Y(0) over the whole population — the effect of treating everyone versus no one.

Foundations

ATT

Average treatment effect on the treated: the mean effect among those who actually received treatment. Usually what “did the programme work?” means.

Foundations

ATU

Average treatment effect on the untreated: the effect treatment would have had on those who did not get it. Relevant to expansion decisions.

Foundations

Estimand

The quantity you are trying to estimate (e.g. the ATT), named before any method or data. Doing so first is the discipline of design-based inference.

Foundations

SUTVA

Stable Unit Treatment Value Assumption: one unit's outcome is unaffected by others' treatment (no interference) and treatment means one well-defined thing (no hidden versions) (Rubin 1980).

Foundations

Switching equation

Y = D·Y(1) + (1−D)·Y(0): the observed outcome is whichever potential outcome the treatment switch selects; the other half is the counterfactual.

Foundations

Selection bias

The gap between how the treated would have fared untreated and how the untreated actually fared. Why a naive treated-minus-untreated comparison is not the effect.

Identification

Whether an effect could be recovered with infinite data, given the assumptions. A question about design, prior to estimation.

Identification

Unconfoundedness

Also ignorability: treatment is as good as randomly assigned once you condition on observed covariates, {Y(1),Y(0)} ⊥ D | X. The key assumption behind matching and regression.

Identification

Overlap / common support

For every covariate value both treated and untreated units exist, so like can be compared with like. Without it, effects rest on extrapolation.

Identification

Exogeneity

A variable is uncorrelated with the error term, so its variation is unrelated to unobserved drivers of the outcome.

Identification

Exclusion restriction

An instrument affects the outcome only through the treatment, never directly. Untestable, and the heart of every IV argument.

Identification

Confounding

A common cause of both treatment and outcome that, left unadjusted, biases the comparison.

Identification

Identifying assumption

The specific condition a design needs to be true for its comparison to recover a causal effect — parallel trends for DiD, continuity for RDD, and so on.

Identification

Conditional independence (CIA)

Same as unconfoundedness: once X is held fixed, treatment is independent of the potential outcomes.

Identification

Ceteris paribus

“Other things equal” — the ideal a causal comparison approximates by holding confounders fixed.

Causal graphs

DAG

Directed acyclic graph: arrows encode assumed causal directions among variables, with no cycles. A tool for reasoning about what to control for (Pearl 2009).

Causal graphs

Confounder

A variable that causes both treatment and outcome; a backdoor path you must block by conditioning.

Causal graphs

Collider

A variable caused by two others (X → C ← Y). Conditioning on it opens a spurious path and creates bias.

Causal graphs

Mediator

A variable on the causal path from treatment to outcome. Controlling for it removes part of the very effect you want to measure.

Causal graphs

Backdoor path

A non-causal path from treatment to outcome through a common cause. Blocking all of them identifies the effect.

Causal graphs

Backdoor criterion

A graphical rule: condition on a set that blocks every backdoor path without opening a collider, and the effect is identified.

Causal graphs

Bad control

A variable that is a mediator or collider; controlling for it introduces bias rather than removing it (Angrist & Pischke, ch. 3).

Causal graphs

d-separation

The graph rule that determines whether two variables are independent given a conditioning set.

Designs & estimators

Randomised controlled trial (RCT)

Treatment assigned by chance, so treated and control groups are comparable in expectation on everything, observed and not. The benchmark design.

Designs & estimators

Randomisation

Assigning treatment by a chance device. It makes potential outcomes independent of treatment, neutralising selection bias by design.

Designs & estimators

Block / stratified randomisation

Randomising within groups to guarantee balance on key covariates and improve precision.

Designs & estimators

Matching

Estimating effects by pairing treated units with untreated units of similar covariates, then comparing outcomes.

Designs & estimators

Propensity score

The probability of treatment given covariates, P(D=1|X). Matching or weighting on it removes confounding under unconfoundedness (Rosenbaum & Rubin 1983).

Designs & estimators

Inverse-probability weighting (IPW)

Reweighting units by the inverse of their propensity score to build a synthetic sample in which treatment is unconfounded.

Designs & estimators

Regression adjustment

Regressing the outcome on treatment and covariates to net out observed confounders. Identifies the effect only under unconfoundedness.

Designs & estimators

Instrumental variables (IV)

Using a variable that shifts treatment but affects the outcome only through it, to recover effects when treatment is endogenous.

Designs & estimators

Two-stage least squares (2SLS)

The standard IV estimator: predict treatment from the instrument, then regress the outcome on the predicted treatment.

Designs & estimators

LATE

Local average treatment effect: the effect IV recovers, for compliers only — those whose treatment is moved by the instrument (Imbens & Angrist 1994).

Designs & estimators

Compliers

Units that take treatment if and only if the instrument assigns them to it. IV speaks only about them, not always-takers or never-takers.

Designs & estimators

Regression discontinuity (RDD)

Using a cutoff in a running variable that assigns treatment, comparing units just above and below as if randomised (Thistlethwaite & Campbell 1960).

Designs & estimators

Sharp vs fuzzy RDD

Sharp: treatment switches deterministically at the cutoff. Fuzzy: the cutoff only shifts the probability of treatment, so it is used as an instrument.

Designs & estimators

Running variable & bandwidth

The variable that determines treatment at a threshold, and the window around the cutoff used for the comparison.

Designs & estimators

Difference-in-differences (DiD)

Comparing the before-after change in a treated group to the change in a control group, differencing out fixed gaps and common trends.

Designs & estimators

Parallel trends

DiD's identifying assumption: absent treatment, treated and control groups would have moved in parallel. Supported, never proved, by pre-trends.

Designs & estimators

Two-way fixed effects

A regression with unit and time fixed effects, the common DiD estimator; biased under staggered timing with heterogeneous effects (Goodman-Bacon 2021).

Designs & estimators

Event study

Plotting effects by time relative to treatment, to inspect pre-trends and the dynamics around the intervention.

Designs & estimators

Synthetic control

Building a weighted combination of untreated units that tracks the treated unit before treatment, as a data-driven counterfactual (Abadie et al. 2010).

Designs & estimators

Double / debiased ML (DML)

Using machine learning for the nuisance models with orthogonalisation and sample splitting to estimate effects (Chernozhukov et al. 2018).

Threats & diagnostics

Attrition

Units dropping out before outcomes are measured. If it differs by treatment, it reintroduces selection bias even in an RCT.

Threats & diagnostics

Spillover / interference

Treatment affecting untreated units through markets or networks, violating SUTVA and contaminating the control group.

Threats & diagnostics

Non-compliance

Assigned units not taking treatment, or unassigned units taking it, breaking the link between assignment and exposure.

Threats & diagnostics

Intention-to-treat (ITT)

The effect of being assigned to treatment, regardless of take-up. Robust to non-compliance and the policy-relevant number for an offer.

Threats & diagnostics

Treatment-on-the-treated (ToT)

The effect among those who actually complied; recovered from the ITT by scaling for take-up under IV assumptions.

Threats & diagnostics

Weak instruments

Instruments only loosely related to treatment, which inflate variance and bias IV toward the naive estimate. Diagnosed via the first-stage F.

Threats & diagnostics

McCrary / manipulation test

Checking for bunching in the running variable at an RDD cutoff, which would signal units sorting across it and break the design.

Threats & diagnostics

Placebo test

Applying a design where no effect should exist (a fake cutoff, an untreated period) to check it returns a null.

Threats & diagnostics

Covariate balance

Whether treated and control groups have similar covariate distributions. A balance table is the first check on any RCT or matching design.

Practice & interpretation

Internal validity

Whether the estimate is unbiased for the population studied. The first thing a design must earn.

Practice & interpretation

External validity

Whether a result transports to other populations, places or scales. High internal validity does not guarantee it.

Practice & interpretation

Minimum detectable effect (MDE)

The smallest true effect a study has the power to detect, set before data collection in a power calculation.

Practice & interpretation

Pre-analysis plan

A registered specification of hypotheses and analyses filed before seeing outcomes, to limit specification search.

Practice & interpretation

Heterogeneous effects (CATE)

Effects that vary across subgroups; the conditional average treatment effect describes them, increasingly estimated with causal ML.

Practice & interpretation

Partial identification / bounds

When a point estimate needs assumptions you cannot defend, reporting the range the data support instead (Manski).

Practice & interpretation

Specification search

Trying many models and reporting the most favourable, which invalidates inference. Pre-registration and robustness checks guard against it.

Practice & interpretation

Clustered standard errors

Standard errors that allow correlation within groups (villages, schools), required when treatment is assigned at the cluster level.

No terms match. Try a broader search or a different filter.