| Form | Coefficient reads as | Common use |
|---|---|---|
| Y on X (levels) | ΔY in Y-units per 1-unit ΔX | Most variables |
| log Y on X | approx. % change in Y per 1-unit ΔX | Wages, income |
| log Y on log X | elasticity: % ΔY per 1% ΔX | Demand, output |
| Y on a dummy (0/1) | gap in mean Y between the two groups | Treated vs control |
| Z → X | Z → Y | Bias on β |
|---|---|---|
| + | + | Upward (too big) |
| − | − | Upward (too big) |
| + | − | Downward (too small) |
| − | + | Downward (too small) |
| Threat | What happens | Fix / response |
|---|---|---|
| Attrition | Treated & control drop out differently | Track everyone; bound effects |
| Spillovers | Control units affected by treatment | Randomise at cluster level |
| Non-compliance | Assigned but don't take treatment | Analyse by assignment (ITT) |
| Hawthorne effects | Being watched changes behaviour | Blinding where possible |
| Instrument (Z) | Treatment (X) | Exclusion argument |
|---|---|---|
| Distance to a school/college | Years of schooling | Distance affects wages only via schooling |
| Quarter of birth | Years of schooling | Birth-month is arbitrary, tied to school-start laws |
| Sex composition of first 2 kids | Having a 3rd child | Sex mix is random, shifts fertility |
| Threat | What it does | Watch for |
|---|---|---|
| Diverging trends | Groups were drifting apart anyway | Non-parallel pre-trends |
| Other shocks | A second event hits only one group | Concurrent policies |
| Composition change | Who is in each group shifts over time | Migration, attrition |
| Anticipation | Behaviour changes before the policy | Pre-period jumps |
| Fixed effects | Random effects | |
|---|---|---|
| Assumes | Unit term may correlate with X | Unit term uncorrelated with X |
| Uses | Within-unit variation only | Within + between variation |
| Robust to | Time-invariant confounding | More efficient if assumption holds |
| Safer when | You fear omitted unit traits | Strong, often unrealistic |
| Tool | Good for | Note |
|---|---|---|
| Stata | Applied micro-econometrics, panel, IV, RD | Industry standard; paid |
| R | Free, flexible, reproducible analysis & graphics | Rich causal-inference packages |
| Python (statsmodels, linearmodels) | Automation, large data, ML | Free, general-purpose |
| Excel / Sheets | Quick description, not inference | Fine to start; outgrow it |
| If you can… | Use | Key assumption |
|---|---|---|
| Randomise treatment | RCT | Successful randomisation |
| Find an as-good-as-random nudge | IV | Relevance + exclusion |
| Compare a treated & untreated group over time | DiD | Parallel trends |
| Exploit a cutoff rule | RD | No manipulation at cutoff |
| Follow units over time | Panel fixed effects | Confounders time-invariant |