| Term | Also called | Meaning |
|---|---|---|
| Outcome | Dependent variable, Y | What you are trying to explain |
| Predictor | Independent variable, X, covariate | What you use to explain it |
| Coefficient | Slope, β | Effect of a predictor on the outcome |
| Control | Adjust for, condition on | Hold a variable constant |
| Residual | Error, e | What the model fails to predict |
| Symbol | Name | What it is |
|---|---|---|
| Y | Outcome | What you predict (e.g. test score) |
| β0 | Intercept | Predicted Y when every X = 0 |
| β1…βk | Slope coefficients | Partial effect of each predictor |
| X1…Xk | Predictors | Your covariates |
| e | Error / residual | Everything the model misses |
| Unstandardised (b) | Standardised (β) | |
|---|---|---|
| Units | Original (rupees, years) | Standard deviations |
| Reads as | Effect of 1 real unit | Effect of a 1-SD change |
| Comparable across Xs? | No — different scales | Yes — common scale |
| Best for | Real-world meaning | Ranking relative importance |
| Assumption | Plain meaning | Check with |
|---|---|---|
| Linearity | The true relationship is a straight line | Residual vs fitted plot |
| Independence | Observations don't lean on each other | Study design; clustering |
| Homoscedasticity | Error spread is constant across X | Residual vs fitted plot |
| Normal residuals | Errors are roughly bell-shaped | Q–Q plot, histogram |
| Remedy | How | When |
|---|---|---|
| Drop one | Remove a redundant predictor | Two variables measure the same thing |
| Combine | Build one index (e.g. PCA) | Several proxies for one concept |
| Centre / rescale | Subtract the mean before squaring/interacting | Collinearity from interaction terms |
| Get more data | Larger / more varied sample | Overlap is mild, not structural |
| Accept it | Keep the model, widen the CIs | You only care about prediction |
| Model | Coefficient β reads as… |
|---|---|
| Y on X (level–level) | β-unit change in Y per 1-unit X |
| Y on log X (level–log) | β/100 change in Y per 1% rise in X |
| log Y on X (log–level) | ~100·β % change in Y per 1-unit X |
| log Y on log X (log–log) | β % change in Y per 1% change in X (elasticity) |
| Quantity | Definition | Range |
|---|---|---|
| Probability p | Chance of 'yes' | 0 to 1 |
| Odds | p ÷ (1 − p) | 0 to ∞ |
| Log-odds (logit) | ln(odds) | −∞ to +∞ |
| PCA | Factor analysis | |
|---|---|---|
| Goal | Summarise / compress variance | Find latent underlying factors |
| Direction | Variables → components | Factors → cause the variables |
| Model of error | None — pure re-expression | Separates shared vs unique variance |
| Typical use | Indices (wealth index) | Psychometrics, attitude scales |
| Pitfall | What goes wrong | Guard |
|---|---|---|
| Overfitting | Fits noise, fails out of sample | Simplicity, adjusted R², hold-out |
| Omitted-variable bias | Confounder contaminates coefficients | Include real confounders |
| Collinearity confusion | Drop a confounder to 'fix' VIF | Let the question decide |
| Controlling a mediator | Blocks the causal pathway | Don't control intermediates |
| Extrapolation | Predicts beyond the data | Stay within range |
| p-hacking | Chance result sold as finding | Pre-specify, replicate |
| Predictor | Coefficient | 95% CI | p |
|---|---|---|---|
| Income (per ₹1,000) | 0.03 | 0.01 – 0.05 | <0.01 |
| Mother's schooling (yr) | 0.06 | 0.02 – 0.10 | <0.01 |
| Improved toilet (vs none) | 0.18 | 0.07 – 0.29 | <0.01 |
| Intercept | −1.42 | — | — |
| Tool | Strengths | Note |
|---|---|---|
| R | Stats-first, superb diagnostics & graphics | Free; lm(), glm(), broom |
| Python (statsmodels) | Cleaning + modelling in one place | Free; pandas, scikit-learn |
| Stata | Survey data, clustered SEs, ubiquitous in econ | Paid; svy: prefix |
| SPSS | Menu-driven, common in academia | Paid; gentle on-ramp |