fullscreen
ImpactMojoData Visualization 101www.impactmojo.in
ImpactMojo 101 Series · Free Forever
Data
Visualization
101
How to turn numbers into honest, clear pictures — a foundational course for development practitioners in South Asia, and the companion to ImpactMojo’s The Long View.
PracticalSouth Asia Focus100 SlidesFree Access
ImpactMojoData Visualization 101www.impactmojo.in
What We Cover
01
What a Chart Is For
Slides 3–9
02
Marks & Channels
Slides 10–18
03
Choosing a Chart
Slides 19–29
04
Honest Charts
Slides 30–40
05
Colour
Slides 41–49
06
Words on Charts
Slides 50–57
07
Tables & Numbers
Slides 58–64
08
Spread & Uncertainty
Slides 65–73
09
Maps
Slides 74–81
10
Audience & Access
Slides 82–89
11
Workflow & Tools
Slides 90–98
ImpactMojoData Visualization 101www.impactmojo.in
01
Section One
What a Chart Is For
ImpactMojoData Visualization 101www.impactmojo.in
A chart is an argument, not decoration
Every chart you make is making a point: this went up, this group is worse off, these two things move together. A good chart helps a reader see that point faster than a paragraph could. A bad one hides it, or worse, makes a point the data does not support.
If you cannot say in one sentence what your chart is for, the reader will not be able to either. Write that sentence first — it usually becomes your title.
ImpactMojoData Visualization 101www.impactmojo.in
The eye finds patterns the table hides
The classic demonstration is Anscombe’s quartet: four datasets with almost identical means, variances and correlation. In a table they look the same. Plotted, they are obviously different — one is a clean line, one is curved, one has a single outlier dragging the trend.
Summary statistics can agree while the data disagree. Always look at the shape before you trust the number.
The point
We are very good at spotting lines, clusters, gaps and outliers by eye. We are bad at reading those same things out of a grid of numbers. Visualisation borrows the strength of the visual system to do statistics.
ImpactMojoData Visualization 101www.impactmojo.in
When a picture beats a sentence — and when it doesn’t
Reach for a chart when…Reach for a number when…
You are comparing many values at onceThere is only one value that matters
The shape or trend is the messageThe precise figure is the message
You want the reader to exploreYou want the reader to remember one fact
Patterns, gaps and outliers carry meaningThe audience needs to quote the exact value
“63% of rural households have piped water” is one number — a sentence is fine. The change across 28 states over 20 years is a chart.
ImpactMojoData Visualization 101www.impactmojo.in
When not to make a chart
  • Two or three numbers. A small table or a sentence is clearer than a tiny bar chart.
  • No real variation. If every bar is the same height, the chart says nothing.
  • The data is too thin. A trend drawn from three data points invites over-reading.
  • You are decorating a slide. A chart with no question behind it is just visual noise.
A chart is a cost — it asks the reader to learn its axes and colours. Only spend that cost when the payoff (a pattern they could not otherwise see) is real.
ImpactMojoData Visualization 101www.impactmojo.in
Who is the chart for?
You
Exploring. Rough, fast, many charts. Ugly is fine — you are looking for what is there.
A team
Explaining. One clear point per chart, labelled, in a report or deck.
The public
Persuading. Self-contained, titled with the finding, works without you in the room.
The same data needs three different charts for these three readers. Most mistakes come from showing an exploration chart to the public.
ImpactMojoData Visualization 101www.impactmojo.in
What makes a chart good
  • Honest — the visual proportions match the numbers. This is non-negotiable.
  • Clear — a reader gets the main point in a few seconds, without a manual.
  • Sourced — the data, the date and the source are on the chart, not lost.
  • Focused — one chart makes one point; if it makes three, split it into three.
Notice what is not on this list: “beautiful”. Beauty helps a chart get read, but it never rescues a chart that is dishonest or unclear.
ImpactMojoData Visualization 101www.impactmojo.in
02
Section Two
Marks & Channels
ImpactMojoData Visualization 101www.impactmojo.in
Marks & channels
A mark is the thing you draw — a dot, a line, a bar, an area. A channel is the property of that mark you use to carry data — its position, length, angle, area, colour or shape. Every chart is just data mapped onto marks through channels.
Once you see charts this way, “what chart should I use?” becomes a sharper question: which channel will carry my most important variable?
ImpactMojoData Visualization 101www.impactmojo.in
Some channels are read more accurately than others
From the work of Cleveland & McGill (1984), channels ranked by how accurately people read quantities from them — most accurate at the top:
  • 1. Position on a common scale (scatter, dot plot)
  • 2. Position on unaligned scales (small multiples)
  • 3. Length (bars)
  • 4. Angle / slope (pie slices, line steepness)
  • 5. Area (bubbles, treemaps)
  • 6. Colour intensity / shade (heatmaps, choropleths)
Put your most important variable on the most accurate channel you can afford. That single habit fixes most weak charts.
ImpactMojoData Visualization 101www.impactmojo.in
Why position wins
When two marks sit on the same axis, the reader compares them directly — no mental arithmetic, no guessing at areas. That is why a dot plot or scatter lets people read values to within a few percent, while a pie chart leaves them guessing.
Practical rule
If precise comparison matters, get your values onto a shared horizontal or vertical scale. Bars and dot plots do this; pies, bubbles and 3D do not.
ImpactMojoData Visualization 101www.impactmojo.in
Length needs a zero baseline
A bar encodes a value with its length. A reader judges a bar twice as long as carrying twice the value — so the bar must start at zero. Cut the baseline and a 2% difference can look like a doubling. This is the single most common way charts mislead.
Bars start at zero. Always. If zero is irrelevant to your story (say, a stock index that never goes near it), use a line, not a bar — lines encode change, not magnitude, and may omit zero.
ImpactMojoData Visualization 101www.impactmojo.in
Area is honest but hard to read
We badly underestimate area. Double a circle’s radius and its area quadruples — a common bubble-chart error makes big values look four times too big. Even done correctly, readers cannot compare areas precisely.
Use area when…
A rough sense of magnitude is enough (a treemap of budget shares), or area is a secondary channel (bubble size on a scatter). Never when readers must rank close values. And always scale by area, never by radius.
ImpactMojoData Visualization 101www.impactmojo.in
Colour: powerful, easily overused
Colour is great for showing categories (which line is which) and rough intensity (a heatmap). It is poor for precise quantities — nobody can read “47” off a shade of blue. Colour gets its own section later; for now, two rules:
  • Use colour to group or highlight, not to carry numbers people must read off.
  • Keep the number of colours small — the eye loses track past about seven.
ImpactMojoData Visualization 101www.impactmojo.in
Encode the important thing strongly
Suppose you are comparing female literacy across 12 states. The states are the categories; literacy is the number that matters. Put literacy on position or length (a sorted bar or dot plot), and use colour only to flag the one state you want to talk about.
A frequent mistake is to spend the strongest channel (position) on something unimportant (alphabetical order) and push the real variable onto a weak one (colour shade). Sort by the value instead.
ImpactMojoData Visualization 101www.impactmojo.in
Same data, three encodings
EncodingChannelHow well it reads
Pie chart of 6 sharesAngle / areaHard — can’t rank similar slices
Stacked bar of 6 sharesLength (unaligned)Better for the bottom segment only
Sorted bar / dot plotPosition + lengthBest — every value is comparable
Three ways to show the same six numbers. The data did not change — only how accurately the reader can recover it. That choice is yours to make on purpose.
ImpactMojoData Visualization 101www.impactmojo.in
03
Section Three
Choosing a Chart
ImpactMojoData Visualization 101www.impactmojo.in
Pick the question before the chart
Do not start from “I want a chart.” Start from the question your reader has. Almost every question is one of five kinds, and each kind has a natural chart family.
Q
What’s the question?
TYPE
Comparison? Trend? Part? Spread? Link?
CHART
The family that fits
ImpactMojoData Visualization 101www.impactmojo.in
“Who is bigger / more / worse off?” → bars
For comparing values across categories, a bar chart is the workhorse. Sort the bars by value (not alphabetically) so the ranking is instant. Horizontal bars when labels are long.
A dot plot does the same job with less ink and handles two series (e.g. men vs women) cleanly as a dumbbell.
Watch out
Bars must start at zero. Too many bars (40+) become a wall — switch to a dot plot or a sorted table.
ImpactMojoData Visualization 101www.impactmojo.in
“Is it rising or falling?” → lines
Time goes on the horizontal axis, left to right. A line connects points to show the trend; the slope is the message. Use a line for continuous time, a bar only when the periods are few and discrete.
  • One to four lines: label each line directly at its end — skip the legend.
  • Many series: use small multiples (one mini-chart each) instead of a tangle.
  • Lines may omit zero — they encode change, not magnitude.
ImpactMojoData Visualization 101www.impactmojo.in
“What share is each part?” → handle with care
OptionVerdict
Pie chartFine for 2–3 slices that are very different; poor beyond that
100% stacked barGood for comparing the same parts across a few groups
Sorted bar of the sharesUsually clearest — reader can rank every part
TreemapMany nested parts where rough size is enough
The infamous pie of 12 near-equal slices tells the reader nothing. When in doubt, a sorted bar of the percentages beats a pie.
ImpactMojoData Visualization 101www.impactmojo.in
“How is it spread out?” → histogram, box, strip
When you care about the range and shape of values — not a single average — show the distribution. A histogram bins the values; a box plot summarises quartiles; a strip / beeswarm shows every point.
Why it matters
“Average income ₹15,000” can hide a few rich households and many poor ones. The distribution reveals the inequality the mean conceals.
ImpactMojoData Visualization 101www.impactmojo.in
“Do these two move together?” → scatter
A scatter plot puts one variable on each axis and one dot per observation. Clusters, lines and outliers jump out. Add a trend line only if it genuinely helps — and never let it imply causation (more on that soon).
  • Add a third variable with dot colour (category) or size (quantity).
  • A connected scatter traces two variables over time as a path.
ImpactMojoData Visualization 101www.impactmojo.in
Linear or log? The axis is a choice too
Most charts use a linear scale, where equal distances mean equal amounts. A log scale makes equal distances mean equal multiples (10, 100, 1,000) — useful when values span many orders of magnitude, or when the rate of growth is the story.
  • Linear: most data, most audiences — the safe default.
  • Log: incomes from poor to billionaire, exponential growth, anything spanning 100× or more.
  • Always label a log axis clearly — many readers misread it as linear and underestimate the spread.
ImpactMojoData Visualization 101www.impactmojo.in
Two more families worth knowing
Ranking
When the order itself is the story (league tables, top-10 lists), use an ordered bar or dot plot. A slope chart shows how ranks change between two points in time.
Flow & networks
For quantities moving between categories — budget → sector, source → use — a Sankey shows volume as ribbon width. A chord diagram shows two-way flows (trade, migration between regions).
ImpactMojoData Visualization 101www.impactmojo.in
The chart chooser
Your questionFirst choiceAlso consider
Compare categoriesSorted barDot plot, dumbbell
Trend over timeLineArea, small multiples
Part of a wholeSorted bar of sharesStacked bar, treemap
DistributionHistogramBox, strip, beeswarm
RelationshipScatterBubble, connected scatter
Flow between groupsSankeyChord
ImpactMojoData Visualization 101www.impactmojo.in
From question to chart
Question: “Which of these countries has the highest under-five mortality?” That is a comparison — so a sorted bar, starting at zero, one country per bar, ordered worst to best.
Under-five mortality, deaths per 1,000 live births (latest)
Source: World Bank / UN IGME, indicator SH.DYN.MORT. Illustrative recent values.
ImpactMojoData Visualization 101www.impactmojo.in
04
Section Four
Honest Charts
ImpactMojoData Visualization 101www.impactmojo.in
A chart can lie while every number is true
You can plot accurate data and still mislead — through the axis, the scale, the time window or the framing. Because charts are read fast and trusted instinctively, a misleading chart does more damage than a misleading sentence. This section is the most important in the course.
The test: would a careful reader come away believing something the data does not actually support? If yes, the chart is dishonest — even if no single value is wrong.
ImpactMojoData Visualization 101www.impactmojo.in
The truncated axis
Misleading — axis starts at 90
Honest — axis starts at 0
Same three numbers (92, 94, 96%). On the left the change looks enormous; on the right, modest — which is the truth. Bars must start at zero.
ImpactMojoData Visualization 101www.impactmojo.in
When zero is not required
The zero rule is about bars, where length carries the value. Lines encode change, so they may start where the data lives — forcing a stock index or a temperature series to include zero can flatten a real, meaningful change.
  • Bar / area → must include zero.
  • Line / scatter → zero optional; choose a range that shows the real variation without exaggerating it.
  • When you omit zero on a line, make the axis range obvious so nobody is fooled.
ImpactMojoData Visualization 101www.impactmojo.in
The dual-axis trap
Two lines, two different y-axes, scaled so they appear to track each other — the reader infers a relationship that the analyst manufactured by choosing the scales. By sliding the axes you can make almost any two series look correlated.
Prefer two small charts side by side, or index both series to 100 at a common start year and plot them on one honest axis. Avoid the second y-axis unless the two units are genuinely linked.
ImpactMojoData Visualization 101www.impactmojo.in
The cherry-picked window
Start the time axis at an unusually low year and the trend looks like a boom; start at a peak and the same series looks like a collapse. The data is real; the window is the lie.
  • Show the longest honest period you have, not the slice that flatters your point.
  • If you must zoom in, say so, and show the full series somewhere nearby.
  • Beware comparisons to a single unusual base year (a drought, a pandemic, an election).
ImpactMojoData Visualization 101www.impactmojo.in
3D and perspective
3D bars and pies tilt the geometry so that nearer slices look bigger and the back of the chart shrinks. They add no information and distort the one channel that mattered. The same goes for drop shadows and “glossy” effects.
There is no honest reason to make a statistical chart 3D. Flat, plain and accurate beats impressive every time.
ImpactMojoData Visualization 101www.impactmojo.in
Sizing by radius, not area
When a value is shown as a circle or icon, the reader judges it by area. If you set the radius proportional to the value, a figure twice as large draws a circle four times as big. Pictographs (“one person = 1 million”) that scale a single stretched icon make the same error.
Scale circles by area (radius ∝ √value). Better still, for icon charts, repeat a fixed-size icon — a waffle or isotype grid — so each unit is equal.
ImpactMojoData Visualization 101www.impactmojo.in
Correlation drawn as causation
A scatter with a trend line, or two lines rising together, strongly suggests that one causes the other. The chart cannot show causation — only association. Ice-cream sales and drownings both rise in summer; neither causes the other.
  • Word the title as association (“moves with”), not cause (“drives”), unless you have evidence for cause.
  • Watch for a hidden third variable (here, heat) driving both.
ImpactMojoData Visualization 101www.impactmojo.in
Aggregation that hides the truth
Simpson’s paradox
A trend that holds in the overall data can reverse when you split it by group. A treatment can look worse on average yet be better for every subgroup, because the groups differ in size and baseline risk.
Always ask whether a headline average survives disaggregation by sex, caste, region or income. If the picture flips, the disaggregated chart is the honest one.
ImpactMojoData Visualization 101www.impactmojo.in
Before you publish: an integrity check
  • Do bars start at zero, and are areas scaled correctly?
  • Is the time window the full, fair period — not a flattering slice?
  • Does the title claim only what the data supports (association vs cause)?
  • Would the picture survive being split by the obvious subgroups?
  • Are the source, date and units on the chart?
If a figure is approximate or modelled, say so on the chart. Never present an estimate as if it were a measured fact.
ImpactMojoData Visualization 101www.impactmojo.in
05
Section Five
Colour
ImpactMojoData Visualization 101www.impactmojo.in
Colour does exactly three jobs
Qual.
Qualitative — distinct hues for unordered categories (regions, parties).
Seq.
Sequential — light-to-dark of one hue for low-to-high values.
Div.
Diverging — two hues from a meaningful middle (e.g. surplus vs deficit).
Most colour mistakes are using the wrong family — a rainbow (qualitative) for ordered data, or a sequential ramp for categories. Match the palette to the data type.
ImpactMojoData Visualization 101www.impactmojo.in
Light to dark for ordered values
For a quantity that runs low to high — poverty rate, temperature, density — use a single hue from pale to deep. Darker reads as “more” intuitively. Keep the steps evenly spaced in perceived lightness so equal jumps look equal.
Tested ramps (ColorBrewer’s Blues, Viridis) are designed to be perceptually even and colour-blind safe. Borrow them rather than inventing your own.
ImpactMojoData Visualization 101www.impactmojo.in
Two directions from a meaningful middle
When values spread both ways from a natural centre — above/below average, gain/loss, agree/disagree — a diverging scale puts a neutral colour at the midpoint and two contrasting hues at the ends. The midpoint must be the real zero or mean, not an arbitrary value.
Set the colour midpoint to where the meaning flips. Centring it on the data’s median instead can paint a struggling region as “average”.
ImpactMojoData Visualization 101www.impactmojo.in
Distinct hues — and not too many
For categories with no order, pick hues that are easy to tell apart and roughly equal in weight (so none shouts). Past about seven colours, readers lose track and the legend becomes a memory test.
  • Too many categories? Group the small ones into “other”, or use small multiples.
  • Give a fixed colour to a category that recurs across charts (e.g. always green for “rural”).
ImpactMojoData Visualization 101www.impactmojo.in
About 1 in 12 men can’t tell red from green
Roughly 8% of men and 0.5% of women have some colour-vision deficiency, most commonly red–green. A chart that relies on red-vs-green to make its point fails for millions of readers.
  • Use colour-blind-safe palettes (Viridis, Okabe–Ito, ColorBrewer “colorblind safe” sets).
  • Check your chart in a simulator before publishing.
  • Pair colour with a second cue — label, shape or pattern (next slide).
ImpactMojoData Visualization 101www.impactmojo.in
Retire the rainbow
The classic “jet” rainbow ramp (blue–green–yellow–red) looks lively but lies: it has bright bands that invent boundaries where the data is smooth, and dark ones that hide real differences. It is also poor for colour-blind readers.
Replace rainbow heatmaps and maps with a perceptually uniform ramp like Viridis. The pattern in the data will change — because the rainbow was distorting it.
ImpactMojoData Visualization 101www.impactmojo.in
Colour carries meaning — use it deliberately
  • Red reads as bad / hot / loss / stop; green as good / go. Do not colour a good outcome red by accident.
  • Convention matters: don’t recolour a party, a flag or a brand against what readers expect.
  • Meaning is cultural — white, red and saffron carry different associations across South Asia. Know your audience.
  • Reserve a bold colour for the one thing you want noticed; grey the rest.
ImpactMojoData Visualization 101www.impactmojo.in
Recolouring a noisy chart
BeforeAfter
12 bars, 12 different bright colours11 grey bars, 1 coloured — the one you discuss
Rainbow choroplethSingle-hue sequential ramp
Red = our product (the “good” one)Green for the favourable series
Legend with 12 swatchesDirect labels on the bars that matter
Less colour, more meaning. Colour should guide attention, not compete for it.
ImpactMojoData Visualization 101www.impactmojo.in
06
Section Six
Words on Charts
ImpactMojoData Visualization 101www.impactmojo.in
A chart without words is half a chart
Marks show the pattern; words tell the reader what it means and what to trust. Titles, labels, annotations, units and sources do as much work as the geometry. A beautiful chart with no words is a puzzle.
Budget your effort roughly half on the picture and half on the words around it. Most weak charts are under-written, not under-designed.
ImpactMojoData Visualization 101www.impactmojo.in
Make the title state the finding
Weak (topic)
“Under-five mortality by country, 2000–2022”
Strong (finding)
“Child deaths have more than halved in every country since 2000”
A topic title makes the reader do the work. A finding title hands them the point, then lets the chart prove it. Keep the descriptive version as a subtitle if you like.
ImpactMojoData Visualization 101www.impactmojo.in
Label directly; kill the legend
A separate legend forces the reader’s eye to bounce between the key and the chart, matching colours from memory. Where you can, write the label next to the thing it names — at the end of each line, beside each bar.
  • One to four series: direct labels almost always win.
  • Many series: a legend is unavoidable — order it to match the chart (top line = top legend entry).
  • Rotate axis labels as little as possible; switch to horizontal bars if names are long.
ImpactMojoData Visualization 101www.impactmojo.in
Point at the one thing that matters
An annotation — a short note with an arrow or a marked point — turns a chart from “here is some data” into “here is what happened.” Mark the spike, name the policy, flag the outlier. It is the cheapest way to make a chart memorable.
One or two annotations, not ten. Annotate the moment that carries your argument and leave the rest of the chart quiet.
ImpactMojoData Visualization 101www.impactmojo.in
Units, source, date — every time
  • Units on the axis: ₹, %, per 1,000, thousands — never make the reader guess.
  • Source line: who collected the data, which dataset, which year.
  • Date: when the data is from, and when the chart was made if it differs.
  • Notes: definitions, exclusions, “provisional” or “approximate” flags.
A sourced chart can travel on its own and be trusted. An unsourced one is just a claim.
ImpactMojoData Visualization 101www.impactmojo.in
Round to the precision that matters
“63.7%” implies you know the figure to a tenth of a percent; often you do not. Round to the precision your data and your reader actually need — usually whole numbers for a public audience.
  • Match precision to the margin of error: don’t quote decimals a survey can’t support.
  • Use thousands separators and consistent units across a chart.
  • Indian readers: lakh/crore for domestic figures, millions/billions for international comparison — pick one and stay consistent.
ImpactMojoData Visualization 101www.impactmojo.in
Annotating a line
A flat line of full-immunisation coverage suddenly climbs after 2014. The raw chart shows the rise; the annotated chart explains it:
1
Finding title: “Coverage rose after the 2014 mission”
2
Marker + note at the 2014 inflection
3
Source + “latest data 2021”
Same line, three small additions — now it argues a point instead of just plotting one.
ImpactMojoData Visualization 101www.impactmojo.in
07
Section Seven
Tables & Numbers
ImpactMojoData Visualization 101www.impactmojo.in
Sometimes the table is the answer
When readers need exact figures, when there are only a handful of numbers, or when they will look up specific rows, a well-set table beats a chart. Tufte’s rule of thumb: for a small dataset that will be read closely, a table often communicates better.
Don’t force a chart onto data that wants to be a table — and don’t bury 200 rows in a table that wants to be a chart.
ImpactMojoData Visualization 101www.impactmojo.in
A table is a designed object too
  • Order rows meaningfully — by value, not alphabetically, unless lookup is the point.
  • Light rules, not heavy grids — a line under the header and at the foot is usually enough.
  • Group and indent to show hierarchy instead of repeating labels.
  • One idea per column; put units in the header, not every cell.
ImpactMojoData Visualization 101www.impactmojo.in
Align numbers so the eye can compare
Do
Right-align numbers, fix the decimal places, use a mono/tabular figure so digits line up in columns. Then magnitudes are visible at a glance — longer number, bigger value.
Don’t
Centre numbers, mix “1,200” with “1200.0”, or vary decimal places down a column. The digits stop lining up and comparison breaks.
Left-align text, right-align numbers. This one habit makes any table more readable.
ImpactMojoData Visualization 101www.impactmojo.in
Sparklines: a chart the size of a word
A sparkline is a small, axis-free line drawn inline — enough to show a trend right next to the number it describes. Add a column of sparklines to a table and readers get the exact value and the shape of its history together.
Sparklines, in-cell bars and up/down arrows let a table carry pattern as well as precision — the best of both worlds for a dashboard or report.
ImpactMojoData Visualization 101www.impactmojo.in
The KPI card: one number, well dressed
Sometimes the most effective “visualisation” is a single large figure with a label and a tiny bit of context. Used on dashboards and report covers, these big-number cards make the headline impossible to miss.
63%
rural households with piped water
illustrative
−2.1pp
vs last year
28
states & UTs covered
ImpactMojoData Visualization 101www.impactmojo.in
A clean comparison table
Indicator20152021Change
Full immunisation (%)6276+14
Institutional births (%)7989+10
Stunting, under-5 (%)3836−2
Numbers right-aligned, units in the header, a change column to do the arithmetic for the reader. Source: NFHS-4 and NFHS-5, all-India (illustrative selection).
ImpactMojoData Visualization 101www.impactmojo.in
08
Section Eight
Spread & Uncertainty
ImpactMojoData Visualization 101www.impactmojo.in
The average hides the spread
Two districts can share an average income while one is broadly comfortable and the other has a few rich households among many poor ones. The mean is the same; the distribution is not. Development data is often skewed, so the average can mislead.
Same mean, different shape
Illustrative distributions with equal means.
ImpactMojoData Visualization 101www.impactmojo.in
Histograms, and the bin problem
A histogram groups values into bins and shows how many fall in each. It reveals shape — symmetric, skewed, bimodal. But the bin width changes the story: too wide hides structure, too narrow turns signal into noise.
Try a few bin widths before settling. If the shape changes wildly with small changes in bins, say so — your data may be thinner than it looks.
ImpactMojoData Visualization 101www.impactmojo.in
Box plots summarise — and conceal
A box plot compresses a distribution into median, quartiles and whiskers — great for comparing many groups at once. The cost: it hides the actual shape. Two very different distributions can produce identical boxes.
  • Use boxes to compare spread across many categories quickly.
  • When the shape matters (or n is small), overlay or switch to the points themselves.
ImpactMojoData Visualization 101www.impactmojo.in
Strip, jitter and beeswarm plots
When you have room and not too many observations, plot every point. A strip plot lines them up; jittering spreads overlaps apart; a beeswarm packs them into a shape that doubles as a density. Readers see the spread, the clusters and the outliers — nothing is hidden behind a summary.
For small datasets, showing the raw points is almost always more honest than summarising them.
ImpactMojoData Visualization 101www.impactmojo.in
Show what you don’t know
Most development figures are estimates from samples or models. A point drawn without its uncertainty looks more certain than it is. Error bars, confidence bands and ranges tell the reader how much to trust the dot.
  • Add 95% confidence intervals to survey estimates where you have them.
  • For two bars whose error bars overlap heavily, resist saying one is “higher”.
  • Label what the bar or band represents — SE, 95% CI, min–max are not the same.
ImpactMojoData Visualization 101www.impactmojo.in
Bigger samples, smaller error — with diminishing returns
Approximate margin of error vs. sample size (50/50 proportion)
±1.96·√(p(1−p)/n) at p=0.5; standard sampling formula.
Going from 400 to 1,000 respondents roughly halves the error; going from 2,400 to 4,000 barely moves it. This is why national surveys cluster around a few thousand.
ImpactMojoData Visualization 101www.impactmojo.in
Projections are fans, not lines
A forecast drawn as a single confident line invites the reader to believe a precision that does not exist. Show projections as a fan — a widening band — so the growing uncertainty is visible. Mark clearly where measured data ends and the projection begins.
A solid line into the future, with no band and no “projected” label, is one of the easiest ways to overstate what you know.
ImpactMojoData Visualization 101www.impactmojo.in
Showing an income distribution honestly
1
Report the median, not just the mean
2
Plot the histogram so the skew shows
3
Mark mean & median on it
When mean and median sit far apart on the chart, the reader sees the inequality directly — no statistics lecture required.
ImpactMojoData Visualization 101www.impactmojo.in
09
Section Nine
Maps
ImpactMojoData Visualization 101www.impactmojo.in
Maps are seductive — and often the wrong choice
A map feels authoritative and is genuinely useful when location is the story. But a lot of data plotted on a map would be clearer as a bar chart — the geography adds beauty while making values harder to compare.
Ask: does the spatial pattern matter, or am I mapping this because maps look impressive? If the latter, a sorted bar will serve the reader better.
ImpactMojoData Visualization 101www.impactmojo.in
The shaded-region map, and its trap
A choropleth shades each region by a value. Its biggest trap: shading by a count rather than a rate. A map of “total cases” mostly shows where the people are — big, populous states light up regardless of how bad things actually are there.
Map rates, shares and per-capita figures — not raw totals — unless “how many” really is the question.
ImpactMojoData Visualization 101www.impactmojo.in
Per capita, per area, per household
Normalising turns a map of population into a map of the thing you care about. Deaths → deaths per 100,000. Spending → spending per person. Schools → schools per 1,000 children. The denominator is a design decision — choose the one that matches the question.
State the denominator on the chart. “Per 100,000” and “per 100,000 women aged 15–49” can tell very different stories.
ImpactMojoData Visualization 101www.impactmojo.in
Big empty regions shout; dense small ones whisper
On a normal map, area equals visual weight. A huge, sparsely populated state (Rajasthan, Ladakh) dominates the eye, while a tiny, densely populated one (Delhi, Kerala’s coast) all but disappears — even when far more people live there. The map over-weights land and under-weights people.
If your subject is people, a land-area map systematically mis-weights them. That is the problem cartograms solve.
ImpactMojoData Visualization 101www.impactmojo.in
Resize geography to match the data
A cartogram distorts regions so their size reflects a value (population, electorate) instead of land area. A hex cartogram gives every region an equal tile — useful when you want each state read equally, regardless of size.
  • Population cartogram: each state’s area ∝ its population — honest weighting for people-based data.
  • Hex/tile grid: every state an equal cell — clear, if geographically loose.
  • Trade-off: cartograms are less recognisable, so label generously.
ImpactMojoData Visualization 101www.impactmojo.in
How you cut the colour scale changes the map
A choropleth groups values into colour bins, and the cut-points are a choice. Equal-interval bins split the range evenly; quantile bins put equal numbers of regions in each colour. The same data can look alarming or calm depending on the scheme.
Don’t tune the bins until the map says what you want. Pick a defensible scheme, state it, and keep it consistent across related maps.
ImpactMojoData Visualization 101www.impactmojo.in
A map of Indian states, done honestly
1
Use a rate (female literacy %), not a count
2
Hex tiles so small states read equally
3
Sequential ramp + stated bins
See this exact chart built from Census data on The Long View — ImpactMojo’s data-visualisation showcase.
ImpactMojoData Visualization 101www.impactmojo.in
10
Section Ten
Audience & Access
ImpactMojoData Visualization 101www.impactmojo.in
Who is this for, and where will they see it?
A chart for a statistics-literate panel can carry more complexity than one for the general public. A chart for a printed report is read slowly and closely; one on a projector has three seconds from the back row. Design for the actual reader and the actual medium.
The most common failure is showing an analyst’s working chart — dense, unlabelled, six series — to a public audience who needed one clear line.
ImpactMojoData Visualization 101www.impactmojo.in
Print, screen, mobile, projector
MediumDesign for…
Print reportHigh detail, small fonts OK, must work in greyscale
Web / screenInteraction possible, but make it readable static-first
MobileOne idea, large text, vertical, few categories
Projector / roomBig marks, high contrast, 3-second readability
A chart that works on a laptop can be unreadable on a phone or a projector. Test it where it will actually be seen.
ImpactMojoData Visualization 101www.impactmojo.in
Make charts readable by everyone
  • Contrast: dark text on light, light on dark — meet WCAG contrast ratios.
  • Font size: nothing smaller than the reader can comfortably read in the medium.
  • Alt text: describe the chart’s point in words for screen-reader users.
  • Data table: offer the underlying numbers for those who can’t use the visual.
Accessibility is not an add-on for a few users — high contrast and large type make a chart clearer for everyone.
ImpactMojoData Visualization 101www.impactmojo.in
Never rely on colour alone
Because some readers cannot distinguish certain colours — and because charts get printed in greyscale and photographed badly — pair every colour with a second cue. Then the chart still works when the colour is lost.
  • Lines: different dash patterns or direct labels as well as colour.
  • Points: different shapes (circle, triangle, square) as well as colour.
  • Categories on bars: direct labels so colour is a bonus, not the only key.
ImpactMojoData Visualization 101www.impactmojo.in
Data-ink ratio
Tufte’s idea: of all the ink on a chart, what share actually encodes data? Maximise it. Every gridline, border, shadow and background that isn’t carrying information is competing with the part that is.
Erase to improve: drop heavy gridlines, boxes and backgrounds, and the data stands out more. But don’t strip away helpful labels in the name of minimalism — words are data-ink too.
ImpactMojoData Visualization 101www.impactmojo.in
Cut the junk
Chartjunk is decoration that adds no information and often subtracts clarity: 3D effects, gradients, clip-art, busy backgrounds, needless gridlines, redundant legends. It makes a chart look “designed” while making it harder to read.
The test for any element: if removing it loses no information, remove it. What remains is the chart.
ImpactMojoData Visualization 101www.impactmojo.in
Animate and interact — only when it helps
Interaction (hover, filter, zoom) and animation (transitions between states) are powerful for exploration and for showing change. But a static reader sees none of it, and a gratuitous animation just delays the point. Always make the chart work as a still image first.
  • Good: animate a transition so the reader can follow what moved.
  • Good: let users filter a big dataset to their own region.
  • Bad: spinning, bouncing or fading that carries no meaning.
ImpactMojoData Visualization 101www.impactmojo.in
11
Section Eleven
Workflow & Tools
ImpactMojoData Visualization 101www.impactmojo.in
From data to finished chart
1
Question
2
Get & clean data
3
Explore (rough charts)
4
Choose & refine
5
Annotate & source
Most of the work is steps 2 and 3 — cleaning the data and looking at it. The pretty final chart is the last 10%.
ImpactMojoData Visualization 101www.impactmojo.in
Pencil before pixels
Before opening any tool, sketch the chart on paper. It costs seconds, forces you to decide what goes on each axis, and surfaces a better idea than the software’s default. The tool should execute your decision, not make it for you.
A rough sketch you can show a colleague in 30 seconds will save you an hour of polishing the wrong chart.
ImpactMojoData Visualization 101www.impactmojo.in
Tools: spreadsheets and chart builders
  • Spreadsheets (Excel, Google Sheets, LibreOffice) — fine for quick exploration; fight their ugly defaults.
  • Datawrapper — free, makes honest, clean, responsive charts and maps fast; the newsroom standard.
  • Flourish — interactive and animated charts and stories, no code.
  • RAWGraphs — free, good for less common chart types (Sankey, beeswarm).
For most practitioners, Datawrapper covers 80% of needs and is hard to make ugly. Start there.
ImpactMojoData Visualization 101www.impactmojo.in
Tools: when you want full control
ToolGood for
Python (matplotlib, seaborn, plotly)Analysis-to-chart in one place; reproducible
R (ggplot2)Statistical graphics; the grammar of graphics done well
D3.js / ObservableBespoke, interactive web visuals; full control
Vega-LiteDeclarative charts from a JSON spec
Code wins when you need reproducibility, automation, or a chart no tool offers. The charts on The Long View are hand-built in SVG for exactly this reason.
ImpactMojoData Visualization 101www.impactmojo.in
Where to find good data (South Asia)
  • India: NFHS, Census, NSS, PLFS, data.gov.in, RBI, budget documents.
  • Global: World Bank, UN agencies (WHO, UNICEF, UN DESA), Our World in Data.
  • Climate: IPCC, EDGAR, Global Carbon Project, national communications.
  • Always: read the methodology, note the year, check the definition before you plot.
ImpactMojo’s Dataverse collects vetted development datasets for South Asia in one place.
ImpactMojoData Visualization 101www.impactmojo.in
People and places to learn from
  • Our World in Data — a master-class in clear, sourced development charts.
  • The Financial Times & The Economist visual teams — and the FT’s public “Visual Vocabulary”.
  • VizChitra — India’s data-visualisation community and conference.
  • The Pudding, Reuters Graphics, IndiaSpend for data journalism in context.
The fastest way to improve: find a chart you admire and work out, element by element, why it works.
ImpactMojoData Visualization 101www.impactmojo.in
Further reading
  • Edward Tufte — The Visual Display of Quantitative Information
  • Alberto Cairo — The Truthful Art and How Charts Lie
  • Cole Nussbaumer Knaflic — Storytelling with Data
  • Tamara Munzner — Visualization Analysis & Design (the marks-and-channels theory)
  • Catherine D’Ignazio & Lauren Klein — Data Feminism (power and data)
ImpactMojoData Visualization 101www.impactmojo.in
Get better by redrawing
Theory only takes you so far. Pick one chart a week — from a newspaper, a report, your own work — and redraw it better. Decide what it is for, fix the encoding, cut the junk, write a finding title, add the source. Keep the before-and-after.
Then explore ImpactMojo’s The Long View — every chart there is built from real, cited data, with notes on why that chart type and what to look for. It is this course, made concrete.
ImpactMojoData Visualization 101www.impactmojo.in
The whole course as a checklist
Before you draw
• What is the one question?
• Comparison, trend, part, spread or link?
• Which channel carries the key variable?
• Who reads it, and where?
Before you publish
• Bars from zero; areas by area
• Full, fair time window
• Title states the finding, honestly
• Colour-blind safe, not colour-only
• Units, source, date on the chart
Honest first, clear second, beautiful third — in that order, always.
ImpactMojoData Visualization 101www.impactmojo.in
ImpactMojo 101 Series
Now go
draw it.
Free Forever·CC BY-NC-ND 4.0·www.impactmojo.in