When social scientists like me have lots of data, we tend to put it in tables. There’s nothing wrong with tables. They contain infinite details within a finite and teeny space, and they give the reader freedom to draw his or her own conclusions about which details are worth paying attention to.

In today’s post I’m teaching you:

  • how to explore your data tables with Excel’s conditional formatting;
  • how clustered bar charts are the worst charts ever;
  • how a small multiples chart can save the day by quickly visualizing an entire table’s worth of details in the same amount of space as the original table; and
  • how to create that small multiples chart in Excel (and it takes less than 5 minutes, I promise).

The Original: A Data Table

This example comes from a short and sweet Child Trends brief: www.childtrends.org/wp-content/uploads/2013/07/125_Flourishing_Measures.pdf. This is a pretty typical situation in research and evaluation projects in which we’ve got some interesting variables across the top and multiple demographic breakdowns along the side.

original

Option A: Conditional Formatting

I wanted to see the patterns and understand what was going on. For starters, I used conditional formatting to color-code any cells that were greater than the overall percentage. For example, in Column B, 93% of children “always” or “usually” were affectionate and tender with their parent. Girls were slightly more likely than boys to be affectionate and tender (95% of girls compared to 93% of children overall) so the 95% is highlighted.

Now what can I see? Something cool is going on with the kids above poverty and whose parents have higher levels of education.

This conditional formatting technique is a good start, but it didn’t seem to show me enough patterns, so I kept going.

option-A_conditional-formatting

Option B: Data Bars

Data bars–within-cell bar charts–are another underused conditional formatting technique for exploring preliminary patterns from the comfort of your spreadsheet. I teach you how to make them here.

What’s happening with column C? Why are those bars so short? Why are the kids less likely to bounce back? Now that I can see what’s going on, my brain’s full of questions.

 

option-B_data-bars

Option C: Clustered Bar Chart

I made one of these bad boys just for fun, and the result was even more incomprehensible than I could have imagined. The not-so-fun part is that I still see charts like these in research and evaluation reports. If you’re still using these, please, contact me and I’ll help you out. option-C_clustered-bar

Option D: Small Multiples Bar Chart

I could’ve visualized this data through a million different individual bar charts: a bar chart showing the overall percentages for each of the four indicators (the option selected by the authors in their original report), a dot plot showing the gender differences for one or two or three or four of the indicators, a bar chart focusing on parental education, etc.

Instead of highlighting one or two patterns, I wanted to see the entire landscape of what was going on. Same real estate as the original data table. The chart could fit on just one page within the body of a report or in an appendix. But now, my brain receives the information through both images and text, rather than just text alone like in the original data table.

And since my primary goal was to see the big picture, I opted for a generic title, generic subtitle, and emphasis-free color scheme. In other words, my title just states the graph’s content rather than putting forth a “so what?” (it’s modeled after a data table from an appendix, after all). My subtitle describes the data source but doesn’t give my interpretation of what’s going on. My colors are matched to Child Trends’ logo rather than Excel’s default color scheme, but the colors don’t emphasize one category or series of data over another. I’ll describe these distinctions in more detail in a future post.

option-D_small-multiples-1

(You can also view the high resolution image here.)