December 30, 2012

An introduction to data visualization

I've recently discovered something called Infographics which could be explained as the art of explaining something difficult in a picture. This something difficult could be a lot of numbers, or an event such as the sinking of the Titanic. The goal of the Infographic is that someone without any knowledge within the subject the picture is explaining, immediately should get what the picture is about. If you need an example, you should visit visual.ly which is an Internet service where you can show the finished Infographics you've made.

So, how do you create an Infographic? The reason I'm asking is because there's a competition going on at Kaggle where the goal is to visualize the school system in Colorado, and I'm participating in that competition. I've never made one before, so this will be a summary on how to do one. To learn how to make an Infographic, I've decided to read some books and watch some online tutorials on YouTube.

The best book I've found on the subject is Envisioning Information by Edward Tufte, and it was recommended to me by Tim Ferriss in one of the Random Show episodes. Edward Tufte is a professor at Yale University and has written several books on the same subject. Envisioning Information is a quite small book, around 130 pages with many pictures, and it explains how to represent a rich visual world on "flatland" where flatland is something flat such as paper, but could also be a memorial and similar structures. One example is how to represent a subway system on a map to make it fast and easy to understand if you have never entered the subway before. A tourist in London should immediately understand,without any confusion, how to travel from the hotel to Madame Tussauds.
Edward Tufte explains that to envision information, you should work in the intersection of image, word, number, and art, using visual principles that tells us how to put the right mark in the right place. Here are some of the visual principles from the book:
  • Avoid "chartjunk." Chartjunk is the art of decorating a chart with "fluff", such as unneeded pictures or dark grid lines, to make it look more interesting, but the chart will also be less credible to the spectators. Decorations are never needed, and if the numbers are boring, then you've got the wrong numbers. You can still use techniques such as colors, typography, layout, and similar, as long as you avoid unneeded junk.  
  • Respect the audience. Consumers of graphics are often more intelligent about the information at hand than those who fabricate the data decoration. The audience may be busy, but they are alert and caring - not stupid. 
  • To clarify, add detail. Thin data may lead to suspicions: "What are they leaving out? What are they hiding?"
  • Clutter and confusion are failures of design. It is not how much information there is, but rather how effectively it is arranged. 
  • Use a panorama. A panorama deliver to viewers the freedom of choice that derives from an overview, a capacity to compare and sort through detail. When appropriate, you can combine a panorama with a more 2-dimensional picture.  
  • 1 + 1 = 3. White space is something. Add more shapes, and thus spaces between the shapes, and the amount of noise will increase exponentially. On white backgrounds, a varying range of lighter colors on the shapes will minimize the clutter.   
  • Avoid color damage. Pure, bright or very strong colors should be used sparingly on or between dull background tones. Light, bright colors should not be mixed with white next to each other. Color spots against a light gray or muted field highlight and italicize data, and also help to achieve an overall harmony. Use colors found in nature to represent and illuminate information since these colors are familiar to the human eye. Gray is regarded in painting to be one of the prettiest, most important and most versatile of colors. 

To create an Infographic, you will need some kind of software, and I believe that Illustrator is the most common software used by the designers of Infographics. One good, and free, replacement software for Illustrator is Inkscape
The YouTube tutorials I've found on the subject are:

No comments:

Post a Comment