|
| |
|
Excel Chart Doctor |
|
This page presents examples of published charts that have been
improved/ enhanced with the use of Cleveland/ Tufte data
visualization principles and Excel charting techniques presented in
this site.
I got the idea for this page from Kaiser Fung's
Junk_Charts.com
site, one of my favorite data visualization sites. Kaiser clips
interesting charts/graphs from publications and critiques them from
a data visualization standpoint, often presenting one or more
alternative ways to display the data more effectively.
Borrowing Junk_charts.com's approach, from time to time, I will
present a published chart, diagnose it's data visualization health
and produce an Excel version. Readers are
encouraged to perform their own surgery on
my chart. I'll be happy to post any effective chart surgery results that readers
submit. |
| Case Study: Description, Diagnosis |
Before Chart Surgery |
After Chart Surgery |
Showing Categorical Data on Trend Chart - Another great New York
Times chart -
The New York Times graphics team (30 strong) is a great
source for ideas on good charting. I particularly like their
March 2,
2008 trend chart of global temperature anomalies and El Nino - La
Nina episodes. This chart does not need any surgery, rather, it offers a challenge to Excel charters, how can we do that in Excel?
I've put together a workbook and video to show how we can add change
point rectangles to a XY trend chart.
Take a look....
Links:
Flash Video
Workbook |
 |
 |
|
Environmental Economics Article - Global Warming: Man versus Sun?
Environmental Economics website published this global warming article on
February 29, 2008.
The paper demonstrates effective data analysis and the use of charts to
demonstrate analysis points.
From a data visualization standpoint, the charts can be improved by
removing the Excel default background color and heavy grid lines.
This workbook included Tim Haab's data sources and recreates the charts
presented in the article, removing the chartjunk, cleaning up axis
number formats, and adding calendar values instead of month numbers.
I have posted this workbook so that users can see an example of an
effective data oriented investigation and analyze this global warming
related data on their own.
(Workbook link) |
 |
 |
Escape from Spreadsheet Hell -
Dynamic Ranges Can Help - An Excel Chart User
forum visitor recently posted this request for help: "I am always
charting 2 years of data.
I have 2 spread sheets, one has multi charts and the other is all the
data for the charts.
I'm tracking about 6 different data points in a line chart. Each month
the data sheet gets the previous months production data. I'd like to
have it automatically adjust the charts. Drop the oldest month and add
the newest month to the chart.
That would save me the time of manually updating the charts."
Ravens Fan (Excel Chart User Forum, 12/3/07)
I invited Ravens Fan to send me his workbook. He sent it
to me. Here's my diagnosis, solution to his problem and Raven's Fans
report on how much time he saves each month using dynamic range names.
|
Ravens Fan,
like many Excel chart users, spends way to much time editing routine
charts because he wasn't using dynamic range names to automatically
update his charts as new data was entered.
Dynamic range names are critical to avoiding spreadsheet hell and
updating routine charts.
The Ravens Fan spent 1.5 days per month updating his
charts before surgery. He reports that his time has been reduced to 10
minutes per month after surgery.
|
E-mail from Ravens Fan:
"Just wanted to thank you for all your
help. I finally figured out the reasoning. ... What you gave me
works great! It took my work load down from 1 1/2 days to 10 minutes.
I greatly appreciate your help" ... Ravens Fan
Follow this
link to see how dynamic range names can save you time
by automatically updating your charts as new data is entered. |
|
Replace Legend with
Series Labels -
JuiceAnalytics.Com,
one of my favorite sites, has an interesting tool to help users select
the appropriate chart for their data. Juice's Line Chart example,
shown on right before surgery, includes a legend that tells the reader
what color represent which data series. Legends make it harder for the
reader to interpret the chart because he/she must move their eye back
and forth between the series and the legend.
Let's replace the legend with data labels for each series to make it
easier to interpret. We don't want our message to get lost in the
legend.
The labels are added with a simple VBA procedure included in this
file. |
 |
 |
|
Request for
Plotting a Line From Runway Name - WLMPilot, an Excel Chart User
Forum reader, asked for help to draw runways based on the runway number
(heading). This is an interesting question, because WLMPilot wants to
draw a line based on a name. The runway numbers drop the last digit of
the 3-digit heading, so 07 is actually 70 degrees..
We can use an XY chart to plot our lines. Since it only takes 2
points to define a line, we can use 0,0 to define one end and calculate
the X,Y coordinates for the 2nd point with simple trigonometry
functions. We can then plot our line with the start and end
points. This is the
link
to revision 2 of the workbook.
|
 |
 |
|
Request for Interactive Chart
- Multi Data Sheets - Joe, an Excel Charting User Forum reader,
asked if anyone had ideas on how to create an interactive chart tool
that lets the user select which data sheet and variable data column to
make the chart. The X axis is time, so Joe also wanted the tool to let
the user select the start and end time. The resulting workbook is
available here. It provides
several features:
1.Drop down menus for the data sheet
2. Drop down menu for variable
3. Scroll bar for minimum X axis value
4. Drop Down menu for X axis time increment
5. Macro to adjust X axis for min value & increment
6. Macro to prepare list of data sheets
|
 |
 |
|
One Trend Chart
Better Than Two Pie Charts - BP does a great service summarizing
world energy use and making this data available
here. BP's
Energy in Perspective includes a graphic that compares share of
world energy use by regions for 1991 and 2006. The graphic includes two
pie charts, showing the shares for 1991 and 2006. BP's chart shows
four factors: year, region, and share of world energy use and the
equivalent tons of oil equivalent (toe). This graphic has several
shortcomings:
- Reader must scan back and forth between pie charts to assess
trend in regions share of energy use
- Increase in global energy use from 8.2 to 10.9 billion toe shown
directly with text notes and indirectly by adjusting pie chart
sizes.
- Important trend information hidden in pie charts:
- 3 countries have decreasing shares, 3 countries have
increasing shares
- China and Other EMEs have fastest rate of increase in energy
share
- FSU has greatest rate of decrease in energy use
- US and Europe have decreasing shares of energy use
- Other OECD countries had slight increase in share of energy
use
The Chart Doctor's version is a simple trend chart with each region
labeled and the 1991 and 2006 data values displayed. The increasing
trend regions are noted with the black line, the decreasing share
regions noted with the gray lines.
Initials:
OECD - 30 developed economies, including USA and Europe
EME - Emerging Market Economy countries
FSU - Former Soviet Union countries
Other EMEs - EMEs less China and FSU countries |
 |
 |
Clustered Column Chart for Trends Not
Clear - Jon
Udell posted this clustered crime trend chart in an article on his
analysis of local crime trends in Keene, NH.
Jon's chart shows 3 factors in a 2 dimensional (X and Y), year,. distance
from 1 Central Square and % of annual crimes by location. He chose to
use a clustered column chart.
Clustered column charts are a poor choice for showing trends of
several series because it
is more difficult for the viewer to follow the trend pattern for a
particular location, the columns for the other series break up the
pattern, forcing the reader to work harder than necessary.
The Chart Doctor's Excel version uses a simple XY - scatter trend chart of the 3
series. It clearly shows the abrupt changes in 2006, the decrease in the <
05. mi series and the increase in the > 1 mi series. This version labels the last point so
that the legend can be removed and removes the data labels to help focus the
viewers attention on what is important, the pattern shift in 2006.
Selecting the right chart type is one of the most important steps in
making an effective chart. |
 |
 |
Where are the crimes?
Junk_Charts.com
has a discussion on charting crime trends by region of the US and type
of crime. The source data is presented in the data table.Junk_Charts
made a "profile chart" shown in the before column.
The Chart Doctor's Excel version uses a series of XY
charts with error bars to produce a panel like display so that readers
can compare regional changes by crime type.
|
 |
 |
|
Graphic Equity 2
JunkCharts.com has a great discussion on energy
use charting. Kaiser has shown network, mosaic, and flow diagrams
for energy source and sector use data.
Jon Peltier used this data to show how you can make a mosaic
chart in Excel.
Panel charts allow users to create
multivariate plots all in one Excel chart. My is to use a
Horizontal Panel Dot Plot.
|
 |
 |
| Operating Room Schedule - I received an e-mail
request from a real doctor who wanted help developing a chart to show
the schedule for an operating room. The doctor sent along a hand sketch.
I was happy very happy to develop the Excel chart on the right to
show the daily schedule for an operating room. |
 |
 |
|
Blood Pressure Trend
- On the Excel Chart user forum, Marc asked how to put connecting lines
between data points on an XY scatter chart of blood pressure readings.
Finding it easier to make the workbook example than explain it, I
created this downloadable example. It uses Y error bars to connect the
Systolic and diastolic readings. It also uses dynamic range names to
provide automatic updating of the chart.
File Link |
|
 |
Chart Should Tell the Full Story
JunkChart.com
shows an example of a junked up chart from the June, 2006 issue
of Scientific America. The problem chart tries to show the relationship
between watershed concrete/asphalt cover and tidal creek water quality.
This chart has many problems, including inappropriate chart type (bar
chart), unnecessary and distracting use of pie charts, using creek name
as X variable when real variable is % impervious cover, claiming "strong
correlation" without any information on correlation coefficient and
relationship.JunkChart solved most of the problems, except for
information on strength and type of relationship between impervious
cover and fecal coliform counts and labeling the creek names.
Here's my Excel version, building on
JunkChart.com's
initial surgery. |
 |
 |
| Stacked Trend Bar Charts Not
Clear - Stacked bar charts are often used to compare trends of
several factors over time.
BP does a great service by preparing the annual
BP Statistical Review of World Energy and the companion
Excel workbook. BP uses stacked/clustered bar charts in their 2006
review. I will be commenting on several of them. In this case, let's
look at BP's Oil Consumption Growth for 2003 - 2005 stacked bar chart.
Since the bars are cumulative, to follow the growth trends for China and
the USA in BP chart, the reader must do some quick mental arithmetic.
My version, uses a simple XY chart so that each area has a
separate series so that the reader can see the US, China and Rest of
World values directly without and mental calculation. this version uses
a custom Y Axis and includes attaches labels to the individual series
rather than use a legend.
The Excel workbook is here
if you'd like to create your own version. Be sure to drop a
comment if you have
suggestions, thoughts or an alternative version. |
 |
 |
| Dot
Plot with 2 Level Y Axis Labels Clearer than Column Chart -
Stephen Few
reports on two data websites that provide data charting and sharing
venues: Data360 and
Swivel. While these sites provide
access to interesting data, the data visualization quality of their
charts is limited. Let's look at Data360's chart on
automobile CO2 emissions. There are a number of concerns:
o Vertical alignment of vehicle labels makes them difficult to read
o Vehicle type (hybrid, SUV) hidden in vehicle name
o Data labels repeat manufacturer
o Emission comparison by manufacturer, not vehicle type
o Horizontal grid not necessary
My Excel chart version uses a dot plot with a 2 level Y axis data
label so that the vehicles can be grouped by vehicle type. This version
helps to uncover hidden information that was hidden in the Data360
version. |
 |
 |
|