August 07, 2006 — Baseball
In 1976 statistician Herman Chernoff had the idea of representing multivariate data in figures now known as Chernoff Faces. The theory is that since we are highly practiced in the art of facial recognition, and can discern minute variations in features and expression, perhaps encoding data in a likeness of a human face would reveal things that, say, a bar graph wouldn't. And reading the diagram would, ideally, be as natural as interpreting the face of your best friend. As an example, here are some team statistics from the 2005 baseball season:
| PCT | H | HR | BB | SB | |
|---|---|---|---|---|---|
| ARI | .475 | 1419 | 191 | 606 | 67 |
| ATL | .556 | 1453 | 184 | 534 | 92 |
| CHI | .488 | 1506 | 194 | 419 | 65 |
| CIN | .451 | 1453 | 222 | 611 | 72 |
| COL | .414 | 1477 | 150 | 509 | 65 |
| FLO | .512 | 1499 | 128 | 512 | 96 |
| HOU | .549 | 1400 | 161 | 481 | 115 |
| LAD | .438 | 1374 | 149 | 541 | 58 |
| MIL | .500 | 1413 | 175 | 531 | 79 |
| NYM | .512 | 1421 | 175 | 486 | 153 |
| PHI | .543 | 1494 | 167 | 639 | 116 |
| PIT | .414 | 1445 | 139 | 471 | 73 |
| SDP | .506 | 1416 | 130 | 600 | 99 |
| SFG | .463 | 1427 | 128 | 431 | 71 |
| STL | .617 | 1494 | 170 | 534 | 83 |
| WAS | .500 | 1367 | 117 | 491 | 45 |
Let's depict these numbers as a series of Chernoff Faces:
For me, the Chernoff Faces have never been particularly effective or easy to read. While I can effortlessly tell how Pedro Martinez is feeling on the mound by looking at his face (on the TV broadcast), when reading Chernoff's faces I have to stop and think carefully about each one. It's not natural or easy, and usually I'd prefer a good old fashioned bar graph.
Specifically, what's wrong with Chernoff Faces? Allow me to propose that the problem lies in the un-human-ness of the stick figure drawings. They look more like bits of a breakfast cereal than a person. What's needed here are realistic human faces. And so, I humbly submit, for your consideration, Reisner faces:
Since, clearly, a more readable expression for the Mets in 2005 would be the one at right it seems I haven't done any better than Chernoff.
Oh well.
More Baseball
Jan 04, 2006 How To Graph Pennant Races