[MINI] Anscombe's Quartet

Data Skeptic

Episode | Podcast

Date: Fri, 12 Jun 2015 08:00:00 +0000

<p style="margin: 0px 0px 10px; color: #224422; font-family: sans-serif; font-size: 14px; line-height: 24px;"> This mini-episode discusses <a href="http://en.wikipedia.org/wiki/Anscombe%27s_quartet" style="color: #337ab7; text-decoration: none; background-color: transparent;">Anscombe's Quartet</a>, a series of four datasets which are clearly very different but share some similar statistical properties with one another. For example, each of the four plots has the same mean and variance on both axis, as well as the same correlation coefficient, and same linear regression.</p> <p> </p> <p> The episode tries to add some context by imagining each of these datasets as data about a sports team, and why it can be important to look beyond basic summary statistics when exploring your dataset.</p>