{"id":900,"date":"2016-08-23T12:36:46","date_gmt":"2016-08-23T16:36:46","guid":{"rendered":"http:\/\/www.chiplynch.com\/wordpress\/?p=900"},"modified":"2016-08-25T14:00:29","modified_gmt":"2016-08-25T18:00:29","slug":"horses-and-olympians-and-data-and-such","status":"publish","type":"post","link":"https:\/\/www.chiplynch.com\/wordpress\/?p=900","title":{"rendered":"Horses and Olympians and Data and Such"},"content":{"rendered":"<p>I pick on <a href=\"https:\/\/twitter.com\/aaronecarroll\">Aaron Carroll<\/a>\u00c2\u00a0a lot, and it&#8217;s really not that he deserves picking on, it&#8217;s in fact because he writes so much good stuff that I like that I am compelled to\u00c2\u00a0investigate\u00c2\u00a0at length. \u00c2\u00a0Fortunately this time he actually asked for comments, so here we are&#8230;<\/p>\n<p>He recently posted a simple question (itself based upon a <a href=\"http:\/\/www.benrosen.com\/2007\/11\/horses-vs-humans-improvement-of-which.html\">piece by Ben Rosen<\/a>): \u00c2\u00a0<a href=\"http:\/\/theincidentaleconomist.com\/wordpress\/why-are-people-getting-so-much-faster-but-not-horses\/\">&#8220;Why are people getting so much faster, but not horses?&#8221;<\/a>\u00c2\u00a0on <a href=\"http:\/\/theincidentaleconomist.com\/\">The Incidental Economist<\/a> blog, which you should read (the post, the site, all of it).<\/p>\n<p>It has two pretty charts , and otherwise it&#8217;s very short. \u00c2\u00a0But he asks for an answer to the question. \u00c2\u00a0First, the charts&#8230; one depicting the wining Kentucky Derby time over 60ish\u00c2\u00a0years and the next depicting the world record human\u00c2\u00a01-Mile running record:<\/p>\n<div id=\"attachment_901\" style=\"width: 1004px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/www.benrosen.com\/2007\/11\/horses-vs-humans-improvement-of-which.html\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-901\" class=\"wp-image-901 size-full\" src=\"http:\/\/www.chiplynch.com\/wordpress\/wp-content\/uploads\/2016\/08\/Derby-Times-vs-Running-Records.png\" alt=\"Charts from: http:\/\/www.benrosen.com\/2007\/11\/horses-vs-humans-improvement-of-which.html\" width=\"994\" height=\"345\" srcset=\"https:\/\/www.chiplynch.com\/wordpress\/wp-content\/uploads\/2016\/08\/Derby-Times-vs-Running-Records.png 994w, https:\/\/www.chiplynch.com\/wordpress\/wp-content\/uploads\/2016\/08\/Derby-Times-vs-Running-Records-300x104.png 300w, https:\/\/www.chiplynch.com\/wordpress\/wp-content\/uploads\/2016\/08\/Derby-Times-vs-Running-Records-150x52.png 150w, https:\/\/www.chiplynch.com\/wordpress\/wp-content\/uploads\/2016\/08\/Derby-Times-vs-Running-Records-768x267.png 768w\" sizes=\"auto, (max-width: 994px) 100vw, 994px\" \/><\/a><p id=\"caption-attachment-901\" class=\"wp-caption-text\">Charts from\u00c2\u00a0<a href=\"http:\/\/www.benrosen.com\/2007\/11\/horses-vs-humans-improvement-of-which.html\">Ben Rosen<\/a><\/p><\/div>\n<p>So, without the raw number or validating the trend lines, sure&#8230; the chart on the left shows a pretty even-keel trend (note the axis labels &#8212; the deviation\u00c2\u00a0is less than 5%, even though it looks spikey), while the one on the right shows a clearly, quickly, decreasing line.<\/p>\n<p>Of course, the error is easy to spot, and I&#8217;m sure Aaron was being coy about it&#8230; you can&#8217;t compare best times at one race against a world record history. \u00c2\u00a0The right hand chart is NECESSARILY going to go down all the time. \u00c2\u00a0It would be astounding if the left hand chart did&#8230; if every single race beat the best time of the previous race. \u00c2\u00a0And remember, the right hand chart is the aggregation of hundreds, possibly thousands of races competing for that best race. \u00c2\u00a0It&#8217;s apples to oranges, and that explanation fits in a tweet:<\/p>\n<blockquote class=\"twitter-tweet\" data-lang=\"en\">\n<p dir=\"ltr\" lang=\"en\"><a href=\"https:\/\/twitter.com\/aaronecarroll\">@aaronecarroll<\/a> I&#8217;m guessing breeding vs. training, but comparing a monotonically decreasing record to best times in one race is misleading.<\/p>\n<p>\u00e2\u20ac\u201d Chip Lynch (@chipmonkey75) <a href=\"https:\/\/twitter.com\/chipmonkey75\/status\/767751975949758464\">August 22, 2016<\/a><\/p><\/blockquote>\n<p><script async src=\"\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><\/p>\n<p>Right, right, &#8220;monotonically decreasing&#8221; because world record times vs. one race is the core issue&#8230; &#8220;But wait!&#8221; I can hear you all gasp, in the back of your minds, why throw in the bit about training and breeding?<\/p>\n<p>See, the reason this deserves more than a tweet, and thus the deep dive blog post, is that just because the charts used to ask a question were bad doesn&#8217;t mean the original question _itself_ was bad, and certainly we haven&#8217;t answered it. The answer to &#8220;why are humans improving faster than horses&#8221; is not &#8220;your charts don&#8217;t match&#8221;. \u00c2\u00a0It&#8217;s just the charts don&#8217;t prove the premise (that people ARE improving faster than horses), so we have to go back and look at it a bit first. \u00c2\u00a0(NB: I think they might be, thus the breeding vs. training comment, but we&#8217;ll come back to that).<\/p>\n<p>To check the underlying assumption, we have to look at world-record times for horses. And that takes a bit of digging. The <a href=\"http:\/\/www.guinnessworldrecords.com\/\">Guinness World Record<\/a> people keep track of the &#8220;<a href=\"http:\/\/www.guinnessworldrecords.com\/world-records\/fastest-speed-for-a-race-horse\/\">Fastest Race Horse<\/a>&#8220;, but there&#8217;s no easily accessible history. \u00c2\u00a0And the <a href=\"http:\/\/www.equibase.com\/\">horse-racing statistics site Equibase<\/a> has <a href=\"http:\/\/www.equibase.com\/about\/northamericanrecords.cfm\">a page of about 75 record times<\/a> (combinations of different length horce races on dirt, turf, and all-weather tracks), but again these are only the current record holders.<\/p>\n<p>But it&#8217;s a good starting point. \u00c2\u00a0The Kentucky Derby is a 1-1\/4 mile dirt track. \u00c2\u00a0It turns out that the world record for a 1-1\/4 mile horse run was NEVER set there (as far as I can tell)&#8230; the three records I can find are:<\/p>\n<ol>\n<li><a href=\"https:\/\/www.racingmuseum.org\/hall-of-fame\/spectacular-bid\">Spectacular Bid<\/a> ran 1:57.8 (117.8 seconds) on Feb. 3, 1980 at Santa Anita Park<\/li>\n<li><a href=\"https:\/\/www.racingmuseum.org\/hall-of-fame\/noor\">Noor<\/a> ran it in 1:58.1 (118.1 seconds) on June 20, 1950\u00c2\u00a0&#8220;beating the prior record by 1.6 seconds&#8221; meaning&#8230;<\/li>\n<li>someone ran it in 1:59.8 (119.8 seconds)&#8230; sometime (Coaltown matched\u00c2\u00a0this in 1949, but the record had already been set)<\/li>\n<\/ol>\n<p>Ok, so that&#8217;s not much data and it was hard to find. \u00c2\u00a0I&#8217;m searching for a better set of world record horse racing data, but for now at least we can compare a chart in Excel because why not!<\/p>\n<div id=\"attachment_905\" style=\"width: 994px\" class=\"wp-caption aligncenter\"><a href=\"http:\/\/www.chiplynch.com\/wordpress\/wp-content\/uploads\/2016\/08\/Horse-Records-vs-Human-Records-1.png\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-905\" class=\"size-full wp-image-905\" src=\"http:\/\/www.chiplynch.com\/wordpress\/wp-content\/uploads\/2016\/08\/Horse-Records-vs-Human-Records-1.png\" alt=\"Horse Records vs. People Records on a bad scale\" width=\"984\" height=\"359\" srcset=\"https:\/\/www.chiplynch.com\/wordpress\/wp-content\/uploads\/2016\/08\/Horse-Records-vs-Human-Records-1.png 984w, https:\/\/www.chiplynch.com\/wordpress\/wp-content\/uploads\/2016\/08\/Horse-Records-vs-Human-Records-1-300x109.png 300w, https:\/\/www.chiplynch.com\/wordpress\/wp-content\/uploads\/2016\/08\/Horse-Records-vs-Human-Records-1-150x55.png 150w, https:\/\/www.chiplynch.com\/wordpress\/wp-content\/uploads\/2016\/08\/Horse-Records-vs-Human-Records-1-768x280.png 768w\" sizes=\"auto, (max-width: 984px) 100vw, 984px\" \/><\/a><p id=\"caption-attachment-905\" class=\"wp-caption-text\">Horse Records vs. People Records (on a bad scale)<\/p><\/div>\n<p>Hey look! \u00c2\u00a0Our trendlines nearly match! \u00c2\u00a0Horses HAVE been improving just like people have, right?! \u00c2\u00a0Excellent, case closed, let&#8217;s go home.<\/p>\n<p>Except, of course, it&#8217;s not that easy either. \u00c2\u00a0These scales aren&#8217;t remotely similar (the left hand chart is eggregious, but both of them are misleading in order to fill in the space with an arbitrary y-axis) and of course the left hand side is based on exactly two points (note that the trendline is NOT the line between the two points&#8230; this is some of the oddity of evaluating a world record trend, since the data points for subsequent years are really imputed).<\/p>\n<p>UPDATE &#8211; TWEET FROM AARON:<\/p>\n<blockquote class=\"twitter-tweet\" data-lang=\"en\">\n<p dir=\"ltr\" lang=\"en\"><a href=\"https:\/\/twitter.com\/chipmonkey75\">@chipmonkey75<\/a> Come on. That&#8217;s lying with charts. Horse times improved less than a second once in fifty years.<\/p>\n<p>\u00e2\u20ac\u201d Aaron E. Carroll (@aaronecarroll) <a href=\"https:\/\/twitter.com\/aaronecarroll\/status\/768192887175524352\">August 23, 2016<\/a><\/p><\/blockquote>\n<p><script async src=\"\/\/platform.twitter.com\/widgets.js\" charset=\"utf-8\"><\/script><br \/>\nBut&#8230; but&#8230; right, but&#8230;\u00c2\u00a0the original chart&#8230; I mean&#8230; that was kind of my p&#8230;. oh, never mind.<\/p>\n<p>Anyway, at this point, we&#8217;re going to need more data and to come up with a method of comparing the base premise&#8230; it&#8217;s easy to say that since a horse record was broken once in 50 years and a human record 10 times or so (I&#8217;m counting inflection points), that the idea that humans are improving faster is valid, it&#8217;s far from a rigorous statistical analysis. \u00c2\u00a0Is one incident at one distance, or one-second in 120 vs 15 from 240 THAT different? \u00c2\u00a0As mentioned before, there are dozens of horse race distances and categories, and the same goes for human records &#8212; is the mile an outlier? \u00c2\u00a0Do records get beaten as frequently and to the same degree in the 100-meter? \u00c2\u00a0In the half- and full-marathons?<\/p>\n<p>I mean, it seems like it&#8230; from this limited data, I just think we should be more careful taking it for granted. \u00c2\u00a0Still, I stand by my point that breeding plus classic training, which would have shown benefits a hundred plus years ago in horses, beats out modern training which has only recently become the\u00c2\u00a0massively streamlined situation it is today. \u00c2\u00a0Sure people are physically changing too, and we may yet start engineering people for their speed, but for now, well&#8230;<\/p>\n<p>let me see if I can dig up some data and maybe we&#8217;ll write a part 2&#8230;<\/p>\n<p>Update update: I&#8217;m probably not going to get any more data, and since I think I wasn&#8217;t on the same page as the original post I&#8217;ll just bow out quietly without starting much more of a row, and point my reader (s?) to <a href=\"http:\/\/www.perceptualedge.com\/blog\/\">Stephen Few&#8217;s blog<\/a>, where we can discuss visualizing data best practices (like zero-axis bases and similar scales) a bit better (and who, similar to Mr. Carroll, I pick on sometimes mostly because I like his writing and agree with him so much of the time).<\/p>\n","protected":false},"excerpt":{"rendered":"<p>I pick on Aaron Carroll\u00c2\u00a0a lot, and it&#8217;s really not that he deserves picking on, it&#8217;s in fact because he writes so much good stuff that I like that I am compelled to\u00c2\u00a0investigate\u00c2\u00a0at length. \u00c2\u00a0Fortunately this time he&#8230; <a class=\"read-more\" href=\"https:\/\/www.chiplynch.com\/wordpress\/?p=900\">Read More<\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_exactmetrics_skip_tracking":false,"_exactmetrics_sitenote_active":false,"_exactmetrics_sitenote_note":"","_exactmetrics_sitenote_category":0,"footnotes":""},"categories":[29,24,10],"tags":[37,38,36],"class_list":["post-900","post","type-post","status-publish","format-standard","hentry","category-data","category-happytechnologist","category-random-musings","tag-horse-racing","tag-horses","tag-kentucky-derby"],"_links":{"self":[{"href":"https:\/\/www.chiplynch.com\/wordpress\/index.php?rest_route=\/wp\/v2\/posts\/900","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.chiplynch.com\/wordpress\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.chiplynch.com\/wordpress\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.chiplynch.com\/wordpress\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/www.chiplynch.com\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=900"}],"version-history":[{"count":10,"href":"https:\/\/www.chiplynch.com\/wordpress\/index.php?rest_route=\/wp\/v2\/posts\/900\/revisions"}],"predecessor-version":[{"id":912,"href":"https:\/\/www.chiplynch.com\/wordpress\/index.php?rest_route=\/wp\/v2\/posts\/900\/revisions\/912"}],"wp:attachment":[{"href":"https:\/\/www.chiplynch.com\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=900"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.chiplynch.com\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=900"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.chiplynch.com\/wordpress\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=900"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}