{"id":533,"date":"2011-09-16T23:46:53","date_gmt":"2011-09-17T03:46:53","guid":{"rendered":"http:\/\/heavytopspin.com\/?p=533"},"modified":"2011-09-16T23:46:53","modified_gmt":"2011-09-17T03:46:53","slug":"win-probability-graphs-and-stats","status":"publish","type":"post","link":"https:\/\/www.tennisabstract.com\/blog\/2011\/09\/16\/win-probability-graphs-and-stats\/","title":{"rendered":"Win Probability Graphs and Stats"},"content":{"rendered":"<p>Win probability graphs and stats are now available for over 600 grand slam matches from 2011. \u00a0Thanks to IBM Pointstream from this year&#8217;s slams, there is a wealth of data available like never before.<\/p>\n<p><a href=\"http:\/\/www.jeffsackmann.com\/WinProb.html\">Here&#8217;s the main menu<\/a>.<\/p>\n<p><a href=\"http:\/\/jeffsackmann.com\/cgi-bin\/wpgraph.py?m=2011U1601\">Here&#8217;s a sample match<\/a>: The US Open semifinal between Federer and Djokovic.<\/p>\n<p>When I first started publishing tennis research, win probability was one of my focuses. \u00a0You can find earlier work <a href=\"http:\/\/summerofjeff.wordpress.com\/2010\/12\/23\/tennis-win-expectancy-graphs\/\">here<\/a>, which links to specific tables for games, sets, and tiebreaks. \u00a0I&#8217;ve also published much of the <a href=\"http:\/\/summerofjeff.wordpress.com\/2011\/01\/13\/python-code-for-tennis-markov\/\">relevant code<\/a>, which is written in Python.<\/p>\n<p>Win probability\u00a0represents the odds of each player winning after every point of the match, based on the score up to that point and which player is serving. It makes no assumptions about the specific skill levels of each players, but does assume that the server has an advantage, which varies based on surface and gender. \u00a0With every point, each player&#8217;s win probability goes up or down, and the degree to which it rises or falls is dependent on the importance of the point&#8211;at 4-1, 40-0, winning the point is nice, but losing the point just delays the inevitable; at 5-6 in a tiebreak, the potential change in win probability is huge.<\/p>\n<p>To quantify that in the graphs, I show another metric: Volatility, which\u00a0measures the importance of each point. It is equal to the difference in win probabilities between the server winning and losing the following point. 10 percent is exciting, 20 percent is crucial, and 30 percent is edge-of-your-seat stuff.<\/p>\n<p><strong>Assumptions<\/strong><\/p>\n<p>To produce these numbers, I needed to make several simplifying assumptions. \u00a0Some are more important than others; here are the big two:<\/p>\n<ul>\n<li>The players are equal.<\/li>\n<li>Each player&#8217;s ability does not vary from point to point.<\/li>\n<\/ul>\n<p>The first of these is almost always false, and the second is probably false as well. \u00a0The first, however, makes things more interesting. \u00a0In most matches Novak Djokovic plays these days, he goes in with an 80-percent-or-better chance of winning. \u00a0If we graphed one of his matches starting at 85 percent, we&#8217;d usually get a very slowly ascending line. \u00a0Instead, by starting at 50 percent, we can see where he and his opponent had their biggest openings, and who took advantage.<\/p>\n<p>(<a href=\"http:\/\/summerofjeff.wordpress.com\/2010\/12\/23\/tennis-win-expectancy-graphs\/\">In this long-ago post<\/a>, I showed a sample graph with an assumption similar to the 85 percent for Djokovic, and you can see some of what I mean.)<\/p>\n<p>Assuming that the players are equal also sidesteps of messy question of how to quantify each player&#8217;s skill level on that day, on that surface, against that opponent.<\/p>\n<p>The second big assumption ignores possibility real-world attributes like clutch performance and streakiness, along with more pedestrian considerations like some players&#8217; stronger serving in the deuce or ad court.<\/p>\n<p><a href=\"http:\/\/summerofjeff.wordpress.com\/2010\/12\/05\/serving-against-markov\/\">Another long-ago article of mine<\/a> suggests that servers are not absolutely consistent, possibly because of natural rises and falls in performance, also possibly because of risk-taking (or lack of concentration) in low-pressure situations. \u00a0One of the most interesting directions for research with these stats is into this inconsistency: We need to figure out whether some players are more consistent than others, whether &#8220;clutch&#8221; exists in tennis, and much more.<\/p>\n<p>One more set of assumptions regards <a title=\"The Speed of Every\u00a0Surface\" href=\"http:\/\/tennisabstract.com\/blog\/2011\/09\/13\/the-speed-of-every-surface\/\">the server&#8217;s advantage<\/a>. \u00a0Since these graphs only encompass the four grand slams, I set the server&#8217;s win percentage for each tournament. \u00a0The numbers I used for men are: 63% in Australia, 61% at the French, 66% at Wimbledon, and 64% at the U.S. Open. \u00a0I used percentages two points lower for women at each event.<\/p>\n<p><strong>More on Win Probability<\/strong><\/p>\n<p>There&#8217;s very little out there on win probability and volatility in tennis. \u00a0I wasn&#8217;t the first person to work out the probability of winning a game, a set, or a match from a given score, but as far as I know, I&#8217;m the only person publishing graphs like this. \u00a0Much of the problem is the limited availability of play-by-play descriptions for professional tennis.<\/p>\n<p>That problem doesn&#8217;t apply to baseball, where win probability has thrived for years. \u00a0<a href=\"http:\/\/www.hardballtimes.com\/main\/article\/the-one-about-win-probability\/\">Here&#8217;s a good intro<\/a> to win probability stats in baseball, and <a href=\"http:\/\/www.fangraphs.com\/\">fangraphs.com<\/a> is known for its single-game graphs&#8211;for instance, here&#8217;s <a href=\"http:\/\/www.fangraphs.com\/livewins.aspx?date=2011-09-16&amp;team=Reds&amp;dh=0&amp;season=2011\">tonight&#8217;s&#8217;s Brewers game<\/a>. \u00a0In many ways, win probability is more interesting in baseball than in tennis. \u00a0In tennis, there are only two possible outcomes of each point, while in baseball, there are several possible outcomes of each at-bat.<\/p>\n<p>Enjoy the <a href=\"http:\/\/www.jeffsackmann.com\/WinProb.html\">graphs and stats<\/a>!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Win probability graphs and stats are now available for over 600 grand slam matches from 2011. \u00a0Thanks to IBM Pointstream from this year&#8217;s slams, there is a wealth of data available like never before. Here&#8217;s the main menu. Here&#8217;s a sample match: The US Open semifinal between Federer and Djokovic. When I first started publishing &hellip; <a href=\"https:\/\/www.tennisabstract.com\/blog\/2011\/09\/16\/win-probability-graphs-and-stats\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Win Probability Graphs and Stats<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[96],"tags":[],"class_list":["post-533","post","type-post","status-publish","format-standard","hentry","category-research"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/posts\/533","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/comments?post=533"}],"version-history":[{"count":0,"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/posts\/533\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/media?parent=533"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/categories?post=533"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/tags?post=533"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}