{"id":2718,"date":"2018-08-09T14:45:22","date_gmt":"2018-08-09T14:45:22","guid":{"rendered":"http:\/\/www.tennisabstract.com\/blog\/?p=2718"},"modified":"2018-08-09T14:45:22","modified_gmt":"2018-08-09T14:45:22","slug":"the-cost-of-a-double-fault","status":"publish","type":"post","link":"https:\/\/www.tennisabstract.com\/blog\/2018\/08\/09\/the-cost-of-a-double-fault\/","title":{"rendered":"The Cost of a Double Fault"},"content":{"rendered":"<p>We all know that double faults aren&#8217;t good, but it&#8217;s less clear just how bad they are. Over the course of an entire match, a single point here or there doesn&#8217;t seem to matter too much, especially when a double fault creeps in at a harmless moment, like 40-love. Yet many missed second serves are far more costly. Let&#8217;s try to quantify the impact of tennis&#8217;s most enervating outcome.<\/p>\n<p>To do this, we need to think in terms of <em><a href=\"https:\/\/github.com\/JeffSackmann\/tennis_misc\">win probability<\/a>.<\/em>\u00a0In each match, a player wins a certain percentage of service points and a certain percentage of return points. If those rates are sufficiently dominating&#8211;say, <a href=\"http:\/\/www.tennisabstract.com\/cgi-bin\/player.cgi?p=MihaelaBuzarnescu\" target=\"_blank\" rel=\"noopener\">Mihaela Buzarnescu<\/a>\u2019s 65% of service points won and 59% of return points won in last week&#8217;s San Jose final&#8211;the player&#8217;s chance of winning the match is 100%. No matter how unlucky or unclutch she was, those percentages result in a win. But in a close contest, in which both players win about 50% of points (<a href=\"http:\/\/www.tennisabstract.com\/blog\/2015\/09\/11\/a-new-way-of-looking-at-lottery-matches\/\">often referred to as &#8220;lottery matches&#8221;<\/a>), the result is heavily influenced by clutch play and luck. In Buzarnescu&#8217;s tour de force, flipping the result of a single point would be meaningless. But in a tight match, like the Wimbledon semifinal between <a href=\"http:\/\/www.tennisabstract.com\/cgi-bin\/player.cgi?p=JohnIsner\" target=\"_blank\" rel=\"noopener\">John Isner<\/a> and <a href=\"http:\/\/www.tennisabstract.com\/cgi-bin\/player.cgi?p=KevinAnderson\" target=\"_blank\" rel=\"noopener\">Kevin Anderson<\/a>, a single point could mean the difference between a spot in the championship match and an early flight home.<\/p>\n<p>My aim, then, is to measure the average win probability impact of a double fault. To take another example, consider last week&#8217;s Washington quarter-final between <a href=\"http:\/\/www.tennisabstract.com\/cgi-bin\/player.cgi?p=AndreaPetkovic\" target=\"_blank\" rel=\"noopener\">Andrea Petkovic<\/a> and <a href=\"http:\/\/www.tennisabstract.com\/cgi-bin\/player.cgi?p=BelindaBencic\" target=\"_blank\" rel=\"noopener\">Belinda Bencic<\/a>. Bencic won nearly 51% of total points&#8211;59% of her service points and 42% on return&#8211;but lost in a third-set tiebreak. Those serve and return components were enough to give her a 56.3% chance of winning the match: claiming more than half of total points usually results in victory, but so close to 50%, there&#8217;s plenty of room for things to go the other way.<\/p>\n<p>I refer to this match because double faults played a huge role. Bencic tallied 12 double faults in 105 service points, a rate of 11.4%, more than double the WTA tour average of 5.1%. Had she avoided those 12 double faults and won those points at the same rate as her other 93 service points, she would have ended up with a much more impressive service-points-won rate of 67%. Combined with her 42% rate of return points won, that implies an 87% chance of winning the match&#8211;more than 30 percentage points higher than her actual figure! Roughly speaking, each of her 12 double faults cost her a 2.5% chance (30% divided by 12) of winning the match.<\/p>\n<p>A double fault rate above 10% is unusual, but a cost of 2.5% per offense is not. When we run this algorithm across the breadth of the ATP and WTA tours, we find that the cost of double faults adds up fast.<\/p>\n<p><strong>Tour averages<\/strong><\/p>\n<p>Using the method I&#8217;ve described above&#8211;replacing double faults with average non-double-fault service points&#8211;and taking the average of all tour-level matches in 2017 and 2018 through last week&#8217;s tournaments, we find that the average WTA double fault costs a player 1.83% of a win. Put another way, every 55 additional double faults subtracts one match from the win column and adds one to the loss column.<\/p>\n<p>In the men&#8217;s game, the equivalent number is 1.99% of a win. The slightly bigger figure is due to the fact that men, on average, win more service points, so the difference between a double fault and a successful service offering is greater.<\/p>\n<p>There is, however, an alternative way we could approach this. By comparing double faults to all other service points, we&#8217;re trading a lot of the double faults for\u00a0<em>first\u00a0<\/em>serve outcomes. We might be more interested in knowing how a player would fare if his or her\u00a0<em>second<\/em> serve were bulletproof&#8211;still eliminating double faults, but replacing them specifically with second serves instead of a generic mix of service points.<\/p>\n<p>In that case, the algorithm remains very similar. Instead of replacing double faults with non-double-fault serve points, we replace them with non-double-fault <em>second<\/em> serve points. Then the cost of a double fault is a little bit less, because second serve points result in fewer points won than service points overall. The second-serve numbers are 1.61% per double fault in the women&#8217;s game and 1.70% per double fault in the men&#8217;s game. For the remainder of this post, I&#8217;ll stick with the generic service points, but one approach is not necessarily better than the other; they simply measure different things.<\/p>\n<p><strong>Building a player-specific stat<\/strong><\/p>\n<p>Odious as double faults are, they are not completely avoidable. Very few players are able to sustain a double fault rate below 2%, and tour averages are around twice that. Since the beginning of 2017, the ATP average has been about 3.9%, and the WTA average roughly 5.1%, as we saw above.<\/p>\n<p>We can measure players by considering their match-by-match double fault rates\u00a0<em>compared to tour average.\u00a0<\/em>In Bencic&#8217;s unfortunate case, her 12 double faults were 6.7 more than a typical player would&#8217;ve committed in the same number of service points. In contrast, in the same match, Petkovic recorded only 3 double faults in 102 service points, 2.2 double faults\u00a0<em>fewer<\/em> than an average player would have.<\/p>\n<p>We know that each WTA double fault affects a player&#8217;s chances of winning the match by 1.83%, so compared to an average service performance, Bencic&#8217;s excessive service errors cost her about a 17% chance of winning (6.7 times 1.83%), while Petkovic&#8217;s stinginess increased her own odds by about 6.6% (2.2 times 1.83%).<\/p>\n<p>Repeat the process for every one of a player&#8217;s matches, and you can assemble a longer-term statistic. Let&#8217;s start with the WTA players who, since the start of last season, have cost themselves the most matches (&#8220;DF Cost&#8221;&#8211;negative numbers are bad), along with those who have most improved their lot by avoiding double faults:<\/p>\n<pre>Player                   DF%  DF Cost  \nKristina Mladenovic     7.7%    -3.84  \nDaria Gavrilova         7.9%    -3.77  \nJelena Ostapenko        7.7%    -3.58  \nPetra Kvitova           8.1%    -3.01  \nCamila Giorgi           8.3%    -2.63  \nOceane Dodin           10.2%    -2.51  \nDonna Vekic             7.0%    -1.91  \nVenus Williams          6.7%    -1.71  \nCoco Vandeweghe         6.4%    -1.60  \nAliaksandra Sasnovich   6.7%    -1.55  \n\u2026                                      \nAgnieszka Radwanska     2.3%     1.27  \nSloane Stephens         2.1%     1.43  \nCaroline Wozniacki      3.2%     1.43  \nBarbora Strycova        3.5%     1.47  \nElina Svitolina         3.9%     1.48  \nSimona Halep            3.5%     1.53  \nQiang Wang              2.6%     1.54  \nAnastasija Sevastova    3.1%     1.57  \nCarla Suarez Navarro    2.1%     1.67  \nCaroline Garcia         3.6%     1.82<\/pre>\n<p>And the same for the men:<\/p>\n<pre>Player                  DF%  DF Cost  \nBenoit Paire           6.2%    -4.51  \nIvo Karlovic           5.8%    -3.63  \nFabio Fognini          5.0%    -2.38  \nDenis Shapovalov       6.3%    -2.26  \nGrigor Dimitrov        5.1%    -2.25  \nGael Monfils           5.0%    -2.22  \nDavid Ferrer           5.2%    -2.06  \nJeremy Chardy          5.3%    -2.00  \nFernando Verdasco      4.8%    -1.94  \nJack Sock              4.8%    -1.73  \n\u2026                                     \nRoger Federer          2.1%     0.88  \nTomas Berdych          2.9%     0.89  \nJuan Martin del Potro  2.8%     0.93  \nAlbert Ramos           3.1%     0.97  \nPablo Carreno Busta    2.2%     1.07  \nRichard Gasquet        2.6%     1.12  \nJohn Isner             2.6%     1.23  \nDusan Lajovic          1.9%     1.23  \nDenis Istomin          1.9%     1.23  \nPhilipp Kohlschreiber  2.5%     1.24<\/pre>\n<p><strong>Situational double faults<\/strong><\/p>\n<p>These aggregate numbers have the potential to hide a lot of information. They consider only two things about each match: how many double faults a player committed, and how close the match was. This statistic would treat Bencic the same whether she hit nine of her double faults at 40-love, or nine of her double faults in the third-set tiebreak. Yet the latter would have a colossally greater impact.<\/p>\n<p>While this is an important limitation to keep in mind, it appears that double faults are distributed relatively randomly. That is, most players do not hit a majority of their double faults in particularly high- or low-leverage situations. The player lists displayed above show both the most basic stat&#8211;double fault percentage&#8211;along with my more complex approach. For players with at least 20 matches since the beginning of last season, double fault rate is very highly correlated with the match-denominated cost of double faults. (For men, r^2 = 0.752, and for women, r^2 = 0.789.) In other words, most of the variance in double fault cost can be explained by the\u00a0<em>number\u00a0<\/em>of double faults, leaving little room for other factors, such as the importance of the situation when double faults are committed.<\/p>\n<p>That said, there&#8217;s plenty of room for additional analysis into those specific sitations. Instead of taking a match-level look at win probability, as I have here, one could identify the point score of every single one of a player&#8217;s double faults, and see how each event affected the win probability of that match. I suspect that, for most players, that would amount to a whole lot of extra complexity for not a lot of added insight, but perhaps there are some players who are uniquely able to land their second serve when it matters most, or particularly prone to double faults at key moments. This match-level look has made it clear how costly double faults can be, and it&#8217;s possible that for some players, missed serves are even more damaging than that.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>We all know that double faults aren&#8217;t good, but it&#8217;s less clear just how bad they are. Over the course of an entire match, a single point here or there doesn&#8217;t seem to matter too much, especially when a double fault creeps in at a harmless moment, like 40-love. Yet many missed second serves are &hellip; <a href=\"https:\/\/www.tennisabstract.com\/blog\/2018\/08\/09\/the-cost-of-a-double-fault\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">The Cost of a Double Fault<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[30,105],"tags":[],"class_list":["post-2718","post","type-post","status-publish","format-standard","hentry","category-double-faults","category-serve-statistics"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/posts\/2718","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/comments?post=2718"}],"version-history":[{"count":0,"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/posts\/2718\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/media?parent=2718"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/categories?post=2718"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/tags?post=2718"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}