{"id":6059,"date":"2023-03-12T18:48:48","date_gmt":"2023-03-12T18:48:48","guid":{"rendered":"https:\/\/www.tennisabstract.com\/blog\/?p=6059"},"modified":"2023-03-12T18:48:48","modified_gmt":"2023-03-12T18:48:48","slug":"aryna-sabalenka-at-one-hundred-percent","status":"publish","type":"post","link":"https:\/\/www.tennisabstract.com\/blog\/2023\/03\/12\/aryna-sabalenka-at-one-hundred-percent\/","title":{"rendered":"Aryna Sabalenka at One Hundred Percent"},"content":{"rendered":"\n<p class=\"wp-block-paragraph\">Aryna Sabalenka played her first match at Indian Wells on Friday, handily beating Evgeniya Rodina. Sabalenka won the first set 6-1, then took a 3-0 lead in the second. Commentator Mikey Perera noted that Sabalenka&#8217;s win probability had reached 100%, though he (correctly!) expressed skepticism with the number.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Win probability has steadily crept in to tennis broadcasts. Often we&#8217;re shown pre-match percentages along with the change up to the current moment in the match. The silliness of a 100% mid-match win probability has a pedestrian explanation: The numbers are usually given as integers. For most fans, there&#8217;s no important difference between 55.7% and 58%, but in extreme cases, another significant digit would come in handy.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">So, was the broadcast algorithm correct?<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">My Elo-based pre-match <a href=\"http:\/\/www.tennisabstract.com\/current\/2023WTAIndianWells.html\">forecast<\/a> set Sabalenka&#8217;s chances at 94.8%. To get mid-match predictions, we need more granular stats. Sabalenka has won 65.5% of serve points and 46.7% of return points <a href=\"http:\/\/www.tennisabstract.com\/cgi-bin\/wplayer.cgi?p=ArynaSabalenka#tour-years-h\">this year<\/a> (including the Rodina match), and if we nudge the RPW up to 47%, those components predict a 94.7% chance of a Sabalenka victory&#8211;virtually equivalent to the Elo forecast.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Plug those numbers into my win probability <a href=\"https:\/\/github.com\/JeffSackmann\/tennis_misc\/blob\/master\/tennisMatchProbability.py\">model<\/a> with Rodina serving at 1-6, 0-3, and Sabalenka&#8217;s chances of victory are 99.7%. Round to the nearest integer, and sure enough, you get a 100% chance of victory. It might have felt that way for Rodina.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">In fact, Sabalenka crossed the &#8220;100%&#8221; (99.5%) threshold in the previous game. She cleared 99.5% at 2-0, 15-0, slipped back under the line when she fell to deuce, then reclaimed it each of the two times she gained ad-in.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">So far, I&#8217;ve used a relatively simple model to forecast the remainder of the match. (And it&#8217;s certainly sufficient for these purposes.) But if we were putting money on the outcome&#8211;especially if the first ten games of the match had gone in a less predictable direction&#8211;we&#8217;d want to do something more sophisticated. I&#8217;ve assumed that from 6-1, 3-0, Sabalenka would play the way we could have predicted <em>before<\/em> the match. In this case, that&#8217;s a sound assumption. But a better method would take into account the results of the match itself up to that point.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Through ten games, Sabalenka was playing <em>better<\/em> than the initial forecast of 66.5% on serve and 47% on return. Her success rate on serve was a bit worse, at 64.4%, but she was destroying any service advantage of Rodina&#8217;s, winning nearly 55% of those points. Had we known before the match that she would play that way, our pre-match forecast would have given Sabalenka a whopping 99.4% chance of victory.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Using <em>that<\/em> pre-match forecast, our prediction at 6-1, 3-0 would have been an overwhelming 99.97% for the favorite.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">As the match progressed, then, we gained more and more information that the in-match performance&#8211;whether due to the conditions, the players&#8217; fitness or mood on the day, the matchup, or any number of other factors&#8211;would be even more lopsided. Had we taken everything into account at 6-1, 3-0, we would have calculated some mix of 99.7% (based on pre-match numbers) and 99.97% (based on in-match performance). The degree to which we should weight each of those numbers is the tough part. Determining the correct weights is a complicated questions; suffice it to say that the correct answer is somewhere in between the two.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">The broadcast algorithm jumped the gun with its 100% win probability, though only a bit. No matter how lopsided a match, anything <em><a href=\"https:\/\/www.youtube.com\/watch?v=Rx9zjUV9lVA\">can<\/a><\/em> happen&#8211;but it probably won&#8217;t.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Aryna Sabalenka played her first match at Indian Wells on Friday, handily beating Evgeniya Rodina. Sabalenka won the first set 6-1, then took a 3-0 lead in the second. Commentator Mikey Perera noted that Sabalenka&#8217;s win probability had reached 100%, though he (correctly!) expressed skepticism with the number. Win probability has steadily crept in to &hellip; <a href=\"https:\/\/www.tennisabstract.com\/blog\/2023\/03\/12\/aryna-sabalenka-at-one-hundred-percent\/\" class=\"more-link\">Continue reading <span class=\"screen-reader-text\">Aryna Sabalenka at One Hundred Percent<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[123,127],"tags":[],"class_list":["post-6059","post","type-post","status-publish","format-standard","hentry","category-win-probability","category-wta"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/posts\/6059","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/comments?post=6059"}],"version-history":[{"count":0,"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/posts\/6059\/revisions"}],"wp:attachment":[{"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/media?parent=6059"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/categories?post=6059"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.tennisabstract.com\/blog\/wp-json\/wp\/v2\/tags?post=6059"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}