Jelena Ostapenko In the Hands of Fate

Also today: Deciding tiebreaks, a MCP milestone, and assorted links.

Jelena Ostapenko in 2023. Credit: Hameltion

If you’ve ever spent five minutes watching Jelena Ostapenko play tennis, you know she’s as aggressive as it gets. She swings for the fences and sometimes knocks them over. Get her on a hot streak, and opponents can only hope its ends before the handshake. When she’s off her game, spectators in the first few rows duck for cover.

What you might not realize is just how aggressive she is. A few years ago I tuned Lowell West’s Aggression Score metric so that the numbers fell in a range between 0 and 100. In theory, 0 is maximally passive; 100 is go-for-broke, all the time. Ostapenko’s career Aggression Score in rallies is 175.

This sort of extreme style lends itself to all sorts of narratives. She can beat anybody, any time, as she showed when she won the 2017 French Open as an unseeded player, and again last year when she upset Iga Swiatek at the US Open–her fourth win in as many matches against the Pole. That makes her a perennial dark horse pick at majors. Even though she hasn’t reached a semi-final since 2018, neither Iga nor Coco Gauff–who exited the Australian Open after an Ostapenko barrage last year–would like her find her in their section.

(Sorry Iga: Guess who you might face in the quarters!)

Hyper-aggressive players also appear to be works in progress. Especially early in Ostapenko’s career, commentators would talk about her stratospheric potential if she could only improve her footwork, or play a bit more “within herself.” That is, not quite so many winners, not quite so soon, more point construction, fewer unforced errors. But players rarely change much, and as they age, they are more likely to become more aggressive, not less. The Latvian is now 26 years old, beginning her ninth year on tour. What you see is what you get.

What you get, it turns out, is a lot of close matches. Ostapenko played 30 three-setters last year, including four in a row to reach the Birmingham final and another four straight to start the US Open. Alona’s apotheosis came at Indian Wells, when she faced fellow super-aggressor Petra Kvitova in the third round. Both women tallied exactly 75 points; Kvitova won, 0-6, 6-0, 6-4. Tennis ball fuzz could be seen floating over the desert for days afterward.

That particular scoreline was an oddity, but the margin of victory was not. Ostapenko’s tight matches are not a result of streakiness, flightiness, or anything of the sort. They are an unavoidable function of her game style. It’s almost impossible to hit lots of winners without also committing piles of unforced errors. (We’ll come back to that.) When you do both in such numbers, you personally account for a substantial majority of point outcomes. The winners and errors (very approximately) balance each other out, and unless your opponent does something remarkable–or remarkably bad–with the limited influence you leave her, you end up winning about half the points played.

No one takes the racket out of an opponent’s hand like Ostapenko does. Once the return is in play, the Latvian ends nearly two-thirds of points herself, with a winner or unforced error, or by forcing an error. No one else comes close. Drawing on Match Charting Project data, I’ve listed the active players who end the most rallies:

Player                 RallyEnd%  
Jelena Ostapenko           65.9%  
Petra Kvitova              61.6%  
Madison Keys               60.8%  
Liudmila Samsonova         60.0%  
Camila Giorgi              59.7%  
Aryna Sabalenka            59.7%  
Veronika Kudermetova       57.5%  
Danielle Collins           57.5%  
Ekaterina Alexandrova      57.2%  
Ons Jabeur                 56.8%  
Peyton Stearns             56.5%  
Caroline Garcia            56.2%  
Naomi Osaka                56.2%  
Varvara Gracheva           55.0%  
Iga Swiatek                55.0%

Here’s another way to look at Alona’s extreme position on this list. The only other woman to grade out so far from 50% is Madison Brengle, who ends fewer than 34% of rallies. Ostapenko’s power turns the rest of the tour into Brengle.

Give and take

Ending even 57% of points on your own racket requires a lot of big swings. When you aim for a line, you might feel confidence about your chances, but you are taking a risk. A few players, like Swiatek, can generate winners without paying the unforced-error penalty, but that takes an unusual combination of patience and power that most players do not possess.

The 66% of points that Ostapenko ends on her own racket divides into roughly 37% winners (and forced errors) and 29% unforced errors. That’s worse than Aryna Sabalenka, who hits nearly as many winners with only a 23% error rate, but compared to the tour as a whole, the ratio is a solid one. For every unforced error she commits, she ends 1.25 points in her favor. Average among players represented in the Match Charting Project is 1.16, and the true mean is probably lower than that, since the MCP is more heavily weighted toward the best players.

The ratio varies among players, but there is a fairly strong relationship. Here are the winner/forced error and unforced error rates–each as a percentage of all points where the return came back in play–for 140 current and recent players:

The correlation between the two rates (r2 = 0.3) would be even stronger if it weren’t for net-rushers like Tatjana Maria–and to some extent Leylah Fernandez–who force their passive opponents into more aggression than they would otherwise produce.

As Sabalenka shows, it’s possible to seize as many points as Ostapenko does without giving quite so many away, but even that may be a mirage: Sabalenka racks up winners behind an overpowering serve that the Latvian can’t match. If the plot above is any indication, it would be difficult to bring her error rate down without also sacrificing some winners, not to mention the élan that she has ridden to seven tour-level titles.

So we’re left with something of a paradox. A hyper-aggressive player has more control over her fate than her peers do, but that control comes at a cost of a towering error rate, which keeps matches close. One result is a week like this one in Adelaide, where Ostapenko has reached the final by slipping through perilously tight battles with Sorana Cirstea (51.7% of points won) and Caroline Garcia (50.2%). Both matches could’ve gone the other way, something that is true so often when the Latvian steps on court. My tactical advice for Daria Kasatkina in tomorrow’s final: Cross your fingers.

* * *

Deciding-set tiebreak records

AbsurDB asks:

[A]m I right that Hurkacz’s 15 deciding sets going into tie-breaks in one calendar year is a historical record in ATP (10 such tie-breaks won is also probably a record?)?

Indeed, both are records. According to my data, the previous records came from Ivo Karlovic’s 2007 season, when he reached 11 deciding-set tiebreaks, winning eight of them. Here are all the player-seasons with nine or more.

Player              Season  Dec TB  Record  
Hubert Hurkacz        2023      15    10-5  
Ivo Karlovic          2007      11     8-3  
John Isner            2011      11     4-7  
John Isner            2018      11     6-5  
Ivo Karlovic          2014      10     7-3  
John Isner            2017      10     5-5  
Kevin Anderson        2018      10     6-4  
Mark Philippoussis    2000       9     5-4  
Marat Safin           2000       9     5-4  
Ivan Ljubicic         2002       9     2-7  
Ivan Ljubicic         2007       9     8-1  
Ivo Karlovic          2008       9     5-4  
Sam Querrey           2018       9     1-8  
Borna Coric           2019       9     6-3  
Hubert Hurkacz        2022       9     3-6

(Yes, I checked before 2000, as well, but no one reached nine until Philippousis did so that year. The first player-season with eight deciding-set tiebreaks was Tom Gullikson’s, in 1984.)

* * *

MCP Milestones

Earlier this week, the Match Charting Project recorded its two-millionth point:

The milestone match was the Auckland second-rounder between Ben Shelton and Fabian Marozsan, which I charted as a warm-up for my article on Wednesday. We’re not resting on our laurels, of course: We’ve added another five matches (and 800 or so points) in the 48 hours since.

Also worth mentioning is another round number we reached in the offseason: 1,000 different ATP players. Apart from the name syou’d expect, it’s a healthy mix of lower-ranked active players and former tour regulars. #1,000 was Martin Jaite, via his 1987 Rome final against Mats Wilander. We’ve also now charted 800 different WTAers.

We stand about 200 charts away from 13,000 matches overall: approximately 7,000 men’s and 6,000 women’s. 2023 was our most productive year yet, and 2024 would be a great time to start contributing.

* * *

Assorted links

  • Earlier this week I appeared on Alex Gruskin’s Mini-Break Podcast, in which he got overexcited about a number of week one trends, and I tried to talk him down from all the ledges.
  • I wrote about how GPT4 helped me make Tennis Abstract’s new navbar, because you had to know I didn’t do it myself.
  • The tours have introduced a new policy on late matches. I’m underwhelmed: There are an awful lot of exceptions, and there’s no acknowledgement of the underlying problem of longer and longer matches.
  • Two student projects worth a look: Pramukh’s Evaluating Tennis Player Styles in Relation to Tour Averages, based on MCP data, and Amrit’s Aces over Expected model.
  • If you can’t wait until Sunday for grand slam tennis, here’s the Clijsters-Henin 2003 US Open final.

* * *

Subscribe to the blog to receive each new post by email:

 

How Grigor Dimitrov Unbalanced Holger Rune in Brisbane

Grigor Dimitrov. Credit: Bradley Kanaris / Getty

Grigor Dimitrov was long known as “Baby Fed,” but yesterday, Holger Rune was the one trying to do a Roger Federer impression. Facing break point at 3-all in the second set, Rune kicked a second serve wide, got a cross-court slice reply, then ran around his backhand to smack an inside-in forehand: a high-risk, high-reward shot, especially if you aim for the line. Rune went big and he pulled it wide. That was the only break of the match.

The 20-year-old had already missed one of those in the same game: The first error dug him a 15-40 hole. Over the course of the match, he attempted seven inside-in forehands, a shot that usually wins him two out of three points. Against Dimitrov, he blew four of them.

The errors are a symptom of one of something separating Rune from the top of the game. In his eagerness to maintain an aggressive position at the baseline–a willingness that defines his style and, in fairness, often pays off–he tries a bit too hard. He swings to end points in three shots that probably need to go five. He keeps a toe on the baseline when he ought to be one step further back.

This isn’t a secret, and Dimitrov exploited it. The Bulgarian landed 82% of his returns behind the service line, compared to a tour average of 70%. 39% of Dimitrov’s returns fell in the back quarter of the court, beating the 28% that players typically face. In rallies, the veteran kept pummeling Rune’s feet, prioritizing depth over direction.

The strategy worked. Take the other pivotal juncture of the match, early in the first-set tiebreak. Serving at 0-1, Rune pushed Dimitrov off the court with an inside-out forehand, which came back as a deep slice. Nothing special, but as Rune stepped back to accommodate it, he hit an equally indifferent reply. Dimitrov came back with another middle-deep backhand and Rune hit the tape with as pedestrian an error as you’ll ever see. At 0-2, Rune’s plus-one forehand forced Dimitrov deep and set up the point for an easy finish–or so he thought. Dimitrov managed to get his defensive forehand deep enough that Rune stepped in–his back foot on the baseline–and the result was another miss that would leave a club player berating himself.

On both points, a slightly more conservative court position, or a better last-minute adjustment step, would have let Rune continue the rally with his opponent on the run. Most players tread more carefully in tiebreaks. Instead, he missed twice and fell to 0-3. He got one point back but couldn’t close the entire gap and lost the first set, 7-6(5).

Middle-deep mediocrity

Yesterday wasn’t the first time that Rune misreads a neutral opportunity as a chance to go big. His own-the-baseline strategy is a mixed bag, the best example of which is how he responds to service returns that land at his feet. The Match Charting Project codes every return by direction (cross-court, middle, or down-the-line) and by depth (shallow–in front of the service line, deep–behind it, or very deep–in the back quarter of the court). Dimitrov placed 13 of his returns in the middle-deep region, and Rune saved just 5 of those points.

When a return lands middle-deep, the point is fully up for grabs. Counting both first- and second-serve points, the server wins roughly 49% of the time from that position. (Once a deep return is in play, any lingering effect of a big serve is mostly erased.) A top player should do better, but Rune does not. Here are the career outcomes of those points for the current ATP top four, plus the two Brisbane finalists:

Player             W/FE%   UFE%  PtsWon%  
Novak Djokovic      6.8%   7.1%    53.8%  
Jannik Sinner       5.7%   6.0%    51.6%  
Daniil Medvedev     5.3%   5.9%    50.6%  
Carlos Alcaraz      8.0%   6.2%    50.1%  
Grigor Dimitrov     9.6%   7.9%    49.6%  
--Average--         7.4%   8.7%    48.9%  
Holger Rune        11.5%  10.9%    48.0% 

Rune is much more aggressive than his peers in these situations. It may feel like it pays off, since he ends more points with winners (or forced errors) than unforced errors. But the bottom line tells another story: He wins fewer points than average, and trails the best players in the game by a sizeable margin. As Djokovic, Sinner, and Medvedev can tell you, from a neutral position, immediate outcomes don’t matter as much as point construction.

It’s the same story later in the rally. Dimitrov won those two crucial tiebreak points by putting his second shot near the baseline. The serve return isn’t unique: Any stroke that lands in the middle-deep region turns the point into a 50-50 proposition. The above table showed how players fare from that position on the plus-one shot. Here are the numbers for everything after that:

Player           Winner%   UFE%  PtsWon%  
Carlos Alcaraz      8.2%  12.8%    55.3%  
Grigor Dimitrov     6.6%   6.3%    54.7%  
Novak Djokovic      6.2%   8.0%    54.6%  
Jannik Sinner       7.2%  10.5%    52.3%  
Daniil Medvedev     4.7%   6.8%    52.0%  
--Average--         7.1%  10.2%    49.3%  
Holger Rune         9.4%   9.7%    49.0%

The order changes, and Rune’s aggression doesn’t stand out like it does earlier in the rally. But the message is the same, only with a wider margin. Given the mix of players represented in the Match Charting Project, “average” is better than tour average, but it’s still a number Rune needs to surpass.

The second table, finally, brings us back to Dimitrov. If he hadn’t played yesterday, I wouldn’t have thought to include him on the list with the top four, but in this type of situation–one that demands both patience and tactical soundness–he rates with the best in the game.

Faced with an over-aggressive, slightly erratic opponent, the 32-year-old took advantage and turned in a workmanlike performance. That isn’t a dig: Dimitrov didn’t need fireworks, just steadiness. By my count, he racked up just 10 unforced errors to Rune’s 29, and just one of them–serving for 4-0 in the tiebreak–came a critical moment. It’s nothing so flashy as the “Baby Fed” moniker once promised, but Dimitrov’s mature game has gotten him up to 7th place on the Elo list, and a return to the official top ten is not far away.

* * *

Subscribe to the blog to receive each new post by email:

 

Angelique Kerber in the New World

Angelique Kerber in 2020. Credit: Rob Keating

Angelique Kerber’s return to the tour has, so far, been a rocky one. She began Germany’s United Cup campaign with a narrow defeat to Jasmine Paolini, in which the Italian earned 21 break points against the German’s serve. Kerber took a set from the free-swinging Caroline Garcia but lost in three. Today, Maria Sakkari blew her off the court, winning nine games in a row before Kerber got on the board and split the remaining six.

The United Cup, in its new design, is not an easy place to make a comeback: The German faced top-30 players all three rounds. (Compare that to the tour event in Brisbane, where fellow returnee Naomi Osaka scored an opening-round victory against a player ranked 83rd.) Kerber surely didn’t expect to dominate immediately. It’s hard to get rolling again after an 18-month layoff, and she hasn’t been a truly elite player since early 2019. She turns 36 years old this month, a tough age even for players with three majors to their credit.

The Garcia match, in particular, highlighted another dimension of the challenge. The tour that Kerber rejoins is different from the one where she collected so many laurels. Angie is the very definition of a counterpuncher, a clever defender who uses anticipation and racket control to convert her opponent’s pace into winners of her own. It’s tough to counterpunch against someone like Garcia, who aims to end the point with nearly every shot.

The reckless Frenchwoman is hardly alone. Based on data from the Match Charting Project, here is the average rally length on the WTA tour since 2013:

It looks a bit fluky, but it’s noteworthy to find a peak in 2016, Kerber’s best year. Rally length has been essentially flat since 2021, perhaps since 2019 if we set aside the Covid-affected 2020 season. The German is plenty familiar with the landscape, having competed on tour until Wimbledon in 2022, but she developed her game back when the power of Serena Williams was an outlier. Now, Serena’s late-career bashing is the model for a new generation.

There are a number of ways to illustrate the trend. While the year-to-year differences are minor, the arrows all point in the same direction. In 2016, 49.6% of points were decided in three shots or less. Last year, it was 53.0%. (In 2021 and 2022, it was a bit higher still.) At Kerber’s peak, nearly 24% of points lasted at least seven strokes. Last year that figure had declined to 20.8%.

This is probably worse news for someone like Caroline Wozniacki than it is for Kerber. Woz keeps points alive and waits for errors, skills that Garcia (or Aryna Sabalenka, or Elena Rybakina, or dozens more players she might draw in the first round of the Australian Open) render meaningless. While Angie isn’t going to pile up aces–she’s hit a grand total of two in three United Cup matches–she is fully capable of redirecting a serve for a return winner, as she did a couple of times against Sakkari. Still, the shorter the point, the less likely that Kerber finds an opportunity to work her magic.

Throughout her career, the German lefty has rarely had a problem picking spots to end points with winners or forced errors. Match Charting data shows that 6% of her groundstrokes go for winners, right in line with tour average.

The catch, though, is when she hits them. Kerber is one of 58 players for whom the Match Charting Project has recorded at least 2,000 winners and forced errors since 2013. Only four of those players unleash their winners later in the rally. The average shot number of Kerber’s point-enders is 4.9–bad news in an era when nearly two-thirds of points are finished in four shots or less.

Here are the twelve players in the dataset whose winners occur latest in the rally:

Player                Avg Winner Shot#  
Daria Kasatkina                    5.1  
Viktorija Golubic                  5.0  
Yulia Putintseva                   5.0  
Carla Suarez Navarro               4.9  
Angelique Kerber                   4.9  
Sloane Stephens                    4.9  
Agnieszka Radwanska                4.9  
Simona Halep                       4.8  
Svetlana Kuznetsova                4.7  
Anastasija Sevastova               4.6  
Caroline Wozniacki                 4.6  
Su Wei Hsieh                       4.6

This isn’t a table where you want to find your name north of Wozniacki’s. It’s possible to survive on today’s tour playing this way, as Daria Kasatkina has proven, but it is much less likely to translate into a major title. Wimbledon champ Marketa Vondrousova didn’t miss the list by much, coming in at 4.4, but her aggression varies wildly from one match to another. Iga Swiatek and Coco Gauff appear closer to the middle of the pack, at 4.2, and Aryna Sabalenka ranks as the fourth most aggressive of the 58, at 3.4.

At the risk of belaboring the point, here’s another way of seeing the difference between Angie’s style and the brands of tennis that currently top the rankings. The following chart shows what percent of Kerber’s winners (and forced errors) happen at each point in the rally, compared to the same figures for Swiatek and Sabalenka:

The “1st shot” and “6th+” columns are virtual mirror images of each other. Even that understates the difference between the veteran and the two youngsters, because a point-ending serve from Kerber is more likely to be at least partially the fault of the returner–those errors are conventionally scored as forced regardless of the strength of the serve.

I don’t want to say that Kerber can’t succeed on her return to the circuit, but it’s clear that she faces a challenge. The tennis world of the mid-2010s is long gone, and even if she regains the form that took her to number one in 2016, it may not give her the same results in 2024. A new era requires a new Angie; we’ll see if she can produce one.

* * *

Subscribe to the blog to receive each new post by email:

 

How Coco Gauff’s Defense Won the US Open Final

Defense, as they say, wins championships. Coco Gauff has a big serve, a strong backhand, and a high tennis IQ, not to mention a new guru in Brad Gilbert. All of that got her to the US Open final and gave her a shot against new No. 1 Aryna Sablenka. But defense was what won her the match.

If you watched the final, you already know this. Over and over again, Gauff rescued a sure winner, hanging in the point long enough for Sabalenka to miss. In a close contest, as this one was, a handful of points can determine the result.

It’s tough to say exactly how many points Gauff saved with her exemplary defense. Sometimes she made multiple digs in the same point; other times she averted disaster just to lose the point a couple of strokes later. Still, we should try to quantify the effect she had on the normally imperious Sabalenka game.

My stat of choice is something I’m going to call, simply, Defense. For any match with charting-based stats, it’s a simple calculation: The percentage of the opponent’s groundstrokes that resulted in winners or forced errors. (I introduced it in my Andy Murray essay as part of the Tennis 128 project last year.) In other words: How often does the player get herself in a position to put a groundstroke back in play?

Among tour regulars on hard and grass courts, the range of the Defense stat runs from about 7%–the backboards that are Lesia Tsurenko and Sloane Stephens–to 15%, where you’ll find the less nimble Evgeniya Rodina and Linda Noskova. Lower is better! Tour average is around 11%. Gauff, over the course of her young career, has averaged 10.8%.

Average doesn’t carry much weight, though, when it comes to Sabalenka. Aryna’s groundstrokes end the point in her favor–with a winner or forced error–17.3% of the time. Only Jelena Ostapenko, at 18.0%, scores higher, and just a few other women are as high as 15%. Turning in an “average” performance against Sabalenka–that is, keeping her to 11%–is a massive step toward victory.

On Saturday, Gauff held her to 9.8%.

Sabalenka hit 285 groundstrokes in the final. 15 went for winners; another 13 turned into forced errors. Had she converted at her usual rate, those numbers would’ve been nearly twice as high: 49 points won off the ground instead of 28.

Gauff’s actual margin of victory was a mere seven points. By the Defense measure, she saved 21 solely with her superlative handling of Aryna’s groundstrokes. Again, it doesn’t quite work that way; she dug out multiple would-be winners on some points, for instance. On the other hand, it isn’t the only way Coco salvaged desperate situations. This measure doesn’t take into account quick-footed service returns or defense against the smash.

It’s almost impossible to overemphasize the magnitude of Gauff’s achievement. In 48 hard and grass court matches since last year’s US Open, just two of Sabalenka’s opponents managed a Defense stat better than 11.6%. The only other exception was Veronika Kudermetova, against Aryna’s limp performance in Berlin. Sabalenka’s average over the last 52 weeks is 19.7%, probably one of the highest marks posted by any baseliner, ever.

Gauff simply cut it in half. She effectively turned one of the most imposing players in women’s tennis history into a frustrated journeywoman–or at least the statistical equivalent of one. Gilbert might call it Winning Ugly, but it looked awfully good to me.

* * *

Subscribe to the blog to receive each new post by email:

 

Surface Speed Convergence Revisited

Grass courts before the convergence

For more than a decade, players and pundits have complained that surface speeds are converging. To oversimplify their gripes: Everything is turning into clay. Hard courts have gotten slower, even many of the indoor ones. Grass courts, once a bastion of quick-fire attacking tennis, have slowed down as well.

I’ve attempted to confirm or refute the notion a couple of times. In 2013, I used break rate and ace rate to see whether hard and clay courts were getting closer to each other. The results said no. Many readers complained that I was using the wrong metrics: rally length is a better indicator. I agree, but rally length wasn’t widely available at the time.

In 2016, I looked at rally length for grand slam finals and found some evidence of surface speed convergence. The phenomenon was much clearer in men’s tennis than women’s, a hint that it wasn’t all about the surface, but that tactics had changed and that the mix of players in slam finals skewed the data.

Now, the Match Charting Project contains shot-by-shot logs of more than 12,000 matches. We can always dream of more and better data, but we’re well past the point where we can take a more detailed look at how rally length has changed over the years on different surfaces.

Forecasting rally length

Start with a simple model to forecast rally length for a single match. You don’t need much, just the average rally length for each player, plus the surface. Men who typically play short points have more influence on rally length than those who play long ones. (This is worthy of a blog post of its own–maybe another day.) Call the average rally length of the shorter-point guy X and the average rally length of the longer-point guy Y.

Using data from the last seven-plus seasons, you can predict the rally length of a hard court match as follows:

  • X + (0.7 * Y) – 2.6

The numbers change a bit depending on gender and time span, but the general idea is always the same. The short-point player usually has about half-again as much influence on rally length than his or her opponent.

For men since 2016, we can get the clay court rally length by adding 0.16 to the result above. For grass courts, subtract 0.45 instead.

For example, take a hypothetical matchup between Carlos Alcaraz and Alexander Bublik. In charted matches, Alcaraz’s average rally length is 4.0 and Bublik’s is 3.2. The formula above predicts the following number of shots per point:

  • Hard: 3.39
  • Clay: 3.55
  • Grass: 2.94

The error bars on the surface adjustments are fairly wide, for all sorts of reasons. Courts are not identical just because their surfaces are given the same names. Other factors, like balls, influence how a match goes on a given day. Players adapt differently to changing surfaces. The usual dose of randomness adds even more variance to rally-length numbers.

Changing coefficients

These surface adjustments aren’t very big. A difference of 0.16 shots per point is barely noticeable, unless you’re keeping score. Given the variation within each surface, it means that rallies would be longer on some hard courts than some clay courts, even for the same pair of players.

That brings us back to the issue of surface speed convergence. 0.16 shots per point is my best attempt at quantifying the difference between hard courts and clay courts now–or, more precisely, for men between 2016 and the present. If surfaces have indeed converged, we would find a more substantial gap in older data.

That’s exactly what we see. I ran the same analysis for three other time periods: 1959-95, 1996-2005, and 2006-2015. The following graph shows the rally-length gap between surfaces for each of the four spans:

For example, in the years up to 1995, a pair of players who averaged 4 shots per point on a hard court would be expected to last 5 shots per point (4 + 1) on clay. They’d tally just 3.25 shots per point (4 – 0.75) on grass.

By the years around the turn of the century, the gap between hard courts and grass courts had narrowed to its present level. But the difference between hard and clay continued to shrink. The current level of 0.16 additional shots per point is only about one-sixth as much as the equivalent in the 1980s and early 1990s.

The graph implies that hard courts are constant over time. That’s just an artifact of how I set up this analysis, and it may not be true. It could be that clay courts have been more consistent, something that my earlier analysis suggested and that many insiders seem to believe. In that case, rather than a downward-sloping clay line and an upward-sloping grass line, the graph would show two upward-sloping lines reflecting longer rallies on non-clay surfaces.

Women, too

The women’s game has evolved somewhat differently than the men’s has, but the trends are broadly similar. Here is the same graph for women’s rally lengths across surfaces:

For the last two decades, there has been essentially no difference in point length between hard courts and clay courts. A gap remains between hard and grass, though like in the men’s game, it is trending slightly downwards.

Why the convergence?

The obvious culprit here is the literal one: the surface. Depending on who you ask, tournament directors have chosen to slow down hard and grass surfaces because fans prefer longer rallies, because the monster servers of the turn of the century were boring, because slow surfaces favored the Big Four, or because they like seeing players puke on court after five hours of grueling tennis.

That’s probably part of it.

I would offer a complementary story. Racket technology and the related development of return skill essentially killed serve-and-volley tennis. Slower surfaces would have aided that process, but they weren’t necessary. In the 1980s, a top player like Ivan Lendl or Mats Wilander would use entirely different tactics depending on the surface, grinding on clay while serve-and-volleying indoors and on grass. Now, a Djokovic-Alcaraz match is roughly the same beast no matter the venue. If Alcaraz serve-and-volleyed on every point, Novak would have a far easier time competing on return points than the opponents of Lendl and Wilander ever did.

My best guess is that rally lengths have converged because of some combination of the two. I believe that conditions (surfaces, balls, etc) are the lesser of the two factors. But I don’t know how we could use the data we have to prove it either way.

In the end, it doesn’t particularly matter why. Much more than in my previous studies, we have enough rally-length data to see how players cope with different surfaces. The evidence is strong that, for whatever reason, hard-court tennis, clay-court tennis, and grass-court tennis are increasingly similar, a trend that began at least 25 to 30 years ago and shows no sign of reversing. Whether or not surfaces have converged, tactics have definitely done so.

* * *

Subscribe to the blog to receive each new post by email:

 

Are American Players Screwed Once You Drag Them Into a Rally?

Long after retiring from tennis, Marat Safin remains quotable. The Russian captain at the ATP Cup had this to say to his charge, Karen Khachanov, during a match against Taylor Fritz:

This isn’t exactly testable. I don’t know you’d quantify “shock-and-awe,” or how to identify–let alone measure–attempts to scare one’s opponent. Or screwed-ness, for that matter. But if we take “screwed” to mean the same as “not very likely to win,” we’ve got something we can check.

Many fans would agree with the general claim that American men tend to have big serves, aggressive game styles, and not a whole lot of subtlety. Certainly John Isner fits that mold, and Sam Querrey doesn’t deviate much from it. While Fritz is a big hitter who racks up his share of aces and second-shot putaways, his style isn’t so one-dimensional.

Taylor Fritz: not screwed

Using data from the Match Charting Project, I calculated some rally-length stats for the 70 men with at least 20 charted matches in the last decade. That includes five Americans (Fritz, Isner, Querrey, Steve Johnson, and Jack Sock) and most of the other guys we think of as ATP tour regulars.

Safin’s implied definition is that rallies of four shots or fewer are “shock-and-awe” territory, points that are won or lost within either player’s first two shots. Longer rallies are, supposedly, the points where the Americans lose the edge.

That is certainly the case for Isner. He wins only 40% of points when the rally reaches a fifth shot, by far the worst of these tour regulars. Compared to Isner, even Nick Kyrgios (44%) and Ivo Karlovic (45%) look respectable. The range of winning percentages extends as high as 56%, the mark held by Nikoloz Basilashvili. Rafael Nadal is, unsurprisingly, right behind him in second place at 54%, a whisker ahead of Novak Djokovic.

Fritz, at 50.2%, ranks 28th out of 70, roughly equal to the likes of Gael Monfils, Roberto Bautista Agut, and Dominic Thiem. Best of all–if you’re a contrarian like me, anyway–is that Fritz is almost 20 places higher on the list than Khachanov, who wins 48.5% of points that last five shots or more.

More data

Here are 20 of the 70 players, including some from the top and bottom of the list, along with all the Americans and some other characters of interest. I’ve calculated each player’s percentage of points won for 1- or 2-shot rallies (serve and return winners), 3- or 4-shot rallies (serve- and return-plus-one points), and 5- or more-shot rallies. They are ranked by the 5- or more-shot column:

Rank  Player                 1-2 W%  3-4 W%  5+ W%  
1     Nikoloz Basilashvili    43.7%   54.1%  55.8%  
2     Rafael Nadal            52.7%   51.6%  54.3%  
3     Novak Djokovic          51.8%   54.6%  54.0%  
4     Kei Nishikori           45.5%   51.2%  53.9%  
11    Roger Federer           52.9%   54.9%  52.1%  
22    Philipp Kohlschreiber   50.1%   50.1%  50.7%  
28    Taylor Fritz            51.1%   47.2%  50.2%  
30    Jack Sock               49.0%   46.5%  50.2%  
31    Alexander Zverev        52.8%   50.3%  50.0%  
32    Juan Martin del Potro   53.8%   49.1%  50.0%  
34    Andy Murray             54.3%   49.5%  49.4%  
39    Daniil Medvedev         53.9%   50.4%  49.0%  
43    Stefanos Tsitsipas      51.4%   50.5%  48.6%  
44    Karen Khachanov         53.7%   48.1%  48.5%  
48    Steve Johnson           49.2%   48.8%  48.3%  
61    Sam Querrey             53.5%   48.0%  46.2%  
62    Matteo Berrettini       53.6%   49.3%  46.1%  
66    Ivo Karlovic            51.8%   43.9%  44.9%  
68    Nick Kyrgios            54.6%   47.4%  44.2%  
70    John Isner              52.3%   48.3%  40.2%

Fritz is one of the few players who win more than half of the shortest rallies and more than half of the longest ones. The first category can be the result of a strong serve, as is probably the case with Fritz, and is definitely the case with Isner. But you don’t have to have a big serve to win more than half of the 1- or 2-shot points. Nadal and Djokovic do well in that category (like they do in virtually all categories) in large part because they negate the advantage of their opponents’ serves.

Shifting focus from the Americans for a moment, you might be surprised by the players with positive winning percentages in all three categories. Nadal, Djokovic, and Roger Federer all make the cut, each with plenty of room to spare. The remaining two are the unexpected ones. Philipp Kohlschreiber is just barely better than neutral in both classes of short points, and a bit better than that (50.7%) on long ones. And Alexander Zverev qualifies by the skin of his teeth, winning very slightly more than half of his long rallies. (Yes, that 50.0% is rounded down, not up.) Match Charting Project data is far from complete, so it’s possible that with a different sample, one or both of the Germans would fall below the 50% mark, but the numbers for both are based on sizable datasets.

Back to Fritz, Isner, and company. Safin may be right that the Americans want to scare you with a couple of big shots. Isner has certainly intimidated his share of opponents with the serve alone. Yet Fritz, the player who prompted the comment, is more well-rounded than the Russian captain gave him credit for. Khachanov won the match on Sunday, and at least at this stage in their careers, the Russian is the better player. But not on longer rallies. Based on our broader look at the data, it’s Khachanov who should try to avoid getting dragged into long exchanges, not Fritz.

Match Charting Project Rally Stats: Glossary

I’m in the process of rolling out more stats based on Match Charting Project data across Tennis Abstract. This is one of several glossaries intended to explain those stats and point interested visitors to further reading.

At the moment, the following rally stats can be seen at a variety of leaderboards.

  • RallyLen – Average rally length. Not everyone counts shots exactly the same way, so I try to follow the closest thing there is to a consensus. The serve counts as a shot, but errors do not. Thus, a double fault is 0 shots, and an ace or unreturned serve is 1. A rally with a serve, four additional shots, and an error on an attempted sixth shot counts as 5.
  • RLen-Serve – Average rally length on service points.
  • RLen-Return – Average rally length on return points.
  • 1-3 W% – Winning percentage on points between one and three shots, inclusive. On the match-specific pages for each charted match, you can see winning percentages broken down by server. Click on “Point outcomes by rally length.”
  • 4-6 W% – Winning percentage on points between four and six shots, inclusive.
  • 7-9 W% – Winning percentage on points between seven and nine shots, inclusive.
  • 10+ W% – Winning percentage on points of ten shots or more.
  • FH/GS – Forehands per groundstroke. This stat counts all baseline shots from the forehand side (including slices, lobs, and dropshots), and divides by all baseline shots, to give an idea of how much each player is favoring the forehand side (or, perhaps, is pushed to one side by his or her opponent’s tactics).
  • BH Slice% – Backhand slice percentage. Of backhand-side groundstrokes (topspin, slices, dropshots, lobs), the percentage that are slices, including dropshots.
  • FHP/Match – Forehand Potency per match. FHP and BHP (Backhand Potency) are stats I invented to measure the effectiveness of particular groundstrokes. It adds, roughly, one point for a winner and one half point for the shot before a winner, and subtracts one point for an unforced error. On a per-match basis, the stat is influenced by the length of the match and the number of shots hit. Because each point can be counted 1.5 times in FHP (one for a forehand winner, one-half for a forehand that set it up), divide by 1.5 for a number of points that the forehand contributed to the match, above or below average. For instance, a FHP of +6 suggests that the player won 4 more points than he or she would have with a neutral forehand.
  • FHP/100 – Forehand potency per 100 forehands. The rate-stat version of FHP allows us to compare stats from different match lengths.
  • BHP/Match – Backhand Potency per match. Same as FHP, but for topspin backhands. I’ve occasionally calculated backhand-slice potency as well, but slices are not included in BHP itself.
  • BHP/100 – Backhand potency per 100 backhands. The rate-stat version of BHP.

Do Rallies Get Longer as Matches Progress?

Italian translation at settesei.it

Yesterday at the New York Open, Paolo Lorenzi battled through three sets to defeat Ryan Harrison. It was a notable result for a number of reasons, starting with the fact that Lorenzi is rarely seen on a hard court when there’s any other option. The 37-year-old Italian is one of the many men defying the aging curve these days, and with the victory, he’ll play at least one tour-level quarter-final for the eighth year in a row, despite not reaching his first until he was 30.

The way in which Lorenzi won the match was almost as unique as his career trajectory. Take a look at the average rally length per set:

Set  Avg Rally  
1          3.2  
2          4.0  
3          4.9

You probably don’t need me to tell you which set Harrison won. The opening frame was serve-dominated, typical of American indoor hard court events. As the match progressed, the points increasingly resembled the clay-court sparring that Lorenzi surely would have preferred.

Theorizing

The Lorenzi-Harrison match was extreme, but it tracks with what I believe to be the conventional wisdom. Throughout a match, players get better at reading their opponents’ games, cutting down on unreturned serves and making it more likely that each point will turn into a more protracted exchange. That’s the theory, anyway. There are some countervailing forces, such as fatigue, which work in the other direction, but in general we expect points to get longer.

Yesterday’s contest didn’t exactly follow that script, though. The rallies might have gotten longer because the two men better predicted each other’s shots, but it doesn’t show up so neatly in aces–Harrison hit aces on between 18% of 21% of his points in each set–or the more inclusive category of unreturned serves:

Set  Points  Unret%  
1        47   42.6%  
2        65   32.3%  
3        73   37.0%

While serve recognition may explain the rally length jump from set 1 to set 2, it goes in the opposite direction from set 2 to set 3. Yes, these are small samples, and yes, unreturned serves don’t tell the whole story. But there are signs that our initial theory is missing something.

More matches

As interesting as Lorenzi is, we’re going to need more players, and more data, to better understand what happens to serve returns and rally length over the course of a match. Let’s start with the main draw singles matches from the 2019 Australian Open. Not only are there are a lot of them, but since they are best of five, we have an opportunity to see how these trends unfold over several sets per match.

For each match, I measured the average rally length and rate of unreturned serves for each set, and then made set-by-set comparisons for the length of the match. For instance, in Lorenzi-Harrison, rally length increased by 25% from set 1 to set 2. Then, for each set, I aggregated all the matches of sufficient length to figure out how much the tour as a whole was changing from one set to the next.

The results are considerably less eye-catching than those of the Lorenzi match. In the following table, the “Avg Rally” and “Unret%” columns show the change in ratio form: If the baseline rate in the first set is 1.0, the rally length in set 2 increases by 0.8% and the number of unreturned serves goes up by 2.4%. I’ve also included example columns, showing realistic rally lengths and unreturned-serve rates for each set based on tournament averages of 3.2 shots by point and 34% of serves unreturned:

Set  Avg Rally  Ex Rally  Unret%  Ex Unret  
1            1      3.20       1     34.0%  
2        1.008      3.23   1.024     34.8%  
3        1.019      3.26   1.033     35.1%  
4        0.987      3.16   1.155     39.3%  
5        1.021      3.27   1.144     38.9% 

The set-to-set differences in rally length are barely enough to qualify for the name. The shift in the rate of unreturned serves, however, is much more striking, all the more so because it moves in the opposite direction that we expected.* Perhaps fatigue–or strategic energy conservation–plays a bigger role than I thought, or servers gain more from familiarity with their opponent than returners do.

* You might wonder if the effect is an artifact of the data, that players who reach 4th and 5th sets are bigger servers. That may be true, but it’s not what we’re seeing here. I’m comparing the stats in each set to the previous set in the match itself, and then averaging the set-to-set changes, weighted by the number of points in the sets. A John Isner 5th set, then, is compared only to an Isner 4th set.

WTA to the rescue

The results are completely different for women. Here is the same data for the 127 main draw women’s singles matches at the Australian Open:

Set  Avg Rally  Ex Rally  Unret%  Ex Unret  
1            1      3.40       1     27.0%  
2        1.035      3.52   0.974     26.3%  
3        1.103      3.75   0.915     24.7%

Still not as dramatic as Harrison-Lorenzi, but the trends are more marked than for the men. The number of unreturned serves drops quite a bit, and rally length increases by an amoun that an attentive spectator might notice. Those two are related–if there are fewer unreturned serves, there are more shots per point, even if we only consider the second shot. Beyond that, there are more opportunities for longer exchanges. In any case, the set-by-set trends for women fit closer to the intial theory than the men’s results did.

As with every aggregate stat, I’m guessing that there is a huge amount of variation among players. Perhaps players who are particularly good in third sets really do return more serves or, as Lorenzi did, shift their tactics in the direction of a more favorable style of play. Looking at these types of numbers for individual competitors is a reasonable next step, but it’s one that will need to wait for another day.

Is Doubles As Entertaining As We Think?

For as long as I’ve been following tennis, there’s been a tension between the amount of doubles available to watch and the amount of doubles that fans say they want to watch. In-person spectators flock to doubles matches at grand slams and aficionados pass around GIFs of the most outrageous, acrobatic doubles points. Yet broadcasters almost always stick with singles, leaving would-be viewers chasing down online streams, often illegal ones.

There are some good reasons for that, foremost among them the marquee drawing power of the best singles players. Broadcasters are convinced that their audiences would rather watch a Fed/Rafa/Serena/Pova blowout than a potentially more entertaining one-on-one contest between unknowns, let alone a doubles match. And they’re probably right–at least, they’ve got ratings numbers to back them up. So we’re left with a small population of hipster doubles fans, confident that two-on-two is the good stuff, even if most of us rarely watch it.

It’s probably impossible to quantify entertainment value, but that doesn’t mean we shouldn’t try. What can the numbers tell us about the watchability of doubles?

Hip to be rectangular

There’s plenty of room for a diversity of preferences–one fan’s Monfils may be another fan’s Isner. But there are some general principles that seem to define entertaining tennis for most spectators. Winners are better than errors, for one. Long rallies are better than short ones, at least within reason. And you can never go wrong with more net play.

If net play were the only criterion, doubles would beat singles easily. But what about other factors? I started wondering about this while researching a recent post on gender differences in mixed doubles, when I came across a match in which every rally was four shots or fewer. For every brilliant reflex half-volley, doubles features a hefty dose of big serving and tactically high-risk returning. Especially in men’s doubles, that translates into a lot of team conferences and not very much shotmaking.

Let’s see some numbers. For each of the five main events at the 2019 Australian Open–men’s and women’s singles, men’s and women’s doubles, and mixed doubles–here is the average rally length, the percentage of points ended in three shots or less, and the percentage of points that required at least ten shots:

Event            Avg Rally  <3 Shots  10+ Shots  
Men's Singles          3.2     72.6%       5.1%  
Women's Singles        3.4     67.9%       5.4%  
Men's Doubles          2.5     81.6%       1.1%  
Women's Doubles        2.9     76.7%       2.4%  
Mixed Doubles          2.8     74.0%       1.8%

There's a family resemblance in these numbers, but it's clear that doubles points are shorter. Men's doubles is the most extreme, at 2.5 shots per point. By comparison, only 8% of the men's singles matches in the Match Charting Project database have an average rally length lower than that. More than four out of every five men's doubles points ends by the third shot, and with barely one in one hundred points lasting to ten shots, you'd be lucky to sit through an entire match and see more than one such exchange.

Quantity and quality

Shorter points are the nature of the format. Even recreational players can find it hard to keep the ball in play when half of each team is patrolling the net, looking for an easy putaway. Short-rally tennis can still be entertaining, as long as the quality of play offsets the unfavorable watching-to-waiting ratio.

I've mentioned my perception that men's doubles features a lot of unreturned serves. The numbers suggest that I spoke too soon. For the five events, here are the percentage of points in which the return doesn't come back in play:

Event            Unret%  
Men's Singles     31.7%  
Women's Singles   24.3%  
Men's Doubles     32.1%  
Women's Doubles   21.6%  
Mixed Doubles     29.3%

For men, singles and doubles are about the same. Perhaps the singles servers are a bit stronger, but the doubles returners are taking more chances, trying to avoid feeding weak returns to aggressive netmen. With women, you're more likely to see a return in play in a doubles match than in singles. Unless you're a connoisseur of powerful serves, you'll probably find higher rates of returns in play to be more enjoyable to watch.

The same applies to winners, compared to unforced errors. (Forced errors are a bit tricky--sometimes they are as exciting and indicative of quality as a winner; other times they're just an out-of-position unforced error.) Let's see what fraction of points end in various ways, for each of the five events:

Event            Unforced%  Forced%  Winner%  
Men's Singles        25.6%    16.2%    21.3%  
Women's Singles      28.9%    16.0%    23.4%  
Men's Doubles        12.8%    17.2%    29.9%  
Women's Doubles      20.9%    18.0%    32.1%  
Mixed Doubles        14.5%    17.0%    29.5%

Here, doubles is the clear winner. For both men and women, more doubles points than singles points end in winners, and fewer points end in unforced errors. Some of that reflects the much higher rate of net play, since it's easier to execute an unreturnable shot from just a few feet behind the net. There are a few more forced errors in doubles, perhaps representing failed attempts to handle volleys that almost went for winners, but no matter how we interpret them, the difference in forced errors is not enough to offset the differences in winners and unforced errors.

The hipsters weren't wrong

The numbers aren't as conclusive as I expected them to be. Yes, doubles points are shorter, but not so much so that the format is reduced to only serving and returning. (Though some men's matches are close.) As usual, our data has limitations, but the information available for each point suggests that there's plenty of high-quality, entertaining tennis to be seen on doubles courts, even if it's usually limited to four or five shots at a time.

Petra Kvitova’s Current Status: Low Risk, High Reward

Italian translation at settesei.it

For more a decade, Petra Kvitova has been one of the most aggressive women in tennis. She aims for the corners, hits hard, and lets the chips fall where they may. Sometimes the results are ugly, like a 6-4 6-0 loss to Monica Niculescu in the 2016 Luxembourg final, but when it works, the rewards–two Wimbledon titles, for starters–more than make up for it.

She’s currently riding another wave of winners. Her 11-match win streak–which has involved the loss of only a single set–puts her one more victory away from a third major championship. The 28-year-old Czech has gotten this far by persisting with her big-hitting style, but with a twist: In Melbourne, she’s not missing very often. While she’s ending as many points as ever on her own racket, she’s missing less often than many of her more conservative peers.

In her last five matches at the Australian Open, from the second round through the semi-finals, 7.9% of her shots (including serves) have resulted in unforced errors. In the 88 Petra matches logged by the Match Charting Project, that’s the stingiest five-match stretch of her career. In charted matches since 2010, the average WTA player hits unforced errors on 8.0% of their shots. So Kvitova, the third-most aggressive player on tour, is somehow making errors at a below-average rate. It’s high-risk, high-reward tennis … without the risk.

And it isn’t because her go-for-broke tactics have changed. In Thursday’s semi-final against Danielle Collins, her aggression score–an aggregate measure of point-ending shots including winners, induced forced errors, and unforced errors–was 30.5%, the third-highest of all of her charted matches since her 2017 return to the tour. Her overall aggression score in Melbourne, 28.2%, is also higher than her career average of 27.1%.

In other words, she’s making fewer errors, and the missing errors are turning into point-ending shots in her favor. The following graph shows five-match rolling averages of winners (and induced forced errors) per shot and unforced errors per shot for all charted matches in Kvitova’s career:

Even with the winner and error rates smoothed out by five-match rolling averages, these are still some noisy trend lines. Still, some stories are quite clear. This month, Kvitova is hitting winners at close to her best-ever rate. Her average since the second round in Melbourne has been 20.3%, as high as anything she’s posted before with the exception of her 2014 Wimbledon title. (I’ve never tried to adjust winner totals for surface; it’s possible that the difference can be explained entirely by the grass.)

And most strikingly, this is as big a gap between winner rate and error rate as she’s achieved since her 2014 Wimbledon title run. In fact, between the second round and semi-finals at that tournament, she averaged 8.1% errors and 20.0% winners. Both of her numbers in Australia this year have been a tiny bit better.

Best of all, the error rate has–for the most part–seen a steady downward trend since 2016. The recent error spike is largely due to her three losses in Singapore last October and a bumpy start to this season in Brisbane. We can’t write those off entirely–perhaps Kvitova will always suffer through weeks when her aim goes awry–but she appears to have put them solidly behind her.

None of this is a guarantee that Petra will continue to avoid errors in Saturday’s final against Naomi Osaka. I could’ve written something about her encouraging error rates before the tour finals in Singapore last fall, and she failed to win a round-robin match there. And Osaka is likely to offer a stiffer challenge than any of Kvitova’s previous six opponents in Melbourne, even if her second serve doesn’t. That said, a stingy Kvitova is a terrifying prospect, one with the potential to end the brief WTA depth era and dominate women’s tennis.