Measuring WTA Tactics With Aggression Score

Editor’s note: Please welcome guest author Lowell! He’s a prolific contributor to the Match Charting Project, and the author of the first guest post on this blog.

The Problem

Quantifying aggression in tennis presents a quandary for the outsider. An aggressive shot and a defensive shot can occur on the same stroke at the same place on the court at the same point in a rally. To know whether one occurred, we need information on court positioning and shot speed, not only of the current shot, but the shots beforehand.

Since this data only exists for a fraction of tennis matches (via Hawkeye) and is not publicly available, using aggressive shots as a metric is untenable for public consumption. In a different era, net points may have been a suitable metric, but almost all current tennis, especially women’s tennis, revolves around baseline play.

Net points also can take on a random quality and may not actually reflect aggression. Elina Svitolina, according to data from the Match Charting Project, had 41 net points in her match against Yulia Putintseva at Roland Garros this year. However, this was not an indicator of Svitolina’s aggressive play so much as Putintseva hitting 51 drop shots in the match.

The Match Charting Project does give some data to help with this problem however. We can use the data to get the length of rallies and whether a player finished the point, i.e. he/she hit a winner or unforced error or their opponent hit a forced error. If we assume an aggressive player would be more likely to finish the point and would be more likely to try to finish the point sooner rather than later in a rally, we can build a metric.

The Metric

To calculate aggression using these assumptions, we need to know how often a player finished the point and how many opportunities did they have to finish the point, i.e. the number of times they had the ball in play on their side of the net. To measure the number of times a player finished the point, we add up the points where they hit a winner or unforced error or their opponent hit a forced error. For short, I will refer to these as “Points on Racquet”.

To measure how many opportunities a player had to finish the point, we calculate the number of times the ball was in play on each player’s side of the net. For service points, we add 1 to the length of each rally and divide it by 2, rounding up if the result is not an integer. For return points, we divide each rally by 2, rounding up if the result is not an integer. These adjustments allow us to accurately count how often a player had the ball in play on their side of the net. For brevity, I will call these values “Shot Opportunities”.

If we divide Points on Racquet by Shot Opportunities we will get a value between 0 and 1. If a player has a value of 0, they never finish points when the ball is on their side of the net. If the player has a value of 1, they only hit shots that end the point. As the value increases, a player is considered more aggressive. For short, I will call this measure an “Aggression Score.”

The Data

Taking data from the latest upload of the Match Charting Project, I found women’s players with 2000 or more completed points in the database (i.e. all points that were not point penalties or missed points). Eighteen players fitted these criteria. Since the Match Charting Project is, unfortunately, a nonrandom sample of matches, I felt uncomfortable making assessments below a very large number of data points. Using 2000 or more data points, however, an overwhelming amount of data would be required to overcome these assessments, giving some confidence that, while bias exists, we get in the neighborhood of the true aggression values.

The Results

Below are the results from the analysis. Tables 1-3 provide the Aggression Scores for each player overall, broken down into serve and return scores and further broken down into first and second serves. They also provide differences between where we would expect the player to be more aggressive (Serve v. Return, First Serve v. Second Serve and Second Serve Return v. First Serve Return).

Table 1: Aggression Scores

Name         Overall  On Serve  On Return  S-R Spread  
S Williams     0.281    0.3114     0.2476      0.0638  
S Halep       0.1818    0.2058     0.1537      0.0521  
M Sharapova   0.2421    0.2471     0.2358      0.0113  
C Wozniacki   0.1526    0.1788     0.1185      0.0603  
P Kvitova     0.3306     0.347      0.309       0.038  
L Safarova    0.2475    0.2694     0.2182      0.0512  
A Ivanovic    0.2413     0.247     0.2335      0.0135  
Ka Pliskova    0.256    0.2898     0.2095      0.0803  
G Muguruza     0.231     0.238     0.2214      0.0166  
A Kerber      0.1766    0.2044     0.1433      0.0611  
B Bencic      0.1742    0.1784     0.1687      0.0097  
A Radwanska   0.1473    0.1688     0.1207      0.0481  
S Errani      0.1232    0.1184     0.1297     -0.0113  
E Svitolina   0.1654    0.1769     0.1511      0.0258  
M Keys        0.3017    0.3284     0.2677      0.0607  
V Azarenka    0.1892    0.1988     0.1762      0.0226  
V Williams    0.2251     0.247     0.1944      0.0526  
E Bouchard    0.2458    0.2695     0.2157      0.0538  
WTA Tour       0.209    0.2254     0.1877      0.0377

Table 2: Serve Aggression Scores

Name          Serve  First Serve  Second Serve  1-2 Spread  
S Williams   0.3114       0.3958        0.2048       0.191  
S Halep      0.2058       0.2298        0.1587      0.0711  
M Sharapova  0.2471       0.2715        0.1989      0.0726  
C Wozniacki  0.1788       0.2016         0.121      0.0806  
P Kvitova     0.347       0.3924        0.2705      0.1219  
L Safarova   0.2694       0.3079        0.1983      0.1096  
A Ivanovic    0.247       0.2961        0.1732      0.1229  
Ka Pliskova  0.2898       0.3552        0.1985      0.1567  
G Muguruza    0.238       0.2906        0.1676       0.123  
A Kerber     0.2044       0.2337        0.1384      0.0953  
B Bencic     0.1784       0.2118        0.1218        0.09  
A Radwanska  0.1688       0.2083        0.0931      0.1152  
S Errani     0.1184       0.1254        0.0819      0.0435  
E Svitolina  0.1769       0.2196         0.105      0.1146  
M Keys       0.3284       0.3958        0.2453      0.1505  
V Azarenka   0.1988       0.2257        0.1347       0.091  
V Williams    0.247       0.3033        0.1716      0.1317  
E Bouchard   0.2695       0.3043        0.2162      0.0881  
WTA Tour     0.2254       0.2578        0.1679      0.0899

Table 3: Return Aggression Scores

Name          Serve  1st Return  2nd Return  Spread  
S Williams   0.2476      0.2108      0.3116  0.1008  
S Halep      0.1537      0.1399      0.1778  0.0379  
M Sharapova  0.2358      0.2133      0.2774  0.0641  
C Wozniacki  0.1185      0.1098       0.132  0.0222  
P Kvitova     0.309      0.2676      0.3803  0.1127  
L Safarova   0.2182      0.1778      0.2725  0.0947  
A Ivanovic   0.2335      0.1952      0.3027  0.1075  
Ka Pliskova  0.2095      0.1731      0.2715  0.0984  
G Muguruza   0.2214      0.1888      0.2855  0.0967  
A Kerber     0.1433      0.1127       0.191  0.0783  
B Bencic     0.1687      0.1514       0.197  0.0456  
A Radwanska  0.1207      0.1049      0.1464  0.0415  
S Errani     0.1297      0.1131      0.1613  0.0482  
E Svitolina  0.1511      0.1175      0.1981  0.0806  
M Keys       0.2677      0.2322      0.3464  0.1142  
V Azarenka   0.1762      0.1499      0.2164  0.0665  
V Williams   0.1944      0.1586       0.255  0.0964  
E Bouchard   0.2157      0.1757      0.2837   0.108  
WTA Tour     0.1877      0.1609      0.2341  0.0732

The first plot shows the relationship between serve and return aggression scores as well as the regression line with a confidence interval (note: since there are only 18 players in the sample, treat this regression line and all of the others in this post with caution).

Figure2

The second and third plots show the relationships between players’ aggression scores on first serves and their aggression scores on second serves for serve and return points respectively as well as the regression lines with confidence intervals.

Figure3

Figure4

The fourth and fifth plots show the relationship between the spread of serve and return aggression scores between first and second serve and the more aggressive point for the player, i.e. first serve for service points and second serve for return points as well as the regression lines with confidence intervals.

Figure5

Figure6

 

We can take away five preliminary observations.

Sara Errani knows where her money is made. The WTA is notoriously terrible for providing statistics. However, they do provide leaderboards for particular statistics, including return points and games won. Errani leads the tour in both this year. She also uniquely holds a higher Aggression Score on return points than serve points. From this information, we can hypothesize that Errani may play more aggressive on return points because she has greater confidence she can win those points or because she relies on those points more to win.

Maria Sharapova is insensitive to context; Elina Svitolina is highly sensitive to context. She falls outside of the confidence interval in all five plots. More specifically, Sharapova consistently is more aggressive on return points, second serve service points and first serve return points than her scores for service points, first serve service points and second serve return points respectively would predict. She has also lower spreads on serve and return than her more aggressive points would predict.

This result suggests that Sharapova differentiates relatively little in how she approaches points according to whether she is serving or returning or whether it is first serve or second serve. Svitolina exhibits the opposite trend as Sharapova. Considering anecdotal thoughts from watching Sharapova and Svitolina, these results make sense. Sharapova’s serve does not seem to vary between first and second and we see a lot of double faults. Svitolina can vary between aggressive shot-making and big first serves and conservative play. Hot takes are not always wrong.

Lucie Safarova, meet Eugenie Bouchard; Ana Ivanovic, meet Garbine Muguruza. Looking at the plots, it is interesting to note how Safarova and Bouchard seem to follow each other across the various measures. The same is true for Ivanovic and Muguruza. A potential application of the aggression score is that it can point us to players that are comparable and may have similar results. Players with good results against Safarova and Ivanovic may have good results against Bouchard and Muguruza, two younger players whom they are much less likely to have played.

Serena Williams and Karolina Pliskova serve like Madison Keys and Petra Kvitova, but they are very different. Serena, Pliskova, Keys and Kvitova are all players that are known for their serves as their weapons. Serena and Pliskova have the third and fourth highest Aggression Scores respectively. However, they also have wide spreads on serve and return scores and they have much lower second serve service point scores than their first serve scores would predict, whereas Keys is about where the prediction places her and Kvitova is far more aggressive than her first serve points would predict.

While Serena is still a relatively aggressive returner, she rates lower on first serve return aggression than Maria Sharapova. Pliskova falls to the middle of the pack on return aggression. Kvitova and Keys, in contrast, are both very aggressive on return points. My hypothesis for the difference is that while Serena and Pliskova are aggressive players, their scores get inflated by using their first serve as a weapon and they are only somewhat more aggressive than the players that score below them. Kvitova and Keys, on the other had, are exceptionally aggressive players.

The WTA runs through Victoria Azarenka and Madison Keys. Oddly, the players who seemed to best capture the relationships between all of the aggression scores and spreads of aggression scores were Victoria Azarenka and Madison Keys. Neither strayed outside of the confidence interval and often ended up on the best-fit line from the regressions. They define average for the WTA top 20.

These thoughts are preliminary and any suggestions on how they could be used or improved would be helpful. I also must beseech you to help with the Match Charting Project to put more players over the 2,000 point mark and get more points for the players on this list to help their Aggression Scores a better part of reality.

Uncontrolled Aggression

Italian translation at settesei.it

Listen to tennis commentary–or a broadcast of any sport, really–and wait for the first mention of “consistency.” You won’t have to wait for long.

“Consistent” is good, and “inconsistent” is bad. Or so we’re told. At first blush, it makes sense. Consistency is a good thing when it comes to following through on your forehand or brushing your teeth every day. But unless you’re the very best player in the world, consistency doesn’t win you Grand Slam titles.

Think of it this way: Every player has an “average” level they are capable of playing. If average Rafael Nadal plays average anybody else on clay, average Nadal wins. If average Richard Gasquet plays average anybody-outside-the-top-fifty, average Gasquet wins. These situations, for the likes of Nadal and Gasquet, are when consistency is actually a good thing. Sure, Rafa might be able to raise him game to previously unheard-of heights, but what’s the point? It’s a matter of winning 6-1 6-0 instead of 6-3 6-2. Nadal’s main concern is avoiding an off day.

Consider the same example from the perspective of Rafa’s opponent. If you’re Tomas Berdych and you play at your usual level against Nadal, you’ll lose. That’s what consistency gets you: thirteen straight losses.

Uncontrolled aggression

Very aggressive players tend to get a bad rap. The guys who always go for their shots–think Lukas Rosol or Nikolay Davydenko–rack up huge winner and unforced error counts. Sometimes it works and often it doesn’t. When it doesn’t, the conventional wisdom always seems to be that these players need to rein in their aggression. They need to be more consistent.

But they don’t. If Rosol stopped unleashing huge shots in every direction, he’d make fewer unforced errors, but he’d hit far fewer winners. He might still hover around #50 in the world, but more likely, he’d still be lurking in the Challenger ranks, looking for the breakthrough that such a passive style might never earn for him. As it is, Rosol’s go-for-broke approach got him that career-defining upset over Nadal, not to mention an ATP title in Bucharest last spring, when he beat three higher-ranked players.

Rather than the pundit’s favored phrase of “controlled aggression,” players score big upsets and major breakthroughs with uncontrolled aggression. (It only looks controlled because it’s working that day.) If you rein in an aggressive player, he may win more of the matches he’s supposed to win, but he’s much less likely to score an upset.

The balance myth

The game of tennis has so much variety–surfaces, climates, playing styles–and so much alternation–deuce/ad, serve/return–that pundits are constantly endorsing balance. Andy Murray needs to get better on clay, they say. Jerzy Janowicz needs to improve his return game. Monica Niculescu needs to learn how to hit a forehand.

It’s a tempting argument to make, because the best players in the game do have that balance. Nadal and Djokovic and Serena and Li have a wide variety of devastating shots and tactics that are effective on every surface. If you want to play like them and reap the same rewards, you need to have that same balance.

Except that, for the vast majority of players–even top-tenners–that just isn’t going to happen. I don’t care if David Ferrer hires a coaching team of Pete Sampras and Mark Philippoussis, he’ll never be much more effective on serve. John Isner could work all offseason with Andre Agassi and remain among the game’s weakest returners.

What’s keeping these players from climbing any higher in the rankings isn’t the fact that they aren’t more balanced. It’s the simple fact that they aren’t better. By definition, most people will never be a once-in-a-generation talent.

Most players are not balanced. And that’s fine. Rather than chasing the impossible dream of out-Novaking Novak, they need to take more risks to outplay their betters in one or two areas. When it doesn’t work, it doesn’t matter–they would’ve lost anyway.

The cluster principle

Tennis rewards the streaky. If you only win four return points in a set, it’s much better to win them consecutively than to spread them out. It’s better to win five matches in one week and go winless for the next four weeks than win one match per week.

Whether it’s points, games, sets, matches, or even titles, it’s better to cluster your triumphs.

If you strive for a balanced game, the best players simply won’t let you go on a streak. Fabio Fognini or Sabine Lisicki might give you a few gifts, but Nadal never will. The only way to cluster your victories over Rafa is to play such aggressive tennis that even he can’t neutralize it. It usually won’t work, but for most players, it’s their only hope. There’s a reason the hyper-aggressive Davydenko is the only active player with a winning record against him.

Stan’s untold narrative

Stanislas Wawrinka probably wouldn’t have beaten a healthy Nadal over five sets on Sunday. But he was winning when Rafa’s back acted up, and he did so by unleashing every weapon in his arsenal.

Whatever the rankings say this week, Wawrinka isn’t one of the best three tennis players in the world. At least “average Stan” isn’t. But that’s the whole point. Tennis doesn’t reward players with ranking points and prize money for consistency. Consistency got Berdych into the top ten and has kept him there for so long … but it has prevented him from spending much time in the top five.

Wawrinka won’t always beat Nadal or Djokovic, and he’ll continue to suffer his share of defeats at the hands of the players ranked below him. The high-risk style of play that earned him a place in the history books won’t always pay off. That’s all part of the package. Stan didn’t get this far by being consistent.

The Questionable Wisdom of the Drop Shot

Italian translation at settesei.it

More than any other shot in tennis, the drop shot can make the player who hits it look either brilliant or idiotic.  The line separating the two is rarely so fine.

When we combine the brilliance and the idiocy, how does the drop shot measure up?  How much does a player gain or lose with frequent use of the dropper?

In the final match of last week’s Challenger Tour Finals between Alejandro Gonzalez and Filippo Volandri, Volandri hit a whopping 23 drop shots–almost one per game (click the “Shot Types” links).  Volandri is a seasoned pro with an excellent sense of clay court tactics, so he avoided the clunkiest drop shot misses–only three of the 23 were errors.  Yet despite facing an opponent who prefers to camp out well behind the baseline, the Italian won only 11 of the 23 points.  Almost half the time the drop shot landed in the court, Gonzalez chased it down, got a return in play, and went on to win the point.

Volandri’s performance in the final wasn’t an anomaly.  In the semifinal against Teymuraz Gabashvili, he attempted 17 drop shots and won only nine of those points.  The other aggressive drop-shotter at the CTF, Oleksandr Nedovyesov, hit 19 drop shots against Gabashvili in a round-robin match.  Even though eight of those 19 drop shots were winners, Nedovyesov lost ten of the ensuing points.

With my shot-by-shot analyses of five matches from last week’s event in Sao Paulo, we can take a somewhat broader look at drop shot tactics and their results.  While this subset may not be representative of all clay-court tennis (for one thing, the altitude makes it a bit easier to chase down a dropper), the aggregate numbers raise some questions about the wisdom of the drop-shot tactic.

As a whole, the six players who took part in these five matches hit 95 drop shots.  16 (16.8%) of them were unforced errors, compared to an overall rate of about 1 unforced error per 10 rallying shots.  29 (30.5%) were outright winners, while another five induced forced errors, immediately ending the point.  That leaves 45 points (47.4%) in which the opponent got the ball back in play.  Of those, the dropshotter won only 19 (42.2%).

Taken together, the results aren’t bad.  The player who hit the drop shot won 53 (55.8%) of the points, and 67.1% of the points when the drop shot landed in play.

There is a noticeable difference, however, in the success rates of the frequent dropshotters (Voladri and Nedovyesov) compared to those of the other four players, who averaged fewer than four drop shots per match.  While the players of what I’ll call the “infrequent group”–Gabashvili, Gonzalez, Guilherme Clezar, and Jesse Huta Galung–may not be as practiced in the art, it is likely that they chose their moments much more carefully, hitting drop shots when the tactic was obvious.

The infrequent group hit 22 drop shots, missing only two.  Not only did nine go for winners, but the overall results were positive as well, as they won 14 (63.6%) of those points.

Remove the infrequent group from the overall numbers, and the aggressive dropshotters won a mere 53.4% of points in which they used the tactic.

53.4% isn’t awful–if you win 53.4% of the points in a match, you almost always win.  However, the type of point in which the drop shot makes sense isn’t an average point.  Usually the dropshotter has better court position than his opponent, who may be off-balance or far behind the baseline.  This isn’t always the case, especially when the dropshotter is simply trying to end the point, or when his brain stops working.  But in the majority of cases, the dropshotter has such an advantage in court position, it seems likely that a more common tactic–such as an aggressive groundstroke, perhaps followed by a net approach–would do better.

Another consideration goes beyond the outcome of a specific point.  A player who fails to run down a drop shot will probably remember that lost point for a game or two and play a little closer to the baseline, maybe making himself less comfortable in the process.  It’s possible that the long-term effect gives an advantage to the player who regularly uses the tactic.

But somewhere between Gonzalez’s four drop shots on Sunday and Volandri’s 23, the marginal advantage of each additional dropper must wear off.  I find it hard to imagine that one drop shot per game has any more of a long-term strategic effect than one drop shot per three games.  If that’s true, Volandri hit 13 or 14 more drop shots than required.  Thus, in about 8% of Sunday’s 162 points, he took an advantageous court position and wasted it on an even-odds shot.

More evidence will surely give us a fuller picture of drop-shot tactics on clay courts.  We may be able to determine whether there is a post-dropper “hangover effect” and if so, how many drop shots are required to reap the benefits.  Until then, it’s worth considering whether drop shots are worth the risk, especially when there may be such a high-percentage alternative.