The Match Charting Project Hits 1,400!

Yesterday, the Match Charting Project hit another milestone: 1,400 matches!

For the last few months, we’ve been growing at the fastest pace yet, better than four new matches per day. We’re adding high-profile matches from the men’s and women’s tour, along with plenty of Challenger and historical matches, as well.

Recent milestones include 100 Rafael Nadal matches, 75 Novak Djokovic matches, 300 active ATPers, 40 Agnieszka Radwanska matches, and 50 Grand Slam finals.

Here’s the complete list, where you can find detailed shot-by-shot breakdowns of every one of these matches, sorted by player.

I’ve also updated the raw data, which is available here.

We’ll definitely reach 1,500 by the end of the year. We started 2015 with 548 matches, so if we keep up the current rate, we’ll cross the 1,000 mark for 2015 alone, including nearly 700 from the 2015 season. To those of you who have contributed: Thank you. You deserve the thanks of the entire tennis community.

If you haven’t contributed, now is a great time to start. Click here for my “quick start” guide to match charting. As we approach the offseason, it’s the perfect opportunity to dig up one of those old matches you’ve always meant to watch. Learn to chart, and you can make tennis analytics better at the same time.

All the Answers

At the end of Turing’s Cathedral, George Dyson suggests that while computers aren’t always able to usefully respond to our questions, they are able to generate a stunning, unprecedented array of answers–even if the corresponding questions have never been asked.

Think of a search engine: It has indexed every possible word and phrase, in many cases still waiting for the first user to search for it.

Tennis Abstract is no different. Using the menus on the left-hand side of Roger Federer’s page–even ignoring the filters for head-to-heads, tournaments, countries, matchstats, and custom settings like those for date and rank–you can run five trillion different queries. That’s twelve zeroes–and that’s just Federer. Judging by my traffic numbers, it will be a bit longer before all of those have been tried.

Every filter is there for a reason–an attempt to answer some meaningful question about a player. But the vast majority of those five trillion queries settle debates that no one in their right mind would ever have, like Roger’s 2010 hard-court Masters record when winning a set 6-1 against a player outside the top 10. (He was 2-0.)

The danger in having all these answers is that it can be tempting to pretend we were asking the questions–or worse, that we were asking the questions and suspected all along that the answers would turn out this way.

The Hawkeye data on tennis broadcasts is a great example. When a graphic shows us the trajectory of several serves, or the path of the ball over every shot of a rally, we’re looking at an enormous amount of raw data, more than most of us could comprehend if it weren’t presented against the familiar backdrop of a tennis court. Given all those answers, our first instinct is too often to seek evidence for something we were already pretty sure about–that Jack Sock’s topspin is doing the damage, or Rafael Nadal’s second serve is attackable.

It’s tough to argue with those kind of claims, especially when a high-tech graphic appears to serve as confirmation. But while those graphics (or those results of long-tail Tennis Abstract queries) are “answers,” they address only narrow questions, rarely proving the points we pretend they do.

These narrow answers are merely jumping-off points for meaningful questions. Instead of looking at a breakdown of Novak Djokovic’s backhands over the course of a match and declaring, “I knew it, his down-the-line backhand is the best in the game,” we should realize we’re looking at a small sample, devoid of context, and take the opportunity to ask, “Is his down-the-line backhand always this good?” or “How does his down-the-line backhand compare to others?” Or even, “How much does a down-the-line backhand increase a player’s odds of winning a rally?”

Unfortunately, the discussion usually stops before a meaningful question is ever asked. Even without publicly released Hawkeye data, we’re beginning to have the necessary data to research many of these questions.

As much as we love to complain about the dearth of tennis analytics, too many people draw conclusions from the pseudo-answers of fancy graphics. With more data available to us than ever before, it is a shame to mistake narrow, facile answers for broad, meaningful ones.

The Difficulty (and Importance) of Finding the Backhand

One disadvantage of some one-handed backhands is that they tend to sit up a little more when they’re hit crosscourt. That gives an opponent more time to prepare and, often, enough time to run around a crosscourt shot and hit a forehand, which opens up more tactical possibilities.

With the 700 men’s matches in the Match Charting Project database (please contribute!), we can start to quantify this disadvantage–if indeed it has a negative effect on one-handers. Once we’ve determined whether one-handers can find their opponents’ backhands, we can try to answer the more important question of how much it matters.

The scenario

Let’s take all baseline rallies between right-handers. Your opponent hits a shot to your backhand side, and you have three choices: drive (flat or topspin) backhand, slice backhand, or run around to hit a forehand. You’ll occasionally go for a winner down the line and you’ll sometimes be forced to hit a weak reply down the middle, but usually, your goal is to return the shot crosscourt, ideally finding your opponent’s backhand.

Considering all righty-righty matchups including at least one player among the last week’s ATP top 72 (I wanted to include Nicolas Almagro), here are the frequency and results of each of those choices:

ALL             9.9%   68.1%  10.8%    5.8%   43.1%  
SLICE  11.9%   34.1%   49.5%   7.1%    0.6%   40.2%  
FH     44.9%    2.8%   69.0%  13.0%    9.8%   42.1%  
BH     43.3%   10.7%   72.2%   9.5%    3.1%   45.0%  
1HBH   42.6%   12.0%   69.5%   9.3%    3.8%   44.2%  
2HBH   43.5%   10.0%   73.4%   9.6%    2.8%   45.4%

“FH REP” and “BH REP” refer to a forehand or backhand reply, and we can see just how much shot selection matters in keeping the ball away from your opponent’s forehand. A slice does a very poor job, while an inside-out forehand almost guarantees a backhand reply, though it comes with an increased risk of error.

The differences between one- and two-handed backhands aren’t as stark. One-handers don’t find the backhand quite as frequently, though they hit a few more winners. They hit drive backhands a bit less often, but that doesn’t necessarily mean they are hitting forehands instead. On average, two-handers hit a few more forehands from the backhand corner, while one-handers are forced to hit more slices.

One hand, many types

Not all one-handed backhands are created equal, and these numbers bear that out. Stanislas Wawrinka‘s backhand is as effective as the best two-handers, while Roger Federer‘s is typically the jumping-off point for discussions of why the one-hander is dying.

Here are the 28 players for whom we have at least 500 instances (excluding service returns) when the player responded to a shot hit to his backhand corner. For each, I’ve shown how often he chose a drive backhand or forehand, and the frequency with which he found the backhand–excluding his own errors and winners.

Player                 BH  BH FRQ  FIND BH%  FH FRQ  FIND BH%  
Alexandr Dolgopolov     2   45.7%     94.2%   43.3%     98.7%  
Kei Nishikori           2   51.1%     94.0%   38.9%     98.1%  
Andy Murray             2   41.0%     92.4%   46.5%     98.6%  
Stanislas Wawrinka      1   48.6%     92.1%   37.5%     98.0%  
Bernard Tomic           2   33.8%     91.7%   43.8%     97.9%  
Novak Djokovic          2   47.2%     91.7%   41.4%     98.5%  
Kevin Anderson          2   41.0%     91.5%   45.8%     96.6%  
Borna Coric             2   46.5%     90.7%   44.2%     96.9%  
Pablo Cuevas            1   41.9%     90.6%   54.5%     96.5%  
Marin Cilic             2   45.4%     89.7%   43.3%     97.2%  
Player                 BH  BH FRQ  FIND BH%  FH FRQ  FIND BH%  
Tomas Berdych           2   41.6%     89.3%   44.2%     97.5%  
Pablo Carreno Busta     2   55.4%     87.8%   41.1%     93.5%  
Fabio Fognini           2   46.0%     87.4%   47.0%     96.1%  
Richard Gasquet         1   57.2%     87.3%   32.1%     96.8%  
Andreas Seppi           2   40.3%     87.2%   50.0%     93.9%  
Nicolas Almagro         1   53.6%     86.5%   39.3%     98.0%  
Dominic Thiem           1   38.5%     86.2%   50.0%     96.5%  
Gael Monfils            2   48.0%     85.3%   46.3%     85.3%  
David Ferrer            2   48.2%     84.9%   40.4%     97.1%  
Roger Federer           1   42.7%     84.8%   43.6%     94.5%  
Player                 BH  BH FRQ  FIND BH%  FH FRQ  FIND BH%  
Gilles Simon            2   46.9%     84.6%   46.5%     94.6%  
David Goffin            2   45.4%     84.6%   45.7%     94.9%  
Roberto Bautista Agut   2   39.6%     83.3%   46.7%     98.4%  
Jo Wilfried Tsonga      2   43.5%     82.0%   44.5%     96.3%  
Grigor Dimitrov         1   41.4%     78.6%   39.4%     92.8%  
Milos Raonic            2   31.5%     63.5%   56.5%     94.3%  
Jack Sock               2   27.0%     62.5%   62.9%     96.3%  
Tommy Robredo           1   26.6%     56.1%   62.3%     88.4%

One-handers Wawrinka, Pablo Cuevas, and Richard Gasquet (barely) are among the top half of these players, in terms of finding the backhand with their own backhand. Federer and his would-be clone Grigor Dimitrov are at the other end of the spectrum.

Taking all 60 righties I included in this analysis (not just those shown above), there is a mild negative correlation (r^2 = -0.16) between a player’s likelihood of finding the opponent’s backhand with his own and the rate at which he chooses to hit a forehand from that corner. In other words, the worse he is at finding the backhand, the more inside-out forehands he hits. Tommy Robredo and Jack Sock are the one- and two-handed poster boys for this, struggling more than any other players to find the backhand, and compensating by hitting as many forehands as possible.

However, Federer–and, to an even greater extent, Dimitrov–don’t fit this mold. The average one-hander runs around balls in their backhand corner 44.6% of the time, while Fed is one percentage point under that and Dimitrov is below 40%. Federer is perceived to be particularly aggressive with his inside-out (and inside-in) forehands, but that may be because he chooses his moments wisely.

Ultimate outcomes

Let’s look at this from one more angle. In the end, what matters is whether you win the point, no matter how you get there. For each of the 28 players listed above, I calculated the rate at which they won points for each shot selection. For instance, when Novak Djokovic hits a drive backhand from his backhand corner, he wins the point 45.4% of the time, compared to 42.3% when he hits a slice and 42.4% when he hits a forehand.

Against his own average, Djokovic is about 3.6% better when he chooses (or to think of it another way, is able to choose) a drive backhand. For all of these players, here’s how each of the three shot choices compare to their average outcome:

Player                 BH   BH W   SL W   FH W  
Dominic Thiem           1  1.209  0.633  0.924  
David Goffin            2  1.111  0.656  0.956  
Grigor Dimitrov         1  1.104  0.730  1.022  
Gilles Simon            2  1.097  0.922  0.913  
Tomas Berdych           2  1.085  0.884  0.957  
Pablo Carreno Busta     2  1.081  0.982  0.892  
Kei Nishikori           2  1.070  0.777  0.965  
Roberto Bautista Agut   2  1.055  0.747  1.027  
Stanislas Wawrinka      1  1.050  0.995  0.936  
Borna Coric             2  1.049  1.033  0.941  
Player                 BH   BH W   SL W   FH W  
Bernard Tomic           2  1.049  1.037  0.943  
Jack Sock               2  1.049  0.811  1.010  
Gael Monfils            2  1.048  1.100  0.938  
Fabio Fognini           2  1.048  0.775  0.987  
Milos Raonic            2  1.048  0.996  0.974  
Nicolas Almagro         1  1.046  0.848  0.964  
Kevin Anderson          2  1.038  1.056  0.950  
Novak Djokovic          2  1.036  0.966  0.969  
Andy Murray             2  1.031  1.039  0.962  
Roger Federer           1  1.023  1.005  0.976  
Player                 BH   BH W   SL W   FH W  
Richard Gasquet         1  1.020  0.795  1.033  
Andreas Seppi           2  1.019  0.883  1.008  
David Ferrer            2  1.018  0.853  1.020  
Alexandr Dolgopolov     2  1.010  1.010  0.987  
Marin Cilic             2  1.006  1.009  0.991  
Pablo Cuevas            1  0.987  0.425  1.048  
Jo Wilfried Tsonga      2  0.956  0.805  1.095  
Tommy Robredo           1  0.845  0.930  1.079

In this view, Dimitrov–along with his fellow one-handed flame carrier Dominic Thiem–looks a lot better. His crosscourt backhand doesn’t find many backhands, but it is by far his most effective shot from his own backhand corner. We would expect him to win more points with a drive backhand than with a slice (since he probably opts for slices in more defensive positions), but it’s surprising to me that his backhand is so much better than the inside-out forehand.

While Dimitrov and Thiem are more extreme than most, almost all of these players have better results with crosscourt drive backhands than with inside-out (or inside-in forehands). Only five–including Robredo but, shockingly, not including Sock–win more points after hitting forehands from the backhand corner.

It’s clear that one-handers do, in fact, have a slightly more difficult time forcing their opponents to hit backhands. It’s much less clear how much it matters. Even Federer, with his famously dodgy backhand and even more famously dominant inside-out forehand, is slightly better off hitting a backhand from his backhand corner. We’ll never know what would happen if Fed had Djokovic’s backhand instead, but even though Federer’s one-hander isn’t finding as many backhands as Novak’s two-hander does, it’s getting the job done at a surprisingly high rate.

Are Two First Serves Ever Better Than One?

It’s one of those ideas that never really goes away. Some players have such strong first serves that we often wonder what would happen if they hit only first serves. That is, if a player went all-out on every serve, would his results be any better?

Last year, Carl Bialik answered that question: It’s a reasonably straightforward “no.”

Bialik showed that among ATP tour regulars in 2014, only Ivo Karlovic would benefit from what I’ll call the “double-first” strategy, and his gains would be minimal. When I ran the numbers for 2015–assuming for all players that their rates of making first serves and winning first-serve points would stay the same–I found that Karlovic only breaks even. Going back to 2010, 2014 Ivo was the only player-season with at least 40 matches for whom two first serves would be better than one.

Still, it’s not an open-and-shut case. What struck me is that the disadvantage of a double-first strategy would be so minimal. For Karlovic (and others, mainly big servers, such as Jerzy Janowicz, Milos Raonic,and John Isner), hitting two first serves would only slightly decrease their overall rate of service points won. For Rafael Nadal and Andy Murray, opting for double-first would reduce their rate of service points won by just under two percentage points.

Here’s a visual look at 2015 tour regulars (minimum 30 matches), showing the hypothetical disadvantage of two first serves. The diagonal line is the breakeven level; Ivo, Janowicz, and Isner are the three points nearly on the line.


Since some players are so close to breaking even, I started to wonder if some matchups make the double-first strategy a winning proposition. For example, Novak Djokovic is so dominant against second serves that, perhaps, opponents would be better off letting him see only first serves.

However, it remains a good idea–at least in general–to take the traditional approach against Djokovic. Hypothetically, two first serves would result in Novak raising his rate of return points won by 1.2 percentage points. Gilles Simon and Andy Murray are in similar territory, right around 1 percentage point.

Here’s the same plot, showing the disadvantage of double-first against tour-regular returners this season:


There just aren’t any returners who would cause the strategy to come as close to breaking even as some big servers do.

The match-level tactic

What happens if a nearly-breakeven server, like Karlovic, faces a not-far-from-breakeven returner, like Djokovic? If opting for double-first is almost a good idea for Ivo against the average returner, what happens when he faces someone particularly skilled at attacking second serves?

Sure enough, there are lots of matches in which two first serves would have been better than one. I found about 1300 matches between tour regulars (players with 30+ matches) this season, and for each one, I calculated each player’s actual service points won along with their estimated points won had they hit two first serves. About one-quarter of the time, double-first would have been an improvement.

This finding holds up in longer matches, too, avoiding some of the danger of tiny samples in short matches. In one-quarter of longer-than-average matches, a player would have still benefited from the double-first strategy. Here’s a look at how those matches are distributed:


Finally, some action on the left side of the line! One of those outliers in the far upper right of the graph is, in fact, Ivo’s upset of Djokovic in Doha this year. Karlovic won 85% of first-serve points but only 50% of second-serve points. Had he hit only first serves, he would’ve won about 79% of his service points instead of the 75% that he recorded that day.

Another standout example is Karlovic’s match against Simon in Cincinnati. Ivo won 81% of first-serve points and only 39% of second-serve points. He won the match anyway, but if he had pursued a double-first strategy, Simon could’ve caught an earlier flight home.

Predicting double-first opportunities

Armed with all this data, we would still have a very difficult time identifying opportunities for players to take advantage of the strategy.

For each player in every match, I multiplied his “double-first disadvantage” (the number of percentage points of serve points won he would lose by hitting two first serves) with the returner’s double-first disadvantage. Ranking all matches by the resulting product puts combinations like Karlovic-Djokovic and Murray-Isner together at one extreme. If we are to find instances where we could retroactively predict an advantage from hitting two first serves, they would be here.

When we divide all these matches into quintiles, there is a strong relationship between the double-first results we would predict using season-aggregate numbers and the double-first results we see in individual matches. However, even if the most double-first-friendly quintile–the one filled with Ivo serving and Novak returning–there’s still, on average, a one-percentage-point advantage to the traditional serving tactic.

It is only at the most extreme that we could even consider recommending two first serves. When we take the 2% of matches with the smallest products–that is, the ones we would most expect to benefit from double first–26 of those 50 matches are one in which the server would’ve done better to hit two first serves.

In other words, there’s a ton of variance at the individual match level, and since the margins are so slim, there are almost no situations where it would be sensible for a player to hit two first serves.

A brief coda in the real world

All of this analysis is based on some simplifying assumptions, namely that players would make their first serves at the same rate if they were hitting two instead of one, and that players would win the same number of points behind their first serves even if they were hitting them twice as often.

We can only speculate how much those assumptions mask. I suspect that if a player hit only first serves, he would be more likely to see streaks of both success and failure; without second serves to mix things up, it would be easier to find oneself repeating mechanics, whether perfect or flawed.

The second assumption is probably the more important one. If a server hit only first serves, his ability to mix things up and disguise serving patterns would be hampered. I have no idea how much that would affect the outcome of service points–but it would probably act to the advantage of the returner.

All that said, even if we can’t recommend that players hit two first serves in any but the extreme matchups, it is worth emphasizing that the margins we’re discussing are small. And since they are small, the risk of hitting big second serves isn’t that great. There may be room for players to profitably experiment with more aggressive second serving, especially when a returner starts crushing second serves.

Ceding the advantage on second-serve points to a player like Djokovic must be disheartening. If the risk of a few more double faults is tolerable, we may have stumbled on a way for servers to occasionally stop the bleeding.

Toward a Better Understanding of Return Effectiveness

The deeper the return, the better, right? That, at least, is the basis for many of the flashy graphics we see these days on tennis broadcasts, indicating the location of every return, often separated into “shallow,” “medium,” and “deep” zones.

In general, yes, deep returns are better than shallow ones. But return winners aren’t overwhelmingly deep, since returners can achieve sharper angles if they aim closer to the service line. There are plenty of other complicating factors as well: returns to the sides of the court are more effective than those down the middle, second-serve returns tend to be better than first-serve returns, and topspin returns result in more return points won than chip or slice returns.

While most of this is common sense, quantifying it is an arduous and mind-bending task. When we consider all these variables–first or second serve, deuce or ad court, serve direction, whether the returner is a righty or lefty, forehand or backhand return, topspin or slice, return direction, and return depth–we end up with more than 8,500 permutations. Many are useless (righties don’t hit a lot of forehand chip returns against deuce court serves down the T), but thousands reflect some common-enough scenario.

To get us started, let’s set aside all of the variables but one. When we analyze 600+ ATP matches in the Match Charting Project data, we have roughly 61,000 in-play returns coded in one of nine zones, including at least 2,000 in each.  Here is a look at the impact of return location, showing the server’s winning percentage when a return comes back in play to one of the nine zones:rzones1show

(“Shallow” is defined as anywhere inside the service boxes, while “Medium” and “Deep” each represent half of the area behind the service boxes. The left, center, and right zones are intended to indicate roughly where the return would cross the baseline, so for sharply angled shots, a return might bounce shallow near the middle of the court but be classified as a return to the forehand or backhand side.)

As we would expect, deeper returns work in favor of the returner, as do returns away from the center of the court. A bit surprisingly, returns to the server’s forehand side (if he’s a right-hander) are markedly more effective than those to the backhand. This is probably because right-handed returners are most dangerous when hitting crosscourt forehands, although left-handed returners are also more effective (if not as dramatically) when returning to that side of the court.

Let’s narrow things down just a little and see how the impact of return location differs on first and second serves. Here are the server’s chances of winning the point if a first-serve return comes back in each of the nine zones:


And the same for second-serve returns:


It’s worth emphasizing just how much impact a deep return can have. So many points are won with unreturnable serves–even seconds–that simply getting the ball back in play comes close to making the point a 50/50 proposition. A deep second-serve return, especially to a corner, puts the returner in a very favorable position. Consistently hitting returns like that is a big reason why Novak Djokovic essentially turns his opponents’ second serves against them.

The final map makes it clear how valuable it is to move the server away from the middle of the court. Think of it as a tactical first strike, forcing the server to play defensively instead of dictating play with his second shot. Among second-serve returns put in play, any ball placed away from the middle of the court–regardless of depth–gives the returner a better chance of winning the point than does a deep return down the middle.

For today, I’m going to stop here. This is just the tip of the iceberg, as there are so many variables that play some part in the effectiveness of various service returns. Ultimately, understanding the potency of each return location will give us additional insight into what players can achieve with different kinds of serve, which players are deadliest with certain types of returns, and how best to handle different returns with the server’s crucial second shot.

Teymuraz Gabashvili and ATP Quarterfinal Losing Streaks

Yesterday in Moscow, Teymuraz Gabashvili played his 16th career tour-level quarterfinal. Facing 118th-ranked Evgeny Donskoy, it was his best chance yet to reach an ATP semifinal, but just as in each of his previous 15 attempts, he lost.

No other player has contested so many tour-level quarterfinals without ever winning one. But while the streak of 16 consecutive quarterfinal losses is a rarity, it’s not a record. The all-time mark belongs to Gianluca Pozzi, who dropped 18 in a row between 1993 and 2000. Pozzi’s record, depressing as that streak is, might be an inspiration to Gabashvili: At age 35, Pozzi finally broke the streak, defeating Marat Safin, one of the best players he ever faced in a quarterfinal.

Gabashvili and Pozzi are among only twelve players who have strung together more than 10 quarterfinal losses at tour level. Here’s the complete list, including the dates of the first and last loss in each streak:

Player               QFs L Streak     Start       End  
Gianluca Pozzi                 18  19930104  20000501  
Teymuraz Gabashvili            16  20070219         *  
Paul Annacone                  14  19860127  19880704  
Ivan Molina                    12  19751110  19791105  
Mischa Zverev                  11  20060925  20090713  
Diego Perez                    11  19861124  19920810  
Anand Amritraj                 11  19750304  19810706  
Dennis Ralston                 11  19701101  19800602  
Bob Carmichael                 11  19720918  19751231  
Ricardas Berankis              10  20120917         *  
Yen Hsun Lu                    10  20070219  20130923  
Mikhail Youzhny                10  20041101  20060130

Ricardas Berankis is the only other player on this list to have an active streak, and since he’s five years younger than Gabashvili, another few years of mild success and quarterfinal futility could put him in the running for the all-time record. Alas, neither player is likely to repeat the post-streak success of Mikhail Youzhny, who went on to play 63 more tour-level quarterfinals, winning 33 of them.

If there’s a silver lining for Gabashvili, it’s that he’s reached all of those quarterfinals, sparing himself the fate of Rolf Thung, a Dutch player from the 1970s who reached the round of 16 at 18 tour events and lost them all.

Elo-Forecasting the WTA Tour Finals in Singapore

With the field of eight divided into two round-robin groups for the WTA Tour Finals in Singapore, we can play around with some forecasts for this event. I’ve updated my Elo ratings through last week’s tournaments, and the first thing that jumps out is how different they are from the official rankings.

Here’s the Singapore field:

EloRank  Player                Elo  Group  
2        Maria Sharapova      2296    RED  
4        Simona Halep         2181    RED  
6        Garbine Muguruza     2147  WHITE  
8        Petra Kvitova        2136  WHITE  
9        Angelique Kerber     2129  WHITE  
11       Agnieszka Radwanska  2100    RED  
15       Lucie Safarova       2051  WHITE  
21       Flavia Pennetta      2004    RED

Serena Williams (#1 in just about every imaginable ranking system) chose not to play, but if Elo ruled the day, Belinda Bencic, Venus Williams, and Victoria Azarenka would be playing this week in place of Agnieszka Radwanska, Lucie Safarova, and Flavia Pennetta.

Anyway, we’ll work with what we’ve got. Maria Sharapova is, according to Elo, a huge favorite here. The ratings translate into a forecast that looks like this:

Player                  SF  Final  Title  
Maria Sharapova      83.7%  61.1%  43.6%  
Simona Halep         60.8%  35.4%  15.9%  
Garbine Muguruza     59.4%  25.7%  11.3%  
Petra Kvitova        55.2%  23.0%   9.8%  
Angelique Kerber     53.1%  21.7%   8.8%  
Agnieszka Radwanska  37.4%  17.4%   6.1%  
Lucie Safarova       32.3%   9.7%   3.1%  
Flavia Pennetta      18.1%   6.0%   1.4%

If Sharapova is really that good, the loser in today’s draw was Simona Halep. The top seed would typically benefit from having the second seed in the other group, but because Garbine Muguruza recently took over the third spot in the rankings, Pova entered the draw as a dangerous floater.

However, these ratings don’t reflect the fact that Sharapova hasn’t completed a match since Wimbledon. They don’t decline with inactivity, so Pova’s rating is the same as it was the day after she lost to Serena back in July. (My algorithm also excludes retirements, so her attempted return in Wuhan isn’t considered.)

With as little as we know about Sharapova’s health, it’s tough to know how to tweak her rating. For lack of any better ideas, I revised her Elo rating to 2132, right between Petra Kvitova and Angelique Kerber. At her best, Sharapova is better than that, but consider this a way of factoring in the substantial possibility that she’ll play much, much worse–or that she’ll get injured and her matches will be played by Carla Suarez Navarro instead. The revised forecast:

Player                  SF  Final  Title  
Simona Halep         69.9%  40.9%  24.0%  
Garbine Muguruza     59.4%  31.5%  16.5%  
Maria Sharapova      57.6%  29.5%  14.5%  
Petra Kvitova        55.6%  28.4%  14.4%  
Angelique Kerber     52.5%  26.3%  13.2%  
Agnieszka Radwanska  47.9%  22.3%   9.9%  
Lucie Safarova       32.6%  12.9%   4.9%  
Flavia Pennetta      24.7%   8.3%   2.7%

If this is a reasonably accurate estimate of Sharapova’s current ability, the Red group suddenly looks like the right place to be. Because Elo doesn’t give any particular weight to Grand Slams, it suggests that the official rankings far overestimate the current level of Safarova and Pennetta. The weakness of those two makes Halep a very likely semifinalist and also means that, in this forecast, the winner of the tournament is more likely (54% to 46%) to come from the White group.

Without Serena, and with Sharapova’s health in question, there are simply no dominant players in the field this week. If nothing else, these forecasts illustrate that we’d be foolish to take any Singapore predictions too seriously.