The vast majority of my men’s and women’s tennis results and stats databases are now free for anyone who wants to use them.
ATP Results and Stats:
- Tour-level results back to 1968, with tons of data on both players in each match (age, handedness, country, rank), and matchstats from 1991-present.
- Almost a decade of tour-level qualifying matches, with matchstats for the last few years.
- Challenger results back to 1991, with matchstats for almost the last ten years.
- Futures (and Satellite) results back to 1991.
- Linked biographical and rankings data (introduced here).
- Tour-level results back to 1968, with the same player data as in the ATP files.
- Tour-level qualifying matches.
- Over 220,000 ITF main-draw matches.
Click the links to access the files. Enjoy!
Today I’ve made available my entire ATP and WTA ranking databases through the end of the 2014 season. In addition, you’ll find my complete player tables, which include birthdate, country, and handedness for every player who has ever been ranked or played a tour-level match. (Plus thousands more players, who are included in the database for other reasons.)
This is all the data you need to research all sorts of topics, like the rise and fall of certain countries in the rankings and the changing age of top 10s, 50s, and 100s.
This is the third major dataset I’ve published this week, and more is on the way.
ATP rankings are here, and WTA rankings are here. Enjoy!
In the last year and a half, dozens of contributors and I have amassed detailed shot-by-shot records of nearly 700 professional matches. You can see the full list here, or a menu sorted by player here.
I refer to this as The Match Charting Project, and I hope you’ll consider contributing as well. Using a straightforward text notation system, we record shot type, shot direction, return depth, error types, and more. The more matches, the more interesting the results. The project made up part of my presentation at the Sloan Sports Analytics Conference last month, which included some very preliminary findings on player tendencies.
Now, you can dig into the raw data yourself. I’ve posted all of the user-submitted match charts in one place, in a standardized format for anyone who wants to mess around with it.
I’ve been doing a lot of griping lately about the state of tennis data, so I figured now was a good time to start doing something about it.
I’ve just released point-by-point data for most Grand Slam singles matches back to 2011. Beyond the basic point sequence–which is valuable in and of itself–you’ll find serve speed, winner type, and for a few of the slams, rally length for each point.
More detailed notes on the data are available at that link. Enjoy, and if working with it turns up any interesting findings, please let me know.
Last weekend at the Sloan Sports Analytics Conference in Boston, I gave a talk, “First Service: The Advent of Actionable Tennis Analytics.” The presentation was in three parts:
- The sorry state of tennis data
- Schedule optimization (based in part on this blog post)
- The Match Charting Project (more about that in this post, among others)
The conference video-recorded all presentations, and I understand that video will be posted on the Sloan site. When it becomes available, I’ll post a link here.
In the meantime, many people have asked for my slide deck: First Service.
Also, Jim Pagels wrote a brief piece for Forbes drawing on my talk, which you can read here.