Chapter 5 Introduction

The dataset covers statistics of players registered on the WTA, the matches that happened on each tour by year, with results, as well some qualifying matches for the tours.

As a reminder, we may not find all data of the matches prior to 2006.

Several interesting insights from the analysis


Players domination


  • There is no complete domination in the years 2017 - 2003 except for 2014 and 2015, which was dominated by Serena Williams

  • 1984 to 1996 was dominated by Three players Martina Navratilova, Steffi Graf and Monica Seles.1997 to around 2002 was dominated by Lindsay Davenport and Martina Hingis

  • In the analysis of the years 2000 to 2017( part of it ), Serena has dominated the Grand Slams winning 7 Aus Opens , 3 French Opens , 7 Wimbledons and 5 US Opens


Winning after losing either the First Or Second Set


  • The probablity of winning a US Open Final after losing the first set in a Final is Zero.

  • The probablity of winning a French Open Final after losing the first set in a Final is 5 %. Only Once this has happened when Jennifer Capriati defeated Kim Clijsters in 2001

  • The probablity of winning a Australian Open Final after losing the first set in a Final is 22%.

  • The probablity of winning a Australian Open Final after losing the first set in a Final is 17%.

  • The probablity of winning after losing the First Set across all matches is 16%

  • The probablity of winning after losing the Second Set across all matches is 14%


Surface Analysis


During the period 2000 - 2017 ( part of the data of 2017 is present) , the following observations are made

  • Williams sisters occupy the 1st and 2nd spots in the winners list

  • Considering all surfaces , Russians , Americans and French have won the most matches

  • On the Hard surface, Russians, Americans and French occupy the top Three positions

  • On the Clay surface, Russians, Spanish and Americans occupy the top Three positions

  • On the Grass surface, Americans,Russians and Australians and occupy the top Three positions
  • On the Carpet surface, Russians, French and Americans occupy the top Three positions


Tournament Levels Analysis


We investigate the Top Winners in the different Tournament Levels

  • Williams sisters and Maria Sharapova occupy the Top Three spots in Grand Slams

  • Elena Dementieva , Justin Henin and Martina Hingis occupy the Top Three spots in T1 tournaments. The T1 tournaments with the most matches are Miami, Indian Wells and Rome

  • Amelie Mauresmo , Lindsay Davenport and Kim Clijsters occupy the Top Three spots in T2 tournaments.The T2 tournaments with the most matches are Amelia Island, Los Angeles and Sydney


Upset Analysis


We could consider an upset if the winner rank is greater than the loser rank by at least 10. We plot the ranking difference between winners and losers for the grand slams.

  • We observe that the upsets are few in Grand Slams Finals. But some of them are there and lets find out what they are.Venus Williams has created Two upsets in Grand Slam Finals.2017 saw an upset in the US Open when Sloane Stephens defeated Venus Williams in the semifinals and Madison Keys in the finals.

  • Kim Clijsters and Daniela Hantuchova has created 2 upsets in T1 tournament finals

  • Chanda Rubin has created 2 upsets in T2 tournament finals