It happened to many football fans. With a lot of games still to be played, looking at the standings and wondering "how much are the things going to change by the end of the season?". This question formulates more generally as "how similar should I expect to be the rankings after n games and the ones at the end of the season?".
In this project, I used match results from season 2004/2005 to season 2020/2021 for Serie A, Premier League and Ligue 1 in the attempt of answering the above question.
If you are interested about my findings, read the article here.
To run the analysis on your machine you should download the entire content of the project and run the following command within the directory where you have placed the files:
python first_game_power.py
The data scraping will immediately start and at the end of the process you will find some plots in the folder "saved_plots" and some spreadsheets in the folder "saved_spreadsheets".
The project requires the following packages:
- beautifulsoup
- matplotlib
- pandas
- regex
- requests