Football match prediction through the use of statistical analysis and machine learning

There is a large market for accurately predicting the result of football matches. Bookmakers thrive off of this by laying bets, created by their in-house models, however the information they use is public. Academics in this field have at many attempts tried to develop accurate models using machine l...

Full description

Bibliographic Details
Main Author: Duyile, Akintunde
Format: Dissertation (University of Nottingham only)
Language:English
Published: 2017
Subjects:
Online Access:https://eprints.nottingham.ac.uk/49105/
_version_ 1848797923097182208
author Duyile, Akintunde
author_facet Duyile, Akintunde
author_sort Duyile, Akintunde
building Nottingham Research Data Repository
collection Online Access
description There is a large market for accurately predicting the result of football matches. Bookmakers thrive off of this by laying bets, created by their in-house models, however the information they use is public. Academics in this field have at many attempts tried to develop accurate models using machine learning algorithms, however the majority of them have all focused on a specific range of features. Evidence suggests that football teams undergo similar trends at particular intervals during a season, therefore the aim of this study was to determine if it’s possible to produce more accurate models using a historical trend metric as a feature for machine learning algorithms. The problem was addressed by at first determining what historical trend metric to use. After different intervals, and different game sets, overall and monthly intervals was identified as the most suitable metric to use. Following this matches from a number of previous seasons and different English leagues were modified to include different form features (varying number of games considered for the form factor) and on separate datasets form features and the historic trend feature. After the datasets were prepared, they were used for K-Nearest Neighbours and Random Forest machine learning algorithms. Both the algorithms showed firstly that 7-game form dataset produced the most accurate results, and ultimately that the historical trend metric did in fact increase the accuracy of the models.
first_indexed 2025-11-14T20:11:35Z
format Dissertation (University of Nottingham only)
id nottingham-49105
institution University of Nottingham Malaysia Campus
institution_category Local University
language English
last_indexed 2025-11-14T20:11:35Z
publishDate 2017
recordtype eprints
repository_type Digital Repository
spelling nottingham-491052018-01-17T23:29:26Z https://eprints.nottingham.ac.uk/49105/ Football match prediction through the use of statistical analysis and machine learning Duyile, Akintunde There is a large market for accurately predicting the result of football matches. Bookmakers thrive off of this by laying bets, created by their in-house models, however the information they use is public. Academics in this field have at many attempts tried to develop accurate models using machine learning algorithms, however the majority of them have all focused on a specific range of features. Evidence suggests that football teams undergo similar trends at particular intervals during a season, therefore the aim of this study was to determine if it’s possible to produce more accurate models using a historical trend metric as a feature for machine learning algorithms. The problem was addressed by at first determining what historical trend metric to use. After different intervals, and different game sets, overall and monthly intervals was identified as the most suitable metric to use. Following this matches from a number of previous seasons and different English leagues were modified to include different form features (varying number of games considered for the form factor) and on separate datasets form features and the historic trend feature. After the datasets were prepared, they were used for K-Nearest Neighbours and Random Forest machine learning algorithms. Both the algorithms showed firstly that 7-game form dataset produced the most accurate results, and ultimately that the historical trend metric did in fact increase the accuracy of the models. 2017-12-14 Dissertation (University of Nottingham only) NonPeerReviewed application/pdf en https://eprints.nottingham.ac.uk/49105/1/Akin%2C%20Duyile_MScDissertation_4282308.pdf Duyile, Akintunde (2017) Football match prediction through the use of statistical analysis and machine learning. [Dissertation (University of Nottingham only)] Average Monthly Points per Game Comma Separate Value k-Nearest Neighbour classifier Multinomial Logistic Regression Naïve Bayes classifier Positive Predictive Value Random Forest Support Vector Machine classifier True Positive Rate
spellingShingle Average Monthly Points per Game
Comma Separate Value
k-Nearest Neighbour classifier
Multinomial Logistic
Regression
Naïve Bayes classifier
Positive Predictive Value
Random Forest
Support Vector Machine classifier
True Positive Rate
Duyile, Akintunde
Football match prediction through the use of statistical analysis and machine learning
title Football match prediction through the use of statistical analysis and machine learning
title_full Football match prediction through the use of statistical analysis and machine learning
title_fullStr Football match prediction through the use of statistical analysis and machine learning
title_full_unstemmed Football match prediction through the use of statistical analysis and machine learning
title_short Football match prediction through the use of statistical analysis and machine learning
title_sort football match prediction through the use of statistical analysis and machine learning
topic Average Monthly Points per Game
Comma Separate Value
k-Nearest Neighbour classifier
Multinomial Logistic
Regression
Naïve Bayes classifier
Positive Predictive Value
Random Forest
Support Vector Machine classifier
True Positive Rate
url https://eprints.nottingham.ac.uk/49105/