Research on Applying Machine Learning to Improve Player Valuation For Scouting in a Football Team

The Dashboard
The Dashboard

This project aim to explore how machine learning can enhance football scouting by improving the way players are valued and shortlisted for recruitment. Traditional scouting relies heavily on intuition and manual analysis, which becomes increasingly challenging as global talent pools and transfer spending continue to grow. My work proposes a data-driven recommendation system that supports scouts with objective, interpretable insights.

The system integrates season-level performance statistics with event-level match data. A Variational Autoencoder with a Gamma mixture prior (VAE-Gamma) is used for representation learning, transforming high-dimensional and sparse player statistics into meaningful latent profiles. These representations enable robust similarity searches, outperforming raw statistical comparisons when benchmarked against FBref similarity rankings.

To go beyond individual performance, the project involves simulation by incorporating the VAEP (Valuing Actions by Estimating Probabilities) framework. Building on VAEP, Joint Offensive Impact (JOI) and Joint Defensive Impact (JDI) metrics are used to model player chemistry and team fit. A two-stage pipeline first identifies similar players, then refines recommendations based on chemistry with an existing squad.


Key Results