Predicting Football Player Market Value

Overview

This project aims to predict the market value of football players using machine learning models. By exploring, engineering features, and applying advanced algorithms, we strive to uncover insights and achieve accurate predictions.

Introduction

Understanding the market value of football players is crucial for clubs, agents, and analysts. This project leverages machine learning techniques to analyze historical player data, evaluate performance, and predict player market values effectively.

Setup and Libraries

To replicate the analysis or contribute to the project, ensure you have the following libraries installed:

library(Boruta)
library(caret)
library(corrplot)
library(cowplot)
library(doParallel)
library(dplyr)
library(dummies)
library(gam)
library(ggplot2)
library(gridExtra)
library(lubridate)
library(randomForest)

Additional libraries may be listed in the source code.

Exploratory Data Analysis

An initial analysis was conducted to understand the dataset's structure and identify key variables:

Basic statistics and distributions of player features.
Correlation analysis to explore relationships between variables.
Visualizations to highlight trends and anomalies.

Data Preprocessing and Feature Engineering

Key steps included:

Handling missing values using imputation strategies.
Scaling and normalizing numerical features for consistency.
Creating dummy variables for categorical data.
Engineering new features such as years_remaining and player categories.

Feature Selection

Various methods were explored to select the most relevant features for modeling, including:

Stepwise Selection (AIC and BIC)
Boruta Algorithm
Recursive Feature Elimination (RFE)
Random Forest Feature Importance

The final selected features differ based on the modeling technique used, ensuring the best predictive performance.

Modeling and Evaluation

Multiple machine learning models were applied and tuned, such as:

Linear Regression: For baseline performance.
Random Forest: To capture non-linear relationships.
Artificial Neural Networks (ANNs): To model complex interactions.

Evaluation metrics:

RMSE (Root Mean Square Error)
R² (Coefficient of Determination)
MAE (Mean Absolute Error)

Hyperparameter tuning was performed to optimize model performance.

Results and Conclusions

Key findings:

Models using features like release_clause_eur_m, potential, and overall performed exceptionally well.
Neural networks provided the best results with optimized parameters, achieving RMSE as low as 0.5115 in cross-validation.

Future Work

Potential improvements include:

Incorporating additional datasets, such as player injuries or transfer market trends.
Experimenting with ensemble methods.
Deploying the model as an API for real-time market value predictions.

Name		Name	Last commit message	Last commit date
Latest commit History 23 Commits
Histograma.png		Histograma.png
Installation		Installation
LICENSE		LICENSE
Modelos.R		Modelos.R
README.md		README.md
R_Market_Value_24.ipynb		R_Market_Value_24.ipynb
ResultadosSAS.jpg		ResultadosSAS.jpg
Subplots.ipynb		Subplots.ipynb
Top_2024.jpg		Top_2024.jpg
Tuneado.jpg		Tuneado.jpg
Tuneado5.jpg		Tuneado5.jpg
pagina_1.png		pagina_1.png
pagina_2.png		pagina_2.png
pagina_3.png		pagina_3.png
rf_test.csv		rf_test.csv
rf_train.csv		rf_train.csv
st_test.csv		st_test.csv
st_train.csv		st_train.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Predicting Football Player Market Value

Overview

Table of Contents

Introduction

Setup and Libraries

Exploratory Data Analysis

Data Preprocessing and Feature Engineering

Feature Selection

Modeling and Evaluation

Results and Conclusions

Future Work

About

Releases

Packages

Languages

License

vgvr0/Market_value_football_players_24

Folders and files

Latest commit

History

Repository files navigation

Predicting Football Player Market Value

Overview

Table of Contents

Introduction

Setup and Libraries

Exploratory Data Analysis

Data Preprocessing and Feature Engineering

Feature Selection

Modeling and Evaluation

Results and Conclusions

Future Work

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages