This project encompasses comprehensive analyses to optimize the performance of NFL player Justin Hillard. Utilizing Random Forest Regression in Python, the project aims to provide insights into various performance metrics.
- Performance Analysis: Calculate the relationship between general features and overall performance.
- Sleep and Performance Correlation: Examine the relationship between sleep patterns and next-day performance.
- Training Effectiveness: Assess whether the targeted performance from training sessions is being achieved.
The dataset used in this project is the NFL Daily Performance Dataset by Justin Hilliard, which includes comprehensive information on NFL athlete's daily performances.
To get started with this project, follow these steps:
- Clone the repository:
git clone https://github.com/hootbu/NFL-Athlete-Performance.git
- Navigate to the project directory:
cd NFL-Athlete-Performance
- Install the necessary dependencies:
pip install -r requirements.txt
- Start Jupyter Notebook:
Jupiter notebook
- Open the
Performance.ipynb
file and run.
- Average RPE vs. Scripted Intensity
This graph shows the average RPE (Rate of Perceived Exertion) for different training intensities: Light, Medium, and High.

-
When the training is Light, the average RPE is low.
-
For Medium intensity, the RPE is higher.
-
For High intensity, the RPE is the highest.
This means the athlete feels more tired when the training is harder. The athlete's feelings match the training plan.
- Sleep’s Effect on Overall Performance
This graph shows whether sleep affects the athlete’s performance the next day. It compares average sleep hours with the overall performance score.

If the athlete sleeps more, the performance score is usually a little higher.
However, the correlation is not very strong (correlation between totalSleep and Overall = 0.27). This means sleep helps, but it is not the only important factor.
Other things like nutrition, training, and recovery are also important for good performance.
A big thank you to Justin Hilliard for providing the dataset on Kaggle.