NFL Game Predictor

Welcome to the NFL Game Predictor repository! This project uses a Random Forest Classifier to predict the outcomes of NFL games based on team statistics specifically for only the 2022-2023 Season. The predictions include confidence scores and take into account home-field advantage. This repository contains the backend machine learning model and all necessary datasets. Our front-end is hosted on Streamlit at NFL Game Predictor App.

Repository Structure

Files and Directories

Random_forest_FINAL.ipynb: The main Jupyter Notebook containing the machine learning pipeline, from data preprocessing to model training and evaluation.
Datasets:
- Individual team data files (e.g., 49ers_data.csv, bears_data.csv): Contain team-specific stats used to train and test the model.
- total_schedule.csv: Consolidated schedule of games.
- football_stats.csv: Includes aggregated and additional metrics for analysis.
Model Files:
- random_forest_model.joblib: Trained Random Forest model.
- scaler.pkl: Scaler used for feature normalization.
model_webpage.py: The Python script for deploying the model via Streamlit.
requirements.txt: Contains all the Python dependencies required to run the project.
README.md: This readme file.

Installation and Setup

Clone the Repository:

git clone https://github.com/your-username/nfl-game-predictor.git
cd nfl-game-predictor

Install Dependencies: Use the requirements.txt file to install all necessary libraries:
```
pip install -r requirements.txt
```
Run the Streamlit App: Start the Streamlit front end to visualize predictions:
```
streamlit run model_webpage.py
```
Data and Model Files: Ensure all dataset files and model files (random_forest_model.joblib and scaler.pkl) are in the correct directory.

Features

Game Prediction: Predict the winner of a matchup with a confidence score. Includes home-field advantage as a feature.
Data Visualization: Includes exploratory data analysis (EDA) visualizations, such as feature distributions, correlation matrices, and boxplots.
Streamlit Integration: A user-friendly interface for entering team matchups and viewing predictions.

How It Works

Data Preprocessing:
- Team-specific data is preprocessed, with features like TotalYards, PassYards, RushYards, and Turnovers used for training.
- Home-field advantage is included as a binary feature.
Model Training:
- A Random Forest Classifier is trained using these features, and hyperparameters are optimized for better accuracy.
Prediction:
- Users input two teams and specify the home team.
- The model predicts the winner and provides a confidence score for the prediction.

Example

To illustrate the model’s performance, we predicted the matchup between the Lions and Commanders. The model correctly predicted the Lions to win with a confidence score of 69%, aligning with the Lions’ strong performance this season and the Commanders’ recent struggles.

Future Work

Incorporating player-level data and recent performance metrics.
Expanding model features to include weather conditions, injuries, and advanced analytics.
Enhancing the user interface for a more interactive experience.

Authors

Akhil Sharma
Bhavesha Sasikumar
Jackson Cmelak
Denil Neil

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
49ers_data.csv		49ers_data.csv
README.md		README.md
Random_forest_FINAL.ipynb		Random_forest_FINAL.ipynb
bears_data.csv		bears_data.csv
bengals_data.csv		bengals_data.csv
bills_data.csv		bills_data.csv
broncos_data.csv		broncos_data.csv
browns_data.csv		browns_data.csv
buccaneers_data.csv		buccaneers_data.csv
cardinals_data.csv		cardinals_data.csv
chargers_data.csv		chargers_data.csv
chiefs_data.csv		chiefs_data.csv
colts_data.csv		colts_data.csv
commanders_data.csv		commanders_data.csv
cowboys_data.csv		cowboys_data.csv
dolphins_data.csv		dolphins_data.csv
eagles_data.csv		eagles_data.csv
falcons_data.csv		falcons_data.csv
football_stats.csv		football_stats.csv
giants_data.csv		giants_data.csv
jaguars_data.csv		jaguars_data.csv
jets_data.csv		jets_data.csv
lions_data.csv		lions_data.csv
model_webpage.py		model_webpage.py
packers_data.csv		packers_data.csv
panthers_data.csv		panthers_data.csv
patriots_data.csv		patriots_data.csv
raiders_data.csv		raiders_data.csv
rams_data.csv		rams_data.csv
random_forest_model.joblib		random_forest_model.joblib
ravens_data.csv		ravens_data.csv
requirements.txt		requirements.txt
saints_data.csv		saints_data.csv
scaler.pkl		scaler.pkl
seahawks_data.csv		seahawks_data.csv
steelers_data.csv		steelers_data.csv
texans_data.csv		texans_data.csv
titans_data.csv		titans_data.csv
total_schedule.csv		total_schedule.csv
vikings_data.csv		vikings_data.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NFL Game Predictor

Repository Structure

Files and Directories

Installation and Setup

Features

How It Works

Example

Future Work

Authors

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

NFL Game Predictor

Repository Structure

Files and Directories

Installation and Setup

Features

How It Works

Example

Future Work

Authors

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages