Bank Marketing Lead Conversion Prediction

This project applies Machine Learning Engineering techniques to the Bank Marketing dataset from the UCI repository to predict whether a customer will subscribe to a term deposit after a marketing campaign.

The primary goal is to build a classification model and serve it through a scalable API, complemented by an interactive web interface for real-time predictions.

Quick Start

This script automates the setup and runs the data processing and model training pipelines.

# 1. Clone the repository
git clone https://github.com/luuisotorres/FIAP-Tech-Challenge-3.git
cd FIAP-Tech-Challenge-3

# 2. Install dependencies and set up the environment
uv sync
cp .env.example .env

# 3. Run the data and training pipelines
uv run -m scripts.make_dataset
uv run -m scripts.train_model

# 4. Launch the API and Streamlit app in separate terminals
# Terminal 1:
uv run uvicorn src.app.api:app --reload

# Terminal 2:
uv run -m streamlit run frontend/streamlit_app.py

Features

FastAPI Backend: A high-performance API to serve the machine learning model.
Streamlit Frontend: An interactive web application for making predictions without technical knowledge.
Automated ML Pipeline: Scripts to process data and train the model, ensuring reproducibility.
Environment Management: Uses uv for fast and reliable dependency and environment management.

Tech Stack

Python: Core programming language.
FastAPI: For building the prediction API.
Streamlit: For the interactive user interface.
Scikit-learn & LightGBM: For model training and building pipelines.
Pandas: For data manipulation and processing.
Uvicorn: ASGI server for the FastAPI application.
uv: For environment and package management.

Project Structure

The project is organized to separate concerns, making it modular and scalable:

.
├── assets/              # Screenshots and images for documentation
├── data/                # Datasets (raw, interim, processed)
├── frontend/            # Streamlit application source code
├── models/              # Trained model artifacts
├── notebooks/           # Jupyter notebooks for EDA, model training, and pipeline building
├── scripts/             # Standalone scripts for data processing and model training
├── src/                 # Main source code
│   ├── app/             # FastAPI application (API endpoints, schemas)
│   ├── data/            # Data handling modules
│   └── models/          # Model-related modules (training, prediction)
├── .github/             # GitHub Actions workflows
├── pyproject.toml       # Project metadata and dependencies
└── README.md            # This file

Getting Started

1. Environment Setup

This project uses uv for dependency and virtual environment management.

Clone the repository:

git clone https://github.com/luuisotorres/FIAP-Tech-Challenge-3.git
cd FIAP-Tech-Challenge-3

Install dependencies: This command creates a virtual environment (.venv) and installs all required packages from pyproject.toml.

uv sync

Activate the virtual environment:

source .venv/bin/activate   # macOS/Linux
.venv\Scripts\activate      # Windows

2. Environment Variables

The project requires a .env file for configuration. Copy the example file to create your own:

cp .env.example .env

The .env file contains:

FASTAPI_URL: The URL where the FastAPI backend is running (default: http://127.0.0.1:8000).

How to Run

1. Run the ML Pipeline

First, process the data and train the model using the pipeline scripts.

Generate the dataset: This script downloads the raw data and creates a clean version.

uv run -m scripts.make_dataset

Train the model: This script trains the LightGBM model and saves the artifact.

uv run -m scripts.train_model

2. Launch the API

Run the FastAPI application using Uvicorn. The --reload flag enables hot-reloading for development.

uv run uvicorn src.app.api:app --reload

The API will be available at http://127.0.0.1:8000.

3. Launch the Streamlit App

In a new terminal, run the Streamlit frontend application.

uv run -m streamlit run frontend/streamlit_app.py

The application will open in your browser, ready to make predictions.

API Usage

The API provides several endpoints to interact with the model and data.

API Documentation

Interactive documentation is available at:

Swagger UI: http://127.0.0.1:8000/docs
ReDoc: http://127.0.0.1:8000/redoc

Endpoints

GET /: Welcome page.
GET /download: Downloads and processes the raw dataset.
GET /dataset: Shows a preview of the dataset.
POST /predict: Makes a prediction based on input features.

Prediction Example:

Here's an example of a POST request to the /predict endpoint and the corresponding response.

Data Preview:

Here's a screenshot of dataset preview.

Streamlit Application

The Streamlit app provides an intuitive interface to predict lead conversion. Fill in the customer details in the sidebar and click "Predict Lead Outcome."

Main Interface:

Prediction Result: The result shows whether the lead is likely to subscribe, along with the prediction probability.

Authors

This project was developed for the Machine Learning Engineering Postgraduate Program by:

Izabelly de Oliveira Menezes | Github
Larissa Diniz da Silva | Github
Luis Fernando Torres | Github
Rafael dos Santos Callegari | Github
Renato Massamitsu Zama Inomata | Github

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bank Marketing Lead Conversion Prediction

Quick Start

Table of Contents

Features

Tech Stack

Project Structure

Getting Started

1. Environment Setup

2. Environment Variables

How to Run

1. Run the ML Pipeline

2. Launch the API

3. Launch the Streamlit App

API Usage

API Documentation

Endpoints

Streamlit Application

Authors

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
assets		assets
data		data
frontend		frontend
models		models
notebooks		notebooks
scripts		scripts
src		src
.env.example		.env.example
.gitattributes		.gitattributes
.gitignore		.gitignore
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

Bank Marketing Lead Conversion Prediction

Quick Start

Table of Contents

Features

Tech Stack

Project Structure

Getting Started

1. Environment Setup

2. Environment Variables

How to Run

1. Run the ML Pipeline

2. Launch the API

3. Launch the Streamlit App

API Usage

API Documentation

Endpoints

Streamlit Application

Authors

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages