💶 EUR/USD Exchange Rate Prediction - MLOps Capstone

A production-grade machine learning pipeline to predict EUR/USD forex rates. This project demonstrates a complete End-to-End MLOps workflow, featuring automated daily retraining, cloud-native infrastructure, and a robust Continuous Deployment (CD) pipeline on AWS.

📋 High-Level Architecture

The system operates on a Hybrid Cloud/Local architecture designed for cost-efficiency and scalability. It uses a Cloud-First, Local-Mirror data strategy powered by the custom DataManager.

graph TD
    subgraph "AWS Infrastructure"
        S3[(S3 Bucket)]
        RDS[(RDS Postgres)]
        
        subgraph "Ingestion layer"
            Lambda[AWS Lambda Data Ingest] -->|Daily Rates| S3
        end
        
        subgraph "Training Layer"
            EC2_Train[EC2 Retraining Worker]
            EC2_Train -->|Read Data| S3
            EC2_Train -->|Log Metrics| RDS
            EC2_Train -->|Save Artifacts| S3
        end

        subgraph "Serving Layer"
            EC2_API[EC2 Flask API]
            EC2_API -->|Load Champion Model| S3
            EC2_API -->|Predict| EndUser((User))
        end
    end

    Lambda -->|"Trigger (EventBridge)"| EC2_Train
    EC2_Train -.->|Register Model| S3
    EC2_Train -.->|Track Experiment| RDS

Key Components (The 3 Containers)

Flask API Application: Hosted on a persistent EC2 instance. It loads the "Champion" model from S3 and serves real-time predictions.
Data Ingestion Lambda: A serverless function triggered Mon-Fri to fetch the latest EUR/USD data and update the S3 "Raw" data info.
Retraining Worker: An EC2 instance that spins up automatically (triggered by EventBridge after ingestion), executes the retraining pipeline, updates the champion model if performance improves, and then shuts down to save costs.

Key Features

🧬 Unified Data Layer: The DataManager handles seamless synchronization between local development and S3.
🔄 Automated Daily Retraining: Models (Linear Regression, ARIMA, LSTM) are retrained daily on the freshest data.
📊 Experiment Tracking: Full MLflow integration. RDS stores parameters/metrics, while S3 stores model artifacts.
🐳 Containerized Deployment: All components (API, Retraining, Ingestion) are dockerized. The Retraining image is stored in AWS ECR.

🚀 Quick Start (Local)

Setup Environment

git clone <repo-url>
cd eurusd-capstone
python3.11 -m venv venv
source venv/bin/activate
pip install -r requirements.txt

Run Data Pipeline The DataManager will check for missing data and sync from S3 if credentials are present.
```
python src/01_ingest_data.py
python src/02_preprocess.py
```
Train Models Locally Changes to features or models are tracked by MLflow.
```
python ml_pipeline.py
```

☁️ AWS Deployment Guide

Prerequisite: You need an active AWS account and aws-cli configured locally with Administrative permissions.

1. Infrastructure Setup

Use the scripts in scripts/infra_setup and scripts/mlops_utils to provision the base environment.

A. Storage & MLflow Configures RDS (Tracking) and S3 (Artifacts).

sh scripts/mlops_utils/setup_mlflow_aws.sh

B. ECR Repository Creates the registry for the retraining docker image.

sh scripts/infra_setup/setup_ecr_retrain.sh

2. Deploy Code Components

A. Data Ingestion (Lambda)

sh scripts/deployment/deploy_lambda_ingest.sh

B. Retraining Worker (EC2) Builds the docker image, pushes to ECR, and configures the EC2 launch template.

sh scripts/deployment/deploy_retrain_ec2.sh

C. Inference API (EC2) Deploys the Flask app to a persistent EC2 instance.

sh scripts/deployment/deploy_flask_api.sh

📖 Full Re-Deployment Details: Verify the AWS Deployment Guide for step-by-step instructions.

📁 Repository Structure

eurusd-capstone/
├── api/                    # Serving API (Flask) source code
├── data/                   # Local data cache (Mirrors S3 structure)
├── docs/                   # 📚 Detailed Documentation
│   ├── architecture/       # System design & Data flows
│   ├── deployment/         # AWS & Docker deployment guides
│   ├── guides/             # Service Manuals
│   ├── debug/              # Debugging logs & notes
│   └── DOCUMENTATION_MAP.md # Index of all docs
├── notebooks/              # Jupyter Laboratories for EDA & Prototyping
├── scripts/                # Automation & DevOps Scripts
├── src/                    # Core ML Source Code
│   ├── 01_ingest_data.py   # Data fetching script
│   ├── 02_preprocess.py    # Feature engineering script
│   ├── 03_train_models.py  # Model training script
│   ├── 04_evaluate_select.py # Model evaluation & promotion
│   └── ml_pipeline.py      # Main pipeline orchestrator
├── utils/                  # Shared utilities (DataManager, Logger)
└── tests/                  # Unit & Integration tests

📞 Contact

Maintainer: [Your Name]

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
api		api
aws/policies		aws/policies
aws_lambda		aws_lambda
data		data
docs		docs
figures		figures
img		img
mlruns		mlruns
models		models
notebooks		notebooks
scripts		scripts
src		src
tests		tests
utils		utils
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
Dockerfile.cloud		Dockerfile.cloud
Dockerfile.production		Dockerfile.production
Dockerfile.retrain		Dockerfile.retrain
README.md		README.md
docker-compose.yml		docker-compose.yml
requirements.txt		requirements.txt
response.json		response.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

💶 EUR/USD Exchange Rate Prediction - MLOps Capstone

📋 High-Level Architecture

Key Components (The 3 Containers)

Key Features

🚀 Quick Start (Local)

☁️ AWS Deployment Guide

1. Infrastructure Setup

2. Deploy Code Components

📁 Repository Structure

📞 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

💶 EUR/USD Exchange Rate Prediction - MLOps Capstone

📋 High-Level Architecture

Key Components (The 3 Containers)

Key Features

🚀 Quick Start (Local)

☁️ AWS Deployment Guide

1. Infrastructure Setup

2. Deploy Code Components

📁 Repository Structure

📞 Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages