VeriSafe is a hybrid fraud detection system designed to identify suspicious financial transactions using a combination of machine learning models and rule-based validation techniques. The system analyzes transactional data to detect anomalous patterns and classify transactions as fraudulent or legitimate.
The project demonstrates how predictive models can be integrated into a modular backend architecture to support real-time fraud detection workflows.
- Hybrid fraud detection pipeline combining Machine Learning + Rule-Based Validation
- Implementation of Random Forest and XGBoost classifiers
- Feature engineering and anomaly detection on transaction datasets
- Flask ML microservice for fraud prediction
- Spring Boot backend for API communication
- Modular architecture enabling scalable fraud detection systems
- Transaction data is submitted through the frontend.
- The backend processes the request and sends transaction data to the ML microservice.
- The machine learning model analyzes the transaction features.
- Fraud probability is calculated.
- The result is returned to the backend and displayed to the user.
- Handling missing values
- Data normalization
- Cleaning transaction records
Key behavioral features are extracted from transaction data to improve model performance.
Two machine learning models are trained for fraud classification:
- Random Forest
- XGBoost
Models are evaluated using:
- Accuracy
- Precision
- Recall
- F1 Score
The hybrid pipeline improved fraud detection performance by approximately 12% compared to baseline models.
- Python
- Scikit-learn
- Random Forest
- XGBoost
- Pandas
- NumPy
- Java
- Spring Boot
- REST APIs
- Flask
- React
- JavaScript
- Git
- Postman
- Linux
The fraud detection models are trained using transactional datasets containing financial transaction records.
Typical dataset features include:
- Transaction amount
- Transaction timestamp
- Customer identifier
- Merchant information
- Transaction location
- Fraud label (fraudulent / legitimate)
- Removal of missing or corrupted records
- Normalization of numerical features
- Encoding categorical variables
- Feature extraction for model training
These preprocessing steps help improve the accuracy and reliability of the fraud detection models.
git clone https://github.com/Zamishi/FraudDetection.git
cd FraudDetectioncd ml-service
pip install -r requirements.txt
python app.pymvn spring-boot:runThe backend handles API requests and communicates with the ML service.
cd frontend
npm install
npm startPotential improvements for the system include:
- Graph-based fraud detection techniques
- Explainable AI methods such as SHAP and LIME
- Real-time fraud detection for streaming transaction data
- Cloud deployment for scalable inference
This project forms the implementation base for the research paper:
Hybrid AI Approach for Fraud Detection: Integrating Rule-Based Systems with Machine Learning and Graph Analysis
Accepted at ICAICCIT 2025.