GitHub - Shashank911/Train-Test-Split-Evaluation-Metrics: he objective of this task is to train a machine learning model on the Heart Disease dataset and evaluate its performance using standard evaluation metrics. The dataset contains medical attributes of patients and a target variable indicating the presence or absence of heart disease.

Task 5 – Train-Test Split & Model Evaluation 📌 Overview This repository contains the implementation of model training and evaluation on the Heart Disease dataset as part of an AI & ML Internship task. The goal of this task is to understand how machine learning models are evaluated using proper data splitting and performance metrics.

❤️ Dataset Information Dataset: Heart Disease Dataset

Problem Type: Binary Classification

Target Variable: Indicates presence (1) or absence (0) of heart disease

Features: Medical attributes such as age, cholesterol, blood pressure, etc.

🎯 Objective The objective of this task is to:

Split the dataset into training and testing sets

Train a classification model

Evaluate performance using accuracy, precision, recall, and confusion matrix

🛠 Tools & Libraries Used Python

Pandas

NumPy

Scikit-learn

⚙️ Steps Performed Loaded the dataset using Pandas

Separated features (X) and target (y)

Split data into 80% training and 20% testing

Trained a Logistic Regression model

Made predictions on test data

Evaluated model using:

Accuracy

Precision

Recall

Confusion Matrix

Classification Report

📊 Evaluation Metrics Metric Meaning Accuracy Overall correctness of predictions Precision How many predicted positive cases were actually positive Recall How many actual positive cases were correctly identified Confusion Matrix Shows TP, TN, FP, FN values F1-score Balance between precision and recall

📈 Key Insights Logistic Regression performed well on the dataset

Model shows balanced precision and recall

Confusion matrix helps understand prediction errors

Train-test split ensured the model generalizes to unseen data

📁 Repository Structure arduino Copy code Task-5-Model-Evaluation/ │ ├── heart.csv ├── Heart_Model.ipynb └── README.md 🧠 Concepts Learned Importance of train-test split

Model evaluation techniques

Understanding classification metrics

Avoiding overfitting

✅ Conclusion The Logistic Regression model was successfully trained and evaluated on the Heart Disease dataset. The evaluation metrics indicate that the model can reliably predict heart disease presence based on patient data.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
Train-Test Split & Evaluation Metrics		Train-Test Split & Evaluation Metrics
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages