This project focuses on Exploratory Data Analysis (EDA) of the Engineering Education sector in Maharashtra.
It is a mixed academic, internship, and portfolio project conducted during my internship at Clustor Computing.
The goal of this project is to analyze, clean, visualize, and interpret education-related data to understand trends in engineering colleges, universities, student intake, and related parameters using Python and Power BI.
- Collect education sector data from multiple sources
- Clean and preprocess raw datasets
- Perform Exploratory Data Analysis (EDA)
- Extract meaningful features and patterns
- Create visualizations and dashboards for insights
- Present findings in a structured and understandable format
- Government education portals (e.g. data.gov.in)
- University and college websites
- AICTE / MSBTE related sources
- Publicly available education datasets
(Data used is non-confidential and for academic analysis purposes.)
- Python
- Pandas, NumPy
- Matplotlib, Seaborn
- Jupyter Notebook
- Power BI
- Excel
- Git & GitHub
- Project planning and scope definition
- Data collection from multiple sources
- Data cleaning (missing values, duplicates, standardization)
- Exploratory Data Analysis (EDA)
- Feature extraction and correlation analysis
- Data visualization using Python
- Dashboard creation using Power BI
- Documentation and presentation
education-sector-eda/
├── README.md
│
├── code/
│ ├── eda_analysis.ipynb
│ └── data_cleaning.py
│
├── outputs/
│ └── data_cleaning_visualization.html
│
├── dashboards/
│ └── power_bi_dashboard.pdf
│
├── reports/
│ ├── eda_feature_extraction.pdf
│ └── sprint_execution_report.pdf
│
└── presentation/
└── education_eda_presentation.pptx
- Engineering colleges and student intake vary significantly across cities
- University-wise pass-out trends show noticeable differences
- Data quality issues required careful preprocessing
- Visualization helped identify patterns not obvious in raw data
- Data collection and validation
- Data cleaning and preprocessing
- Exploratory Data Analysis using Python
- Feature extraction
- Visualization and dashboard creation
- Documentation and presentation
- Internship Project (Clustor Computing)
- Academic Project
- Data Analytics Portfolio Project
Vaibhav P Adikane
This project is created for learning, academic, and portfolio purposes and demonstrates practical application of data analytics and visualization techniques in the education domain.