This repository contains an exploratory data analysis of delivery performance using the Brazilian Olist e-commerce dataset.
The main focus of the project is to understand delivery time patterns, variability, and potential sources of delay from a customer perspective.
The analysis emphasizes clear reasoning, data validation, and distribution-based interpretation rather than relying solely on single summary metrics.
The project uses the Brazilian E-Commerce Public Dataset by Olist, which includes information about orders, delivery timestamps, sellers, and customer locations.
Only delivered orders are used for delivery time analysis. Records with missing or logically inconsistent timestamps are separated and reported instead of being silently removed.
The analysis is structured into multiple focused objectives:
-
Delivery Time Distribution
- Understand the typical delivery duration and its variability
- Define relative fast and slow deliveries using median and interquartile range (IQR)
-
On-Time vs Late Delivery Analysis
- Compare actual delivery times with estimated delivery dates
-
Review Score between Late Deliveries vs Early Deliveries
- Explore whether delivery performance between Late Deliveries and Early Deliveries has same average review score or not
Each objective is implemented as a separate notebook to keep the analysis clear and self-contained.
data/
raw/
Original datasets
reports/
Data quality reports (missing or invalid records)
notebooks/
00_project_overview.ipynb
01_delivery_time_analysis.ipynb
02_on_time_vs_late.ipynb
03_delivery_review_score_analysis.ipynb
- Data validation is treated as part of the analysis rather than a preprocessing shortcut.
- Missing or logically inconsistent records are separated for transparency.
- Robust statistics such as the median and interquartile range are used to reduce distortion from extreme values.
- Delivery performance is described relative to the observed data distribution rather than external benchmarks.
If this is your first time viewing the project:
- Start with notebooks/00_project_overview.ipynb
- Continue to notebooks/01_delivery_time_analysis.ipynb for the first completed objective
- Additional notebooks build on this foundation
Kaggle Notebook
Objective 1: https://www.kaggle.com/code/kevinh28/olist-delivery-time-analysis
Objective 2: https://www.kaggle.com/code/kevinh28/02-on-time-vs-late
Objective 3 : https://www.kaggle.com/code/kevinh28/olist-delivery-review-score-analysis