I specialize in building robust data pipelines and analytics systems.
- 🛠️ Core Stack: Databricks • BigQuery • Snowflake • dbt • Python • SQL
- ☁️ Cloud Platforms: Azure • GCP • AWS
- 🧬 Domain Focus: Data Engineering & Clinical Trials Databases
I specialize in building robust data pipelines and analytics systems.
End-to-end batch analytics pipeline using MongoDB Atlas, MinIO, PostgreSQL, Apache Airflow, and GitHub Actions. Covers synthetic data generation, object storage partitioning, raw-to-staging warehou…
Python
A centralized data platform and star-schema warehouse for AcmeMart, consolidating siloed retail, e-commerce, and customer data from Google Drive into clean, automated analytics pipelines.
REDCap SMS API middleware to allow REDCap post text messages to an SMS provider and receive responses via a one-way or two-way messaging system
PHP
Automates participant randomization and data transfers by syncing values from OpenClinica to REDCap and returning the resulting allocations to OpenClinica 4.
This project builds a scalable, end-to-end data pipeline to ingest, process, and visualise stock market data in real-time through live dashboards.
Python
Data Engineering Capstone Project Specialization: Data Engineering
Python