👋 Hi, I’m Sunil! | Data & Analytics Engineer
🔧 What I’m Building
I bridge the gap between raw data and actionable insights. My focus is on Medallion Architecture, modular data modeling with dbt, and building the infrastructure that powers AI/LLM applications. While I am a fan of the Modern Data Stack and primarily work with batch pipelines, I love getting my hands dirty with real-time and near real-time systems, from high-frequency pollers to event-driven streaming.
💼 Background
-
Recent Applied Data Engineering grad from WeCloudData (June 2025).
-
Experience building production-grade pipelines within Snowflake-centric architectures, as well as AWS, Azure, and local OLAP environments.
🚀 Core Stack
-
Data: Snowflake, dbt, DuckDB, PostgreSQL, Delta Lake, Airflow.
-
Data Modeling: Medallion Architecture, Kimball Star Schema, Fact/Dimension Modeling.
-
Engineering: Python, Kafka, PySpark, FastAPI, Docker, systemd, Airflow, Celery, SQLAlchemy, Redis.
-
AI: RAG Systems, Vector DBs (Qdrant), LLM APIs (OpenAI/Gemini), LangChain.
📂 Featured Work
-
🎧 Spotify Data Platform: A hybrid Kafka-S3-Snowflake pipeline + a real-time playback poller.
-
🇧🇷 Olist E-commerce: End-to-end dbt transformation layer using Kimball Star Schema.
-
🤖 YouTube Sentiment: Async FastAPI/Celery pipeline processing 10k+ comments.
-
🗳️ Election Monte Carlo: Distributed Spark simulation of the 2016 Electoral College.
📝 Let's Connect I’m always down to chat about data modeling, the future of AI infra, or the best way to optimize an Airflow DAG.
🎯 Currently focused on Analytics Engineering & Data Platform opportunities.

