Hello, folks! 👋
🌱 I’m a Senior Data & Analytics Engineer with a strong foundation in Computer Science Engineering and 6.5+ years of industry experience building scalable, production-grade data systems.
I specialize in designing and implementing robust data platforms, ETL/ELT pipelines, and analytics systems using Python, PySpark, SQL, GCP, databricks, GCP.
Currently, I work as a Senior Data Engineer at Thoughtworks, contributing to large-scale enterprise data solutions.
🌱 Core Expertise
- Python, PySpark, SQL, OOPS
- Data Engineering System Design
- Data Modeling (Dimensional & Analytical Models)
🌱 Data Engineering
- Batch & Streaming Data Pipelines
- ETL / ELT Design & Optimization
- Kafka-based event streaming
- Workflow orchestration using Airflow
- Databricks (Jobs, Delta Lake, Pipelines)
🌱 Cloud Platforms
- Google Cloud Platform (GCP): BigQuery, GCS, Dataflow, Looker Studio
- AWS: S3, Lambda, SNS, Glue, Redshift, RDS, Athena
🌱 Databases & Platforms
- BigQuery, Snowflake
- Relational and analytical data stores
- Structured, semi-structured, and streaming data
🌱 Data Analysis
- Exploratory Data Analysis (EDA) using Python & SQL
- Business-driven case studies
- Data visualization with Tableau
🌱 Tools & Development
- Git, Bitbucket
- VS Code, PyCharm, RStudio
Outside the world of data and laptops:
- 🏋️♀️ I’m passionate about fitness and health
- 🍳 I enjoy cooking
- 🎻 I play the violin
- 📚 I’m deeply interested in learning, reading, and continuous growth
- Successfully earned the Google Cloud Professional Data Engineer and Digital Leader certifications.
- Bagged 𝗧𝗲𝗮𝗺 𝗔𝘄𝗮𝗿𝗱 for exceptional performance in 2024.
- Awarded the 𝗦𝗵𝗶𝗻𝗶𝗻𝗴 𝗦𝘁𝗮𝗿 𝗔𝘄𝗮𝗿𝗱 for outstanding performance in 2023 and 2024.
- Honored with the 𝗦𝘁𝗿𝗶𝗱𝗲 𝘁𝗵𝗲 𝗧𝗿𝗶𝗱𝗲 𝗔𝘄𝗮𝗿𝗱 for independently managing and developing a key tool during 2021–2022.
