scd2

Advanced Healthcare Claims Pipeline using Snowflake, Snowpipe, Streams, Tasks, SCD Type 2, and AWS S3. Automates ingestion, CDC, dimensional modeling, and data quality checks for healthcare patient and claims data.

aws cloud sql analytics tasks snowflake streams data-engineering healthcare cdc data-pipeline scd2 snowpipe

Updated Nov 10, 2025

shukla2015 / Travel_Booking_SCD2_Project

Star

Production-grade parameterized ETL pipeline implementing SCD Type 2 for travel booking data using Databricks, Delta Lake, and ADLS — includes data quality checks, incremental fact table build, Z-Order optimization, and SQL reporting.

etl pyspark databricks scd2 delta-lake azure-data-engineering pydeequ

Updated Apr 6, 2026
Jupyter Notebook

Aayushi-Anand / SCD2_Implementation

Star

Implementation of SCD2 for employee relocation data

etl-pipeline scd2

Updated Feb 28, 2022

ZuhairBhati / travel_bookings_pipeline

Star

This is a data engineering pipeline built on Databricks + Delta Lake + PySpark that ingests travel booking and customer master data, applies SCD Type 2 logic, and delivers analytics-ready tables. It includes data quality enforcement, dimension versioning, fact aggregation, and performance tuning.

python analytics travel pyspark data-engineering hospitality notebooks databricks bookings etl-pipeline scd2

Updated Oct 8, 2025
Jupyter Notebook

ViinayKumaarMamidi / Databricks_Travel_Booking_SCD2_Project

Star

This repo contains details about travel booking project executed on Databricks, Thanks

databricks-notebooks scd2 pyspark-python databricks-workspace dataqualitycheck databricks-workflows pydeequ medallion-architecture

Updated Jan 18, 2026
Python

sushmakl95 / aws-glue-cdc-framework

Star

Production-grade CDC pipeline: MySQL → Debezium → Kinesis → S3 → AWS Glue (PySpark) → Redshift + Postgres + OpenSearch. Multi-sink fanout with SCD2, idempotency tracking, and 13 modular Terraform modules.

Updated Apr 21, 2026
Python

sushmakl95 / lakehouse-iot-telemetry-pipeline

Star

Multi-tenant IoT telemetry Lakehouse on Databricks + Delta Lake. PySpark, Auto Loader, DLT, medallion architecture, Terraform IaC.

Updated Apr 21, 2026
Python

sushmakl95 / dbt-bigquery-analytics-platform

Star

Modern data stack reference: dbt + BigQuery + Airflow (Cloud Composer) with medallion layering, SCD2 snapshots, exposures, freshness SLAs, and 45× cost reduction via partition + cluster + incremental tuning.

Updated Apr 21, 2026
Python

Ewambura / zagimore-etl-pipeline

Star

End-to-end ETL and data warehouse pipeline implementing star schema design, SCD Type 2 dimensions, and fact tables for analytical reporting. Built with SQL and structured for scalable analytics.

sql etl analytics data-modeling dimension-tables star-schema scd2 fact-table

Updated Dec 12, 2025

OsamaMustafa32 / Enterprise_Retail_Data_Lakehouse

Star

Batch retail data lakehouse on Databricks: Delta Live Tables (bronze → silver → gold), Unity Catalog, synthetic data generator, and an executive analytics dashboard.

python sql pyspark databricks data-quality-checks etl-pipeline scd2 delta-lake data-lakehouse delta-live-tables unity-catalog medallion-architecture

Updated Apr 2, 2026
Python

ctriz / DataBricks_DLT_Pipeline

Star

Implements a data pipeline using DLT in Databricks (Delta Lake) and uses medallion layering in Delta Lake

delta databricks scd2 delta-lake

Updated Sep 2, 2025
Python

rushal-futurense / Informatica_SCD_2

Star

Vijay works in an IT company for last 5 years, he always needs extra money to spend on his monthly expenses so he decided to apply for the credit card in icici bank. The bank does a background check of vijay to know if he is elligible for the credit card or not.

informatica scd2

Updated Mar 1, 2022

chacha64 / snowflake-healthcare-pipeline

Star

🏥 Streamline healthcare claims processing with this Snowflake pipeline, featuring auto-ingestion, CDC, SCD Type 2, and data quality checks.

aws cloud analytics azure tasks power-bi snowflake elt cdc data-pipeline azure-data-factory scd2 streaming-pipeline ml-pipelines batch-pipeline snowpipe bfsi audit-compliance

Updated Apr 23, 2026

Improve this page

Add a description, image, and links to the scd2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the scd2 topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

scd2

Here are 18 public repositories matching this topic...

KaterynaD / dbt_scd2_plus

spatil6 / ETL-SCD2

ai-tech-karthik / banking-data-pipeline

akshayush / SCD2-Implementation--using-pyspark

Mairondc21 / pipeline_delta_s3

shivaranjanka / snowflake-healthcare-pipeline

shukla2015 / Travel_Booking_SCD2_Project

Aayushi-Anand / SCD2_Implementation

ZuhairBhati / travel_bookings_pipeline

ViinayKumaarMamidi / Databricks_Travel_Booking_SCD2_Project

sushmakl95 / aws-glue-cdc-framework

sushmakl95 / lakehouse-iot-telemetry-pipeline

sushmakl95 / dbt-bigquery-analytics-platform

Ewambura / zagimore-etl-pipeline

OsamaMustafa32 / Enterprise_Retail_Data_Lakehouse

ctriz / DataBricks_DLT_Pipeline

rushal-futurense / Informatica_SCD_2

chacha64 / snowflake-healthcare-pipeline

Improve this page

Add this topic to your repo