Atlas 🔍

Data Change Intelligence Agent

Built for the Google Cloud Rapid Agent Hackathon 2026 (Fivetran Track)

🌐 Live Demo: https://atlas-fivetran.streamlit.app/

⚡ The Problem

Data pipelines break constantly because upstream schema changes (like dropping a column or changing a data type) are made without understanding the downstream impact. Data engineers waste hours manually tracing lineage across dbt, Looker, and machine learning platforms, often discovering breakages only after executive dashboards fail.

🚀 The Solution

Atlas is an AI agent powered by Gemini 3 and Fivetran's Model Context Protocol (MCP). It proactively analyzes the impact of proposed schema changes before they happen.

Instead of just answering questions, Atlas takes action:

It validates the current schema state directly via Fivetran.
It traces the lineage of the specific column across all downstream assets (dbt, Tableau, Looker, ML features).
It determines the business criticality of the change.
It formulates a deprecation plan and drafts custom communications for affected stakeholders.
Upon user approval, it uses Fivetran to automatically soft-deprecate the column and triggers a verification sync.

🧠 Architecture & Multi-Step Reasoning

Atlas isn't a chatbot; it's an agentic workflow. When given a complex prompt (e.g., "Drop forecast_category from Salesforce and lead_source_legacy from HubSpot"), Gemini 3's advanced reasoning dynamically parallelizes its tasks:

Verify connection health and schema status via Fivetran MCP (salesforce, hubspot).
Retrieve lineage maps for both targets.
Synthesize a combined impact report across multiple data domains.
Execute multiple modify_connection_column_config tool calls.
Trigger multiple sync_connection commands to push changes to production.

flowchart TD
    User(["👤 User: Drop customer_segment from stripe.customers"])
    UI["🖥️ Streamlit UI"]
    Gemini{{"🧠 Gemini 3 — reasoning &amp; planning"}}
    subgraph Tools["Agent Tools"]
        Fivetran["🔌 Fivetran MCP<br/>connections · schema · sync"]
        Lineage["🕸️ Lineage Engine<br/>downstream impact · owners"]
        Ranker["⚙️ Semantic Ranker<br/>deterministic severity"]
    end
    Gate{"🛡️ Approval Gate<br/>human in the loop"}
    Exec["✅ Execute: soft-deprecate column<br/>+ trigger verification sync"]
    Log["📋 Change Log"]

    User --> UI --> Gemini
    Gemini -->|tool calls| Tools
    Tools -->|results| Gemini
    Gemini -->|impact report + severity| Gate
    Gate -->|reject| UI
    Gate -->|approve| Exec --> Fivetran
    Exec --> Log --> UI

🔌 Fivetran MCP Integration

Atlas talks to Fivetran through a tool layer (fivetran_tools.py) that implements the same tool names and response shapes as the official fivetran/fivetran-mcp server. It exposes six tools:

Tool	Real Fivetran endpoint
`list_connections`	`GET /v1/connections`
`get_connection_details`	`GET /v1/connections/{id}`
`get_connection_state`	`GET /v1/connections/{id}/state`
`get_connection_schema_config`	`GET /v1/connections/{id}/schemas`
`modify_connection_column_config`	`PATCH /v1/connections/{id}/schemas/{schema}/tables/{table}/columns/{column}`
`sync_connection`	`POST /v1/connections/{id}/sync`

For the demo these run against a curated in-memory fixture so judges can exercise the full lifecycle without live credentials. Because the interface matches the official MCP server exactly, it's a drop-in replacement: point the tool layer at a real Fivetran account (with API credentials) and Atlas operates on production connections unchanged.

🛟 Reliability: Smart Model Fallback + Demo Cache

Two layers keep the demo alive even under free-tier API limits:

Smart model fallback (gemini_client.py) — every Gemini call routes through smart_generate(), which walks an ordered model chain and transparently falls back on 429 (rate limit), 503 (overload), or 404 (unavailable model). The chain ends with high-quota models (gemini-1.5-flash, 1500 RPD) as a safety net.
Demo cache (demo_cache.py) — the three rehearsed scenarios (drop customer_segment, drop lead_source_legacy, the not-found discount_code) have pre-baked reports. If API quota is fully exhausted, Atlas serves the cached analysis and execution with zero API calls, so the live demo never fails. The severity badge, PII flag, and stakeholder cards are still rebuilt deterministically from the lineage layer, so a cached run renders identically to a live one.

📸 Screenshots

Atlas welcome screen with Fivetran MCP and multi-channel integrations

Impact analysis with CRITICAL severity badge and downstream asset mapping

Stakeholder notifications — send directly to Slack, Telegram, or Email

Real-time Telegram notification delivered by Atlas

Human-in-the-loop approval gate, execution result, and change log

🛠️ Built With

Gemini 3 - Advanced reasoning, planning, and multi-tool orchestration
Fivetran MCP Server - Direct integration with Fivetran's configuration API
Streamlit - Custom glassmorphic UI with dynamic state management
Python - Core logic and API integration

💻 Running Locally

Clone the repository
Install dependencies: pip install -r requirements.txt
Add your Gemini API key to a .env file: GEMINI_API_KEY="your-key"
Run the app: streamlit run app.py

📄 License

This project is licensed under the MIT License - see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
.streamlit		.streamlit
screenshots		screenshots
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
app.py		app.py
atlas.py		atlas.py
create_demo_db.py		create_demo_db.py
db_scanner.py		db_scanner.py
demo_cache.py		demo_cache.py
demo_warehouse.db		demo_warehouse.db
fivetran_tools.py		fivetran_tools.py
gemini_client.py		gemini_client.py
lineage.json		lineage.json
lineage.py		lineage.py
lineage_inference.py		lineage_inference.py
lineage_viz.py		lineage_viz.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Atlas 🔍

⚡ The Problem

🚀 The Solution

🧠 Architecture & Multi-Step Reasoning

🔌 Fivetran MCP Integration

🛟 Reliability: Smart Model Fallback + Demo Cache

📸 Screenshots

🛠️ Built With

💻 Running Locally

📄 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Atlas 🔍

⚡ The Problem

🚀 The Solution

🧠 Architecture & Multi-Step Reasoning

🔌 Fivetran MCP Integration

🛟 Reliability: Smart Model Fallback + Demo Cache

📸 Screenshots

🛠️ Built With

💻 Running Locally

📄 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages