flowx

ADF to Databricks Lakeflow Jobs translator via Declarative Automation Bundles.

flowx is a Claude Code plugin that converts Azure Data Factory (ADF) pipeline definitions into Databricks Lakeflow Jobs packaged as Declarative Automation Bundles (DABs). It deterministically translates known activity types and falls back to agentic LLM-assisted translation for complex or rare types.

Architecture

                         flowx Pipeline
                         ==================

  ADF JSON (UC Volumes)
        |
        v
  +------------------+
  |  1. PROFILE      |    Parse ADF ARM/JSON exports
  |  adf_loader.py   | -> Typed AST -> metadata/inventory.json
  +------------------+
        |
        v
  +------------------+
  |  2. TRANSLATE     |    Registry dispatch + topological sort
  |  engine.py       | -> Pipeline IR (deterministic + agentic gaps)
  +------------------+
        |
        v
  +------------------+
  |  3. PREPARE       |    IR -> DAB YAML + notebooks + setup scripts
  |  dab_writer.py   | -> Deployable DABs project
  +------------------+
        |
        v
  databricks bundle validate / deploy

Quick Start

Add the flowx marketplace and install the plugin in Claude Code:
```
/plugin marketplace add databricks-solutions/flowx
/plugin install flowx@flowx
```
Then run /reload-plugins to activate it.
Set up the runtime (run once):
```
/flowx:flowx-setup
```
This auto-detects your environment and prepares the right execution path — a local Python virtual environment for Claude Code, or a deployed MCP server for Databricks Genie Code. See Setup for details.

Run the end-to-end migration:

/flowx:flowx-migrate

Or run individual phases:

/flowx:flowx-discover    # Parse ADF JSON, produce inventory + complexity report
/flowx:flowx-convert     # Deterministic + agentic translation
/flowx:flowx-package     # Generate DABs project

Setup

/flowx:flowx-setup keys off the DATABRICKS_RUNTIME_VERSION environment variable (the same signal the rest of the plugin uses to detect Databricks) and prepares one of two execution paths:

Local / Claude Code (virtual environment). The phases run from the plugin's CLI. Setup runs scripts/bootstrap.sh, which creates a .venv, installs requirements.txt, and writes the resolved interpreter path to a .migration-venv marker file that the phase skills read. Optionally, a local (stdio) MCP server can be registered to drive the phases through MCP tools instead of the CLI.
Databricks Genie Code (MCP server, no virtual environment). The phases run as a single flowx MCP tool hosted on a Databricks App. Setup runs app/deploy.sh, which stages a self-contained bundle, syncs it to /Workspace/Shared/mcp-flowx, and deploys the mcp-flowx app. You then grant app/data access and register the app under Genie Code Settings → MCP Servers. No venv is created on this path.

Run setup once before any other flowx skill, or again whenever the environment is missing.

Supported ADF Activity Types

Deterministic (16 types)

ADF Activity	Databricks Task	Category
Copy	Notebook task	Data movement
DatabricksNotebook	Notebook task	Compute
DatabricksSparkJar	Spark JAR task	Compute
DatabricksSparkPython	Spark Python task	Compute
ForEach	for_each_task	Control flow
IfCondition	if_else_task	Control flow
Switch	if_else_task chain	Control flow
SetVariable	run_job_task	Control flow
AppendVariable	run_job_task	Control flow
Filter	Notebook task	Control flow
Wait	Notebook task (sleep)	Control flow
Lookup	Notebook task	Data access
WebActivity	Notebook task	External
Delete	Notebook task	Data management
ExecutePipeline	run_job_task	Orchestration
DatabricksJob	run_job_task	Compute

Agentic Fallback (12 types)

ADF Activity	Strategy
ExecuteDataFlow	LLM-assisted via adf-to-databricks-plugin
SqlServerStoredProcedure	LLM-assisted via adf-to-databricks-plugin
AzureFunction	LLM-assisted via adf-to-databricks-plugin
WebHook	LLM-assisted via adf-to-databricks-plugin
Custom	LLM-assisted via adf-to-databricks-plugin
ExecuteSSISPackage	LLM-assisted via adf-to-databricks-plugin
AzureMLExecutePipeline	LLM-assisted via adf-to-databricks-plugin
GetMetadata	LLM-assisted via adf-to-databricks-plugin
Validation	LLM-assisted via adf-to-databricks-plugin
Fail	LLM-assisted via adf-to-databricks-plugin
Script	LLM-assisted via adf-to-databricks-plugin
Until	LLM-assisted via adf-to-databricks-plugin

How It Works

Phase 1: Discover

Reads ADF JSON definitions from Unity Catalog volumes, normalizes ARM template format, parses into typed AST nodes, and classifies each activity as deterministic, agentic, or unsupported. Produces metadata/inventory.json and a per-pipeline complexity report at metadata/profile_report.csv.

Phase 2: Convert

Applies deterministic translators via registry dispatch, resolves dependencies through topological sort, and threads immutable TranslationContext through control-flow visitors. Agentic gaps are flagged for LLM-assisted translation. Produces Pipeline IR.

Phase 3: Package

Converts Pipeline IR into a deployable DABs project: databricks.yml, per-job YAML resource files, generated Python notebooks, and setup scripts for UC volumes, secrets, and connections.

Output Format

All three phases write into one shared output directory (default ./flowx_output):

flowx_output/
  databricks.yml              # Bundle configuration (package)
  resources/
    jobs/
      <pipeline_name>.yml     # One job per ADF pipeline
  src/
    notebooks/
      <pipeline_name>/
        <activity_name>.py    # Generated notebooks per activity
    setup/
      create_volumes.py       # UC volume setup
      create_secrets.py       # Secret scope setup
      create_connections.py   # Connection setup
  SETUP.md                    # Setup instructions (package)
  metadata/
    inventory.json            # discover: activity inventory
    profile_report.csv        # profile: per-pipeline complexity report
    <pipeline>.arm.json       # discover: verbatim original ADF/ARM source
    configuration.json        # modify: collected configuration answers
  .work/                      # transient intermediates (translation report, IR, gaps.json); pruned by prepare

Development

make dev          # Install dependencies
make test         # Run unit tests
make integration  # Run integration tests
make fmt          # Format + lint (ruff + mypy)
make clean        # Remove build artifacts

Prerequisites

Python 3.12+
uv package manager

These prerequisites are for contributing to the flowx project. Plugin users do not need uv — /flowx:flowx-setup provisions the runtime (a pip-based .venv locally, or the MCP server on Databricks).

Contributing

Fork the repository
Create a feature branch (git checkout -b feature/my-feature)
Follow the adding a new translator guide
Run make fmt && make test before committing
Open a pull request

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.claude-plugin		.claude-plugin
.github		.github
app		app
docs		docs
scripts		scripts
skills		skills
src		src
tests		tests
.build-constraints.txt		.build-constraints.txt
.gitignore		.gitignore
AGENTS.md		AGENTS.md
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
CODEOWNERS.txt		CODEOWNERS.txt
LICENSE		LICENSE
LICENSE.md		LICENSE.md
Makefile		Makefile
NOTICE.md		NOTICE.md
README.md		README.md
SECURITY.md		SECURITY.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

flowx

Architecture

Quick Start

Setup

Supported ADF Activity Types

Deterministic (16 types)

Agentic Fallback (12 types)

How It Works

Phase 1: Discover

Phase 2: Convert

Phase 3: Package

Output Format

Development

Prerequisites

Contributing

About

Licenses found

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

flowx

Architecture

Quick Start

Setup

Supported ADF Activity Types

Deterministic (16 types)

Agentic Fallback (12 types)

How It Works

Phase 1: Discover

Phase 2: Convert

Phase 3: Package

Output Format

Development

Prerequisites

Contributing

About

Resources

License

Licenses found

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages