Skip to content

Add DataFlow to the list of AI frameworks#18

Open
Jununn wants to merge 1 commit into
pracdata:mainfrom
Jununn:patch-1
Open

Add DataFlow to the list of AI frameworks#18
Jununn wants to merge 1 commit into
pracdata:mainfrom
Jununn:patch-1

Conversation

@Jununn
Copy link
Copy Markdown

@Jununn Jununn commented Jun 1, 2026

Adds DataFlow to the LLMOps section.

What it is

DataFlow is an open-source data-centric AI platform for LLM data preparation, synthetic data generation, and AI/data pipelines. It provides reusable skills, operator-based pipelines, and a WebUI for constructing and executing data workflows for AI tasks.

Position in the list

I placed it in the LLMOps section because DataFlow focuses on data preparation, generation, filtering, refinement, and reusable pipelines for LLM training, fine-tuning, and RAG workflows.

Project status

  • Licensed under Apache-2.0.
  • Keywords: data-centric AI, LLM data preparation, synthetic data generation, AI/data pipelines, reusable skills, operator-based workflows.
  • Provides Python package and Docker-based usage.
  • Includes a WebUI for visual pipeline construction via dataflow webui.
  • Includes DataFlow-Skills for operator development, pipeline construction, and data-centric AI workflows.

Adds [DataFlow](https://github.com/OpenDCAI/DataFlow) to the **LLMOps** section.

## What it is

DataFlow is an open-source data-centric AI platform for LLM data preparation, synthetic data generation, and AI/data pipelines. It provides reusable skills, operator-based pipelines, and a WebUI for constructing and executing data workflows for AI tasks.

## Position in the list

I placed it in the LLMOps section because DataFlow focuses on data preparation, generation, filtering, refinement, and reusable pipelines for LLM training, fine-tuning, and RAG workflows.

## Project status

- Licensed under Apache-2.0.
- Keywords: data-centric AI, LLM data preparation, synthetic data generation, AI/data pipelines, reusable skills, operator-based workflows.
- Provides Python package and Docker-based usage.
- Includes a WebUI for visual pipeline construction via `dataflow webui`.
- Includes DataFlow-Skills for operator development, pipeline construction, and data-centric AI workflows.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant