Skip to content
@kreuzberg-dev

Kreuzberg

Polyglot document intelligence with a Rust core — extract structured data from 97+ formats

Pinned Loading

  1. kreuzberg kreuzberg Public

    A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Pyt…

    Rust 7.6k 380

  2. html-to-markdown html-to-markdown Public

    High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg team. Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts stru…

    HTML 658 55

  3. tree-sitter-language-pack tree-sitter-language-pack Public

    Comprehensive tree-sitter grammar compilation with polyglot bindings — Rust, Python, Node.js, Go, Java, Ruby, Elixir, PHP, C#, WASM, and CLI. 305+ languages.

    Rust 334 53

  4. liter-llm liter-llm Public

    Universal LLM API client — 142+ providers, 11 native language bindings, powered by Rust core

    Rust 148 9

  5. alef alef Public

    Generate fully-typed, lint-clean language bindings for Rust libraries across 11 languages

    Rust 9

  6. kreuzcrawl kreuzcrawl Public

    Rust 5

Repositories

Showing 10 of 18 repositories
  • alef Public

    Generate fully-typed, lint-clean language bindings for Rust libraries across 11 languages

    kreuzberg-dev/alef’s past year of commit activity
    Rust 9 MIT 0 1 1 Updated Apr 19, 2026
  • homebrew-tap Public
    kreuzberg-dev/homebrew-tap’s past year of commit activity
    Ruby 0 1 1 0 Updated Apr 19, 2026
  • kreuzcrawl Public
    kreuzberg-dev/kreuzcrawl’s past year of commit activity
    Rust 5 0 0 0 Updated Apr 19, 2026
  • kreuzberg Public

    A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 97+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.

    kreuzberg-dev/kreuzberg’s past year of commit activity
    Rust 7,596 380 11 (2 issues need help) 2 Updated Apr 19, 2026
  • liter-llm Public

    Universal LLM API client — 142+ providers, 11 native language bindings, powered by Rust core

    kreuzberg-dev/liter-llm’s past year of commit activity
    Rust 148 MIT 9 2 1 Updated Apr 19, 2026
  • html-to-markdown Public

    High performance and CommonMark compliant HTML to Markdown converter. Maintained by the Kreuzberg team. Kreuzberg is a fast, polyglot document intelligence engine with a Rust core. It extracts structured data from 56+ document formats using streaming parsers and built-in OCR.

    kreuzberg-dev/html-to-markdown’s past year of commit activity
    HTML 658 MIT 55 2 3 Updated Apr 19, 2026
  • kreuzberg-txtai Public

    Kreuzberg integration for txtai — drop-in Textractor replacement and custom pipeline

    kreuzberg-dev/kreuzberg-txtai’s past year of commit activity
    Python 1 MIT 0 0 0 Updated Apr 19, 2026
  • kreuzberg-spring-ai Public

    Spring AI DocumentReader integration for Kreuzberg document extraction engine

    kreuzberg-dev/kreuzberg-spring-ai’s past year of commit activity
    Java 1 MIT 0 1 0 Updated Apr 19, 2026
  • kreuzberg-crewai Public

    Extract text and metadata from 88+ document formats — PDF, DOCX, XLSX, HTML, images with OCR, and more — directly from your CrewAI agents.

    kreuzberg-dev/kreuzberg-crewai’s past year of commit activity
    Python 1 MIT 0 0 1 Updated Apr 19, 2026
  • kreuzberg-surrealdb Public

    Extract, chunk, and embed documents from 88+ formats directly into SurrealDB.

    kreuzberg-dev/kreuzberg-surrealdb’s past year of commit activity
    Python 11 MIT 1 0 0 Updated Apr 19, 2026

Top languages

Loading…

Most used topics

Loading…