Skip to content
Change the repository type filter

All

    Repositories list

    • starVLA

      Public
      StarVLA: A Lego-like Codebase for Vision-Language-Action Model Developing
      Python
      Other
      263000Updated May 5, 2026May 5, 2026
    • AutoGEO

      Public
      [ICLR'26] AutoGEO: a Generative Engine Optimization framework to automatically learn generative engine preferences, and rewrite web contents for more traction.
      Python
      MIT License
      1413800Updated May 1, 2026May 1, 2026
    • 0000Updated Apr 27, 2026Apr 27, 2026
    • SkillLearnBench is the first benchmark for evaluating continual learning methods that automatically generate agent skills.
      Python
      MIT License
      01410Updated Apr 24, 2026Apr 24, 2026
    • Benchmark Test-Time Scaling of General LLM Agents
      Python
      MIT License
      11800Updated Apr 14, 2026Apr 14, 2026
    • AgentWebBench: Benchmarking Multi-Agent Coordination in Agentic Web
      MIT License
      0210Updated Apr 13, 2026Apr 13, 2026
    • Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
      Python
      Apache License 2.0
      8.7k000Updated Apr 6, 2026Apr 6, 2026
    • ai-search

      Public
      0000Updated Mar 31, 2026Mar 31, 2026
    • OLMo

      Public
      Modeling, training, eval, and inference code for OLMo
      Python
      Apache License 2.0
      753000Updated Mar 12, 2026Mar 12, 2026
    • Python
      0200Updated Mar 10, 2026Mar 10, 2026
    • Python
      12800Updated Mar 10, 2026Mar 10, 2026
    • Python
      11500Updated Feb 18, 2026Feb 18, 2026
    • RePro

      Public
      Official repository for RePro: Training Language Models to Faithfully Recycle the Web for Pretraining
      Python
      2810Updated Feb 18, 2026Feb 18, 2026
    • repo for the 11787 course
      HTML
      MIT License
      258000Updated Jan 15, 2026Jan 15, 2026
    • The deep research agent RL framework for paper https://arxiv.org/abs/2510.06534
      Python
      Apache License 2.0
      177400Updated Dec 20, 2025Dec 20, 2025
    • Ressys benchmark code repo
      Python
      21111Updated Dec 5, 2025Dec 5, 2025
    • Website: https://cxcscmu.github.io/behavior-priming-agentic-search.ai/
      Python
      1310Updated Nov 27, 2025Nov 27, 2025
    • HTML
      0000Updated Nov 26, 2025Nov 26, 2025
    • Official repository for Deep Research Comparator: A Platform For Fine-grained Human Annotations of Deep Research Agents
      Python
      MIT License
      0800Updated Nov 16, 2025Nov 16, 2025
    • Organize the Web: Constructing Domains Enhances Pre-Training Data Curation
      Jupyter Notebook
      Apache License 2.0
      7000Updated Oct 27, 2025Oct 27, 2025
    • AutoRule

      Public
      Official repository for AutoRule: Reasoning Chain-of-thought Extracted Rule-based Rewards Improve Preference Learning
      Python
      11510Updated Jul 24, 2025Jul 24, 2025
    • Official repository for FactMM-RAG: Fact-Aware Multimodal Retrieval Augmentation for Accurate Medical Radiology Report Generation [NAACL 2025]
      Python
      MIT License
      32930Updated Jul 12, 2025Jul 12, 2025
    • Python
      MIT License
      0010Updated May 30, 2025May 30, 2025
    • Python
      0300Updated Apr 2, 2025Apr 2, 2025
    • Interpret and control dense embedding via sparse autoencoder.
      Python
      MIT License
      01000Updated Mar 5, 2025Mar 5, 2025
    • Craw4LLM

      Public
      Official repository for "Craw4LLM: Efficient Web Crawling for LLM Pretraining"
      Python
      MIT License
      6065240Updated Feb 24, 2025Feb 24, 2025
    • Official repository for Montessori-Instruct: Generate Influential Training Data Tailored for Student Learning [ICLR 2025]
      Python
      MIT License
      55010Updated Jan 24, 2025Jan 24, 2025
    • RAGViz

      Public
      Official repository for RAGViz: Diagnose and Visualize Retrieval-Augmented Generation [EMNLP 2024]
      TypeScript
      MIT License
      138910Updated Jan 18, 2025Jan 18, 2025
    • MATES

      Public
      Official repository for MATES: Model-Aware Data Selection for Efficient Pretraining with Data Influence Models [NeurIPS 2024]
      Python
      MIT License
      97940Updated Nov 14, 2024Nov 14, 2024
    • esae

      Public
      Python
      0000Updated Oct 29, 2024Oct 29, 2024
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.