OpenHelix Robotics

OpenHelix Robotics: Building Next-generation Embodiment Intelligence

We are a group focused on vision-language-action models (VLAs). We wish to bring insights to the community with our research.

Introduction

OpenHelix-Team introduces a novel family of fully open-source Vision-Language-Action Models (VLAs) that achieves state-of-the-art performance with substantially lower cost.

Awesome VLAs

Awesome-Force-Tactile-VLA: A paper list of multimodal VLAs
Awesome-VLA-RL: A taxonomy and summary of recent advances in VLA + RL
Embodied-AI-Paper-TopConf: Embodied AI Paper List from Top Conferences of 2025 and 2026

Multimodal Large Language Models

Cobra (AAAI 2025): Extending Mamba to Multi-modal Large Language Model for Efficient Inference

General Foundation Models

VLA-Adapter (AAAI 2026 (Oral)): An Effective Paradigm for Tiny-Scale Vision-Language-Action Model
LLaVA-VLA (ICRA 2026): A Simple Yet Powerful Vision-Language-Action Model

Visual Feature Alignment for VLAs

ReconVLA (AAAI 2026 Best Paper Award): Reconstructive Vision-Language-Action Model as Effective Robot Perceiver
Spatial Forcing (ICLR 2026): Implicit Spatial Representation Alignment for Vision-Language-Action Model

World-modeling VLAs

Unified Diffusion VLA (ICLR 2026): The first open-sourced diffusion Vision-Language-Action model
HiF-VLA (CVPR 2026): An efficient, bidirectional spatiotemporal expansion Vision-Language-Action Model
frappe: Infusing World Modeling into Generalist Policies via Multiple Future Representation Alignment
VLA-RFT: Vision-Language-Action Models with Reinforcement Fine-Tuning

Visual Enhanced Frameworks

VLA-2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation
LongVLA (CoRL 2025): Unleashing Long-Horizon Capability of Vision-Language-Action Models for Robot Manipulation

Efficient VLAs

CEED-VLA: Consistency Vision-Language-Action Model with Early-Exit Decoding
OpenHelix: An Open-Source Dual-System Vision-Language-Action Model for Robotic Manipulation

Quadruped VLAs

GeRM (IROS 2024): A Generalist Robotic Model with Mixture-of-Experts for Quadruped Robot

Humanoid VLAs

OpenTrajBooster (ICRA 2026): Official implementation of TrajBooster

Collaborating Institutions

This initiative is jointly established and co-developed with the following research institutions:

Westlake University
The Hong Kong University of Science and Technology (Guangzhou)
Zhejiang University
Tsinghua University
Beijing Academy of Artificial Intelligence (BAAI)
Xi’an Jiaotong University
Beijing University of Posts and Telecommunications

Contact

If you are interested in discussion or joining us, please send emails to songwenxuan0115@gmail.com.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenHelix Robotics

OpenHelix Robotics: Building Next-generation Embodiment Intelligence

Introduction

Awesome VLAs

Multimodal Large Language Models

General Foundation Models

Visual Feature Alignment for VLAs

World-modeling VLAs

Visual Enhanced Frameworks

Efficient VLAs

Quadruped VLAs

Humanoid VLAs

Collaborating Institutions

Contact

Pinned Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!