GitHub - OpenImagingLab/AnyRecon: AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model

AnyRecon: Arbitrary-View 3D Reconstruction
with Video Diffusion Model

Your star means a lot for us to develop this project! ✨

TODO List

Upload sparse attention weight.

🛠️ Environment Setup

1. Clone Repository and Setup Environment

The point-cloud rendering pipeline depends on π³, which is included as a git submodule. Make sure to clone recursively so that Pi3/ is fetched at the same time:

git clone --recursive https://github.com/OpenImagingLab/AnyRecon.git
# If you already cloned without --recursive, run:
#   git submodule update --init --recursive
cd AnyRecon
conda create -n anyrecon python=3.10 -y
conda activate anyrecon
pip install torch==2.4.1 torchvision==0.19.1 torchaudio==2.4.1 --index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt
pip install -r Pi3/requirements.txt

2. Download Models

AnyRecon relies on specific pre-trained weights. Please download them and place them in the ./checkpoints folder.

Base Video Diffusion Model (Wan2.1 I2V 14B 720P) [download]
AnyRecon LoRA weights [download]
π³ checkpoint (for point-cloud rendering) [download] → place at Pi3/model.safetensors

🚀 Quick Start

To reproduce the provided example, run:

bash test.sh

Or directly:

python run_AnyRecon.py \
    --root_dir example/valley \
    --output_dir example/valley \
    --lora_path full_attention.ckpt

🌟 Run on Your Own Data

run_AnyRecon.py expects point-cloud rendered condition videos as input. To prepare them from a raw video, we provide a helper script built on top of π³:

bash run_pi3.sh

Input video format. Your input video must be organized so that:

the first --num_cond_frames frames are the capture views — these provide the 3D point cloud,
the remaining frames are the test views — they are only used to estimate the camera poses at which the point cloud is rendered, and do not contribute any points to the reconstruction.

Custom test-view trajectory (no test frames needed). If you'd rather specify a custom rendering trajectory instead of estimating poses from real test-view images, you can replace the test-view portion of the video with any placeholder frames and override target_extrinsics[num_cond_frames:] inside process_scene with your desired sequence of world→camera 4×4 matrices. The capture views (the first num_cond_frames frames) will still be used to build the point cloud, and rendering proceeds along your chosen trajectory.

Once run_pi3.py has produced the condition videos in --output_dir, point run_AnyRecon.py --root_dir to that directory and run inference as shown above.

💗 Acknowledgments

Thanks to these great repositories: Wan2.1, DiffSynth-Studio, and π³.

🔗 Citation

If you find our work helpful, please cite it:

@article{chen2026anyrecon,
  title={AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model},
  author={Chen, Yutian and Guo, Shi and Jin, Renbiao and Yang, Tianshuo and Cai, Xin and Luo, Yawen and Yang, Mingxin and Yu, Mulin and Xu, Linning and Xue, Tianfan},
  journal={arXiv preprint arXiv:2604.19747},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
Pi3 @ 9fa3ddb		Pi3 @ 9fa3ddb
diffsynth		diffsynth
docs		docs
example		example
pipeline		pipeline
.DS_Store		.DS_Store
.gitmodules		.gitmodules
README.md		README.md
geometry_utils.py		geometry_utils.py
requirements.txt		requirements.txt
run_AnyRecon.py		run_AnyRecon.py
run_pi3.py		run_pi3.py
run_pi3.sh		run_pi3.sh
test.sh		test.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AnyRecon: Arbitrary-View 3D Reconstruction
with Video Diffusion Model

TODO List

🛠️ Environment Setup

1. Clone Repository and Setup Environment

2. Download Models

🚀 Quick Start

🌟 Run on Your Own Data

💗 Acknowledgments

🔗 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AnyRecon: Arbitrary-View 3D Reconstructionwith Video Diffusion Model

TODO List

🛠️ Environment Setup

1. Clone Repository and Setup Environment

2. Download Models

🚀 Quick Start

🌟 Run on Your Own Data

💗 Acknowledgments

🔗 Citation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

AnyRecon: Arbitrary-View 3D Reconstruction
with Video Diffusion Model

Packages