Selected Publications
✱: Both authors contributed equally.
|
|
DA2: Depth Anything in Any Direction
Haodong Li,
Wangguangdong Zheng,
Jing He,
Yuhao Liu,
Xin Lin,
Xin Yang,
Ying-Cong Chen
Chunchao Guo
arXiv 2025
arXiv (Soon)
/
Paper
/
Project Page
/
Github (Soon)
/
Data (Soon)
/
Demo (Soon)
/
Slides
Powered by large-scale training data curated from our panoramic data curation engine, and the SphereViT for addressing the spherical distortions in panoramas,
DA2 is able to predict dense, scale-invariant distance from a single 360° panorama in an end-to-end manner,
with remarkable geometric fidelity and strong zero-shot generalization.
|
|
LOTUS: Diffusion-based Visual Foundation Model for High-quality Dense Prediction
Jing He✱ ,
Haodong Li✱ ,
Wei Yin,
Yixun Liang,
Leheng Li,
Kaiqiang Zhou,
Hongbo Zhang,
Bingbing Liu,
Ying-Cong Chen
ICLR 2025
arXiv
/
Paper
/
Project Page
/
Github
/
Demo (D)
/
Demo (N)
/
ComfyUI
Lotus is a diffusion-based visual foundation model with a simple yet effective adaptation protocol,
aiming to fully leverage the pre-trained diffusion's powerful visual priors for dense prediction.
With minimal training data, Lotus achieves SoTA performance in two key geometry perception tasks, i.e., zero-shot monocular depth and normal estimation.
|
|