Presentation
Schedule
The team has a total of 20 papers accepted to CVPR 2025.
Paper |
---|
Editing
MatAnyone: Stable Video Matting with Consistent Memory Propagation
P. Yang, S. Zhou, J. Zhao, Q. Tao, C. C. Loy in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [arXiv] [Project Page] [Demo] |
EditingSegmentation
EdgeTAM: On-Device Track Anything Model
C. Zhou, C. Zhu, Y. Xiong, S. Suri, F. Xiao, L. Wu, R. Krishnamoorthi, B. Dai, C. C. Loy, V. Chandra, B. Soran in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [arXiv] [Project Page] |
Restoration
SeedVR: Seeding Infinity in Diffusion Transformer Towards Generic Video Restoration
J. Wang, Z. Lin, M. Wei, Y. Zhao, C. Yang, F. Xiao, C. C. Loy, L. Jiang in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [arXiv] [Project Page] |
Restoration
Arbitrary-steps Image Super-resolution via Diffusion Inversion
Z. Yue, K. Liao, C. C. Loy in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [arXiv] [Project Page] [Demo] |
Generation
Alias-free Latent Diffusion Models: Improving Fractional Shift Equivariance of Diffusion Latent Space
Y. Zhou, Z. Xiao, S. Yang, X. Pan in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [Coming Soon] |
Generation
AudCast: Audio-Driven Human Video Generation by Cascaded Diffusion Transformers
J. Guan, K. Wang, Z. Xu, Q. Yang, Y. Sun, S. He, B. Liang, Y. Cao, Y. Li, H. Feng, E. Ding, J. Wang, Y. Zhao, H. Zhou, Z. Liu in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [Coming Soon] |
3DRestoration
3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement
Y. Luo, S. Zhou, Y. Lan, X. Pan, C. C. Loy in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [arXiv] [Project Page] |
3DEditing
DoF-Gaussian: Controllable Depth-of-Field for 3D Gaussian Splatting
L. Shen, T. Liu, H. Sun, J. Li, Z. Cao, W. Li, C. C. Loy in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [Coming Soon] |
3D
MEAT: Multiview Diffusion Model for Human Generation on Megapixels with Mesh Attention
Y. Wang, F. Hong, S. Yang, L. Jiang, W. Wu, C. C. Loy in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [Coming Soon] |
3D
SAR3D: Autoregressive 3D Object Generation and Understanding via Multi-scale 3D VQVAE
Y. Chen, Y. Lan, S. Zhou, T. Wang, X. Pan in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [arXiv] [Project Page] |
3D
WildAvatar: Learning In-the-wild 3D Avatars from the Web
Z. Huang, S. Hu, G. Wang, T. Liu, Y. Zang, Z. Cao, W. Li, Z. Liu in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [arXiv] [Project Page] |
3D
GaussianCity: Generative Gaussian Splatting for Unbounded 3D City Generation
H. Xie, Z. Chen, F. Hong, Z. Liu in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [arXiv] [Project Page] |
3D
3DTopia-XL: Scaling High-quality 3D Asset Generation via Primitive Diffusion
Z. Chen, J. Tang, Y. Dong, Z. Cao, F. Hong, Y. Lan, T. Wang, H. Xie, T. Wu, S. Saito, L. Pan, D. Lin, Z. Liu in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [arXiv] [Project Page] |
3D
Disco4D: Disentangled 4D Human Generation and Animation from a Single Image
H. E. Pang, S. Liu, Z. Cai, L. Yang, T. Zhang, Z. Liu in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [arXiv] [Project Page] |
3D
Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion
Z. He, T. Wang, X. Huang, X. Pan, Z. Liu in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [arXiv] [Project Page] |
Multimodal3D
SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters
J. Jiang, W. Xiao, Z. Lin, H. Zhang, T. Ren, Y. Gao, Z. Lin, Z. Cai, L. Yang, Z. Liu in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [arXiv] [Project Page] |
MultimodalSegmentation
F-LMM: Grounding Frozen Large Multimodal Models
S. Wu, S. Jin, W. Zhang, L. Xu, W. Liu, W. Li, C. C. Loy in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [arXiv] [Project Page] |
Multimodal
EgoLife: Towards Egocentric Life Assistant
J. Yang, S. Liu, H. Guo, Y. Dong, X. Zhang, S. Zhang, P. Wang, Z. Zhou, B. Xie, Z. Wang, B. Ouyang, Z. Lin, M. Cominelli, Z. Cai, B. Li, Y. Zhang, P. Zhang, F. Hong, J. Widmer, F. Gringoli, L. Yang, Z. Liu in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [Coming Soon] |
Multimodal
Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models
Y. Dong, Z. Liu, H.-L. Sun, J. Yang, W. Hu, Y. Rao, Z. Liu in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [arXiv] [Project Page] |
Multimodal
EgoLM: Multi-Modal Language Model of Egocentric Motions
F. Hong, V. Guzov, H. J. Kim, Y. Ye, R. Newcombe, Z. Liu, L. Ma in Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2025 (CVPR) [arXiv] [Project Page] |