3D Generation

Scaling Mesh Generation via Compressive Tokenization

Haohan Weng, Zibo Zhao, Biwen Lei, Xianghui Yang, Jian Liu, Zeqiang Lai, Zhuo Chen, Yuhong Liu, Jie Jiang, Chunchao Guo, Tong Zhang, Shenghua Gao, C. L. Philip Chen

Hunyuan3D-2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Hunyuan3D 2.0 is an advanced large-scale 3D synthesis system designed to generate high-resolution textured 3D assets. It consists of two core components Hunyuan3D-DiT, a scalable flow-based diffusion transformer for shape generation that ensures alignment with input conditions, and Hunyuan3D-Paint, a texture synthesis model that produces high-quality, vibrant textures using geometric and diffusion priors. Additionally, Hunyuan3D-Studio provides a user-friendly platform for asset creation, manipulation, and animation, catering to both professionals and amateurs. Evaluations demonstrate that Hunyuan3D 2.0 surpasses previous state-of-the-art models (both open- and closed-source) in geometry detail, condition alignment, and texture quality. The system is publicly released to advance the open-source 3D generative modeling community.

Zibo Zhao, Zeqiang Lai, Qingxiang Lin, Yunfei Zhao, Haolin Liu, Shuhui Yang, Yifei Feng, Mingxin Yang, Sheng Zhang, Xianghui Yang, Huiwen Shi*, Sicong Liu,, Junta Wu, Yihang Lian, Fan Yang, Ruining Tang, Zebin He, Xinzhou Wang, Jian Liu, Xuhui Zuo, Zhuo Chen, Biwen Lei, Haohan Weng, Jing Xu, Yiling Zhu, Xinhai Liu,, Lixin Xu, Changrong Hu, Shaoxiong Yang, Song Zhang, Yang Liu, Tianyu Huang, Lifu Wang, Jihong Zhang, Meng Chen, Liang Dong, Yiwen Jia, Yulin Cai, Jiaao Yu, Yixuan Tang, Hao Zhang, Zheng Ye, Peng He, Runzhou Wu, Chao Zhang, Yonghao Tan, Jie Xiao, Yangyu Tao, Jianchen Zhu, Jinbao Xue, Kai Liu, Chongqing Zhao, Xinming Wu, Zhichao Hu, Lei Qin, Jianbing Peng, Zhan Li, Minghui Chen, Xipeng Zhang, Lin Niu, Paige Wang, Yingkai Wang, Haozhao Kuang, Zhongyi Fan, Xu Zheng, Weihao Zhuang, YingPing He, Tian Liu, Yong Yang, Di Wang, Yuhong Liu, Jie Jiang, Jingwei Huang, Chunchao Guo

Hunyuan3D-2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation

Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation

We propose a two-stage approach named Hunyuan3D-1.0 including a lite version and a standard version, that both support text- and image-conditioned generation. In the first stage, we employ a multi-view diffusion model that efficiently generates multiview RGB in approximately 4 seconds. In the second stage, we introduce a feedforward reconstruction model that rapidly and faithfully reconstructs the 3D asset given the generated multi-view images in approximately 7 seconds.

Xianghui Yang, Huiwen Shi*, Bowen Zhang*, Fan Yang, Jiacheng Wang, Hongxu Zhao, Xinhai Liu,, Xinzhou Wang, Qingxiang Lin, Jiaao Yu, Lifu Wang, Zhuo Chen, Sicong Liu,, Yuhong Liu, Yong Yang, Di Wang, Jie Jiang, Chunchao Guo

Hunyuan3D-1.0: A Unified Framework for Text-to-3D and Image-to-3D Generation

ViewFusion: Towards Multi-View Consistency via Interpolated Denoising

Novel-view Generation, 3D Generation, Diffusion Model

Xianghui Yang, Yan Zuo, Sameera Ramasinghe, Loris Bazzani, Gil Avraham, Anton van den Hengel

ViewFusion: Towards Multi-View Consistency via Interpolated Denoising