Connecting Consistency Distillation to Score Distillation for Text-to-3D Generation

1Nanyang Technological University, 2Zhejiang University
ECCV 2024

Abstract

Although recent advancements in text-to-3D generation have significantly improved generation quality, issues like limited level of detail and low fidelity still persist, which requires further improvement. To understand the essence of those issues, we thoroughly analyze current score distillation methods by connecting theories of consistency distillation to score distillation. Based on the insights acquired through analysis, we propose an optimization framework, Guided Consistency Sampling (GCS), integrated with 3D Gaussian Splatting (3DGS) to alleviate those issues. Additionally, we have observed the persistent oversaturation in the rendered views of generated 3D assets. From experiments, we find that it is caused by unwanted accumulated brightness in 3DGS during optimization. To mitigate this issue, we introduce a Brightness Equalized Generation (BEG) scheme in 3DGS rendering. Experimental results demonstrate that our approach generates 3D assets with more details and higher fidelity than state-of-the-art methods.

Guided Consistency Sampling

We proposed a Guided Consistency Sampling (GCS) to reduce the distillation error and improve the generation quality. GCS is composed of a Compact Consistency (CC) Loss, a Conditional Guidance (CG) Loss, and a Constraint on Pixel (CP) Domain.

Teaser

Brightness Equalized Sampling

We proposed a Brightness Generation (BEG) to alleviate the brightness accumulated problem in 3D Gaussian-based generation.

Teaser

Generated Examples

BibTeX

@article{li2024connecting,
        title={Connecting Consistency Distillation to Score Distillation for Text-to-3D Generation},
        author={Li, Zongrui and Hu, Minghui and Zheng, Qian and Jiang, Xudong},
        journal={arXiv preprint arXiv:2407.13584},
        year={2024}
      }