AToM: Amortized Text-to-Mesh using 2D Diffusion
Abstract
We introduce Amortized Text-to-Mesh (AToM), a feed-forward text-to-mesh framework optimized across multiple text prompts simultaneously. In contrast to existing text-to-3D methods that often entail time-consuming per-prompt optimization and commonly output representations other than polygonal meshes, AToM directly generates high-quality textured meshes in less than 1 second with around 10 times reduction in the training cost, and generalizes to unseen prompts. Our key idea is a novel triplane-based text-to-mesh architecture with a two-stage amortized optimization strategy that ensures stable training and enables scalability. Through extensive experiments on various prompt benchmarks, AToM significantly outperforms state-of-the-art amortized approaches with over 4 times higher accuracy (in DF415 dataset) and produces more distinguishable and higher-quality 3D outputs. AToM demonstrates strong generalizability, offering finegrained 3D assets for unseen interpolated prompts without further optimization during inference, unlike per-prompt solutions.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- HexaGen3D: StableDiffusion is just one step away from Fast and Diverse Text-to-3D Generation (2024)
- Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors (2023)
- PI3D: Efficient Text-to-3D Generation with Pseudo-Image Diffusion (2023)
- 4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency (2023)
- TPA3D: Triplane Attention for Fast Text-to-3D Generation (2023)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Are there any links to the relevant datasets or the code used to get the results the paper shows? The linked GitHub from the paper just goes to a GitHub hosted synopsis of the paper.
Hi Thanks for your interest. Our code will be released upon acceptance.
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper