Project page template is borrowed from DreamFusion.
While image diffusion models have made significant progress in text-driven 3D content creation, they often fail to accurately capture the intended meaning of text prompts, especially for view information. This limitation leads to the Janus problem, where multi-faced 3D models are generated under the guidance of such diffusion models. In this paper, we propose a robust high-quality 3D content generation pipeline by exploiting orthogonal-view image guidance. First, we introduce a novel 2D diffusion model that generates an image consisting of four orthogonal-view sub-images based on the given text prompt. Then, the 3D content is created using this diffusion model. Notably, the generated orthogonal-view image provides strong geometric structure priors and thus improves 3D consistency. As a result, it effectively resolves the Janus problem and significantly enhances the quality of 3D content creation. Additionally, we present a 3D synthesis fusion network that can further improve the details of the generated 3D contents. Both quantitative and qualitative evaluations demonstrate that our method surpasses previous text-to-3D techniques.
EfficientDreamer generates objects and scenes from diverse captions.
A squirrel, animated movie character, high detail 3D model.
A pig wearing a backpack, high quality.
Mr Bean Cartoon.
A motorcycle, scifi.
A squirrel playing guitar.
A ghost eating a hamburger.
A crab, low poly.
Katana, high detail, high quality.
Darth Vader helmet.
A DSLR photo of a yellow duck.
A 3D model of a road bike.
A 3D model of a corgi taking a selfie, high detail.
A 3D model of a fox holding a videogame controller.
An astronaut is riding a horse, high detail 3d model.
A peacock on a surfboard.
A 3D model of a German Shepherd.
A 3D model of an adorable cottage with a thatched roof.
A 3D model of a toy robot.
A 3D model of an exercise bike.
A 3D model of a white rabbit.
A blue poison-dart frog sitting on a water lily, high detail 3D model.
A ladybug, high detail, high quality.
A DSLR photo of a chow chow puppy, high detail, high quality.
A lion reading the newspaper.
A panda rowing a boat in a pond, high detail, high quality.
A product photo of a toy tank, high detail 3D model.
A recliner chair.
Viking axe fantasy, weapon, blender.
Dragon armor, 3D asset.
A photo of a horse walking.
TRUMP figure.
Pikachu, high quality.
Army Jacket.
A statue of angel.
A bulldog wearing a black pirate hat.
A 3D scan of AK47, weapon.
Project page template is borrowed from DreamFusion.