Skip to content

Commit 96249b7

Browse files
authored
Update README.md (#78)
## 🔥 Summary Updating typos in Readme and arxiv link to new v3 report.
1 parent f691bc1 commit 96249b7

File tree

1 file changed

+2
-2
lines changed

1 file changed

+2
-2
lines changed

README.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -27,12 +27,12 @@ The example gif below shows the model's capacity to generate 3D shapes capturing
2727
<img src="./resources/3d_composition.gif" width="100%" style="margin: 5px;">
2828
</p>
2929

30-
With bounding box conditioning, we observe novel 3D generations where the model balances between the two conditioning inputs -- text prompt and global aspect ratio. In the gif below, notice how the model creatively interprets the the *seashell* or *tall pagoda* prompts into distinct 3D shapes. The model sometimes struggles when the bounding box is too extreme for a given prompt, for example the *cat*, where it can produce disconnected components or generates it along a diagonal to fit the bounding box constraints.
30+
With bounding box conditioning, we observe novel 3D generations where the model balances between the two conditioning inputs -- text prompt and global aspect ratio. In the gif below, notice how the model creatively interprets the *seashell* or *tall pagoda* prompts into distinct 3D shapes. The model sometimes struggles when the bounding box is too extreme for a given prompt, for example the *cat*, where it can produce disconnected components or generates it along a diagonal to fit the bounding box constraints.
3131
<p align="center">
3232
<img src="./resources/bbox_conditioning.gif" width="100%" style="margin: 5px;">
3333
</p>
3434

35-
For a technical overview of the methods behind these two improvements, please refer to our latest v3 report on arXiv that will be updated soon. The latest model was trained on an additional ~2.8 million synthetic 3D assets. We introduced several refinements to VQ-VAE architecture and training procedures, and increased the VQ-VAE latent length from 512 to 1024 to increase generation fidelity.
35+
For a technical overview of the methods behind these two improvements, please refer to our latest <a href=https://arxiv.org/abs/2503.15475 target="_blank">v3 report on arXiv</a>. The latest model was trained on an additional ~2.8 million synthetic 3D assets. We introduced several refinements to VQ-VAE architecture and training procedures, and increased the VQ-VAE latent length from 512 to 1024 to increase generation fidelity.
3636

3737
### Try it out on
3838
- [Hugging Face Interactive Demo](https://huggingface.co/spaces/Roblox/cube3d-interactive)

0 commit comments

Comments
 (0)