PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes [ICCV 2025]

Ahmed Abdelreheem²*, Filippo Aleotti¹, Jamie Watson¹, Zawar Qureshi¹, Abdelrahman Eldesokey², Peter Wonka², Gabriel Brostow¹³, Sara Vicente¹, Guillermo Garcia-Hernando¹

¹Niantic Spatial, ²KAUST ³UCL

📋 Overview

We introduce PlaceIt3D, a novel task and benchmark for language-guided object placement in real 3D scenes. Given a 3D scene point cloud, a 3D asset, and a natural language prompt, the goal is to predict a valid placement that respects semantic and geometric constraints, including object relationships, free-space reasoning, and occlusions. We propose a new evaluation protocol, a dataset for training 3D LLMs on this task, and a baseline method that predicts placement location, anchor localization, and valid rotations.

🔬 Method

🚀 Installation

⚙️ System Requirements

Our code is tested on the following setup:

CUDA 11.8
PyTorch 2.0.0
transformers 4.33.2
NVIDIA Container Runtime version 1.17.8 (needed for running the PlaceIt3D-Benchmark, see instructions below)

🐍 Step 1: Create Conda Environment

conda create -n placeit3d python=3.9 pip=25.0 -y
conda activate placeit3d

Step 2: Install PyTorch and Dependencies

pip install torch==2.0.0 torchvision==0.15.1 torchaudio==2.0.1 --index-url https://download.pytorch.org/whl/cu118 
pip install spconv-cu118

conda install ninja google-sparsehash -c bioconda

pip install trimesh==4.0.1 loguru torch-scatter ipdb timm gorilla-core transformers==4.33.2 peft==0.9.0 
pip install -U "ray[default]"

Step 3: Install LAVIS

git clone https://github.com/salesforce/LAVIS.git SalesForce-LAVIS
cd SalesForce-LAVIS
# Ignore open3D as the pinned version in requirements.txt causes problems later and it is not needed in our codebase.
pip install -r <(grep -v "open3d" requirements.txt) numpy==1.26

Step 4: Clone PlaceIt3D repo

git clone https://github.com/nianticlabs/placeit3d.git

Step 5: Install PointGroup Operations

cd lavis/models/placewizard_model/lib
python setup.py develop

📁 Data Preparation

Expected Data Structure

The final structure of the data folder should be as follows:

data/
    PlaceIt3D/
              placeit3d_train.h5 
              placeit3d_val.h5 
              placeit3d_test.h5 
    scannetv2/
              train/
              val/
    superpoints/
              *.npy
    pointbert_embeddings/
              *.pt

Step 1: ScanNet Scans

To prepare ScanNet, please follow the data preparation instructions found in the Reason3D repository.

Step 2: PlaceIt3D Dataset & Benchmark

Download the train/val/test sets from here.

Step 3: Uniform Spatial Superpoints

Download the superpoints from here and extract the zip file inside the data folder. Ensure you maintain the same folder structure for the data folder as mentioned above.

Step 4: Download Asset PointBert Embeddings

Download the precomputed PointBert embeddings from here. Move the downloaded file to the data folder, then run the following:

cd data
unzip pointbert_PartObjaverseTiny_embeddings.zip 
mv pointbert_PartObjaverseTiny_embeddings pointbert_embeddings

Step 5: 3D Assets (needed for visualization purposes)

Download our resized and orientation-aligned GLBs from the TinyObjaverse-Part dataset using this link.

Training Preparation

Download Pretrained Scene Encoder

The final structure of the checkpoints folder should be as follows:

checkpoints/spformer_encoder_uniform_superpoints.pth

Download the pretrained scene encoder from here and place it in the checkpoints directory.

🏋️ Training

To train PlaceWizard on the PlaceIt3D dataset from scratch, use the following command. We train for 50 epochs:

export REPO_DIR="$(pwd)"
TOKENIZERS_PARALLELISM=false python train_ray.py --cfg-path configs/placewizard.yaml

Feel free to change the number of gpus parameter ray_num_gpus in configs/placewizard.yaml (but you have to adjust accordingly the batch size and the learning rate as in this issue).

Our pretrained checkpoint can be downloaded from here.

📊 Evaluation

Step 1: Generate Predictions

First, generate the predicted masks for each validation or test example. The predictions will be saved as {val/test}_predictions.h5 and the save path will be printed after the evaluation ends.

TOKENIZERS_PARALLELISM=false python evaluate.py --cfg-path configs/placewizard.yaml --options model.pretrained={CHECKPOINT_PATH} --h5_data .h5

Note: This repository currently only supports batch size = 1 for inference.

Step 2: Extract Translation Vectors and Rotation Angles

Extract the predicted translation vectors and rotation angles using the following command:

python eval_pred_to_json.py \
  --pred_path path_to_{val/test}_predictions.h5 \
  --superpoint_path ./data/superpoints \
  --pcd_path ./data/scannetv2/val \
  --mesh_path SCANNET_PATH/scans/ \
  --test_h5 ./data/PlaceIt3D/placeit3d_{val/test}.h5

🐳 Step 3: Run Rule-Based Evaluation

Once the predictions are ready, run the rule-based evaluation executable.

First, pull the Docker image:

docker pull aabdelreheem/placeit3d-benchmark:latest

Then run the evaluation on the test set:

docker run --gpus all \
  -v "path_to_scannet/scans:/app/mounted_scannet/scans:ro" \
  -v "path_to_prediction/predictions.json:/app/test_input/predictions.json:ro" \
  -v "path_to_save_output/:/app/test_output" \
  aabdelreheem/placeit3d-benchmark:latest test

The benchmark metrics performance will be printed after running the above code and saved in a .json file.

Similarly, to run the benchmark on the validation set, run the following command:

docker run --gpus all \
  -v "path_to_scannet/scans:/app/mounted_scannet/scans:ro" \
  -v "path_to_prediction/predictions.json:/app/val_input/predictions.json:ro" \
  -v "path_to_save_output/:/app/val_output" \
  aabdelreheem/placeit3d-benchmark:latest val

🖼️ Qualitative Examples

🙏 Acknowledgments

Our code is mainly based on Reason3D, ReferIt3D, VLA-3D, Cap3D, Shap-e, 3D-LLM, and 3D-STMN. Thanks to the authors for their amazing contributions!

📝 Citation

If you find our work useful for your project, please consider citing our paper:

@inproceedings{abdelreheem2025Placeit3d,
  author = {Abdelreheem, Ahmed and Aleotti, Filippo and Watson, Jamie and Qureshi, Zawar and Eldesokey, Abdelrahman and Wonka, Peter and Brostow, Gabriel and Vicente, Sara and Garcia-Hernando, Guillermo},
  title = {PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes},
  booktitle = {ICCV},
  year = {2025}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes [ICCV 2025]

📋 Overview

🔬 Method

🚀 Installation

⚙️ System Requirements

🐍 Step 1: Create Conda Environment

Step 2: Install PyTorch and Dependencies

Step 3: Install LAVIS

Step 4: Clone PlaceIt3D repo

Step 5: Install PointGroup Operations

📁 Data Preparation

Expected Data Structure

Step 1: ScanNet Scans

Step 2: PlaceIt3D Dataset & Benchmark

Step 3: Uniform Spatial Superpoints

Step 4: Download Asset PointBert Embeddings

Step 5: 3D Assets (needed for visualization purposes)

Training Preparation

Download Pretrained Scene Encoder

🏋️ Training

📊 Evaluation

Step 1: Generate Predictions

Step 2: Extract Translation Vectors and Rotation Angles

🐳 Step 3: Run Rule-Based Evaluation

🖼️ Qualitative Examples

🙏 Acknowledgments

📝 Citation

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
checkpoints		checkpoints
configs		configs
data		data
figs		figs
lavis		lavis
media		media
LICENSE.txt		LICENSE.txt
README.md		README.md
eval_pred_to_json.py		eval_pred_to_json.py
evaluate.py		evaluate.py
train_ray.py		train_ray.py

License

nianticlabs/placeit3d

Folders and files

Latest commit

History

Repository files navigation

PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes [ICCV 2025]

📋 Overview

🔬 Method

🚀 Installation

⚙️ System Requirements

🐍 Step 1: Create Conda Environment

Step 2: Install PyTorch and Dependencies

Step 3: Install LAVIS

Step 4: Clone PlaceIt3D repo

Step 5: Install PointGroup Operations

📁 Data Preparation

Expected Data Structure

Step 1: ScanNet Scans

Step 2: PlaceIt3D Dataset & Benchmark

Step 3: Uniform Spatial Superpoints

Step 4: Download Asset PointBert Embeddings

Step 5: 3D Assets (needed for visualization purposes)

Training Preparation

Download Pretrained Scene Encoder

🏋️ Training

📊 Evaluation

Step 1: Generate Predictions

Step 2: Extract Translation Vectors and Rotation Angles

🐳 Step 3: Run Rule-Based Evaluation

🖼️ Qualitative Examples

🙏 Acknowledgments

📝 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages