MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models

Installation

To get started, clone this project, create a conda virtual environment using Python 3.10+, and install the requirements:

git clone https://github.com/CUHK-AIM-Group/MonoSplat.git
cd MonoSplat
conda create -n monosplat python=3.10
conda activate monosplat
pip install torch==2.1.2 torchvision==0.16.2 torchaudio==2.1.2 --index-url https://download.pytorch.org/whl/cu118
pip install -r requirements.txt

Acquiring Datasets

RealEstate10K and ACID

Our MonoSplat uses the same training datasets as pixelSplat and MVSplat. Below we quote pixelSplat's detailed instructions on getting datasets.

DTU (For Testing Only)

Download the preprocessed DTU data dtu_training.rar.
Convert DTU to chunks by running python src/scripts/convert_dtu.py --input_dir PATH_TO_DTU --output_dir datasets/dtu

Running the Code

Evaluation

To render novel views and compute evaluation metrics from a pretrained model,

get the pretrained models, and save them to /checkpoints
run the following:

# re10k
python -m src.main +experiment=re10k \
    mode=test \
    dataset/view_sampler=evaluation \
    checkpointing.load=/path/to/checkpoint \
    dataset.view_sampler.index_path=assets/evaluation_index_re10k_nctx10.json \
    test.compute_scores=true \
    wandb.mode=disabled

the rendered novel views will be stored under outputs/test

Training

Run the following:

# download the backbone pretrained weight from unimatch and save to 'checkpoints/'
wget 'https://s3.eu-central-1.amazonaws.com/avg-projects/unimatch/pretrained/gmdepth-scale1-resumeflowthings-scannet-5d9d7964.pth' -P checkpoints
# train mvsplat
python -m src.main +experiment=re10k data_loader.train.batch_size=14

Our models are trained with a single A100 (80GB) GPU. They can also be trained on multiple GPUs with smaller RAM by setting a smaller data_loader.train.batch_size per GPU.

Cross-Dataset Generalization

We employ our baseline model, which was trained using the RealEstate10K dataset, to perform extensive cross-dataset validation. For instance, to execute an evaluation on the DTU benchmark, simply issue the following command in the terminal:

# RealEstate10K -> DTU
python -m src.main +experiment=dtu \
    mode=test \
    checkpointing.load=/path/to/checkpoint \
    dataset/view_sampler=evaluation \
    dataset.view_sampler.index_path=assets/evaluation_index_dtu_nctx2.json \
    test.compute_scores=true \
    wandb.mode=disabled

BibTeX

@article{liu2025monosplat,
  title={MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models},
  author={Liu, Yifan and Fan, Keyu and Yu, Weihao and Li, Chenxin and Lu, Hao and Yuan, Yixuan},
  journal={arXiv preprint arXiv:2505.15185},
  year={2025}
}

Acknowledgements

The project is largely based on pixelSplat and MVSplat. Many thanks to these two projects for their excellent contributions!

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
assets		assets
config		config
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
test_dtu.sh		test_dtu.sh
test_re10k.sh		test_re10k.sh
train_re10k.sh		train_re10k.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models

Installation

Acquiring Datasets

RealEstate10K and ACID

DTU (For Testing Only)

Running the Code

Evaluation

Training

Cross-Dataset Generalization

BibTeX

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

CUHK-AIM-Group/MonoSplat

Folders and files

Latest commit

History

Repository files navigation

MonoSplat: Generalizable 3D Gaussian Splatting from Monocular Depth Foundation Models

Installation

Acquiring Datasets

RealEstate10K and ACID

DTU (For Testing Only)

Running the Code

Evaluation

Training

Cross-Dataset Generalization

BibTeX

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages