Skip to content

[SIGGRAPH'25 (ACM TOG)] TransparentGS: Fast Inverse Rendering of Transparent Objects with Gaussians

License

MIT, Unknown licenses found

Licenses found

MIT
LICENSE
Unknown
LICENSE.md
Notifications You must be signed in to change notification settings

LetianHuang/transparentgs

Repository files navigation

TransparentGS: Fast Inverse Rendering of Transparent Objects with Gaussians

SIGGRAPH 2025
(ACM Transactions on Graphics)

Letian Huang1      Dongwei Ye1      Jialin Dan1      Chengzhi Tao1      Huiwen Liu2     
Kun Zhou3,4      Bo Ren2      Yuanqi Li1      Yanwen Guo1      Jie Guo* 1     
1State Key Lab for Novel Software Technology, Nanjing University
2TMCC, College of Computer Science, Nankai University
3State Key Lab of CAD&CG, Zhejiang University
4Institute of Hangzhou Holographic Intelligent Technology

teaser

News

[2025.08.04] 🎈 We release the code.

[2025.07.23] :smile: Birthday of the repository.

TL;DR

We propose TransparentGS, a fast inverse rendering pipeline for transparent objects based on 3D-GS. The main contributions are three-fold: efficient transparent Gaussian primitives for specular refraction, GaussProbe to encode ambient light and nearby contents, and the IterQuery algorithm to reduce parallax errors in our probe-based framework.

Overview

The overview of our TransparentGS pipeline. Each 3D scene is firstly separated into transparent objects and opaque environment using SAM2 [Ravi et al. 2024] guided by GroundingDINO [Liu et al. 2024]. For transparent objects, we propose transparent Gaussian primitives, which explicitly encode both geometric and material properties within 3D Gaussians. And the properties are rasterized into maps for subsequent deferred shading. For the opaque environment, we recover it with the original 3D-GS, and bake it into GaussProbe surrounding the transparent object. The GaussProbe are then queried through our IterQuery algorithm to compute reflection and refraction.

pipeline

Citation

If you find this work useful in your research, please cite:

@article{transparentgs,
    author = {Huang, Letian and Ye, Dongwei and Dan, Jialin and Tao, Chengzhi and Liu, Huiwen and Zhou, Kun and Ren, Bo and Li, Yuanqi and Guo, Yanwen and Guo, Jie},
    title = {TransparentGS: Fast Inverse Rendering of Transparent Objects with Gaussians},
    journal = {ACM Transactions on Graphics (TOG)},
    number = {4},
    volume = {44},
    month = {July},
    year = {2025},
    pages = {1--17},
    url = {https://doi.org/10.1145/3730892},
    publisher = {ACM New York, NY, USA}
}

TransparentGS Viewer (Renderer)

TransparentGS Renderer

Utility

  • Real-time rendering and navigation of scenes that integrate traditional 3DGS, triangle meshes and reconstructed meshes (Highly robust to complex occlusions).
  • Secondary light effects (e.g., reflection and refraction).
  • Rendering with non-pinhole camera models (e.g., fisheye or panorama).
  • Material editing (e.g., IOR and base color).

Cloning the Repository and Setup

Clone the repository and create an anaconda environment using

git clone git@github.com:LetianHuang/transparentgs.git --recursive
cd transparentgs

SET DISTUTILS_USE_SDK=1 # Windows only
conda env create --file environment.yml
conda activate transparentgs

The repository contains several submodules, thus please check it out with

pip install . # Thanks to https://github.com/ashawkey/raytracing
pip install submodules/diff-gaussian-rasterization
pip install submodules/simple-knn
pip install submodules/diff-gaussian-rasterization-fisheye 
pip install submodules/diff-gaussian-rasterization-panorama
pip install submodules/nvdiffrast

or choose a faster version (1. integrated with Speedy-Splat, using SnugBox and AccuTile; 2. Employ CUDA scripting for computational acceleration of 64 probes).

pip install . 
pip install submodules-speedy/diff-gaussian-rasterization
pip install submodules/simple-knn
pip install submodules-speedy/diff-gaussian-rasterization-fisheye
pip install submodules-speedy/diff-gaussian-rasterization-panorama
pip install submodules-speedy/compute-trilinear-weights
pip install submodules/nvdiffrast

Scene Assets

First, create a models folder inside the project path by

mkdir models

The data structure will be organised as follows:

transparentgs/
│── models/
│   ├── 3dgs/
│   │   ├── drjohnson.ply
│   │   ├── playroom_lego_hotdog_mouse.ply
│   │   ├── Matterport3D_h1zeeAwLh9Z_3.ply
│   │   ├── ...
│   ├── mesh/
│   │   ├── ball.ply
│   │   ├── mouse.ply
│   │   ├── bunny.ply
│   │   ├── ...
│   ├── probes/
│   │   ├── playroom_lego_hotdog_mouse/
│   │   │   ├── probes/
│   │   │   │   ├── 000_depth.exr
│   │   │   │   ├── 000.exr
│   │   │   │   ├── 333_depth.exr
│   │   │   │   ├── 333.exr
│   │   │   │   ├── ...
│   │   │   ├── probe.json
│   │   ├── ...
|   ├── meshgs_proxy/
│   │   ├── mouse.ply
│   │   ├── ...

Public scene

We release several ready-to-use scenes. Please download the assets from Google Drive and move the 3dgs and mesh folders into models/ folder.

Custom scene

To create a custom scene, simply follow the provided instructions to set it up. Instructions on the above data structure are as follows:

  1. Scenes in the 3dgs folder should be in .ply format and reconstructed using traditional 3DGS, op43dgs (for reconstruction from non-pinhole cameras) or Mip-Splatting (for anti-alias).
  2. Objects in the mesh folder could be in any triangle mesh format (e.g, .obj, .ply or .glb), including both traditional and reconstructed ones.
  3. Probes in the probes folder could be baked using Step I: Bake GaussProbe or similar formats. The probes.json file specifies the positions of the probes, while the probes/ directory stores the corresponding RGB panorama and depth panorama in EXR format.
  4. The meshgs_proxy folder is a byproduct of Step I: Bake GaussProbe. It contains the object converted into 3DGS format and can be used as a proxy of the mesh in mesh to assemble a new scene (mesh + 3DGS). Note: modifying the files in meshgs_proxy does not affect the final rendering results (i.e., Step II: Boot up the renderer). To change the proxy configuration, you can adjust the scene’s position under the 3dgs directory and rerun Step I: Bake GaussProbe.

Step I: Bake GaussProbe

The first step is to bake probes for the scene that has already been set up:

python probes_bake.py --W 800 --H 800 --gs_path ./models/3dgs/playroom_lego_hotdog_mouse.ply --probes_path ./models/probes/playroom_lego_hotdog_mouse --mesh ./models/mesh/mouse.ply --begin_id 0
Command Line Arguments for probes_bake.py

--gs_path

path to the trained 3D Gaussians directory as the environment (used to bake GaussProbe).

--probes_path

output path of GaussProbe to be baked

--mesh

path to the mesh

--W

width of the RGBD panorama

--H

height of the RGBD panorama

--numProbes

number of probes (1/8/64). In theory, any positive integer is allowed, but the released code only supports these three fixed values.

--begin_id

only to prevent OOM (Out of Memory); when GPU memory is insufficient, the process can exit and resume baking from the specified ID.

--scale_ratio

bounding box scale ratio for the mesh

--meshproxy_pitch

the voxel size (pitch), which determines the resolution of the mesh voxelization.

Step II: Boot up the renderer

Next, boot the renderer to start rendering:

python renderer.py --W 960 --H 540 --gs_path ./models/3dgs/playroom_lego_hotdog_mouse.ply --probes_path ./models/probes/playroom_lego_hotdog_mouse --mesh ./models/mesh/mouse.ply --meshproxy_pitch 0.1
Command Line Arguments for renderer.py

--mesh

path to the mesh

--light_type

the original design supports either environment map or GaussProbe. However, since a single probe with zero iteration is equivalent to the environment map, this design has been deprecated.

--gs_path

path to the trained 3D Gaussians directory as the environment (used to bake GaussProbe).

--W

GUI width

--H

GUI height

--radius

default GUI camera radius from center

--fovy

default GUI camera fovy (can be modified in the GUI)

--probes_path

path of the baked GaussProbe

--numProbes

number of probes (1/8/64). In theory, any positive integer is allowed, but the released code only supports these three fixed values. (can be modified in the GUI)

--iters

count of iterations (0-10). In theory, any non-negative integer is allowed, but the released code only supports these eleven fixed values. (can be modified in the GUI)

--meshproxy_pitch

the voxel size (pitch), which determines the resolution of the mesh voxelization.

Full Pipeline (optional)

Additionally, we offer an optional all-in-one pipeline script that produces the same effect as executing Step I and Step II independently:

python full_render_pipeline.py --W 960 --H 540 --probesW 800 --probesH 800 --gs_path ./models/3dgs/playroom_lego_hotdog_mouse.ply --probes_path ./models/probes/playroom_lego_hotdog_mouse --mesh ./models/mesh/mouse.ply --meshproxy_pitch 0.1
# equal to
# 1. python probes_bake.py --W 800 --H 800 --gs_path ./models/3dgs/playroom_lego_hotdog_mouse.ply --probes_path ./models/probes/playroom_lego_hotdog_mouse --mesh ./models/mesh/mouse.ply --begin_id 0 --meshproxy_pitch 0.1
# 2. python renderer.py --W 960 --H 540 --gs_path ./models/3dgs/playroom_lego_hotdog_mouse.ply --probes_path ./models/probes/playroom_lego_hotdog_mouse --mesh ./models/mesh/mouse.ply --meshproxy_pitch 0.1
Command Line Arguments for renderer.py

--mesh

path to the mesh

--light_type

the original design supports either environment map or GaussProbe. However, since a single probe with zero iteration is equivalent to the environment map, this design has been deprecated.

--gs_path

path to the trained 3D Gaussians directory as the environment (used to bake GaussProbe).

--W

GUI width

--H

GUI height

--radius

default GUI camera radius from center

--fovy

default GUI camera fovy (can be modified in the GUI)

--probes_path

path of the baked GaussProbe

--numProbes

number of probes (1/8/64). In theory, any positive integer is allowed, but the released code only supports these three fixed values. (can be modified in the GUI)

--iters

count of iterations (0-10). In theory, any non-negative integer is allowed, but the released code only supports these eleven fixed values. (can be modified in the GUI)

--meshproxy_pitch

the voxel size (pitch), which determines the resolution of the mesh voxelization.

--probesW

width of the RGBD panorama

--probesH

height of the RGBD panorama

--begin_id

only to prevent OOM (Out of Memory); when GPU memory is insufficient, the process can exit and resume baking from the specified ID.

--scale_ratio

bounding box scale ratio for the mesh

--just_render

if using this argument, it will be equivalent to running renderer.py.

Renderer GUI Tutorial

The following GUI usage tutorial is provided based on the current release. It is recommended to watch this in conjunction with the video available on the project homepage.

Alt text

Move the camera

Bear resemblance to raytracing.

  1. drag rotate: move with the left mouse button.
  2. drag translation: move with the middle mouse button
  3. move closer: move with the wheel

Options and Debug

  1. Options: the main ways to control, aside from moving the camera.
  2. Debug: display the camera pose.

gbuffers in Options

Common G-buffers in typical renderers (depth, mask, normal, position), with special attention to:

  1. reflect: the reflection component (mesh). Alt text
  2. refract: the refraction component (mesh). Alt text
  3. render: the weighted sum of the reflection and refraction components using the Fresnel term (mesh). Alt text
  4. gs_render: the rendering result obtained using only traditional Gaussian primitives (3DGS). Alt text
  5. semantic: the result of hybrid rendering with Gaussians and meshes (mesh + 3DGS). Pixels belonging to the mesh are replaced with a uniform color that represents the same semantic label (e.g., purple). Alt text

camera

Select the camera model.

  1. pinhole: the regular camera model which 3DGS also supports.
  2. fisheye: It can support a field of view (FOV) of up to 180°. Alt text
  3. panorama: It can support a field of view (FOV) of 360°. Alt text

bkg

Select the background of the mesh.

  1. black: black color as the background
  2. white: white color as the background
  3. 3DGS: Hybrid rendering of 3DGS and transparent objects (reflect, refract, render, normal).

normal mode

Select whether to apply normal smoothing.

  1. raw: no Alt text
  2. smooth: yes Alt text

num probes

As changing the number of probes involves I/O overhead, it is not recommended to modify it through the GUI. It is advisable to configure it beforehand using terminal arguments. Additionally, increasing the number of probes demands more GPU memory.

num iters

Modify the count of iterations of IterQuery (0-10). In theory, any non-negative integer is allowed, but the released code only supports these eleven fixed values. Setting it to zero clearly demonstrates the superiority of the IterQuery.

FoV (y)

Modifying the field of view (FOV), particularly for fisheye cameras, allows reaching up to a 180° viewing angle.

IOR

Alt text

This mainly affects gbuffers with refract or render properties. When the IOR is approximately 1, the result is almost identical to the background, demonstrating the high quality of IterQuery (especially with 64 probes).

GS scale

Note that only the scale of the 3dgs primitives is modified, not the overall scene scaling. Therefore, reducing the scale allows us to observe the gaps between Gaussians.

spp

Control the sampling rate of mesh ray tracing.

Pick a color

Modify the color of the mesh.

Standalone demo : Segmentation

  • To release.

TODO List

  • Release the code.
  • Release the code of Standalone demo : segmentation.
  • Release the dataset of transparent objects that we captured ourselves.
  • Code optimization.

Acknowledgements

This project is built upon 3DGS, GaussianShader, GlossyGS, op43dgs, raytracing, nvdiffrast, instant-ngp, SAM2, GroundingDINO, SAM, GroundedSAM, and so on. Please follow the licenses. We thank all the authors for their great work and repos. We sincerely thank our colleagues for their valuable contributions to this project.

About

[SIGGRAPH'25 (ACM TOG)] TransparentGS: Fast Inverse Rendering of Transparent Objects with Gaussians

Topics

Resources

License

MIT, Unknown licenses found

Licenses found

MIT
LICENSE
Unknown
LICENSE.md

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 2

  •  
  •