Spherical Frustum Sparse Convolution Network for LiDAR Point Cloud Semantic Segmentation

NeurIPS 2024

Yu Zheng* · Guangming Wang* · Jiuming Liu · Marc Pollefeys · Hesheng Wang#

TL;DR: We propose the spherical frustum structure to avoid quantized information loss in conventional 2D spherical projection for LiDAR point cloud semantic segmentation.

📰News

[26/Sept/2024] Our Paper has been accepted as a Poster in NeurIPS 2024.

📄 Abstract

LiDAR point cloud semantic segmentation enables the robots to obtain fine-grained semantic information of the surrounding environment. Recently, many works project the point cloud onto the 2D image and adopt the 2D Convolutional Neural Networks (CNNs) or vision transformer for LiDAR point cloud semantic segmentation. However, since more than one point can be projected onto the same 2D position but only one point can be preserved, the previous 2D projection-based segmentation methods suffer from inevitable quantized information loss, which results in incomplete geometric structure, especially for small objects. To avoid quantized information loss, in this paper, we propose a novel spherical frustum structure, which preserves all points projected onto the same 2D position. Additionally, a hash-based representation is proposed for memory-efficient spherical frustum storage. Based on the spherical frustum structure, the Spherical Frustum sparse Convolution (SFC) and Frustum Farthest Point Sampling (F2PS) are proposed to convolve and sample the points stored in spherical frustums respectively. Finally, we present the Spherical Frustum sparse Convolution Network (SFCNet) to adopt 2D CNNs for LiDAR point cloud semantic segmentation without quantized information loss. Extensive experiments on the SemanticKITTI and nuScenes datasets demonstrate that our SFCNet outperforms previous 2D projection-based semantic segmentation methods based on conventional spherical projection and shows better performance on small object segmentation by preserving complete geometric structure.

📃 Results & Pretrained SFCNet models

dataset	Val mIoU	Download
SemanticKITTI	62.9	Model Weight
nuScenes	75.9	Model Weight

🚗 Dataset Preparation

SemanticKITTI

Download the SemanticKITTI dataset from official and change the dataset path here.

nuScenes

Install the nuScenes devkit with

pip install nuscenes-devkit

Use SFCNet/pp_dataset/generate_nuscenes_datas.py to generate the file list of nuScenes dataset. First, modify the nuScenes dataset

cd SFCNet/pp_dataset/
python generate_nuscenes_datas.py

The generated file list will be saved in SFCNet/pp_dataset/nuscenes_data.

⚙️ Environment Setup

It is recommend to train and test the model on linux, like ubuntu 20.04 with nvidia GPU. The CUDA compile tools with version 11.3 should be installed formly.

First, the python environment should be created through

conda create -n spconv python=3.8

Then install the dependence through pip:

pip install -r requirements.txt

Then compile the spconv operator:

bash build.sh

💪 Training

Train the model on SemanticKITTI

cd SFCNet
python train_SemanticKITTI.py --log_dir <LOG>

Train the model on nuScenes

cd SFCNet
python train_SemanticKITTI.py --log_dir <LOG> --dataset nuscenes_trainset_spp --config config_frust_nuscenes

♐ Evaluation

Eval the model on SemanticKITTI (, suppose the model has been put in SFCNet/logs/log_kitti/checkpoints/best.pt)

cd SFCNet
python val_SemanticKITTI.py --log_dir logs/log_kitti

Eval the model on nuScenes (, suppose the model has been put in SFCNet/logs/log_nuscenes/checkpoints/best.pt)

cd SFCNet
python val_SemanticKITTI_nus.py --log_dir logs/log_nuscenes

🖇️ Reference

If you find our work useful, please cite us

@inproceedings{
    zheng2024spherical,
    title={Spherical Frustum Sparse Convolution Network for LiDAR Point Cloud Semantic Segmentation},
    author={Zheng, Yu and Wang, Guangming and Liu, Jiuming and Pollefeys, Marc and Wang, Hesheng},
    booktitle={The Thirty-eighth Annual Conference on Neural Information Processing Systems},
    year={2024},
}

📖 License

Our project is licensed under the MIT and Apache license 2.0 (for the spherical frustum library) License - see the LICENSE and LICENSE_LIB files for details.
The spconv project is licensed under the Apache license 2.0 License. Our project is modified from the v1.1 branch of spconv.
The CUDPP hash code is licensed under BSD License.
The SemanticKITTI dataset is licensed under Creative Commons Attribution-NonCommercial-ShareAlike License.
The nuScenes dataset is licensed under Creative Commons Attribution-Sharealike 4.0 International Public License (CC BY-SA 4.0).

🤝Acknowledgement

Our model training and testing architecture is mainly built based on RandLA-Net-pytorch. The spherical frustum library is built based on spconv. Many thanks to these open-sourced projects.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
SFCNet		SFCNet
include		include
src		src
third_party/catch2		third_party/catch2
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
LICENSE_LIB		LICENSE_LIB
README.md		README.md
build.sh		build.sh
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Uh oh!

Repository files navigation

Spherical Frustum Sparse Convolution Network for LiDAR Point Cloud Semantic Segmentation

NeurIPS 2024

TL;DR: We propose the spherical frustum structure to avoid quantized information loss in conventional 2D spherical projection for LiDAR point cloud semantic segmentation.

📰News

📄 Abstract

📃 Results & Pretrained SFCNet models

🚗 Dataset Preparation

SemanticKITTI

nuScenes

⚙️ Environment Setup

💪 Training

♐ Evaluation

🖇️ Reference

📖 License

🤝Acknowledgement

About

Licenses found

Uh oh!

Releases

Packages

Languages

License

Licenses found

IRMVLab/SFCNet

Folders and files

Latest commit

History

Repository files navigation

Spherical Frustum Sparse Convolution Network for LiDAR Point Cloud Semantic Segmentation

NeurIPS 2024

TL;DR: We propose the spherical frustum structure to avoid quantized information loss in conventional 2D spherical projection for LiDAR point cloud semantic segmentation.

📰News

📄 Abstract

📃 Results & Pretrained SFCNet models

🚗 Dataset Preparation

SemanticKITTI

nuScenes

⚙️ Environment Setup

💪 Training

♐ Evaluation

🖇️ Reference

📖 License

🤝Acknowledgement

About

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages