ICL++: Coarse-to-Fine Implicit Correspondence Learning for Image-to-Point Cloud Registration

This repo is the official implementation of extended version of our CVPR25 paper "Implicit Correspondence Learning for Image-to-Point Cloud Registration"

by Xinjun Li, Wenfei Yang, Jiacheng Deng, Zhixin Cheng, Xu Zhou, and Tianzhu Zhang.

The extended version primarily includes the following additions:

We design a coarse-to-fine strategy to refine the image-to-point cloud correspondence and camera pose, which can improve the performance with smaller computational cost.
We conduct more experiments to clarify the effectiveness and limitation of the proposed method.

We will soon release a preprint about the extended paper where you can find more details.

Installation

Please use the following command for installation.

# It is recommended to create a new environment
conda create -n ICLI2P python==3.8
conda activate ICLI2P

# 2. Install vision3d following https://github.com/qinzheng93/vision3d

Since we made some modifications to the vision3d codebase — for example, the original vision3d does not support the nuScenes dataset — we provide the modified version used in our experiments. The code has been tested on Python 3.8, PyTorch 1.13.1, Ubuntu 22.04, GCC 11.3 and CUDA 11.7, but it should work with other configurations.

Pre-trained Weights

We provide pre-trained weights from BaiduYun(extraction code: 54s4). Please download the latest weights and place them into the appropriate directory:

kitti/ (or nuscenes/)
└── stage_1/ (or stage_2/)
    └── workspace/
        └── vision3d-output/
            └── stage_1/ (or stage_2/)
                └── checkpoints/

Make sure to choose the correct dataset (kitti or nuscenes) and stage (stage_1 or stage_2) accordingly.

Data Preparation

You can download both the prepared KITTI and nuScenes datasets from the link provided by CorrI2P.

Training

Our training process consists of two stages. In stage 1, we train only the GPDM (Geometric Prior-guided Overlapping Region Detection Module) using a classification loss and a frustum-pose loss for 20 epochs. In stage 2, we train the entire network for another 20 epochs, while keeping the parameters of the GPDM frozen.

Training on KITTI

Stage 1

The code is in kitti/stage_1. Use the following command for training.

CUDA_VISIBLE_DEVICES=0 python trainval.py

Stage 2

The code is in kitti/stage_2. Save the checkpoint from Stage 1 to: kitti/stage_2/workspace/vision3d-output/stage_2/checkpoints/checkpoint.pth

Note: Make sure to save the checkpoint with the name epoch-xx.pth instead of checkpoint.pth, so that the training in Stage 2 can properly resume from the beginning.

Use the following command for training.

CUDA_VISIBLE_DEVICES=0 python trainval.py --resume

Training on nuScenes

Stage 1

The code is in nuscenes/stage_1. Use the following command for training.

CUDA_VISIBLE_DEVICES=0 python trainval.py

Stage 2

The code is in nuscenes/stage_2. Save the checkpoint from Stage 1 to: nuscenes/stage_2/workspace/vision3d-output/stage_2/checkpoints/checkpoint.pth

Note: Make sure to save the checkpoint with the name epoch-xx.pth instead of checkpoint.pth, so that the training in Stage 2 can properly resume from the beginning.

Use the following command for training.

CUDA_VISIBLE_DEVICES=0 python trainval.py --resume

Evaluation

Stage 1

To evaluate the results of stage 1, you can run the following command:

bash eval.sh

Stage 2

To evaluate the results of stage 2, you can run the following command:

bash eval.sh

Acknowledgements

Our code is based on 2D3D-MATR, vision3d and CorrI2P. We thank the authors for their excellent work!

Citation

If you find our work useful, please cite:

@inproceedings{li2025implicit,
  title={Implicit Correspondence Learning for Image-to-Point Cloud Registration},
  author={Li, Xinjun and Yang, Wenfei and Deng, Jiacheng and Cheng, Zhixin and Zhou, Xu and Zhang, Tianzhu},
  booktitle={Proceedings of the Computer Vision and Pattern Recognition Conference},
  pages={16922--16931},
  year={2025}
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
kitti		kitti
nuscenes		nuscenes
vision3d-main		vision3d-main
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ICL++: Coarse-to-Fine Implicit Correspondence Learning for Image-to-Point Cloud Registration

Installation

Pre-trained Weights

Data Preparation

Training

Training on KITTI

Stage 1

Stage 2

Training on nuScenes

Stage 1

Stage 2

Evaluation

Stage 1

Stage 2

Acknowledgements

Citation

About

Uh oh!

Releases

Packages

Languages

License

OpenSpaceAI/ICL-I2PReg

Folders and files

Latest commit

History

Repository files navigation

ICL++: Coarse-to-Fine Implicit Correspondence Learning for Image-to-Point Cloud Registration

Installation

Pre-trained Weights

Data Preparation

Training

Training on KITTI

Stage 1

Stage 2

Training on nuScenes

Stage 1

Stage 2

Evaluation

Stage 1

Stage 2

Acknowledgements

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages