Enhanced Monocular Depth Estimation Network Based on Uncertainty Edge Weighting Mask.

Publication: Tsinghua Science and Technology (in press at March 2025)

Ye-Ji Kim and Byung-Gyu Kim

Intelligent Vision Processing Lab. (IVPL), Sookmyung Women's University, Seoul, Republic of Korea

Summary of paper

Abstract

Depth estimation acts as an important factor for understanding spatial relations in the field of computer vision. In particular, it plays a key role in detecting obstacles and planning safe routes in autonomous driving technology. To ensure stability, improving the accuracy of depth estimation is emerging as an important issue, and research direction in the field of computer vision is also developing with the aim of improving accuracy. Based on the analysis of an outdoor dataset in road driving, we investigate that depth network is difficult to predict accurate depth in distant regions. Accordingly, this thesis aims to improve the overall performance by improving depth prediction accuracy in far-distance areas. For this purpose, we devise new modules in the encoder and decoder, respectively. In the process of feature extraction, we propose an encoder modeling that can obtain information about distant regions where has a large loss compared to the near region. In order to obtain information about distant areas, we employ an auxiliary network. By combining the features that are aware of this information by each layer, the reinforced features are extracted for the information about the uncertainty area. In addition, we propose a methodology for generating a weighted edge map from a distance to allow the decoder generating depth to learn the edge information of the distant region. To verify this, the performance evaluation on the distant region is taken for showing the improvement of 0.006, 0.073, 0.879, and 0.303 for Abs Rel, Sq Rel, RMSE, and RMSE log, respectively, on the KITTI dataset. For the DDAD dataset, improvement of 0.417, 0.242, 2.678, and 0.333 is observed respectively. From these results, the proposed scheme can improve the overall performance by emphasizing the feature of the far-distant region.

Network Architecture

Experimental Results

KITTI

Model	Abs Rel	Sq Rel	RMSE	Checkpoint
DepthFormer	0.052	0.156	2.133	[Link]
DepthFormer+UEWM	0.050	0.144	2.105	[Google Drive]
GEDepth	0.048	0.142	2.050	[Link]
GEDepth+UEWM	0.047	0.138	2.033	[Google Drive]
MDEUncertainty	0.052	0.156	2.133	[Link]
MDEUncertainty+UEWM	0.049	0.142	2.097	[Google Drive]

DDAD

Model	Abs Rel	Sq Rel	RMSE	Checkpoint
DepthFormer	0.052	0.156	2.133	[Link]
DepthFormer+UEWM	0.049	0.144	2.061	[Google Drive]
GEDepth	0.052	0.156	2.133	[Link]
GEDepth+UEWM	0.049	0.144	2.061	[Google Drive]

Training

DepthFormer+proposed mehtod run

cd depthformer_exp
bash tools/dist_train.sh configs/depthformer/depthformer_swinl_22k_w7_kitti.py 4

GEDepth+proposed mehtod run

cd gedepth_exp
bash tools/dist_train.sh configs/depthformer/depthformer_a.py 4

MDEUncertainty+proposed mehtod run

cd mdeuncertainty_exp
python train.py --dataset kitti --encoder swin --reg_mode lin_cls --reg_supervision regression_l1 --prob_supervision soft_label --uncert_supervision error_uncertainty_ranking

Testing

DepthFormer+proposed mehtod run

cd depthformer_exp
python tools/test.py configs/depthformer/depthformer_swinl_22k_w7_kitti.py ${CHECKPOINT_FILE}

GEDepth+proposed mehtod run

cd gedepth_exp
bash tools/dist_test.sh  configs/depthformer/depthformer_a.py ${CHECKPOINT_FILE} 1

MDEUncertainty+proposed mehtod run

cd mdeuncertainty_exp
python eval.py ${CHECKPOINT_FILE}

CHECKPOINT_FILE means pretrained weight file.

Acknowledgements

The codes are heavily based on DepthFormer, GEDepth, and MDEUncertainty. I appreciate their outstanding work.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
depthestimation		depthestimation
images		images
temp		temp
README.md		README.md
temp.txt		temp.txt
temp2.txt		temp2.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Enhanced Monocular Depth Estimation Network Based on Uncertainty Edge Weighting Mask.

Publication: Tsinghua Science and Technology (in press at March 2025)

Ye-Ji Kim and Byung-Gyu Kim

Intelligent Vision Processing Lab. (IVPL), Sookmyung Women's University, Seoul, Republic of Korea

Summary of paper

Abstract

Network Architecture

Experimental Results

Training

Testing

Acknowledgements

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

smu-ivpl/Monocular_Depth_UEWM

Folders and files

Latest commit

History

Repository files navigation

Enhanced Monocular Depth Estimation Network Based on Uncertainty Edge Weighting Mask.

Publication: Tsinghua Science and Technology (in press at March 2025)

Ye-Ji Kim and Byung-Gyu Kim

Intelligent Vision Processing Lab. (IVPL), Sookmyung Women's University, Seoul, Republic of Korea

Summary of paper

Abstract

Network Architecture

Experimental Results

Training

Testing

Acknowledgements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages