M4Semantic

Official implementation of the single semantic segmentation network presented in the paper "Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial Images"

Overview

M4Semantic is a lightweight deep architecture for semantic segmentation given an input of RGB image captured in outdoor environments by a camera moving with 6 degrees of freedom (6 DoF).

Please refer to Co-SemDepth for the implementation of the joint architecture.

Citation

If you use this code, please cite our paper:

@misc{alaaeldin2025cosemdepthfastjointsemantic,
      title={Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial Images}, 
      author={Yara AlaaEldin and Francesca Odone},
      year={2025},
      eprint={2503.17982},
      archivePrefix={arXiv},
      primaryClass={cs.CV},
      url={https://arxiv.org/abs/2503.17982}, 
}

Dependencies

Starting from a fresh Anaconda environment with python=3.8, you need first to install tensorflow 2.7:

pip install tensorflow-gpu==2.7.1

Then, install the other required librarires:

pip install pandas pillow

Datasets

Mid-Air [1]

To download the Mid-Air dataset necessary for training and testing our architecture, do the following:

Go on the download page of the Mid-Air dataset

Select the "Left RGB", "Semantic seg." and "Stereo Disparity" image types

Move to the end of the page and press "Get download links"

When you have the file, execute this script to download and extract the dataset:

bash  scripts/0a-get_midair.sh path/to/desired/dataset/location path/to/download_config.txt

Apply the semantic classes mapping on MidAir by running the following script:

python scripts/data_class_mapping.py

Aeroscapes [2]

Download aeroscapes from the link and place it inside "datasets" folder

Reproducing paper results

Training from scratch

To train on MidAir:

bash  scripts/1a-train-midair.sh path/to/desired/weights/location

To train on Aeroscapes:

bash  scripts/1a-train-aeroscapes.sh path/to/desired/weights/location

Evaluation and Pretrained weights

The pre-trained weights can be downloaded from here and extracted in the weights folder:

weights trained on MidAir

weights trained on Aeroscapes

For evaluation:

bash  scripts/2-evaluate.sh dataset path/to/weights/location

where dataset can be midair or aeroscapes

Please notice that, because of differences in the environments, you may need to train the network from scratch in your conda evnironment to be able to obtain the same results reported in the paper

Prediction and visualizing the output

For prediction and saving the output depth and semantic segmentation maps run the following for MidAir:

python main.py --mode=predict --dataset="midair" --arch_depth=5 --ckpt_dir="weights/midair/" --records="data/midair/test_data/"

for Aeroscapes:

python main.py --mode=predict --dataset="aeroscapes" --arch_depth=5 --ckpt_dir="weights/aeroscapes/" --records="data/aeroscapes/test_data/"

Training and Evaluation on your own dataset

In this case, you need to write the dataloader for your own dataset similar to dataloaders/midair.py. You also need to generate the data files by writing a data generator script similar to scripts/midair-split-generator.py. For depth training and prediction, your dataset should have per-frame camera location information to generate the data files.

Baseline Methods for semantic segmentation

[TO BE ADDED]

References

[1]
@inproceedings{Fonder2019MidAir,
  author    = {Fonder, Michael and Van Droogenbroeck, Marc},
  title     = {Mid-Air: A multi-modal dataset for extremely low altitude drone flights},
  booktitle = {IEEE International Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)},
  year      = {2019},
  month     = {June}
}

[2]
@inproceedings{aeroscapes,
  title={Ensemble knowledge transfer for semantic segmentation},
  author={Nigam, Ishan and Huang, Chen and Ramanan, Deva},
  booktitle={2018 IEEE Winter Conference on Applications of Computer Vision (WACV)},
  pages={1499--1508},
  year={2018},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
__pycache__		__pycache__
cuda_backproject		cuda_backproject
data		data
dataloaders		dataloaders
scripts		scripts
utils		utils
LICENSE		LICENSE
README.md		README.md
callbacks.py		callbacks.py
datasets_location.json		datasets_location.json
m4depth_options.py		m4depth_options.py
m4semantic_.png		m4semantic_.png
m4semantic_network.py		m4semantic_network.py
m4semantic_network_aeroscapes.py		m4semantic_network_aeroscapes.py
main.py		main.py
metrics.py		metrics.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

M4Semantic

Overview

Citation

Dependencies

Datasets

Mid-Air [1]

Aeroscapes [2]

Reproducing paper results

Training from scratch

Evaluation and Pretrained weights

Prediction and visualizing the output

Training and Evaluation on your own dataset

Baseline Methods for semantic segmentation

References

About

Uh oh!

Releases

Packages

Languages

License

Malga-Vision/M4Semantic

Folders and files

Latest commit

History

Repository files navigation

M4Semantic

Overview

Citation

Dependencies

Datasets

Mid-Air [1]

Aeroscapes [2]

Reproducing paper results

Training from scratch

Evaluation and Pretrained weights

Prediction and visualizing the output

Training and Evaluation on your own dataset

Baseline Methods for semantic segmentation

References

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages