Central Asian Food Scenes Dataset

In this work, we propose the first Central Asia Food Scenes Dataset that contains 21,306 images with 69,856 instances across 239 food classes. To make sure that the dataset contains various food items, we took as a benchmark the ontology of Global Individual Food Tool developed by Food and Agriculture Organization (FAO) together with the World Health Organization (WHO) [1]. The dataset contains food items across 18 coarse classes: 🍅 Vegetables • 🥖 Baked goods • 🍲 Cooked dishes • 🍎 Fruits • 🌿 Herbs • 🍖 Meat dishes • 🍰 Desserts • 🥗 Salads • 🥫 Sauces • 🥤 Drinks • 🧀 Dairy • 🍔 Fast food • 🍜 Soups • 🍟 Sides • 🥜 Nuts • 🥒 Pickled & fermented • 🥚 Egg products • 🌾 Cereals

The images come from:

🤖 15,939 web-scraped images (i.e., Google, YouTube, Yandex)
📸 2,324 everyday-life photos
🎥 3,043 video frames (1 fps)

The dataset has been checked and cleaned for duplicates using the Python Hash Image library. Furthermore, we have also filtered out images less than 30 kB in size and replaced them by performing additional iterative data scraping and duplicate check to make sure the high quality of the dataset.

📊 Class Balance

The dataset is unbalaced. The statistics across high-level 18 classes is shown on Figure below.

🖼️ Annotation Examples

Sample Food scenes annotated images are shown below. Figures illustrate annotated food scenes samples based on our annotation rules that we have followed to create the dataset: the liquid objects such as beverages and soups are annotated together with the dish itself (see upper image), solid food items are annotated without the plate, in case one class is located on top of another class the annotations are made as shown on lower image; in case one class is obscured by another class and the rest of the background class is not visible we highlight only the visible part (see `Salad leaves' class annotation on the lower image).

🔀 Dataset Splits

Fold	Train	Valid	Test
images	17,046	2,084	2,176
instances	55,422	7,062	7,381

📥 Download the Dataset

You can get the data in two ways:

Hugging Face Hub
Browse or fetch individual files from our HF repo:
🔗 Central Asian Food Scenes Dataset
Direct download
Grab the full dataset archive here:
▶️ Download ZIP

📈 Training Results

Training results of different versions of the YOLOv8 model on the Central Asian Food Scenes Dataset, model sizes, and inference times.

🛠️ Pre-trained Models

Pre-trained weights of the YOLOv8 model can be downloaded using these links:

YOLOv8n: https://issai.nu.edu.kz/wp-content/themes/issai-new/data/models/CAFSD/yolov8n.pt
YOLOv8s: https://issai.nu.edu.kz/wp-content/themes/issai-new/data/models/CAFSD/yolov8s.pt
YOLOv8m: https://issai.nu.edu.kz/wp-content/themes/issai-new/data/models/CAFSD/yolov8m.pt
YOLOv8l: https://issai.nu.edu.kz/wp-content/themes/issai-new/data/models/CAFSD/yolov8l.pt
YOLOv8x: https://issai.nu.edu.kz/wp-content/themes/issai-new/data/models/CAFSD/yolov8x.pt

📖 References

[1] European Commission: Impact assessment on measures addressing food waste to complete swd (2014) 207 regarding the review of EU waste management target (2014). Accessed 5-22-2024.

📝 Citation

In case of using our dataset and/or pre-trained models, please cite our work:

@article{Karabay2025,
  author    = {Karabay, Aknur and Varol, Huseyin Atakan and Chan, Mei Yen},
  title     = {Improved food image recognition by leveraging deep learning and data-driven methods with an application to Central Asian Food Scene},
  journal   = {Scientific Reports},
  volume    = {15},
  number    = {1},
  pages     = {14043},
  year      = {2025},
  doi       = {10.1038/s41598-025-95770-9},
  url       = {https://doi.org/10.1038/s41598-025-95770-9},
  issn      = {2045-2322}
}

📚 View the publication

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
figures		figures
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Central Asian Food Scenes Dataset

📊 Class Balance

🖼️ Annotation Examples

🔀 Dataset Splits

📥 Download the Dataset

📈 Training Results

🛠️ Pre-trained Models

📖 References

📝 Citation

About

Uh oh!

Releases

Packages

Uh oh!

License

IS2AI/Central_Asian_Food_Scenes_Dataset

Folders and files

Latest commit

History

Repository files navigation

Central Asian Food Scenes Dataset

📊 Class Balance

🖼️ Annotation Examples

🔀 Dataset Splits

📥 Download the Dataset

📈 Training Results

🛠️ Pre-trained Models

📖 References

📝 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Packages