Xinhuo_competition

This repo generates multi-language caption for images. We provide training code and 2 versions of demo:

DTU version demo: Demo in this version run on DTU.
GUI version demo: Demo in this version has GUI interaction interface. You can run it on GPU or CPU with pt or onnx model. Note that, it can't run on DTU, because existing DTU environment doesn't support GUI.
Training codes: Codes and dataset for training image caption task.

DTU version

Environment

Please make sure that you have install following package in your environment:

pip install --upgrade pytorch torchvision
pip install onnxruntime

pip install transformers
pip install datasets
pip install sacrebleu
pip install sentencepiece

pip install scikit-image

Inference

You can run this demo with the following script, and get the result in the last row of log:

python -m inference_DTU_total --language zh --type DTU

Note, you can change parameters as followings:

language: We support languages in [en, zh, de, fr, ro].
type: You can change type to torch or onnx, which infer the type of model without DTU.

GUI version

You can run this demo with the following script, and interact with GUI interface:

python gui_demo.py

Training codes

How to download dataset and train model are described in PyTorch-Tutorial-to-Image-Captioning/README.md.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
DTU_inference		DTU_inference
PyTorch-Tutorial-to-Image-Captioning		PyTorch-Tutorial-to-Image-Captioning
machine_translation		machine_translation
visual_caption		visual_caption
.gitignore		.gitignore
README.md		README.md
gui_demo.py		gui_demo.py
inference_DTU_total.py		inference_DTU_total.py
models.py		models.py
test_internet_img.jpg		test_internet_img.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Xinhuo_competition

DTU version

Environment

Inference

GUI version

Training codes

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Betty1202/Xinhuo_competition

Folders and files

Latest commit

History

Repository files navigation

Xinhuo_competition

DTU version

Environment

Inference

GUI version

Training codes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages