GitHub - vladman-25/Image-Captioning-Experiments

In this work I propose a set of experiments on the Image Captioning for Romanian Language task, starting with an automatically translated and humanly revised "Flickr30k" dataset, on which I worked among two other students. The architectures used for these experiments include Convolutional Networks, Recurrent Networks and both Visual and Text Transformers. The results are not comparable to other state-of-the-art works, but one of the models proposed is generating pretty decent and accurate captions.

Dataset available here: https://github.com/dima331453/Flickr30k-Romanian

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
CustomEmbeddings		CustomEmbeddings
paper		paper
weights/EfficientNetB4-v4-10.tf		weights/EfficientNetB4-v4-10.tf
Image_Captioning_Framework_Inference.ipynb		Image_Captioning_Framework_Inference.ipynb
Image_Captioning_Framework_RNN.ipynb		Image_Captioning_Framework_RNN.ipynb
Image_Captioning_Framework_Transformers.ipynb		Image_Captioning_Framework_Transformers.ipynb
README.MD		README.MD

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Releases

Packages

Languages

vladman-25/Image-Captioning-Experiments

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages