Instruct-Tune TinyLlama

Liked our work? give us a ⭐!

This repository contains an easy-to-use and understand code to fine-tune VLMs (Visual Language Models).

With Vision-Language Models (VLMs), you can ask questions about an image and receive answers. We'll do it on images with visual data representations. Such as graphs and charts. We'll use HuggingFaceM4/ChartQA as the dataset.

In this case, we'll be fine-tuning Qwen/Qwen2-VL-7B-Instruct. We will be using LoRA adapter and 4-bit quantization. Refer to the fine-tune-vlms-qwen.ipynb file.

YouTube Tutorial

Fine-Tune Visual Language Models (VLMs) - HuggingFace, PyTorch, LoRA, Quantization, TRL

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
.gitignore		.gitignore
README.md		README.md
fine-tune-vlms-qwen.ipynb		fine-tune-vlms-qwen.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Instruct-Tune TinyLlama

YouTube Tutorial

References

About

Uh oh!

Releases

Packages

Languages

uygarkurt/Fine-Tune-VLMs

Folders and files

Latest commit

History

Repository files navigation

Instruct-Tune TinyLlama

YouTube Tutorial

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages