Skip to content

Commit 8273c06

Browse files
authored
Merge pull request #279 from davidberenstein1957/add-structure-extraction-vlm
Add structure extraction vlm
2 parents 0eec604 + 0f3d86b commit 8273c06

File tree

3 files changed

+405
-1
lines changed

3 files changed

+405
-1
lines changed

notebooks/en/_toctree.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -108,6 +108,8 @@
108108
title: Smol Multimodal RAG, Building with ColSmolVLM and SmolVLM on Colab's Free-Tier GPU
109109
- local: fine_tuning_vlm_dpo_smolvlm_instruct
110110
title: Fine-tuning SmolVLM using direct preference optimization (DPO) with TRL on a consumer GPU
111+
- local: structured_generation_vision_language_models
112+
title: Structured Generation from Images or Documents Using Vision Language Models
111113

112114
- title: Search Recipes
113115
isExpanded: false

notebooks/en/index.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,11 +7,11 @@ applications and solving various machine learning tasks using open-source tools
77

88
Check out the recently added notebooks:
99

10+
- [Structured Generation from Images or Documents Using Vision Language Models](structured_generation_vision_language_models)
1011
- [Vector Search on Hugging Face with the Hub as Backend](vector_search_with_hub_as_backend)
1112
- [Multi-Agent Order Management System with MongoDB](mongodb_smolagents_multi_micro_agents)
1213
- [Scaling Test-Time Compute for Longer Thinking in LLMs](search_and_learn)
1314
- [Signature-Aware Model Serving from MLflow with Ray Serve](mlflow_ray_serve)
14-
- [Fine-tuning SmolVLM using direct preference optimization (DPO) with TRL on a consumer GPU](fine_tuning_vlm_dpo_smolvlm_instruct)
1515

1616
You can also check out the notebooks in the cookbook's [GitHub repo](https://github.com/huggingface/cookbook).
1717

0 commit comments

Comments
 (0)