Skip to content

Commit 9cb4d5f

Browse files
Fix typo in notebook filename and update index
- Corrected the filename from `structured_generation_vision_languag_models.ipynb` to `structured_generation_vision_language_models.ipynb` - Updated the index.md to reflect the corrected notebook title and link - Updated the _toctree.yml to use the corrected notebook filename
1 parent 1a4a376 commit 9cb4d5f

File tree

3 files changed

+24
-15
lines changed

3 files changed

+24
-15
lines changed

notebooks/en/_toctree.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -108,7 +108,7 @@
108108
title: Smol Multimodal RAG, Building with ColSmolVLM and SmolVLM on Colab's Free-Tier GPU
109109
- local: fine_tuning_vlm_dpo_smolvlm_instruct
110110
title: Fine-tuning SmolVLM using direct preference optimization (DPO) with TRL on a consumer GPU
111-
- local: structured_generation_vision_languag_models
111+
- local: structured_generation_vision_language_models
112112
title: Structured Generation from Documents Using Vision Language Models
113113

114114
- title: Search Recipes

notebooks/en/index.md

Lines changed: 1 addition & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,12 +7,11 @@ applications and solving various machine learning tasks using open-source tools
77

88
Check out the recently added notebooks:
99

10+
- [Structured Generation from Images or Documents Using Vision Language Models](structured_generation_vision_language_models)
1011
- [Multi-Agent Order Management System with MongoDB](mongodb_smolagents_multi_micro_agents)
1112
- [Scaling Test-Time Compute for Longer Thinking in LLMs](search_and_learn)
1213
- [Signature-Aware Model Serving from MLflow with Ray Serve](mlflow_ray_serve)
1314
- [Fine-tuning SmolVLM using direct preference optimization (DPO) with TRL on a consumer GPU](fine_tuning_vlm_dpo_smolvlm_instruct)
14-
- [Smol Multimodal RAG: Building with ColSmolVLM and SmolVLM on Colab's Free-Tier GPU](multimodal_rag_using_document_retrieval_and_smol_vlm)
15-
1615

1716
You can also check out the notebooks in the cookbook's [GitHub repo](https://github.com/huggingface/cookbook).
1817

notebooks/en/structured_generation_vision_languag_models.ipynb renamed to notebooks/en/structured_generation_vision_language_models.ipynb

Lines changed: 22 additions & 12 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@
44
"cell_type": "markdown",
55
"metadata": {},
66
"source": [
7-
"# Structured Generation from Documents Using Vision Language Models\n",
7+
"# Structured Generation from Images or Documents Using Vision Language Models\n",
88
"\n",
99
"We will be using the SmolVLM-Instruct model from HuggingFaceTB to extract structured information from documents We will run the VLM using the HuggingFace Transformers library and the [Outlines library](https://github.com/dottxt-ai/outlines), which facilitates structured generation based on limiting token sampling probabilities. \n",
1010
"\n",
@@ -132,7 +132,7 @@
132132
},
133133
{
134134
"cell_type": "code",
135-
"execution_count": 16,
135+
"execution_count": 19,
136136
"metadata": {},
137137
"outputs": [],
138138
"source": [
@@ -188,13 +188,23 @@
188188
},
189189
{
190190
"cell_type": "code",
191-
"execution_count": 17,
191+
"execution_count": 20,
192192
"metadata": {},
193193
"outputs": [
194+
{
195+
"name": "stderr",
196+
"output_type": "stream",
197+
"text": [
198+
"/Users/davidberenstein/Documents/programming/huggingface/cookbook/.venv/lib/python3.11/site-packages/dill/_dill.py:414: PicklingWarning: Cannot locate reference to <class '__main__.ImageData'>.\n",
199+
" StockPickler.save(self, obj, save_persistent_id)\n",
200+
"/Users/davidberenstein/Documents/programming/huggingface/cookbook/.venv/lib/python3.11/site-packages/dill/_dill.py:414: PicklingWarning: Cannot pickle <class '__main__.ImageData'>: __main__.ImageData has recursive self-references that trigger a RecursionError.\n",
201+
" StockPickler.save(self, obj, save_persistent_id)\n"
202+
]
203+
},
194204
{
195205
"data": {
196206
"application/vnd.jupyter.widget-view+json": {
197-
"model_id": "1caa96c32bc7416ea43c192c0cd88c20",
207+
"model_id": "e1d431b922334b0297195415a11cf68a",
198208
"version_major": 2,
199209
"version_minor": 0
200210
},
@@ -214,7 +224,7 @@
214224
"})"
215225
]
216226
},
217-
"execution_count": 17,
227+
"execution_count": 20,
218228
"metadata": {},
219229
"output_type": "execute_result"
220230
}
@@ -252,13 +262,13 @@
252262
},
253263
{
254264
"cell_type": "code",
255-
"execution_count": 18,
265+
"execution_count": 21,
256266
"metadata": {},
257267
"outputs": [
258268
{
259269
"data": {
260270
"application/vnd.jupyter.widget-view+json": {
261-
"model_id": "843b9c88cab54402812f1b936a2dc6e0",
271+
"model_id": "ab88b1b3bb1441498788bdc2c2b4cf30",
262272
"version_major": 2,
263273
"version_minor": 0
264274
},
@@ -272,7 +282,7 @@
272282
{
273283
"data": {
274284
"application/vnd.jupyter.widget-view+json": {
275-
"model_id": "57e5dea4ae504866b2d93863bcfa4408",
285+
"model_id": "e5e359d02ede43959e92a9e5626f9ffd",
276286
"version_major": 2,
277287
"version_minor": 0
278288
},
@@ -286,7 +296,7 @@
286296
{
287297
"data": {
288298
"application/vnd.jupyter.widget-view+json": {
289-
"model_id": "b811febb7c044100bb74bf67016f0d0d",
299+
"model_id": "9f7f07dad09f47c5a8dfdeba403845f6",
290300
"version_major": 2,
291301
"version_minor": 0
292302
},
@@ -300,7 +310,7 @@
300310
{
301311
"data": {
302312
"application/vnd.jupyter.widget-view+json": {
303-
"model_id": "1fa44296ea00459b8cbb22e56739117c",
313+
"model_id": "e47600c765b64b55aa6f93e9cf5d077e",
304314
"version_major": 2,
305315
"version_minor": 0
306316
},
@@ -314,10 +324,10 @@
314324
{
315325
"data": {
316326
"text/plain": [
317-
"CommitInfo(commit_url='https://huggingface.co/datasets/davidberenstein1957/structured-generation-information-extraction-vlms-openbmb-RLAIF-V-Dataset/commit/f72002df2d9aef403afeaf6e27f4407ddd82c89c', commit_message='Upload dataset', commit_description='', oid='f72002df2d9aef403afeaf6e27f4407ddd82c89c', pr_url=None, repo_url=RepoUrl('https://huggingface.co/datasets/davidberenstein1957/structured-generation-information-extraction-vlms-openbmb-RLAIF-V-Dataset', endpoint='https://huggingface.co', repo_type='dataset', repo_id='davidberenstein1957/structured-generation-information-extraction-vlms-openbmb-RLAIF-V-Dataset'), pr_revision=None, pr_num=None)"
327+
"CommitInfo(commit_url='https://huggingface.co/datasets/davidberenstein1957/structured-generation-information-extraction-vlms-openbmb-RLAIF-V-Dataset/commit/373d6a25e8301077773fc6a37899b1598cf6f8cd', commit_message='Upload dataset', commit_description='', oid='373d6a25e8301077773fc6a37899b1598cf6f8cd', pr_url=None, repo_url=RepoUrl('https://huggingface.co/datasets/davidberenstein1957/structured-generation-information-extraction-vlms-openbmb-RLAIF-V-Dataset', endpoint='https://huggingface.co', repo_type='dataset', repo_id='davidberenstein1957/structured-generation-information-extraction-vlms-openbmb-RLAIF-V-Dataset'), pr_revision=None, pr_num=None)"
318328
]
319329
},
320-
"execution_count": 18,
330+
"execution_count": 21,
321331
"metadata": {},
322332
"output_type": "execute_result"
323333
}

0 commit comments

Comments
 (0)