docling-serve: Why smovlm and not smoldocling? #1508

thistleknot · 2025-05-02T01:13:23Z

thistleknot
May 2, 2025

isn't smodocling made for docling?

INFO: 127.0.0.1:51899 - "GET /v1alpha/status/poll/32e34cdd-7385-4bf4-bb57-2b3bec316137?wait=5 HTTP/1.1" 200 OK
ERROR:docling_serve.gradio_ui:Error processing file(s): Task failed with status 'failure'
Error while downloading from https://huggingface.co/HuggingFaceTB/SmolVLM-256M-Instruct/resolve/7e3e67edbbed1bf9888184d9df282b700a323964/model.safetensors: [SSL: DECRYPTION_FAILED_OR_BAD_RECORD_MAC] decryption failed or bad record mac (_ssl.c:2536)

I think I can solve the error, but curious why SmolVLM-256M vs SmolDocling?

dolfim-ibm · 2025-05-02T07:13:18Z

dolfim-ibm
May 2, 2025
Maintainer

SmolDocling is actually a fine-tuning of SmolVLM, which introduces the concept of DocTags for converting documents. You can read more in the respective paper.

I think you are referring to the picture description feature in Docling. This functionality doesn't convert the document (i.e. the SmolDocling task) but it is only making a textual description of the figures. For this task you can leverage almost any vision-language model (SmolVLM, Granite Vision, etc)

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

docling-serve: Why smovlm and not smoldocling? #1508

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

docling-serve: Why smovlm and not smoldocling? #1508

Uh oh!

thistleknot May 2, 2025

Replies: 1 comment

Uh oh!

dolfim-ibm May 2, 2025 Maintainer

thistleknot
May 2, 2025

dolfim-ibm
May 2, 2025
Maintainer