Images within PDFs, Docx, or Powerpoints could be passed through a vision model! #92

Andresshamis · 2025-02-11T10:08:42Z

Andresshamis
Feb 11, 2025

Some files contain images with relevant information that I and many others also need to extract from within the files, not just plain text and tables. If we are able to detect which images are actually relevant, pass them through a vision LLM with a nice extraction prompt, and place the description of the image in the same order as it was in the file compared with the rest of the data, it would make this the absolute best file parser out there for LLMs!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Images within PDFs, Docx, or Powerpoints could be passed through a vision model! #92

Uh oh!

{{title}}

Uh oh!

Replies: 0 comments

Select a reply

Uh oh!

Images within PDFs, Docx, or Powerpoints could be passed through a vision model! #92

Uh oh!

Andresshamis Feb 11, 2025

Replies: 0 comments

Andresshamis
Feb 11, 2025