Skip to content
Discussion options

You must be logged in to vote

Hi there,

I'm currently having a lot of success with the qwen2.5vl:7b model running on a Nvidia 4070Super. It's much better than the minicpm-v.

I also tried granite3.2Vision but found it unsatisfactory. Mistral-small3.1 was also very good, but I could only run it via CPU which made it unreasonably slow to process multiple documents.

I'm also using mistral-nemo for the suggestions - it follows instructions well as long as the prompts are prescriptive. The OCR part is pretty important though - garbage in garbage out if the OCR is not great.

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@ManuVice
Comment options

@Psypher37
Comment options

Answer selected by icereed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants