-
Hi! |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 2 replies
-
Hi there, I'm currently having a lot of success with the qwen2.5vl:7b model running on a Nvidia 4070Super. It's much better than the minicpm-v. I also tried granite3.2Vision but found it unsatisfactory. Mistral-small3.1 was also very good, but I could only run it via CPU which made it unreasonably slow to process multiple documents. I'm also using mistral-nemo for the suggestions - it follows instructions well as long as the prompts are prescriptive. The OCR part is pretty important though - garbage in garbage out if the OCR is not great. |
Beta Was this translation helpful? Give feedback.
Hi there,
I'm currently having a lot of success with the qwen2.5vl:7b model running on a Nvidia 4070Super. It's much better than the minicpm-v.
I also tried granite3.2Vision but found it unsatisfactory. Mistral-small3.1 was also very good, but I could only run it via CPU which made it unreasonably slow to process multiple documents.
I'm also using mistral-nemo for the suggestions - it follows instructions well as long as the prompts are prescriptive. The OCR part is pretty important though - garbage in garbage out if the OCR is not great.