Hi everyone,
I'm looking for recommendations on the most suitable open-source model for analyzing screenshot files.
My plan is to use a local setup with either Ollama or LM Studio. The primary reason for this is that the screenshots often contain highly sensitive information, such as passwords, bank account numbers, and other personal data.
To be honest, I'm not comfortable using cloud-based services like Gemini for this task due to major privacy concerns. I need a solution that can run completely offline on my own machine.
Could anyone recommend a powerful and reliable vision model that is well-suited for this kind of screenshot analysis and works well with Ollama or LM Studio?
Thanks in advance for your help