Skip to content

exploringweirdmachines/describe-image

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Description

Image-to-text. A tool that can be used to describe an image.

Usage

usage: main.py [-h] [-v] -i some image [-p prompt]

Describe content of an image

options:
  -h, --help            show this help message and exit
  -v, --version         show program's version number and exit
  -i some image, --input some image
                        path to the image
  -p prompt, --prompt prompt
                        optional custom prompt

Bye

Alt text

The target image

'give_thanks.png' image can be found in the folder named 'resources'.

Alt text

Requirements

My setup is:

WIN 10 + nvidia drivers 522 + cuda 11.8 + WSL2 with Ubuntu 22.04 with cuda tools + python >=3.10

see "pyproject.toml"

poetry install

About

Image-to-text tool. Describe content of an image. based on moondream model. (LLava-like but smaller model)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published