This project uses OpenAI's CLIP model to find images in a folder that match a user provided text description. It can analyze a folder of images, compute their similarity to a given description, and display or save the best matches.
- Streamlit: For the web application interface.
- PyTorch: For leveraging the CLIP model.
- Hugging Face Transformers: To load the pretrained CLIP model and processor.
- Pillow (PIL): For image processing.
shutil
: For file copying and management.
-
Add support for more image formats (e.g., GIFs, BMP).
-
Enhance the user interface with drag-and-drop functionality for folder selection.
-
Integrate GPU acceleration for faster image processing.
-
Extend the application to process videos by extracting frames and performing similarity checks.
For any inquiries or feedback, please contact:
Email: vishnumkpn@gmail.com
GitHub: VishnuMuraliKr