Skip to content

Photo Semantic Finder: Replicate Google Photos' functionality of searching images with natural language queries. Users upload images and search using descriptive captions like 'car', 'child playing', powered by Salesforce's BLIP Image Captioning Large model.

Notifications You must be signed in to change notification settings

Mihaillo29/Photo-Semantic-Finder-Google-photos-replica

Repository files navigation

Photo-Semantic-Finder---Google-photos-replica

"Photo Semantic Finder: Replicate Google Photos' aiming to replicate its functionality of searching images using natural language queries. With this application, users can upload their images and search for them using descriptive captions such as "car", "child playing", or "birthdays".

How it works

  1. Upload Images: Users can upload their images through the web interface.

  2. Caption Generation: The uploaded images are passed through the BLIP Image Captioning model to generate descriptive captions.

  3. Search by Caption: Users can search for images using natural language queries. The application retrieves images whose captions match the query.

Screenshot 2024-06-09 221845


Feature included

  • Semantic Image Search: Utilizes state-of-the-art natural language processing models like Salesforce's BLIP Image Captioning Large to generate descriptive captions for uploaded images.

  • Streamlit Web Application: The frontend of the application is built using Streamlit, providing an intuitive user interface for uploading images and querying them using natural language.

  • Uploading and Preview

    Preview image Screenshot 2024-06-09 210626

  • Grid Interface

    Preview image Screenshot 2024-06-09 210724

  • Searching using description

    Preview image Screenshot 2024-06-09 205943 sol:Screenshot 2024-06-09 210543

  • others

    Preview image
    Screenshot 2024-06-09 205540

Technologies Used

  • Streamlit: Used for building the web application frontend.

  • Hugging Face Transformers: Leveraged the Salesforce/blip-image-captioning-large model for generating image captions.

  • Python: The backend and scripting language used for the application development.

Usage

Download the model from Hugging Face or else from here and then extract it in 'Models' folder

  1. Clone the repository:

    git clone https://github.com/your-username/photo-semantic-finder.git
  2. Install dependencies:

    pip install -r requirements.txt
  3. Run the Streamlit app:

    streamlit run photo_Semantics.py

Contributing

Contributions are welcome! Feel free to open issues for feature requests, bug fixes, or general improvements. Pull requests are also appreciated.

License

This project is licensed under the MIT License - see the LICENSE file for details.


About

Photo Semantic Finder: Replicate Google Photos' functionality of searching images with natural language queries. Users upload images and search using descriptive captions like 'car', 'child playing', powered by Salesforce's BLIP Image Captioning Large model.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages