vision-encoder

Star

Here are 9 public repositories matching this topic...

sathishkumar67 / PaliGemma

Star

Implementation of PaliGemma

deeplearning vlm llm siglip vision-encoder

Updated Nov 29, 2024
Python

PRITHIVSAKTHIUR / Multilabel-GeoSceneNet

Star

Multilabel-GeoSceneNet is a vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for multi-label image classification. It is designed to recognize and label multiple geographic or environmental elements in a single image using the SiglipForImageClassification architecture.

map geospatial landscape spaces gradio huggingface-transformers hugging-face siglip vision-encoder siglip2 geoscenenet

Updated Apr 23, 2025
Python

PRITHIVSAKTHIUR / Fashion-Product-Usage

Star

Fashion-Product-Usage is a vision-language model fine-tuned from google/siglip2-base-patch16-224 using the SiglipForImageClassification architecture. It classifies fashion product images based on their intended usage context.

google image-classification season gradio clothing huggingface-transformers vision-transformer vision-encoder wearing-time siglip2

Updated Apr 18, 2025
Python

PRITHIVSAKTHIUR / Multilabel-Portrait-SigLIP2

Star

Multilabel-Portrait-SigLIP2 is a vision-language model fine-tuned from google/siglip2-base-patch16-224 using the SiglipForImageClassification architecture. It classifies portrait-style images into one of the following visual portrait categories:

python google autoencoder image-classification gradio multilabel-classification portraits huggingface-transformers vision-transformer vision-encoder siglip2

Updated Apr 16, 2025
Python

PRITHIVSAKTHIUR / Coral-Health

Star

Coral-Health is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify coral reef images into two health conditions using the SiglipForImageClassification architecture.

health healthy coral coral-reefs huggingface-transformers vision-encoder siglip2 bleached

Updated Apr 28, 2025
Python

PRITHIVSAKTHIUR / shoe-type-detection

Star

shoe-type-detection is a vision-language encoder model fine-tuned from google/siglip2-base-patch16-512 for multi-class image classification. It is trained to detect different types of shoes such as Ballet Flats, Boat Shoes, Brogues, Clogs, and Sneakers. The model uses the SiglipForImageClassification architecture.

google type gradio multiclass-classification shoe huggingface-transformers huggingface-models vision-encoder siglip2

Updated Jun 7, 2025
Python

PRITHIVSAKTHIUR / PussyCat-vs-Doggie-SigLIP2

Star

PussyCat-vs-Doggie-SigLIP2 is an image classification vision-language encoder model fine-tuned from google/siglip2-base-patch16-224 for a single-label classification task. It is designed to classify images as either a cat or a dog using the SiglipForImageClassification architecture.

cat dog prediction classification image-classification gradio huggingface-transformers vision-encoder siglip2

Updated Apr 19, 2025
Python

PRITHIVSAKTHIUR / Flood-Image-Detection

Star

Flood-Image-Detection is a vision-language encoder model fine-tuned from google/siglip2-base-patch16-512 for binary image classification. It is trained to detect whether an image contains a flooded scene or non-flooded environment. The model uses the SiglipForImageClassification architecture.

google disaster flood gradio flooding huggingface-transformers vision-transformer vision-encoder siglip2

Updated May 27, 2025
Python

sitamgithub-MSIT / siglip2-litserve

Star

Leverage SigLIP 2's capabilities using LitServe.

python deep-learning transformers artificial-intelligence fastapi lightning-ai zero-shot-image-classification siglip litserve vision-encoder

Updated Feb 28, 2025
Python

Improve this page

Add a description, image, and links to the vision-encoder topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the vision-encoder topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

vision-encoder

Here are 9 public repositories matching this topic...

sathishkumar67 / PaliGemma

PRITHIVSAKTHIUR / Multilabel-GeoSceneNet

PRITHIVSAKTHIUR / Fashion-Product-Usage

PRITHIVSAKTHIUR / Multilabel-Portrait-SigLIP2

PRITHIVSAKTHIUR / Coral-Health

PRITHIVSAKTHIUR / shoe-type-detection

PRITHIVSAKTHIUR / PussyCat-vs-Doggie-SigLIP2

PRITHIVSAKTHIUR / Flood-Image-Detection

sitamgithub-MSIT / siglip2-litserve

Improve this page

Add this topic to your repo