Skip to content

M-Fannilla/multilabel_vision

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Variable Size Image Classifier

This project implements a variable-size image classifier using Vision Transformers (ViT) and attention mechanisms.

Setup

  1. Clone the repository:

    git clone https://github.com/your-username/variable-size-image-classifier.git
    cd variable-size-image-classifier
    
  2. Create a virtual environment and activate it:

    python -m venv venv
    source venv/bin/activate  # On Windows, use `venv\Scripts\activate`
    
  3. Install the required packages:

    pip install -r requirements.txt
    
  4. Set up Weights & Biases:

    wandb login
    

Usage

Training

To train the model:

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages