This project implements a variable-size image classifier using Vision Transformers (ViT) and attention mechanisms.
-
Clone the repository:
git clone https://github.com/your-username/variable-size-image-classifier.git cd variable-size-image-classifier
-
Create a virtual environment and activate it:
python -m venv venv source venv/bin/activate # On Windows, use `venv\Scripts\activate`
-
Install the required packages:
pip install -r requirements.txt
-
Set up Weights & Biases:
wandb login
To train the model: