- LVLM: ARC Prize, Chatbot, LLM Benchmark, OmniSpatial, SuperCLUE
- Detection: Zero-Shot COCO, Open-Voc COCO, Open-Voc LVIS
- other: KITTI, Mip-NeRF 360
Install the basic requirements:
pip install -r requirements.txt
Select preferences and run the command to install PyTorch >= 2.5.1 locally.
pip install open_clip_torch
pip install mss git+https://github.com/facebookresearch/detectron2.git git+https://github.com/openai/CLIP.git
pip install provider pyrender trimesh
pip install gradio_imageslider gradio==4.29.0
pip install git+https://github.com/IDEA-Research/GroundingDINO.git
pip install accelerate huggingface_hub[hf_xet] qwen-vl-utils[decord] transformers==4.50.3
pip install git+https://github.com/facebookresearch/sam2.git
pip install fairscale git+https://github.com/xinyu1205/recognize-anything.git
wget https://huggingface.co/spaces/xinyu1205/recognize-anything/resolve/main/tag2text_swin_14m.pth?download=true
mv tag2text_swin_14m.pth?download=true checkpoints/tag2text_swin_14m.pth
pip install git+https://github.com/facebookresearch/vggt.git
pip install diffusers
Install the following packages to run the web server:
pip install fastapi requests uvicorn
You need to create a Python file that stores the API in a dictionary format within a variable named FUNCTIONS
, and set the file path in server.py
.
Then, run the server using the command below:
uvicorn server:app