KoyilbekV

Bobomurod KoyilbekV

Popular repositories Loading

Vision-Transformer-papers Vision-Transformer-papers Public

Forked from NielsRogge/Vision-Transformer-papers

This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.
Awesome-Audio-LLM Awesome-Audio-LLM Public

Forked from AudioLLMs/Awesome-Audio-LLM

Audio Large Language Models

Python
Video-LLaMA Video-LLaMA Public

Forked from DAMO-NLP-SG/Video-LLaMA

[EMNLP 2023 Demo] Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding

Python
Awesome-CLIP Awesome-CLIP Public

Forked from yzhuoning/Awesome-CLIP

Awesome list for research on CLIP (Contrastive Language-Image Pre-Training).
Mamba-in-Computer-Vision Mamba-in-Computer-Vision Public

Forked from maklachur/Mamba-in-Computer-Vision

Mamba in Vision: A Comprehensive Survey of Techniques and Applications
pdf-to-podcast pdf-to-podcast Public

Forked from NVIDIA-AI-Blueprints/pdf-to-podcast

Transform PDFs into AI podcasts for engaging on-the-go audio content.

Python