Change the repository type filter
All
Repositories list
46 repositories
verl-tool
PublicVLM2Vec
PublicThis repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]Vamba
PublicVideoEval-Pro
PublicMore reliable Video Understanding EvaluationTheoremExplainAgent
PublicOfficial Repo for "TheoremExplainAgent: Towards Video-based Multimodal Explanations for LLM Theorem Understanding" [ACL 2025 oral]ABC
PublicABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]CritiqueFineTuning
PublicCode for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]ScholarCopilot
PublicMEGA-Bench
PublicThis repo contains the code for "MEGA-Bench Scaling Multimodal Evaluation to over 500 Real-World Tasks" [ICLR2025]Pixel-Reasoner
PublicDisProtEdit
PublicQuickVideo
PublicQuick Long Video UnderstandingGeneral-Reasoner
PublicVisCoder
PublicOne-Shot-CFT
PublicVL-Rethinker
PublicStructEval
PublicQuickCodec
PublicVisualWebInstruct
PublicImagenHub
PublicA one-stop library to standardize the inference and evaluation of all the conditional image generation models. [ICLR 2024]AceCoder
PublicMantis
PublicMMLU-Pro
PublicVISTA
PublicLongICLBench
PublicCode and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]VideoScore
PublicVideoGenHub
PublicA one-stop library to standardize the inference and evaluation of all the conditional video generation models.PixelWorld
PublicKB-BINDER
PublicOmniEdit
PublicOfficial Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]