Change the repository type filter
All
Repositories list
34 repositories
- Unified Multimodal Model for image generation/editing/understanding
- Generate large-scale explorable 3D scenes with high-quality panorama videos from a single image or text prompt.
- Matrix-Game 2.0: An Open-Source, Real-Time, and Streaming Interactive World Model
- DeepResearchAgent is a hierarchical multi-agent system designed not only for deep research tasks but also for general-purpose task solving. The framework leverages a top-level planning agent to coordinate multiple specialized lower-level agents, enabling automated task decomposition and efficient execution across diverse and complex domains.
- Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI (Kunlun Inc.), specializing in vision-language reasoning.
CSVQA
PublicA Multimodal Benchmark for Evaluating Scientific Reasoning Capabilities of VLMs- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers
- SkyReels-A2: Compose anything in video diffusion transformers
Mureka-mcp
Publicskyreels-a2.github.io
PublicMoH
PublicVitron
PublicNeurIPS 2024 Paper: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing- [ICLR 2025] MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts