✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
-
Updated
Apr 25, 2025
✨✨Latest Papers and Benchmarks in Reasoning with Foundation Models
TxAgent: An AI Agent for Therapeutic Reasoning Across a Universe of Tools
Project Page For "Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement"
Latest Advances on Long Chain-of-Thought Reasoning
Deep Reasoning Translation (DRT) Project
The official implement of "VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning"
a-m-team's exploration in large language modeling
ToolUniverse is a collection of biomedical tools designed for AI agents
Pivotal Token Search
OpenVLThinker: An Early Exploration to Vision-Language Reasoning via Iterative Self-Improvement
A Comprehensive Survey on Evaluating Reasoning Capabilities in Multimodal Large Language Models.
Official Implementation of "Reasoning Language Models: A Blueprint"
Designing Multi-Agent Systems with Zero Supervision
This is the repo of developing reasoning models in the specific domain of financial, aim to enhance models capabilities in handling financial reasoning tasks.
Lightweight replication study of DeepSeek-R1-Zero. Interesting findings include "No Aha Moment", "Longer CoT ≠ Accuracy", and "Language Mixing in Instruct Models".
Pure RL to post-train base models for social reasoning capabilities. Lightweight replication of DeepSeek-R1-Zero with Social IQa dataset.
[arxiv: 2505.02156] Adaptive Thinking via Mode Policy Optimization for Social Language Agents
To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models
A preview-version of one novel multimodal reasoning benchmark CharmBench.
☁️ KUMO: Generative Evaluation of Complex Reasoning in Large Language Models
Add a description, image, and links to the reasoning-language-models topic page so that developers can more easily learn about it.
To associate your repository with the reasoning-language-models topic, visit your repo's landing page and select "manage topics."