GitHub - XuzhaoLi/ro-arxiv-daily: Automatically Update Arxiv Papers about Path Planning, LLM and Autonomous Driving using Github Actions since 2024.2.

Updated on 2025.06.11

Table of Contents

Path Planning
Large Language Model
Autonomous Driving

Path Planning

Publish Date	Title	Authors	PDF	Code
2025-06-10	Reinforce LLM Reasoning through Multi-Agent Reflection	Yurun Yuan et.al.	2506.08379	null
2025-06-10	Dynamical System Optimization	Emo Todorov et.al.	2506.08340	null
2025-06-09	Modelling Nonstationary Time Series using Trend-Stationary Hypothesis	Zhandos Abdikhadir et.al.	2506.07987	null
2025-06-08	Stochastic Quadratic Dynamic Programming	Vincent Guigues et.al.	2506.07314	null
2025-06-05	Resilient Pattern Mining	Pengxin Bian et.al.	2506.04935	null
2025-06-05	Composing Agents to Minimize Worst-case Risk	Guruprerana Shabadi et.al.	2506.04632	null
2025-06-04	Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models	Fangrui Zhu et.al.	2506.04220	null
2025-05-28	Large Neighborhood and Hybrid Genetic Search for Inventory Routing Problems	Jingyi Zhao et.al.	2506.03172	null
2025-06-03	Dynamic Programming Techniques for Enhancing Cognitive Representation in Knowledge Tracing	Lixiang Xu et.al.	2506.02949	null
2025-06-03	Reachability Weighted Offline Goal-conditioned Resampling	Wenyan Yang et.al.	2506.02577	null
2025-06-03	Multi-agent Markov Entanglement	Shuze Chen et.al.	2506.02385	null
2025-06-02	Scalable In-Context Q-Learning	Jinmei Liu et.al.	2506.01299	null
2025-06-01	Trilevel Memetic Algorithm for the Electric Vehicle Routing Problem	Ivan Milinović et.al.	2506.01065	null
2025-06-01	Q-learning with Posterior Sampling	Priyank Agrawal et.al.	2506.00917	null
2025-05-30	GridRoute: A Benchmark for LLM-Based Route Planning with Cardinal Movement in Grid Environments	Kechen Li et.al.	2505.24306	null
2025-05-30	Winners vs. Losers: Momentum-based Strategies with Intertemporal Choice for ESG Portfolios	Ayush Jha et.al.	2505.24250	null
2025-05-30	CLaSp: In-Context Layer Skip for Self-Speculative Decoding	Longze Chen et.al.	2505.24196	null
2025-05-29	Spoken Language Modeling with Duration-Penalized Self-Supervised Units	Nicol Visser et.al.	2505.23494	link
2025-05-29	Offline Map Matching Based on Localization Error Distribution Modeling	Ruilin Xu et.al.	2505.23123	null
2025-05-29	DINGO: Constrained Inference for Diffusion LLMs	Tarun Suresh et.al.	2505.23061	null
2025-05-27	Learning-Based Tracking Perimeter Control for Two-region Macroscopic Traffic Dynamics	Can Chen et.al.	2505.21818	null
2025-05-27	When to Deceive: A Cross-Layer Stackelberg Game Framework for Strategic Timing of Cyber Deception	Ya-Ting Yang et.al.	2505.21244	null
2025-05-23	Evaluating the Energy-Efficiency of the Code Generated by LLMs	Md Arman Islam et.al.	2505.20324	null
2025-05-23	URB -- Urban Routing Benchmark for RL-equipped Connected Autonomous Vehicles	Ahmet Onur Akman et.al.	2505.17734	null
2025-05-23	Distance Estimation in Outdoor Driving Environments Using Phase-only Correlation Method with Event Cameras	Masataka Kobayashi et.al.	2505.17582	null
2025-05-22	Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms	Baran Hashemi et.al.	2505.17190	null
2025-05-22	Quantum Routing and Entanglement Dynamics Through Bottlenecks	Dhruv Devulapalli et.al.	2505.16948	null
2025-05-22	Reward-Aware Proto-Representations in Reinforcement Learning	Hon Tik Tse et.al.	2505.16217	null
2025-05-21	Toward Theoretical Insights into Diffusion Trajectory Distillation via Operator Merging	Weiguo Gao et.al.	2505.16024	null
2025-05-21	Families of tractable problems with respect to vertex-interval-membership width and its generalisations	Jessica Enright et.al.	2505.15699	null
2025-05-21	Deep Learning for Continuous-time Stochastic Control with Jumps	Patrick Cheridito et.al.	2505.15602	null
2025-05-19	Finding Maximum Independent Sets in Dynamic Graphs using Unsupervised Learning	Devendra Parkar et.al.	2505.13754	null
2025-05-24	Learning to Program Quantum Measurements for Machine Learning	Samuel Yen-Chi Chen et.al.	2505.13525	null
2025-05-19	Dynamic programming and dimensionality in convex stochastic optimization and control	Teemu Pennanen et.al.	2505.12787	null
2025-05-18	Resolving Latency and Inventory Risk in Market Making with Reinforcement Learning	Junzhe Jiang et.al.	2505.12465	null
2025-05-16	Co-Evolutionary Defence of Active Directory Attack Graphs via GNN-Approximated Dynamic Programming	Diksha Goel et.al.	2505.11710	null
2025-05-15	Multi-Objective Memory Bandwidth Regulation and Cache Partitioning for Multicore Real-Time Systems	Binqi Sun et.al.	2505.11554	null
2025-05-16	Sobolev Training of End-to-End Optimization Proxies	Andrew W. Rosemberg et.al.	2505.11342	null
2025-05-16	Beyond KL-divergence: Risk Aware Control Through Cross Entropy and Adversarial Entropy Regularization	Menno van Zutphen et.al.	2505.11068	null
2025-05-15	Scalable Approximate Biclique Counting over Large Bipartite Graphs	Jingbang Chen et.al.	2505.10471	null
2025-05-14	Reflected stochastic recursive control problems with jumps: dynamic programming and stochastic verification theorems	Lu Liu et.al.	2505.09070	null
2025-05-13	Optimal Trajectory Planning with Collision Avoidance for Autonomous Vehicle Maneuvering	Jason Zalev et.al.	2505.08724	null
2025-05-13	Distributionally Robust LQG with Kullback-Leibler Ambiguity Sets	Marta Fochesato et.al.	2505.08370	null
2025-05-11	Optimal control of convective Brinkman-Forchheimer equations: Dynamic programming equation and Viscosity solutions	Sagar Gautam et.al.	2505.07095	null
2025-05-10	Optimizing Railcar Movements to Create Outbound Trains in a Freight Railyard	Ruonan Zhao et.al.	2505.06510	null
2025-05-09	Scheduled Jacobian Chaining	Simon Märtens et.al.	2505.06056	link
2025-05-09	Universal Approximation Theorem for Deep Q-Learning via FBSDE System	Qian Qi et.al.	2505.06023	null
2025-05-09	Data-driven pressure field prediction for ships in regular sea states	Malte Loft et.al.	2505.06014	null
2025-05-09	Multi-armed Bandit for Stochastic Shortest Path in Mixed Autonomy	Yu Bai et.al.	2505.05878	null
2025-05-10	Driving with Context: Online Map Matching for Complex Roads Using Lane Markings and Scenario Recognition	Xin Bi et.al.	2505.05007	link
2025-05-08	Chain-of-Thought Tokens are Computer Program Variables	Fangwei Zhu et.al.	2505.04955	link
2025-05-08	Network Digital Twin for Route Optimization in 5G/B5G Transport Slicing with What-If Analysis	Rebecca Aben-Athar et.al.	2505.04879	null
2025-05-06	Stochastic scheduling with Bernoulli-type jobs through policy stratification	Antonios Antoniadis et.al.	2505.03349	null
2025-05-05	A Fully Data-Driven Value Iteration for Stochastic LQR: Convergence, Robustness and Stability	Leilei Cui et.al.	2505.02970	null
2025-05-03	Multistage stochastic optimization for drayage procurement in container logistics using stochastic dual dynamic programming	Georgios Vassos et.al.	2505.01813	null
2025-05-03	Integrated optimization of operations and capacity planning under uncertainty for drayage procurement in container logistics	Georgios Vassos et.al.	2505.01808	link
2025-05-03	Evaluating Input Modalities for Pilot-Centered Taxiway Navigation: Insights from a Wizard-of-Oz Simulation	Chan Chea Mean et.al.	2505.01679	null
2025-05-03	Morello: Compiling Fast Neural Networks with Dynamic Programming and Spatial Compression	Samuel J. Kaufman et.al.	2505.01637	link
2025-05-02	Global Collinearity-aware Polygonizer for Polygonal Building Mapping in Remote Sensing	Fahong Zhang et.al.	2505.01385	null
2025-05-02	Power System Transition Planning: An Industry-Aligned Framework for Long-Term Optimization	Ahmed Al-Shafei et.al.	2505.01331	null
2025-05-02	A stochastic Gordon-Loeb model for optimal cybersecurity investment under clustered attacks	Giorgia Callegaro et.al.	2505.01221	null
2025-05-02	Remote Estimation over Packet-Dropping Wireless Channels with Partial State Information	Ioannis Tzortzis et.al.	2505.01132	null
2025-05-01	Quantum Computing in Industrial Environments: Where Do We Stand and Where Are We Headed?	Eneko Osaba et.al.	2505.00891	null
2025-05-01	Platoon Coordination and Leader Selection in Mixed Transportation Systems via Dynamic Programming	Ying Wang et.al.	2505.00847	null
2025-04-24	Optimal Blackjack Betting Strategies Through Dynamic Programming and Expected Utility Theory	Lucas Bordeu et.al.	2505.00724	null
2025-04-30	Galvatron: An Automatic Distributed System for Efficient Foundation Model Training	Xinyi Liu et.al.	2504.21411	link
2025-04-29	DeeP-Mod: Deep Dynamic Programming based Environment Modelling using Feature Extraction	Chris Child et.al.	2504.20535	null
2025-04-28	Warm-Starting QAOA with XY Mixers: A Novel Approach for Quantum-Enhanced Vehicle Routing Optimization	Rafael S. do Carmo et.al.	2504.19934	null
2025-04-30	The frequency $K_i$ s for symmetrical traveling salesman problem	Yong Wang et.al.	2504.19608	null
2025-04-28	Symmetric Policy Design for Multi-Agent Dispatch Coordination in Supply Chains	Sagar Sudhakara et.al.	2504.19397	null
2025-04-24	Efficient Tree Generation for Globally Optimal Decisions under Probabilistic Outcomes	Berk Ozturk et.al.	2504.17983	null
2025-04-24	Ergodic control of McKean-Vlasov systems on the Wasserstein space	Marco Fuhrman et.al.	2504.17958	null
2025-04-24	Fréchet Distance in Unweighted Planar Graphs	Ivor van der Hoog et.al.	2504.17342	null
2025-04-24	Advancing Frontiers of Path Integral Theory for Stochastic Optimal Control	Apurva Patil et.al.	2504.17154	null
2025-04-22	Distributed model predictive control without terminal cost under inexact distributed optimization	Xiaoyu Liu et.al.	2504.15768	null
2025-04-22	Stochastic Programming for Dynamic Temperature Control of Refrigerated Road Transport	Francesco Giliberto et.al.	2504.15741	null
2025-04-22	Exploring Inevitable Waypoints for Unsolvability Explanation in Hybrid Planning Problems	Mir Md Sajid Sarwar et.al.	2504.15668	null
2025-04-24	A Quadratic Control Framework for Dynamic Systems	Igor Ladnik et.al.	2504.15396	null
2025-04-21	The Iterative Chainlet Partitioning Algorithm for the Traveling Salesman Problem with Drone and Neural Acceleration	Jae Hyeok Lee et.al.	2504.15147	null
2025-04-23	Feedback Stackelberg-Nash equilibria in difference games with quasi-hierarchical interactions and inequality constraints	Partha Sarathi Mohapatra et.al.	2504.15019	null
2025-04-19	Optimal Operation and Valuation of Electricity Storages	Jean-Philippe Chancelier et.al.	2504.14292	null
2025-04-18	Code generation for solving and differentiating through convex optimization problems	Maximilian Schaller et.al.	2504.14099	null
2025-04-16	Beyond ISAC: Toward Integrated Heterogeneous Service Provisioning via Elastic Multi-Dimensional Multiple Access	Jie Chen et.al.	2504.11692	null
2025-04-18	Traffic Adaptive Moving-window Service Patrolling for Real-time Incident Management during High-impact Events	Haozhe Lei et.al.	2504.11570	null
2025-04-15	TransitReID: Transit OD Data Collection with Occlusion-Resistant Dynamic Passenger Re-Identification	Kaicong Huang et.al.	2504.11500	null
2025-04-15	Integration of a high-fidelity model of quantum sensors with a map-matching filter for quantum-enhanced navigation	Samuel Lellouch et.al.	2504.11119	null
2025-04-22	Breaking the Dimensional Barrier: A Pontryagin-Guided Direct Policy Optimization for Continuous-Time Multi-Asset Portfolio	Jeonggyu Huh et.al.	2504.11116	null
2025-04-15	Hallucination-Aware Generative Pretrained Transformer for Cooperative Aerial Mobility Control	Hyojun Ahn et.al.	2504.10831	null
2025-04-11	A Nonlinear Hash-based Optimization Method for SpMV on GPUs	Chen Yan et.al.	2504.08860	null
2025-04-07	A Constraint Programming Model For Serial Batch Scheduling With Minimum Batch Size	Jorge A. Huertas et.al.	2504.08793	null
2025-04-05	SLOs-Serve: Optimized Serving of Multi-SLO LLMs	Siyuan Chen et.al.	2504.08784	null
2025-04-11	Interior Point Differential Dynamic Programming, Redux	Ming Xu et.al.	2504.08278	link
2025-04-10	Quantum-assured magnetic navigation achieves positioning accuracy better than a strategic-grade INS in airborne and ground-based field trials	Murat Muradoglu et.al.	2504.08167	null
2025-04-10	Low-Thrust Many-Revolution Transfer between Near Rectilinear Halo Orbit and Low Lunar Orbit Using Hybrid Differential Dynamic Programming	Kohei Oue et.al.	2504.07723	null
2025-04-10	Joint Travel Route Optimization Framework for Platooning	Akif Adas et.al.	2504.07623	null
2025-04-09	Rounding the Lovász Theta Function with a Value Function Approximation	Rui Gong et.al.	2504.07204	null
2025-04-09	Evaluating Retrieval Augmented Generative Models for Document Queries in Transportation Safety	Chad Melton et.al.	2504.07022	null
2025-04-17	Maximizing Battery Storage Profits via High-Frequency Intraday Trading	David Schaurecker et.al.	2504.06932	null
2025-04-08	Linear-space LCS enumeration with quadratic-time delay for two strings	Yoshifumi Sakai et.al.	2504.05742	null
2025-04-09	DDT: Decoupled Diffusion Transformer	Shuai Wang et.al.	2504.05741	null
2025-04-08	Hamilton-Jacobi-Bellman equation and Viscosity solutions for an optimal control problem for stochastic convective Brinkman-Forchheimer equations	Sagar Gautam et.al.	2504.05707	null
2025-04-06	Optimized Path Planning for Logistics Robots Using Ant Colony Algorithm under Multiple Constraints	Haopeng Zhao et.al.	2504.05339	null
2025-04-07	Maximum Shortest Path Interdiction Problem by Upgrading Nodes on Trees under Unit Cost	Qiao Zhang et.al.	2504.05190	null
2025-04-06	Memetic Search for Green Vehicle Routing Problem with Private Capacitated Refueling Stations	Rui Xu et.al.	2504.04527	null
2025-04-05	Improving Question Embeddings with Cognitiv Representation Optimization for Knowledge Tracing	Lixiang Xu et.al.	2504.04121	null
2025-04-04	NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices	Zhe Wang et.al.	2504.03415	null
2025-04-04	Block Toeplitz Sparse Precision Matrix Estimation for Large-Scale Interval-Valued Time Series Forecasting	Wan Tian et.al.	2504.03322	null
2025-04-04	Quantum Optimization-Based Route Compression for Efficient Navigation Systems	Shunsuke Sotobayashi et.al.	2504.03227	null
2025-04-11	Dynamic Treewidth in Logarithmic Time	Tuukka Korhonen et.al.	2504.02790	null
2025-04-04	Controlled Social Learning: Altruism vs. Bias	Raghu Arghal et.al.	2504.02648	null
2025-04-03	Reinforcement Learning for Solving the Pricing Problem in Column Generation: Applications to Vehicle Routing	Abdo Abouelrous et.al.	2504.02383	null
2025-04-03	AI-Driven Framework for Multi-Service Multi-Modal Devices in NextG ORAN Systems	Mrityunjoy Gain et.al.	2504.01730	null
2025-04-01	A Parametric Model for Near-Optimal Online Synthesis with Robust Reach-Avoid Guarantees	Mario Gleirscher et.al.	2504.01006	null
2025-04-01	Linear models of dynamic optimization with linear constraints	Somdeb Lahiri et.al.	2504.00630	null
2025-03-31	QUADRO: A Hybrid Quantum Optimization Framework for Drone Delivery	James B. Holliday et.al.	2503.24301	null
2025-04-02	Unraveling tensor structures in correct-by-design controller synthesis	Ruohan Wang et.al.	2503.24085	null
2025-03-31	Bi-Level Route Optimization and Path Planning with Hazard Exploration	Jimin Choi et.al.	2503.24044	null
2025-03-31	Tree-Guided $L_1$ -Convex Clustering	Bingyuan Zhang et.al.	2503.24012	link
2025-03-30	A Systematic Decade Review of Trip Route Planning with Travel Time Estimation based on User Preferences and Behavior	Nikil Jayasuriya et.al.	2503.23486	null
2025-03-29	A convergence technique for the game i-Mark	Gabriel Nivasch et.al.	2503.23196	null
2025-03-29	PartialLoading: User Scheduling and Bandwidth Allocation for Parameter-sharing Edge Inference	Guanqiao Qu et.al.	2503.22982	null
2025-03-28	Policy Optimization and Multi-agent Reinforcement Learning for Mean-variance Team Stochastic Games	Junkai Hu et.al.	2503.22779	null
2025-04-04	The Price of Simplicity: Analyzing Decoupled Policies for Multi-Location Inventory Control	Yohan John et.al.	2503.22639	null
2025-03-28	Scheduling problem of aircrafts on a same runway and dual runways	Peng Lin et.al.	2503.22124	null
2025-03-27	Optimal Stepsize for Diffusion Sampling	Jianning Pei et.al.	2503.21774	link
2025-03-26	A Hopf-Lax Type Formula for Multi-Agent Path Planning with Pattern Coordination	Christian Parkinson et.al.	2503.20974	link
2025-03-26	Infinite Time Horizon Optimal Control of McKean-Vlasov SDEs	Silvia Rudà et.al.	2503.20572	null
2025-03-26	Optimal reinsurance in a competitive market	Lea Enzi et.al.	2503.20555	null
2025-03-26	Beyond Worst-Case Subset Sum: An Adaptive, Structure-Aware Solver with Sub- $2^{n/2}$ Enumeration	Jesus Salas et.al.	2503.20162	null
2025-03-31	Graph neural networks extrapolate out-of-distribution for shortest paths	Robert R. Nerem et.al.	2503.19173	null
2025-03-29	An Efficient Frequency-Based Approach for Maximal Square Detection in Binary Matrices	Swastik Bhandari et.al.	2503.18974	null
2025-03-23	Agent-Based Models for Two Stocks with Superhedging	Dario Crisci et.al.	2503.18165	null
2025-03-21	A New Segment Routing method with Swap Node Selection Strategy Based on Deep Reinforcement Learning for Software Defined Network	Miao Ye et.al.	2503.16914	null
2025-03-20	Reinforcement Learning-based Heuristics to Guide Domain-Independent Dynamic Programming	Minori Narita et.al.	2503.16371	link
2025-03-19	On the Functoriality of Belief Propagation Algorithms on finite Partially Ordered Sets	Grégoire Sergeant-Perthuis et.al.	2503.15705	null
2025-03-24	Distribution and Purification of Entanglement States in Quantum Networks	Xiaojie Fan et.al.	2503.14712	null
2025-03-18	Designing and Deploying AI Models for Sustainable Logistics Optimization: A Case Study on Eco-Efficient Supply Chains in the USA	Reza E Rabbi Shawon et.al.	2503.14556	null
2025-03-17	Local-Global Learning of Interpretable Control Policies: The Interface between MPC and Reinforcement Learning	Thomas Banker et.al.	2503.13289	null
2025-03-17	Exploring 3D Activity Reasoning and Planning: From Implicit Human Intentions to Route-Aware Planning	Xueying Jiang et.al.	2503.12974	null
2025-03-17	Navigating Heat Exposure: Simulation of Route Planning Based on Visual Language Model Agents	Haoran Ma et.al.	2503.12731	null
2025-03-16	Routing Guidance for Emerging Transportation Systems with Improved Dynamic Trip Equity	Ting Bai et.al.	2503.12601	null
2025-03-14	Discrete Effort Distribution via Regrettable Greedy Algorithm	Song Cao et.al.	2503.11107	null
2025-03-13	Dynamic Programming Algorithms for Finding Cost-Optimal Trajectory on the Terrain	Majid E. Abbasov et.al.	2503.10922	null
2025-03-13	Enhanced Route Planning with Calibrated Uncertainty Set	Lingxuan Tang et.al.	2503.10088	null
2025-03-12	PairVDN - Pair-wise Decomposed Value Functions	Zak Buzzard et.al.	2503.09521	link
2025-03-11	Large Neighborhood Search and Bitmask Dynamic Programming for Wireless Mobile Charging Electric Vehicle Routing Problems in Medical Transportation	Jingyi Zhao et.al.	2503.08752	null
2025-03-11	DISTINGUISH Workflow: A New Paradigm of Dynamic Well Placement Using Generative Machine Learning	Sergey Alyaev et.al.	2503.08509	link
2025-03-10	Multi-Objective Routing Optimization Using Coherent Ising Machine in Wireless Multihop Networks	Yu-Xuan Lin et.al.	2503.07924	null
2025-03-10	Co-Optimizing Distributed Energy Resources under Demand Charges and Bi-Directional Power Flow	Ruixiao Yang et.al.	2503.07907	null
2025-03-10	Operational route planning under uncertainty for Demand Adaptive Systems	Benedikt Lienkamp et.al.	2503.07812	link
2025-03-09	Pull-Based Query Scheduling for Goal-Oriented Semantic Communication	Pouya Agheli et.al.	2503.06725	null
2025-03-08	A Neural Score Follower for Computer Accompaniment of Polyphonic Musical Instruments	Ashwin Pillay et.al.	2503.06348	null
2025-03-11	Optimal Output Feedback Learning Control for Discrete-Time Linear Quadratic Regulation	Kedi Xie et.al.	2503.06226	null
2025-03-08	Dynamic Programming in Ordered Vector Space	Nisha Peng et.al.	2503.06055	null
2025-03-04	Establishment and Solution of a Multi-Stage Decision Model Based on Hypothesis Testing and Dynamic Programming Algorithm	Ziyang Liu et.al.	2503.05807	null
2025-03-07	On Almost Fair and Equitable Allocations of Indivisible Items for Non-monotone Valuations	Vittorio Bilò et.al.	2503.05695	null
2025-03-06	Efficient Algorithms for Verifying Kruskal Rank in Sparse Linear Regression and Related Applications	Fengqin Zhou et.al.	2503.04986	null
2025-03-06	Mean field optimal stopping with uncontrolled state	Andrea Cosso et.al.	2503.04269	null
2025-03-05	Endpoint-Explicit Differential Dynamic Programming via Exact Resolution	Maria Parilli et.al.	2503.03897	null
2025-03-05	Composite Nonlinear Trajectory Tracking Control of Co-Driving Vehicles Using Self-Triggered Adaptive Dynamic Programming	Chuan Hu et.al.	2503.03348	null
2025-03-04	Optimal power procurement for green cellular wireless networks under uncertainty and chance constraints	Nadhir Ben Rached et.al.	2503.03051	null
2025-03-04	On the optimal stopping problem for diffusions and an approximation result for stopping times	Andrea Cosso et.al.	2503.02514	null
2025-03-04	JPDS-NN: Reinforcement Learning-Based Dynamic Task Allocation for Agricultural Vehicle Routing Optimization	Yixuan Fan et.al.	2503.02369	null
2025-03-04	Optimal Control for Remote Patient Monitoring with Multidimensional Health States	Siddharth Chandak et.al.	2503.02292	null
2025-03-03	CorrA: Leveraging Large Language Models for Dynamic Obstacle Avoidance of Autonomous Vehicles	Shanting Wang et.al.	2503.02076	null
2025-03-03	Mapping Spiking Neural Networks to Heterogeneous Crossbar Architectures using Integer Linear Programming	Devin Pohl et.al.	2503.02033	null
2025-02-25	Tracking Control of Euler-Lagrangian Systems with Prescribed State, Input, and Temporal Constraints	Chidre Shravista Kashyap et.al.	2503.01866	null
2025-03-03	CacheQuant: Comprehensively Accelerated Diffusion Models	Xuewen Liu et.al.	2503.01323	null
2025-03-03	Parameter-free Video Segmentation for Vision and Language Understanding	Louis Mahon et.al.	2503.01201	null
2025-03-02	Efficient End-to-end Visual Localization for Autonomous Driving with Decoupled BEV Neural Matching	Jinyu Miao et.al.	2503.00862	null
2025-03-07	Llamarine: Open-source Maritime Industry-specific Large Language Model	William Nguyen et.al.	2503.00203	null
2025-02-28	Time-optimal problem in the space of probabilities measures	Yurii Averboukh et.al.	2502.20871	null
2025-02-27	Dynamic Program Slices Change How Developers Diagnose Gradual Run-Time Type Errors	Felipe Bañados Schwerter et.al.	2502.20533	null
2025-02-27	Efficient Risk-sensitive Planning via Entropic Risk Measures	Alexandre Marthe et.al.	2502.20423	null
2025-02-27	Pontryagin-Bellman Differential Dynamic Programming for Low-Thrust Trajectory Optimization with Path Constraints	Yanis Sidhoum et.al.	2502.20291	null
2025-02-27	SSD: A State-based Stealthy Backdoor Attack For Navigation System in UAV Route Planning	Zhaoxuan Wang et.al.	2502.20178	null
2025-02-27	GraphSparseNet: a Novel Method for Large Scale Trafffic Flow Prediction	Weiyang Kong et.al.	2502.19823	null
2025-03-04	Off-Policy Temporal Difference Learning for Perturbed Markov Decision Processes: Theoretical Insights and Extensive Simulations	Ali Forootani et.al.	2502.18415	null
2025-02-25	Dynamic Factor Model-Based Multiperiod Mean-Variance Portfolio Selection with Portfolio Constraints	Jianjun Gao et.al.	2502.17915	link
2025-02-24	A Deterministic and Linear Model of Dynamic Optimization	Somdeb Lahiri et.al.	2502.17012	null
2025-02-24	Be CIM or Be Memory: A Dual-mode-aware DNN Compiler for CIM Accelerators	Shixin Zhao et.al.	2502.17006	null
2025-02-23	Volume Optimality in Conformal Prediction with Structured Prediction Sets	Chao Gao et.al.	2502.16658	null
2025-02-21	Near Optimal Decision Trees in a SPLIT Second	Varun Babbar et.al.	2502.15988	null
2025-02-21	Zweistein: A Dynamic Programming Evaluation Function for Einstein Würfelt Nicht!	Wei Lin. Hsueh et.al.	2502.15547	null
2025-02-21	Learning Maritime Inventory Routing Optimization	Rui Chen et.al.	2502.15244	null
2025-02-19	Optimistically Optimistic Exploration for Provably Efficient Infinite-Horizon Reinforcement and Imitation Learning	Antoine Moulin et.al.	2502.13900	null
2025-02-19	FPT algorithms over linear delta-matroids with applications	Eduard Eiben et.al.	2502.13654	null
2025-03-01	Value Gradient Sampler: Sampling as Sequential Decision Making	Sangwoong Yoon et.al.	2502.13280	link
2025-02-18	Autonomous Vehicles Using Multi-Agent Reinforcement Learning for Routing Decisions Can Harm Urban Traffic	Anastasia Psarou et.al.	2502.13188	null
2025-02-18	GPU Memory Usage Optimization for Backward Propagation in Deep Network Training	Ding-Yong Hong et.al.	2502.12499	null
2025-02-17	Logarithmic Approximation for Road Pricing on Grids	Andrei Constantinescu et.al.	2502.11979	null
2025-02-17	Proactive Depot Discovery: A Generative Framework for Flexible Location-Routing	Site Qu et.al.	2502.11715	null
2025-02-16	The Q-Spellbook: Crafting Surface Code Layouts and Magic State Protocols for Large-Scale Quantum Computing	Avimita Chatterjee et.al.	2502.11253	null
2025-02-14	Customizable Contraction Hierarchies -- A Survey	Thomas Bläsius et.al.	2502.10519	null
2025-02-14	Scheduling Strategies for Partially-Replicable Task Chains on Two Types of Resources	Diane Orhan et.al.	2502.10000	null
2025-02-14	Thompson Sampling for Repeated Newsvendor	Weizhou Zhang et.al.	2502.09900	null
2025-02-26	A quantum speedup algorithm for TSP based on quantum dynamic programming with very few qubits	Bai Xujun et.al.	2502.08853	null
2025-02-12	Self-Evaluation for Job-Shop Scheduling	Imanol Echeverria et.al.	2502.08684	null
2025-02-11	TRAVEL: Training-Free Retrieval and Alignment for Vision-and-Language Navigation	Navid Rajabi et.al.	2502.07306	null
2025-02-05	RLOMM: An Efficient and Robust Online Map Matching Framework with Reinforcement Learning	Minxiao Chen et.al.	2502.06825	null
2025-02-08	Counting Tree-Like Multigraphs with a Given Number of Vertices and Multiple Edges	Muhammad Ilyas et.al.	2502.05529	null
2025-02-06	Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers	Adam Stooke et.al.	2502.05232	null
2025-02-07	Stochastic internal habit formation and optimality	Michele Aleandri et.al.	2502.05081	null
2025-02-07	Preference-aware compensation policies for crowdsourced on-demand services	Georgina Nouli et.al.	2502.05060	null
2025-02-07	A non-zero-sum game with reinforcement learning under mean-variance framework	Junyi Guo et.al.	2502.04788	null
2025-02-06	Unifying and Optimizing Data Values for Selection via Sequential-Decision-Making	Hongliang Chi et.al.	2502.04554	null
2025-02-06	Solvability of Approximate Reach-Avoid Games	Mario Gleirscher et.al.	2502.04544	null
2025-02-06	On the Number of Control Nodes in Boolean Networks with Degree Constraints	Liangjie Sun et.al.	2502.03839	null
2025-02-06	Iterate to Accelerate: A Unified Framework for Iterative Reasoning and Feedback Convergence	Jacob Fein-Ashley et.al.	2502.03787	null
2025-02-06	Cascaded Learned Bloom Filter for Optimal Model-Filter Size Balance and Fast Rejection	Atsuki Sato et.al.	2502.03696	null
2025-02-06	Improving polynomial bounds for the Graphical Traveling Salesman Problem with release dates on paths	Thailsson Clementino et.al.	2502.02680	null
2025-02-04	Optimal Routing in the Presence of Hooks: Three Case Studies	Tarun Chitra et.al.	2502.02059	link
2025-02-03	Trajectory Map-Matching in Urban Road Networks Based on RSS Measurements	Zheng Xing et.al.	2502.01280	null
2025-02-08	Minimum Riesz s-Energy Subset Selection in Ordered Point Sets via Dynamic Programming	Michael Emmerich et.al.	2502.01163	null
2025-02-01	Model-Free Predictive Control: Introductory Algebraic Calculations, and a Comparison with HEOL and ANNs	Cédric Join et.al.	2502.00443	null
2025-02-01	A polynomial-based constrained solver for fuel-optimal low-thrust trajectory optimization	Thomas Caleb et.al.	2502.00398	null
2025-02-01	Left-Deep Join Order Selection with Higher-Order Unconstrained Binary Optimization on Quantum Computers	Valter Uotila et.al.	2502.00362	null
2025-01-31	Epi-Consistent Approximation of Stochastic Dynamic Programs	Dominic S. T. Keehan et.al.	2501.19028	null
2025-01-30	Model-Adaptive Approach to Dynamic Discrete Choice Models with Large State Spaces	Ertian Chen et.al.	2501.18746	null
2025-02-05	Solving Drone Routing Problems with Quantum Computing: A Hybrid Approach Combining Quantum Annealing and Gate-Based Paradigms	Eneko Osaba et.al.	2501.18432	null
2025-01-29	Stochastic scattering control of spider diffusion governed by an optimal diffraction probability measure selected from its own local-time	Isaac Ohavi et.al.	2501.18057	null
2025-01-15	Low-Thrust Many-Revolution Trajectory Design Under Operational Uncertainties for DESTINY+ Mission	Naoya Ozaki et.al.	2501.17867	null
2025-02-06	On characterizing optimal learning trajectories in a class of learning problems	Getachew K Befekadu et.al.	2501.16521	null
2025-01-22	Modified Patankar Semi-Lagrangian Scheme for the Optimal Control of Production-Destruction systems	Simone Cacace et.al.	2501.13085	null
2025-01-22	Optimizing Return Distributions with Distributional Dynamic Programming	Bernardo Ávila Pires et.al.	2501.13028	null
2025-01-30	Pontryagin-Guided Deep Learning for Large-Scale Constrained Dynamic Portfolio Choice	Jeonggyu Huh et.al.	2501.12600	null
2025-01-23	Treefix: Enabling Execution with a Tree of Prefixes	Beatriz Souza et.al.	2501.12339	null
2025-01-21	A Dynamic Programming Framework for Generating Approximately Diverse and Optimal Solutions	Waldo Gálvez et.al.	2501.12261	null
2025-01-21	Dissecting the NVIDIA Hopper Architecture through Microbenchmarking and Multiple Level Analysis	Weile Luo et.al.	2501.12084	null
2025-01-20	Routing Optimization Based on Distributed Intelligent Network Softwarization for the Internet of Things	Mohamed Ali Zormati et.al.	2501.11484	null
2025-02-01	OpenLiDARMap: Zero-Drift Point Cloud Mapping using Map Priors	Dominik Kulmer et.al.	2501.11111	link
2025-01-25	BOOST: Microgrid Sizing using Ordinal Optimization	Mohamad Fares El Hajj Chehade et.al.	2501.10842	null
2025-01-17	Multiclass Queue Scheduling Under Slowdown: An Approximate Dynamic Programming Approach	Jing Dong et.al.	2501.10523	null
2025-01-17	Complexity of the Virtual Network Embedding with uniform demands	Amal Benhamiche et.al.	2501.10154	null
2025-01-16	A Dynamic Unmanned Aerial Vehicle Routing Framework for Urban Traffic Monitoring	Yumeng Bai et.al.	2501.09249	null
2025-01-15	Stochastic Optimal Control of Prosumers in a District Heating System	Maalvladédon Ganet Somé et.al.	2501.09088	null
2025-01-15	Family-wise Error Rate Control with E-values	Will Hartog et.al.	2501.09015	null
2025-01-31	Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design	Zhi Zheng et.al.	2501.08603	link
2025-01-14	Cooperative Patrol Routing: Optimizing Urban Crime Surveillance through Multi-Agent Reinforcement Learning	Juan Palma-Borda et.al.	2501.08020	link
2025-01-14	Optimal Classification Trees for Continuous Feature Data Using Dynamic Programming with Branch-and-Bound	Catalin E. Brita et.al.	2501.07903	link
2025-01-09	A Multi-Layer CNN-GRUSKIP model based on transformer for spatial TEMPORAL traffic flow prediction	Karimeh Ibrahim Mohammad Ata et.al.	2501.07593	null
2025-01-13	An Alternating Approach to Approximate Dynamic Programming	Di Zhang et.al.	2501.06983	null
2025-01-11	A Linear Complexity Algorithm for Optimal Transport Problem with Log-type Cost	Ziyuan Lyu et.al.	2501.06578	null
2025-01-10	Exploratory Randomization for Discrete-Time Linear Exponential Quadratic Gaussian (LEQG) Problem	Sebastien Lleo et.al.	2501.06275	null
2025-01-09	Linear Algebraic Truncation Algorithm with A Posteriori Error Bounds for Computing Markov Chain Equilibrium Gradients	Saied Mahdian et.al.	2501.06266	null
2025-01-09	ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries	Keke Huang et.al.	2501.04901	null
2025-01-08	Semilinear Dynamic Programming: Analysis, Algorithms, and Certainty Equivalence Properties	Yuchao Li et.al.	2501.04668	null
2025-01-08	HypeRL: Parameter-Informed Reinforcement Learning for Parametric PDEs	Nicolò Botteghi et.al.	2501.04538	null
2025-01-08	Probabilistic Greedy Algorithm Solver Using Magnetic Tunneling Junctions for Traveling Salesman Problem	Ran Zhang et.al.	2501.04447	null
2025-01-07	Exploring the Potential of Large Language Models in Public Transportation: San Antonio Case Study	Ramya Jonnala et.al.	2501.03904	null
2025-01-07	Young domination on Hamming rectangles	Janko Gravner et.al.	2501.03788	null
2025-01-06	Distributionally Robust Control Synthesis for Stochastic Systems with Safety and Reach-Avoid Specifications	Yu Chen et.al.	2501.03137	null
2025-01-06	MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs	Hui Sun et.al.	2501.02885	null
2025-01-06	Local Reactive Control for Mobile Manipulators with Whole-Body Safety in Complex Environments	Chunxin Zheng et.al.	2501.02815	null
2025-01-06	Enhancing Robot Route Optimization in Smart Logistics with Transformer and GNN Integration	Hao Luo et.al.	2501.02749	null
2025-01-05	Approximate Dynamic Programming for a Remanufacture-to-Order System	Amirreza Pashapour et.al.	2501.02656	null
2025-01-05	Neural Error Covariance Estimation for Precise LiDAR Localization	Minoo Dolatabadi et.al.	2501.02558	null
2025-01-01	Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation	Shoutao Guo et.al.	2501.00868	link
2024-12-30	A randomisation method for mean-field control problems with common noise	Robert Denkert et.al.	2412.20782	null
2024-12-28	RFPPO: Motion Dynamic RRT based Fluid Field - PPO for Dynamic TF/TA Routing Planning	Rongkun Xue et.al.	2412.20098	null
2024-12-27	Game theoretical asymptotic mean value properties for non-homogeneous $p$ -Laplace problems	Félix del Teso et.al.	2412.19410	null
2024-12-24	Hybrid Many-Objective Optimization in Probabilistic Mission Design for Compliant and Effective UAV Routing	Simon Kohaut et.al.	2412.18514	null
2024-12-23	AI-Driven Control of Chaos: A Transformer-Based Approach for Dynamical Systems	David Valle et.al.	2412.17357	link
2024-12-21	A Bayesian Composite Risk Approach for Stochastic Optimal Control and Markov Decision Processes	Wentao Ma et.al.	2412.16488	null
2024-12-20	Battery valuation on electricity intraday markets with liquidity costs	Enzo Cognéville et.al.	2412.15959	null
2024-12-19	Robustness Evaluation of a Physical Internet-based Intermodal Logistic Network	Federico Gallo et.al.	2412.14658	null
2024-12-17	A Scalable Method for Optimal Path Planning on Manifolds via a Hopf-Lax Type Formula	Edward Huynh et.al.	2412.13346	link
2024-12-16	Using machine learning to inform harvest control rule design in complex fishery settings	Felipe Montealegre-Mora et.al.	2412.12400	link
2024-12-12	SprayCraft: Graph-Based Route Optimization for Variable Rate Precision Spraying	Kiran K. Kethineni et.al.	2412.12176	null
2024-12-16	Witty: An Efficient Solver for Computing Minimum-Size Decision Trees	Luca Pascal Staus et.al.	2412.11954	null
2024-12-16	LLM-DaaS: LLM-driven Drone-as-a-Service Operations from Text User Requests	Lillian Wassim et.al.	2412.11672	null
2024-12-14	An Active Parameter Learning Approach to The Identification of Safe Regions	Aneesh Raghavan et.al.	2412.10627	null
2024-12-12	On Round-Off Errors and Gaussian Blur in Superresolution and in Image Registration	Serap A. Savari et.al.	2412.09741	null
2024-12-20	MAPLE: A Framework for Active Preference Learning Guided by Large Language Models	Saaduddin Mahmud et.al.	2412.07207	null
2024-12-09	Phaedrus: Exploring Dynamic Application Behavior with Lightweight Generative Models and Large-Language Models	Bodhisatwa Chatterjee et.al.	2412.06994	null
2024-12-07	Timely reliable Bayesian decision-making enabled using memristors	Lekai Song et.al.	2412.06838	null
2024-12-08	DiTer++: Diverse Terrain and Multi-modal Dataset for Multi-Robot SLAM in Multi-session Environments	Juwon Kim et.al.	2412.05839	null
2024-12-08	SizeGS: Size-aware Compression of 3D Gaussians with Hierarchical Mixed Precision Quantization	Shuzhao Xie et.al.	2412.05808	null
2024-12-07	Controlled rough SDEs, pathwise stochastic control and dynamic programming principles	Peter K. Friz et.al.	2412.05698	null
2024-12-07	Quantum Annealing and Tensor Networks: a Powerful Combination to Solve Optimization Problems	Miquel Albertí Binimelis et.al.	2412.05595	link
2024-12-07	Optimizing Returns from Experimentation Programs	Timothy Sudijono et.al.	2412.05508	null
2024-12-06	Nonmyopic Global Optimisation via Approximate Dynamic Programming	Filippo Airaldi et.al.	2412.04882	link
2024-12-05	Generating graph states with a single quantum emitter and the minimum number of fusions	Matthias C. Löbl et.al.	2412.04587	null
2024-12-04	Summa Summarum: Moessner's Theorem without Dynamic Programming	Olivier Danvy et.al.	2412.03127	null
2024-11-21	Quantum Annealing based Hybrid Strategies for Real Time Route Optimization	Sushil Mario et.al.	2412.02720	null
2024-11-30	A Second Soul: Celebrating the Many Languages of Programming -- Festschrift in Honor of Peter Thiemann's Sixtieth Birthday	Annette Bieniusa et.al.	2412.01856	null
2024-12-01	Optimization of Delivery Routes for Fresh E-commerce in Pre-warehouse Mode	Alice Harward et.al.	2412.00634	null
2024-11-29	An Optimal Switching Approach for Bird Migration	Jiawei Chu et.al.	2411.19467	null
2024-11-28	SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing	Rong-Cheng Tu et.al.	2411.18983	null
2024-11-27	SCoTT: Wireless-Aware Path Planning with Vision Language Models and Strategic Chains-of-Thought	Aladin Djuhera et.al.	2411.18212	null
2024-11-26	Structural Parameterization of Locating-Dominating Set and Test Cover	Dipayan Chakraborty et.al.	2411.17948	null
2024-11-26	Pushing the Limits of Large Language Model Quantization via the Linearity Theorem	Vladimir Malinovskii et.al.	2411.17525	null
2024-11-26	Weakly acyclic diagrams: A data structure for infinite-state symbolic verification	Michael Blondin et.al.	2411.17250	null
2024-11-26	Dynamic Programming-Based Offline Redundancy Resolution of Redundant Manipulators Along Prescribed Paths with Real-Time Adjustment	Zhihang Yin et.al.	2411.17052	null
2024-11-26	Dynamic Programming-Based Redundancy Resolution for Path Planning of Redundant Manipulators Considering Breakpoints	Zhihang Yin et.al.	2411.17034	null
2024-11-26	Entropy-Based Dynamic Programming for Efficient Vehicle Parking	Jean-Luc Lupien et.al.	2411.17014	null
2024-11-25	Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking	Phuc Nguyen et.al.	2411.16183	null
2024-11-25	Using Drone Swarm to Stop Wildfire: A Predict-then-optimize Approach	Shijie Pan et.al.	2411.16144	null
2024-11-24	Hiding Communication Cost in Distributed LLM Training via Micro-batch Co-execution	Haiquan Wang et.al.	2411.15871	null
2024-11-24	Revenue Maximization in Choice-Based Matching Markets	Dan Nissim et.al.	2411.15727	null
2024-11-22	Jovis: A Visualization Tool for PostgreSQL Query Optimizer	Yoojin Choi et.al.	2411.14788	null
2024-11-22	Construction and Preliminary Validation of a Dynamic Programming Concept Inventory	Matthew Ferland et.al.	2411.14655	null
2024-11-18	Controlled Occupied Processes and Viscosity Solutions	H. Mete Soner et.al.	2411.12080	null
2024-11-18	A New Finite-Horizon Dynamic Programming Analysis of Nonanticipative Rate-Distortion Function for Markov Sources	Zixuan He et.al.	2411.11698	null
2024-11-18	gpuPairHMM: High-speed Pair-HMM Forward Algorithm for DNA Variant Calling on GPUs	Bertil Schmidt et.al.	2411.11547	link
2024-11-17	Dynamic Programming: Optimality at a Point Implies Optimality Everywhere	John Stachurski et.al.	2411.11062	null
2024-11-15	AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment	Yonggan Fu et.al.	2411.10606	link
2024-11-14	Latency Optimization in LEO Satellite Communications with Hybrid Beam Pattern and Interference Control	Qianqian Zhang et.al.	2411.09600	null
2024-11-13	On the numerical integration of the Fokker-Planck equation driven by a mechanical force and the Bismut-Elworthy-Li formula	Julia Sanders et.al.	2411.08518	link
2024-11-13	Tractable Robust Markov Decision Processes	Julien Grand-Clément et.al.	2411.08435	null
2024-11-12	dpvis: A Visual and Interactive Learning Tool for Dynamic Programming	David H. Lee et.al.	2411.07705	link
2024-11-11	DP and QP Based Decision-making and Planning for Autonomous Vehicle	Zhicheng Zhang et.al.	2411.06751	null
2024-11-11	Resilient control under denial-of-service and uncertainty: An adaptive dynamic programming approach	Weinan Gao et.al.	2411.06689	null
2024-11-11	Two Kinds of Learning Algorithms for Continuous-Time VWAP Targeting Execution	Xingyu Zhou et.al.	2411.06645	null
2024-11-10	Robust optimal stopping with regime switching	Siyu Lv et.al.	2411.06522	null
2024-11-07	Optimal control under unknown intensity with Bayesian learning	Nicolas Baradel et.al.	2411.04917	null
2024-11-07	Structure Matters: Dynamic Policy Gradient	Sara Klein et.al.	2411.04913	null
2024-11-07	Minimax Linear Regulator Problems for Positive Systems	Alba Gurpegui et.al.	2411.04809	null
2024-11-07	Optimal Execution under Incomplete Information	Etienne Chevalier et.al.	2411.04616	null
2024-11-07	Convergence and Robustness of Value and Policy Iteration for the Linear Quadratic Regulator	Bowen Song et.al.	2411.04548	link
2024-11-05	DP-HLS: A High-Level Synthesis Framework for Accelerating Dynamic Programming Algorithms in Bioinformatics	Yingqi Cao et.al.	2411.03398	link
2024-11-04	Stochastic Optimal Control of an Industrial Power-to-Heat System with High-Temperature Heat Pump and Thermal Energy Storage	Eric Pilling et.al.	2411.02211	null
2024-11-03	ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis	Xinyu Geng et.al.	2411.01564	null
2024-10-31	EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization	Mujin Cheon et.al.	2411.00171	null
2024-10-31	Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis	Jia Lin Hau et.al.	2410.24128	link
2024-10-31	A dynamic programming principle for multiperiod control problems with bicausal constraints	Ruslan Mirmominov et.al.	2410.23927	null
2024-10-30	Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning	Ruhan Wang et.al.	2410.23450	null
2024-10-29	Approximately Counting Knapsack Solutions in Subquadratic Time	Weiming Feng et.al.	2410.22267	null
2024-10-29	Beating Bellman's Algorithm for Subset Sum	Karl Bringmann et.al.	2410.21942	null
2024-10-28	Analysis of Different Algorithmic Design Techniques for Seam Carving	Owais Aijaz et.al.	2410.21207	null
2024-10-27	A New Method for Inserting Train Paths into a Timetable	David Dekker et.al.	2410.20561	link
2024-10-27	On the I/O Complexity of the CYK Algorithm and of a Family of Related DP Algorithms	Lorenzo De Stefani et.al.	2410.20337	null
2024-10-25	An Enhanced Hierarchical Planning Framework for Multi-Robot Autonomous Exploration	Gengyuan Cai et.al.	2410.19373	null
2024-10-24	Stochastic dynamic programming under recursive Epstein-Zin preferences	Anna Jaśkiewicz et.al.	2410.19181	null
2024-10-24	A Counterexample in Cross-Correlation Template Matching	Serap A. Savari et.al.	2410.19085	null
2024-10-23	Trajectory Optimization for Spatial Microstructure Control in Electron Beam Metal Additive Manufacturing	Mikhail Khrenov et.al.	2410.18207	null
2024-10-24	Estimating the Spectral Moments of the Kernel Integral Operator from Finite Sample Matrices	Chanwoo Chun et.al.	2410.17998	null
2024-10-21	Policies with Sparse Inter-Agent Dependencies in Dynamic Games: A Dynamic Programming Approach	Xinjie Liu et.al.	2410.16441	null
2024-10-21	All You Need is an Improving Column: Enhancing Column Generation for Parallel Machine Scheduling via Transformers	Amira Hijazi et.al.	2410.15601	null
2024-10-21	How to Find the Exact Pareto Front for Multi-Objective MDPs?	Yining Li et.al.	2410.15557	null
2024-10-20	CASET: Complexity Analysis using Simple Execution Traces for CS submissions*	Aaryen Mehta et.al.	2410.15419	null
2024-10-19	The Constrained Layer Tree Problem and Applications to Solar Farm Cabling	Thomas Bläsius et.al.	2410.15031	null
2024-10-18	On picking operations in e-commerce warehouses: Insights from the complete-information counterpart	Catherine Lorenz et.al.	2410.14316	null
2024-10-17	Quasi-quantum states and the quasi-quantum PCP theorem	Itai Arad et.al.	2410.13549	null
2024-10-17	Joint Antenna Selection and Covariance Matrix Optimization for ISAC Systems	Michail Palaiologos et.al.	2410.13446	null
2024-10-17	Membership Testing for Semantic Regular Expressions	Yifei Huang et.al.	2410.13262	null
2024-10-22	Research on Travel Route Planing Problems Based on Greedy Algorithm	Yiquan Wang et.al.	2410.13226	link
2024-10-17	Algorithmic Content Selection and the Impact of User Disengagement	Emilio Calvano et.al.	2410.13108	null
2024-10-16	Learning Representations for Reasoning: Generalizing Across Diverse Structures	Zhaocheng Zhu et.al.	2410.13018	null
2024-10-16	Vehicle Localization in GPS-Denied Scenarios Using Arc-Length-Based Map Matching	Nur Uddin Javed et.al.	2410.12208	null
2024-10-15	Incremental computation of the set of period sets	Eric Rivals et.al.	2410.12077	null
2024-10-15	Routing and Scheduling Optimization for Urban Air Mobility Fleet Management using Quantum Annealing	Renichiro Haba et.al.	2410.11231	null
2024-10-16	SplitLLM: Collaborative Inference of LLMs for Model Placement and Throughput Optimization	Akrit Mudvari et.al.	2410.10759	null
2024-10-14	Learning Sub-Second Routing Optimization in Computer Networks requires Packet-Level Dynamics	Andreas Boltres et.al.	2410.10377	null
2024-10-09	Rapid Computation of the Assembly Index of Molecular Graphs	Ian Seet et.al.	2410.09100	null
2024-10-11	Deep Learning Algorithms for Mean Field Optimal Stopping in Finite Space and Discrete Time	Lorenzo Magnino et.al.	2410.08850	null
2024-10-11	Hybrid Filtering Heuristic for the Sensor-Placement Problem to Discretize 2D Continuous Environments	Jan Mikula et.al.	2410.08784	link
2024-10-10	Dynamic Programming based Local Search approaches for Multi-Agent Path Finding problems on Directed Graphs	Irene Saccani et.al.	2410.07954	null
2024-10-10	Partitioning Trillion Edge Graphs on Edge Devices	Adil Chhabra et.al.	2410.07732	null
2024-10-11	Q-WSL:Leveraging Dynamic Programming for Weighted Supervised Learning in Goal-conditioned RL	Xing Lei et.al.	2410.06648	null
2024-10-08	Solvability of Equilibrium Riccati Equations: A Direct Approach	Bowen Ma et.al.	2410.06090	null
2024-10-07	Dynamic HumTrans: Humming Transcription Using CNNs and Dynamic Programming	Shubham Gupta et.al.	2410.05455	link
2024-10-07	A Predictive and Optimization Approach for Enhanced Urban Mobility Using Spatiotemporal Data	Shambhavi Mishra et.al.	2410.05358	null
2024-10-05	AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text	Ximing Lu et.al.	2410.04265	link
2024-10-05	A branch-&-price approach to the unrooted maximum agreement forest problem	Martin Frohn et.al.	2410.04122	null
2024-10-02	Electrification of Transportation: A Hybrid Benders/SDDP Algorithm for Optimal Charging Station Trading	Farnaz Sohrabi et.al.	2410.03763	null
2024-10-02	Effects of eco-driving on energy consumption and battery degradation for electric vehicles at signalized intersections	Yongqiang Wang et.al.	2410.01685	null
2024-10-02	Krylov-Safonov theory for Pucci-type extremal inequalities on random data clouds	Ángel Arroyo et.al.	2410.01642	null
2024-10-02	Automated Curvy Waveguide Routing for Large-Scale Photonic Integrated Circuits	Hongjian Zhou et.al.	2410.01260	link
2024-09-30	Generalised mixed effects models for changepoint analysis of biomedical time series data	Mark B. Fiecas et.al.	2410.00183	null
2024-09-30	Opt2Skill: Imitating Dynamically-feasible Whole-Body Trajectories for Versatile Humanoid Loco-Manipulation	Fukang Liu et.al.	2409.20514	null
2024-09-28	On Computing Elastic Shape Distances between Curves in d-dimensional Space	Javier Bernal et.al.	2409.19380	null
2024-09-25	MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual Features	Katharina Anderer et.al.	2409.16765	link
2024-09-25	DeformStream: Deformation-based Adaptive Volumetric Video Streaming	Boyan Li et.al.	2409.16615	null
2024-09-24	Partial Elastic Shape Registration of 3D Surfaces using Dynamic Programming	Javier Bernal et.al.	2409.16462	null
2024-09-25	Efficient Nearest Neighbor Search Using Dynamic Programming	Pengfei Wang et.al.	2409.15023	null
2024-09-22	Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming	Simon Malan et.al.	2409.14486	null
2024-09-24	Batch Predictive Inference	Yonghoon Lee et.al.	2409.13990	link
2024-09-20	A Modified Algorithm for Optimal Picker Routing in a Single Block Warehouse	George Dunn et.al.	2409.13219	null
2024-09-19	Program Slicing in the Era of Large Language Models	Kimya Khakzad Shahandashti et.al.	2409.12369	null
2024-09-18	Differential dynamic programming with stagewise equality and inequality constraints using interior point method	Siddharth Prabhu et.al.	2409.12048	link
2024-09-20	Second-Order Constrained Dynamic Optimization	Yuichiro Aoyama et.al.	2409.11649	null
2024-09-18	Multi-stage stochastic linear programming for shared autonomous vehicle system operation and design with on-demand and pre-booked requests	Riki Kawase et.al.	2409.11611	null
2024-09-17	Optimal Investment with Costly Expert Opinions	Christoph Knochenhauer et.al.	2409.11569	null
2024-09-20	Exact Wavefront Propagation for Globally Optimal One-to-All Path Planning on 2D Cartesian Grids	Ibrahim Ibrahim et.al.	2409.11545	link
2024-09-17	Neural Networks for Vehicle Routing Problem	László Kovács et.al.	2409.11290	null
2024-09-17	Selective algorithm processing of subset sum distributions	Nick Dawes et.al.	2409.11076	null
2024-09-17	Local discontinuous Galerkin method for nonlinear BSPDEs of Neumann boundary conditions with deep backward dynamic programming time-marching	Yixiang Dai et.al.	2409.11004	null
2024-09-17	Relationship between stochastic maximum principle and dynamic programming principle under convex expectation	Xiaojuan Li et.al.	2409.10987	null
2024-09-16	Direct Data-Driven Discounted Infinite Horizon Linear Quadratic Regulator with Robustness Guarantees	Ramin Esmzad et.al.	2409.10703	null
2024-09-20	Motion Forecasting via Model-Based Risk Minimization	Aron Distelzweig et.al.	2409.10585	null
2024-09-16	Estimates for Optimal Multistage Group Partition Testing	Guojiang Shao et.al.	2409.10410	null
2024-09-16	Pareto Sums of Pareto Sets: Lower Bounds and Algorithms	Daniel Funke et.al.	2409.10232	null
2024-09-12	Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning	Teng Yan et.al.	2409.08062	null
2024-09-12	Super Monotonic Alignment Search	Junhyeok Lee et.al.	2409.07704	link
2024-09-10	Design of Threshold-Constrained Indirect Quantizers	Ariel Doubchak et.al.	2409.06839	null
2024-09-10	Cooptimizing Safety and Performance with a Control-Constrained Formulation	Hao Wang et.al.	2409.06696	link
2024-09-12	Valuation Model of Chinese Convertible Bonds Based on Monte Carlo Simulation	Yu Liu et.al.	2409.06496	null
2024-09-09	OTFS-MDMA: An Elastic Multi-Domain Resource Utilization Mechanism for High Mobility Scenarios	Jie Chen et.al.	2409.05724	null
2024-09-09	Enhancing Empathic Accuracy: Penalized Functional Alignment Method to Correct Misalignment in Emotional Perception	Linh H Nghiem et.al.	2409.05343	null
2024-09-08	Cooperative Learning-Based Framework for VNF Caching and Placement Optimization over Low Earth Orbit Satellite Networks	Khai Doan et.al.	2409.05025	null
2024-09-08	Fast Deep Predictive Coding Networks for Videos Feature Extraction without Labels	Wenqian Xue et.al.	2409.04945	null
2024-09-17	Second-Order Stein Variational Dynamic Optimization	Yuichiro Aoyama et.al.	2409.04644	null
2024-09-06	Refined Bounds on Near Optimality Finite Window Policies in POMDPs and Their Reinforcement Learning	Yunus Emre Demirci et.al.	2409.04351	null
2024-09-05	Space-Efficient Algorithm for Integer Programming with Few Constraints	Lars Rohwedder et.al.	2409.03681	null
2024-09-05	Fine-Grained Equivalence for Problems Related to Integer Linear Programming	Lars Rohwedder et.al.	2409.03675	null
2024-09-06	Revenue Management with Calendar-Aware and Dependent Demands: Asymptotically Tight Fluid Approximations	Weiyuan Li et.al.	2409.02637	null
2024-09-03	FuzzCoder: Byte-level Fuzzing Test via Large Language Model	Liqun Yang et.al.	2409.01944	link
2024-09-03	Quantum Algorithms for One-Sided Crossing Minimization	Susanna Caroppo et.al.	2409.01942	null
2024-09-02	Solving Integrated Process Planning and Scheduling Problem via Graph Neural Network Based Deep Reinforcement Learning	Hongpei Li et.al.	2409.00968	link
2024-09-02	Multistage Robust Average Randomized Spectral Risk Optimization	Qiong Wu et.al.	2409.00892	null
2024-09-01	An Optimized Binning and Probabilistic Slice Sharing Algorithm for Motion Correction in Abdominal DW-MRI	Michelle Su et.al.	2409.00798	null
2024-09-01	Cooperative Path Planning with Asynchronous Multiagent Reinforcement Learning	Jiaming Yin et.al.	2409.00754	null
2024-09-01	The landscape of deterministic and stochastic optimal control problems: One-shot Optimization versus Dynamic Programming	Jihun Kim et.al.	2409.00655	null
2024-08-31	Foundations of Multivariate Distributional Reinforcement Learning	Harley Wiltzer et.al.	2409.00328	null
2024-08-30	Approximation Algorithms for Anchored Multiwatchman Routes	Joseph S. B. Mitchell et.al.	2408.17343	null
2024-08-30	Stationary Policies are Optimal in Risk-averse Total-reward MDPs with EVaR	Xihong Su et.al.	2408.17286	link
2024-08-30	A Two-Timescale Decision-Hazard-Decision Formulation for Storage Usage Values Calculation	Camila Martinez Parra et.al.	2408.17113	null
2024-08-29	Optimization Models for the Quadratic Traveling Salesperson Problem	Yuxiao Chen et.al.	2408.16680	null
2024-08-27	On the parameterized complexity of computing good edge-labelings	Davi de Andrade et.al.	2408.15181	null
2024-08-26	Achieving designed texture and flows in bulk active nematics using optimal control theory	Saptorshi Ghosh et.al.	2408.14596	null
2024-08-25	Decentralized Stochastic Control in Standard Borel Spaces: Centralized MDP Reductions, Near Optimality of Finite Window Local Information, and Q-Learning	Omar Mrani-Zentar et.al.	2408.13828	null
2024-08-23	The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities	Venkatesh Balavadhani Parthasarathy et.al.	2408.13296	null
2024-08-18	An Introduction to Cognidynamics	Marco Gori et.al.	2408.13112	null
2024-08-20	Optimal Guarantees for Online Selection Over Time	Sebastian Perez-Salazar et.al.	2408.11224	null
2024-08-20	Fault Tolerant Dynamic Task Assignment for UAV-based Search Teams	Ali Nasir et.al.	2408.10564	null
2024-08-19	Efficient Exploration in Deep Reinforcement Learning: A Novel Bayesian Actor-Critic Algorithm	Nikolai Rozanov et.al.	2408.10055	null
2024-08-19	Continuous-Time Dynamic Decision Making with Costly Information	Christoph Knochenhauer et.al.	2408.09693	null
2024-08-19	Solving stochastic climate-economy models: A deep least-squares Monte Carlo approach	Aleksandar Arandjelović et.al.	2408.09642	null
2024-08-18	Exploratory Optimal Stopping: A Singular Control Formulation	Jodi Dianetti et.al.	2408.09335	null
2024-08-17	Optimal Strip Attitude Command of Earth Observation Satellite using Differential Dynamic Programming	Seungyeop Han et.al.	2408.09244	null
2024-08-17	Twin Sorting Dynamic Programming Assisted User Association and Wireless Bandwidth Allocation for Hierarchical Federated Learning	Rung-Hung Gau et.al.	2408.09076	null
2024-08-17	Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs (Extended Version)	Mingkuan Xu et.al.	2408.09055	null
2024-08-15	Optimal control problems with generalized mean-field dynamics and viscosity solution to Master Bellman equation	Rainer Buckdahn et.al.	2408.08046	null
2024-08-14	Differentiating Policies for Non-Myopic Bayesian Optimization	Darian Nwankwo et.al.	2408.07812	null
2024-08-11	Moderate Exponential-time Quantum Dynamic Programming Across the Subsets for Scheduling Problems	Camille Grange et.al.	2408.05741	null
2024-08-10	Convergence Guarantee of Dynamic Programming for LTL Surrogate Reward	Zetong Xuan et.al.	2408.05438	null
2024-08-09	MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling	Drew Edwards et.al.	2408.05024	null
2024-08-09	A Comprehensive System Architecture using Field Programmable Gate Arrays Technology, Dijkstra's Algorithm, and Edge Computing for Emergency Response in Smart Cities	Mahamat Abdel Aziz Assoul et.al.	2408.04924	null
2024-08-08	Mathematical Programming For Adaptive Experiments	Ethan Che et.al.	2408.04570	null
2024-08-08	Non-maximizing policies that fulfill multi-criterion aspirations in expectation	Simon Dima et.al.	2408.04385	null
2024-08-08	Enhanced Traffic Flow Prediction with Multi-Segment Fusion Tensor Graph Convolutional Networks	Wei Zhang et.al.	2408.04232	null
2024-08-06	A Course in Dynamic Optimization	Bar Light et.al.	2408.03034	null
2024-08-05	Positive Dynamic Programming: A Critique	Aaqib Peerzada et.al.	2408.02809	null
2024-08-05	Multi-level Traffic-Responsive Tilt Camera Surveillance through Predictive Correlated Online Learning	Tao Li et.al.	2408.02208	null
2024-08-04	Non-local Hamilton-Jacobi-Bellman equations for the stochastic optimal control of path-dependent piecewise deterministic processes	Elena Bandini et.al.	2408.02147	null
2024-08-03	Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation	Balázs Opra et.al.	2408.01640	null
2024-08-02	Occasionally Observed Piecewise-deterministic Markov Processes	Marissa Gee et.al.	2408.01335	null
2024-08-02	The Impact of Program Reduction on Automated Program Repair	Linas Vidziunas et.al.	2408.01134	null
2024-08-11	Deep Learning Approach for Changepoint Detection: Penalty Parameter Optimization	Tung L Nguyen et.al.	2408.00856	link
2024-07-31	Tractable and Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation	Taehyun Cho et.al.	2407.21260	null
2024-07-30	A Machine Learning Approach to Boost the Vehicle-2-Grid Scheduling	Gabriele Agliardi et.al.	2407.20802	null
2024-07-30	Generalized replicator dynamics based on mean-field pairwise comparison dynamic	Hidekazu Yoshioka et.al.	2407.20751	null
2024-08-10	A UAV-Enabled Time-Sensitive Data Collection Scheme for Grassland Monitoring Edge Networks	Dongbin Jiao et.al.	2407.20585	null
2024-07-29	A Differential Dynamic Programming Framework for Inverse Reinforcement Learning	Kun Cao et.al.	2407.19902	null
2024-07-27	Map-Matching Queries under Fréchet Distance on Low-Density Spanners	Kevin Buchin et.al.	2407.19304	null
2024-07-26	RRO: A Regularized Routing Optimization Algorithm for Enhanced Throughput and Low Latency with Efficient Complexity	David Zenati et.al.	2407.18683	null
2024-07-26	Mean-field control of non exchangeable systems	Anna De Crescenzo et.al.	2407.18635	null
2024-08-01	Stochastic Games with Minimally Bounded Action Costs	David Mguni et.al.	2407.18010	null
2024-07-25	Personalized and Context-aware Route Planning for Edge-assisted Vehicles	Dinesh Cyril Selvaraj et.al.	2407.17980	null
2024-07-23	Data-Driven Optimal Feedback Laws via Kernel Mean Embeddings	Petar Bevanda et.al.	2407.16407	null
2024-07-23	Data-driven Multistage Distributionally Robust Linear Optimization with Nested Distance	Rui Gao et.al.	2407.16346	null
2024-07-22	Faster Optimal Coalition Structure Generation via Offline Coalition Selection and Graph-Based Search	Redha Taguelmimt et.al.	2407.16092	null
2024-07-22	Scheduling on a Stochastic Number of Machines	Moritz Buchem et.al.	2407.15737	null
2024-07-20	Interdiction of minimum spanning trees and other matroid bases	Noah Weninger et.al.	2407.14906	link
2024-07-20	A Tale of Two Scales: Reconciling Horizontal and Vertical Scaling for Inference Serving Systems	Kamran Razavi et.al.	2407.14843	null
2024-07-19	Dynamic Programming Techniques for Planar Orbital Transfer of Low Earth Orbit Satellites	C. Ciancarelli et.al.	2407.14675	null
2024-07-19	Generalization Error Analysis of Deep Backward Dynamic Programming for Solving Nonlinear PDEs	Du Ouyang et.al.	2407.14566	null
2024-07-19	On Policy Evaluation Algorithms in Distributional Reinforcement Learning	Julian Gerstenberg et.al.	2407.14175	null
2024-07-18	Shaded Route Planning Using Active Segmentation and Identification of Satellite Images	Longchao Da et.al.	2407.13689	null
2024-07-18	The Madness of Multiple Entries in March Madness	Jeff Decary et.al.	2407.13438	null
2024-07-18	Double interdiction problem on trees on the sum of root-leaf distances by upgrading edges	Xiao Li et.al.	2407.13391	null
2024-07-18	Deterministic Trajectory Optimization through Probabilistic Optimal Control	Mohammad Mahmoudi Filabadi et.al.	2407.13316	null
2024-07-18	Integrated Hardware Architecture and Device Placement Search	Irene Wang et.al.	2407.13143	link
2024-07-18	Multiobjective Vehicle Routing Optimization with Time Windows: A Hybrid Approach Using Deep Reinforcement Learning and NSGA-II	Rixin Wu et.al.	2407.13113	null
2024-07-17	Dynamic Programming Principle and Hamilton-Jacobi-Bellman Equation for Optimal Control Problems with Uncertainty	M. Soledad Aronna et.al.	2407.13045	null
2024-07-17	Estimating the Potential Impact of Combined Race and Ethnicity Reporting on Long-Term Earnings Statistics	Kevin L. McKinney et.al.	2407.12775	null
2024-07-16	Enabling MCTS Explainability for Sequential Planning Through Computation Tree Logic	Ziyan An et.al.	2407.10820	null
2024-07-14	Fine Grained Lower Bounds for Multidimensional Knapsack	Ilan Doron-Arad et.al.	2407.10146	null
2024-07-12	Investigating the Interplay of Prioritized Replay and Generalization	Parham Mohammad Panahi et.al.	2407.09702	null
2024-07-12	An efficient algorithm to compute the minimum free energy of interacting nucleic acid strands	Ahmed Shalaby et.al.	2407.09676	null
2024-07-12	Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey	Milan Ganai et.al.	2407.09645	null
2024-07-12	Integer programs with nearly totally unimodular matrices: the cographic case	Manuel Aprile et.al.	2407.09477	null
2024-07-12	A new approach to principal-agent problems with volatility control	Alessandro Chiusolo et.al.	2407.09471	null
2024-07-12	CAACS: A Carbon Aware Ant Colony System	Marina Lin et.al.	2407.09404	null
2024-07-12	Structure and Independence in Hyperbolic Uniform Disk Graphs	Thomas Bläsius et.al.	2407.09362	null
2024-07-12	KUNPENG: An Embodied Large Model for Intelligent Maritime	Naiyao Wang et.al.	2407.09048	link
2024-07-09	Trajectory Data Mining and Trip Travel Time Prediction on Specific Roads	Muhammad Awais Amin et.al.	2407.07030	null
2024-07-08	Solving Multi-Model MDPs by Coordinate Ascent and Dynamic Programming	Xihong Su et.al.	2407.06329	link
2024-07-08	Narrowing the Gap between Adversarial and Stochastic MDPs via Policy Optimization	Daniil Tiapkin et.al.	2407.05704	null
2024-07-06	Advancing Algorithmic Approaches to Probabilistic Argumentation under the Constellation Approach	Andrei Popescu et.al.	2407.05058	null
2024-07-05	Re-Tuning: Overcoming the Compositionality Limits of Large Language Models with Recursive Tuning	Eric Pasewark et.al.	2407.04787	link
2024-07-05	GOALPlace: Begin with the End in Mind	Anthony Agnesina et.al.	2407.04579	null
2024-07-04	Advanced Artificial Intelligence Strategy for Optimizing Urban Rail Network Design using Nature-Inspired Algorithms	Hariram Sampath Kumar et.al.	2407.04087	null
2024-07-04	Multi-Time Scale Service Caching and Pricing in MEC Systems with Dynamic Program Popularity	Yiming Chen et.al.	2407.03804	null
2024-07-03	Reconsidering utility: unveiling the limitations of synthetic mobility data generation algorithms in real-life scenarios	Alexandra Kapp et.al.	2407.03237	null
2024-07-12	A Two-stage Identification Method for Switched Linear Systems	Zheng Wenju et.al.	2407.02743	null
2024-07-02	DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object Detection	Kaixin Xu et.al.	2407.02098	null
2024-06-28	Edge-DIRECT: A Deep Reinforcement Learning-based Method for Solving Heterogeneous Electric Vehicle Routing Problem with Time Window Constraints	Arash Mozhdehi et.al.	2407.01615	null
2024-07-02	Contractual Reinforcement Learning: Pulling Arms with Invisible Hands	Jibang Wu et.al.	2407.01458	null
2024-07-01	Exact statistical analysis for response-adaptive clinical trials: a general and computationally tractable approach	Stef Baas et.al.	2407.01055	null
2024-06-30	Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based Models	Sangwoong Yoon et.al.	2407.00626	link
2024-06-30	Your Car Tells Me Where You Drove: A Novel Path Inference Attack via CAN Bus and OBD-II Data	Tommaso Bianchi et.al.	2407.00585	null
2024-06-29	A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation	Aicheng Gong et.al.	2407.00496	link
2024-06-29	Vector-valued robust stochastic control	Igor Cialenco et.al.	2407.00266	null
2024-06-28	Leveraging Fixed-Parameter Tractability for Robot Inspection Planning	Yosuke Mizutani et.al.	2407.00251	null
2024-06-28	Approximate Solutions for Multi-Trip Route Planning in Time-Sensitive Situations	Bahar Cavdar et.al.	2407.00173	null
2024-06-28	Online Optimization of DNN Inference Network Utility in Collaborative Edge Computing	Rui Li et.al.	2406.19613	null
2024-06-27	Efficient and Distributed Large-Scale 3D Map Registration using Tomographic Features	Halil Utku Unlu et.al.	2406.19461	link
2024-06-27	Cuts in Graphs with Matroid Constraints	Aritra Banik et.al.	2406.19134	null
2024-06-27	State and Input Constrained Output-Feedback Adaptive Optimal Control of Affine Nonlinear Systems	Tochukwu Elijah Ogri et.al.	2406.18804	null
2024-06-26	Markov Decision Process and Approximate Dynamic Programming for a Patient Assignment Scheduling problem	Malgorzata M. O'Reilly et.al.	2406.18618	null
2024-06-26	Tiered Service Architecture for Remote Patient Monitoring	Siddharth Chandak et.al.	2406.18000	null
2024-06-25	Splitting Guarantees for Prophet Inequalities via Nonlinear Systems	Johannes Brustle et.al.	2406.17767	null
2024-06-25	Using iterated local alignment to aggregate GPS trajectories into a traffic flow map	Tarn Duong et.al.	2406.17500	null
2024-06-24	A multiplicative surface signature through its Magnus expansion	Ilya Chevyrev et.al.	2406.16856	null
2024-06-24	Stochastic Path-Dependent Volatility Models for Price-Storage Dynamics in Natural Gas Markets and Discrete-Time Swing Option Pricing	Jinniao Qiu et.al.	2406.16400	null
2024-06-21	Exact discovery is polynomial for sparse causal Bayesian networks	Felix L. Rios et.al.	2406.15012	link
2024-06-19	A programmable wafer-scale chiroptical heterostructure of twisted aligned carbon nanotubes and phase change materials	Jichao Fan et.al.	2406.13190	null
2024-06-14	Interpretable Cascading Mixture-of-Experts for Urban Traffic Congestion Prediction	Wenzhao Jiang et.al.	2406.12923	null
2024-06-26	LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging	Jinuk Kim et.al.	2406.12837	link
2024-06-17	LibProf: A Python Profiler for Improving Cold Start Performance in Serverless Applications	Syed Salauddin Mohammad Tariq et.al.	2406.11734	null
2024-06-17	Statistical Learning of Distributionally Robust Stochastic Control in Continuous State Spaces	Shengbo Wang et.al.	2406.11281	null
2024-06-16	WeShap: Weak Supervision Source Evaluation with Shapley Values	Naiqing Guan et.al.	2406.11010	null
2024-06-16	Solving Co-Path/Cycle Packing Faster than $3^k$	Yuxi Liu et.al.	2406.10829	null
2024-06-15	Scheduling two types of jobs with minimum makespan	Song Cao et.al.	2406.10467	null
2024-06-14	CycleTrajectory: An End-to-End Pipeline for Enriching and Analyzing GPS Trajectories to Understand Cycling Behavior and Environment	Meihui Wang et.al.	2406.10069	link
2024-06-13	Optimal Control of Agent-Based Dynamics under Deep Galerkin Feedback Laws	Frederik Kelbel et.al.	2406.09141	link
2024-06-13	Coordinated Trading Strategies for Battery Storage in Reserve and Spot Markets	Paul E. Seifert et.al.	2406.08390	null
2024-06-11	Flow Map Matching	Nicholas M. Boffi et.al.	2406.07507	null
2024-06-11	Variational inequalities and smooth-fit principle for singular stochastic control problems in Hilbert spaces	Salvatore Federico et.al.	2406.07242	null
2024-06-10	Stochastic Guidance of Buoyancy Controlled Vehicles under Ice Shelves using Ocean Currents	Federico Rossi et.al.	2406.06724	null
2024-06-10	Leveraging Hyperscanning EEG and VR Omnidirectional Treadmill to Explore Inter-Brain Synchrony in Collaborative Spatial Navigation	Chun-Hsiang Chuang et.al.	2406.06327	null
2024-06-09	Production and distribution planning, scheduling, and routing optimization in a yogurt supply chain under demand uncertainty: A case study	Babak Javadi et.al.	2406.05803	null
2024-06-09	Heart Sound Segmentation Using Deep Learning Techniques	Manas Madine et.al.	2406.05653	null
2024-06-11	Bisimulation Metrics are Optimal Transport Distances, and Can be Computed Efficiently	Sergio Calo et.al.	2406.04056	null
2024-06-04	GrootVL: Tree Topology is All You Need in State Space Model	Yicheng Xiao et.al.	2406.02395	link
2024-06-21	Branches: A Fast Dynamic Programming and Branch & Bound Algorithm for Optimal Decision Trees	Ayman Chaouki et.al.	2406.02175	link
2024-06-03	An efficient solution to Hidden Markov Models on trees with coupled branches	Farzan Vafa et.al.	2406.01663	null
2024-06-03	A New View on Planning in Online Reinforcement Learning	Kevin Roice et.al.	2406.01562	null
2024-06-02	Dual Policy Reinforcement Learning for Real-time Rebalancing in Bike-sharing Systems	Jiaqi Liang et.al.	2406.00868	null
2024-06-02	Computing Optimal Equilibria in Repeated Games with Restarts	Ratip Emin Berker et.al.	2406.00851	null
2024-06-02	A Lazy Abstraction Algorithm for Markov Decision Processes: Theory and Initial Evaluation	Dániel Szekeres et.al.	2406.00824	null
2024-06-10	Model Predictive Control and Reinforcement Learning: A Unified Framework Based on Dynamic Programming	Dimitri P. Bertsekas et.al.	2406.00592	null
2024-06-01	Optimal Transmission Power Scheduling for Networked Control System under DoS Attack	Siyi Wang et.al.	2406.00540	null
2024-06-01	A Single-Loop Robust Policy Gradient Method for Robust Markov Decision Processes	Zhenwei Lin et.al.	2406.00274	link
2024-05-31	Finding Diverse Solutions Parameterized by Cliquewidth	Karolina Drabik et.al.	2405.20931	null
2024-05-29	A numerical algorithm with linear complexity for Multi-marginal Optimal Transport with $L^1$ Cost	Chunhui Chen et.al.	2405.19246	null
2024-05-28	A Pontryagin Perspective on Reinforcement Learning	Onno Eberhard et.al.	2405.18100	null
2024-05-27	Q-value Regularized Transformer for Offline Reinforcement Learning	Shengchao Hu et.al.	2405.17098	null
2024-05-25	A Bi-Objective Approach to Last-Mile Delivery Routing Considering Driver Preferences	Juan Pablo Mesa et.al.	2405.16051	null
2024-06-03	Inference of Utilities and Time Preference in Sequential Decision-Making	Haoyang Cao et.al.	2405.15975	null
2024-05-31	Stability and Performance Analysis of Model Predictive Control of Uncertain Linear Systems	Changrui Liu et.al.	2405.15552	link
2024-05-24	An Approximate Dynamic Programming Framework for Occlusion-Robust Multi-Object Tracking	Pratyusha Musunuru et.al.	2405.15137	null
2024-05-23	Two-Stage ML-Guided Decision Rules for Sequential Decision Making under Uncertainty	Andrew Rosemberg et.al.	2405.14973	null
2024-05-23	A rolling horizon heuristic approach for a multi-stage stochastic waste collection problem	Andrea Spinelli et.al.	2405.14499	link
2024-05-23	EdgeShard: Efficient LLM Inference via Collaborative Edge Computing	Mingjin Zhang et.al.	2405.14371	null
2024-05-23	Optimal Whole Body Trajectory Planning for Mobile Manipulators in Planetary Exploration and Construction	Federica Storiale et.al.	2405.14363	null
2024-05-23	Deterministic Policies for Constrained Reinforcement Learning in Polynomial-Time	Jeremy McMahan et.al.	2405.14183	null
2024-05-22	Tackling Decision Processes with Non-Cumulative Objectives using Reinforcement Learning	Maximilian Nägele et.al.	2405.13609	link
2024-05-21	Parallel Algorithm for Optimal Threshold Labeling of Ordinal Regression Methods	Ryoya Yamasaki et.al.	2405.12756	link
2024-05-21	Short and simple introduction to Bellman filtering and smoothing	Rutger-Jan Lange et.al.	2405.12668	null
2024-05-21	Data-driven Coordinated AC/DC Control Strategy for Frequency Safety	Qianni Cao et.al.	2405.12546	null
2024-05-20	Semantic Trajectory Data Mining with LLM-Informed POI Classification	Yifan Liu et.al.	2405.11715	null
2024-05-18	On the Trajectory Regularity of ODE-based Diffusion Sampling	Defang Chen et.al.	2405.11326	link
2024-05-15	Harmonizing Human Insights and AI Precision: Hand in Hand for Advancing Knowledge Graph Task	Shurong Wang et.al.	2405.09477	null
2024-05-14	Treatment Effect Estimation for User Interest Exploration on Recommender Systems	Jiaju Chen et.al.	2405.08582	link
2024-05-27	Dynamic Programming for Symbolic Boolean Realizability and Synthesis	Yi Lin et.al.	2405.07975	null
2024-05-13	Space Domain based Ecological Cooperative and Adaptive Cruise Control on Rolling Terrain	Mingyue Lei et.al.	2405.07553	null
2024-05-12	Deciding regular games: a playground for exponential time algorithms	Zihui Liang et.al.	2405.07188	null
2024-05-12	Trade execution games in a Markovian environment	Masamitsu Ohnishi et.al.	2405.07184	null
2024-05-10	Dynamic programming principle and computable prices in financial market models with transaction costs	Emmanuel Lepinette et.al.	2405.06623	null
2024-05-09	Change point localisation and inference in fragmented functional data	Gengyu Xue et.al.	2405.05730	link
2024-05-09	Infinite horizon stochastic recursive control problems with jumps: dynamic programming and stochastic verification theorems	Sheng Luo et.al.	2405.05561	null
2024-05-14	Robust Reward Placement under Uncertainty	Petros Petsinis et.al.	2405.05433	null
2024-05-06	Novel Tour Construction Heuristic for Pick-Up and Delivery Routing Problems	Mithun Goutham et.al.	2405.03774	null
2024-05-05	TSP Escapes the $O(2^n n^2)$ Curse	Mihail Stoian et.al.	2405.03018	link
2024-05-02	DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines	Ye Tian et.al.	2405.01248	null
2024-05-02	Lipschitz constant estimation for general neural network architectures using control tools	Patricia Pauli et.al.	2405.01125	link
2024-05-01	A biased random-key genetic algorithm with variable mutants to solve a vehicle routing problem	Paola Festa et.al.	2405.00268	null
2024-04-28	Bi-objective optimization of a VRP problem applied to urban solid waste collection through a model that includes the visual attraction of routes	Diego Rossit et.al.	2405.00068	null
2024-04-26	Energy Storage Arbitrage in Two-settlement Markets: A Transformer-Based Approach	Saud Alghumayjan et.al.	2404.17683	null
2024-04-25	Path integral control under McKean-Vlasov dynamics	Timothy Bennett et.al.	2404.17006	null
2024-04-25	Parallel and (Nearly) Work-Efficient Dynamic Programming	Xiangyun Ding et.al.	2404.16314	link
2024-04-23	Prediction from compression for models with infinite memory, with applications to hidden Markov and renewal processes	Yanjun Han et.al.	2404.15454	null
2024-04-26	Variational Dynamic Programming for Stochastic Optimal Control	Marc Lambert et.al.	2404.14806	link
2024-04-22	Tile-Weighted Rate-Distortion Optimized Packet Scheduling for 360 $^\circ$ VR Video Streaming	Haopeng Wang et.al.	2404.14573	null
2024-04-21	Stochastic Multi-round Submodular Optimization with Budget	Vincenzo Auletta et.al.	2404.13737	null
2024-04-21	Planning of Truck Platooning for Road-Network Capacitated Vehicle Routing Problem	Yilang Hao et.al.	2404.13512	null
2024-04-20	Liquidity Pool Design on Automated Market Makers	Xue Dong He et.al.	2404.13291	null
2024-04-19	Decentralized Coordination of Distributed Energy Resources through Local Energy Markets and Deep Reinforcement Learning	Daniel May et.al.	2404.13142	null
2024-04-18	NLP-enabled trajectory map-matching in urban road networks using transformer sequence-to-sequence model	Sevin Mohammadi et.al.	2404.12460	null
2024-04-18	Recursive stochastic differential games with non-Lipschitzian generators and viscosity solutions of Hamilton-Jacobi-Bellman-Isaacs equation	Guangchen Wang et.al.	2404.12129	null
2024-04-18	Actor-Critic Reinforcement Learning with Phased Actor	Ruofan Wu et.al.	2404.11834	null
2024-04-18	Itō and Itō-Wentzell chain rule for flows of conditional laws of continuous semimartingales: an easy approach	Assil Fadle et.al.	2404.11010	null
2024-04-16	Zero-Sum Games for Volterra Integral Equations and Viscosity Solutions of Path-Dependent Hamilton-Jacobi Equations	Mikhail I. Gomoyunov et.al.	2404.10428	null
2024-04-16	Urban Water Sprinkler Routing: A Multi-Depot Mixed Capacitated Arc Routing Problem Incorporating Real-Time Demands	Hongtai Yang et.al.	2404.10230	null
2024-04-13	Fast Gradient Computation for Gromov-Wasserstein Distance	Wei Zhang et.al.	2404.08970	null
2024-04-12	A Parametric Approach for Solving Convex Quadratic Optimization with Indicators Over Trees	Aaresh Bhathena et.al.	2404.08178	link
2024-04-06	Viscosity solutions for mean field optimal switching with a two-time-scale Markov chain	Tian Chen et.al.	2404.07998	null
2024-04-11	Parameterized Fast and Safe Tracking (FaSTrack) using Deepreach	Hyun Joe Jeong et.al.	2404.07431	null
2024-04-09	Inexact Policy Iteration Methods for Large-Scale Markov Decision Processes	Matilde Gargiani et.al.	2404.06136	null
2024-04-09	fastcpd: Fast Change Point Detection in R	Xingchi Li et.al.	2404.05933	link
2024-04-08	Non-concave distributionally robust stochastic control in a discrete time finite horizon setting	Ariel Neufeld et.al.	2404.05230	link
2024-04-07	Percentile Criterion Optimization in Offline Reinforcement Learning	Elita A. Lobo et.al.	2404.05055	link
2024-04-05	A Ground Mobile Robot for Autonomous Terrestrial Laser Scanning-Based Field Phenotyping	Javier Rodriguez-Sanchez et.al.	2404.04404	null
2024-04-04	Forecasting with Neuro-Dynamic Programming	Pedro Afonso Fernandes et.al.	2404.03737	null
2024-04-03	Reinforcement Learning in Categorical Cybernetics	Jules Hedges et.al.	2404.02688	null
2024-04-03	Transformer-based Stagewise Decomposition for Large-Scale Multistage Stochastic Optimization	Chanyeong Kim et.al.	2404.02583	null
2024-04-01	Versatile Navigation under Partial Observability via Value-guided Diffusion Policy	Gengyu Zhang et.al.	2404.02176	null
2024-03-31	Adversarially-Robust Inference on Trees via Belief Propagation	Samuel B. Hopkins et.al.	2404.00768	null
2024-03-28	A Faster Algorithm for Pigeonhole Equal Sums	Ce Jin et.al.	2403.19117	null
2024-03-27	Policy iteration for discrete-time systems with discounted costs: stability and near-optimality guarantees	Jonathan de Brusse et.al.	2403.19007	null
2024-03-27	A Dynamic Programming Approach for Road Traffic Estimation	Mattia Laurini et.al.	2403.18561	null
2024-03-26	Generalized Maximum Entropy Differential Dynamic Programming	Yuichiro Aoyama et.al.	2403.18130	null
2024-03-26	Accuracy enhancement method for speech emotion recognition from spectrogram using temporal frequency correlation and positional information learning through knowledge transfer	Jeong-Yoon Kim et.al.	2403.17327	link
2024-03-25	State-Augmented Linear Games with Antagonistic Error for High-Dimensional, Nonlinear Hamilton-Jacobi Reachability	Will Sharpless et.al.	2403.16982	link
2024-03-25	Semantic-Aware Remote Estimation of Multiple Markov Sources Under Constraints	Jiping Luo et.al.	2403.16855	null
2024-03-24	On the Navier-Stokes equations and the Hamilton-Jacobi-Bellman equation on the group of volume preserving diffeomorphisms	Xiang-Dong Li et.al.	2403.15997	null
2024-03-23	On Merton's Optimal Portfolio Problem under Sporadic Bankruptcy	Yaacov Kopeliovich et.al.	2403.15923	link
2024-03-22	Transactive Local Energy Markets Enable Community-Level Resource Coordination Using Individual Rewards	Daniel C. May et.al.	2403.15617	null
2024-03-19	Most Likely Sequence Generation for $n$ -Grams, Transformers, HMMs, and Markov Chains, by Using Rollout Algorithms	Yuchao Li et.al.	2403.15465	null
2024-03-21	Conservative Linear Envelopes for High-Dimensional, Hamilton-Jacobi Reachability for Nonlinear Systems via the Hopf Formula	Will Sharpless et.al.	2403.14184	null
2024-03-20	Optimal control of continuous-time symmetric systems with unknown dynamics and noisy measurements	Hamed Taghavian et.al.	2403.13605	null
2024-03-19	Solving Combinatorial Pricing Problems using Embedded Dynamic Programming Models	Quang Minh Bui et.al.	2403.12923	null
2024-03-18	AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition	SooHwan Eom et.al.	2403.11578	null
2024-03-17	Multiscale Quantile Regression with Local Error Control	Zhi Liu et.al.	2403.11356	link
2024-03-15	Fast Generation of Feasible Trajectories in Direct Optimal Control	David Kiessling et.al.	2403.10115	link
2024-03-14	Is Data All That Matters? The Role of Control Frequency for Learning-Based Sampled-Data Control of Uncertain Systems	Ralf Römer et.al.	2403.09504	link
2024-03-14	Quantum Dynamic Programming	Jeongrak Son et.al.	2403.09187	null
2024-03-15	Relationship between General MP and DPP for the Stochastic Recursive Optimal Control Problem With Jumps: Viscosity Solution Framework	Bin Wang et.al.	2403.09044	null
2024-03-13	Model-free Resilient Controller Design based on Incentive Feedback Stackelberg Game and Q-learning	Jiajun Shen et.al.	2403.08948	null
2024-03-13	Online Multi-Contact Feedback Model Predictive Control for Interactive Robotic Tasks	Seo Wook Han et.al.	2403.08302	null
2024-03-12	Optimal Design and Implementation of an Open-source Emulation Platform for User-Centric Shared E-mobility Services	Maqsood Hussain Shah et.al.	2403.07964	null
2024-03-12	The Primal Pathwidth SETH	Michael Lampis et.al.	2403.07239	null
2024-03-10	A Unified Model for Spatio-Temporal Prediction Queries with Arbitrary Modifiable Areal Units	Liyue Chen et.al.	2403.07022	link
2024-03-11	Domain-Independent Dynamic Programming and Constraint Programming Approaches for Assembly Line Balancing Problems with Setups	Jiachen Zhang et.al.	2403.06780	null
2024-03-11	Balanced Substructures in Bicolored Graphs	P. S. Ardra et.al.	2403.06608	null
2024-03-11	An Efficient Solution to the 2D Visibility Problem in Cartesian Grid Maps and its Application in Heuristic Path Planning	Ibrahim Ibrahim et.al.	2403.06494	link
2024-03-11	AGAThA: Fast and Efficient GPU Acceleration of Guided Sequence Alignment for Long Read Mapping	Seongyeon Park et.al.	2403.06478	link
2024-03-09	Spatial Clustering Approach for Vessel Path Identification	Mohamed Abuella et.al.	2403.05778	link
2024-03-07	On $[1,2]$ -Domination in Interval and Circle Graphs	Mohsen Alambardar Meybodi et.al.	2403.04694	null
2024-03-07	Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control	Sadegh Sadeghi Tabas et.al.	2403.04195	null
2024-03-06	Global Geolocated Realtime Data of Interfleet Urban Transit Bus Idling	Nicholas Kunz et.al.	2403.03489	link
2024-03-06	SalienTime: User-driven Selection of Salient Time Steps for Large-Scale Geospatial Data Visualization	Juntong Chen et.al.	2403.03449	link
2024-03-06	Leveraging The Finite States of Emotion Processing to Study Late-Life Mental Health	Yuanzhe Huang et.al.	2403.03414	null
2024-03-04	Dynamic programming principle in cost-efficient sequential design: application to switching measurements	Jeongmin Han et.al.	2403.02245	null
2024-03-04	Cooperative and Interaction-aware Driver Model for Lane Change Maneuver	Jemin Woo et.al.	2403.01752	null
2024-03-01	DyPyBench: A Benchmark of Executable Python Software	Islem Bouzenia et.al.	2403.00539	link
2024-03-01	Graph Construction with Flexible Nodes for Traffic Demand Prediction	Jinyan Hou et.al.	2403.00276	link
2024-02-29	Lifelong Benchmarks: Efficient Model Evaluation in an Era of Rapid Progress	Ameya Prabhu et.al.	2402.19472	link
2024-02-27	Globally Convergent Distributed Sequential Quadratic Programming with Overlapping Decomposition and Exact Augmented Lagrangian Merit Function	Runxin Ni et.al.	2402.17170	null
2024-02-24	Selective Task offloading for Maximum Inference Accuracy and Energy efficient Real-Time IoT Sensing Systems	Abdelkarim Ben Sada et.al.	2402.16904	null
2024-02-25	IKLink: End-Effector Trajectory Tracking with Minimal Reconfigurations	Yeping Wang et.al.	2402.16154	link
2024-02-25	Evolving E-commerce Logistics Planning- Integrating Embedded Technology and Ant Colony Algorithm for Enhanced Efficiency	Lynn Huang et.al.	2402.15965	null
2024-02-25	Budget-Constrained Tool Learning with Planning	Yuanhang Zheng et.al.	2402.15960	link
2024-02-23	Neural optimal controller for stochastic systems via pathwise HJB operator	Zhe Jiao et.al.	2402.15592	null
2024-02-23	Curve fitting on a quantum annealer for an advanced navigation method	Philipp Isserstedt et.al.	2402.15308	null
2024-02-22	Quantum Markov Decision Processes Part II: Optimal Solutions and Algorithms	Naci Saldi et.al.	2402.14651	null
2024-02-22	Quantum Markov Decision Processes Part I: General Theory, Approximations, and Classes of Policies	Naci Saldi et.al.	2402.14649	null
2024-02-21	Quantum Annealing and Graph Neural Networks for Solving TSP with QUBO	Haoqi He et.al.	2402.14036	null
2024-02-21	Do Efficient Transformers Really Save Computation?	Kai Yang et.al.	2402.13934	null
2024-02-21	Benchmarking and Dissecting the Nvidia Hopper GPU Architecture	Weile Luo et.al.	2402.13499	null
2024-02-20	An Improved Lower Bound on the Number of Pseudoline Arrangements	Fernando Cortés Kühnast et.al.	2402.13107	null
2024-02-20	Smart Mobility Digital Twin Based Automated Vehicle Navigation System: A Proof of Concept	Kui Wang et.al.	2402.12682	null
2024-02-19	An algorithm for counting number of all (normal) fuzzy subgroups in $U_{6n}$	Marek Hyčko et.al.	2402.12543	null
2024-02-29	Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding	Zhuoming Chen et.al.	2402.12374	link
2024-02-19	Scalable Virtual Valuations Combinatorial Auction Design by Combining Zeroth-Order and First-Order Optimization Method	Zhijian Duan et.al.	2402.11904	null
2024-02-19	Two Online Map Matching Algorithms Based on Analytic Hierarchy Process and Fuzzy Logic	Jeremy J. Lin et.al.	2402.11866	null
2024-02-18	A Fisher Information based Receding Horizon Control Method for Signal Strength Model Estimation	Yancheng Zhu et.al.	2402.11483	null
2024-02-16	Optimal Savings and Value of Population in A Stochastic Environment: Transient Behavior	Hao Liu et.al.	2402.10768	null
2024-02-15	Engraving Oriented Joint Estimation of Pitch Spelling and Local and Global Keys	Augustin Bouquillard et.al.	2402.10247	null
2024-02-14	Analyzing the Impact of Computation in Adaptive Dynamic Programming for Stochastic LQR Problem	Wenhan Cao et.al.	2402.09575	null
2024-02-13	Approximate Sequential Optimization for Informative Path Planning	Joshua Ott et.al.	2402.08841	link
2024-02-13	Sequence graphs realizations and ambiguity in language models	Sammy Khalife et.al.	2402.08830	null
2024-02-11	GenSTL: General Sparse Trajectory Learning via Auto-regressive Generation of Feature Domains	Yan Lin et.al.	2402.07232	link
2024-02-09	High-Precision Geosteering via Reinforcement Learning and Particle Filters	Ressi Bonti Muhammad et.al.	2402.06377	null
2024-02-09	Bellman Conformal Inference: Calibrating Prediction Intervals For Time Series	Zitong Yang et.al.	2402.05203	link
2024-02-04	Empowering Computing and Networks Convergence System with Distributed Cooperative Routing	Yujiao Hu et.al.	2402.02381	null
2024-02-03	Multiple sequences Prophet Inequality Under Observation Constraints	Aristomenis Tsopelakos et.al.	2402.02059	null
2024-02-02	Capturing waste collection planning expert knowledge in a fitness function through preference learning	Laura Fernández Díaz et.al.	2402.01849	null
2024-02-02	Dynamic programming for the stochastic matching model on general graphs: the case of the `N-graph'	Loïc Jean et.al.	2402.01803	null
2024-02-01	AlphaRank: An Artificial Intelligence Approach for Ranking and Selection Problems	Ruihan Zhou et.al.	2402.00907	null
2024-02-01	Cocco: Hardware-Mapping Co-Exploration towards Memory Capacity-Communication Optimization	Zhanhong Tan et.al.	2402.00629	null
2024-02-02	Branch and Price for the Length-Constrained Cycle Partition Problem	Mohammed Ghannam et.al.	2401.17937	link
2024-01-31	Revisiting speech segmentation and lexicon learning with better features	Herman Kamper et.al.	2401.17902	null
2024-02-16	The computation of approximate feedback Stackelberg equilibria in multi-player nonlinear constrained dynamic games	Jingqi Li et.al.	2401.15745	link
2024-01-28	HappyRouting: Learning Emotion-Aware Route Trajectories for Scalable In-The-Wild Navigation	David Bethge et.al.	2401.15695	null
2024-01-28	Constrained Markov decision processes for response-adaptive procedures in clinical trials with binary outcomes	Stef Baas et.al.	2401.15694	null
2024-01-27	Fair and Efficient Ridesharing: A Dynamic Programming-based Relocation Approach	Aqsa Ashraf Makhdomi et.al.	2401.15363	null
2024-01-27	Optimal Sparse Survival Trees	Rui Zhang et.al.	2401.15330	link
2024-01-25	Domain-Independent Dynamic Programming	Ryo Kuroiwa et.al.	2401.13883	link
2024-01-27	Deep multitask neural networks for solving some stochastic optimal control problems	Christian Yeo et.al.	2401.12923	link
2024-01-23	Optimal Stopping of Branching Diffusion Processes	Idris Kharroubi et.al.	2401.12811	null
2024-01-22	On a class of interdiction problems with partition matroids: complexity and polynomial-time algorithms	Sergey S. Ketkov et.al.	2401.12010	null
2024-01-22	Finite horizon optimal control of reaction-diffusion SIV epidemic system with stochastic environment	Zong Wang et.al.	2401.11744	null
2024-01-20	Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of View	Raj Ghugare et.al.	2401.11237	link

(back to top)

Large Language Model

Publish Date	Title	Authors	PDF	Code
2025-06-10	VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning	Li Kang et.al.	2506.09049	null
2025-06-10	Same Task, Different Circuits: Disentangling Modality-Specific Mechanisms in VLMs	Yaniv Nikankin et.al.	2506.09047	null
2025-06-10	Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation	Xiaowen Ma et.al.	2506.09046	null
2025-06-10	Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models	Xuanchi Ren et.al.	2506.09042	null
2025-06-10	Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better	Dianyi Wang et.al.	2506.09040	null
2025-06-10	AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions	Polina Kirichenko et.al.	2506.09038	null
2025-06-10	FZOO: Fast Zeroth-Order Optimizer for Fine-Tuning Large Language Models towards Adam-Scale Speed	Sizhe Dang et.al.	2506.09034	null
2025-06-10	Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning	Haozhen Zhang et.al.	2506.09033	null
2025-06-10	Do MIL Models Transfer?	Daniel Shao et.al.	2506.09022	null
2025-06-10	SPEED-RL: Faster Training of Reasoning Models via Online Curriculum Learning	Ruiqi Zhang et.al.	2506.09016	null
2025-06-10	Learning to Reason Across Parallel Samples for LLM Reasoning	Jianing Qi et.al.	2506.09014	null
2025-06-10	Boosting Rust Unit Test Coverage through Hybrid Program Analysis and Large Language Models	Bei Chu et.al.	2506.09002	null
2025-06-10	Efficient Medical Vision-Language Alignment Through Adapting Masked Vision Models	Chenyu Lian et.al.	2506.08990	null
2025-06-10	SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning	Xiao Liang et.al.	2506.08989	null
2025-06-10	On Finetuning Tabular Foundation Models	Ivan Rubachev et.al.	2506.08982	null
2025-06-10	AdaDec: Uncertainty-Guided Adaptive Decoding for LLM-based Code Generation	Kaifeng He et.al.	2506.08980	null
2025-06-10	Propositional Logic for Probing Generalization in Neural Networks	Anna Langedijk et.al.	2506.08978	null
2025-06-10	Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Scheduling System	Yuan Guo et.al.	2506.08972	null
2025-06-10	ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations	Amirreza Rouhi et.al.	2506.08968	null
2025-06-10	Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model	Ailin Huang et.al.	2506.08967	null
2025-06-09	GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior	Penghao Wu et.al.	2506.08012	null
2025-06-09	Play to Generalize: Learning to Reason Through Game Play	Yunfei Xie et.al.	2506.08011	null
2025-06-09	Vision Transformers Don't Need Trained Registers	Nick Jiang et.al.	2506.08010	null
2025-06-09	Hidden in plain sight: VLMs overlook their visual representations	Stephanie Fu et.al.	2506.08008	null
2025-06-09	Reinforcement Pre-Training	Qingxiu Dong et.al.	2506.08007	null
2025-06-09	Reparameterized LLM Training via Orthogonal Equivalence Transformation	Zeju Qiu et.al.	2506.08001	null
2025-06-09	Supporting Construction Worker Well-Being with a Multi-Agent Conversational AI System	Fan Yang et.al.	2506.07997	null
2025-06-09	HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization	Hongzheng Chen et.al.	2506.07972	null
2025-06-09	CyberV: Cybernetics for Test-time Scaling in Video Understanding	Jiahao Meng et.al.	2506.07971	null
2025-06-09	SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial Intelligence	Ziyang Gong et.al.	2506.07966	null
2025-06-09	Reinforcing Multimodal Understanding and Generation with Dual Self-rewards	Jixiang Hong et.al.	2506.07963	null
2025-06-09	Correlated Errors in Large Language Models	Elliot Kim et.al.	2506.07962	null
2025-06-09	BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models	Peiyan Li et.al.	2506.07961	null
2025-06-09	Language Models over Canonical Byte-Pair Encodings	Tim Vieira et.al.	2506.07956	null
2025-06-09	TokenBreak: Bypassing Text Classification Models Through Token Manipulation	Kasimir Schulz et.al.	2506.07948	null
2025-06-09	Statistical Hypothesis Testing for Auditing Robustness in Language Models	Paulius Rauba et.al.	2506.07947	null
2025-06-09	ProtocolLLM: RTL Benchmark for SystemVerilog Generation of Communication Protocols	Arnav Sheth et.al.	2506.07945	null
2025-06-09	Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations	Yizhen Li et.al.	2506.07943	null
2025-06-09	Adversarial Attack Classification and Robustness Testing for Large Language Models for Code	Yang Liu et.al.	2506.07942	null
2025-06-09	Gradients: When Markets Meet Fine-tuning -- A Distributed Approach to Model Optimisation	Christopher Subia-Waud et.al.	2506.07940	null
2025-06-06	TerraFM: A Scalable Foundation Model for Unified Multisensor Earth Observation	Muhammad Sohail Danish et.al.	2506.06281	null
2025-06-06	Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias	Yuanzhe Hu et.al.	2506.06280	null
2025-06-06	CoMemo: LVLMs Need Image Context with Image Memory	Shi Liu et.al.	2506.06279	null
2025-06-06	Movie Facts and Fibs (MF $^2$ ): A Benchmark for Long Movie Understanding	Emmanouil Zaranis et.al.	2506.06275	null
2025-06-06	AdvSumm: Adversarial Training for Bias Mitigation in Text Summarization	Mukur Gupta et.al.	2506.06273	null
2025-06-06	RecGPT: A Foundation Model for Sequential Recommendation	Yangqin Jiang et.al.	2506.06270	null
2025-06-09	Cartridges: Lightweight and general-purpose long context representations via self-study	Sabri Eyuboglu et.al.	2506.06266	null
2025-06-06	PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time	Weizhi Zhang et.al.	2506.06254	null
2025-06-06	DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation	Jingyu Xiao et.al.	2506.06251	null
2025-06-06	Visual Graph Arena: Evaluating Visual Conceptualization of Vision and Multimodal Large Language Models	Zahra Babaiee et.al.	2506.06242	null
2025-06-06	Bridging External and Parametric Knowledge: Mitigating Hallucination of LLMs with Shared-Private Semantic Synergy in Dual-Stream Knowledge	Yi Sui et.al.	2506.06240	null
2025-06-06	Explaining Matters: Leveraging Definitions and Semantic Expansion for Sexism Detection	Sahrish Khan et.al.	2506.06238	null
2025-06-06	Challenging Vision-Language Models with Surgical Data: A New Dataset and Broad Benchmarking Study	Leon Mayer et.al.	2506.06232	null
2025-06-06	CompilerGPT: Leveraging Large Language Models for Analyzing and Acting on Compiler Optimization Reports	Peter Pirkelbauer et.al.	2506.06227	null
2025-06-06	PROVSYN: Synthesizing Provenance Graphs for Data Augmentation in Intrusion Detection Systems	Yi Huang et.al.	2506.06226	null
2025-06-06	GenIR: Generative Visual Feedback for Mental Image Retrieval	Diji Yang et.al.	2506.06220	null
2025-06-06	STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous Driving	Christian Fruhwirth-Reisinger et.al.	2506.06218	null
2025-06-06	Corrector Sampling in Language Models	Itai Gat et.al.	2506.06215	null
2025-06-06	Can Theoretical Physics Research Benefit from Language Agents?	Sirui Lu et.al.	2506.06214	null
2025-06-06	PuzzleWorld: A Benchmark for Multimodal, Open-Ended Reasoning in Puzzlehunts	Hengzhi Li et.al.	2506.06211	null
2025-06-05	Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets	Lei Hsiung et.al.	2506.05346	null
2025-06-05	SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs	Jiahui Wang et.al.	2506.05344	null
2025-06-05	Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning	Xingjian Ran et.al.	2506.05341	null
2025-06-05	Flattery, Fluff, and Fog: Diagnosing and Mitigating Idiosyncratic Biases in Preference Models	Anirudh Bharadwaj et.al.	2506.05339	null
2025-06-05	VideoMolmo: Spatio-Temporal Grounding Meets Pointing	Ghazi Shazan Ahmad et.al.	2506.05336	null
2025-06-05	Search Arena: Analyzing Search-Augmented LLMs	Mihran Miroyan et.al.	2506.05334	null
2025-06-05	Unleashing Hour-Scale Video Training for Long Video-Language Understanding	Jingyang Lin et.al.	2506.05332	null
2025-06-05	MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning	Xinyan Chen et.al.	2506.05331	null
2025-06-05	LSM-2: Learning from Incomplete Wearable Sensor Data	Maxwell A. Xu et.al.	2506.05321	null
2025-06-06	Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs	Haoyuan Li et.al.	2506.05318	null
2025-06-05	Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay	Yifan Sun et.al.	2506.05316	null
2025-06-05	Constrained Entropic Unlearning: A Primal-Dual Framework for Large Language Models	Taha Entesari et.al.	2506.05314	null
2025-06-05	ProRefine: Inference-time Prompt Refinement with Textual Feedback	Deepak Pandita et.al.	2506.05305	null
2025-06-05	Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos	Weifeng Lin et.al.	2506.05302	null
2025-06-05	Power Law Guided Dynamic Sifting for Efficient Attention	Nirav Koley et.al.	2506.05300	null
2025-06-05	Control Tax: The Price of Keeping AI in Check	Mikhail Terekhov et.al.	2506.05296	null
2025-06-05	Sample Complexity and Representation Ability of Test-time Scaling Paradigms	Baihe Huang et.al.	2506.05295	null
2025-06-05	EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World?	Yuqian Yuan et.al.	2506.05287	null
2025-06-05	Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning	Nan Huo et.al.	2506.05278	null
2025-06-06	Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams	Mohammed Almutairi et.al.	2506.05265	null
2025-06-04	OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis	Junting Chen et.al.	2506.04217	null
2025-06-04	Language-Image Alignment with Fixed Text Encoders	Jingfeng Yang et.al.	2506.04209	null
2025-06-04	Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning	Shuang Chen et.al.	2506.04207	null
2025-06-04	EPiC: Towards Lossless Speedup for Reasoning Training through Edge-Preserving CoT Condensation	Jinghan Jia et.al.	2506.04205	null
2025-06-04	Cascadia: A Cascade Serving System for Large Language Models	Youhe Jiang et.al.	2506.04203	null
2025-06-04	TracLLM: A Generic Framework for Attributing Long Context LLMs	Yanting Wang et.al.	2506.04202	null
2025-06-04	R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning	Qingfei Zhao et.al.	2506.04185	null
2025-06-04	SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models	Yuhao Wu et.al.	2506.04180	null
2025-06-04	SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling	Anhao Zhao et.al.	2506.04179	null
2025-06-04	Does Prompt Design Impact Quality of Data Imputation by LLMs?	Shreenidhi Srinivasan et.al.	2506.04172	null
2025-06-04	VISCA: Inferring Component Abstractions for Automated End-to-End Testing	Parsa Alian et.al.	2506.04161	null
2025-06-04	Image Editing As Programs with Diffusion Models	Yujia Hu et.al.	2506.04158	null
2025-06-04	A Dataset for Addressing Patient's Information Needs related to Clinical Course of Hospitalization	Sarvesh Soni et.al.	2506.04156	null
2025-06-04	Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis	Kejian Zhu et.al.	2506.04142	null
2025-06-04	MMR-V: What's Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos	Kejian Zhu et.al.	2506.04141	null
2025-06-04	TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems	Shaina Raza et.al.	2506.04133	null
2025-06-04	Recent Advances in Medical Image Classification	Loan Dao et.al.	2506.04129	null
2025-06-04	Guided Speculative Inference for Efficient Test-Time Alignment of LLMs	Jonathan Geuter et.al.	2506.04118	null
2025-06-05	Rectified Sparse Attention	Yutao Sun et.al.	2506.04108	null
2025-06-04	TextAtari: 100K Frames Game Playing with Language Agents	Wenhao Li et.al.	2506.04098	link
2025-06-03	Causal Estimation of Tokenisation Bias	Pietro Lesci et.al.	2506.03149	null
2025-06-04	UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation	Bin Lin et.al.	2506.03147	null
2025-06-03	Entity-Augmented Neuroscience Knowledge Retrieval Using Ontology and Semantic Understanding Capability of LLM	Pralaypati Ta et.al.	2506.03145	null
2025-06-03	Not All Tokens Are Meant to Be Forgotten	Xiangyu Zhou et.al.	2506.03142	null
2025-06-03	SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation	Siqi Chen et.al.	2506.03139	null
2025-06-03	OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models	Mengdi Jia et.al.	2506.03135	null
2025-06-03	Native-Resolution Image Synthesis	Zidong Wang et.al.	2506.03131	null
2025-06-03	AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation	Lu Qiu et.al.	2506.03126	null
2025-06-03	AUTOCIRCUIT-RL: Reinforcement Learning-Driven LLM for Automated Circuit Topology Generation	Prashanth Vijayaraghavan et.al.	2506.03122	null
2025-06-03	Targeted Forgetting of Image Subgroups in CLIP Models	Zeliang Zhang et.al.	2506.03117	null
2025-06-04	Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback	Xiaoying Zhang et.al.	2506.03106	null
2025-06-03	Beyond Text Compression: Evaluating Tokenizers Across Scales	Jonas F. Lotz et.al.	2506.03101	null
2025-06-03	TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models	Chetwin Low et.al.	2506.03099	null
2025-06-03	EgoVLM: Policy Optimization for Egocentric Video Understanding	Ashwin Vinod et.al.	2506.03097	null
2025-06-03	DPO Learning with LLMs-Judge Signal for Computer Use Agents	Man Luo et.al.	2506.03095	null
2025-06-03	From Flat to Hierarchical: Extracting Sparse Representations with Matching Pursuit	Valérie Costa et.al.	2506.03093	null
2025-06-03	Literary Evidence Retrieval via Long-Context Language Models	Katherine Thai et.al.	2506.03090	null
2025-06-03	StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs	Qijun Luo et.al.	2506.03077	null
2025-06-03	LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM	Roman Titkov et.al.	2506.03073	null
2025-06-03	EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models	Mingzhe Li et.al.	2506.03067	null
2025-05-30	ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL	Yu Zhang et.al.	2505.24875	null
2025-05-30	The Road to Generalizable Neuro-Symbolic Learning Should be Paved with Foundation Models	Adam Stein et.al.	2505.24874	null
2025-05-30	ProxyThinker: Test-Time Guidance through Small Visual Reasoners	Zilin Xiao et.al.	2505.24872	null
2025-05-30	MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning	Yiqing Liang et.al.	2505.24871	null
2025-05-30	GenSpace: Benchmarking Spatially-Aware Image Generation	Zehan Wang et.al.	2505.24870	null
2025-05-30	SiLVR: A Simple Language-based Video Reasoning Framework	Ce Zhang et.al.	2505.24869	link
2025-05-30	Time Blindness: Why Video-Language Models Can't See What Humans Can?	Ujjwal Upadhyay et.al.	2505.24867	null
2025-05-30	ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models	Mingjie Liu et.al.	2505.24864	link
2025-05-30	Beyond Multiple Choice: Evaluating Steering Vectors for Adaptive Free-Form Summarization	Joschka Braun et.al.	2505.24859	null
2025-05-30	Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking	Heli Ben-Hamu et.al.	2505.24857	null
2025-05-30	MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning	Jingyan Shen et.al.	2505.24846	null
2025-05-30	Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning	Wanyun Xie et.al.	2505.24844	null
2025-05-30	Cascading Adversarial Bias from Injection to Distillation in Language Models	Harsh Chaudhari et.al.	2505.24842	null
2025-05-30	Vision LLMs Are Bad at Hierarchical Visual Understanding, and LLMs Are the Bottleneck	Yuwen Tan et.al.	2505.24840	null
2025-05-30	VideoCAD: A Large-Scale Video Dataset for Learning UI Interactions and 3D Reasoning from CAD Software	Brandon Man et.al.	2505.24838	null
2025-06-02	How much do language models memorize?	John X. Morris et.al.	2505.24832	null
2025-05-30	Improving Reliability and Explainability of Medical Question Answering through Atomic Fact Checking in Retrieval-Augmented LLMs	Juraj Vladika et.al.	2505.24830	null
2025-05-30	LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text	Li yunhan et.al.	2505.24826	null
2025-05-30	PhySense: Principle-Based Physics Reasoning Benchmarking for Large Language Models	Yinggan Xu et.al.	2505.24823	null
2025-05-30	Bi-Manual Joint Camera Calibration and Scene Representation	Haozhan Tang et.al.	2505.24819	null
2025-05-29	TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models	Yao Xiao et.al.	2505.23769	link
2025-05-29	Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought	Yunze Man et.al.	2505.23766	null
2025-05-29	From Chat Logs to Collective Insights: Aggregative Question Answering	Wentao Zhang et.al.	2505.23765	null
2025-05-29	MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence	Sihan Yang et.al.	2505.23764	null
2025-05-29	ZeroGUI: Automating Online GUI Learning at Zero Human Cost	Chenyu Yang et.al.	2505.23762	link
2025-05-29	Differential Information: An Information-Theoretic Perspective on Preference Optimization	Yunjae Won et.al.	2505.23761	null
2025-05-29	Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint	Heekyung Lee et.al.	2505.23759	link
2025-05-29	DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning	Ziyin Zhang et.al.	2505.23754	link
2025-05-29	ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks	Akashah Shabbir et.al.	2505.23752	link
2025-05-29	Distortion of AI Alignment: Does Preference Optimization Optimize for Preferences?	Paul Gölz et.al.	2505.23749	null
2025-05-29	Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence	Diankun Wu et.al.	2505.23747	null
2025-05-29	To Trust Or Not To Trust Your Vision-Language Model's Prediction	Hao Dong et.al.	2505.23745	link
2025-05-29	LayerPeeler: Autoregressive Peeling for Layer-wise Image Vectorization	Ronghuan Wu et.al.	2505.23740	null
2025-05-29	ATLAS: Learning to Optimally Memorize the Context at Test Time	Ali Behrouz et.al.	2505.23735	null
2025-05-29	Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time	Mohamad Chehade et.al.	2505.23729	null
2025-05-29	PixelThink: Towards Efficient Chain-of-Pixel Reasoning	Song Wang et.al.	2505.23727	null
2025-05-29	FMG-Det: Foundation Model Guided Robust Object Detection	Darryl Hannan et.al.	2505.23726	null
2025-05-29	MuLoCo: Muon is a practical inner optimizer for DiLoCo	Benjamin Thérien et.al.	2505.23725	null
2025-05-29	SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA	Minrui Luo et.al.	2505.23724	null
2025-05-29	ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering	Zexi Liu et.al.	2505.23723	link
2025-05-28	Zero-Shot Vision Encoder Grafting via LLM Surrogates	Kaiyu Yue et.al.	2505.22664	link
2025-05-28	Training Free Stylized Abstraction	Aimon Rahman et.al.	2505.22663	null
2025-05-28	AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models	Feng Luo et.al.	2505.22662	null
2025-05-28	GuessArena: Guess Who I Am? A Self-Adaptive Framework for Evaluating LLMs in Domain-Specific Knowledge and Reasoning	Qingchen Yu et.al.	2505.22661	null
2025-05-29	Maximizing Confidence Alone Improves Reasoning	Mihir Prabhudesai et.al.	2505.22660	null
2025-05-28	3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model	Wenbo Hu et.al.	2505.22657	null
2025-05-28	Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents	Michael Kirchhof et.al.	2505.22655	null
2025-05-28	VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models	Ce Zhang et.al.	2505.22654	null
2025-05-28	The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason	Ang Lv et.al.	2505.22653	null
2025-05-28	Sherlock: Self-Correcting Reasoning in Vision-Language Models	Yi Ding et.al.	2505.22651	null
2025-05-28	Characterizing Bias: Benchmarking Large Language Models in Simplified versus Traditional Chinese	Hanjia Lyu et.al.	2505.22645	link
2025-05-28	Understanding (Un)Reliability of Steering Vectors in Language Models	Joschka Braun et.al.	2505.22637	null
2025-05-28	Learning Composable Chains-of-Thought	Fangcong Yin et.al.	2505.22635	null
2025-05-28	Spatial Knowledge Graph-Guided Multimodal Synthesis	Yida Xue et.al.	2505.22633	null
2025-05-28	Stochastic Chameleons: Irrelevant Context Hallucinations Reveal Class-Based (Mis)Generalization in LLMs	Ziling Cheng et.al.	2505.22630	null
2025-05-28	Principled Out-of-Distribution Generalization via Simplicity	Jiawei Ge et.al.	2505.22622	null
2025-05-28	Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding	Chengyue Wu et.al.	2505.22618	null
2025-05-28	The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models	Ganqu Cui et.al.	2505.22617	null
2025-05-28	RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction	Yuchi Wang et.al.	2505.22613	null
2025-05-28	Effective and Efficient One-pass Compression of Speech Foundation Models Using Sparsity-aware Self-pinching Gates	Haoning Xu et.al.	2505.22608	null
2025-05-27	Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making	Yihan Wang et.al.	2505.21503	null
2025-05-27	ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models	Dingming Li et.al.	2505.21500	null
2025-05-27	AdInject: Real-World Black-Box Attacks on Web Agents via Advertising Delivery	Haowei Wang et.al.	2505.21499	link
2025-05-27	Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment	Xiaojun Jia et.al.	2505.21494	link
2025-05-27	Reinforcing General Reasoning without Verifiers	Xiangxin Zhou et.al.	2505.21493	null
2025-05-27	Robust Hypothesis Generation: LLM-Automated Language Bias for Inductive Logic Programming	Yang Yang et.al.	2505.21486	null
2025-05-27	Are Language Models Consequentialist or Deontological Moral Reasoners?	Keenan Samway et.al.	2505.21479	null
2025-05-27	Policy Optimized Text-to-Image Pipeline Design	Uri Gadot et.al.	2505.21478	null
2025-05-27	Mitigating Hallucination in Large Vision-Language Models via Adaptive Attention Calibration	Mehrdad Fazli et.al.	2505.21472	null
2025-05-27	Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration	Zijun Liu et.al.	2505.21471	link
2025-05-27	Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion	Zhanqiu Hu et.al.	2505.21467	null
2025-05-27	ID-Align: RoPE-Conscious Position Remapping for Dynamic High-Resolution Adaptation in Vision-Language Models	Bozhou Li et.al.	2505.21465	null
2025-05-27	LazyVLM: Neuro-Symbolic Approach to Video Analytics	Xiangru Jian et.al.	2505.21459	null
2025-05-27	Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance	Shintaro Ozaki et.al.	2505.21458	null
2025-05-27	Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO	Muzhi Zhu et.al.	2505.21457	null
2025-05-27	Can Large Reasoning Models Self-Train?	Sheikh Shafayat et.al.	2505.21444	null
2025-05-27	Towards Better Instruction Following Retrieval Models	Yuchen Zhuang et.al.	2505.21439	null
2025-05-27	Hume: Introducing System-2 Thinking in Visual-Language-Action Model	Haoming Song et.al.	2505.21432	null
2025-05-27	Policy Induction: Predicting Startup Success via Explainable Memory-Augmented In-Context Learning	Xianling Mu et.al.	2505.21427	null
2025-05-27	GUARD:Dual-Agent based Backdoor Defense on Chain-of-Thought in Neural Code Generation	Naizhu Jin et.al.	2505.21425	null
2025-05-26	On Path to Multimodal Historical Reasoning: HistBench and HistAgent	Jiahao Qiu et.al.	2505.20246	link
2025-05-26	KnowTrace: Bootstrapping Iterative Retrieval-Augmented Generation with Structured Knowledge Tracing	Rui Li et.al.	2505.20245	link
2025-05-26	It's High Time: A Survey of Temporal Information Retrieval and Question Answering	Bhawna Piryani et.al.	2505.20243	null
2025-05-26	RedAHD: Reduction-Based End-to-End Automatic Heuristic Design with Large Language Models	Nguyen Thach et.al.	2505.20242	null
2025-05-26	DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning	Qi Cao et.al.	2505.20241	null
2025-05-26	Efficient Speech Translation through Model Compression and Knowledge Distillation	Yasmin Moslem et.al.	2505.20237	link
2025-05-26	Seeing is Believing, but How Much? A Comprehensive Analysis of Verbalized Calibration in Vision-Language Models	Weihao Xuan et.al.	2505.20236	null
2025-05-26	FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models	Hao Kang et.al.	2505.20225	link
2025-05-26	Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future Prospects	Yixin Cui et.al.	2505.20223	link
2025-05-26	Fine-grained List-wise Alignment for Generative Medication Recommendation	Chenxiao Fan et.al.	2505.20218	link
2025-05-26	Parameter-Efficient Fine-Tuning with Column Space Projection	Junseo Hwang et.al.	2505.20211	null
2025-05-26	How to Improve the Robustness of Closed-Source Models on NLI	Joe Stacey et.al.	2505.20209	null
2025-05-26	Evaluating Large Language Models for Code Review	Umut Cihan et.al.	2505.20206	null
2025-05-26	PathBench: A comprehensive comparison benchmark for pathology foundation models towards precision oncology	Jiabo Ma et.al.	2505.20202	null
2025-05-26	Reasoning Is Not All You Need: Examining LLMs for Multi-Turn Mental Health Conversations	Mohit Chandra et.al.	2505.20201	null
2025-05-26	Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking	Pengxiang Li et.al.	2505.20199	link
2025-05-26	Temporal Sampling for Forgotten Reasoning in LLMs	Yuetai Li et.al.	2505.20196	link
2025-05-26	FunReason: Enhancing Large Language Models' Function Calling via Self-Refinement Multiscale Loss and Automated Data Refinement	Bingguang Hao et.al.	2505.20192	link
2025-05-26	THiNK: Can Large Language Models Think-aloud?	Yongan Yu et.al.	2505.20184	link
2025-05-26	An Empirical Study on Strong-Weak Model Collaboration for Repo-level Code Generation	Shubham Gandhi et.al.	2505.20182	link
2025-05-26	Pangu Light: Weight Re-Initialization for Pruning and Accelerating LLMs	Hanting Chen et.al.	2505.20155	null
2025-05-26	UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models	Xueyan Zhang et.al.	2505.20154	null
2025-05-26	MineAnyBuild: Benchmarking Spatial Planning for Open-world AI Agents	Ziming Wei et.al.	2505.20148	link
2025-05-26	FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities	Jin Wang et.al.	2505.20147	null
2025-05-26	SeMe: Training-Free Language Model Merging via Semantic Alignment	Jian Gu et.al.	2505.20144	null
2025-05-26	StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs	Jialin Yang et.al.	2505.20139	null
2025-05-26	AweDist: Attention-aware Embedding Distillation for New Input Token Embeddings	Konstantin Dobler et.al.	2505.20133	null
2025-05-26	Agentic 3D Scene Generation with Spatially Contextualized VLMs	Xinhang Liu et.al.	2505.20129	null
2025-05-26	Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers	Zhengliang Shi et.al.	2505.20128	link
2025-05-26	Agentic AI Process Observability: Discovering Behavioral Variability	Fabiana Fournier et.al.	2505.20127	null
2025-05-26	MEBench: A Novel Benchmark for Understanding Mutual Exclusivity Bias in Vision-Language Models	Anh Thai et.al.	2505.20122	null
2025-05-26	TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent	Dominik Meier et.al.	2505.20118	link
2025-05-26	Named Entity Recognition in Historical Italian: The Case of Giacomo Leopardi's Zibaldone	Cristian Santini et.al.	2505.20113	null
2025-05-26	ResSVD: Residual Compensated SVD for Large Language Model Compression	Haolei Bai et.al.	2505.20112	null
2025-05-26	Language-Agnostic Suicidal Risk Detection Using Large Language Models	June-Woo Kim et.al.	2505.20109	null
2025-05-26	Adaptive Deep Reasoning: Triggering Deep Thinking When Needed	Yunhao Wang et.al.	2505.20101	null
2025-05-26	AdaTP: Attention-Debiased Token Pruning for Video Large Language Models	Fengyuan Sun et.al.	2505.20100	null
2025-05-26	Large Language Models Meet Knowledge Graphs for Question Answering: Synthesis and Opportunities	Chuangtao Ma et.al.	2505.20099	link
2025-05-26	S2LPP: Small-to-Large Prompt Prediction across LLMs	Liang Cheng et.al.	2505.20097	null
2025-05-26	Multi-Domain Explainability of Preferences	Nitay Calderon et.al.	2505.20088	null
2025-05-23	Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs	Wafa Alghallabi et.al.	2505.18152	link
2025-05-23	First Finish Search: Efficient Test-Time Scaling in Large Language Models	Aradhye Agarwal et.al.	2505.18149	null
2025-05-23	Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find	Owen Bianchi et.al.	2505.18148	null
2025-05-23	Graph-Linguistic Fusion: Using Language Models for Wikidata Vandalism Detection	Mykola Trokhymovych et.al.	2505.18136	null
2025-05-23	Gaming Tool Preferences in Agentic LLMs	Kazem Faghih et.al.	2505.18135	link
2025-05-23	VideoGameBench: Can Vision-Language Models complete popular video games?	Alex L. Zhang et.al.	2505.18134	null
2025-05-23	One RL to See Them All: Visual Triple Unified Reinforcement Learning	Yan Ma et.al.	2505.18129	null
2025-05-23	Reward Model Overoptimisation in Iterated RLHF	Lorenz Wolf et.al.	2505.18126	null
2025-05-23	TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations	Alan Arazi et.al.	2505.18125	null
2025-05-23	UNJOIN: Enhancing Multi-Table Text-to-SQL Generation via Schema Simplification	Poojah Ganesan et.al.	2505.18122	null
2025-05-23	ProgRM: Build Better GUI Agents with Progress Rewards	Danyang Zhang et.al.	2505.18121	null
2025-05-23	Bidirectional Knowledge Distillation for Enhancing Sequential Recommendation with Large Language Models	Jiongran Wu et.al.	2505.18120	null
2025-05-23	Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM	Zinuo Li et.al.	2505.18110	null
2025-05-23	ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework	Lisheng Huang et.al.	2505.18105	null
2025-05-23	How Can I Publish My LLM Benchmark Without Giving the True Answers Away?	Takashi Ishida et.al.	2505.18102	null
2025-05-23	Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL	Joey Hong et.al.	2505.18098	null
2025-05-23	QwenLong-CPRS: Towards $\infty$ -LLMs with Dynamic Context Optimization	Weizhou Shen et.al.	2505.18092	null
2025-05-23	Data Mixing Can Induce Phase Transitions in Knowledge Acquisition	Xinran Gu et.al.	2505.18091	null
2025-05-23	CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays	Hyungyung Lee et.al.	2505.18087	null
2025-05-23	Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding	Xiaoyi Zhang et.al.	2505.18079	null
2025-05-22	CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms	Shilin Yan et.al.	2505.17020	link
2025-05-22	Let Androids Dream of Electric Sheep: A Human-like Image Implication Understanding and Reasoning Framework	Chenhao Zhang et.al.	2505.17019	link
2025-05-22	SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward	Kaixuan Fan et.al.	2505.17018	link
2025-05-22	Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO	Chengzhuo Tong et.al.	2505.17017	link
2025-05-22	Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models	Runsen Xu et.al.	2505.17015	null
2025-05-22	SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding	Haoning Wu et.al.	2505.17012	link
2025-05-22	R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning	Huatong Song et.al.	2505.17005	link
2025-05-22	Do Large Language Models Excel in Complex Logical Reasoning with Formal Language?	Jin Jiang et.al.	2505.16998	link
2025-05-22	DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization	Chao Zhang et.al.	2505.16995	null
2025-05-22	Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding	Runpeng Yu et.al.	2505.16990	link
2025-05-22	T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning	Amartya Chakraborty et.al.	2505.16986	null
2025-05-22	UFT: Unifying Supervised and Reinforcement Fine-Tuning	Mingyang Liu et.al.	2505.16984	link
2025-05-22	LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding	Junlong Tong et.al.	2505.16983	link
2025-05-22	Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine	Adib Bazgir et.al.	2505.16982	null
2025-05-22	HyGenar: An LLM-Driven Hybrid Genetic Algorithm for Few-Shot Grammar Generation	Weizhi Tang et.al.	2505.16978	link
2025-05-22	SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development	Yaxin Du et.al.	2505.16975	link
2025-05-22	CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark	Ahmed Heakl et.al.	2505.16968	link
2025-05-22	Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models	Junjie Xiong et.al.	2505.16957	null
2025-05-22	On Multilingual Encoder Language Model Compression for Low-Resource Languages	Daniil Gurgurov et.al.	2505.16956	null
2025-05-22	A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization	Shengyu Feng et.al.	2505.16952	null
2025-05-21	InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition	Yijie Zheng et.al.	2505.15818	link
2025-05-21	On the creation of narrow AI: hierarchy and nonlocality of neural network skills	Eric J. Michaud et.al.	2505.15811	link
2025-05-21	MMaDA: Multimodal Large Diffusion Language Models	Ling Yang et.al.	2505.15809	link
2025-05-21	The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval Augmentation	Patrick Kahardipraja et.al.	2505.15807	link
2025-05-21	Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering	Hwan Chang et.al.	2505.15805	link
2025-05-21	STAR-R1: Spacial TrAnsformation Reasoning by Reinforcing Multimodal LLMs	Zongzhao Li et.al.	2505.15804	null
2025-05-21	VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models	Yuchen Yan et.al.	2505.15801	null
2025-05-21	Model Merging is Secretly Certifiable: Non-Vacuous Generalisation Bounds for Low-Shot Learning	Taehoon Kim et.al.	2505.15798	null
2025-05-21	Reverse Engineering Human Preferences with Reinforcement Learning	Lisa Alazraki et.al.	2505.15795	null
2025-05-21	HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving	Zhiwen Chen et.al.	2505.15793	null
2025-05-21	Large Language Models as Computable Approximations to Solomonoff Induction	Jun Wan et.al.	2505.15784	null
2025-05-21	dKV-Cache: The Cache for Diffusion Language Models	Xinyin Ma et.al.	2505.15781	link
2025-05-21	ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning	Changtai Zhu et.al.	2505.15776	link
2025-05-21	Beyond Hard and Soft: Hybrid Context Compression for Balancing Local and Global Information Retention	Huanxuan Liao et.al.	2505.15774	link
2025-05-21	MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling	Cheng Yifan et.al.	2505.15772	null
2025-05-21	An Empirical Analysis of Vulnerability Detection Tools for Solidity Smart Contracts Using Line Level Manually Annotated Vulnerabilities	Francesco Salzano et.al.	2505.15756	null
2025-05-21	Exploring The Visual Feature Space for Multimodal Neural Decoding	Weihao Xia et.al.	2505.15755	null
2025-05-21	Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval	Taiye Chen et.al.	2505.15753	null
2025-05-21	Multi-modal Integration Analysis of Alzheimer's Disease Using Large Language Models and Knowledge Graphs	Kanan Kiguchi et.al.	2505.15747	null
2025-05-21	Evolutionary Computation and Large Language Models: A Survey of Methods, Synergies, and Applications	Dikshit Chauhan et.al.	2505.15741	null
2025-05-20	Language Models use Lookbacks to Track Beliefs	Nikhil Prakash et.al.	2505.14685	null
2025-05-21	Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning	Haolei Xu et.al.	2505.14684	null
2025-05-20	Emerging Properties in Unified Multimodal Pretraining	Chaorui Deng et.al.	2505.14683	null
2025-05-20	UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation	Rui Tian et.al.	2505.14682	null
2025-05-20	UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models	Xiaojie Gu et.al.	2505.14679	link
2025-05-20	Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning	Jiaer Xia et.al.	2505.14677	null
2025-05-20	Reward Reasoning Model	Jiaxin Guo et.al.	2505.14674	null
2025-05-20	UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens	Ruichuan An et.al.	2505.14671	null
2025-05-20	Quartet: Native FP4 Training Can Be Optimal for Large Language Models	Roberto L. Castro et.al.	2505.14669	link
2025-05-20	ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions	Bufang Yang et.al.	2505.14668	null
2025-05-20	Beyond Words: Multimodal LLM Knows When to Speak	Zikai Liao et.al.	2505.14654	null
2025-05-21	General-Reasoner: Advancing LLM Reasoning Across All Domains	Xueguang Ma et.al.	2505.14652	null
2025-05-20	Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits	Tiantian Feng et.al.	2505.14648	link
2025-05-20	CAD-Coder: An Open-Source Vision-Language Model for Computer-Aided Design Code Generation	Anna C. Doris et.al.	2505.14646	link
2025-05-21	Think Only When You Need with Large Hybrid-Reasoning Models	Lingjie Jiang et.al.	2505.14631	null
2025-05-20	KERL: Knowledge-Enhanced Personalized Recipe Recommendation using Large Language Models	Fnu Mohbat et.al.	2505.14629	link
2025-05-20	Debating for Better Reasoning: An Unsupervised Multimodal Approach	Ashutosh Adhikari et.al.	2505.14627	null
2025-05-20	TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning	Zhangchen Xu et.al.	2505.14625	link
2025-05-20	Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs	Morgan Lindsay Heisler et.al.	2505.14620	null
2025-05-20	Linear Control of Test Awareness Reveals Differential Compliance in Reasoning Models	Sahar Abdelnabi et.al.	2505.14617	link
2025-05-19	CIE: Controlling Language Model Text Generations Using Continuous Signals	Vinay Samuel et.al.	2505.13448	link
2025-05-19	Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards	Xiaoyuan Liu et.al.	2505.13445	link
2025-05-19	ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models	Liyan Tang et.al.	2505.13444	null
2025-05-19	GraspMolmo: Generalizable Task-Oriented Grasping via Large-Scale Synthetic Data Generation	Abhay Deshpande et.al.	2505.13441	null
2025-05-19	Optimizing Anytime Reasoning via Budget Relative Policy Optimization	Penghui Qi et.al.	2505.13438	link
2025-05-19	SMOTExT: SMOTE meets Large Language Models	Mateusz Bystroński et.al.	2505.13434	null
2025-05-19	Fine-tuning Quantized Neural Networks with Zeroth-order Optimization	Sifeng Shang et.al.	2505.13430	null
2025-05-19	MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision	Lingxiao Du et.al.	2505.13427	link
2025-05-19	G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning	Liang Chen et.al.	2505.13426	link
2025-05-19	Learnware of Language Models: Specialized Small Language Models Can Do Big	Zhi-Hao Tan et.al.	2505.13425	link
2025-05-19	Make Still Further Progress: Chain of Thoughts for Tabular Data Leaderboard	Si-Yang Liu et.al.	2505.13421	null
2025-05-19	FEALLM: Advancing Facial Emotion Analysis in Multimodal Large Language Models with Emotional Synergy and Reasoning	Zhuozhao Hu et.al.	2505.13419	link
2025-05-19	CoT-Kinetics: A Theoretical Modeling Assessing LRM Reasoning Process	Jinhe Bi et.al.	2505.13408	null
2025-05-19	AutoMathKG: The automated mathematical knowledge graph based on LLM and vector database	Rong Bian et.al.	2505.13406	null
2025-05-19	MR. Judge: Multimodal Reasoner as a Judge	Renjie Pi et.al.	2505.13403	null
2025-05-19	R3: Robust Rubric-Agnostic Reward Models	David Anugraha et.al.	2505.13388	link
2025-05-19	CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition	Nam V. Nguyen et.al.	2505.13380	link
2025-05-19	Thinkless: LLM Learns When to Think	Gongfan Fang et.al.	2505.13379	link
2025-05-19	Seeing, Saying, Solving: An LLM-to-TL Framework for Cooperative Robots	Dan BW Choe et.al.	2505.13376	null
2025-05-19	Multi-Armed Bandits Meet Large Language Models	Djallel Bouneffouf et.al.	2505.13355	null
2025-05-16	Modeling cognitive processes of natural reading with transformer-based Language Models	Bruno Bianchi et.al.	2505.11485	null
2025-05-16	msf-CNN: Patch-based Multi-Stage Fusion with Convolutional Neural Networks for TinyML	Zhaolan Huang et.al.	2505.11483	link
2025-05-16	Improving Assembly Code Performance with Large Language Models via Reinforcement Learning	Anjiang Wei et.al.	2505.11480	null
2025-05-16	HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages	Zhilin Wang et.al.	2505.11475	null
2025-05-16	Disentangling Reasoning and Knowledge in Medical Large Language Models	Rahul Thapa et.al.	2505.11462	null
2025-05-16	ProxyPrompt: Securing System Prompts against Prompt Extraction Attacks	Zhixiong Zhuang et.al.	2505.11459	null
2025-05-16	LLMs unlock new paths to monetizing exploits	Nicholas Carlini et.al.	2505.11449	null
2025-05-16	Is Compression Really Linear with Code Intelligence?	Xianzhen Luo et.al.	2505.11441	null
2025-05-16	GODBench: A Benchmark for Multimodal Large Language Models in Video Comment Art	Chenkai Zhang et.al.	2505.11436	link
2025-05-16	MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production	Chao Jin et.al.	2505.11432	null
2025-05-16	Mergenetic: a Simple Evolutionary Model Merging Library	Adrian Robert Minut et.al.	2505.11427	link
2025-05-16	When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs	Xiaomin Li et.al.	2505.11423	null
2025-05-16	Towards Cultural Bridge by Bahnaric-Vietnamese Translation Using Transfer Learning of Sequence-To-Sequence Pre-training Language Model	Phan Tran Minh Dat et.al.	2505.11421	null
2025-05-16	EdgeWisePersona: A Dataset for On-Device User Profiling from Natural Language Interactions	Patryk Bartkowiak et.al.	2505.11417	link
2025-05-16	MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems	Yinsicheng Jiang et.al.	2505.11415	null
2025-05-16	CARES: Comprehensive Evaluation of Safety and Adversarial Robustness in Medical LLMs	Sijia Chen et.al.	2505.11413	null
2025-05-16	Visual Planning: Let's Think Only with Images	Yi Xu et.al.	2505.11409	link
2025-05-16	Large Language Model Use Impact Locus of Control	Jenny Xiyu Fu et.al.	2505.11406	null
2025-05-16	EmotionHallucer: Evaluating Emotion Hallucinations in Multimodal Large Language Models	Bohao Xing et.al.	2505.11405	link
2025-05-16	Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner	Wenchuan Zhang et.al.	2505.11404	link
2025-05-15	End-to-End Vision Tokenizer Tuning	Wenxuan Wang et.al.	2505.10562	null
2025-05-15	Neural Thermodynamic Laws for Large Language Model Training	Ziming Liu et.al.	2505.10559	null
2025-05-15	Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data	Yiwen Liu et.al.	2505.10551	link
2025-05-15	Real-Time Out-of-Distribution Failure Prevention via Multi-Modal Reasoning	Milan Ganai et.al.	2505.10547	null
2025-05-15	Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models	Annie Wong et.al.	2505.10543	link
2025-05-15	Exploring Implicit Visual Misunderstandings in Multimodal Large Language Models through Attention Analysis	Pengfei Wang et.al.	2505.10541	link
2025-05-15	S3C2 Summit 2024-09: Industry Secure Software Supply Chain Summit	Imranur Rahman et.al.	2505.10538	null
2025-05-15	WorldPM: Scaling Human Preference Modeling	Binghai Wang et.al.	2505.10527	link
2025-05-15	MASSV: Multimodal Adaptation and Self-Data Distillation for Speculative Decoding of Vision-Language Models	Mugilan Ganesan et.al.	2505.10526	null
2025-05-15	Multi-Token Prediction Needs Registers	Anastasios Gerontopoulos et.al.	2505.10518	link
2025-05-15	RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs	Vibha Belavadi et.al.	2505.10495	null
2025-05-15	Can You Really Trust Code Copilots? Evaluating Large Language Models from a Code Security Perspective	Yutao Mou et.al.	2505.10494	link
2025-05-15	CL-RAG: Bridging the Gap in Retrieval-Augmented Generation with Curriculum Learning	Shaohan Wang et.al.	2505.10493	null
2025-05-15	Campus AI vs Commercial AI: A Late-Breaking Study on How LLM As-A-Service Customizations Shape Trust and Usage Patterns	Leon Hannig et.al.	2505.10490	null
2025-05-15	Parallel Scaling Law for Language Models	Mouxiang Chen et.al.	2505.10475	link
2025-05-15	Large Language Models for Cancer Communication: Evaluating Linguistic Quality, Safety, and Accessibility in Generative AI	Agnik Saha et.al.	2505.10472	null
2025-05-15	AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge	Ranjan Sapkota et.al.	2505.10468	null
2025-05-15	Superposition Yields Robust Neural Scaling	Yizhou liu et.al.	2505.10465	link
2025-05-15	Vision language models have difficulty recognizing virtual objects	Tyler Tran et.al.	2505.10453	null
2025-05-15	Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models	Zemin Huang et.al.	2505.10446	null
2025-05-14	Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists?	Anthony GX-Chen et.al.	2505.09614	null
2025-05-14	Customizing a Large Language Model for VHDL Design of High-Performance Microprocessors	Nicolas Dupuis et.al.	2505.09610	null
2025-05-14	Adversarial Suffix Filtering: a Defense Pipeline for LLMs	David Khachaturov et.al.	2505.09602	null
2025-05-14	How Hungry is AI? Benchmarking Energy, Water, and Carbon Footprint of LLM Inference	Nidhal Jegham et.al.	2505.09598	null
2025-05-14	WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models	Abdullah Mushtaq et.al.	2505.09595	null
2025-05-14	Variational Visual Question Answering	Tobias Jan Wieczorek et.al.	2505.09591	null
2025-05-15	Beyond Likes: How Normative Feedback Complements Engagement Signals on Social Media	Yuchen Wu et.al.	2505.09583	null
2025-05-14	VTLA: Vision-Tactile-Language-Action Model with Preference Learning for Insertion Manipulation	Chaofan Zhang et.al.	2505.09577	null
2025-05-14	Ethics and Persuasion in Reinforcement Learning from Human Feedback: A Procedural Rhetorical Approach	Shannon Lodoen et.al.	2505.09576	null
2025-05-14	MIGRATION-BENCH: Repository-Level Code Migration Benchmark from Java 8	Linbo Liu et.al.	2505.09569	link
2025-05-14	Using Foundation Models as Pseudo-Label Generators for Pre-Clinical 4D Cardiac CT Segmentation	Anne-Marie Rickmann et.al.	2505.09564	null
2025-05-14	WavReward: Spoken Dialogue Models With Generalist Reward Evaluators	Shengpeng Ji et.al.	2505.09558	link
2025-05-14	PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt Tuning	Zongqian Li et.al.	2505.09519	link
2025-05-15	Towards Fair In-Context Learning with Tabular Foundation Models	Patrik Kenfack et.al.	2505.09503	null
2025-05-14	Layered Unlearning for Adversarial Relearning	Timothy Qian et.al.	2505.09500	link
2025-05-14	Flash-VL 2B: Optimizing Vision-Language Model Performance for Ultra-Low Latency and High Throughput	Bo Zhang et.al.	2505.09498	null
2025-05-14	Card Sorting Simulator: Augmenting Design of Logical Information Architectures with Large Language Models	Eduard Kuric et.al.	2505.09478	null
2025-05-14	Deploying Foundation Model-Enabled Air and Ground Robots in the Field: Challenges and Opportunities	Zachary Ravichandran et.al.	2505.09477	null
2025-05-14	Evaluating GPT- and Reasoning-based Large Language Models on Physics Olympiad Problems: Surpassing Human Performance and Implications for Educational Assessment	Paul Tschisgale et.al.	2505.09438	null
2025-05-14	CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios	Raghav Garg et.al.	2505.09436	link
2025-05-13	CodePDE: An Inference Framework for LLM-driven PDE Solver Generation	Shanda Li et.al.	2505.08783	link
2025-05-13	HealthBench: Evaluating Large Language Models Towards Improved Human Health	Rahul K. Arora et.al.	2505.08775	link
2025-05-14	Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology	Yatai Ji et.al.	2505.08765	null
2025-05-13	Aya Vision: Advancing the Frontier of Multilingual Multimodality	Saurabh Dash et.al.	2505.08751	null
2025-05-13	AC-Reason: Towards Theory-Guided Actual Causality Reasoning with Large Language Models	Yanxi Zhang et.al.	2505.08750	link
2025-05-13	DeepMath-Creative: A Benchmark for Evaluating Mathematical Creativity of Large Language Models	Xiaoyang Chen et.al.	2505.08744	link
2025-05-13	Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies	Xiaoliang Luo et.al.	2505.08739	link
2025-05-13	Towards Foundation Models for Experimental Readout Systems Combining Discrete and Continuous Data	James Giroux et.al.	2505.08736	link
2025-05-13	NurValues: Real-World Nursing Values Evaluation for Large Language Models in Clinical Context	Ben Yao et.al.	2505.08734	null
2025-05-13	Securing RAG: A Risk Assessment and Mitigation Framework	Lukas Ammann et.al.	2505.08728	null
2025-05-13	Memorization-Compression Cycles Improve Generalization	Fangyuan Yu et.al.	2505.08727	null
2025-05-13	Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving	Zongchuang Zhao et.al.	2505.08725	link
2025-05-13	TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series	Xiaolei Qin et.al.	2505.08723	link
2025-05-13	PWC-MoE: Privacy-Aware Wireless Collaborative Mixture of Experts	Yang Su et.al.	2505.08719	null
2025-05-13	Controllable Image Colorization with Instance-aware Texts and Masks	Yanru An et.al.	2505.08705	null
2025-05-13	LLM-based Prompt Ensemble for Reliable Medical Entity Recognition from EHRs	K M Sajjadul Islam et.al.	2505.08704	null
2025-05-14	Granite-speech: open-source speech-aware LLMs with strong English ASR capabilities	George Saon et.al.	2505.08699	null
2025-05-13	VizCV: AI-assisted visualization of researchers' publications tracks	Vladimír Lazárik et.al.	2505.08691	null
2025-05-13	Adaptive Schema-aware Event Extraction with Retrieval-Augmented Generation	Sheng Liang et.al.	2505.08690	null
2025-05-13	A Social Robot with Inner Speech for Dietary Guidance	Valerio Belcamino et.al.	2505.08664	link
2025-05-12	DanceGRPO: Unleashing GRPO on Visual Generation	Zeyue Xue et.al.	2505.07818	null
2025-05-12	Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models	Seungjae Lee et.al.	2505.07815	null
2025-05-12	Learning Dynamics in Continual Pre-Training for Large Language Models	Xingjin Wang et.al.	2505.07796	null
2025-05-12	Domain Regeneration: How well do LLMs match syntactic properties of text domains?	Da Ju et.al.	2505.07784	null
2025-05-12	Relative Overfitting and Accept-Reject Framework	Yanxin Liu et.al.	2505.07783	null
2025-05-12	MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering	Rushi Qiang et.al.	2505.07782	link
2025-05-12	Must Read: A Systematic Survey of Computational Persuasion	Nimet Beyza Bozdag et.al.	2505.07775	link
2025-05-12	Agent RL Scaling Law: Agent RL with Spontaneous Code Execution for Mathematical Problem Solving	Xinji Mai et.al.	2505.07773	link
2025-05-12	Enhancing Code Generation via Bidirectional Comment-Level Mutual Grounding	Yifeng Di et.al.	2505.07768	link
2025-05-12	BodyGPS: Anatomical Positioning System	Halid Ziya Yerebakan et.al.	2505.07744	null
2025-05-12	Assessing the Chemical Intelligence of Large Language Models	Nicholas T. Runcie et.al.	2505.07735	link
2025-05-12	Spoken Language Understanding on Unseen Tasks With In-Context Learning	Neeraj Agrawal et.al.	2505.07731	null
2025-05-12	Reproducibility, Replicability, and Insights into Visual Document Retrieval with Late Interaction	Jingfen Qiao et.al.	2505.07730	link
2025-05-12	Circuit Partitioning Using Large Language Models for Quantum Compilation and Simulations	Pranav Sinha et.al.	2505.07711	null
2025-05-12	Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images	Elisei Rykov et.al.	2505.07704	null
2025-05-12	PatchTrack: A Comprehensive Analysis of ChatGPT's Influence on Pull Request Outcomes	Daniel Ogenrwot et.al.	2505.07700	null
2025-05-12	Beyond CLIP Generalization: Against Forward&Backward Forgetting Adapter for Continual Learning of Vision-Language Models	Songlin Dong et.al.	2505.07690	null
2025-05-12	S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models	Muzhi Dai et.al.	2505.07686	null
2025-05-12	Multimodal Survival Modeling in the Age of Foundation Models	Steven Song et.al.	2505.07683	link
2025-05-12	SpecRouter: Adaptive Routing for Multi-Level Speculative Decoding in Large Language Models	Hang Wu et.al.	2505.07680	null
2025-05-09	Towards a Unified Representation Evaluation Framework Beyond Downstream Tasks	Christos Plachouras et.al.	2505.06224	link
2025-05-09	Adapting a Segmentation Foundation Model for Medical Image Classification	Pengfei Gu et.al.	2505.06217	null
2025-05-09	From Millions of Tweets to Actionable Insights: Leveraging LLMs for User Profiling	Vahid Rahimzadeh et.al.	2505.06184	null
2025-05-09	A Large Language Model-Enhanced Q-learning for Capacitated Vehicle Routing Problem with Time Windows	Linjiang Cao et.al.	2505.06178	null
2025-05-09	MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills	Niladri Shekhar Dutt et.al.	2505.06176	null
2025-05-09	Turbo-ICL: In-Context Learning-Based Turbo Equalization	Zihang Song et.al.	2505.06175	null
2025-05-09	MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks	Wenqi Zeng et.al.	2505.06152	link
2025-05-09	A Scaling Law for Token Efficiency in LLM Fine-Tuning Under Fixed Compute Budgets	Ryan Lagasse et.al.	2505.06150	null
2025-05-09	Can Prompting LLMs Unlock Hate Speech Detection across Languages? A Zero-shot and Few-shot Study	Faeze Ghorbanpour et.al.	2505.06149	null
2025-05-09	LLMs Get Lost In Multi-Turn Conversation	Philippe Laban et.al.	2505.06120	link
2025-05-09	LLMs Outperform Experts on Challenging Biology Benchmarks	Lennart Justen et.al.	2505.06108	null
2025-05-09	Free and Fair Hardware: A Pathway to Copyright Infringement-Free Verilog Generation using LLMs	Sam Bush et.al.	2505.06096	null
2025-05-09	Assessing Tenstorrent's RISC-V MatMul Acceleration Capabilities	Hiari Pizzini Cavagna et.al.	2505.06085	null
2025-05-09	Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information	Joshua Harris et.al.	2505.06046	null
2025-05-09	Short-circuiting Shortcuts: Mechanistic Investigation of Shortcuts in Text Classification	Leon Eshuijs et.al.	2505.06032	link
2025-05-09	Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation	Stefan Vasilev et.al.	2505.06027	null
2025-05-09	ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding	Shuai Wang et.al.	2505.06020	null
2025-05-09	Exploring the Feasibility of Multilingual Grammatical Error Correction with a Single LLM up to 9B parameters: A Comparative Study of 17 Models	Dawid Wisniewski et.al.	2505.06004	link
2025-05-09	Task-Adapter++: Task-specific Adaptation with Order-aware Alignment for Few-shot Action Recognition	Congqi Cao et.al.	2505.06002	link
2025-05-09	Towards Developmentally Plausible Rewards: Communicative Success as a Learning Signal for Interactive Language Models	Lennart Stöpler et.al.	2505.05970	null
2025-05-08	Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation	Chao Liao et.al.	2505.05472	null
2025-05-08	Generating Physically Stable and Buildable LEGO Designs from Text	Ava Pun et.al.	2505.05469	link
2025-05-08	StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant	Haibo Wang et.al.	2505.05467	null
2025-05-08	ComPO: Preference Alignment via Comparison Oracles	Peter Chen et.al.	2505.05465	null
2025-05-08	Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging	Shiqi Chen et.al.	2505.05464	link
2025-05-08	UKElectionNarratives: A Dataset of Misleading Narratives Surrounding Recent UK General Elections	Fatima Haouari et.al.	2505.05459	null
2025-05-08	SITE: towards Spatial Intelligence Thorough Evaluation	Wenqi Wang et.al.	2505.05456	null
2025-05-08	Conversational Process Model Redesign	Nataliia Klievtsova et.al.	2505.05453	null
2025-05-08	clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations	Chalamalasetti Kranti et.al.	2505.05445	null
2025-05-08	GesPrompt: Leveraging Co-Speech Gestures to Augment LLM-Based Interaction in Virtual Reality	Xiyun Hu et.al.	2505.05441	null
2025-05-09	EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation	Biao Yi et.al.	2505.05440	null
2025-05-08	Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data	Yudong Wang et.al.	2505.05427	null
2025-05-09	LiTransProQA: an LLM-based Literary Translation evaluation metric with Professional Question Answering	Ran Zhang et.al.	2505.05423	link
2025-05-08	Crosslingual Reasoning through Test-Time Scaling	Zheng-Xin Yong et.al.	2505.05408	link
2025-05-08	Frame In, Frame Out: Do LLMs Generate More Biased News Headlines than Humans?	Valeria Pastorino et.al.	2505.05406	null
2025-05-08	A Pain Assessment Framework based on multimodal data and Deep Machine Learning methods	Stefanos Gkikas et.al.	2505.05396	null
2025-05-08	DSDrive: Distilling Large Language Model for Lightweight End-to-End Autonomous Driving with Unified Reasoning and Planning	Wenru Liu et.al.	2505.05360	null
2025-05-08	Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source Localization	Sooyoung Park et.al.	2505.05343	link
2025-05-08	FLAM: Frame-Wise Language-Audio Modeling	Yusong Wu et.al.	2505.05335	null
2025-05-08	ICon: In-Context Contribution for Automatic Data Selection	Yixin Yang et.al.	2505.05327	null
2025-05-07	EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning	Zhenghao Xing et.al.	2505.04623	link
2025-05-07	On Path to Multimodal Generalist: General-Level and General-Bench	Hao Fei et.al.	2505.04620	null
2025-05-07	OmniGIRL: A Multilingual and Multimodal Benchmark for GitHub Issue Resolution	Lianghong Guo et.al.	2505.04606	link
2025-05-07	OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning	Xianhang Li et.al.	2505.04601	null
2025-05-08	MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection	Zhihao Zhang et.al.	2505.04594	null
2025-05-07	ZeroSearch: Incentivize the Search Capability of LLMs without Searching	Hao Sun et.al.	2505.04588	link
2025-05-07	SlideItRight: Using AI to Find Relevant Slides and Provide Feedback for Open-Ended Questions	Chloe Qianhui Zhao et.al.	2505.04584	link
2025-05-07	Fight Fire with Fire: Defending Against Malicious RL Fine-Tuning via Reward Neutralization	Wenjun Cao et.al.	2505.04578	null
2025-05-07	Communication-Efficient Federated Fine-Tuning of Language Models via Dynamic Update Schedules	Michail Theologitis et.al.	2505.04535	link
2025-05-07	Overcoming Data Scarcity in Generative Language Modelling for Low-Resource Languages: A Systematic Review	Josh McGiff et.al.	2505.04531	null
2025-05-07	Comparative Analysis of Carbon Footprint in Manual vs. LLM-Assisted Code Development	Kuen Sum Cheung et.al.	2505.04521	null
2025-05-07	Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs	Yehui Tang et.al.	2505.04519	null
2025-05-07	"I Can See Forever!": Evaluating Real-time VideoLLMs for Assisting Individuals with Visual Impairments	Ziyi Zhang et.al.	2505.04488	null
2025-05-07	CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation	Jiahao Li et.al.	2505.04481	null
2025-05-07	TrajEvo: Designing Trajectory Prediction Heuristics via LLM-driven Evolution	Zhikai Zhao et.al.	2505.04480	link
2025-05-07	Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration	Shigeki Karita et.al.	2505.04457	link
2025-05-07	M2Rec: Multi-scale Mamba for Efficient Sequential Recommendation	Qianru Zhang et.al.	2505.04445	null
2025-05-07	Towards Effectively Leveraging Execution Traces for Program Repair with Code LLMs	Mirazul Haque et.al.	2505.04441	null
2025-05-07	OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models	Xiaoyu Xu et.al.	2505.04416	null
2025-05-07	DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception	Junjie Wang et.al.	2505.04410	link
2025-05-06	VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model	Zuwei Long et.al.	2505.03739	link
2025-05-06	Decentralized Nonconvex Optimization under Heavy-Tailed Noise: Normalization and Optimal Convergence	Shuhua Yu et.al.	2505.03736	null
2025-05-06	Meta-Optimization and Program Search using Language Models for Task and Motion Planning	Denis Shcherba et.al.	2505.03725	null
2025-05-06	Fill the Gap: Quantifying and Reducing the Modality Gap in Image-Text Representation Learning	François Role et.al.	2505.03703	null
2025-05-06	Fairness of Automatic Speech Recognition in Cleft Lip and Palate Speech	Susmita Bhattacharjee et.al.	2505.03697	null
2025-05-06	Graph Drawing for LLMs: An Empirical Evaluation	Walter Didimo et.al.	2505.03678	null
2025-05-06	Distribution-Conditional Generation: From Class Distribution to Creative Generation	Fu Feng et.al.	2505.03667	null
2025-05-06	Binding threshold units with artificial oscillatory neurons	Vladimir Fanaskov et.al.	2505.03648	link
2025-05-06	PhysLLM: Harnessing Large Language Models for Cross-Modal Remote Physiological Sensing	Yiping Xie et.al.	2505.03621	null
2025-05-06	Learning Unknown Spoof Prompts for Generalized Face Anti-Spoofing Using Only Real Face Images	Fangling Jiang et.al.	2505.03611	null
2025-05-06	Learning Knowledge-based Prompts for Robust 3D Mask Presentation Attack Detection	Fangling Jiang et.al.	2505.03610	null
2025-05-06	DyGEnc: Encoding a Sequence of Textual Scene Graphs to Reason and Answer Questions in Dynamic Scenes	Sergey Linok et.al.	2505.03581	link
2025-05-06	LlamaFirewall: An open source guardrail system for building secure AI agents	Sahana Chennabasappa et.al.	2505.03574	null
2025-05-06	Say It Another Way: A Framework for User-Grounded Paraphrasing	Cléa Chataigner et.al.	2505.03563	null
2025-05-06	A Comprehensive Survey of Large AI Models for Future Communications: Foundations, Applications and Challenges	Feibo Jiang et.al.	2505.03556	link
2025-05-06	A Hashgraph-Inspired Consensus Mechanism for Reliable Multi-Model Reasoning	Kolawole E. Ogunsina et.al.	2505.03553	null
2025-05-06	STORY2GAME: Generating (Almost) Everything in an Interactive Fiction Game	Eric Zhou et.al.	2505.03547	null
2025-05-06	Faster MoE LLM Inference for Extremely Large Models	Haoqi Yang et.al.	2505.03531	null
2025-05-06	Ruled by the Representation Space: On the University's Embrace of Large Language Models	Katia Schwerzmann et.al.	2505.03513	null
2025-05-06	BadLingual: A Novel Lingual-Backdoor Attack against Large Language Models	Zihan Wang et.al.	2505.03501	null
2025-05-05	Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation	Lu Ling et.al.	2505.02836	null
2025-05-05	R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning	Yi-Fan Zhang et.al.	2505.02835	link
2025-05-05	No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves	Dengyang Jiang et.al.	2505.02831	link
2025-05-05	LISAT: Language-Instructed Segmentation Assistant for Satellite Imagery	Jerome Quenum et.al.	2505.02829	null
2025-05-05	ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations	Dmitriy Shopkhoev et.al.	2505.02819	link
2025-05-05	Knowing You Don't Know: Learning When to Continue Search in Multi-round RAG through Self-Practicing	Diji Yang et.al.	2505.02811	link
2025-05-05	Towards Quantifying the Hessian Structure of Neural Networks	Zhaorui Dong et.al.	2505.02809	link
2025-05-05	Generating HomeAssistant Automations Using an LLM-based Chatbot	Mathyas Giudici et.al.	2505.02802	null
2025-05-05	HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models	Zheng Lin et.al.	2505.02795	null
2025-05-05	Giving Simulated Cells a Voice: Evolving Prompt-to-Intervention Models for Cellular Control	Nam H. Le et.al.	2505.02766	null
2025-05-05	Bye-bye, Bluebook? Automating Legal Procedure with Large Language Models	Matthew Dahl et.al.	2505.02763	null
2025-05-05	Using Knowledge Graphs to harvest datasets for efficient CLIP model training	Simon Ging et.al.	2505.02746	link
2025-05-06	Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation	Gerard Pons et.al.	2505.02737	null
2025-05-05	FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models	Zhouliang Yu et.al.	2505.02735	link
2025-05-05	Enhancing LLMs' Clinical Reasoning with Real-World Data from a Nationwide Sepsis Registry	Junu Kim et.al.	2505.02722	link
2025-05-05	Less is More: Efficient Weight Farcasting with 1-Layer Neural Network	Xiao Shou et.al.	2505.02714	null
2025-05-05	Technical Report: Evaluating Goal Drift in Language Model Agents	Rauno Arike et.al.	2505.02709	null
2025-05-05	Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play	Yemin Shi et.al.	2505.02707	link
2025-05-05	AI Standardized Patient Improves Human Conversations in Advanced Cancer Care	Kurtis Haut et.al.	2505.02694	link
2025-05-05	Predicting Movie Hits Before They Happen with LLMs	Shaghayegh Agah et.al.	2505.02693	null
2025-05-02	How Effective are Large Time Series Models in Hydrology? A Study on Water Level Forecasting in Everglades	Rahuul Rangaraj et.al.	2505.01415	null
2025-05-02	Dynamic Robot Tool Use with Vision Language Models	Noah Trupin et.al.	2505.01399	null
2025-05-02	FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors	Chenxi Li et.al.	2505.01322	null
2025-05-02	Helping Big Language Models Protect Themselves: An Enhanced Filtering and Summarization System	Sheikh Samit Muhaimin et.al.	2505.01315	null
2025-05-02	Enhancing SPARQL Query Rewriting for Complex Ontology Alignments	Anicet Lepetit Ondo et.al.	2505.01309	null
2025-05-02	Document Retrieval Augmented Fine-Tuning (DRAFT) for safety-critical software assessments	Regan Bolton et.al.	2505.01307	null
2025-05-02	FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing	Gaoxiang Cong et.al.	2505.01263	null
2025-05-02	Digital Pathway Curation (DPC): a comparative pipeline to assess the reproducibility, consensus and accuracy across Gemini, PubMed, and scientific reviewers in biomedical research	Flavio Lichtenstein et.al.	2505.01259	null
2025-05-02	Can Foundation Models Really Segment Tumors? A Benchmarking Odyssey in Lung CT Imaging	Elena Mulero Ayllón et.al.	2505.01239	null
2025-05-02	CaReAQA: A Cardiac and Respiratory Audio Question Answering Model for Open-Ended Diagnostic Reasoning	Tsai-Ning Wang et.al.	2505.01199	null
2025-05-02	Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods	Mahdi Dhaini et.al.	2505.01198	link
2025-05-05	TSTMotion: Training-free Scene-aware Text-to-motion Generation	Ziyan Guo et.al.	2505.01182	null
2025-05-02	LLM Security: Vulnerabilities, Attacks, Defenses, and Countermeasures	Francisco Aguilera-Martínez et.al.	2505.01177	null
2025-05-02	On the Limitations of Steering in Language Model Alignment	Chebrolu Niranjan et.al.	2505.01162	null
2025-05-02	Methodological Foundations for AI-Driven Survey Question Generation	Ted K. Mburu et.al.	2505.01150	null
2025-05-02	Retrieval-Augmented Generation in Biomedicine: A Survey of Technologies, Datasets, and Clinical Applications	Jiawei He et.al.	2505.01146	null
2025-05-02	MateICL: Mitigating Attention Dispersion in Large-Scale In-Context Learning	Murtadha Ahmed et.al.	2505.01110	null
2025-05-02	Self-Supervision Enhances Instance-based Multiple Instance Learning Methods in Digital Pathology: A Benchmark Study	Ali Mammadov et.al.	2505.01109	link
2025-05-02	Nesterov Method for Asynchronous Pipeline Parallel Optimization	Thalaiyasingam Ajanthan et.al.	2505.01099	link
2025-05-02	Evaluating Vision Language Model Adaptations for Radiology Report Generation in Low-Resource Languages	Marco Salmè et.al.	2505.01096	null
2025-05-01	T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT	Dongzhi Jiang et.al.	2505.00703	link
2025-05-01	Robotic Visual Instruction	Yanbang Li et.al.	2505.00693	null
2025-05-01	Visual Test-time Scaling for GUI Agent Grounding	Tiange Luo et.al.	2505.00684	link
2025-05-01	Steering Large Language Models with Register Analysis for Arbitrary Style Transfer	Xinchen Yang et.al.	2505.00679	null
2025-05-01	Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions	Yiming Du et.al.	2505.00675	link
2025-05-01	DeepCritic: Deliberate Critique with Large Language Models	Wenkai Yang et.al.	2505.00662	link
2025-05-01	On the generalization of language models from in-context learning and finetuning: a controlled study	Andrew K. Lampinen et.al.	2505.00661	null
2025-05-01	Large Language Models Understanding: an Inherent Ambiguity Barrier	Daniel N. Nissani et.al.	2505.00654	null
2025-05-01	Open-Source LLM-Driven Federated Transformer for Predictive IoV Management	Yazan Otoum et.al.	2505.00651	null
2025-05-01	Investigating Task Arithmetic for Zero-Shot Information Retrieval	Marco Braga et.al.	2505.00649	link
2025-05-01	Brain Foundation Models with Hypergraph Dynamic Adapter for Brain Disease Analysis	Zhongying Deng et.al.	2505.00627	null
2025-05-01	The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them)	Zihao Wang et.al.	2505.00626	null
2025-05-01	FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation	Chaitali Bhattacharyya et.al.	2505.00624	null
2025-05-01	Pixel3DMM: Versatile Screen-Space Priors for Single-Image 3D Face Reconstruction	Simon Giebenhain et.al.	2505.00615	null
2025-05-01	Combining LLMs with Logic-Based Framework to Explain MCTS	Ziyan An et.al.	2505.00610	null
2025-05-01	Can LLMs Help Improve Analogical Reasoning For Strategic Decisions? Experimental Evidence from Humans and GPT-4	Phanish Puranam et.al.	2505.00603	null
2025-05-02	Fast and Low-Cost Genomic Foundation Models via Outlier Removal	Haozheng Luo et.al.	2505.00598	link
2025-05-01	Block Circulant Adapter for Large Language Models	Xinyu Ding et.al.	2505.00582	null
2025-05-01	Parameter-Efficient Fine-Tuning with Circulant and Diagonal Vectors	Xinyu Ding et.al.	2505.00580	null
2025-05-01	FreqKV: Frequency Domain Key-Value Compression for Efficient Context Window Extension	Jushi Kai et.al.	2505.00570	null
2025-04-30	TRUST: An LLM-Based Dialogue System for Trauma Understanding and Structured Assessments	Sichang Tu et.al.	2504.21851	null
2025-04-30	COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning	Xindi Wu et.al.	2504.21850	null
2025-04-30	Early Exit and Multi Stage Knowledge Distillation in VLMs for Video Summarization	Anas Anwarul Haq Khan et.al.	2504.21831	null
2025-04-30	Why Compress What You Can Generate? When GPT-4o Generation Ushers in Image Compression Fields	Yixin Gao et.al.	2504.21814	null
2025-04-30	A simple and effective approach for body part recognition on CT scans based on projection estimation	Franko Hrzic et.al.	2504.21810	null
2025-04-30	An Empirical Study on the Effectiveness of Large Language Models for Binary Code Understanding	Xiuwei Shang et.al.	2504.21803	null
2025-04-30	DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition	Z. Z. Ren et.al.	2504.21801	link
2025-04-30	SWE-smith: Scaling Data for Software Engineering Agents	John Yang et.al.	2504.21798	null
2025-04-30	MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness	Junsheng Huang et.al.	2504.21773	null
2025-04-30	LASHED: LLMs And Static Hardware Analysis for Early Detection of RTL Bugs	Baleegh Ahmad et.al.	2504.21770	null
2025-04-30	LLM-based Interactive Imitation Learning for Robotic Manipulation	Jonas Werner et.al.	2504.21769	link
2025-04-30	Investigating Literary Motifs in Ancient and Medieval Novels with Large Language Models	Emelie Hallenberg et.al.	2504.21742	null
2025-04-30	TheraQuest: A Gamified, LLM-Powered Simulation for Massage Therapy Training	Shengqian Wang et.al.	2504.21735	null
2025-04-30	XBreaking: Explainable Artificial Intelligence for Jailbreaking LLMs	Marco Arazzi et.al.	2504.21700	null
2025-04-30	Visual Text Processing: A Comprehensive Review and Unified Evaluation	Yan Shu et.al.	2504.21682	link
2025-04-30	Hoist with His Own Petard: Inducing Guardrails to Facilitate Denial-of-Service Attacks on Retrieval-Augmented Generation of LLMs	Pan Suo et.al.	2504.21680	null
2025-04-30	Traceback of Poisoning Attacks to Retrieval-Augmented Generation	Baolei Zhang et.al.	2504.21668	null
2025-04-30	From Precision to Perception: User-Centred Evaluation of Keyword Extraction Algorithms for Internet-Scale Contextual Advertising	Jingwen Cai et.al.	2504.21667	null
2025-04-30	AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization	Haotian Luo et.al.	2504.21659	link
2025-04-30	Sadeed: Advancing Arabic Diacritization Through Small Language Model	Zeina Aldallal et.al.	2504.21635	null
2025-04-29	Toward Efficient Exploration by Large Language Model Agents	Dilip Arumugam et.al.	2504.20997	null
2025-04-29	X-Fusion: Introducing New Modality to Frozen Large Language Models	Sicheng Mo et.al.	2504.20996	null
2025-04-29	ACE: A Security Architecture for LLM-Integrated App Systems	Evan Li et.al.	2504.20984	null
2025-04-29	Real-Time Wayfinding Assistant for Blind and Low-Vision Users	Dabbrata Das et.al.	2504.20976	null
2025-04-29	SetKE: Knowledge Editing for Knowledge Elements Overlap	Yifan Wei et.al.	2504.20972	null
2025-04-29	OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification	Shangyu Li et.al.	2504.20964	link
2025-04-29	Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models	Maryna Vyshnyvetska et.al.	2504.20951	null
2025-04-29	Trace-of-Thought: Enhanced Arithmetic Problem Solving via Reasoning Distillation From Large to Small Language Models	Tyler McDonald et.al.	2504.20946	null
2025-04-29	ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification	Ziqing Fan et.al.	2504.20930	link
2025-04-29	An Empirical Study on the Capability of LLMs in Decomposing Bug Reports	Zhiyuan Chen et.al.	2504.20911	null
2025-04-29	Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers	Quentin Guimard et.al.	2504.20902	null
2025-04-29	LELANTE: LEveraging LLM for Automated ANdroid TEsting	Shamit Fatin et.al.	2504.20896	null
2025-04-29	FedMVP: Federated Multi-modal Visual Prompt Tuning for Vision-Language Models	Mainak Singha et.al.	2504.20860	null
2025-04-29	X-Cross: Dynamic Integration of Language Models for Cross-Domain Sequential Recommendation	Guy Hadad et.al.	2504.20859	null
2025-04-29	JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry	Anum Afzal et.al.	2504.20849	null
2025-04-29	Language Model for Large-Text Transmission in Noisy Quantum Communications	Yuqi Li et.al.	2504.20842	null
2025-04-29	Universal language model with the intervention of quantum theory	D. -F. Qin et.al.	2504.20839	null
2025-04-29	Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning	Hongfei Xue et.al.	2504.20835	null
2025-04-29	Reinforcement Learning for LLM Reasoning Under Memory Constraints	Alan Lee et.al.	2504.20834	null
2025-04-30	Ascendra: Dynamic Request Prioritization for Efficient LLM Serving	Azam Ikram et.al.	2504.20828	null
2025-04-28	Learning Streaming Video Representation via Multitask Training	Yibin Yan et.al.	2504.20041	null
2025-04-28	AutoJudge: Judge Decoding Without Manual Annotation	Roman Garipov et.al.	2504.20039	null
2025-04-28	SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning	Wufei Ma et.al.	2504.20024	null
2025-04-28	Better To Ask in English? Evaluating Factual Accuracy of Multilingual LLMs in English and Low-Resource Languages	Pritika Rohera et.al.	2504.20022	null
2025-04-28	Modular Machine Learning: An Indispensable Path towards New-Generation Large Language Models	Xin Wang et.al.	2504.20020	null
2025-04-29	LLM-Generated Fake News Induces Truth Decay in News Ecosystem: A Case Study on Neural News Recommendation	Beizhe Hu et.al.	2504.20013	null
2025-04-28	Towards Automated Scoping of AI for Social Good Projects	Jacob Emmerson et.al.	2504.20010	null
2025-04-28	Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom	Rishika Sen et.al.	2504.20000	null
2025-04-28	HJRNO: Hamilton-Jacobi Reachability with Neural Operators	Yankai Li et.al.	2504.19989	null
2025-04-28	TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons	Emre Can Acikgoz et.al.	2504.19982	null
2025-04-28	Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets	Adam Younsi et.al.	2504.19981	null
2025-04-29	From Concept to Practice: an Automated LLM-aided UVM Machine for RTL Verification	Junhao Ye et.al.	2504.19959	null
2025-04-28	Enhancing Surgical Documentation through Multimodal Visual-Temporal Transformers and Generative AI	Hugo Georgenthum et.al.	2504.19918	null
2025-04-28	Can AI Agents Design and Implement Drug Discovery Pipelines?	Khachik Smbatyan et.al.	2504.19912	null
2025-04-28	GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets	Mingqian He et.al.	2504.19898	null
2025-04-28	CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition	Quynh Phung et.al.	2504.19894	null
2025-04-28	semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage	Ke Hong et.al.	2504.19867	null
2025-04-28	CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback	Chenhan Jiang et.al.	2504.19860	null
2025-04-28	Efficient Domain-adaptive Continual Pretraining for the Process Industry in the German Language	Anastasia Zhukova et.al.	2504.19856	null
2025-04-29	The Automation Advantage in AI Red Teaming	Rob Mulla et.al.	2504.19855	null
2025-04-25	Generalization Capability for Imitation Learning	Yixiao Wang et.al.	2504.18538	null
2025-04-25	TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation	Gwen Yidou Weng et.al.	2504.18535	null
2025-04-25	Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation	Shivam Duggal et.al.	2504.18509	null
2025-04-25	Investigating Co-Constructive Behavior of Large Language Models in Explanation Dialogues	Leandra Fichtel et.al.	2504.18483	null
2025-04-25	Generative Induction of Dialogue Task Schemas with Streaming Refinement and Simulated Interactions	James D. Finch et.al.	2504.18474	null
2025-04-25	Fast-Slow Thinking for Large Vision-Language Model Reasoning	Wenyi Xiao et.al.	2504.18458	null
2025-04-25	Pseudo-Asynchronous Local SGD: Robust and Efficient Data-Parallel Training	Hiroki Naganuma et.al.	2504.18454	null
2025-04-25	Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Generation	Peiyuan Jing et.al.	2504.18453	null
2025-04-25	Kimi-Audio Technical Report	KimiTeam et.al.	2504.18425	link
2025-04-25	LLMpatronous: Harnessing the Power of LLMs For Vulnerability Detection	Rajesh Yarra et.al.	2504.18423	null
2025-04-25	BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs	Hongyu Wang et.al.	2504.18415	null
2025-04-25	An Empirical Study of Evaluating Long-form Question Answering	Ning Xian et.al.	2504.18413	link
2025-04-25	Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providers	Jared Moore et.al.	2504.18412	link
2025-04-25	HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?	Yusen Zhang et.al.	2504.18406	null
2025-04-25	Unsupervised Visual Chain-of-Thought Reasoning via Preference Optimization	Kesen Zhao et.al.	2504.18397	null
2025-04-25	Bridge the Domains: Large Language Models Enhanced Cross-domain Sequential Recommendation	Qidong Liu et.al.	2504.18383	null
2025-04-25	Pushing the boundary on Natural Language Inference	Pablo Miralles-González et.al.	2504.18376	null
2025-04-25	Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant	Lei Shen et.al.	2504.18373	link
2025-04-25	ThreMoLIA: Threat Modeling of Large Language Model-Integrated Applications	Felix Viktor Jedrzejewski et.al.	2504.18369	null
2025-04-25	Testing Individual Fairness in Graph Neural Networks	Roya Nasiri et.al.	2504.18353	null
2025-04-24	Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models	Xu Ma et.al.	2504.17789	null
2025-04-24	Replay to Remember: Retaining Domain Knowledge in Streaming Language Models	Sneh Pillai et.al.	2504.17780	null
2025-04-24	Conversational Assistants to support Heart Failure Patients: comparing a Neurosymbolic Architecture with ChatGPT	Anuja Tayal et.al.	2504.17753	null
2025-04-24	Towards Robust LLMs: an Adversarial Robustness Measurement Framework	Natan Levy et.al.	2504.17723	null
2025-04-24	Multilingual Performance Biases of Large Language Models in Education	Vansh Gupta et.al.	2504.17720	null
2025-04-24	PICO: Reconstructing 3D People In Contact with Objects	Alpár Cseke et.al.	2504.17695	null
2025-04-24	Ensemble Bayesian Inference: Leveraging Small Language Models to Achieve LLM-level Accuracy in Profile Matching Tasks	Haru-Tada Sato et.al.	2504.17685	null
2025-04-24	INSIGHT: Bridging the Student-Teacher Gap in Times of Large Language Models	Jarne Thys et.al.	2504.17677	null
2025-04-24	Energy Considerations of Large Language Model Inference and Efficiency Optimizations	Jared Fernandez et.al.	2504.17674	null
2025-04-24	Cross-region Model Training with Communication-Computation Overlapping and Delay Compensation	Ying Zhu et.al.	2504.17672	null
2025-04-25	Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction	Yuanchang Ye et.al.	2504.17671	null
2025-04-24	Towards a HIPAA Compliant Agentic AI System in Healthcare	Subash Neupane et.al.	2504.17669	null
2025-04-24	Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics	Zena Al-Khalili et.al.	2504.17665	null
2025-04-24	Effortless, Simulation-Efficient Bayesian Inference using Tabular Foundation Models	Julius Vetter et.al.	2504.17660	null
2025-04-24	Portability of Optimizations from SC to TSO	Akshay Gopalakrishnan et.al.	2504.17646	null
2025-04-24	L3: DIMM-PIM Integrated Architecture and Coordination for Scalable Long-Context LLM Inference	Qingyuan Liu et.al.	2504.17584	null
2025-04-25	DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training	Xiaoyu Tian et.al.	2504.17565	null
2025-04-24	When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars	Rei Higuchi et.al.	2504.17562	null
2025-04-24	HalluLens: LLM Hallucination Benchmark	Yejin Bang et.al.	2504.17550	null
2025-04-24	A Comprehensive Survey of Knowledge-Based Vision Question Answering Systems: The Lifecycle of Knowledge in Visual Reasoning Task	Jiaqi Deng et.al.	2504.17547	null
2025-04-23	Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of Light	Ali Hassani et.al.	2504.16922	null
2025-04-23	IberBench: LLM Evaluation on Iberian Languages	José Ángel González et.al.	2504.16921	null
2025-04-23	Tracing Thought: Using Chain-of-Thought Reasoning to Identify the LLM Behind AI-Generated Text	Shifali Agrahari et.al.	2504.16913	null
2025-04-23	Do Large Language Models know who did what to whom?	Joseph M. Denning et.al.	2504.16884	null
2025-04-23	Enhancing Critical Thinking with AI: A Tailored Warning System for RAG Models	Xuyang Zhu et.al.	2504.16883	null
2025-04-23	Context-Enhanced Vulnerability Detection Based on Large Language Model	Yixin Yang et.al.	2504.16877	null
2025-04-24	Exploring How LLMs Capture and Represent Domain-Specific Knowledge	Mirian Hipolito Garcia et.al.	2504.16871	null
2025-04-23	Common Functional Decompositions Can Mis-attribute Differences in Outcomes Between Populations	Manuel Quintero et.al.	2504.16864	null
2025-04-23	Planning with Diffusion Models for Target-Oriented Dialogue Systems	Hanwen Du et.al.	2504.16858	null
2025-04-23	Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification	Alexander Shvets et.al.	2504.16856	null
2025-04-23	Monte Carlo Planning with Large Language Model for Text-Based Game Agents	Zijing Shi et.al.	2504.16855	null
2025-04-23	Improving Significant Wave Height Prediction Using Chronos Models	Yilin Zhai et.al.	2504.16834	null
2025-04-23	LRASGen: LLM-based RESTful API Specification Generation	Sida Deng et.al.	2504.16833	null
2025-04-23	GreenMind: A Next-Generation Vietnamese Large Language Model for Structured and Logical Reasoning	Luu Quy Tung et.al.	2504.16832	null
2025-04-23	Decoupled Global-Local Alignment for Improving Compositional Understanding	Xiaoxing Hu et.al.	2504.16801	null
2025-04-23	MOOSComp: Improving Lightweight Long-Context Compressor via Mitigating Over-Smoothing and Incorporating Outlier Scores	Fengwei Zhou et.al.	2504.16786	null
2025-04-23	Graph2Nav: 3D Object-Relation Graph Generation to Robot Navigation	Tixiao Shan et.al.	2504.16782	null
2025-04-23	How Effective are Generative Large Language Models in Performing Requirements Classification?	Waad Alhoshan et.al.	2504.16768	null
2025-04-23	Lightweight Latent Verifiers for Efficient Meta-Generation Strategies	Bartosz Piotrowski et.al.	2504.16760	null
2025-04-23	HEMA : A Hippocampus-Inspired Extended Memory Architecture for Long-Context AI Conversations	Kwangseob Ahn et.al.	2504.16754	null
2025-04-22	TTRL: Test-Time Reinforcement Learning	Yuxin Zuo et.al.	2504.16084	link
2025-04-22	MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention	Yucheng Li et.al.	2504.16083	null
2025-04-22	MR. Video: "MapReduce" is the Principle for Long Video Understanding	Ziqi Pang et.al.	2504.16082	null
2025-04-22	From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning	Le Zhuo et.al.	2504.16080	null
2025-04-22	LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities	Thomas Schmied et.al.	2504.16078	null
2025-04-22	PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models	Shi Qiu et.al.	2504.16074	null
2025-04-22	Guiding VLM Agents with Process Rewards at Inference Time for GUI Navigation	Zhiyuan Hu et.al.	2504.16073	null
2025-04-22	Describe Anything: Detailed Localized Image and Video Captioning	Long Lian et.al.	2504.16072	null
2025-04-22	A Python Tool for Reconstructing Full News Text from GDELT	A. Fronzetti Colladon et.al.	2504.16063	link
2025-04-22	Vision language models are unreliable at trivial spatial cognition	Sangeet Khemlani et.al.	2504.16061	null
2025-04-22	Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation	Ziqiao Ma et.al.	2504.16060	link
2025-04-22	Automated Static Vulnerability Detection via a Holistic Neuro-symbolic Approach	Penghui Li et.al.	2504.16057	null
2025-04-22	Honey, I Shrunk the Language Model: Impact of Knowledge Distillation Methods on Performance and Explainability	Daniel Hendriks et.al.	2504.16056	null
2025-04-22	LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field Enlargement	Zhifan Ye et.al.	2504.16053	link
2025-04-22	Evaluating Vision Language Models (VLMs) for Radiology: A Comprehensive Analysis	Frank Li et.al.	2504.16047	null
2025-04-23	Certified Mitigation of Worst-Case LLM Copyright Infringement	Jingyu Zhang et.al.	2504.16046	null
2025-04-22	LLMs meet Federated Learning for Scalable and Secure IoT Management	Yazan Otoum et.al.	2504.16032	null
2025-04-22	LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale	Joya Chen et.al.	2504.16030	null
2025-04-22	Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V3	Ahmed R. Sadik et.al.	2504.16027	null
2025-04-22	Efficient Temporal Consistency in Diffusion-Based Video Editing with Adaptor Modules: A Theoretical Framework	Xinyuan Song et.al.	2504.16016	null
2025-04-21	Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs	Chun-Hsiao Yeh et.al.	2504.15280	link
2025-04-21	VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models	Weiye Xu et.al.	2504.15279	null
2025-04-21	Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning	Jie Cheng et.al.	2504.15275	link
2025-04-21	Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models	Guo Chen et.al.	2504.15271	null
2025-04-21	Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction	Vaishnavh Nagarajan et.al.	2504.15266	link
2025-04-21	Interpretable Locomotion Prediction in Construction Using a Memory-Driven LLM Agent With Chain-of-Thought Reasoning	Ehsan Ahmadi et.al.	2504.15263	null
2025-04-21	Leveraging Language Models for Automated Patient Record Linkage	Mohammad Beheshti et.al.	2504.15261	null
2025-04-21	CRUST-Bench: A Comprehensive Benchmark for C-to-safe-Rust Transpilation	Anirudh Khatry et.al.	2504.15254	link
2025-04-21	Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators	Yilun Zhou et.al.	2504.15253	link
2025-04-21	MR. Guard: Multilingual Reasoning Guardrail using Curriculum Learning	Yahan Yang et.al.	2504.15241	null
2025-04-21	Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions	Saffron Huang et.al.	2504.15236	null
2025-04-21	A Self-Improving Coding Agent	Maxime Robeyns et.al.	2504.15228	null
2025-04-21	EvalAgent: Discovering Implicit Evaluation Criteria from the Web	Manya Wadhwa et.al.	2504.15219	null
2025-04-21	Integrating Symbolic Execution into the Fine-Tuning of Code-Generating LLMs	Marina Sakharova et.al.	2504.15210	null
2025-04-21	Compute-Optimal LLMs Provably Generalize Better With Scale	Marc Finzi et.al.	2504.15208	null
2025-04-21	Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges	Nandan Thakur et.al.	2504.15205	null
2025-04-22	Synergistic Weak-Strong Collaboration by Aligning Preferences	Yizhu Jiao et.al.	2504.15188	null
2025-04-21	DSPO: Direct Semantic Preference Optimization for Real-World Image Super-Resolution	Miaomiao Cai et.al.	2504.15176	null
2025-04-21	The Synthetic Imputation Approach: Generating Optimal Synthetic Texts For Underrepresented Categories In Supervised Classification Tasks	Joan C. Timoneda et.al.	2504.15160	null
2025-04-21	KGMEL: Knowledge Graph-Enhanced Multimodal Entity Linking	Juyeon Kim et.al.	2504.15135	link
2025-04-18	Generative AI Act II: Test Time Scaling Drives Cognition Engineering	Shijie Xia et.al.	2504.13828	link
2025-04-18	Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models	Junjie Yang et.al.	2504.13825	null
2025-04-18	CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning	Yang Yue et.al.	2504.13820	link
2025-04-18	Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning	Yixuan Even Xu et.al.	2504.13818	null
2025-04-18	BadApex: Backdoor Attack Based on Adaptive Optimization Mechanism of Black-box Large Language Models	Zhengxian Wu et.al.	2504.13775	null
2025-04-18	DP2Unlearning: An Efficient and Guaranteed Unlearning Framework for LLMs	Tamim Al Mahmud et.al.	2504.13774	link
2025-04-18	Detecting Malicious Source Code in PyPI Packages with LLMs: Does RAG Come in Handy?	Motunrayo Ibiyo et.al.	2504.13769	null
2025-04-18	Decoding Vision Transformers: the Diffusion Steering Lens	Ryota Takatsuki et.al.	2504.13763	link
2025-04-18	Scaling sparse feature circuit finding for in-context learning	Dmitrii Kharlapenko et.al.	2504.13756	null
2025-04-18	Learning to Attribute with Attention	Benjamin Cohen-Wang et.al.	2504.13752	link
2025-04-18	Controlled Territory and Conflict Tracking (CONTACT): (Geo-)Mapping Occupied Territory from Open Source Intelligence	Paul K. Mandal et.al.	2504.13730	link
2025-04-18	OpenDeception: Benchmarking and Investigating AI Deceptive Behaviors via Open-ended Interaction Simulation	Yichen Wu et.al.	2504.13707	null
2025-04-18	Exploring Multimodal Prompt for Visualization Authoring with Large Language Models	Zhen Wen et.al.	2504.13700	null
2025-04-18	Analysing the Robustness of Vision-Language-Models to Common Corruptions	Muhammad Usama et.al.	2504.13690	null
2025-04-18	Intelligent Interaction Strategies for Context-Aware Cognitive Augmentation	Xiangrong et.al.	2504.13684	null
2025-04-18	Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results	Andrea Santilli et.al.	2504.13677	null
2025-04-18	Large Language Models Will Change The Way Children Think About Technology And Impact Every Interaction Paradigm	Russell Beale et.al.	2504.13667	null
2025-04-18	Do Prompt Patterns Affect Code Quality? A First Empirical Assessment of ChatGPT-Generated Code	Antonio Della Porta et.al.	2504.13656	null
2025-04-18	EyecareGPT: Boosting Comprehensive Ophthalmology Understanding with Tailored Dataset, Benchmark and Model	Sijing Li et.al.	2504.13650	link
2025-04-18	Exploring the Potential for Large Language Models to Demonstrate Rational Probabilistic Beliefs	Gabriel Freedman et.al.	2504.13644	link
2025-04-17	Perception Encoder: The best visual embeddings are not at the output of the network	Daniel Bolya et.al.	2504.13181	null
2025-04-17	PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding	Jang Hyun Cho et.al.	2504.13180	link
2025-04-17	It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization	Ali Behrouz et.al.	2504.13173	null
2025-04-17	Sleep-time Compute: Beyond Inference Scaling at Test-time	Kevin Lin et.al.	2504.13171	link
2025-04-17	Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling	Tsung-Han Wu et.al.	2504.13169	link
2025-04-17	CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training	Shizhe Diao et.al.	2504.13161	null
2025-04-17	Digital Twin Generation from Visual Data: A Survey	Andrew Melnik et.al.	2504.13159	link
2025-04-17	MIB: A Mechanistic Interpretability Benchmark	Aaron Mueller et.al.	2504.13151	link
2025-04-17	Exploring Expert Failures Improves LLM Agent Tuning	Li-Cheng Lan et.al.	2504.13145	null
2025-04-17	Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo	João Loula et.al.	2504.13139	null
2025-04-17	Energy-Based Reward Models for Robust Language Model Alignment	Anamika Lochab et.al.	2504.13134	link
2025-04-17	LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard	Varun Rao et.al.	2504.13125	null
2025-04-17	Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training	Xinsong Zhang et.al.	2504.13123	null
2025-04-17	VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models	Haojian Huang et.al.	2504.13122	link
2025-04-17	Probing and Inducing Combinational Creativity in Vision-Language Models	Yongqian Peng et.al.	2504.13120	null
2025-04-17	Object-Driven Narrative in AR: A Scenario-Metaphor Framework with VLM Integration	Yusi Sun et.al.	2504.13119	null
2025-04-17	Uncertainty-Aware Trajectory Prediction via Rule-Regularized Heteroscedastic Deep Classification	Kumar Manas et.al.	2504.13111	null
2025-04-17	EventVAD: Training-Free Event-Aware Video Anomaly Detection	Yihua Shao et.al.	2504.13092	null
2025-04-17	Retrieval-Augmented Generation with Conflicting Evidence	Han Wang et.al.	2504.13079	link
2025-04-18	SkyReels-V2: Infinite-length Film Generative Model	Guibin Chen et.al.	2504.13074	link
2025-04-16	BitNet b1.58 2B4T Technical Report	Shuming Ma et.al.	2504.12285	null
2025-04-16	HLS-Eval: A Benchmark and Framework for Evaluating LLMs on High-Level Synthesis Design Tasks	Stefan Abi-Karam et.al.	2504.12268	link
2025-04-16	FLIP Reasoning Challenge	Andreas Plesner et.al.	2504.12256	link
2025-04-16	AnomalyGen: An Automated Semantic Log Sequence Generation Framework with LLM for Anomaly Detection	Xinyu Li et.al.	2504.12250	null
2025-04-16	MOS: Towards Effective Smart Contract Vulnerability Detection through Mixture-of-Experts Tuning of Large Language Models	Hang Yuan et.al.	2504.12234	null
2025-04-16	Watermarking Needs Input Repetition Masking	David Khachaturov et.al.	2504.12229	null
2025-04-16	d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning	Siyan Zhao et.al.	2504.12216	null
2025-04-16	What Do Large Language Models Know? Tacit Knowledge as a Potential Causal-Explanatory Structure	Céline Budding et.al.	2504.12187	null
2025-04-16	SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data	Suyoung Bae et.al.	2504.12185	null
2025-04-16	Trusting CHATGPT: how minor tweaks in the prompts lead to major differences in sentiment classification	Jaime E. Cuellar et.al.	2504.12180	null
2025-04-16	Multilingual Contextualization of Large Language Models for Document-Level Machine Translation	Miguel Moura Ramos et.al.	2504.12140	null
2025-04-16	Efficient Contrastive Decoding with Probabilistic Hallucination Detection - Mitigating Hallucinations in Large Vision Language Models -	Laura Fieback et.al.	2504.12137	null
2025-04-16	Clarifying Ambiguities: on the Role of Ambiguity Types in Prompting Methods for Clarification Generation	Anfu Tang et.al.	2504.12113	null
2025-04-16	Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation	Shizhan Cai et.al.	2504.12108	null
2025-04-16	Logits DeConfusion with CLIP for Few-Shot Learning	Shuo Li et.al.	2504.12104	link
2025-04-16	Gauging Overprecision in LLMs: An Empirical Study	Adil Bahaj et.al.	2504.12098	null
2025-04-16	Reasoning-Based AI for Startup Evaluation (R.A.I.S.E.): A Memory-Augmented, Multi-Step Decision Framework	Jack Preuveneers et.al.	2504.12090	null
2025-04-16	Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization	Pritam Sarkar et.al.	2504.12083	null
2025-04-16	Selective Demonstration Retrieval for Improved Implicit Hate Speech Detection	Yumin Kim et.al.	2504.12082	null
2025-04-16	Subitizing-Inspired_Large_Language_Models_for_Floorplanning	Shao-Chien Lu et.al.	2504.12076	null
2025-04-16	Elucidating the Design Space of Multimodal Protein Language Models	Cheng-Yen Hsieh et.al.	2504.11454	null
2025-04-15	TextArena	Leon Guertler et.al.	2504.11442	link
2025-04-15	Masculine Defaults via Gendered Discourse in Podcasts and Large Language Models	Maria Teleki et.al.	2504.11431	link
2025-04-15	A Dual-Space Framework for General Knowledge Distillation of Large Language Models	Xue Zhang et.al.	2504.11426	null
2025-04-15	Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts	Quanyu Long et.al.	2504.11420	null
2025-04-15	Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning	Ali Taghibakhshi et.al.	2504.11409	null
2025-04-15	DataDecide: How to Predict Best Pretraining Data with Small Experiments	Ian Magnusson et.al.	2504.11393	null
2025-04-15	RankAlign: A Ranking View of the Generator-Validator Gap in Large Language Models	Juan Diego Rodriguez et.al.	2504.11381	link
2025-04-15	Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions	Wang Bill Zhu et.al.	2504.11373	link
2025-04-15	OpenTuringBench: An Open-Model-based Benchmark and Framework for Machine-Generated Text Detection and Attribution	Lucio La Cava et.al.	2504.11369	null
2025-04-15	From Gaze to Insight: Bridging Human Visual Attention and Vision Language Model Explanation for Weakly-Supervised Medical Image Segmentation	Jingkun Chen et.al.	2504.11368	null
2025-04-15	Teaching Large Language Models to Reason through Learning and Forgetting	Tianwei Ni et.al.	2504.11364	link
2025-04-15	Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning	Haiming Wang et.al.	2504.11354	link
2025-04-16	Seedream 3.0 Technical Report	Yu Gao et.al.	2504.11346	null
2025-04-15	A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce	Wei Xiong et.al.	2504.11343	link
2025-04-15	REWARD CONSISTENCY: Improving Multi-Objective Alignment from a Data-Centric Perspective	Zhihao Xu et.al.	2504.11337	null
2025-04-15	Looking beyond the next token	Abitha Thankaraj et.al.	2504.11336	null
2025-04-15	Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints	Ruicheng Ao et.al.	2504.11320	link
2025-04-15	Learning to Be A Doctor: Searching for Effective Medical Agent Architectures	Yangyang Zhuang et.al.	2504.11301	null
2025-04-16	Automated Python Translation	Joshua Otten et.al.	2504.11290	null
2025-04-14	InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models	Jinguo Zhu et.al.	2504.10479	link
2025-04-14	Weight Ensembling Improves Reasoning in Language Models	Xingyu Dang et.al.	2504.10478	null
2025-04-14	MIEB: Massive Image Embedding Benchmark	Chenghao Xiao et.al.	2504.10471	link
2025-04-14	Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding	Tao Zhang et.al.	2504.10465	link
2025-04-14	The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer	Weixian Lei et.al.	2504.10462	link
2025-04-15	GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents	Xiaobo Xia et.al.	2504.10458	null
2025-04-14	M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models	Junxiong Wang et.al.	2504.10449	link
2025-04-14	Multimodal Long Video Modeling Based on Temporal Dynamic Context	Haoran Hao et.al.	2504.10443	link
2025-04-14	LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models	Minqian Liu et.al.	2504.10430	null
2025-04-14	Foundation models for electronic health records: representation dynamics and transferability	Michael C. Burkhart et.al.	2504.10422	link
2025-04-14	Can We Edit LLMs for Long-Tail Biomedical Knowledge?	Xinhao Yi et.al.	2504.10421	link
2025-04-15	Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA	Michał Turski et.al.	2504.10419	link
2025-04-14	CliniChat: A Multi-Source Knowledge-Driven Framework for Clinical Interview Dialogue Reconstruction and Evaluation	Jing Chen et.al.	2504.10418	null
2025-04-14	LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models	Parshin Shojaee et.al.	2504.10415	link
2025-04-14	Performance of Large Language Models in Supporting Medical Diagnosis and Treatment	Diogo Sousa et.al.	2504.10405	null
2025-04-14	Satellite Federated Fine-Tuning for Foundation Models in Space Computing Power Networks	Yan zhu et.al.	2504.10403	null
2025-04-14	Can LLMs Assist Expert Elicitation for Probabilistic Causal Modeling?	Olha Shaposhnyk et.al.	2504.10397	null
2025-04-14	SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning	Yiting Wang et.al.	2504.10369	null
2025-04-14	DICE: A Framework for Dimensional and Contextual Evaluation of Language Models	Aryan Shrivastava et.al.	2504.10359	null
2025-04-14	Pseudo-Autoregressive Neural Codec Language Models for Efficient Zero-Shot Text-to-Speech Synthesis	Yifan Yang et.al.	2504.10352	null
2025-04-11	Quantum Large Language Model Fine-Tuning	Sang Hyub Kim et.al.	2504.08732	null
2025-04-11	DocAgent: A Multi-Agent System for Automated Code Documentation Generation	Dayu Yang et.al.	2504.08725	link
2025-04-11	SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling	Krishna C. Puvvada et.al.	2504.08719	null
2025-04-11	SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents	Muhammad Shihab Rashid et.al.	2504.08703	link
2025-04-11	Large Language Models as Span Annotators	Zdeněk Kasner et.al.	2504.08697	null
2025-04-11	TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning	Hang Ni et.al.	2504.08694	null
2025-04-11	Fast-Slow-Thinking: Complex Task Solving with Large Language Models	Yiliu Sun et.al.	2504.08690	null
2025-04-11	Voice Interaction With Conversational AI Could Facilitate Thoughtful Reflection and Substantive Revision in Writing	Jiho Kim et.al.	2504.08687	null
2025-04-11	Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model	Team Seawead et.al.	2504.08685	null
2025-04-11	Variability-Driven User-Story Generation using LLM and Triadic Concept Analysis	Alexandre Bazin et.al.	2504.08666	null
2025-04-11	Quality evaluation of Tabby coding assistant using real source code snippets	Marta Borek et.al.	2504.08650	link
2025-04-11	Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents	Alessio Buscemi et.al.	2504.08640	null
2025-04-11	Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging	Gabriele Lozupone et.al.	2504.08635	link
2025-04-11	MooseAgent: A LLM Based Multi-agent Framework for Automating Moose Simulation	Tao Zhang et.al.	2504.08621	link
2025-04-11	Analyzing 16,193 LLM Papers for Fun and Profits	Zhiqiu Xia et.al.	2504.08619	null
2025-04-11	Playpen: An Environment for Exploring Learning Through Conversational Interaction	Nicola Horst et.al.	2504.08590	link
2025-04-11	AstroLLaVA: towards the unification of astronomical data and natural language	Sharaf Zaman et.al.	2504.08583	null
2025-04-11	UoB-NLP at SemEval-2025 Task 11: Leveraging Adapters for Multilingual and Cross-Lingual Emotion Detection	Frances Laureano De Leon et.al.	2504.08543	null
2025-04-11	Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions	Tommaso Galliena et.al.	2504.08531	null
2025-04-11	On The Landscape of Spoken Language Models: A Comprehensive Survey	Siddhant Arora et.al.	2504.08528	null
2025-04-10	Cat, Rat, Meow: On the Alignment of Language Model and Human Term-Similarity Judgments	Lorenz Linhardt et.al.	2504.07965	null
2025-04-10	C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing	Zhongyang Li et.al.	2504.07964	link
2025-04-10	GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation	Lang Lin et.al.	2504.07962	null
2025-04-10	Detect Anything 3D in the Wild	Hanxue Zhang et.al.	2504.07958	link
2025-04-10	MM-IFEngine: Towards Multimodal Instruction Following	Shengyuan Ding et.al.	2504.07957	link
2025-04-10	VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning	Yukun Qi et.al.	2504.07956	null
2025-04-10	Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory	Mirac Suzgun et.al.	2504.07952	link
2025-04-10	We Are All Creators: Generative AI, Collective Knowledge, and the Path Towards Human-AI Synergy	Jordi Linares-Pellicer et.al.	2504.07936	null
2025-04-10	Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining	Rosie Zhao et.al.	2504.07912	link
2025-04-10	Porting an LLM based Application from ChatGPT to an On-Premise Environment	Teemu Paloniemi et.al.	2504.07907	null
2025-04-10	Redefining Machine Translation on Social Network Services with Large Language Models	Hongcheng Guo et.al.	2504.07901	link
2025-04-10	How do Large Language Models Understand Relevance? A Mechanistic Interpretability Perspective	Qi Liu et.al.	2504.07898	link
2025-04-10	Fast Adaptation with Behavioral Foundation Models	Harshit Sikchi et.al.	2504.07896	null
2025-04-10	Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge	Riccardo Cantini et.al.	2504.07887	link
2025-04-11	An LLM-Driven Multi-Agent Debate System for Mendelian Diseases	Xinyang Zhou et.al.	2504.07881	null
2025-04-10	Token Level Routing Inference System for Edge Devices	Jianshu She et.al.	2504.07878	null
2025-04-10	SAMJAM: Zero-Shot Video Scene Graph Generation for Egocentric Kitchen Videos	Joshua Li et.al.	2504.07867	null
2025-04-11	Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs	Yichun Yin et.al.	2504.07866	null
2025-04-10	Robust Hallucination Detection in LLMs via Adaptive Token Selection	Mengjia Niu et.al.	2504.07863	null
2025-04-10	2D-Curri-DPO: Two-Dimensional Curriculum Learning for Direct Preference Optimization	Mengyang Li et.al.	2504.07856	null
2025-04-09	Sculpting Subspaces: Constrained Full Fine-Tuning in LLMs for Continual Learning	Nikhil Shivakumar Nayak et.al.	2504.07097	link
2025-04-09	OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens	Jiacheng Liu et.al.	2504.07096	null
2025-04-09	Are We Done with Object-Centric Learning?	Alexander Rubinstein et.al.	2504.07092	link
2025-04-09	KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs	Elan Markowitz et.al.	2504.07087	null
2025-04-09	A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility	Andreas Hochlehnert et.al.	2504.07086	null
2025-04-09	Self-Steering Language Models	Gabriel Grand et.al.	2504.07081	null
2025-04-09	DeduCE: Deductive Consistency as a Framework to Evaluate LLM Reasoning	Atharva Pandey et.al.	2504.07080	null
2025-04-09	Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation	Israfel Salazar et.al.	2504.07072	null
2025-04-09	A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models	Zhouhang Xie et.al.	2504.07070	null
2025-04-09	HalluciNot: Hallucination Detection Through Context and Common Knowledge Verification	Bibek Paudel et.al.	2504.07069	null
2025-04-09	Teaching pathology foundation models to accurately predict gene expression with parameter efficient knowledge transfer	Shi Pan et.al.	2504.07061	null
2025-04-09	TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling	Liang-Hsuan Tseng et.al.	2504.07053	link
2025-04-09	To Backtrack or Not to Backtrack: When Sequential Search Limits Model Reasoning	Tian Qin et.al.	2504.07052	null
2025-04-09	Evaluating Retrieval Augmented Generative Models for Document Queries in Transportation Safety	Chad Melton et.al.	2504.07022	null
2025-04-09	LLM-IFT: LLM-Powered Information Flow Tracking for Secure Hardware	Nowfel Mashnoor et.al.	2504.07015	null
2025-04-09	Towards LLMs Robustness to Changes in Prompt Format Styles	Lilian Ngweta et.al.	2504.06969	null
2025-04-09	Efficient Self-Supervised Learning for Earth Observation via Dynamic Dataset Curation	Thomas Kerdreux et.al.	2504.06962	null
2025-04-10	VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning	Xinhao Li et.al.	2504.06958	null
2025-04-09	Adaptive Computation Pruning for the Forgetting Transformer	Zhixuan Lin et.al.	2504.06949	null
2025-04-09	RuOpinionNE-2024: Extraction of Opinion Tuples from Russian News Texts	Natalia Loukachevitch et.al.	2504.06947	link
2025-04-08	GOLLuM: Gaussian Process Optimized LLMs -- Reframing LLM Finetuning through Bayesian Optimization	Bojana Ranković et.al.	2504.06265	link
2025-04-08	OmniSVG: A Unified Scalable Vector Graphics Generation Model	Yiying Yang et.al.	2504.06263	null
2025-04-09	Hogwild! Inference: Parallel LLM Generation via Concurrent Attention	Gleb Rodionov et.al.	2504.06261	null
2025-04-08	FEABench: Evaluating Language Models on Multiphysics Reasoning Ability	Nayantara Mudur et.al.	2504.06260	link
2025-04-08	Orb-v3: atomistic simulation at scale	Benjamin Rhodes et.al.	2504.06231	link
2025-04-08	LExT: Towards Evaluating Trustworthiness of Natural Language Explanations	Krithi Shailya et.al.	2504.06227	null
2025-04-08	Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation	Biao Zhang et.al.	2504.06225	null
2025-04-09	Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation	Xiaoxing Hu et.al.	2504.06220	link
2025-04-08	Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs	Dongyang Fan et.al.	2504.06219	null
2025-04-08	From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models	Chejian Xu et.al.	2504.06214	null
2025-04-08	TxGemma: Efficient and Agentic LLMs for Therapeutics	Eric Wang et.al.	2504.06196	null
2025-04-08	A Self-Supervised Framework for Space Object Behaviour Characterisation	Ian Groves et.al.	2504.06176	null
2025-04-08	Assessing how hyperparameters impact Large Language Models' sarcasm detection performance	Montgomery Gole et.al.	2504.06166	null
2025-04-09	Navigating the Rabbit Hole: Emergent Biases in LLM-Generated Attack Narratives Targeting Mental Health Groups	Rijul Magu et.al.	2504.06160	null
2025-04-08	A Large-Scale Analysis on Contextual Self-Supervised Video Representation Learning	Akash Kumar et.al.	2504.06153	null
2025-04-08	V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models	Xiangxi Zheng et.al.	2504.06148	link
2025-04-08	ARLO: A Tailorable Approach for Transforming Natural Language Software Requirements into Architecture using LLMs	Tooraj Helmi et.al.	2504.06143	null
2025-04-08	Adversarial Training of Reward Models	Alexander Bukharin et.al.	2504.06141	null
2025-04-08	A Multimedia Analytics Model for the Foundation Model Era	Marcel Worring et.al.	2504.06138	null
2025-04-08	QGen Studio: An Adaptive Question-Answer Generation, Training and Evaluation Platform	Movina Moses et.al.	2504.06136	null
2025-04-07	URECA: Unique Region Caption Anything	Sangbeom Lim et.al.	2504.05305	null
2025-04-07	InteractVLM: 3D Interaction Reasoning from 2D Foundational Models	Sai Kumar Dwivedi et.al.	2504.05303	link
2025-04-07	SmolVLM: Redefining small and efficient multimodal models	Andrés Marafioti et.al.	2504.05299	null
2025-04-07	Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations	Pedro Ferreira et.al.	2504.05294	null
2025-04-07	The challenge of uncertainty quantification of large language models in medicine	Zahra Atf et.al.	2504.05278	null
2025-04-07	Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation	Yucheng Chu et.al.	2504.05276	null
2025-04-07	Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models	Yang Yan et.al.	2504.05262	null
2025-04-07	Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models	Adrián Bazaga et.al.	2504.05258	null
2025-04-07	Explaining Low Perception Model Competency with High-Competency Counterfactuals	Sara Pohland et.al.	2504.05254	null
2025-04-07	LLM-based Automated Grading with Human-in-the-Loop	Hang Li et.al.	2504.05239	null
2025-04-08	NoveltyBench: Evaluating Language Models for Humanlike Diversity	Yiming Zhang et.al.	2504.05228	null
2025-04-07	A Reality Check of Vision-Language Pre-training in Radiology: Have We Progressed Using Text?	Julio Silva-Rodríguez et.al.	2504.05227	null
2025-04-07	Vision-Language Model Predictive Control for Manipulation Planning and Trajectory Generation	Jiaming Chen et.al.	2504.05225	link
2025-04-08	Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG	Hengran Zhang et.al.	2504.05220	null
2025-04-07	Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling	Hengran Zhang et.al.	2504.05216	null
2025-04-07	Post-Training Language Models for Continual Relation Extraction	Sefika Efeoglu et.al.	2504.05214	null
2025-04-07	Quantum Program Linting with LLMs: Emerging Results from a Comparative Study	Seung Yeob Shin et.al.	2504.05204	null
2025-04-07	Training state-of-the-art pathology foundation models with orders of magnitude less data	Mikhail Karasikov et.al.	2504.05186	null
2025-04-07	Concise Reasoning via Reinforcement Learning	Mehdi Fatemi et.al.	2504.05185	link
2025-04-07	BRIDGES: Bridging Graph Modality and Large Language Models within EDA Tasks	Wei Li et.al.	2504.05180	null
2025-04-04	Shape My Moves: Text-Driven Shape-Aware Synthesis of Human Motions	Ting-Hsuan Liao et.al.	2504.03639	null
2025-04-04	Do Larger Language Models Imply Better Reasoning? A Pretraining Scaling Law for Reasoning	Xinyi Wang et.al.	2504.03635	null
2025-04-04	Align to Structure: Aligning Large Language Models with Structural Information	Zae Myung Kim et.al.	2504.03622	null
2025-04-04	VISTA-OCR: Towards generative and interactive end to end OCR models	Laziz Hamdi et.al.	2504.03621	null
2025-04-04	Multilingual Retrieval-Augmented Generation for Knowledge-Intensive Task	Leonardo Ranaldi et.al.	2504.03616	null
2025-04-04	AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset	Bingxiang He et.al.	2504.03612	null
2025-04-04	MedSAM2: Segment Anything in 3D Medical Images and Videos	Jun Ma et.al.	2504.03600	link
2025-04-04	EnrichIndex: Using LLMs to Enrich Retrieval Indices Offline	Peter Baile Chen et.al.	2504.03598	null
2025-04-04	PF3Det: A Prompted Foundation Feature Assisted Visual LiDAR 3D Detector	Kaidong Li et.al.	2504.03563	null
2025-04-04	Agentic Knowledgeable Self-awareness	Shuofei Qiao et.al.	2504.03553	link
2025-04-04	RANa: Retrieval-Augmented Navigation	Gianluca Monaci et.al.	2504.03524	null
2025-04-04	Neutralizing the Narrative: AI-Powered Debiasing of Online News Articles	Chen Wei Kuo et.al.	2504.03520	null
2025-04-04	SpectR: Dynamically Composing LM Experts with Spectral Routing	William Fleshman et.al.	2504.03454	null
2025-04-04	Optimizing Specific and Shared Parameters for Efficient Parameter Tuning	Van-Anh Nguyen et.al.	2504.03450	null
2025-04-04	LLMSched: Uncertainty-Aware Workload Scheduling for Compound LLM Applications	Botao Zhu et.al.	2504.03444	null
2025-04-04	Know What You do Not Know: Verbalized Uncertainty Estimation Robustness on Corrupted Images in Vision-Language Models	Mirko Borszukovszki et.al.	2504.03440	null
2025-04-04	Locations of Characters in Narratives: Andersen and Persuasion Datasets	Batuhan Ozyurt et.al.	2504.03434	link
2025-04-04	Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning	Sanghwan Bae et.al.	2504.03380	null
2025-04-04	MultiClear: Multimodal Soft Exoskeleton Glove for Transparent Object Grasping Assistance	Chen Hu et.al.	2504.03379	null
2025-04-04	Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency	Erik Johannes Husom et.al.	2504.03360	null
2025-04-03	STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection	Divya Velayudhan et.al.	2504.02823	null
2025-04-03	Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models	Mateusz Pach et.al.	2504.02821	link
2025-04-03	Generative Evaluation of Complex Reasoning in Large Language Models	Haowei Lin et.al.	2504.02810	link
2025-04-03	MegaMath: Pushing the Limits of Open Math Corpora	Fan Zhou et.al.	2504.02807	link
2025-04-03	F-ViTA: Foundation Model Guided Visible to Thermal Translation	Jay N. Paranjape et.al.	2504.02801	link
2025-04-04	A Survey of Large Language Models in Mental Health Disorder Detection on Social Media	Zhuohan Ge et.al.	2504.02800	null
2025-04-03	Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence	Anita Rau et.al.	2504.02799	null
2025-04-03	A Framework for Situating Innovations, Opportunities, and Challenges in Advancing Vertical Systems with Large AI Models	Gaurav Verma et.al.	2504.02793	null
2025-04-03	Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets	Chuning Zhu et.al.	2504.02792	null
2025-04-03	A Framework for Robust Cognitive Evaluation of LLMs	Karin de Langis et.al.	2504.02789	null
2025-04-03	From Consumption to Collaboration: Measuring Interaction Patterns to Augment Human Cognition in Open-Ended Tasks	Joshua Holstein et.al.	2504.02780	null
2025-04-03	BT-ACTION: A Test-Driven Approach for Modular Understanding of User Instruction Leveraging Behaviour Trees and LLMs	Alexander Leszczynski et.al.	2504.02779	link
2025-04-03	How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices?	Andres Algaba et.al.	2504.02767	link
2025-04-03	Robot-Led Vision Language Model Wellbeing Assessment of Children	Nida Itrat Abbasi et.al.	2504.02765	null
2025-04-03	Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study	Aryan Agrawal et.al.	2504.02733	link
2025-04-04	Why do LLMs attend to the first token?	Federico Barbero et.al.	2504.02732	null
2025-04-03	ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization	Kehua Feng et.al.	2504.02725	null
2025-04-03	TeleMoM: Consensus-Driven Telecom Intelligence via Mixture of Models	Xinquan Wang et.al.	2504.02712	null
2025-04-03	The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context	Nikhil Verma et.al.	2504.02708	null
2025-04-03	LLM for Complex Reasoning Task: An Exploratory Study in Fermi Problems	Zishuo Liu et.al.	2504.02671	null
2025-04-02	Slot-Level Robotic Placement via Visual Imitation from Single Human Video	Dandan Shan et.al.	2504.01959	null
2025-04-02	Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities	Jing Liu et.al.	2504.01954	null
2025-04-02	The LLM Wears Prada: Analysing Gender Bias and Stereotypes through Online Shopping Data	Massimiliano Luca et.al.	2504.01951	null
2025-04-02	Efficient Federated Learning Tiny Language Models for Mobile Network Feature Prediction	Daniel Becking et.al.	2504.01947	null
2025-04-02	OpenCodeReasoning: Advancing Data Distillation for Competitive Coding	Wasi Uddin Ahmad et.al.	2504.01943	null
2025-04-02	Critical Thinking: Which Kinds of Complexity Govern Optimal Reasoning Length?	Celine Lee et.al.	2504.01935	link
2025-04-02	A thorough benchmark of automatic text classification: From traditional approaches to large language models	Washington Cunha et.al.	2504.01930	link
2025-04-02	Gen-C: Populating Virtual Worlds with Generative Crowds	Andreas Panayiotou et.al.	2504.01924	null
2025-04-02	Is Less Really More? Fake News Detection with Limited Information	Zhaoyang Cao et.al.	2504.01922	link
2025-04-03	Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation	Baban Gain et.al.	2504.01919	null
2025-04-02	FineLIP: Extending CLIP's Reach via Fine-Grained Alignment with Longer Text Inputs	Mothilal Asokan et.al.	2504.01916	link
2025-04-02	Advancing AI-Scientist Understanding: Making LLM Think Like a Physicist with Interpretable Reasoning	Yinggan Xu et.al.	2504.01911	null
2025-04-02	Is Temporal Prompting All We Need For Limited Labeled Action Recognition?	Shreyank N Gowda et.al.	2504.01890	null
2025-04-02	TransientTables: Evaluating LLMs' Reasoning on Temporally Evolving Semi-structured Tables	Abhilash Shankarampeta et.al.	2504.01879	null
2025-04-02	From Code Generation to Software Testing: AI Copilot with Context-Based RAG	Yuchen Wang et.al.	2504.01866	null
2025-04-02	Cross-Lingual Consistency: A Novel Inference Framework for Advancing Reasoning in Large Language Models	Zhiwei Yu et.al.	2504.01857	null
2025-04-02	Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks	Ali Al-Kaswan et.al.	2504.01850	null
2025-04-02	LARGE: Legal Retrieval Augmented Generation Evaluation Tool	Minhu Park et.al.	2504.01840	link
2025-04-02	Prompting Medical Vision-Language Models to Mitigate Diagnosis Bias by Generating Realistic Dermoscopic Images	Nusrat Munia et.al.	2504.01838	link
2025-04-02	YourBench: Easy Custom Evaluation Sets for Everyone	Sumuk Shashidhar et.al.	2504.01833	link
2025-03-31	Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation	Shengqiong Wu et.al.	2503.24379	null
2025-03-31	ACPBench Hard: Unrestrained Reasoning about Action, Change, and Planning	Harsha Kokel et.al.	2503.24378	null
2025-03-31	Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models	Rui Wang et.al.	2503.24377	link
2025-03-31	Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1	Yi Chen et.al.	2503.24376	link
2025-03-31	Effectively Controlling Reasoning Models through Thinking Intervention	Tong Wu et.al.	2503.24370	null
2025-03-31	Adapting Vision Foundation Models for Real-time Ultrasound Image Segmentation	Xiaoran Zhang et.al.	2503.24368	null
2025-03-31	ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion	Rana Muhammad Shahroz Khan et.al.	2503.24354	null
2025-03-31	PathOrchestra: A Comprehensive Foundation Model for Computational Pathology with Over 100 Diverse Clinical-Grade Tasks	Fang Yan et.al.	2503.24345	null
2025-03-31	Can Test-Time Scaling Improve World Foundation Model?	Wenyan Cong et.al.	2503.24320	link
2025-03-31	BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models	Alok Abhishek et.al.	2503.24310	null
2025-03-31	A Systematic Evaluation of LLM Strategies for Mental Health Text Analysis: Fine-tuning vs. Prompt Engineering vs. RAG	Arshia Kermani et.al.	2503.24307	null
2025-03-31	Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning	Jiacheng Lin et.al.	2503.24289	link
2025-03-31	Style Quantization for Data-Efficient GAN Training	Jian Wang et.al.	2503.24282	null
2025-03-31	Evaluating and Designing Sparse Autoencoders by Approximating Quasi-Orthogonality	Sewoong Lee et.al.	2503.24277	link
2025-03-31	Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation	Dun Yuan et.al.	2503.24245	null
2025-03-31	What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models	Qiyuan Zhang et.al.	2503.24235	link
2025-03-31	Synthetic News Generation for Fake News Classification	Abdul Sittar et.al.	2503.24206	null
2025-03-31	TwT: Thinking without Tokens by Habitual Reasoning Distillation with Multi-Teachers' Guidance	Jingxian Xu et.al.	2503.24198	null
2025-04-02	Text2Tracks: Prompt-based Music Recommendation via Generative Retrieval	Enrico Palumbo et.al.	2503.24193	null
2025-03-31	Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms	Shuoming Zhang et.al.	2503.24191	null
2025-03-28	Q-Insight: Understanding Image Quality via Visual Reinforcement Learning	Weiqi Li et.al.	2503.22679	link
2025-03-28	QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks?	Belinda Z. Li et.al.	2503.22674	link
2025-03-28	Exploring the Effectiveness of Multi-stage Fine-tuning for Cross-encoder Re-rankers	Francesca Pezzuti et.al.	2503.22672	link
2025-03-28	Understanding Co-speech Gestures in-the-wild	Sindhu B Hegde et.al.	2503.22668	null
2025-03-28	Unicorn: Text-Only Data Synthesis for Vision Language Model Training	Xiaomin Yu et.al.	2503.22655	link
2025-03-28	Evaluating Multimodal Language Models as Visual Assistants for Visually Impaired Users	Antonia Karamolegkou et.al.	2503.22610	null
2025-03-28	On the Alignment of Post-Publication Reviews & Bibliometric and Altmetric Impact -- A Case Study on Expert Statements from the Science Media Center Germany	Dirk Tunger et.al.	2503.22594	null
2025-03-28	LLM-enabled Instance Model Generation	Fengjunjie Pan et.al.	2503.22587	null
2025-03-28	Historical Ink: Exploring Large Language Models for Irony Detection in 19th-Century Spanish	Kevin Cohen et.al.	2503.22585	link
2025-03-28	Beyond Vanilla Fine-Tuning: Leveraging Multistage, Multilingual, and Domain-Specific Methods for Low-Resource Machine Translation	Sarubi Thillainathan et.al.	2503.22582	null
2025-03-28	Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization	Iñigo Pikabea et.al.	2503.22577	null
2025-03-28	Niyama : Breaking the Silos of LLM Inference Serving	Kanishk Goel et.al.	2503.22562	null
2025-03-28	Bridging the Dimensional Chasm: Uncover Layer-wise Dimensional Reduction in Transformers through Token Correlation	Zhuo-Yang Song et.al.	2503.22547	null
2025-03-28	Exploiting Mixture-of-Experts Redundancy Unlocks Multimodal Generative Abilities	Raman Dutt et.al.	2503.22517	null
2025-03-28	Assessing Foundation Models for Sea Ice Type Segmentation in Sentinel-1 SAR Imagery	Samira Alkaee Taleghan et.al.	2503.22516	null
2025-03-28	Probabilistic Uncertain Reward Model: A Natural Generalization of Bradley-Terry Reward Model	Wangtao Sun et.al.	2503.22480	null
2025-03-28	WorkTeam: Constructing Workflows from Natural Language with Multi-Agents	Hanchao Liu et.al.	2503.22473	null
2025-03-28	Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey	Shengyue Guan et.al.	2503.22458	null
2025-03-28	Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning	Abdullah Vanlioglu et.al.	2503.22456	null
2025-03-28	STADE: Standard Deviation as a Pruning Metric	Diego Coello de Portugal Mecke et.al.	2503.22451	link
2025-03-27	Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model	Abdelrahman Shaker et.al.	2503.21782	link
2025-03-27	Video-R1: Reinforcing Video Reasoning in MLLMs	Kaituo Feng et.al.	2503.21776	link
2025-03-27	Stable-SCore: A Stable Registration-based Framework for 3D Shape Correspondence	Haolin Liu et.al.	2503.21766	null
2025-03-27	Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video	David Yifan Yao et.al.	2503.21761	link
2025-03-27	MemInsight: Autonomous Memory Augmentation for LLM Agents	Rana Salama et.al.	2503.21760	null
2025-03-27	Fwd2Bot: LVLM Visual Token Compression with Double Forward Bottleneck	Adrian Bulat et.al.	2503.21757	null
2025-03-27	GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics	Arsham Gholamzadeh Khoee et.al.	2503.21735	null
2025-03-27	Effective Skill Unlearning through Intervention and Abstention	Yongce Li et.al.	2503.21730	link
2025-03-27	Collab: Controlled Decoding using Mixture of Agents for LLM Alignment	Souradip Chakraborty et.al.	2503.21720	null
2025-03-28	Outlier dimensions favor frequent tokens in language models	Iuri Macocco et.al.	2503.21718	null
2025-03-27	As easy as PIE: understanding when pruning causes language models to disagree	Pietro Tropeano et.al.	2503.21714	link
2025-03-27	Enhancing Repository-Level Software Repair via Repository-Aware Knowledge Graphs	Boyang Yang et.al.	2503.21710	null
2025-03-27	LLM-Gomoku: A Large Language Model-Based System for Strategic Gomoku with Self-Play and Reinforcement Learning	Hui Wang et.al.	2503.21683	null
2025-03-27	JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models' Detection of Human Self-Destructive Behavior Content in Jirai Community	Yunze Xiao et.al.	2503.21679	null
2025-03-27	How do language models learn facts? Dynamics, curricula and hallucinations	Nicolas Zucchet et.al.	2503.21676	null
2025-03-27	Intelligent IoT Attack Detection Design via ODLLM with Feature Ranking-based Knowledge Base	Satvik Verma et.al.	2503.21674	link
2025-03-27	Model Assembly Learning with Heterogeneous Layer Weight Merging	Yi-Kai Zhang et.al.	2503.21657	null
2025-03-27	UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning	Zhengxi Lu et.al.	2503.21620	link
2025-03-27	Leveraging Language Models for Analyzing Longitudinal Experiential Data in Education	Ahatsham Hayat et.al.	2503.21617	null
2025-03-27	Evaluating book summaries from internal knowledge in Large Language Models: a cross-model and semantic consistency approach	Javier Coronado-Blázquez et.al.	2503.21613	null
2025-03-26	Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark	Sondos Mahmoud Bsharat et.al.	2503.20786	link
2025-03-26	Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency	Tianqi Liu et.al.	2503.20785	link
2025-03-26	Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields	Shijie Zhou et.al.	2503.20776	null
2025-03-26	ASGO: Adaptive Structured Gradient Optimization	Kang An et.al.	2503.20762	null
2025-03-26	MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search	Yunhai Hu et.al.	2503.20757	null
2025-03-27	Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning	Huajie Tan et.al.	2503.20752	null
2025-03-26	UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines	Chen Tang et.al.	2503.20748	null
2025-03-26	MATHGLANCE: Multimodal Large Language Models Do Not Know Where to Look in Mathematical Diagrams	Yanpeng Sun et.al.	2503.20745	null
2025-03-26	Dynamic Motion Blending for Versatile Motion Editing	Nan Jiang et.al.	2503.20724	null
2025-03-26	From Annotation to Adaptation: Metrics, Synthetic Data, and Aspect Extraction for Aspect-Based Sentiment Analysis with Large Language Models	Nikita Neveditsin et.al.	2503.20715	null
2025-03-26	MMMORRF: Multimodal Multilingual Modularized Reciprocal Rank Fusion	Saron Samuel et.al.	2503.20698	null
2025-03-26	Graph-Enhanced Model-Free Reinforcement Learning Agents for Efficient Power Grid Topological Control	Eloy Anguiano Batanero et.al.	2503.20688	null
2025-03-27	Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound	Yuhao Huang et.al.	2503.20685	null
2025-03-27	Mitigating Low-Level Visual Hallucinations Requires Self-Awareness: Database, Model and Training Strategy	Yinan Sun et.al.	2503.20673	null
2025-03-26	TAMA: A Human-AI Collaborative Thematic Analysis Framework Using Multi-Agent LLMs for Clinical Interviews	Huimin Xu et.al.	2503.20666	null
2025-03-26	AutoRad-Lung: A Radiomic-Guided Prompting Autoregressive Vision-Language Model for Lung Nodule Malignancy Prediction	Sadaf Khademi et.al.	2503.20662	null
2025-03-26	AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports	Xiangwen Zhang et.al.	2503.20654	null
2025-03-26	Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging	Han Wu et.al.	2503.20641	link
2025-03-26	Collaborative Storytelling and LLM: A Linguistic Analysis of Automatically-Generated Role-Playing Game Sessions	Alessandro Maisto et.al.	2503.20623	null
2025-03-26	IAP: Improving Continual Learning of Vision-Language Models via Instance-Aware Prompting	Hao Fu et.al.	2503.20612	link
2025-03-25	SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data Pretraining	Xiang Xu et.al.	2503.19912	link
2025-03-25	CoLLM: A Large Language Model for Composed Image Retrieval	Chuong Huynh et.al.	2503.19910	link
2025-03-25	FullDiT: Multi-Task Video Generative Foundation Model with Full Attention	Xuan Ju et.al.	2503.19907	null
2025-03-25	CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning	Hao Yu et.al.	2503.19900	link
2025-03-25	A Multi-Agent Framework Integrating Large Language Models and Generative AI for Accelerated Metamaterial Design	Jie Tian et.al.	2503.19889	null
2025-03-25	CausalRAG: Integrating Causal Graphs into Retrieval-Augmented Generation	Nengbo Wang et.al.	2503.19878	null
2025-03-25	Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators	Seungone Kim et.al.	2503.19877	null
2025-03-25	SLA-Awareness for AI-assisted coding	Kishanthan Thangarajah et.al.	2503.19876	null
2025-03-25	Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking	Xiaoyu Tian et.al.	2503.19855	null
2025-03-25	Towards Online Multi-Modal Social Interaction Understanding	Xinpeng Li et.al.	2503.19851	link
2025-03-25	FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs	Carlos Plou et.al.	2503.19850	null
2025-03-25	A Comparative Analysis of Word Segmentation, Part-of-Speech Tagging, and Named Entity Recognition for Historical Chinese Sources, 1900-1950	Zhao Fang et.al.	2503.19844	null
2025-03-25	FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model	Jun Zhou et.al.	2503.19839	null
2025-03-25	Domain-incremental White Blood Cell Classification with Privacy-aware Continual Learning	Pratibha Kumari et.al.	2503.19819	null
2025-03-25	SeLIP: Similarity Enhanced Contrastive Language Image Pretraining for Multi-modal Head MRI	Zhiyang Liu et.al.	2503.19801	null
2025-03-25	SemEval-2025 Task 9: The Food Hazard Detection Challenge	Korbinian Randl et.al.	2503.19800	null
2025-03-25	PAVE: Patching and Adapting Video Large Language Models	Zhuoming Liu et.al.	2503.19794	link
2025-03-25	Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models	Kartik Thakral et.al.	2503.19783	null
2025-03-25	LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation	Vladan Stojnić et.al.	2503.19777	link
2025-03-25	OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations	Christina Kassab et.al.	2503.19764	null
2025-03-24	DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation	Karim Abou Zeid et.al.	2503.18944	link
2025-03-24	SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding	Mingze Xu et.al.	2503.18943	null
2025-03-24	Video-T1: Test-Time Scaling for Video Generation	Fangfu Liu et.al.	2503.18942	null
2025-03-24	Exploring Training and Inference Scaling Laws in Generative Retrieval	Hongru Cai et.al.	2503.18941	link
2025-03-24	CoMP: Continual Multimodal Pre-training for Vision Foundation Models	Yitong Chen et.al.	2503.18931	link
2025-03-24	Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training	Brian R. Bartoldson et.al.	2503.18929	null
2025-03-24	Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models	Meng Cao et.al.	2503.18923	null
2025-03-24	FFN Fusion: Rethinking Sequential Computation in Large Language Models	Akhiad Bercovich et.al.	2503.18908	null
2025-03-24	xKV: Cross-Layer SVD for KV-Cache Compression	Chi-Chih Chang et.al.	2503.18893	link
2025-03-24	AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration	Zhexuan Wang et.al.	2503.18891	link
2025-03-24	Toward building next-generation Geocoding systems: a systematic review	Zhengcong Yin et.al.	2503.18888	null
2025-03-24	I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders	Andrey Galichin et.al.	2503.18878	link
2025-03-24	Efficient Self-Supervised Adaptation for Medical Image Analysis	Moein Sorkhei et.al.	2503.18873	link
2025-03-24	Reimagining Memory Access for LLM Inference: Compression-Aware Memory Controller Design	Rui Xie et.al.	2503.18869	null
2025-03-24	Reasoning to Learn from Latent Thoughts	Yangjun Ruan et.al.	2503.18866	null
2025-03-25	Structuring Scientific Innovation: A Framework for Modeling and Discovering Impactful Knowledge Combinations	Junlan Chen et.al.	2503.18865	null
2025-03-25	MC-LLaVA: Multi-Concept Personalized Vision-Language Model	Ruichuan An et.al.	2503.18854	link
2025-03-24	Enhanced OoD Detection through Cross-Modal Alignment of Multi-Modal Representations	Jeonghyeon Kim et.al.	2503.18817	link
2025-03-24	Defeating Prompt Injections by Design	Edoardo Debenedetti et.al.	2503.18813	null
2025-03-24	SKDU at De-Factify 4.0: Vision Transformer with Data Augmentation for AI-Generated Image Detection	Shrikant Malviya et.al.	2503.18812	link
2025-03-21	Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique	Yansi Li et.al.	2503.17363	null
2025-03-21	HCAST: Human-Calibrated Autonomy Software Tasks	David Rein et.al.	2503.17354	link
2025-03-21	NdLinear Is All You Need for Representation Learning	Alex Reneau et.al.	2503.17353	link
2025-03-21	OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement	Yihe Deng et.al.	2503.17352	link
2025-03-21	Beyond Semantics: Rediscovering Spatial Awareness in Vision-Language Models	Jianing Qi et.al.	2503.17349	null
2025-03-21	Capturing Individual Human Preferences with Reward Features	André Barreto et.al.	2503.17338	null
2025-03-21	Efficient Intent-Based Filtering for Multi-Party Conversations Using Knowledge Distillation from LLMs	Reem Gody et.al.	2503.17336	null
2025-03-21	CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities	Yuxuan Zhu et.al.	2503.17332	link
2025-03-21	LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition Language	Kun Chu et.al.	2503.17309	link
2025-03-21	Bugdar: AI-Augmented Secure Code Review for GitHub Pull Requests	John Naulty et.al.	2503.17302	null
2025-03-21	FastCuRL: Curriculum Reinforcement Learning with Progressive Context Extension for Efficient Training R1-like Reasoning Models	Mingyang Song et.al.	2503.17287	link
2025-03-21	CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement	Gaifan Zhang et.al.	2503.17279	null
2025-03-21	Revisiting End To End Sparse Autoencoder Training -- A Short Finetune is All You Need	Adam Karvonen et.al.	2503.17272	link
2025-03-21	SafeMERGE: Preserving Safety Alignment in Fine-Tuned Large Language Models via Selective Layer-Wise Model Merging	Aladin Djuhera et.al.	2503.17239	link
2025-03-21	Slide-Level Prompt Learning with Vision Language Models for Few-Shot Multiple Instance Learning in Histopathology	Devavrat Tomar et.al.	2503.17238	link
2025-03-21	FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs	Albert Sawczyn et.al.	2503.17229	null
2025-03-21	Automating Adjudication of Cardiovascular Events Using Large Language Models	Sonish Sivarajkumar et.al.	2503.17222	null
2025-03-21	A Language Anchor-Guided Method for Robust Noisy Domain Generalization	Zilin Dai et.al.	2503.17211	null
2025-03-21	TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning	Sheng Wang et.al.	2503.17195	null
2025-03-21	LLMs Love Python: A Study of LLMs' Bias for Programming Languages and Libraries	Lukas Twist et.al.	2503.17181	link
2025-03-20	DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding	Keyan Chen et.al.	2503.16426	link
2025-03-20	Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models	Yang Sui et.al.	2503.16419	link
2025-03-20	M3: 3D-Spatial MultiModal Memory	Xueyan Zou et.al.	2503.16413	link
2025-03-20	The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination	Yifan Sun et.al.	2503.16402	link
2025-03-20	Exploring the Hidden Reasoning Process of Large Language Models by Misleading Them	Guanyu Chen et.al.	2503.16401	null
2025-03-20	Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation	Yijia Luo et.al.	2503.16385	link
2025-03-20	LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images	Leyang Wang et.al.	2503.16376	null
2025-03-20	JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse	Muyao Li et.al.	2503.16365	null
2025-03-20	CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners	Yunzhi Yao et.al.	2503.16356	link
2025-03-20	Lyra: An Efficient and Expressive Subquadratic Architecture for Modeling Biological Sequences	Krithik Ramesh et.al.	2503.16351	null
2025-03-20	LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates	Ying Shen et.al.	2503.16334	null
2025-03-20	OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence	Long Yuan et.al.	2503.16326	null
2025-03-20	Issue2Test: Generating Reproducing Test Cases from Issue Reports	Noor Nashid et.al.	2503.16320	null
2025-03-21	Bridging Technology and Humanities: Evaluating the Impact of Large Language Models on Social Sciences Research with DeepSeek-R1	Peiran Gu et.al.	2503.16304	null
2025-03-20	Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model	Zhaochong An et.al.	2503.16282	link
2025-03-21	Uni-3DAR: Unified 3D Generation and Understanding via Autoregression on Compressed Spatial Tokens	Shuqi Lu et.al.	2503.16278	link
2025-03-20	Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data	Zijian Li et.al.	2503.16260	null
2025-03-20	Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models	Keda Tao et.al.	2503.16257	null
2025-03-21	Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning	Zhaowei Liu et.al.	2503.16252	link
2025-03-20	Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't	Quy-Anh Dang et.al.	2503.16219	link
2025-03-19	TULIP: Towards Unified Language-Image Pretraining	Zineng Tang et.al.	2503.15485	null
2025-03-19	SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks	Yifei Zhou et.al.	2503.15478	link
2025-03-19	What Makes a Reward Model a Good Teacher? An Optimization Perspective	Noam Razin et.al.	2503.15477	link
2025-03-19	Cube: A Roblox View of 3D Intelligence	Foundation AI Team et.al.	2503.15475	link
2025-03-19	EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining	Boshen Xu et.al.	2503.15470	link
2025-03-19	From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level Alignment	Jia-Nan Li et.al.	2503.15463	link
2025-03-19	SkyLadder: Better and Faster Pretraining via Context Window Scheduling	Tongyao Zhu et.al.	2503.15450	link
2025-03-19	VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning	Yang Tan et.al.	2503.15438	link
2025-03-19	Visual Position Prompt for MLLM based Visual Grounding	Wei Tang et.al.	2503.15426	link
2025-03-19	Probing the topology of the space of tokens with structured prompts	Michael Robinson et.al.	2503.15421	null
2025-03-19	Visual Persona: Foundation Model for Full-Body Human Customization	Jisu Nam et.al.	2503.15406	null
2025-03-19	FedSCA: Federated Tuning with Similarity-guided Collaborative Aggregation for Heterogeneous Medical Image Segmentation	Yumin Zhang et.al.	2503.15390	null
2025-03-19	EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models	Yinan Liang et.al.	2503.15369	null
2025-03-19	SemEval-2025 Task 1: AdMIRe -- Advancing Multimodal Idiomaticity Representation	Thomas Pickard et.al.	2503.15358	null
2025-03-19	SPILL: Domain-Adaptive Intent Clustering based on Selection and Pooling with Large Language Models	I-Fan Lin et.al.	2503.15351	null
2025-03-19	TruthLens:A Training-Free Paradigm for DeepFake Detection	Ritabrata Chakraborty et.al.	2503.15342	null
2025-03-19	Uncertainty-Guided Chain-of-Thought for Code Generation with LLMs	Yuqi Zhu et.al.	2503.15341	null
2025-03-19	Solla: Towards a Speech-Oriented LLM That Hears Acoustic Context	Junyi Ao et.al.	2503.15338	link
2025-03-19	Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal Transport	Hao Tan et.al.	2503.15337	link
2025-03-19	Euclid Quick Data Release (Q1) Exploring galaxy properties with a multi-modal foundation model	Euclid Collaboration et.al.	2503.15312	link
2025-03-18	Aligning Multimodal LLM with Human Preference: A Survey	Tao Yu et.al.	2503.14504	link
2025-03-18	Engineering Scientific Assistants using Interactive Structured Induction of Programs	Shraddha Surana et.al.	2503.14488	null
2025-03-18	Gricean Norms as a Basis for Effective Collaboration	Fardin Saad et.al.	2503.14484	link
2025-03-19	Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM	Xinyu Fang et.al.	2503.14478	link
2025-03-18	Characterizing Data Visualization Literacy: a Systematic Literature Review	Sara Beschi et.al.	2503.14468	null
2025-03-18	RWKV-7 "Goose" with Expressive Dynamic State Evolution	Bo Peng et.al.	2503.14456	link
2025-03-18	EnvBench: A Benchmark for Automated Environment Setup	Aleksandra Eliseeva et.al.	2503.14443	link
2025-03-18	LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers	Nikhil Abhyankar et.al.	2503.14434	link
2025-03-18	PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool Play	Wei Fang et.al.	2503.14432	null
2025-03-18	ExDDV: A New Dataset for Explainable Deepfake Detection in Video	Vlad Hondru et.al.	2503.14421	link
2025-03-18	Unifying Text Semantics and Graph Structures for Temporal Text-attributed Graphs with Large Language Models	Siwei Zhang et.al.	2503.14411	null
2025-03-18	Large Language Models for Virtual Human Gesture Selection	Parisa Ghanad Torshizi et.al.	2503.14408	null
2025-03-18	DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers	Mert Bulent Sariyildiz et.al.	2503.14405	null
2025-03-18	From "Hallucination" to "Suture": Insights from Language Philosophy to Enhance Large Language Models	Qiantong Wang et.al.	2503.14392	null
2025-03-18	How much do LLMs learn from negative examples?	Shadi Hamdan et.al.	2503.14391	null
2025-03-18	Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation	Rikuto Tsuchida et.al.	2503.14382	null
2025-03-18	On the Standard Performance Criteria for Applied Control Design: PID, MPC or Machine Learning Controller?	Pouria Sarhadi et.al.	2503.14379	link
2025-03-18	Tiled Flash Linear Attention: More Efficient Linear RNN and xLSTM Kernels	Maximilian Beck et.al.	2503.14376	link
2025-03-18	MAST-Pro: Dynamic Mixture-of-Experts for Adaptive Segmentation of Pan-Tumors with Knowledge-Driven Prompts	Runqi Meng et.al.	2503.14355	null
2025-03-19	MoonCast: High-Quality Zero-Shot Podcast Generation	Zeqian Ju et.al.	2503.14345	link
2025-03-17	MetaScale: Test-Time Scaling with Evolving Meta-Thoughts	Qin Liu et.al.	2503.13447	null
2025-03-17	MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation	Zhenyu Wu et.al.	2503.13446	null
2025-03-17	Faithfulness of LLM Self-Explanations for Commonsense Tasks: Larger Is Better, and Instruction-Tuning Allows Trade-Offs but Not Pareto Dominance	Noah Y. Siegel et.al.	2503.13445	null
2025-03-17	VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning	Ye Liu et.al.	2503.13444	link
2025-03-17	DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models	Haoyang Li et.al.	2503.13443	link
2025-03-18	MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling	Yingyue Li et.al.	2503.13440	link
2025-03-17	xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference	Maximilian Beck et.al.	2503.13427	link
2025-03-17	SuperBPE: Space Travel for Language Models	Alisa Liu et.al.	2503.13423	null
2025-03-17	A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives	Weiqiang Jin et.al.	2503.13415	null
2025-03-18	DLPO: Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Learning Perspective	Dengyun Peng et.al.	2503.13413	link
2025-03-17	Using the Tools of Cognitive Science to Understand Large Language Models at Different Levels of Analysis	Alexander Ku et.al.	2503.13401	null
2025-03-17	MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research	James Burgess et.al.	2503.13399	link
2025-03-17	Aligned Probing: Relating Toxic Behavior and Model Internals	Andreas Waldis et.al.	2503.13390	null
2025-03-17	Cream of the Crop: Harvesting Rich, Scalable and Transferable Multi-Modal Data for Instruction Fine-Tuning	Mengyao Lyu et.al.	2503.13383	null
2025-03-17	Sightation Counts: Leveraging Sighted User Feedback in Building a BLV-aligned Dataset of Diagram Descriptions	Wan Ju Kang et.al.	2503.13369	null
2025-03-17	Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning	Hai-Long Sun et.al.	2503.13360	null
2025-03-17	Agents Play Thousands of 3D Video Games	Zhongwen Xu et.al.	2503.13356	null
2025-03-17	Valid Text-to-SQL Generation with Unification-based DeepStochLog	Ying Jiao et.al.	2503.13342	link
2025-03-17	LearnMate: Enhancing Online Education with LLM-Powered Personalized Learning Plans and Support	Xinyu Jessica Wang et.al.	2503.13340	null
2025-03-17	Reliable and Efficient Amortized Model-based Evaluation	Sang Truong et.al.	2503.13335	null
2025-03-14	Tit-for-Tat: Safeguarding Large Vision-Language Models Against Jailbreak Attacks via Adversarial Defense	Shuyang Hao et.al.	2503.11619	null
2025-03-14	ASMA-Tune: Unlocking LLMs' Assembly Code Comprehension via Structural-Semantic Instruction Tuning	Xinyi Wang et.al.	2503.11617	link
2025-03-14	Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages	Matteo Farina et.al.	2503.11609	link
2025-03-14	Do Construction Distributions Shape Formal Language Learning In German BabyLMs?	Bastian Bunzeck et.al.	2503.11593	null
2025-03-14	Pathology Image Compression with Pre-trained Autoencoders	Srikar Yellapragada et.al.	2503.11591	null
2025-03-14	Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs using Semantic Space	Zhiliang Chen et.al.	2503.11586	link
2025-03-14	SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion	Ahmed Nassar et.al.	2503.11576	null
2025-03-14	Synthesizing Access Control Policies using Large Language Models	Adarsh Vatsa et.al.	2503.11573	null
2025-03-14	Implicit Bias-Like Patterns in Reasoning Models	Messi H. J. Lee et.al.	2503.11572	null
2025-03-14	VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity	Jing Bi et.al.	2503.11557	null
2025-03-14	Similarity-Aware Token Pruning: Your VLM but Faster	Ahmadreza Jeddi et.al.	2503.11549	link
2025-03-14	Potential of large language model-powered nudges for promoting daily water and energy conservation	Zonghan Li et.al.	2503.11531	null
2025-03-14	Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models	Hao Cheng et.al.	2503.11519	null
2025-03-14	HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models	Ziqin Zhou et.al.	2503.11513	null
2025-03-14	V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning	Zixu Cheng et.al.	2503.11495	null
2025-03-14	A Review of DeepSeek Models' Key Innovative Techniques	Chengen Wang et.al.	2503.11486	null
2025-03-14	Integrating LLMs in Gamified Systems	Carlos J. Costa et.al.	2503.11458	null
2025-03-14	D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning	Jia Zhang et.al.	2503.11441	null
2025-03-14	Text Compression for Efficient Language Generation	David Gu et.al.	2503.11426	null
2025-03-14	Empowering Time Series Analysis with Synthetic Data: A Survey and Outlook in the Era of Foundation Models	Xu Liu et.al.	2503.11411	null
2025-03-13	GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing	Rongyao Fang et.al.	2503.10639	link
2025-03-13	A Frustratingly Simple Yet Highly Effective Attack Baseline: Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1	Zhaoyi Li et.al.	2503.10635	link
2025-03-13	HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model	Jiaming Liu et.al.	2503.10631	null
2025-03-13	UniGoal: Towards Universal Zero-shot Goal-oriented Navigation	Hang Yin et.al.	2503.10630	null
2025-03-13	Transformers without Normalization	Jiachen Zhu et.al.	2503.10622	null
2025-03-13	From TOWER to SPIRE: Adding the Speech Modality to a Text-Only LLM	Kshitij Ambilduke et.al.	2503.10620	link
2025-03-13	Siege: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search	Andy Zhou et.al.	2503.10619	null
2025-03-13	Compositional Subspace Representation Fine-tuning for Adaptive Large Language Models	Andy Zhou et.al.	2503.10617	null
2025-03-13	R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization	Yi Yang et.al.	2503.10615	link
2025-03-13	CoSTA $\ast$ : Cost-Sensitive Toolpath Agent for Multi-turn Image Editing	Advait Gupta et.al.	2503.10613	link
2025-03-13	TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention	Jinhao Duan et.al.	2503.10602	link
2025-03-13	GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding	Rui Hu et.al.	2503.10596	link
2025-03-13	Unlock the Power of Unlabeled Data in Language Driving Model	Chaoqun Wang et.al.	2503.10586	null
2025-03-13	VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search	Yiming Jia et.al.	2503.10582	null
2025-03-13	Unveiling the Mathematical Reasoning in DeepSeek Models: A Comparative Study of Large Language Models	Afrar Jahin et.al.	2503.10573	null
2025-03-13	ASIDE: Architectural Separation of Instructions and Data in Language Models	Egor Zverev et.al.	2503.10566	null
2025-03-13	Short-term AI literacy intervention does not reduce over-reliance on incorrect ChatGPT recommendations	Brett Puppart et.al.	2503.10556	null
2025-03-13	KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation	Zixian Liu et.al.	2503.10546	null
2025-03-13	DP-GPL: Differentially Private Graph Prompt Learning	Jing Xu et.al.	2503.10544	null
2025-03-13	Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More	Arvid Frydenlund et.al.	2503.10542	null
2025-03-12	MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System	Jihao Zhao et.al.	2503.09600	link
2025-03-12	How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation	Ruohao Guo et.al.	2503.09598	link
2025-03-12	SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment	Katrin Renz et.al.	2503.09594	null
2025-03-12	BIMBA: Selective-Scan Compression for Long-Range Video Question Answering	Md Mohaiminul Islam et.al.	2503.09590	link
2025-03-12	Cost-Optimal Grouped-Query Attention for Long-Context LLMs	Yingfa Chen et.al.	2503.09579	link
2025-03-12	Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models	Marianne Arriola et.al.	2503.09573	link
2025-03-12	Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks	Lutfi Eren Erdogan et.al.	2503.09572	null
2025-03-13	Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models	Qiguang Chen et.al.	2503.09567	null
2025-03-12	PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs	Oskar van der Wal et.al.	2503.09543	link
2025-03-13	Large Language Models for Multi-Facility Location Mechanism Design	Nguyen Thach et.al.	2503.09533	null
2025-03-13	SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability	Adam Karvonen et.al.	2503.09532	null
2025-03-12	Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning	Bowen Jin et.al.	2503.09516	link
2025-03-12	Reinforcement Learning is all You Need	Yongsheng Lian et.al.	2503.09512	null
2025-03-12	ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning	Ziyu Wan et.al.	2503.09501	link
2025-03-12	MindGYM: Enhancing Vision-Language Models via Synthetic Self-Challenging Questions	Zhe Xu et.al.	2503.09499	link
2025-03-12	Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection	Romain Thoreau et.al.	2503.09493	null
2025-03-12	Project-Probe-Aggregate: Efficient Fine-Tuning for Group Robustness	Beier Zhu et.al.	2503.09487	null
2025-03-12	BAMBI: Developing Baby Language Models for Italian	Alice Suozzi et.al.	2503.09481	null
2025-03-12	SurgicalVLM-Agent: Towards an Interactive AI Co-Pilot for Pituitary Surgery	Jiayuan Huang et.al.	2503.09474	null
2025-03-12	Explicit Learning and the LLM in Machine Translation	Malik Marmonier et.al.	2503.09454	link
2025-03-11	QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension	Yongdong Luo et.al.	2503.08689	link
2025-03-11	Randomness, Not Representation: The Unreliability of Evaluating Cultural Alignment in LLMs	Ariba Khan et.al.	2503.08688	link
2025-03-11	Perplexity Trap: PLM-Based Retrievers Overrate Low Perplexity Documents	Haoyu Wang et.al.	2503.08684	link
2025-03-11	Self-Taught Self-Correction for Small Language Models	Viktor Moskvoretskii et.al.	2503.08681	null
2025-03-11	Understanding and Mitigating Distribution Shifts For Machine Learning Force Fields	Tobias Kreiman et.al.	2503.08674	null
2025-03-11	Generating Robot Constitutions & Benchmarks for Semantic Safety	Pierre Sermanet et.al.	2503.08663	null
2025-03-11	Exploring the Word Sense Disambiguation Capabilities of Large Language Models	Pierpaolo Basile et.al.	2503.08662	null
2025-03-11	YuE: Scaling Open Foundation Models for Long-Form Music Generation	Ruibin Yuan et.al.	2503.08638	link
2025-03-11	LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization	Xianfeng Wu et.al.	2503.08619	link
2025-03-11	EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments	Dongping Li et.al.	2503.08604	link
2025-03-11	NSF-SciFy: Mining the NSF Awards Database for Scientific Claims	Delip Rao et.al.	2503.08600	null
2025-03-11	Proc4Gem: Foundation models for physical agency through procedural generation	Yixin Lin et.al.	2503.08593	null
2025-03-11	BiasEdit: Debiasing Stereotyped Language Models via Model Editing	Xin Xu et.al.	2503.08588	link
2025-03-11	HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding	Shehreen Azad et.al.	2503.08585	null
2025-03-11	RAG-Adapter: A Plug-and-Play RAG-enhanced Framework for Long Video Understanding	Xichen Tan et.al.	2503.08576	null
2025-03-11	DeepReview: Improving LLM-based Paper Review with Human-like Deep Thinking Process	Minjun Zhu et.al.	2503.08569	null
2025-03-11	Reasoning and Sampling-Augmented MCQ Difficulty Prediction via LLMs	Wanyong Feng et.al.	2503.08551	null
2025-03-11	Transferring Extreme Subword Style Using Ngram Model-Based Logit Scaling	Craig Messner et.al.	2503.08550	null
2025-03-11	Graph of AI Ideas: Leveraging Knowledge Graphs and LLMs for AI Research Idea Generation	Xian Gao et.al.	2503.08549	null
2025-03-11	TLA: Tactile-Language-Action Model for Contact-Rich Manipulation	Peng Hao et.al.	2503.08548	null
2025-03-10	Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru	Dunant Cusipuma et.al.	2503.07587	null
2025-03-10	Talking to GDELT Through Knowledge Graphs	Audun Myers et.al.	2503.07584	null
2025-03-10	VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models	Jen-tse Huang et.al.	2503.07575	link
2025-03-10	AutoSpatial: Visual-Language Reasoning for Social Robot Navigation through Efficient Spatial Reasoning Learning	Yangzhe Kong et.al.	2503.07557	null
2025-03-10	Junior Software Developers' Perspectives on Adopting LLMs for Software Engineering: a Systematic Literature Review	Samuel Ferino et.al.	2503.07556	null
2025-03-10	KSOD: Knowledge Supplement for LLMs On Demand	Haoran Li et.al.	2503.07550	null
2025-03-10	Bi-Directional Mental Model Reconciliation for Human-Robot Interaction with Large Language Models	Nina Moorman et.al.	2503.07547	null
2025-03-10	Queueing, Predictions, and LLMs: Challenges and Open Problems	Michael Mitzenmacher et.al.	2503.07545	null
2025-03-10	XIFBench: Evaluating Large Language Models on Multilingual Instruction Following	Zhenyu Li et.al.	2503.07539	null
2025-03-10	Building English ASR model with regional language support	Purvi Agrawal et.al.	2503.07522	null
2025-03-10	GRITHopper: Decomposition-Free Multi-Hop Dense Retrieval	Justus-Jonas Erker et.al.	2503.07519	link
2025-03-10	TokenButler: Token Importance is Predictable	Yash Akhauri et.al.	2503.07518	link
2025-03-10	Language Models Fail to Introspect About Their Knowledge of Language	Siyuan Song et.al.	2503.07513	link
2025-03-10	Plume: Scaffolding Text Composition in Dashboards	Maxim Lisnic et.al.	2503.07512	null
2025-03-10	Sometimes the Model doth Preach: Quantifying Religious Bias in Open LLMs through Demographic Analysis in Asian Nations	Hari Shankar et.al.	2503.07510	link
2025-03-10	Think Before You Segment: High-Quality Reasoning Segmentation with GPT Chain of Thoughts	Shiu-hong Kao et.al.	2503.07503	null
2025-03-10	V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation	Guiwei Zhang et.al.	2503.07493	link
2025-03-10	LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition?	Bangyan Li et.al.	2503.07487	null
2025-03-10	Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction	Zongzheng Zhang et.al.	2503.07485	link
2025-03-10	VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models	Jiacheng Ruan et.al.	2503.07478	link
2025-03-07	Fairness-Aware Low-Rank Adaptation Under Demographic Privacy Constraints	Parameswaran Kamalaruban et.al.	2503.05684	null
2025-03-07	Understanding the Limits of Lifelong Knowledge Editing in LLMs	Lukas Thede et.al.	2503.05683	null
2025-03-07	A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval	Yu Zhang et.al.	2503.05659	link
2025-03-07	Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings	Xuanqing Liu et.al.	2503.05620	null
2025-03-07	A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models	Dong Shu et.al.	2503.05613	null
2025-03-07	From Theory to Application: A Practical Introduction to Neural Operators in Scientific Computing	Prashant K. Jha et.al.	2503.05598	link
2025-03-07	R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning	Huatong Song et.al.	2503.05592	null
2025-03-07	Quantifying the Robustness of Retrieval-Augmented Language Models Against Spurious Features in Grounding Data	Shiping Yang et.al.	2503.05587	null
2025-03-07	Evaluating open-source Large Language Models for automated fact-checking	Nicolo' Fontana et.al.	2503.05565	null
2025-03-07	Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance	Bryan Etzine et.al.	2503.05551	null
2025-03-07	Leveraging Approximate Caching for Faster Retrieval-Augmented Generation	Shai Bergman et.al.	2503.05530	null
2025-03-07	PoSSUM: A Protocol for Surveying Social-media Users with Multimodal LLMs	Roberto Cerina et.al.	2503.05529	null
2025-03-07	Cognitive Bias Detection Using Advanced Prompt Engineering	Frederic Lemieux et.al.	2503.05516	null
2025-03-07	Grammar-Based Code Representation: Is It a Worthy Pursuit for LLMs?	Qingyuan Liang et.al.	2503.05507	null
2025-03-07	Statistical Guarantees of Correctness Coverage for Medical Multiple-Choice Question Answering	Yusong Ke et.al.	2503.05505	null
2025-03-07	Benchmarking LLMs in Recommendation Tasks: A Comparative Evaluation with Conventional Recommenders	Qijiong Liu et.al.	2503.05493	null
2025-03-07	Maximum Hallucination Standards for Domain-Specific Large Language Models	Tingmingke Lu et.al.	2503.05481	null
2025-03-07	The Society of HiveMind: Multi-Agent Optimization of Foundation Model Swarms to Unlock the Potential of Collective Intelligence	Noah Mamie et.al.	2503.05473	null
2025-03-07	Soft Policy Optimization: Online Off-Policy RL for Sequence Models	Taco Cohen et.al.	2503.05453	null
2025-03-07	LLM-based Iterative Approach to Metamodeling in Automotive	Nenad Petrovic et.al.	2503.05449	null
2025-03-06	L $^2$ M: Mutual Information Scaling Law for Long-Context Language Modeling	Zhuo Chen et.al.	2503.04725	link
2025-03-06	LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM	Sambal Shikhar et.al.	2503.04724	null
2025-03-07	Shifting Long-Context LLMs Research from Input to Output	Yuhao Wu et.al.	2503.04723	null
2025-03-06	Enough Coin Flips Can Make LLMs Act Bayesian	Ritwik Gupta et.al.	2503.04722	null
2025-03-06	Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities	Guan-Ting Lin et.al.	2503.04721	link
2025-03-06	Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining	Houyi Li et.al.	2503.04715	null
2025-03-06	Scaling Rich Style-Prompted Text-to-Speech Datasets	Anuj Diwan et.al.	2503.04713	link
2025-03-06	Universality of Layer-Level Entropy-Weighted Quantization Beyond Model Architecture and Size	Alireza Behtash et.al.	2503.04704	null
2025-03-06	L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning	Pranjal Aggarwal et.al.	2503.04697	null
2025-03-06	UIPE: Enhancing LLM Unlearning by Removing Knowledge Related to Forgetting Targets	Wenyu Wang et.al.	2503.04693	null
2025-03-06	Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases	Pengcheng Qiu et.al.	2503.04691	null
2025-03-06	LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue	Sangyeop Kim et.al.	2503.04675	null
2025-03-06	An Information-theoretic Multi-task Representation Learning Framework for Natural Language Understanding	Dou Hu et.al.	2503.04667	link
2025-03-06	CLDyB: Towards Dynamic Benchmarking for Continual Learning with Pre-trained Models	Shengzhuang Chen et.al.	2503.04655	link
2025-03-06	Transferable Foundation Models for Geometric Tasks on Point Cloud Representations: Geometric Neural Operators	Blaine Quackenbush et.al.	2503.04649	link
2025-03-06	Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment	Wen Yang et.al.	2503.04647	null
2025-03-06	Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation	Aishik Konwer et.al.	2503.04639	null
2025-03-06	Mark Your LLM: Detecting the Misuse of Open-Source Large Language Models via Watermarking	Yijie Xu et.al.	2503.04636	null
2025-03-06	Better Process Supervision with Bi-directional Rewarding Signals	Wenxiang Chen et.al.	2503.04618	null
2025-03-06	Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning	Mohammad Amin Ghanizadeh et.al.	2503.04611	null
2025-03-05	The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems	Richard Ren et.al.	2503.03750	null
2025-03-05	Process-based Self-Rewarding Language Models	Shimao Zhang et.al.	2503.03746	link
2025-03-05	CHOP: Mobile Operating Assistant with Constrained High-frequency Optimized Subtask Planning	Yuqi Zhou et.al.	2503.03743	link
2025-03-05	Towards Understanding Distilled Reasoning Models: A Representational Approach	David D. Baek et.al.	2503.03730	null
2025-03-05	Improving LLM Safety Alignment with Dual-Objective Optimization	Xuandong Zhao et.al.	2503.03710	link
2025-03-05	Effective LLM Knowledge Learning via Model Generalization	Mingkang Zhu et.al.	2503.03705	null
2025-03-05	A Practical Memory Injection Attack against LLM Agents	Shen Dong et.al.	2503.03704	null
2025-03-05	Developing and Utilizing a Large-Scale Cantonese Dataset for Multi-Tasking in Large Language Models	Jiyue Jiang et.al.	2503.03702	null
2025-03-05	Addressing Overprescribing Challenges: Fine-Tuning Large Language Models for Medication Recommendation Tasks	Zihao Zhao et.al.	2503.03687	link
2025-03-05	Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models	Bar Karov et.al.	2503.03669	link
2025-03-05	Analogical Reasoning Inside Large Language Models: Concept Vectors and the Limits of Abstraction	Gustaw Opiełka et.al.	2503.03666	link
2025-03-05	Robust Learning of Diverse Code Edits	Tushar Aggarwal et.al.	2503.03656	null
2025-03-05	Improving Neutral Point of View Text Generation through Parameter-Efficient Reinforcement Learning and a Small-Scale High-Quality Dataset	Jessica Hoffmann et.al.	2503.03654	null
2025-03-05	Token-Level Privacy in Large Language Models	Re'em Harel et.al.	2503.03652	null
2025-03-05	Psy-Copilot: Visual Chain of Thought for Counseling	Keqi Chen et.al.	2503.03645	null
2025-03-05	Large language models in finance: estimating financial sentiment for stock prediction	Kemal Kirtac et.al.	2503.03612	null
2025-03-05	Enhancing the Accuracy and Comprehensibility in Architectural Tactics Detection via Small Model-Augmented Prompt Engineering	Lingli Cao et.al.	2503.03609	link
2025-03-05	Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling	Keqi Chen et.al.	2503.03607	null
2025-03-05	Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders	Kristian Kuznetsov et.al.	2503.03601	null
2025-03-05	Small but Mighty: Enhancing Time Series Forecasting with Lightweight LLMs	Haoran Fan et.al.	2503.03594	link
2025-03-04	Wikipedia in the Era of LLMs: Evolution and Risks	Siming Huang et.al.	2503.02879	link
2025-03-04	Language Models can Self-Improve at State-Value Estimation for Better Search	Ethan Mendes et.al.	2503.02878	link
2025-03-04	SPIDER: A Comprehensive Multi-Organ Supervised Pathology Dataset and Baseline Models	Dmitry Nechaev et.al.	2503.02876	link
2025-03-04	The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models	Ke Ji et.al.	2503.02875	null
2025-03-04	Prompting Generative AI with Interaction-Augmented Instructions	Leixian Shen et.al.	2503.02874	null
2025-03-04	FairSense-AI: Responsible AI Meets Sustainability	Shaina Raza et.al.	2503.02865	null
2025-03-04	Calibrating LLM Confidence with Semantic Steering: A Multi-Prompt Aggregation Framework	Ziang Zhou et.al.	2503.02863	null
2025-03-04	Privacy and Accuracy-Aware AI/ML Model Deduplication	Hong Guan et.al.	2503.02862	null
2025-03-04	(How) Do Language Models Track State?	Belinda Z. Li et.al.	2503.02854	null
2025-03-04	Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs' Decoding Layers	Zicong He et.al.	2503.02851	link
2025-03-04	Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs	Yuzhe Gu et.al.	2503.02846	link
2025-03-04	Beyond Cosine Decay: On the effectiveness of Infinite Learning Rate Schedule for Continual Pre-training	Paul Janson et.al.	2503.02844	null
2025-03-04	AlignDistil: Token-Level Language Model Alignment as Adaptive Policy Distillation	Songming Zhang et.al.	2503.02832	null
2025-03-04	Developing a PET/CT Foundation Model for Cross-Modal Anatomical and Functional Imaging	Yujin Oh et.al.	2503.02824	null
2025-03-04	"What If Smart Homes Could See Our Homes?": Exploring DIY Smart Home Building Experiences with VLM-Based Camera Sensors	Sojeong Yun et.al.	2503.02816	null
2025-03-04	Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression	Nathan Godey et.al.	2503.02812	link
2025-03-04	RAAD-LLM: Adaptive Anomaly Detection Using LLMs and RAG Integration	Alicia Russell-Gilbert et.al.	2503.02800	null
2025-03-04	Multimodal AI predicts clinical outcomes of drug combinations from preclinical data	Yepeng Huang et.al.	2503.02781	link
2025-03-04	Implicit Bias in LLMs: A Survey	Xinru Lin et.al.	2503.02776	null
2025-03-04	InSerter: Speech Instruction Following with Unsupervised Interleaved Pre-training	Dingdong Wang et.al.	2503.02769	null
2025-02-28	LLM Post-Training: A Deep Dive into Reasoning Large Language Models	Komal Kumar et.al.	2502.21321	link
2025-02-28	Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos	Zhiyu Tan et.al.	2502.21314	null
2025-02-28	FANformer: Improving Large Language Models Through Effective Periodicity Modeling	Yihong Dong et.al.	2502.21309	link
2025-02-28	Contextualizing biological perturbation experiments through language	Menghua Wu et.al.	2502.21290	link
2025-02-28	Adaptive Keyframe Sampling for Long Video Understanding	Xi Tang et.al.	2502.21271	null
2025-03-03	Foundation Models -- A Panacea for Artificial Intelligence in Pathology?	Nita Mulliqi et.al.	2502.21264	null
2025-02-28	Modeling Human Beliefs about AI Behavior for Scalable Oversight	Leon Lang et.al.	2502.21262	null
2025-02-28	PET Image Denoising via Text-Guided Diffusion: Integrating Anatomical Priors through Text Prompts	Boxiao Yu et.al.	2502.21260	null
2025-02-28	RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete	Yuheng Ji et.al.	2502.21257	null
2025-02-28	TimesBERT: A BERT-Style Foundation Model for Time Series Understanding	Haoran Zhang et.al.	2502.21245	null
2025-02-28	Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs	Xiaomin Li et.al.	2502.21239	null
2025-02-28	Transforming Tuberculosis Care: Optimizing Large Language Models For Enhanced Clinician-Patient Communication	Daniil Filienko et.al.	2502.21236	null
2025-02-28	ByteScale: Efficient Scaling of LLM Training with a 2048K Context Length on More Than 12,000 GPUs	Hao Ge et.al.	2502.21231	null
2025-03-03	ECLeKTic: a Novel Challenge Set for Evaluation of Cross-Lingual Knowledge Transfer	Omer Goldman et.al.	2502.21228	null
2025-02-28	Transformers Learn to Implement Multi-step Gradient Descent with Chain of Thought	Jianhao Huang et.al.	2502.21212	null
2025-02-28	Chronologically Consistent Large Language Models	Songrun He et.al.	2502.21206	null
2025-02-28	$Δ$ -model correction of Foundation Model based on the models own understanding	Mads-Peter Verner Christiansen et.al.	2502.21179	null
2025-03-03	Causality Is Key to Understand and Balance Multiple Goals in Trustworthy ML and Foundation Models	Ruta Binkyte et.al.	2502.21123	null
2025-02-28	Optimizing Large Language Models for ESG Activity Detection in Financial Texts	Mattia Birti et.al.	2502.21112	link
2025-02-28	Large Language Model-Based Benchmarking Experiment Settings for Evolutionary Multi-Objective Optimization	Lie Meng Pang et.al.	2502.21108	null
2025-02-27	R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts	Zhongyang Li et.al.	2502.20395	link
2025-02-27	Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis	Jeffrey Yang Fan Chiang et.al.	2502.20383	null
2025-02-27	Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers	Shalev Lifshitz et.al.	2502.20379	null
2025-02-27	PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation	Albert Gong et.al.	2502.20377	link
2025-02-27	Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization	Ryan C. Barron et.al.	2502.20364	link
2025-02-27	Bridging the Creativity Understanding Gap: Small-Scale Human Alignment Enables Expert-Level Humor Ranking in LLMs	Kuan Lok Zhou et.al.	2502.20356	null
2025-02-27	KEDRec-LM: A Knowledge-distilled Explainable Drug Recommendation Large Language Model	Kai Zhang et.al.	2502.20350	null
2025-02-27	Sparse Auto-Encoder Interprets Linguistic Features in Large Language Models	Yi Jing et.al.	2502.20344	null
2025-02-27	Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners	Daniele Paliotta et.al.	2502.20339	null
2025-02-27	Expertise Is What We Want	Alan Ashworth et.al.	2502.20335	null
2025-02-27	Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models	Yukang Yang et.al.	2502.20332	null
2025-02-27	Long-Context Inference with Retrieval-Augmented Speculative Decoding	Guanzheng Chen et.al.	2502.20330	link
2025-02-27	LangProBe: a Language Programs Benchmark	Shangyin Tan et.al.	2502.20315	null
2025-02-27	EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants	Franck Cappello et.al.	2502.20309	link
2025-02-27	M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging	Jinghao Feng et.al.	2502.20301	null
2025-02-27	An exploration of features to improve the generalisability of fake news detection models	Nathaniel Hoy et.al.	2502.20299	null
2025-02-27	Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription	Benjamin Gutteridge et.al.	2502.20295	link
2025-02-27	Visual Adaptive Prompting for Compositional Zero-Shot Learning	Kyle Stein et.al.	2502.20292	null
2025-02-27	Conformal Tail Risk Control for Large Language Model Alignment	Catherine Yu-Chi Chen et.al.	2502.20285	null
2025-02-27	Evaluating Human Trust in LLM-Based Planners: A Preliminary Study	Shenghui Chen et.al.	2502.20284	null
2025-02-26	Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models	Lucy Xiaoyang Shi et.al.	2502.19417	null
2025-02-26	Norm Growth and Stability Challenges in Localized Sequential Knowledge Editing	Akshat Gupta et.al.	2502.19416	null
2025-02-26	Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation	Shiven Sinha et.al.	2502.19414	link
2025-02-26	Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs	Christoph Schuhmann et.al.	2502.19413	null
2025-02-26	Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs	Dayu Yang et.al.	2502.19411	link
2025-02-26	Less or More: Towards Glanceable Explanations for LLM Recommendations Using Ultra-Small Devices	Xinru Wang et.al.	2502.19410	null
2025-02-26	ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models	Danae Sánchez Villegas et.al.	2502.19409	null
2025-02-26	Learning Code-Edit Embedding to Model Student Debugging Behavior	Hasnain Heickal et.al.	2502.19407	null
2025-02-26	General Reasoning Requires Learning to Reason from the Get-go	Seungwook Han et.al.	2502.19402	null
2025-02-26	TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding	Max Ku et.al.	2502.19400	null
2025-02-26	LiDAR Registration with Visual Foundation Models	Niclas Vödisch et.al.	2502.19374	null
2025-02-26	Deep Learning For Time Series Analysis With Application On Human Motion	Ali Ismail-Fawaz et.al.	2502.19364	null
2025-02-26	DataMan: Data Manager for Pre-training Large Language Models	Ru Peng et.al.	2502.19363	null
2025-02-26	Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?	Yancheng He et.al.	2502.19361	link
2025-02-26	Controlled Diversity: Length-optimized Natural Language Generation	Diana Marie Schenke et.al.	2502.19347	null
2025-02-26	Evaluating LLMs and Pre-trained Models for Text Summarization Across Diverse Datasets	Tohida Rehman et.al.	2502.19339	null
2025-02-26	I Know What I Don't Know: Improving Model Cascades Through Confidence Tuning	Stephan Rabanser et.al.	2502.19335	null
2025-02-26	Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems	Hao Peng et.al.	2502.19328	link
2025-02-26	Shh, don't say that! Domain Certification in LLMs	Cornelius Emde et.al.	2502.19320	null
2025-02-26	Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond	Qizhou Wang et.al.	2502.19301	null
2025-02-25	DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers	Xueguang Ma et.al.	2502.18460	link
2025-02-25	LLM-Based Design Pattern Detection	Christian Schindler et.al.	2502.18458	null
2025-02-25	Evaluating the Effectiveness of Small Language Models in Detecting Refactoring Bugs	Rohit Gheyi et.al.	2502.18454	null
2025-02-25	FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response	Mollie Shichman et.al.	2502.18452	null
2025-02-25	SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution	Yuxiang Wei et.al.	2502.18449	null
2025-02-25	olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models	Jake Poznanski et.al.	2502.18443	link
2025-02-25	MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning	Chanwoo Park et.al.	2502.18439	null
2025-02-25	Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions	Yizhe Zhang et.al.	2502.18435	null
2025-02-25	Exploring Gender Disparities in Automatic Speech Recognition Technology	Hend ElGhazaly et.al.	2502.18434	null
2025-02-25	TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning	Frederikus Hudi et.al.	2502.18431	link
2025-02-25	PyEvalAI: AI-assisted evaluation of Jupyter Notebooks for immediate personalized feedback	Nils Wandel et.al.	2502.18425	null
2025-02-25	Compressing Language Models for Specialized Domains	Miles Williams et.al.	2502.18424	null
2025-02-25	Rank1: Test-Time Compute for Reranking in Information Retrieval	Orion Weller et.al.	2502.18418	link
2025-02-25	OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference	Xiangyu Zhao et.al.	2502.18411	link
2025-02-25	Enhancing DNA Foundation Models to Address Masking Inefficiencies	Monireh Safari et.al.	2502.18405	null
2025-02-25	Monte Carlo Temperature: a robust sampling strategy for LLM's uncertainty quantification methods	Nicola Cecere et.al.	2502.18389	null
2025-02-25	How Far are LLMs from Real Search? A Comprehensive Study on Efficiency, Completeness, and Inherent Capabilities	Minhua Lin et.al.	2502.18387	null
2025-02-25	MindMem: Multimodal for Predicting Advertisement Memorability Using LLMs and Deep Learning	Sepehr Asgarian et.al.	2502.18371	null
2025-02-25	Responsible AI Agents	Deven R. Desai et.al.	2502.18359	null
2025-02-25	Which Contributions Deserve Credit? Perceptions of Attribution in Human-AI Co-Creation	Jessica He et.al.	2502.18357	null
2025-02-24	Introducing Visual Perception Token into Multimodal Large Language Model	Runpeng Yu et.al.	2502.17425	link
2025-02-24	MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs	Jiarui Zhang et.al.	2502.17422	link
2025-02-24	LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification	Penghui Yang et.al.	2502.17421	link
2025-02-24	The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence	Tom Wollschläger et.al.	2502.17420	null
2025-02-24	From System 1 to System 2: A Survey of Reasoning Large Language Models	Zhong-Zhi Li et.al.	2502.17419	link
2025-02-24	Reasoning with Latent Thoughts: On the Power of Looped Transformers	Nikunj Saunshi et.al.	2502.17416	null
2025-02-24	COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMs	Liming Liu et.al.	2502.17410	link
2025-02-24	Large Language Models are Powerful EHR Encoders	Stefan Hegselmann et.al.	2502.17403	link
2025-02-24	Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models	Alon Albalak et.al.	2502.17387	link
2025-02-24	Bridging Gaps in Natural Language Processing for Yorùbá: A Systematic Review of a Decade of Progress and Prospects	Toheeb A. Jimoh et.al.	2502.17364	null
2025-02-24	A Closer Look at TabPFN v2: Strength, Limitation, and Extension	Han-Jia Ye et.al.	2502.17361	null
2025-02-24	RELICT: A Replica Detection Framework for Medical Image Generation	Orhun Utku Aydin et.al.	2502.17360	link
2025-02-24	DIS-CO: Discovering Copyrighted Content in VLMs Training Data	André V. Duarte et.al.	2502.17358	link
2025-02-24	Distributional Scaling Laws for Emergent Capabilities	Rosie Zhao et.al.	2502.17356	null
2025-02-24	On Relation-Specific Neurons in Large Language Models	Yihong Liu et.al.	2502.17355	link
2025-02-24	How Scientists Use Large Language Models to Program	Gabrielle O'Brien et.al.	2502.17348	null
2025-02-24	Time series forecasting based on optimized LLM for fault prediction in distribution power grid insulators	João Pedro Matos-Carvalho et.al.	2502.17341	null
2025-02-24	Tokenized SAEs: Disentangling SAE Reconstructions	Thomas Dooms et.al.	2502.17332	null
2025-02-24	HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference Optimization	Zhenghao Liu et.al.	2502.17315	link
2025-02-24	`Generalization is hallucination' through the lens of tensor completions	Liang Ze Wong et.al.	2502.17305	null
2025-02-21	ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval	Guanqi Zhan et.al.	2502.15682	null
2025-02-21	Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training	Jaydeep Borkar et.al.	2502.15680	link
2025-02-21	BOSS: Benchmark for Observation Space Shift in Long-Horizon Task	Yue Yang et.al.	2502.15679	null
2025-02-21	Testing the limits of fine-tuning to improve reasoning in vision language models	Luca M. Schulze Buschoff et.al.	2502.15678	null
2025-02-21	FLEKE: Federated Locate-then-Edit Knowledge Editing	Zongkai Zhao et.al.	2502.15677	link
2025-02-21	AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind	Zhining Zhang et.al.	2502.15676	link
2025-02-21	Almost AI, Almost Human: The Challenge of Detecting AI-Polished Writing	Shoumik Saha et.al.	2502.15666	link
2025-02-21	Machine-generated text detection prevents language model collapse	George Drayson et.al.	2502.15654	link
2025-02-21	Empowering LLMs with Logical Reasoning: A Comprehensive Survey	Fengxiang Cheng et.al.	2502.15652	null
2025-02-21	Steering into New Embedding Spaces: Analyzing Cross-Lingual Alignment Induced by Model Interventions in Multilingual Language Models	Anirudh Sundar et.al.	2502.15639	null
2025-02-21	Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series Classification	Vasilii Feofanov et.al.	2502.15637	link
2025-02-21	The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not Longer	Marthe Ballon et.al.	2502.15631	link
2025-02-21	Extraction multi-étiquettes de relations en utilisant des couches de Transformer	Ngoc Luyen Le et.al.	2502.15619	null
2025-02-21	Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing	Qi Le et.al.	2502.15618	link
2025-02-21	PDeepPP:A Deep learning framework with Pretrained Protein language for peptide classification	Jixiu Zhai et.al.	2502.15610	link
2025-02-21	On the Robustness of Transformers against Context Hijacking for Linear Classification	Tianle Li et.al.	2502.15609	null
2025-02-21	Cross-Format Retrieval-Augmented Generation in XR with LLMs for Context-Aware Maintenance Assistance	Akos Nagy et.al.	2502.15604	null
2025-02-21	Do Multilingual LLMs Think In English?	Lisa Schut et.al.	2502.15603	null
2025-02-21	WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents	Xinhang Liu et.al.	2502.15601	null
2025-02-21	SafeInt: Shielding Large Language Models from Jailbreak Attacks via Safety-Aware Representation Intervention	Jiaqi Wu et.al.	2502.15594	null
2025-02-20	LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention	Shang Yang et.al.	2502.14866	link
2025-02-20	Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning	Shuyue Stella Li et.al.	2502.14860	link
2025-02-20	FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling	Weilin Zhao et.al.	2502.14856	null
2025-02-20	Prompt-to-Leaderboard	Evan Frick et.al.	2502.14855	link
2025-02-20	GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks	Jianwen Luo et.al.	2502.14848	link
2025-02-20	Red-Teaming LLM Multi-Agent Systems via Communication Attacks	Pengfei He et.al.	2502.14847	null
2025-02-20	Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation	Yue Yang et.al.	2502.14846	null
2025-02-20	Revealing and Mitigating Over-Attention in Knowledge Editing	Pinzheng Wang et.al.	2502.14838	link
2025-02-20	LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models	Shangqing Tu et.al.	2502.14834	link
2025-02-20	Middle-Layer Representation Alignment for Cross-Lingual Transfer in Fine-Tuned LLMs	Danni Liu et.al.	2502.14830	link
2025-02-20	Measuring Faithfulness of Chains of Thought by Unlearning Reasoning Steps	Martin Tutek et.al.	2502.14829	link
2025-02-20	Exploring Advanced Techniques for Visual Question Answering: A Comprehensive Comparison	Aiswarya Baby et.al.	2502.14827	null
2025-02-20	A Survey of Model Architectures in Information Retrieval	Zhichao Xu et.al.	2502.14822	null
2025-02-20	eC-Tab2Text: Aspect-Based Text Generation from e-Commerce Product Tables	Luis Antonio Gutiérrez Guanilo et.al.	2502.14820	null
2025-02-20	Dynamic Low-Rank Sparse Adaptation for Large Language Models	Weizhong Huang et.al.	2502.14816	link
2025-02-20	FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis	Fadillah Maani et.al.	2502.14807	link
2025-02-20	From RAG to Memory: Non-Parametric Continual Learning for Large Language Models	Bernal Jiménez Gutiérrez et.al.	2502.14802	link
2025-02-20	A Multi-Agent Perspective on Modern Information Retrieval	Haya Nachimovsky et.al.	2502.14796	null
2025-02-20	Rapid Word Learning Through Meta In-Context Learning	Wentao Wang et.al.	2502.14791	null
2025-02-20	SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features	Michael Tschannen et.al.	2502.14786	link
2025-02-19	Where's the Bug? Attention Probing for Scalable Fault Localization	Adam Stein et.al.	2502.13966	null
2025-02-19	Autellix: An Efficient Serving Engine for LLM Agents as General Programs	Michael Luo et.al.	2502.13965	null
2025-02-19	MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads	Weihao Liu et.al.	2502.13963	link
2025-02-19	Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering	William Jurayj et.al.	2502.13962	null
2025-02-19	LIDDIA: Language-based Intelligent Drug Discovery Agent	Reza Averly et.al.	2502.13959	null
2025-02-19	Neurosymbolic artificial intelligence via large language models and coherence-driven inference	Steve Huntsman et.al.	2502.13953	null
2025-02-19	Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region	Chak Tou Leong et.al.	2502.13946	null
2025-02-19	A Chain-of-Thought Subspace Meta-Learning for Few-shot Image Captioning with Large Vision and Language Models	Hao Huang et.al.	2502.13942	null
2025-02-19	Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images	Shengguang Wu et.al.	2502.13928	null
2025-02-19	Beyond Single Frames: Can LMMs Comprehend Temporal and Contextual Narratives in Image Sequences?	Xiaochen Wang et.al.	2502.13925	null
2025-02-19	LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization	Guanzheng Chen et.al.	2502.13922	link
2025-02-19	Exploring Code Language Models for Automated HLS-based Hardware Generation: Benchmark, Infrastructure and Analysis	Jiahao Gai et.al.	2502.13921	null
2025-02-19	Exploring Personalized Health Support through Data-Driven, Theory-Guided LLMs: A Case Study in Sleep Health	Xingbo Wang et.al.	2502.13920	link
2025-02-19	TESS 2: A Large-Scale Generalist Diffusion Language Model	Jaesung Tae et.al.	2502.13917	link
2025-02-19	How Do LLMs Perform Two-Hop Reasoning in Context?	Tianyu Guo et.al.	2502.13913	null
2025-02-19	Lost in Sequence: Do Large Language Models Understand Sequential Recommendation?	Sein Kim et.al.	2502.13909	link
2025-02-19	Judging the Judges: A Collection of LLM-Generated Relevance Judgements	Hossein A. Rahmani et.al.	2502.13908	link
2025-02-19	DataSciBench: An LLM Agent Benchmark for Data Science	Dan Zhang et.al.	2502.13897	link
2025-02-19	NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants	Yiran Qin et.al.	2502.13894	null
2025-02-19	Refining embeddings with fill-tuning: data-efficient generalised performance improvements for materials foundation models	Matthew P. Wilson et.al.	2502.13886	link
2025-02-18	Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization	Shuo Xing et.al.	2502.13146	link
2025-02-18	Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation	Bencheng Liao et.al.	2502.13145	link
2025-02-18	Pre-training Auto-regressive Robotic Models with 4D Representations	Dantong Niu et.al.	2502.13142	null
2025-02-18	UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models	Huawei Lin et.al.	2502.13141	link
2025-02-18	AIDE: AI-Driven Exploration in the Space of Code	Zhengyao Jiang et.al.	2502.13138	link
2025-02-18	Theorem Prover as a Judge for Synthetic Data Generation	Joshua Ong Jun Leang et.al.	2502.13137	null
2025-02-18	Sleepless Nights, Sugary Days: Creating Synthetic Users with Health Conditions for Realistic Coaching Agent Interactions	Taedong Yun et.al.	2502.13135	null
2025-02-18	Learning to Defer for Causal Discovery with Imperfect Experts	Oscar Clivio et.al.	2502.13132	null
2025-02-18	Rethinking Diverse Human Preference Learning through Principal Component Analysis	Feng Luo et.al.	2502.13131	null
2025-02-18	Magma: A Foundation Model for Multimodal AI Agents	Jianwei Yang et.al.	2502.13130	link
2025-02-18	Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning	Jingyang Lin et.al.	2502.13127	null
2025-02-18	RuozhiBench: Evaluating LLMs with Logical Fallacies and Misleading Premises	Zenan Zhai et.al.	2502.13125	link
2025-02-18	Adapting Psycholinguistic Research for LLMs: Gender-inclusive Language in a Coreference Context	Marion Bartl et.al.	2502.13120	null
2025-02-18	STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models	Narun Raman et.al.	2502.13119	null
2025-02-18	Performance Evaluation of Large Language Models in Statistical Programming	Xinyi Song et.al.	2502.13117	link
2025-02-18	MatterChat: A Multi-Modal LLM for Material Science	Yingheng Tang et.al.	2502.13107	null
2025-02-18	Understanding and Rectifying Safety Perception Distortion in VLMs	Xiaohan Zou et.al.	2502.13095	null
2025-02-18	Text2World: Benchmarking Large Language Models for Symbolic World Model Generation	Mengkang Hu et.al.	2502.13092	null
2025-02-18	KAPPA: A Generic Patent Analysis Framework with Keyphrase-Based Portraits	Xin Xia et.al.	2502.13076	null
2025-02-18	Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity	Yuri Kuratov et.al.	2502.13063	link
2025-02-17	Idiosyncrasies in Large Language Models	Mingjie Sun et.al.	2502.12150	link
2025-02-17	HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation	Ling Yang et.al.	2502.12148	link
2025-02-17	Fast or Better? Balancing Accuracy and Cost in Retrieval-Augmented Generation with Flexible User Control	Jinyan Su et.al.	2502.12145	link
2025-02-17	Small Models Struggle to Learn from Strong Reasoners	Yuetai Li et.al.	2502.12143	null
2025-02-17	SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs	Yige Xu et.al.	2502.12134	link
2025-02-17	Transformer Dynamics: A neuroscientific approach to interpretability of large language models	Jesseba Fernando et.al.	2502.12131	null
2025-02-17	Scaling Autonomous Agents via Automatic Reward Modeling And Planning	Zhenfang Chen et.al.	2502.12130	null
2025-02-17	On the Query Complexity of Verifier-Assisted Language Generation	Edoardo Botta et.al.	2502.12123	null
2025-02-17	Minimal Ranks, Maximum Confidence: Parameter-efficient Uncertainty Quantification for LoRA	Patryk Marszałek et.al.	2502.12122	link
2025-02-17	LLMs on the Line: Data Determines Loss-to-Loss Scaling Laws	Prasanna Mayilvahanan et.al.	2502.12120	null
2025-02-17	PRISM: Self-Pruning Intrinsic Selection Method for Training-Free Multimodal Data Selection	Jinhe Bi et.al.	2502.12119	null
2025-02-17	A-MEM: Agentic Memory for LLM Agents	Wujiang Xu et.al.	2502.12110	link
2025-02-17	Personality Structured Interview for Large Language Model Simulation in Personality Research	Pengda Wang et.al.	2502.12109	null
2025-02-17	Relational Norms for Human-AI Cooperation	Brian D. Earp et.al.	2502.12102	null
2025-02-17	Token Communications: A Unified Framework for Cross-modal Context-aware Semantic Communications	Li Qiao et.al.	2502.12096	null
2025-02-17	Descriminative-Generative Custom Tokens for Vision-Language Models	Pramuditha Perera et.al.	2502.12095	null
2025-02-17	Meta-Statistical Learning: Supervised Learning of Statistical Inference	Maxime Peyrard et.al.	2502.12088	null
2025-02-17	APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs	Yuxiang Huang et.al.	2502.12085	link
2025-02-17	VLM $^2$ -Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues	Jianshu Zhang et.al.	2502.12084	null
2025-02-17	AdaSplash: Adaptive Sparse Flash Attention	Nuno Gonçalves et.al.	2502.12082	link
2025-02-14	MM-RLHF: The Next Step Forward in Multimodal LLM Alignment	Yi-Fan Zhang et.al.	2502.10391	null
2025-02-14	Aspect-Oriented Summarization for Psychiatric Short-Term Readmission Prediction	WonJin Yoon et.al.	2502.10388	null
2025-02-14	Unknown Word Detection for English as a Second Language (ESL) Learners Using Gaze and Pre-trained Language Models	Jiexin Ding et.al.	2502.10378	null
2025-02-14	Robustness tests for biomedical foundation models should tailor to specification	R. Patrick Xian et.al.	2502.10374	link
2025-02-14	Enhancing Multilingual LLM Pretraining with Model-Based Data Selection	Bettina Messmer et.al.	2502.10361	null
2025-02-14	Organize the Web: Constructing Domains Enhances Pre-Training Data Curation	Alexander Wettig et.al.	2502.10341	null
2025-02-14	Evaluating the Meta- and Object-Level Reasoning of Large Language Models for Question Answering	Nick Ferguson et.al.	2502.10338	null
2025-02-14	LLM-Powered Preference Elicitation in Combinatorial Assignment	Ermis Soumalias et.al.	2502.10308	null
2025-02-14	SPIRIT: Short-term Prediction of solar IRradIance for zero-shot Transfer learning using Foundation Models	Aditya Mishra et.al.	2502.10307	null
2025-02-14	Open-Source AI-Powered Optimization in Scalene: Advancing Python Performance Profiling with DeepSeek-R1 and LLaMA 3.2	Saem Hasan et.al.	2502.10299	null
2025-02-14	DeltaProduct: Increasing the Expressivity of DeltaNet Through Products of Householders	Julien Siems et.al.	2502.10297	link
2025-02-14	Probing Perceptual Constancy in Large Vision Language Models	Haoran Sun et.al.	2502.10273	null
2025-02-14	Are Large Language Models the future crowd workers of Linguistics?	Iris Ferrazzo et.al.	2502.10266	null
2025-02-14	Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers	Aivin V. Solatorio et.al.	2502.10263	link
2025-02-14	VisCon-100K: Leveraging Contextual Web Data for Fine-tuning Vision Language Models	Gokul Karthik Kumar et.al.	2502.10250	null
2025-02-14	Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model	Guoqing Ma et.al.	2502.10248	link
2025-02-14	Efficient Zero-Order Federated Finetuning of Language Models for Resource-Constrained Devices	Mohamed Aboelenien Ahmed et.al.	2502.10239	null
2025-02-14	AdaPTS: Adapting Univariate Foundation Models to Probabilistic Multivariate Time Series Forecasting	Abdelhakim Benechehab et.al.	2502.10235	link
2025-02-14	Do Large Language Models Reason Causally Like Us? Even Better?	Hanna M. Dettki et.al.	2502.10215	null
2025-02-14	Can Post-Training Quantization Benefit from an Additional QLoRA Integration?	Xiliang Zhu et.al.	2502.10202	null
2025-02-13	Theoretical Benefit and Limitation of Diffusion Language Model	Guhao Feng et.al.	2502.09622	null
2025-02-13	MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency	Dongzhi Jiang et.al.	2502.09621	null
2025-02-13	Exploring the Potential of Encoder-free Architectures in 3D LMMs	Yiwen Tang et.al.	2502.09620	link
2025-02-13	Human-LLM Coevolution: Evidence from Academic Writing	Mingmeng Geng et.al.	2502.09606	null
2025-02-13	SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models	Yung-Sung Chuang et.al.	2502.09604	link
2025-02-13	GAIA: A Global, Multi-modal, Multi-scale Vision-Language Dataset for Remote Sensing Image Analysis	Angelos Zavras et.al.	2502.09598	link
2025-02-13	Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs	Siyan Zhao et.al.	2502.09597	link
2025-02-13	KIMAs: A Configurable Knowledge Integrated Multi-Agent System	Zitao Li et.al.	2502.09596	null
2025-02-13	Logical forms complement probability in understanding language model (and human) performance	Yixuan Wang et.al.	2502.09589	null
2025-02-13	Polymind: Parallel Visual Diagramming with Large Language Models to Support Prewriting Through Microtasks	Qian Wan et.al.	2502.09577	null
2025-02-13	MorphNLI: A Stepwise Approach to Natural Language Inference Using Text Morphing	Vlad Andrei Negru et.al.	2502.09567	null
2025-02-13	Zero-shot generation of synthetic neurosurgical data with large language models	Austin A. Barr et.al.	2502.09566	link
2025-02-13	MDCrow: Automating Molecular Dynamics Workflows with Large Language Models	Quintina Campbell et.al.	2502.09565	link
2025-02-13	EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents	Rui Yang et.al.	2502.09560	null
2025-02-13	Explainable AI-assisted Optimization for Feynman Integral Reduction	Zhuo-Yang Song et.al.	2502.09544	null
2025-02-13	Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages	Shreyan Biswas et.al.	2502.09532	null
2025-02-13	When and How Does CLIP Enable Domain and Compositional Generalization?	Elias Kempf et.al.	2502.09507	null
2025-02-13	Improve LLM-based Automatic Essay Scoring with Linguistic Features	Zhaoyi Joey Hou et.al.	2502.09497	null
2025-02-13	Foundation Neural-Network Quantum States	Riccardo Rende et.al.	2502.09488	null
2025-02-13	Objective quantification of mood states using large language models	Jakub Onysk et.al.	2502.09487	null
2025-02-12	SwiftSketch: A Diffusion Model for Image-to-Vector Sketch Generation	Ellie Arar et.al.	2502.08642	null
2025-02-12	Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples	Andrianos Michail et.al.	2502.08638	null
2025-02-12	Ensemble based approach to quantifying uncertainty of LLM based classifications	Srijith Rajamohan et.al.	2502.08631	null
2025-02-12	Continuous Cardiac Arrest Prediction in ICU using PPG Foundation Model	Saurabh Kataria et.al.	2502.08612	null
2025-02-12	Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors	Vishwanath Pratap Singh et.al.	2502.08587	null
2025-02-12	Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks	Ang Li et.al.	2502.08586	null
2025-02-12	COAST: Intelligent Time-Adaptive Neural Operators	Zhikai Wu et.al.	2502.08574	null
2025-02-12	QA-Expand: Multi-Question Answer Generation for Enhanced Query Expansion in Information Retrieval	Wonduk Seo et.al.	2502.08557	null
2025-02-12	Human-Centric Foundation Models: Perception, Generation and Agentic Modeling	Shixiang Tang et.al.	2502.08556	link
2025-02-12	Fostering Appropriate Reliance on Large Language Models: The Role of Explanations, Sources, and Inconsistencies	Sunnie S. Y. Kim et.al.	2502.08554	null
2025-02-12	LLMs can implicitly learn from mistakes in-context	Lisa Alazraki et.al.	2502.08550	null
2025-02-12	Representation Learning to Advance Multi-institutional Studies with Electronic Health Record Data	Doudou Zhou et.al.	2502.08547	null
2025-02-12	Moment of Untruth: Dealing with Negative Queries in Video Moment Retrieval	Kevin Flanagan et.al.	2502.08544	link
2025-02-12	LLM Pretraining with Continuous Concepts	Jihoon Tack et.al.	2502.08524	null
2025-02-12	The Paradox of Stochasticity: Limited Creativity and Computational Decoupling in Temperature-Varied LLM Outputs of Structured Fictional Data	Evgenii Evstafev et.al.	2502.08515	null
2025-02-12	Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation	Mahnaz Koupaee et.al.	2502.08514	link
2025-02-12	Measuring Diversity in Synthetic Datasets	Yuchang Zhu et.al.	2502.08512	link
2025-02-12	Explanation based In-Context Demonstrations Retrieval for Multilingual Grammatical Error Correction	Wei Li et.al.	2502.08507	link
2025-02-12	Salamandra Technical Report	Aitor Gonzalez-Agirre et.al.	2502.08489	link
2025-02-12	One-Shot Federated Learning with Classifier-Free Diffusion Models	Obaidullah Zaland et.al.	2502.08488	null
2025-02-11	DarwinLM: Evolutionary Structured Pruning of Large Language Models	Shengkun Tang et.al.	2502.07780	link
2025-02-11	Auditing Prompt Caching in Language Model APIs	Chenchen Gu et.al.	2502.07776	link
2025-02-11	Automatic Robot Task Planning by Integrating Large Language Model with Genetic Programming	Azizjon Kobilov et.al.	2502.07772	null
2025-02-11	Breaking Down Bias: On The Limits of Generalizable Pruning Strategies	Sibo Ma et.al.	2502.07771	null
2025-02-11	Great Power Brings Great Responsibility: Personalizing Conversational AI for Diverse Problem-Solvers	Italo Santos et.al.	2502.07763	null
2025-02-11	Scalable Fingerprinting of Large Language Models	Anshul Nasery et.al.	2502.07760	null
2025-02-11	Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension	Wenbo Gong et.al.	2502.07752	null
2025-02-11	WHODUNIT: Evaluation benchmark for culprit detection in mystery stories	Kshitij Gupta et.al.	2502.07747	link
2025-02-11	The Economics of Large Language Models: Token Allocation, Fine-Tuning, and Optimal Pricing	Dirk Bergemann et.al.	2502.07736	null
2025-02-11	Economics of Sourcing Human Data	Sebastin Santy et.al.	2502.07732	null
2025-02-11	Verifying LLM-Generated Code in the Context of Software Verification with Ada/SPARK	Marcos Cramer et.al.	2502.07728	null
2025-02-11	Making Language Models Robust Against Negation	MohammadHossein Rezaei et.al.	2502.07717	link
2025-02-11	Magic 1-For-1: Generating One Minute Video Clips within One Minute	Hongwei Yi et.al.	2502.07701	link
2025-02-11	A Framework for LLM-powered Design Assistants	Swaroop Panda et.al.	2502.07698	null
2025-02-11	Large Language Models as Proxies for Theories of Human Linguistic Cognition	Imry Ziv et.al.	2502.07687	null
2025-02-11	SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models	Shihao Xia et.al.	2502.07644	null
2025-02-11	FoQA: A Faroese Question-Answering Dataset	Annika Simonsen et.al.	2502.07642	null
2025-02-11	Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving	Yong Lin et.al.	2502.07640	link
2025-02-11	Exploring Mobile Touch Interaction with Large Language Models	Tim Zindulka et.al.	2502.07629	null
2025-02-11	Scaling Pre-training to One Hundred Billion Data for Vision Language Models	Xiao Wang et.al.	2502.07617	null
2025-02-10	EVEv2: Improved Baselines for Encoder-Free Vision-Language Models	Haiwen Diao et.al.	2502.06788	link
2025-02-10	Visual Agentic AI for Spatial Reasoning with a Dynamic API	Damiano Marsili et.al.	2502.06787	null
2025-02-10	DeepCrossAttention: Supercharging Transformer Residual Connections	Mike Heddes et.al.	2502.06785	null
2025-02-10	Towards Internet-Scale Training For Agents	Brandon Trabucco et.al.	2502.06776	null
2025-02-10	Enhancing Trust in Language Model-Based Code Optimization through RLHF: A Research Design	Jingzhi Gong et.al.	2502.06769	null
2025-02-10	Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs	Ryan Synk et.al.	2502.06766	link
2025-02-10	Rationalization Models for Text-to-SQL	Gaetano Rossiello et.al.	2502.06759	null
2025-02-10	Accelerating Data Processing and Benchmarking of AI Models for Pathology	Andrew Zhang et.al.	2502.06750	link
2025-02-10	Gradient Multi-Normalization for Stateless and Scalable LLM Training	Meyer Scetbon et.al.	2502.06742	null
2025-02-10	VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data	Thomas Zeng et.al.	2502.06737	null
2025-02-10	Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining	Daouda Sow et.al.	2502.06733	null
2025-02-10	Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling	Runze Liu et.al.	2502.06703	link
2025-02-10	EquiTabPFN: A Target-Permutation Equivariant Prior Fitted Networks	Michael Arbel et.al.	2502.06684	null
2025-02-10	Boosting Self-Efficacy and Performance of Large Language Models via Verbal Efficacy Stimulations	Rui Chen et.al.	2502.06669	null
2025-02-10	Automatic Evaluation of Healthcare LLMs Beyond Question-Answering	Anna Arias-Duart et.al.	2502.06666	null
2025-02-10	Evaluation of Deep Audio Representations for Hearables	Fabian Gröger et.al.	2502.06664	null
2025-02-10	EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models	Xingrun Xing et.al.	2502.06663	null
2025-02-10	Unbiased Evaluation of Large Language Models from a Causal Perspective	Meilin Chen et.al.	2502.06655	null
2025-02-10	In-Context Learning (and Unlearning) of Length Biases	Stephanie Schoch et.al.	2502.06653	null
2025-02-10	Transparent NLP: Using RAG and LLM Alignment for Privacy Q&A	Anna Leschanowsky et.al.	2502.06652	null
2025-02-07	Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray	Yunhang Shen et.al.	2502.05177	link
2025-02-07	Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach	Jonas Geiping et.al.	2502.05171	link
2025-02-07	NoLiMa: Long-Context Evaluation Beyond Literal Matching	Ali Modarressi et.al.	2502.05167	link
2025-02-07	Multitwine: Multi-Object Compositing with Text and Layout Control	Gemma Canet Tarrés et.al.	2502.05165	null
2025-02-07	DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails	Yihe Deng et.al.	2502.05163	link
2025-02-07	A Lightweight Method to Disrupt Memorized Sequences in LLM	Parjanya Prajakta Prashant et.al.	2502.05159	null
2025-02-07	Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation	Steffen Eger et.al.	2502.05151	link
2025-02-07	CodeSCM: Causal Analysis for Multi-Modal Code Generation	Mukur Gupta et.al.	2502.05150	link
2025-02-07	An Annotated Reading of 'The Singer of Tales' in the LLM Era	Kush R. Varshney et.al.	2502.05148	null
2025-02-07	Chest X-ray Foundation Model with Global and Local Representations Integration	Zefan Yang et.al.	2502.05142	link
2025-02-07	Refining Integration-by-Parts Reduction of Feynman Integrals with Machine Learning	Matt von Hippel et.al.	2502.05121	null
2025-02-07	Flexible and Efficient Grammar-Constrained Decoding	Kanghee Park et.al.	2502.05111	null
2025-02-07	Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs	Rohit Saxena et.al.	2502.05092	null
2025-02-07	DCFormer: Efficient 3D Vision-Language Modeling with Decomposed Convolutions	Gorkem Can Ates et.al.	2502.05091	null
2025-02-07	Mitigating Unintended Memorization with LoRA in Federated Learning for LLMs	Thierry Bossy et.al.	2502.05087	link
2025-02-07	Causality can systematically address the monsters under the bench(marks)	Felix Leeb et.al.	2502.05085	null
2025-02-07	ChallengeMe: An Adversarial Learning-enabled Text Summarization Framework	Xiaoyu Deng et.al.	2502.05084	null
2025-02-07	Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph Structures	Tushar Pandey et.al.	2502.05078	link
2025-02-07	nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow	Geliang Ouyang et.al.	2502.05036	link
2025-02-07	EnseSmells: Deep ensemble and programming language models for automated code smells detection	Anh Ho et.al.	2502.05012	link
2025-02-06	Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment	Zuyan Liu et.al.	2502.04328	link
2025-02-06	Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions	Yik Siu Chan et.al.	2502.04322	link
2025-02-06	ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features	Alec Helbling et.al.	2502.04320	link
2025-02-06	sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views	Eyvaz Najafli et.al.	2502.04318	null
2025-02-06	ChamaleonLLM: Batch-Aware Dynamic Low-Rank Adaptation via Inference-Time Clusters	Kamer Ali Yuksel et.al.	2502.04315	link
2025-02-06	Great Models Think Alike and this Undermines AI Oversight	Shashwat Goel et.al.	2502.04313	link
2025-02-06	ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization	Yinjie Wang et.al.	2502.04306	link
2025-02-06	Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization	Yuanye Liu et.al.	2502.04295	link
2025-02-06	PILAF: Optimal Human Preference Sampling for Reward Modeling	Yunzhen Feng et.al.	2502.04270	null
2025-02-06	How does a Multilingual LM Handle Multiple Languages?	Santhosh Kakarla et.al.	2502.04269	null
2025-02-06	Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion	Marco Mistretta et.al.	2502.04263	link
2025-02-06	Efficient Randomized Experiments Using Foundation Models	Piersilvio De Bartolomeis et.al.	2502.04262	link
2025-02-06	MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion	Xintong Hao et.al.	2502.04235	null
2025-02-06	Can LLMs Hack Enterprise Networks? Autonomous Assumed Breach Penetration-Testing Active Directory Networks	Andreas Happe et.al.	2502.04227	null
2025-02-06	Keep It Light! Simplifying Image Clustering Via Text-Free Adapters	Yicen Li et.al.	2502.04226	null
2025-02-06	Éclair -- Extracting Content and Layout with Integrated Reading Order for Documents	Ilia Karmanov et.al.	2502.04223	null
2025-02-06	Sports and Women's Sports: Gender Bias in Text Generation with Olympic Data	Laura Biester et.al.	2502.04218	null
2025-02-06	Algorithmic causal structure emerging through compression	Liang Wendong et.al.	2502.04210	null
2025-02-06	"Short-length" Adversarial Training Helps LLMs Defend "Long-length" Jailbreak Attacks: Theoretical and Empirical Evidence	Shaopeng Fu et.al.	2502.04204	link
2025-02-06	The Best Instruction-Tuning Data are Those That Fit	Dylan Zhang et.al.	2502.04194	null
2025-02-05	Do Large Language Model Benchmarks Test Reliability?	Joshua Vendrow et.al.	2502.03461	link
2025-02-05	Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training	Boyao Wang et.al.	2502.03460	null
2025-02-05	SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living	Arkaprava Sinha et.al.	2502.03459	null
2025-02-05	A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs)	Yiye Chen et.al.	2502.03450	null
2025-02-05	BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem Proving	Ran Xin et.al.	2502.03438	null
2025-02-05	On Fairness of Unified Multimodal Large Language Model for Image Generation	Ming Liu et.al.	2502.03429	null
2025-02-05	Harnessing Large Language Models for Curated Code Reviews	Oussama Ben Sghaier et.al.	2502.03425	link
2025-02-05	Think or Step-by-Step? UnZIPping the Black Box in Zero-Shot Prompts	Nikta Gohari Sadr et.al.	2502.03418	null
2025-02-05	SPRI: Aligning Large Language Models with Context-Situated Principles	Hongli Zhan et.al.	2502.03397	null
2025-02-05	Benchmarking Time Series Forecasting Models: From Statistical Techniques to Foundation Models in Real-World Applications	Issar Arab et.al.	2502.03395	null
2025-02-05	LIMO: Less is More for Reasoning	Yixin Ye et.al.	2502.03387	link
2025-02-05	Transformers and Their Roles as Time Series Foundation Models	Dennis Wu et.al.	2502.03383	null
2025-02-05	High-Fidelity Simultaneous Speech-To-Speech Translation	Tom Labiausse et.al.	2502.03382	link
2025-02-05	Demystifying Long Chain-of-Thought Reasoning in LLMs	Edward Yeo et.al.	2502.03373	link
2025-02-05	PalimpChat: Declarative and Interactive AI analytics	Chunwei Liu et.al.	2502.03368	null
2025-02-05	Minerva: A Programmable Memory Test Benchmark for Language Models	Menglin Xia et.al.	2502.03358	null
2025-02-05	RadVLM: A Multitask Conversational Vision-Language Model for Radiology	Nicolas Deperrois et.al.	2502.03333	null
2025-02-05	ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model	Qiguang Chen et.al.	2502.03325	null
2025-02-05	Out-of-Distribution Detection using Synthetic Data Generation	Momin Abbas et.al.	2502.03323	null
2025-02-05	Simplifying Formal Proof-Generating Models with ChatGPT and Basic Searching Techniques	Sangjun Han et.al.	2502.03321	null
2025-02-04	Articulate AnyMesh: Open-Vocabulary 3D Articulated Objects Modeling	Xiaowen Qiu et.al.	2502.02590	null
2025-02-04	COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation	Xueqing Deng et.al.	2502.02589	null
2025-02-04	A comparison of translation performance between DeepL and Supertext	Alex Flückiger et.al.	2502.02577	link
2025-02-04	Are Language Models Up to Sequential Optimization Problems? From Evaluation to a Hegelian-Inspired Enhancement	Soheil Abbasloo et.al.	2502.02573	null
2025-02-04	Learning the RoPEs: Better 2D and 3D Position Encodings with STRING	Connor Schenck et.al.	2502.02562	null
2025-02-04	Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation	Junha Lee et.al.	2502.02548	null
2025-02-04	LLMs for Generation of Architectural Components: An Exploratory Empirical Study in the Serverless World	Shrikara Arun et.al.	2502.02539	null
2025-02-04	Adaptive Self-improvement LLM Agentic System for ML Library Development	Genghan Zhang et.al.	2502.02534	link
2025-02-04	Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies	Han Zhou et.al.	2502.02533	null
2025-02-04	Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search	Maohao Shen et.al.	2502.02508	null
2025-02-04	Analyzing Similarity Metrics for Data Selection for Language Model Pretraining	Dylan Sam et.al.	2502.02494	null
2025-02-04	EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization	Yize Wu et.al.	2502.02493	null
2025-02-04	Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study	Menglong Cui et.al.	2502.02481	null
2025-02-04	Mind the Gap: Evaluating Patch Embeddings from General-Purpose and Histopathology Foundation Models for Cell Segmentation and Classification	Valentina Vadori et.al.	2502.02471	link
2025-02-04	Modular Training of Neural Networks aids Interpretability	Satvik Golechha et.al.	2502.02470	null
2025-02-04	SAISA: Towards Multimodal Large Language Models with Both Training and Inference Efficiency	Qianhao Yuan et.al.	2502.02458	link
2025-02-04	IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning	Quan Zhang et.al.	2502.02454	null
2025-02-04	Personalization Toolkit: Training Free Personalization of Large Vision Language Models	Soroush Seifi et.al.	2502.02452	null
2025-02-04	Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study	Calvin Yixiang Cheng et.al.	2502.02451	link
2025-02-04	Generative Psycho-Lexical Approach for Constructing Value Systems in Large Language Models	Haoran Ye et.al.	2502.02444	null
2025-01-31	Low-Rank Adapting Models for Sparse Autoencoders	Matthew Chen et.al.	2501.19406	link
2025-01-31	Vintix: Action Model via In-Context Reinforcement Learning	Andrey Polubarov et.al.	2501.19400	link
2025-01-31	Scalable-Softmax Is Superior for Attention	Ken M. Nakanishi et.al.	2501.19399	null
2025-01-31	Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game	Mustafa O. Karabag et.al.	2501.19398	link
2025-02-03	s1: Simple test-time scaling	Niklas Muennighoff et.al.	2501.19393	link
2025-01-31	Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models	Alina Shutova et.al.	2501.19392	link
2025-01-31	Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models	Wenzhi Fang et.al.	2501.19389	link
2025-01-31	Decoding-based Regression	Xingyou Song et.al.	2501.19383	link
2025-01-31	TableMaster: A Recipe to Advance Table Understanding with Language Models	Lang Cao et.al.	2501.19378	null
2025-02-03	SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions	Dominik Wagner et.al.	2501.19377	null
2025-01-31	We're Different, We're the Same: Creative Homogeneity Across LLMs	Emily Wenger et.al.	2501.19361	null
2025-01-31	Mechanical Properties of the Meninges: Large Language Model Assisted Systematic Review of over 25,000 Studies	Brandon P. Chelstrom et.al.	2501.19359	null
2025-01-31	The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking	Yuchun Miao et.al.	2501.19358	null
2025-01-31	Towards Adaptive Self-Improvement for Smarter Energy Systems	Alexander Sommer et.al.	2501.19340	null
2025-01-31	PixelWorld: Towards Perceiving Everything as Pixels	Zhiheng Lyu et.al.	2501.19339	null
2025-01-31	Homogeneity Bias as Differential Sampling Uncertainty in Language Models	Messi H. J. Lee et.al.	2501.19337	null
2025-01-31	Reward-Guided Speculative Decoding for Efficient LLM Reasoning	Baohao Liao et.al.	2501.19324	null
2025-01-31	MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems	Anirudh Chari et.al.	2501.19318	null
2025-01-31	LLM-based Affective Text Generation Quality Based on Different Quantization Values	Yarik Menchaca Resendiz et.al.	2501.19317	null
2025-01-31	An Efficient Approach for Machine Translation on Low-resource Languages: A Case Study in Vietnamese-Chinese	Tran Ngoc Son et.al.	2501.19314	null
2025-01-30	Foundational Models for 3D Point Clouds: A Survey and Outlook	Vishal Thengane et.al.	2501.18594	null
2025-01-30	Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models	Hao Dong et.al.	2501.18592	link
2025-01-30	Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs	Yue Wang et.al.	2501.18585	null
2025-01-30	Prediction-Powered Inference with Imputed Covariates and Nonuniform Sampling	Dan M. Kluger et.al.	2501.18577	link
2025-01-30	Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH	Evgenii Evstafev et.al.	2501.18576	null
2025-01-30	BounTCHA: A CAPTCHA Utilizing Boundary Identification in AI-extended Videos	Lehao Lin et.al.	2501.18565	null
2025-01-30	SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation	Haoquan Fang et.al.	2501.18564	link
2025-01-30	Semantic Web and Creative AI -- A Technical Report from ISWS 2023	Raia Abu Ahmad et.al.	2501.18542	null
2025-01-30	Loss Functions and Operators Generated by f-Divergences	Vincent Roulet et.al.	2501.18537	null
2025-01-30	Illusions of Relevance: Using Content Injection Attacks to Deceive Retrievers, Rerankers, and LLM Judges	Manveer Singh Tamber et.al.	2501.18536	link
2025-01-30	Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models	Yi Ding et.al.	2501.18533	null
2025-01-30	Differentially Private Steering for Large Language Model Alignment	Anmol Goel et.al.	2501.18532	link
2025-01-30	Learn from the Past: Language-conditioned Object Rearrangement with Large Language Models	Guanqun Cao et.al.	2501.18516	null
2025-01-30	Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch	Arthur Douillard et.al.	2501.18512	null
2025-01-30	WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training	Benjamin Feuer et.al.	2501.18511	link
2025-01-30	CLEAR: Cue Learning using Evolution for Accurate Recognition Applied to Sustainability Data Extraction	Peter J. Bentley et.al.	2501.18504	null
2025-01-30	A Tool for In-depth Analysis of Code Execution Reasoning of Large Language Models	Changshu Liu et.al.	2501.18482	null
2025-01-30	CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization	Yanxia Deng et.al.	2501.18475	null
2025-01-30	Tuning Vision Foundation Model via Test-Time Prompt-Guided Training for VFSS Segmentations	Chengxi Zeng et.al.	2501.18474	null
2025-01-30	A Benchmark and Evaluation for Real-World Out-of-Distribution Detection Using Vision-Language Models	Shiho Noda et.al.	2501.18463	link
2025-01-29	Learning Beyond the Surface: How Far Can Continual Pre-Training with LoRA Enhance LLMs' Domain-Specific Insight Learning?	Pouya Pezeshkpour et.al.	2501.17840	link
2025-01-29	Matrix Product Sketching via Coordinated Sampling	Majid Daliri et.al.	2501.17836	null
2025-01-29	Aggregation Schemes for Single-Vector WSI Representation Learning in Digital Pathology	Sobhan Hemati et.al.	2501.17822	null
2025-01-29	Leveraging Multimodal LLM for Inspirational User Interface Search	Seokhyeon Park et.al.	2501.17799	link
2025-01-29	BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights	Chan-Jan Hsu et.al.	2501.17790	null
2025-01-29	Reasoning Over the Glyphs: Evaluation of LLM's Decipherment of Rare Scripts	Yu-Fei Shih et.al.	2501.17785	null
2025-01-29	AdditiveLLM: Large Language Models Predict Defects in Additive Manufacturing	Peter Pak et.al.	2501.17784	null
2025-01-29	2SSP: A Two-Stage Framework for Structured Pruning of LLMs	Fabrizio Sandri et.al.	2501.17771	link
2025-01-29	Hybrid Graphs for Table-and-Text based Question Answering using LLMs	Ankush Agarwal et.al.	2501.17767	null
2025-01-29	On the Partitioning of GPU Power among Multi-Instances	Tirth Vamja et.al.	2501.17752	null
2025-01-29	Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation	Aitor Arrieta et.al.	2501.17749	null
2025-01-29	A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches	Ana R. Baião et.al.	2501.17729	null
2025-01-29	Using Code Generation to Solve Open Instances of Combinatorial Design Problems	Christopher D. Rosin et.al.	2501.17725	link
2025-01-29	RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts	Eujeong Choi et.al.	2501.17715	link
2025-01-29	Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate	Yubo Wang et.al.	2501.17703	null
2025-01-29	Planning with Vision-Language Models and a Use Case in Robot-Assisted Teaching	Xuzhe Dang et.al.	2501.17665	null
2025-01-29	Exploring Vision Language Models for Multimodal and Multilingual Stance Detection	Jake Vasilakes et.al.	2501.17654	null
2025-01-29	Tonguescape: Exploring Language Models Understanding of Vowel Articulation	Haruki Sakajo et.al.	2501.17643	link
2025-01-29	Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation	Lin Chen et.al.	2501.17642	null
2025-01-29	In-Context Meta LoRA Generation	Yihua Shao et.al.	2501.17635	null
2025-01-28	SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training	Tianzhe Chu et.al.	2501.17161	null
2025-01-28	AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders	Zhengxuan Wu et.al.	2501.17148	link
2025-01-28	FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data	Deren Lei et.al.	2501.17144	link
2025-01-28	ASTRAL: Automated Safety Testing of Large Language Models	Miriam Ugarte et.al.	2501.17132	null
2025-01-28	Scenario Understanding of Traffic Scenes Through Large Visual Language Models	Rivera Esteban et.al.	2501.17131	null
2025-01-28	Histoires Morales: A French Dataset for Assessing Moral Alignment	Thibaud Leteno et.al.	2501.17117	link
2025-01-28	Optimizing Large Language Model Training Using FP4 Quantization	Ruizhe Wang et.al.	2501.17116	null
2025-01-28	Unlocking Transparent Alignment Through Enhanced Inverse Constitutional AI for Principle Extraction	Carl-Leander Henneking et.al.	2501.17112	null
2025-01-28	COS(M+O)S: Curiosity and RL-Enhanced MCTS for Exploring Story Space via Language Models	Tobias Materzok et.al.	2501.17104	null
2025-01-28	Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving	Evgenii Evstafev et.al.	2501.17084	null
2025-01-28	Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding	Akash Kumar et.al.	2501.17053	null
2025-01-28	How Linguistics Learned to Stop Worrying and Love the Language Models	Richard Futrell et.al.	2501.17047	null
2025-01-28	Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models	Minghan Li et.al.	2501.17039	null
2025-01-28	Challenges in Ensuring AI Safety in DeepSeek-R1 Models: The Shortcomings of Reinforcement Learning Strategies	Manojkumar Parmar et.al.	2501.17030	null
2025-01-28	Automated Refactoring of Non-Idiomatic Python Code: A Differentiated Replication with LLMs	Alessandro Midolo et.al.	2501.17024	link
2025-01-28	Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement	Kei Katsumata et.al.	2501.17022	link
2025-01-28	Large Language Models for Code Generation: The Practitioners Perspective	Zeeshan Rasheed et.al.	2501.16998	link
2025-01-28	Artificial Intelligence Clones	Annie Liang et.al.	2501.16996	null
2025-01-28	FedEFM: Federated Endovascular Foundation Model with Unseen Data	Tuong Do et.al.	2501.16992	null
2025-01-28	Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection	Xiangyu Gao et.al.	2501.16981	null
2025-01-27	LUCY: Linguistic Understanding and Control Yielding Early Stage of Her	Heting Gao et.al.	2501.16327	link
2025-01-27	Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology	Meiyun Cao et.al.	2501.16309	null
2025-01-27	RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval	Long Nguyen et.al.	2501.16303	null
2025-01-27	Matryoshka Re-Ranker: A Flexible Re-Ranking Architecture With Configurable Depth and Width	Zheng Liu et.al.	2501.16302	null
2025-01-27	Large Models in Dialogue for Active Perception and Anomaly Detection	Tzoulio Chamiti et.al.	2501.16300	link
2025-01-27	FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers	Renshan Zhang et.al.	2501.16297	null
2025-01-27	Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models	Jing Zhang et.al.	2501.16282	null
2025-01-27	Do LLMs Have Visualization Literacy? An Evaluation on Modified Visualizations to Test Generalization in Data Interpretation	Jiayi Hong et.al.	2501.16277	link
2025-01-27	URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots -- A Case Study at HCMUT	Long Nguyen et.al.	2501.16276	null
2025-01-27	Return of the Encoder: Maximizing Parameter Efficiency for SLMs	Mohamed Elfeki et.al.	2501.16273	link
2025-01-27	A foundation model for human-AI collaboration in medical literature mining	Zifeng Wang et.al.	2501.16255	null
2025-01-27	Multi-Agent Geospatial Copilots for Remote Sensing Workflows	Chaehong Lee et.al.	2501.16254	null
2025-01-27	Zero-Shot Decision Tree Construction via Large Language Models	Lucas Carrasco et.al.	2501.16247	null
2025-01-27	CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentation	Xiaochuan Ma et.al.	2501.16246	null
2025-01-27	Phase Transitions in Large Language Models and the $O(N)$ Model	Youran Sun et.al.	2501.16241	null
2025-01-27	AiGet: Transforming Everyday Moments into Hidden Knowledge Discovery with AI Assistance on Smart Glasses	Runze Cai et.al.	2501.16240	link
2025-01-27	Distilling foundation models for robust and efficient models in digital pathology	Alexandre Filiot et.al.	2501.16239	null
2025-01-27	Language-Based Bayesian Optimization Research Assistant (BORA)	Abdoulatif Cissé et.al.	2501.16224	null
2025-01-27	Enhancing Visual Inspection Capability of Multi-Modal Large Language Models on Medical Time Series with Supportive Conformalized and Interpretable Small Specialized Models	Huayu Li et.al.	2501.16215	link
2025-01-27	Provence: efficient and robust context pruning for retrieval-augmented generation	Nadezhda Chirkova et.al.	2501.16214	null
2025-01-24	HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation	Xin Zhou et.al.	2501.14729	link
2025-01-24	Do LLMs Provide Consistent Answers to Health-Related Questions across Languages?	Ipek Baris Schlicht et.al.	2501.14719	null
2025-01-24	Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models	Naihao Deng et.al.	2501.14717	null
2025-01-24	FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing	James Seale Smith et.al.	2501.14713	null
2025-01-24	The Karp Dataset	Mason DiCicco et.al.	2501.14705	null
2025-01-24	Rethinking Table Instruction Tuning	Naihao Deng et.al.	2501.14693	null
2025-01-24	Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST	Fuping Wu et.al.	2501.14685	null
2025-01-24	An Empirical Study on LLM-based Classification of Requirements-related Provisions in Food-safety Regulations	Shabnam Hassani et.al.	2501.14683	null
2025-01-24	Diffusion based Text-to-Music Generationwith Global and Local Text based Conditioning	Jisi Zhang et.al.	2501.14680	null
2025-01-24	MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications	Yixing Jiang et.al.	2501.14654	link
2025-01-24	Investigating the (De)Composition Capabilities of Large Language Models in Natural-to-Formal Language Conversion	Ziyao Xu et.al.	2501.14649	link
2025-01-24	Recommending Actionable Strategies: A Semantic Approach to Integrating Analytical Frameworks with Decision Heuristics	Renato Ghisellini et.al.	2501.14634	null
2025-01-24	Extracting Problem Structure with LLMs for Optimized SAT Local Search	André Schilder et.al.	2501.14630	null
2025-01-24	ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations	Tianming Liang et.al.	2501.14607	null
2025-01-24	Knowledge Graphs Construction from Criminal Court Appeals: Insights from the French Cassation Court	Alexander V. Belikov et.al.	2501.14579	null
2025-01-24	ZETA: Leveraging Z-order Curves for Efficient Top-k Attention	Qiuhao Zeng et.al.	2501.14577	null
2025-01-24	Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding	Zhongyi Shui et.al.	2501.14548	link
2025-01-24	Leveraging ChatGPT's Multimodal Vision Capabilities to Rank Satellite Images by Poverty Level: Advancing Tools for Social Science Research	Hamid Sarmadi et.al.	2501.14546	null
2025-01-24	VERUS-LM: a Versatile Framework for Combining LLMs with Symbolic Reasoning	Benjamin Callewaert et.al.	2501.14540	null
2025-01-24	Design and Implementation of a Psychiatry Resident Training System Based on Large Language Models	Zhenguang Zhong et.al.	2501.14530	link
2025-01-23	CRPO: Confidence-Reward Driven Preference Optimization for Machine Translation	Guofeng Cui et.al.	2501.13927	null
2025-01-23	The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities	Chan-Jan Hsu et.al.	2501.13921	link
2025-01-23	Analysis of Indic Language Capabilities in LLMs	Aatman Vaidya et.al.	2501.13912	null
2025-01-23	Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models	Linh Tran et.al.	2501.13904	null
2025-01-23	Exploring Finetuned Audio-LLM on Heart Murmur Features	Adrian Florea et.al.	2501.13884	null
2025-01-23	The machine learning platform for developers of large systems	Alexey Naikov et.al.	2501.13881	null
2025-01-23	A RAG-Based Institutional Assistant	Gustavo Kuratomi et.al.	2501.13880	null
2025-01-23	Dual-Modal Prototype Joint Learning for Compositional Zero-Shot Learning	Shiyu Zhang et.al.	2501.13859	null
2025-01-23	Large Vision-Language Models for Knowledge-Grounded Data Annotation of Memes	Shiling Deng et.al.	2501.13851	link
2025-01-23	Think Outside the Data: Colonial Biases and Systemic Issues in Automated Moderation Pipelines for Low-Resource Languages	Farhana Shahid et.al.	2501.13836	null
2025-01-23	On the Reasoning Capacity of AI Models and How to Quantify It	Santosh Kumar Radha et.al.	2501.13833	null
2025-01-23	Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing	Hao Zhang et.al.	2501.13831	null
2025-01-23	Hallucinations Can Improve Large Language Models in Drug Discovery	Shuzhou Yuan et.al.	2501.13824	null
2025-01-23	Large Language Model driven Policy Exploration for Recommender Systems	Jie Wang et.al.	2501.13816	null
2025-01-23	Enhancing LLMs for Governance with Human Oversight: Evaluating and Aligning LLMs on Expert Classification of Climate Misinformation for Detecting False or Misleading Claims about Climate Change	Mowafak Allaham et.al.	2501.13802	null
2025-01-23	PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments	Changhao Wang et.al.	2501.13796	null
2025-01-23	Training-Free Zero-Shot Temporal Action Detection with Vision-Language Models	Chaolei Han et.al.	2501.13795	link
2025-01-23	Parameter-Efficient Fine-Tuning for Foundation Models	Dan Zhang et.al.	2501.13787	link
2025-01-23	Not Every AI Problem is a Data Problem: We Should Be Intentional About Data Scaling	Tanya Rodchenko et.al.	2501.13779	null
2025-01-23	Explainable XR: Understanding User Behaviors of XR Environments using LLM-assisted Analytics Framework	Yoonsang Kim et.al.	2501.13778	link
2025-01-22	VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding	Boqiang Zhang et.al.	2501.13106	link
2025-01-22	Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment	Melissa Kazemi Rad et.al.	2501.13080	null
2025-01-22	Autonomy-of-Experts Models	Ang Lv et.al.	2501.13074	null
2025-01-22	Does Table Source Matter? Benchmarking and Improving Multimodal Scientific Table Understanding and Reasoning	Bohao Yang et.al.	2501.13042	link
2025-01-22	Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament	Yantao Liu et.al.	2501.13007	link
2025-01-22	Large Language Model-Based Semantic Communication System for Image Transmission	Soheyb Ribouh et.al.	2501.12988	null
2025-01-22	LLM4WM: Adapting LLM for Wireless Multi-Tasking	Xuanyu Liu et.al.	2501.12983	null
2025-01-22	OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models	Chongren Sun et.al.	2501.12975	link
2025-01-22	Accessible Smart Contracts Verification: Synthesizing Formal Models with Tamed LLMs	Jan Corazza et.al.	2501.12972	link
2025-01-22	It's complicated. The relationship of algorithmic fairness and non-discrimination regulations in the EU AI Act	Kristof Meding et.al.	2501.12962	null
2025-01-22	Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference	Weizhi Fei et.al.	2501.12959	null
2025-01-22	GANQ: GPU-Adaptive Non-Uniform Quantization for Large Language Models	Pengxiang Zhao et.al.	2501.12956	null
2025-01-22	Correctness Assessment of Code Generated by Large Language Models Using Internal Representations	Tuan-Dung Bui et.al.	2501.12934	link
2025-01-22	DynamicEarth: How Far are We from Open-Vocabulary Change Detection?	Kaiyu Li et.al.	2501.12931	null
2025-01-22	A Functional Software Reference Architecture for LLM-Integrated Systems	Alessio Bucaioni et.al.	2501.12904	null
2025-01-22	Architectural Fusion Through Contextual Partitioning in Large Language Models: A Novel Approach to Parameterized Knowledge Integration	Offa Kingsleigh et.al.	2501.12901	null
2025-01-22	Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback	Yafu Li et.al.	2501.12895	link
2025-01-22	Generative AI Misuse Potential in Cyber Security Education: A Case Study of a UK Degree Program	Carlton Shepherd et.al.	2501.12883	null
2025-01-22	WisdomBot: Tuning Large Language Models with Artificial Intelligence Knowledge	Jingyuan Chen et.al.	2501.12877	null
2025-01-22	HierPromptLM: A Pure PLM-based Framework for Representation Learning on Heterogeneous Text-rich Networks	Qiuyu Zhu et.al.	2501.12857	null
2025-01-21	InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling	Yi Wang et.al.	2501.12386	link
2025-01-21	MMVU: Measuring Expert-Level Multi-Discipline Video Understanding	Yilun Zhao et.al.	2501.12380	link
2025-01-21	Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists	Thomas F. Eisenmann et.al.	2501.12374	link
2025-01-21	Is Long Context All You Need? Leveraging LLM's Extended Context for NL2SQL	Yeounoh Chung et.al.	2501.12372	link
2025-01-21	Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models	Samira Abnar et.al.	2501.12370	null
2025-01-21	InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model	Yuhang Zang et.al.	2501.12368	link
2025-01-21	Vision-Language Models for Automated Chest X-ray Interpretation: Leveraging ViT and GPT-2	Md. Rakibul Islam et.al.	2501.12356	null
2025-01-21	Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration	Thomas Walshe et.al.	2501.12332	null
2025-01-21	Cinepro: Robust Training of Foundation Models for Cancer Detection in Prostate Ultrasound Cineloops	Mohamed Harmanani et.al.	2501.12331	link
2025-01-21	VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model	Xianwei Zhuang et.al.	2501.12327	link
2025-01-21	LLM-Assisted Knowledge Graph Completion for Curriculum and Domain Modelling in Personalized Higher Education Recommendations	Hasan Abu-Rasheed et.al.	2501.12300	null
2025-01-21	MoGERNN: An Inductive Traffic Predictor for Unobserved Locations in Dynamic Sensing Networks	Qishen Zhou et.al.	2501.12281	link
2025-01-21	Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement	Maosong Cao et.al.	2501.12273	link
2025-01-21	CBVLM: Training-free Explainable Concept-based Large Vision Language Models for Medical Image Classification	Cristiano Patrício et.al.	2501.12266	null
2025-01-21	FOCUS: First Order Concentrated Updating Scheme	Yizhou Liu et.al.	2501.12243	null
2025-01-21	InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models	Pha Nguyen et.al.	2501.12231	null
2025-01-21	CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning	Yuanheng Fang et.al.	2501.12226	null
2025-01-21	Leveraging Large Language Models for Realizing Truly Intelligent User Interfaces	Allard Oelen et.al.	2501.12221	null
2025-01-21	You Can't Eat Your Cake and Have It Too: The Performance Degradation of LLMs with Jailbreak Defense	Wuyuao Mai et.al.	2501.12210	null
2025-01-21	Fixing Imbalanced Attention to Mitigate In-Context Hallucination of Large Vision-Language Model	Kazi Hasan Ibn Arif et.al.	2501.12206	link
2025-01-17	FaceXBench: Evaluating Multimodal LLMs on Face Understanding	Kartik Narayan et.al.	2501.10360	link
2025-01-17	Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems	Weibo Gao et.al.	2501.10332	link
2025-01-17	BoK: Introducing Bag-of-Keywords Loss for Interpretable Dialogue Response Generation	Suvodip Dey et.al.	2501.10328	link
2025-01-17	Large language models for automated scholarly paper review: A survey	Zhenzhen Zhuang et.al.	2501.10326	null
2025-01-17	Hierarchical Autoregressive Transformers: Combining Byte-~and Word-Level Processing for Robust, Adaptable Language Models	Pit Neitemeier et.al.	2501.10322	null
2025-01-17	HiMix: Reducing Computational Complexity in Large Vision-Language Models	Xuange Zhang et.al.	2501.10318	null
2025-01-17	Addressing Popularity Bias in Third-Party Library Recommendations Using LLMs	Claudio Di Sipio et.al.	2501.10313	null
2025-01-17	Computational Protein Science in the Era of Large Language Models (LLMs)	Wenqi Fan et.al.	2501.10282	null
2025-01-17	Test Wars: A Comparative Study of SBST, Symbolic Execution, and LLM-Based Approaches to Unit Test Generation	Azat Abdullin et.al.	2501.10200	null
2025-01-17	Generative Artificial Intelligence: Implications for Biomedical and Health Professions Education	William Hersh et.al.	2501.10186	null
2025-01-17	Multi-stage Training of Bilingual Islamic LLM for Neural Passage Retrieval	Vera Pavlova et.al.	2501.10175	null
2025-01-17	Dual Debiasing: Remove Stereotypes and Keep Factual Gender for Fair Language Modeling and Translation	Tomasz Limisiewicz et.al.	2501.10150	null
2025-01-17	A Vision-Language Framework for Multispectral Scene Representation Using Language-Grounded Features	Enes Karanfil et.al.	2501.10144	null
2025-01-17	Exploring the Impact of Generative Artificial Intelligence in Education: A Thematic Analysis	Abhishek Kaushik et.al.	2501.10134	null
2025-01-17	ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario	Lucen Zhong et.al.	2501.10132	link
2025-01-17	PaSa: An LLM Agent for Comprehensive Academic Paper Search	Yichen He et.al.	2501.10120	link
2025-01-17	LLM Reasoner and Automated Planner: A new NPC approach	Israel Puerta-Merino et.al.	2501.10106	null
2025-01-17	Universal Actions for Enhanced Embodied Foundation Models	Jinliang Zheng et.al.	2501.10105	link
2025-01-17	Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks	Michael Schwingshackl et.al.	2501.10080	link
2025-01-17	SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and Chain-of-Thought for Embodied Task Planning	Yuecheng Liu et.al.	2501.10074	null
2025-01-16	Distilling Multi-modal Large Language Models for Autonomous Driving	Deepti Hegde et.al.	2501.09757	null
2025-01-16	Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues	Youngjoon Jang et.al.	2501.09754	null
2025-01-16	OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking	Zekun Xi et.al.	2501.09751	link
2025-01-16	Enhancing Lexicon-Based Text Embeddings with Large Language Models	Yibin Lei et.al.	2501.09749	null
2025-01-16	Suggesting Code Edits in Interactive Machine Learning Notebooks Using Large Language Models	Bihui Jin et.al.	2501.09745	null
2025-01-16	Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps	Nanye Ma et.al.	2501.09732	null
2025-01-16	A Simple Aerial Detection Baseline of Multimodal Language Models	Qingyun Li et.al.	2501.09720	link
2025-01-16	CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education	Tianyu Wang et.al.	2501.09709	link
2025-01-16	Domain Adaptation of Foundation LLMs for e-Commerce	Christian Herold et.al.	2501.09706	null
2025-01-16	Cueless EEG imagined speech for subject identification: dataset and benchmarks	Ali Derakhshesh et.al.	2501.09700	link
2025-01-16	Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key	Zhihe Yang et.al.	2501.09695	link
2025-01-16	Simulated Interactive Debugging	Yannic Noller et.al.	2501.09694	null
2025-01-16	Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models	Fengli Xu et.al.	2501.09686	null
2025-01-16	Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review	Masatoshi Uehara et.al.	2501.09685	null
2025-01-16	Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark	Alexis Roger et.al.	2501.09672	null
2025-01-16	A Survey of Research in Large Language Models for Electronic Design Automation	Jingyu Pan et.al.	2501.09655	null
2025-01-16	The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models	Jonathan Katzy et.al.	2501.09653	null
2025-01-16	CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding	Johannes Kirmayr et.al.	2501.09645	link
2025-01-16	LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading	Kuan-Ming Liu et.al.	2501.09636	null
2025-01-16	Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework	Yushen Lin et.al.	2501.09631	null
2025-01-15	Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians	Ishan Amin et.al.	2501.09009	link
2025-01-15	Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails	Shaona Ghosh et.al.	2501.09004	null
2025-01-15	Vision Foundation Models for Computed Tomography	Suraj Pai et.al.	2501.09001	link
2025-01-15	CityLoc: 6 DoF Localization of Text Descriptions in Large-Scale Scenes with Gaussian Representation	Qi Ma et.al.	2501.08982	null
2025-01-15	Development and Validation of the Provider Documentation Summarization Quality Instrument for Large Language Models	Emma Croxford et.al.	2501.08977	null
2025-01-15	Learning to Extract Cross-Domain Aspects and Understanding Sentiments Using Large Language Models	Karukriti Kaushik Ghosh et.al.	2501.08974	null
2025-01-15	Analyzing the Ethical Logic of Six Large Language Models	W. Russell Neuman et.al.	2501.08951	null
2025-01-15	Applying General Turn-taking Models to Conversational Human-Robot Interaction	Gabriel Skantze et.al.	2501.08946	null
2025-01-15	Disentangling Exploration of Large Language Models by Optimal Exploitation	Tim Grams et.al.	2501.08925	null
2025-01-15	GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge	Liam Dugan et.al.	2501.08913	link
2025-01-15	Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning	Qinyu Ma et.al.	2501.08897	link
2025-01-15	Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving	Tengpeng Li et.al.	2501.08861	link
2025-01-15	Exploring Task-Level Optimal Prompts for Visual In-Context Learning	Yan Zhu et.al.	2501.08841	null
2025-01-15	IDEA: Image Description Enhanced CLIP-Adapter	Zhipeng Ye et.al.	2501.08816	link
2025-01-15	How Developers Interact with AI: A Taxonomy of Human-AI Collaboration in Software Engineering	Christoph Treude et.al.	2501.08774	null
2025-01-15	Admitting Ignorance Helps the Video Question Answering Models to Answer	Haopeng Li et.al.	2501.08771	null
2025-01-15	Enhanced Large Language Models for Effective Screening of Depression and Anxiety	June M. Liu et.al.	2501.08769	null
2025-01-15	Leveraging LLM Agents for Translating Network Configurations	Yunze Wei et.al.	2501.08760	null
2025-01-15	Expanding Vietnamese SentiWordNet to Improve Performance of Vietnamese Sentiment Analysis Models	Hong-Viet Tran et.al.	2501.08758	null
2025-01-15	The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Learning Capabilities	Irina Bigoulaeva et.al.	2501.08716	link
2025-01-14	PokerBench: Training Large Language Models to become Professional Poker Players	Richard Zhuang et.al.	2501.08328	link
2025-01-14	Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks	Miran Heo et.al.	2501.08326	null
2025-01-14	ADAM-1: AI and Bioinformatics for Alzheimer's Detection and Microbiome-Clinical Data Integrations	Ziyuan Huang et.al.	2501.08324	null
2025-01-14	Exploring Robustness of Multilingual LLMs on Real-World Noisy Data	Amirhossein Aliakbarzadeh et.al.	2501.08322	link
2025-01-14	Enhancing Automated Interpretability with Output-Centric Feature Descriptions	Yoav Gur-Arieh et.al.	2501.08319	link
2025-01-14	MiniMax-01: Scaling Foundation Models with Lightning Attention	MiniMax et.al.	2501.08313	null
2025-01-14	HALoGEN: Fantastic LLM Hallucinations and Where to Find Them	Abhilasha Ravichander et.al.	2501.08292	null
2025-01-14	LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding	Hongyu Li et.al.	2501.08282	link
2025-01-14	Exploring Robustness of LLMs to Sociodemographically-Conditioned Paraphrasing	Pulkit Arora et.al.	2501.08276	null
2025-01-14	Addressing the sustainable AI trilemma: a case study on LLM agents and RAG	Hui Wu et.al.	2501.08262	link
2025-01-14	Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models	Yifu Qiu et.al.	2501.08248	null
2025-01-14	Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints	Jonathan Nöther et.al.	2501.08246	null
2025-01-14	Investigating Energy Efficiency and Performance Trade-offs in LLM Inference Across Tasks and DVFS Settings	Paul Joe Maliakel et.al.	2501.08219	null
2025-01-14	ASTRID -- An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems	Mohita Chowdhury et.al.	2501.08208	null
2025-01-14	ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem Solving	Zain Ul Abedin et.al.	2501.08203	null
2025-01-14	CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation	Jinjun Peng et.al.	2501.08200	link
2025-01-14	OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training	Yijiong Yu et.al.	2501.08197	link
2025-01-14	PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving	Ahmet Caner Yüzügüler et.al.	2501.08192	null
2025-01-14	A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation	Steven Landgraf et.al.	2501.08188	null
2025-01-14	A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following	Yin Fang et.al.	2501.08187	link
2025-01-13	Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss	Xinyu Zhang et.al.	2501.07563	null
2025-01-13	SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing	Varun Biyyala et.al.	2501.07554	link
2025-01-13	Imagine while Reasoning in Space: Multimodal Visualization-of-Thought	Chengzu Li et.al.	2501.07542	null
2025-01-13	ML Mule: Mobile-Driven Context-Aware Collaborative Learning	Haoxiang Yu et.al.	2501.07536	null
2025-01-13	Investigating Large Language Models in Inferring Personality Traits from User Conversations	Jianfeng Zhu et.al.	2501.07532	null
2025-01-13	RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment	Difei Gu et.al.	2501.07525	link
2025-01-13	Parallel Key-Value Cache Fusion for Position Invariant RAG	Philhoon Oh et.al.	2501.07523	null
**

Name		Name	Last commit message	Last commit date
Latest commit History 924 Commits
.github/workflows		.github/workflows
docs		docs
README.md		README.md
config.yaml		config.yaml
daily_arxiv.py		daily_arxiv.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Updated on 2025.06.11

Path Planning

Large Language Model

About

Uh oh!

Releases

Packages

Uh oh!

Languages

XuzhaoLi/ro-arxiv-daily

Folders and files

Latest commit

History

Repository files navigation

Updated on 2025.06.11

Path Planning

Large Language Model

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages