Skip to content

XuzhaoLi/ro-arxiv-daily

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Updated on 2025.06.11

Table of Contents
  1. Path Planning
  2. Large Language Model
  3. Autonomous Driving

Path Planning

Publish Date Title Authors PDF Code
2025-06-10 Reinforce LLM Reasoning through Multi-Agent Reflection Yurun Yuan et.al. 2506.08379 null
2025-06-10 Dynamical System Optimization Emo Todorov et.al. 2506.08340 null
2025-06-09 Modelling Nonstationary Time Series using Trend-Stationary Hypothesis Zhandos Abdikhadir et.al. 2506.07987 null
2025-06-08 Stochastic Quadratic Dynamic Programming Vincent Guigues et.al. 2506.07314 null
2025-06-05 Resilient Pattern Mining Pengxin Bian et.al. 2506.04935 null
2025-06-05 Composing Agents to Minimize Worst-case Risk Guruprerana Shabadi et.al. 2506.04632 null
2025-06-04 Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models Fangrui Zhu et.al. 2506.04220 null
2025-05-28 Large Neighborhood and Hybrid Genetic Search for Inventory Routing Problems Jingyi Zhao et.al. 2506.03172 null
2025-06-03 Dynamic Programming Techniques for Enhancing Cognitive Representation in Knowledge Tracing Lixiang Xu et.al. 2506.02949 null
2025-06-03 Reachability Weighted Offline Goal-conditioned Resampling Wenyan Yang et.al. 2506.02577 null
2025-06-03 Multi-agent Markov Entanglement Shuze Chen et.al. 2506.02385 null
2025-06-02 Scalable In-Context Q-Learning Jinmei Liu et.al. 2506.01299 null
2025-06-01 Trilevel Memetic Algorithm for the Electric Vehicle Routing Problem Ivan Milinović et.al. 2506.01065 null
2025-06-01 Q-learning with Posterior Sampling Priyank Agrawal et.al. 2506.00917 null
2025-05-30 GridRoute: A Benchmark for LLM-Based Route Planning with Cardinal Movement in Grid Environments Kechen Li et.al. 2505.24306 null
2025-05-30 Winners vs. Losers: Momentum-based Strategies with Intertemporal Choice for ESG Portfolios Ayush Jha et.al. 2505.24250 null
2025-05-30 CLaSp: In-Context Layer Skip for Self-Speculative Decoding Longze Chen et.al. 2505.24196 null
2025-05-29 Spoken Language Modeling with Duration-Penalized Self-Supervised Units Nicol Visser et.al. 2505.23494 link
2025-05-29 Offline Map Matching Based on Localization Error Distribution Modeling Ruilin Xu et.al. 2505.23123 null
2025-05-29 DINGO: Constrained Inference for Diffusion LLMs Tarun Suresh et.al. 2505.23061 null
2025-05-27 Learning-Based Tracking Perimeter Control for Two-region Macroscopic Traffic Dynamics Can Chen et.al. 2505.21818 null
2025-05-27 When to Deceive: A Cross-Layer Stackelberg Game Framework for Strategic Timing of Cyber Deception Ya-Ting Yang et.al. 2505.21244 null
2025-05-23 Evaluating the Energy-Efficiency of the Code Generated by LLMs Md Arman Islam et.al. 2505.20324 null
2025-05-23 URB -- Urban Routing Benchmark for RL-equipped Connected Autonomous Vehicles Ahmet Onur Akman et.al. 2505.17734 null
2025-05-23 Distance Estimation in Outdoor Driving Environments Using Phase-only Correlation Method with Event Cameras Masataka Kobayashi et.al. 2505.17582 null
2025-05-22 Tropical Attention: Neural Algorithmic Reasoning for Combinatorial Algorithms Baran Hashemi et.al. 2505.17190 null
2025-05-22 Quantum Routing and Entanglement Dynamics Through Bottlenecks Dhruv Devulapalli et.al. 2505.16948 null
2025-05-22 Reward-Aware Proto-Representations in Reinforcement Learning Hon Tik Tse et.al. 2505.16217 null
2025-05-21 Toward Theoretical Insights into Diffusion Trajectory Distillation via Operator Merging Weiguo Gao et.al. 2505.16024 null
2025-05-21 Families of tractable problems with respect to vertex-interval-membership width and its generalisations Jessica Enright et.al. 2505.15699 null
2025-05-21 Deep Learning for Continuous-time Stochastic Control with Jumps Patrick Cheridito et.al. 2505.15602 null
2025-05-19 Finding Maximum Independent Sets in Dynamic Graphs using Unsupervised Learning Devendra Parkar et.al. 2505.13754 null
2025-05-24 Learning to Program Quantum Measurements for Machine Learning Samuel Yen-Chi Chen et.al. 2505.13525 null
2025-05-19 Dynamic programming and dimensionality in convex stochastic optimization and control Teemu Pennanen et.al. 2505.12787 null
2025-05-18 Resolving Latency and Inventory Risk in Market Making with Reinforcement Learning Junzhe Jiang et.al. 2505.12465 null
2025-05-16 Co-Evolutionary Defence of Active Directory Attack Graphs via GNN-Approximated Dynamic Programming Diksha Goel et.al. 2505.11710 null
2025-05-15 Multi-Objective Memory Bandwidth Regulation and Cache Partitioning for Multicore Real-Time Systems Binqi Sun et.al. 2505.11554 null
2025-05-16 Sobolev Training of End-to-End Optimization Proxies Andrew W. Rosemberg et.al. 2505.11342 null
2025-05-16 Beyond KL-divergence: Risk Aware Control Through Cross Entropy and Adversarial Entropy Regularization Menno van Zutphen et.al. 2505.11068 null
2025-05-15 Scalable Approximate Biclique Counting over Large Bipartite Graphs Jingbang Chen et.al. 2505.10471 null
2025-05-14 Reflected stochastic recursive control problems with jumps: dynamic programming and stochastic verification theorems Lu Liu et.al. 2505.09070 null
2025-05-13 Optimal Trajectory Planning with Collision Avoidance for Autonomous Vehicle Maneuvering Jason Zalev et.al. 2505.08724 null
2025-05-13 Distributionally Robust LQG with Kullback-Leibler Ambiguity Sets Marta Fochesato et.al. 2505.08370 null
2025-05-11 Optimal control of convective Brinkman-Forchheimer equations: Dynamic programming equation and Viscosity solutions Sagar Gautam et.al. 2505.07095 null
2025-05-10 Optimizing Railcar Movements to Create Outbound Trains in a Freight Railyard Ruonan Zhao et.al. 2505.06510 null
2025-05-09 Scheduled Jacobian Chaining Simon Märtens et.al. 2505.06056 link
2025-05-09 Universal Approximation Theorem for Deep Q-Learning via FBSDE System Qian Qi et.al. 2505.06023 null
2025-05-09 Data-driven pressure field prediction for ships in regular sea states Malte Loft et.al. 2505.06014 null
2025-05-09 Multi-armed Bandit for Stochastic Shortest Path in Mixed Autonomy Yu Bai et.al. 2505.05878 null
2025-05-10 Driving with Context: Online Map Matching for Complex Roads Using Lane Markings and Scenario Recognition Xin Bi et.al. 2505.05007 link
2025-05-08 Chain-of-Thought Tokens are Computer Program Variables Fangwei Zhu et.al. 2505.04955 link
2025-05-08 Network Digital Twin for Route Optimization in 5G/B5G Transport Slicing with What-If Analysis Rebecca Aben-Athar et.al. 2505.04879 null
2025-05-06 Stochastic scheduling with Bernoulli-type jobs through policy stratification Antonios Antoniadis et.al. 2505.03349 null
2025-05-05 A Fully Data-Driven Value Iteration for Stochastic LQR: Convergence, Robustness and Stability Leilei Cui et.al. 2505.02970 null
2025-05-03 Multistage stochastic optimization for drayage procurement in container logistics using stochastic dual dynamic programming Georgios Vassos et.al. 2505.01813 null
2025-05-03 Integrated optimization of operations and capacity planning under uncertainty for drayage procurement in container logistics Georgios Vassos et.al. 2505.01808 link
2025-05-03 Evaluating Input Modalities for Pilot-Centered Taxiway Navigation: Insights from a Wizard-of-Oz Simulation Chan Chea Mean et.al. 2505.01679 null
2025-05-03 Morello: Compiling Fast Neural Networks with Dynamic Programming and Spatial Compression Samuel J. Kaufman et.al. 2505.01637 link
2025-05-02 Global Collinearity-aware Polygonizer for Polygonal Building Mapping in Remote Sensing Fahong Zhang et.al. 2505.01385 null
2025-05-02 Power System Transition Planning: An Industry-Aligned Framework for Long-Term Optimization Ahmed Al-Shafei et.al. 2505.01331 null
2025-05-02 A stochastic Gordon-Loeb model for optimal cybersecurity investment under clustered attacks Giorgia Callegaro et.al. 2505.01221 null
2025-05-02 Remote Estimation over Packet-Dropping Wireless Channels with Partial State Information Ioannis Tzortzis et.al. 2505.01132 null
2025-05-01 Quantum Computing in Industrial Environments: Where Do We Stand and Where Are We Headed? Eneko Osaba et.al. 2505.00891 null
2025-05-01 Platoon Coordination and Leader Selection in Mixed Transportation Systems via Dynamic Programming Ying Wang et.al. 2505.00847 null
2025-04-24 Optimal Blackjack Betting Strategies Through Dynamic Programming and Expected Utility Theory Lucas Bordeu et.al. 2505.00724 null
2025-04-30 Galvatron: An Automatic Distributed System for Efficient Foundation Model Training Xinyi Liu et.al. 2504.21411 link
2025-04-29 DeeP-Mod: Deep Dynamic Programming based Environment Modelling using Feature Extraction Chris Child et.al. 2504.20535 null
2025-04-28 Warm-Starting QAOA with XY Mixers: A Novel Approach for Quantum-Enhanced Vehicle Routing Optimization Rafael S. do Carmo et.al. 2504.19934 null
2025-04-30 The frequency $K_i$ s for symmetrical traveling salesman problem Yong Wang et.al. 2504.19608 null
2025-04-28 Symmetric Policy Design for Multi-Agent Dispatch Coordination in Supply Chains Sagar Sudhakara et.al. 2504.19397 null
2025-04-24 Efficient Tree Generation for Globally Optimal Decisions under Probabilistic Outcomes Berk Ozturk et.al. 2504.17983 null
2025-04-24 Ergodic control of McKean-Vlasov systems on the Wasserstein space Marco Fuhrman et.al. 2504.17958 null
2025-04-24 Fréchet Distance in Unweighted Planar Graphs Ivor van der Hoog et.al. 2504.17342 null
2025-04-24 Advancing Frontiers of Path Integral Theory for Stochastic Optimal Control Apurva Patil et.al. 2504.17154 null
2025-04-22 Distributed model predictive control without terminal cost under inexact distributed optimization Xiaoyu Liu et.al. 2504.15768 null
2025-04-22 Stochastic Programming for Dynamic Temperature Control of Refrigerated Road Transport Francesco Giliberto et.al. 2504.15741 null
2025-04-22 Exploring Inevitable Waypoints for Unsolvability Explanation in Hybrid Planning Problems Mir Md Sajid Sarwar et.al. 2504.15668 null
2025-04-24 A Quadratic Control Framework for Dynamic Systems Igor Ladnik et.al. 2504.15396 null
2025-04-21 The Iterative Chainlet Partitioning Algorithm for the Traveling Salesman Problem with Drone and Neural Acceleration Jae Hyeok Lee et.al. 2504.15147 null
2025-04-23 Feedback Stackelberg-Nash equilibria in difference games with quasi-hierarchical interactions and inequality constraints Partha Sarathi Mohapatra et.al. 2504.15019 null
2025-04-19 Optimal Operation and Valuation of Electricity Storages Jean-Philippe Chancelier et.al. 2504.14292 null
2025-04-18 Code generation for solving and differentiating through convex optimization problems Maximilian Schaller et.al. 2504.14099 null
2025-04-16 Beyond ISAC: Toward Integrated Heterogeneous Service Provisioning via Elastic Multi-Dimensional Multiple Access Jie Chen et.al. 2504.11692 null
2025-04-18 Traffic Adaptive Moving-window Service Patrolling for Real-time Incident Management during High-impact Events Haozhe Lei et.al. 2504.11570 null
2025-04-15 TransitReID: Transit OD Data Collection with Occlusion-Resistant Dynamic Passenger Re-Identification Kaicong Huang et.al. 2504.11500 null
2025-04-15 Integration of a high-fidelity model of quantum sensors with a map-matching filter for quantum-enhanced navigation Samuel Lellouch et.al. 2504.11119 null
2025-04-22 Breaking the Dimensional Barrier: A Pontryagin-Guided Direct Policy Optimization for Continuous-Time Multi-Asset Portfolio Jeonggyu Huh et.al. 2504.11116 null
2025-04-15 Hallucination-Aware Generative Pretrained Transformer for Cooperative Aerial Mobility Control Hyojun Ahn et.al. 2504.10831 null
2025-04-11 A Nonlinear Hash-based Optimization Method for SpMV on GPUs Chen Yan et.al. 2504.08860 null
2025-04-07 A Constraint Programming Model For Serial Batch Scheduling With Minimum Batch Size Jorge A. Huertas et.al. 2504.08793 null
2025-04-05 SLOs-Serve: Optimized Serving of Multi-SLO LLMs Siyuan Chen et.al. 2504.08784 null
2025-04-11 Interior Point Differential Dynamic Programming, Redux Ming Xu et.al. 2504.08278 link
2025-04-10 Quantum-assured magnetic navigation achieves positioning accuracy better than a strategic-grade INS in airborne and ground-based field trials Murat Muradoglu et.al. 2504.08167 null
2025-04-10 Low-Thrust Many-Revolution Transfer between Near Rectilinear Halo Orbit and Low Lunar Orbit Using Hybrid Differential Dynamic Programming Kohei Oue et.al. 2504.07723 null
2025-04-10 Joint Travel Route Optimization Framework for Platooning Akif Adas et.al. 2504.07623 null
2025-04-09 Rounding the Lovász Theta Function with a Value Function Approximation Rui Gong et.al. 2504.07204 null
2025-04-09 Evaluating Retrieval Augmented Generative Models for Document Queries in Transportation Safety Chad Melton et.al. 2504.07022 null
2025-04-17 Maximizing Battery Storage Profits via High-Frequency Intraday Trading David Schaurecker et.al. 2504.06932 null
2025-04-08 Linear-space LCS enumeration with quadratic-time delay for two strings Yoshifumi Sakai et.al. 2504.05742 null
2025-04-09 DDT: Decoupled Diffusion Transformer Shuai Wang et.al. 2504.05741 null
2025-04-08 Hamilton-Jacobi-Bellman equation and Viscosity solutions for an optimal control problem for stochastic convective Brinkman-Forchheimer equations Sagar Gautam et.al. 2504.05707 null
2025-04-06 Optimized Path Planning for Logistics Robots Using Ant Colony Algorithm under Multiple Constraints Haopeng Zhao et.al. 2504.05339 null
2025-04-07 Maximum Shortest Path Interdiction Problem by Upgrading Nodes on Trees under Unit Cost Qiao Zhang et.al. 2504.05190 null
2025-04-06 Memetic Search for Green Vehicle Routing Problem with Private Capacitated Refueling Stations Rui Xu et.al. 2504.04527 null
2025-04-05 Improving Question Embeddings with Cognitiv Representation Optimization for Knowledge Tracing Lixiang Xu et.al. 2504.04121 null
2025-04-04 NeRFlex: Resource-aware Real-time High-quality Rendering of Complex Scenes on Mobile Devices Zhe Wang et.al. 2504.03415 null
2025-04-04 Block Toeplitz Sparse Precision Matrix Estimation for Large-Scale Interval-Valued Time Series Forecasting Wan Tian et.al. 2504.03322 null
2025-04-04 Quantum Optimization-Based Route Compression for Efficient Navigation Systems Shunsuke Sotobayashi et.al. 2504.03227 null
2025-04-11 Dynamic Treewidth in Logarithmic Time Tuukka Korhonen et.al. 2504.02790 null
2025-04-04 Controlled Social Learning: Altruism vs. Bias Raghu Arghal et.al. 2504.02648 null
2025-04-03 Reinforcement Learning for Solving the Pricing Problem in Column Generation: Applications to Vehicle Routing Abdo Abouelrous et.al. 2504.02383 null
2025-04-03 AI-Driven Framework for Multi-Service Multi-Modal Devices in NextG ORAN Systems Mrityunjoy Gain et.al. 2504.01730 null
2025-04-01 A Parametric Model for Near-Optimal Online Synthesis with Robust Reach-Avoid Guarantees Mario Gleirscher et.al. 2504.01006 null
2025-04-01 Linear models of dynamic optimization with linear constraints Somdeb Lahiri et.al. 2504.00630 null
2025-03-31 QUADRO: A Hybrid Quantum Optimization Framework for Drone Delivery James B. Holliday et.al. 2503.24301 null
2025-04-02 Unraveling tensor structures in correct-by-design controller synthesis Ruohan Wang et.al. 2503.24085 null
2025-03-31 Bi-Level Route Optimization and Path Planning with Hazard Exploration Jimin Choi et.al. 2503.24044 null
2025-03-31 Tree-Guided $L_1$ -Convex Clustering Bingyuan Zhang et.al. 2503.24012 link
2025-03-30 A Systematic Decade Review of Trip Route Planning with Travel Time Estimation based on User Preferences and Behavior Nikil Jayasuriya et.al. 2503.23486 null
2025-03-29 A convergence technique for the game i-Mark Gabriel Nivasch et.al. 2503.23196 null
2025-03-29 PartialLoading: User Scheduling and Bandwidth Allocation for Parameter-sharing Edge Inference Guanqiao Qu et.al. 2503.22982 null
2025-03-28 Policy Optimization and Multi-agent Reinforcement Learning for Mean-variance Team Stochastic Games Junkai Hu et.al. 2503.22779 null
2025-04-04 The Price of Simplicity: Analyzing Decoupled Policies for Multi-Location Inventory Control Yohan John et.al. 2503.22639 null
2025-03-28 Scheduling problem of aircrafts on a same runway and dual runways Peng Lin et.al. 2503.22124 null
2025-03-27 Optimal Stepsize for Diffusion Sampling Jianning Pei et.al. 2503.21774 link
2025-03-26 A Hopf-Lax Type Formula for Multi-Agent Path Planning with Pattern Coordination Christian Parkinson et.al. 2503.20974 link
2025-03-26 Infinite Time Horizon Optimal Control of McKean-Vlasov SDEs Silvia Rudà et.al. 2503.20572 null
2025-03-26 Optimal reinsurance in a competitive market Lea Enzi et.al. 2503.20555 null
2025-03-26 Beyond Worst-Case Subset Sum: An Adaptive, Structure-Aware Solver with Sub- $2^{n/2}$ Enumeration Jesus Salas et.al. 2503.20162 null
2025-03-31 Graph neural networks extrapolate out-of-distribution for shortest paths Robert R. Nerem et.al. 2503.19173 null
2025-03-29 An Efficient Frequency-Based Approach for Maximal Square Detection in Binary Matrices Swastik Bhandari et.al. 2503.18974 null
2025-03-23 Agent-Based Models for Two Stocks with Superhedging Dario Crisci et.al. 2503.18165 null
2025-03-21 A New Segment Routing method with Swap Node Selection Strategy Based on Deep Reinforcement Learning for Software Defined Network Miao Ye et.al. 2503.16914 null
2025-03-20 Reinforcement Learning-based Heuristics to Guide Domain-Independent Dynamic Programming Minori Narita et.al. 2503.16371 link
2025-03-19 On the Functoriality of Belief Propagation Algorithms on finite Partially Ordered Sets Grégoire Sergeant-Perthuis et.al. 2503.15705 null
2025-03-24 Distribution and Purification of Entanglement States in Quantum Networks Xiaojie Fan et.al. 2503.14712 null
2025-03-18 Designing and Deploying AI Models for Sustainable Logistics Optimization: A Case Study on Eco-Efficient Supply Chains in the USA Reza E Rabbi Shawon et.al. 2503.14556 null
2025-03-17 Local-Global Learning of Interpretable Control Policies: The Interface between MPC and Reinforcement Learning Thomas Banker et.al. 2503.13289 null
2025-03-17 Exploring 3D Activity Reasoning and Planning: From Implicit Human Intentions to Route-Aware Planning Xueying Jiang et.al. 2503.12974 null
2025-03-17 Navigating Heat Exposure: Simulation of Route Planning Based on Visual Language Model Agents Haoran Ma et.al. 2503.12731 null
2025-03-16 Routing Guidance for Emerging Transportation Systems with Improved Dynamic Trip Equity Ting Bai et.al. 2503.12601 null
2025-03-14 Discrete Effort Distribution via Regrettable Greedy Algorithm Song Cao et.al. 2503.11107 null
2025-03-13 Dynamic Programming Algorithms for Finding Cost-Optimal Trajectory on the Terrain Majid E. Abbasov et.al. 2503.10922 null
2025-03-13 Enhanced Route Planning with Calibrated Uncertainty Set Lingxuan Tang et.al. 2503.10088 null
2025-03-12 PairVDN - Pair-wise Decomposed Value Functions Zak Buzzard et.al. 2503.09521 link
2025-03-11 Large Neighborhood Search and Bitmask Dynamic Programming for Wireless Mobile Charging Electric Vehicle Routing Problems in Medical Transportation Jingyi Zhao et.al. 2503.08752 null
2025-03-11 DISTINGUISH Workflow: A New Paradigm of Dynamic Well Placement Using Generative Machine Learning Sergey Alyaev et.al. 2503.08509 link
2025-03-10 Multi-Objective Routing Optimization Using Coherent Ising Machine in Wireless Multihop Networks Yu-Xuan Lin et.al. 2503.07924 null
2025-03-10 Co-Optimizing Distributed Energy Resources under Demand Charges and Bi-Directional Power Flow Ruixiao Yang et.al. 2503.07907 null
2025-03-10 Operational route planning under uncertainty for Demand Adaptive Systems Benedikt Lienkamp et.al. 2503.07812 link
2025-03-09 Pull-Based Query Scheduling for Goal-Oriented Semantic Communication Pouya Agheli et.al. 2503.06725 null
2025-03-08 A Neural Score Follower for Computer Accompaniment of Polyphonic Musical Instruments Ashwin Pillay et.al. 2503.06348 null
2025-03-11 Optimal Output Feedback Learning Control for Discrete-Time Linear Quadratic Regulation Kedi Xie et.al. 2503.06226 null
2025-03-08 Dynamic Programming in Ordered Vector Space Nisha Peng et.al. 2503.06055 null
2025-03-04 Establishment and Solution of a Multi-Stage Decision Model Based on Hypothesis Testing and Dynamic Programming Algorithm Ziyang Liu et.al. 2503.05807 null
2025-03-07 On Almost Fair and Equitable Allocations of Indivisible Items for Non-monotone Valuations Vittorio Bilò et.al. 2503.05695 null
2025-03-06 Efficient Algorithms for Verifying Kruskal Rank in Sparse Linear Regression and Related Applications Fengqin Zhou et.al. 2503.04986 null
2025-03-06 Mean field optimal stopping with uncontrolled state Andrea Cosso et.al. 2503.04269 null
2025-03-05 Endpoint-Explicit Differential Dynamic Programming via Exact Resolution Maria Parilli et.al. 2503.03897 null
2025-03-05 Composite Nonlinear Trajectory Tracking Control of Co-Driving Vehicles Using Self-Triggered Adaptive Dynamic Programming Chuan Hu et.al. 2503.03348 null
2025-03-04 Optimal power procurement for green cellular wireless networks under uncertainty and chance constraints Nadhir Ben Rached et.al. 2503.03051 null
2025-03-04 On the optimal stopping problem for diffusions and an approximation result for stopping times Andrea Cosso et.al. 2503.02514 null
2025-03-04 JPDS-NN: Reinforcement Learning-Based Dynamic Task Allocation for Agricultural Vehicle Routing Optimization Yixuan Fan et.al. 2503.02369 null
2025-03-04 Optimal Control for Remote Patient Monitoring with Multidimensional Health States Siddharth Chandak et.al. 2503.02292 null
2025-03-03 CorrA: Leveraging Large Language Models for Dynamic Obstacle Avoidance of Autonomous Vehicles Shanting Wang et.al. 2503.02076 null
2025-03-03 Mapping Spiking Neural Networks to Heterogeneous Crossbar Architectures using Integer Linear Programming Devin Pohl et.al. 2503.02033 null
2025-02-25 Tracking Control of Euler-Lagrangian Systems with Prescribed State, Input, and Temporal Constraints Chidre Shravista Kashyap et.al. 2503.01866 null
2025-03-03 CacheQuant: Comprehensively Accelerated Diffusion Models Xuewen Liu et.al. 2503.01323 null
2025-03-03 Parameter-free Video Segmentation for Vision and Language Understanding Louis Mahon et.al. 2503.01201 null
2025-03-02 Efficient End-to-end Visual Localization for Autonomous Driving with Decoupled BEV Neural Matching Jinyu Miao et.al. 2503.00862 null
2025-03-07 Llamarine: Open-source Maritime Industry-specific Large Language Model William Nguyen et.al. 2503.00203 null
2025-02-28 Time-optimal problem in the space of probabilities measures Yurii Averboukh et.al. 2502.20871 null
2025-02-27 Dynamic Program Slices Change How Developers Diagnose Gradual Run-Time Type Errors Felipe Bañados Schwerter et.al. 2502.20533 null
2025-02-27 Efficient Risk-sensitive Planning via Entropic Risk Measures Alexandre Marthe et.al. 2502.20423 null
2025-02-27 Pontryagin-Bellman Differential Dynamic Programming for Low-Thrust Trajectory Optimization with Path Constraints Yanis Sidhoum et.al. 2502.20291 null
2025-02-27 SSD: A State-based Stealthy Backdoor Attack For Navigation System in UAV Route Planning Zhaoxuan Wang et.al. 2502.20178 null
2025-02-27 GraphSparseNet: a Novel Method for Large Scale Trafffic Flow Prediction Weiyang Kong et.al. 2502.19823 null
2025-03-04 Off-Policy Temporal Difference Learning for Perturbed Markov Decision Processes: Theoretical Insights and Extensive Simulations Ali Forootani et.al. 2502.18415 null
2025-02-25 Dynamic Factor Model-Based Multiperiod Mean-Variance Portfolio Selection with Portfolio Constraints Jianjun Gao et.al. 2502.17915 link
2025-02-24 A Deterministic and Linear Model of Dynamic Optimization Somdeb Lahiri et.al. 2502.17012 null
2025-02-24 Be CIM or Be Memory: A Dual-mode-aware DNN Compiler for CIM Accelerators Shixin Zhao et.al. 2502.17006 null
2025-02-23 Volume Optimality in Conformal Prediction with Structured Prediction Sets Chao Gao et.al. 2502.16658 null
2025-02-21 Near Optimal Decision Trees in a SPLIT Second Varun Babbar et.al. 2502.15988 null
2025-02-21 Zweistein: A Dynamic Programming Evaluation Function for Einstein Würfelt Nicht! Wei Lin. Hsueh et.al. 2502.15547 null
2025-02-21 Learning Maritime Inventory Routing Optimization Rui Chen et.al. 2502.15244 null
2025-02-19 Optimistically Optimistic Exploration for Provably Efficient Infinite-Horizon Reinforcement and Imitation Learning Antoine Moulin et.al. 2502.13900 null
2025-02-19 FPT algorithms over linear delta-matroids with applications Eduard Eiben et.al. 2502.13654 null
2025-03-01 Value Gradient Sampler: Sampling as Sequential Decision Making Sangwoong Yoon et.al. 2502.13280 link
2025-02-18 Autonomous Vehicles Using Multi-Agent Reinforcement Learning for Routing Decisions Can Harm Urban Traffic Anastasia Psarou et.al. 2502.13188 null
2025-02-18 GPU Memory Usage Optimization for Backward Propagation in Deep Network Training Ding-Yong Hong et.al. 2502.12499 null
2025-02-17 Logarithmic Approximation for Road Pricing on Grids Andrei Constantinescu et.al. 2502.11979 null
2025-02-17 Proactive Depot Discovery: A Generative Framework for Flexible Location-Routing Site Qu et.al. 2502.11715 null
2025-02-16 The Q-Spellbook: Crafting Surface Code Layouts and Magic State Protocols for Large-Scale Quantum Computing Avimita Chatterjee et.al. 2502.11253 null
2025-02-14 Customizable Contraction Hierarchies -- A Survey Thomas Bläsius et.al. 2502.10519 null
2025-02-14 Scheduling Strategies for Partially-Replicable Task Chains on Two Types of Resources Diane Orhan et.al. 2502.10000 null
2025-02-14 Thompson Sampling for Repeated Newsvendor Weizhou Zhang et.al. 2502.09900 null
2025-02-26 A quantum speedup algorithm for TSP based on quantum dynamic programming with very few qubits Bai Xujun et.al. 2502.08853 null
2025-02-12 Self-Evaluation for Job-Shop Scheduling Imanol Echeverria et.al. 2502.08684 null
2025-02-11 TRAVEL: Training-Free Retrieval and Alignment for Vision-and-Language Navigation Navid Rajabi et.al. 2502.07306 null
2025-02-05 RLOMM: An Efficient and Robust Online Map Matching Framework with Reinforcement Learning Minxiao Chen et.al. 2502.06825 null
2025-02-08 Counting Tree-Like Multigraphs with a Given Number of Vertices and Multiple Edges Muhammad Ilyas et.al. 2502.05529 null
2025-02-06 Aligner-Encoders: Self-Attention Transformers Can Be Self-Transducers Adam Stooke et.al. 2502.05232 null
2025-02-07 Stochastic internal habit formation and optimality Michele Aleandri et.al. 2502.05081 null
2025-02-07 Preference-aware compensation policies for crowdsourced on-demand services Georgina Nouli et.al. 2502.05060 null
2025-02-07 A non-zero-sum game with reinforcement learning under mean-variance framework Junyi Guo et.al. 2502.04788 null
2025-02-06 Unifying and Optimizing Data Values for Selection via Sequential-Decision-Making Hongliang Chi et.al. 2502.04554 null
2025-02-06 Solvability of Approximate Reach-Avoid Games Mario Gleirscher et.al. 2502.04544 null
2025-02-06 On the Number of Control Nodes in Boolean Networks with Degree Constraints Liangjie Sun et.al. 2502.03839 null
2025-02-06 Iterate to Accelerate: A Unified Framework for Iterative Reasoning and Feedback Convergence Jacob Fein-Ashley et.al. 2502.03787 null
2025-02-06 Cascaded Learned Bloom Filter for Optimal Model-Filter Size Balance and Fast Rejection Atsuki Sato et.al. 2502.03696 null
2025-02-06 Improving polynomial bounds for the Graphical Traveling Salesman Problem with release dates on paths Thailsson Clementino et.al. 2502.02680 null
2025-02-04 Optimal Routing in the Presence of Hooks: Three Case Studies Tarun Chitra et.al. 2502.02059 link
2025-02-03 Trajectory Map-Matching in Urban Road Networks Based on RSS Measurements Zheng Xing et.al. 2502.01280 null
2025-02-08 Minimum Riesz s-Energy Subset Selection in Ordered Point Sets via Dynamic Programming Michael Emmerich et.al. 2502.01163 null
2025-02-01 Model-Free Predictive Control: Introductory Algebraic Calculations, and a Comparison with HEOL and ANNs Cédric Join et.al. 2502.00443 null
2025-02-01 A polynomial-based constrained solver for fuel-optimal low-thrust trajectory optimization Thomas Caleb et.al. 2502.00398 null
2025-02-01 Left-Deep Join Order Selection with Higher-Order Unconstrained Binary Optimization on Quantum Computers Valter Uotila et.al. 2502.00362 null
2025-01-31 Epi-Consistent Approximation of Stochastic Dynamic Programs Dominic S. T. Keehan et.al. 2501.19028 null
2025-01-30 Model-Adaptive Approach to Dynamic Discrete Choice Models with Large State Spaces Ertian Chen et.al. 2501.18746 null
2025-02-05 Solving Drone Routing Problems with Quantum Computing: A Hybrid Approach Combining Quantum Annealing and Gate-Based Paradigms Eneko Osaba et.al. 2501.18432 null
2025-01-29 Stochastic scattering control of spider diffusion governed by an optimal diffraction probability measure selected from its own local-time Isaac Ohavi et.al. 2501.18057 null
2025-01-15 Low-Thrust Many-Revolution Trajectory Design Under Operational Uncertainties for DESTINY+ Mission Naoya Ozaki et.al. 2501.17867 null
2025-02-06 On characterizing optimal learning trajectories in a class of learning problems Getachew K Befekadu et.al. 2501.16521 null
2025-01-22 Modified Patankar Semi-Lagrangian Scheme for the Optimal Control of Production-Destruction systems Simone Cacace et.al. 2501.13085 null
2025-01-22 Optimizing Return Distributions with Distributional Dynamic Programming Bernardo Ávila Pires et.al. 2501.13028 null
2025-01-30 Pontryagin-Guided Deep Learning for Large-Scale Constrained Dynamic Portfolio Choice Jeonggyu Huh et.al. 2501.12600 null
2025-01-23 Treefix: Enabling Execution with a Tree of Prefixes Beatriz Souza et.al. 2501.12339 null
2025-01-21 A Dynamic Programming Framework for Generating Approximately Diverse and Optimal Solutions Waldo Gálvez et.al. 2501.12261 null
2025-01-21 Dissecting the NVIDIA Hopper Architecture through Microbenchmarking and Multiple Level Analysis Weile Luo et.al. 2501.12084 null
2025-01-20 Routing Optimization Based on Distributed Intelligent Network Softwarization for the Internet of Things Mohamed Ali Zormati et.al. 2501.11484 null
2025-02-01 OpenLiDARMap: Zero-Drift Point Cloud Mapping using Map Priors Dominik Kulmer et.al. 2501.11111 link
2025-01-25 BOOST: Microgrid Sizing using Ordinal Optimization Mohamad Fares El Hajj Chehade et.al. 2501.10842 null
2025-01-17 Multiclass Queue Scheduling Under Slowdown: An Approximate Dynamic Programming Approach Jing Dong et.al. 2501.10523 null
2025-01-17 Complexity of the Virtual Network Embedding with uniform demands Amal Benhamiche et.al. 2501.10154 null
2025-01-16 A Dynamic Unmanned Aerial Vehicle Routing Framework for Urban Traffic Monitoring Yumeng Bai et.al. 2501.09249 null
2025-01-15 Stochastic Optimal Control of Prosumers in a District Heating System Maalvladédon Ganet Somé et.al. 2501.09088 null
2025-01-15 Family-wise Error Rate Control with E-values Will Hartog et.al. 2501.09015 null
2025-01-31 Monte Carlo Tree Search for Comprehensive Exploration in LLM-Based Automatic Heuristic Design Zhi Zheng et.al. 2501.08603 link
2025-01-14 Cooperative Patrol Routing: Optimizing Urban Crime Surveillance through Multi-Agent Reinforcement Learning Juan Palma-Borda et.al. 2501.08020 link
2025-01-14 Optimal Classification Trees for Continuous Feature Data Using Dynamic Programming with Branch-and-Bound Catalin E. Brita et.al. 2501.07903 link
2025-01-09 A Multi-Layer CNN-GRUSKIP model based on transformer for spatial TEMPORAL traffic flow prediction Karimeh Ibrahim Mohammad Ata et.al. 2501.07593 null
2025-01-13 An Alternating Approach to Approximate Dynamic Programming Di Zhang et.al. 2501.06983 null
2025-01-11 A Linear Complexity Algorithm for Optimal Transport Problem with Log-type Cost Ziyuan Lyu et.al. 2501.06578 null
2025-01-10 Exploratory Randomization for Discrete-Time Linear Exponential Quadratic Gaussian (LEQG) Problem Sebastien Lleo et.al. 2501.06275 null
2025-01-09 Linear Algebraic Truncation Algorithm with A Posteriori Error Bounds for Computing Markov Chain Equilibrium Gradients Saied Mahdian et.al. 2501.06266 null
2025-01-09 ThriftLLM: On Cost-Effective Selection of Large Language Models for Classification Queries Keke Huang et.al. 2501.04901 null
2025-01-08 Semilinear Dynamic Programming: Analysis, Algorithms, and Certainty Equivalence Properties Yuchao Li et.al. 2501.04668 null
2025-01-08 HypeRL: Parameter-Informed Reinforcement Learning for Parametric PDEs Nicolò Botteghi et.al. 2501.04538 null
2025-01-08 Probabilistic Greedy Algorithm Solver Using Magnetic Tunneling Junctions for Traveling Salesman Problem Ran Zhang et.al. 2501.04447 null
2025-01-07 Exploring the Potential of Large Language Models in Public Transportation: San Antonio Case Study Ramya Jonnala et.al. 2501.03904 null
2025-01-07 Young domination on Hamming rectangles Janko Gravner et.al. 2501.03788 null
2025-01-06 Distributionally Robust Control Synthesis for Stochastic Systems with Safety and Reach-Avoid Specifications Yu Chen et.al. 2501.03137 null
2025-01-06 MDP3: A Training-free Approach for List-wise Frame Selection in Video-LLMs Hui Sun et.al. 2501.02885 null
2025-01-06 Local Reactive Control for Mobile Manipulators with Whole-Body Safety in Complex Environments Chunxin Zheng et.al. 2501.02815 null
2025-01-06 Enhancing Robot Route Optimization in Smart Logistics with Transformer and GNN Integration Hao Luo et.al. 2501.02749 null
2025-01-05 Approximate Dynamic Programming for a Remanufacture-to-Order System Amirreza Pashapour et.al. 2501.02656 null
2025-01-05 Neural Error Covariance Estimation for Precise LiDAR Localization Minoo Dolatabadi et.al. 2501.02558 null
2025-01-01 Large Language Models Are Read/Write Policy-Makers for Simultaneous Generation Shoutao Guo et.al. 2501.00868 link
2024-12-30 A randomisation method for mean-field control problems with common noise Robert Denkert et.al. 2412.20782 null
2024-12-28 RFPPO: Motion Dynamic RRT based Fluid Field - PPO for Dynamic TF/TA Routing Planning Rongkun Xue et.al. 2412.20098 null
2024-12-27 Game theoretical asymptotic mean value properties for non-homogeneous $p$ -Laplace problems Félix del Teso et.al. 2412.19410 null
2024-12-24 Hybrid Many-Objective Optimization in Probabilistic Mission Design for Compliant and Effective UAV Routing Simon Kohaut et.al. 2412.18514 null
2024-12-23 AI-Driven Control of Chaos: A Transformer-Based Approach for Dynamical Systems David Valle et.al. 2412.17357 link
2024-12-21 A Bayesian Composite Risk Approach for Stochastic Optimal Control and Markov Decision Processes Wentao Ma et.al. 2412.16488 null
2024-12-20 Battery valuation on electricity intraday markets with liquidity costs Enzo Cognéville et.al. 2412.15959 null
2024-12-19 Robustness Evaluation of a Physical Internet-based Intermodal Logistic Network Federico Gallo et.al. 2412.14658 null
2024-12-17 A Scalable Method for Optimal Path Planning on Manifolds via a Hopf-Lax Type Formula Edward Huynh et.al. 2412.13346 link
2024-12-16 Using machine learning to inform harvest control rule design in complex fishery settings Felipe Montealegre-Mora et.al. 2412.12400 link
2024-12-12 SprayCraft: Graph-Based Route Optimization for Variable Rate Precision Spraying Kiran K. Kethineni et.al. 2412.12176 null
2024-12-16 Witty: An Efficient Solver for Computing Minimum-Size Decision Trees Luca Pascal Staus et.al. 2412.11954 null
2024-12-16 LLM-DaaS: LLM-driven Drone-as-a-Service Operations from Text User Requests Lillian Wassim et.al. 2412.11672 null
2024-12-14 An Active Parameter Learning Approach to The Identification of Safe Regions Aneesh Raghavan et.al. 2412.10627 null
2024-12-12 On Round-Off Errors and Gaussian Blur in Superresolution and in Image Registration Serap A. Savari et.al. 2412.09741 null
2024-12-20 MAPLE: A Framework for Active Preference Learning Guided by Large Language Models Saaduddin Mahmud et.al. 2412.07207 null
2024-12-09 Phaedrus: Exploring Dynamic Application Behavior with Lightweight Generative Models and Large-Language Models Bodhisatwa Chatterjee et.al. 2412.06994 null
2024-12-07 Timely reliable Bayesian decision-making enabled using memristors Lekai Song et.al. 2412.06838 null
2024-12-08 DiTer++: Diverse Terrain and Multi-modal Dataset for Multi-Robot SLAM in Multi-session Environments Juwon Kim et.al. 2412.05839 null
2024-12-08 SizeGS: Size-aware Compression of 3D Gaussians with Hierarchical Mixed Precision Quantization Shuzhao Xie et.al. 2412.05808 null
2024-12-07 Controlled rough SDEs, pathwise stochastic control and dynamic programming principles Peter K. Friz et.al. 2412.05698 null
2024-12-07 Quantum Annealing and Tensor Networks: a Powerful Combination to Solve Optimization Problems Miquel Albertí Binimelis et.al. 2412.05595 link
2024-12-07 Optimizing Returns from Experimentation Programs Timothy Sudijono et.al. 2412.05508 null
2024-12-06 Nonmyopic Global Optimisation via Approximate Dynamic Programming Filippo Airaldi et.al. 2412.04882 link
2024-12-05 Generating graph states with a single quantum emitter and the minimum number of fusions Matthias C. Löbl et.al. 2412.04587 null
2024-12-04 Summa Summarum: Moessner's Theorem without Dynamic Programming Olivier Danvy et.al. 2412.03127 null
2024-11-21 Quantum Annealing based Hybrid Strategies for Real Time Route Optimization Sushil Mario et.al. 2412.02720 null
2024-11-30 A Second Soul: Celebrating the Many Languages of Programming -- Festschrift in Honor of Peter Thiemann's Sixtieth Birthday Annette Bieniusa et.al. 2412.01856 null
2024-12-01 Optimization of Delivery Routes for Fresh E-commerce in Pre-warehouse Mode Alice Harward et.al. 2412.00634 null
2024-11-29 An Optimal Switching Approach for Bird Migration Jiawei Chu et.al. 2411.19467 null
2024-11-28 SPAgent: Adaptive Task Decomposition and Model Selection for General Video Generation and Editing Rong-Cheng Tu et.al. 2411.18983 null
2024-11-27 SCoTT: Wireless-Aware Path Planning with Vision Language Models and Strategic Chains-of-Thought Aladin Djuhera et.al. 2411.18212 null
2024-11-26 Structural Parameterization of Locating-Dominating Set and Test Cover Dipayan Chakraborty et.al. 2411.17948 null
2024-11-26 Pushing the Limits of Large Language Model Quantization via the Linearity Theorem Vladimir Malinovskii et.al. 2411.17525 null
2024-11-26 Weakly acyclic diagrams: A data structure for infinite-state symbolic verification Michael Blondin et.al. 2411.17250 null
2024-11-26 Dynamic Programming-Based Offline Redundancy Resolution of Redundant Manipulators Along Prescribed Paths with Real-Time Adjustment Zhihang Yin et.al. 2411.17052 null
2024-11-26 Dynamic Programming-Based Redundancy Resolution for Path Planning of Redundant Manipulators Considering Breakpoints Zhihang Yin et.al. 2411.17034 null
2024-11-26 Entropy-Based Dynamic Programming for Efficient Vehicle Parking Jean-Luc Lupien et.al. 2411.17014 null
2024-11-25 Any3DIS: Class-Agnostic 3D Instance Segmentation by 2D Mask Tracking Phuc Nguyen et.al. 2411.16183 null
2024-11-25 Using Drone Swarm to Stop Wildfire: A Predict-then-optimize Approach Shijie Pan et.al. 2411.16144 null
2024-11-24 Hiding Communication Cost in Distributed LLM Training via Micro-batch Co-execution Haiquan Wang et.al. 2411.15871 null
2024-11-24 Revenue Maximization in Choice-Based Matching Markets Dan Nissim et.al. 2411.15727 null
2024-11-22 Jovis: A Visualization Tool for PostgreSQL Query Optimizer Yoojin Choi et.al. 2411.14788 null
2024-11-22 Construction and Preliminary Validation of a Dynamic Programming Concept Inventory Matthew Ferland et.al. 2411.14655 null
2024-11-18 Controlled Occupied Processes and Viscosity Solutions H. Mete Soner et.al. 2411.12080 null
2024-11-18 A New Finite-Horizon Dynamic Programming Analysis of Nonanticipative Rate-Distortion Function for Markov Sources Zixuan He et.al. 2411.11698 null
2024-11-18 gpuPairHMM: High-speed Pair-HMM Forward Algorithm for DNA Variant Calling on GPUs Bertil Schmidt et.al. 2411.11547 link
2024-11-17 Dynamic Programming: Optimality at a Point Implies Optimality Everywhere John Stachurski et.al. 2411.11062 null
2024-11-15 AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment Yonggan Fu et.al. 2411.10606 link
2024-11-14 Latency Optimization in LEO Satellite Communications with Hybrid Beam Pattern and Interference Control Qianqian Zhang et.al. 2411.09600 null
2024-11-13 On the numerical integration of the Fokker-Planck equation driven by a mechanical force and the Bismut-Elworthy-Li formula Julia Sanders et.al. 2411.08518 link
2024-11-13 Tractable Robust Markov Decision Processes Julien Grand-Clément et.al. 2411.08435 null
2024-11-12 dpvis: A Visual and Interactive Learning Tool for Dynamic Programming David H. Lee et.al. 2411.07705 link
2024-11-11 DP and QP Based Decision-making and Planning for Autonomous Vehicle Zhicheng Zhang et.al. 2411.06751 null
2024-11-11 Resilient control under denial-of-service and uncertainty: An adaptive dynamic programming approach Weinan Gao et.al. 2411.06689 null
2024-11-11 Two Kinds of Learning Algorithms for Continuous-Time VWAP Targeting Execution Xingyu Zhou et.al. 2411.06645 null
2024-11-10 Robust optimal stopping with regime switching Siyu Lv et.al. 2411.06522 null
2024-11-07 Optimal control under unknown intensity with Bayesian learning Nicolas Baradel et.al. 2411.04917 null
2024-11-07 Structure Matters: Dynamic Policy Gradient Sara Klein et.al. 2411.04913 null
2024-11-07 Minimax Linear Regulator Problems for Positive Systems Alba Gurpegui et.al. 2411.04809 null
2024-11-07 Optimal Execution under Incomplete Information Etienne Chevalier et.al. 2411.04616 null
2024-11-07 Convergence and Robustness of Value and Policy Iteration for the Linear Quadratic Regulator Bowen Song et.al. 2411.04548 link
2024-11-05 DP-HLS: A High-Level Synthesis Framework for Accelerating Dynamic Programming Algorithms in Bioinformatics Yingqi Cao et.al. 2411.03398 link
2024-11-04 Stochastic Optimal Control of an Industrial Power-to-Heat System with High-Temperature Heat Pump and Thermal Energy Storage Eric Pilling et.al. 2411.02211 null
2024-11-03 ParseCaps: An Interpretable Parsing Capsule Network for Medical Image Diagnosis Xinyu Geng et.al. 2411.01564 null
2024-10-31 EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization Mujin Cheon et.al. 2411.00171 null
2024-10-31 Q-learning for Quantile MDPs: A Decomposition, Performance, and Convergence Analysis Jia Lin Hau et.al. 2410.24128 link
2024-10-31 A dynamic programming principle for multiperiod control problems with bicausal constraints Ruslan Mirmominov et.al. 2410.23927 null
2024-10-30 Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning Ruhan Wang et.al. 2410.23450 null
2024-10-29 Approximately Counting Knapsack Solutions in Subquadratic Time Weiming Feng et.al. 2410.22267 null
2024-10-29 Beating Bellman's Algorithm for Subset Sum Karl Bringmann et.al. 2410.21942 null
2024-10-28 Analysis of Different Algorithmic Design Techniques for Seam Carving Owais Aijaz et.al. 2410.21207 null
2024-10-27 A New Method for Inserting Train Paths into a Timetable David Dekker et.al. 2410.20561 link
2024-10-27 On the I/O Complexity of the CYK Algorithm and of a Family of Related DP Algorithms Lorenzo De Stefani et.al. 2410.20337 null
2024-10-25 An Enhanced Hierarchical Planning Framework for Multi-Robot Autonomous Exploration Gengyuan Cai et.al. 2410.19373 null
2024-10-24 Stochastic dynamic programming under recursive Epstein-Zin preferences Anna Jaśkiewicz et.al. 2410.19181 null
2024-10-24 A Counterexample in Cross-Correlation Template Matching Serap A. Savari et.al. 2410.19085 null
2024-10-23 Trajectory Optimization for Spatial Microstructure Control in Electron Beam Metal Additive Manufacturing Mikhail Khrenov et.al. 2410.18207 null
2024-10-24 Estimating the Spectral Moments of the Kernel Integral Operator from Finite Sample Matrices Chanwoo Chun et.al. 2410.17998 null
2024-10-21 Policies with Sparse Inter-Agent Dependencies in Dynamic Games: A Dynamic Programming Approach Xinjie Liu et.al. 2410.16441 null
2024-10-21 All You Need is an Improving Column: Enhancing Column Generation for Parallel Machine Scheduling via Transformers Amira Hijazi et.al. 2410.15601 null
2024-10-21 How to Find the Exact Pareto Front for Multi-Objective MDPs? Yining Li et.al. 2410.15557 null
2024-10-20 CASET: Complexity Analysis using Simple Execution Traces for CS submissions* Aaryen Mehta et.al. 2410.15419 null
2024-10-19 The Constrained Layer Tree Problem and Applications to Solar Farm Cabling Thomas Bläsius et.al. 2410.15031 null
2024-10-18 On picking operations in e-commerce warehouses: Insights from the complete-information counterpart Catherine Lorenz et.al. 2410.14316 null
2024-10-17 Quasi-quantum states and the quasi-quantum PCP theorem Itai Arad et.al. 2410.13549 null
2024-10-17 Joint Antenna Selection and Covariance Matrix Optimization for ISAC Systems Michail Palaiologos et.al. 2410.13446 null
2024-10-17 Membership Testing for Semantic Regular Expressions Yifei Huang et.al. 2410.13262 null
2024-10-22 Research on Travel Route Planing Problems Based on Greedy Algorithm Yiquan Wang et.al. 2410.13226 link
2024-10-17 Algorithmic Content Selection and the Impact of User Disengagement Emilio Calvano et.al. 2410.13108 null
2024-10-16 Learning Representations for Reasoning: Generalizing Across Diverse Structures Zhaocheng Zhu et.al. 2410.13018 null
2024-10-16 Vehicle Localization in GPS-Denied Scenarios Using Arc-Length-Based Map Matching Nur Uddin Javed et.al. 2410.12208 null
2024-10-15 Incremental computation of the set of period sets Eric Rivals et.al. 2410.12077 null
2024-10-15 Routing and Scheduling Optimization for Urban Air Mobility Fleet Management using Quantum Annealing Renichiro Haba et.al. 2410.11231 null
2024-10-16 SplitLLM: Collaborative Inference of LLMs for Model Placement and Throughput Optimization Akrit Mudvari et.al. 2410.10759 null
2024-10-14 Learning Sub-Second Routing Optimization in Computer Networks requires Packet-Level Dynamics Andreas Boltres et.al. 2410.10377 null
2024-10-09 Rapid Computation of the Assembly Index of Molecular Graphs Ian Seet et.al. 2410.09100 null
2024-10-11 Deep Learning Algorithms for Mean Field Optimal Stopping in Finite Space and Discrete Time Lorenzo Magnino et.al. 2410.08850 null
2024-10-11 Hybrid Filtering Heuristic for the Sensor-Placement Problem to Discretize 2D Continuous Environments Jan Mikula et.al. 2410.08784 link
2024-10-10 Dynamic Programming based Local Search approaches for Multi-Agent Path Finding problems on Directed Graphs Irene Saccani et.al. 2410.07954 null
2024-10-10 Partitioning Trillion Edge Graphs on Edge Devices Adil Chhabra et.al. 2410.07732 null
2024-10-11 Q-WSL:Leveraging Dynamic Programming for Weighted Supervised Learning in Goal-conditioned RL Xing Lei et.al. 2410.06648 null
2024-10-08 Solvability of Equilibrium Riccati Equations: A Direct Approach Bowen Ma et.al. 2410.06090 null
2024-10-07 Dynamic HumTrans: Humming Transcription Using CNNs and Dynamic Programming Shubham Gupta et.al. 2410.05455 link
2024-10-07 A Predictive and Optimization Approach for Enhanced Urban Mobility Using Spatiotemporal Data Shambhavi Mishra et.al. 2410.05358 null
2024-10-05 AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text Ximing Lu et.al. 2410.04265 link
2024-10-05 A branch-&-price approach to the unrooted maximum agreement forest problem Martin Frohn et.al. 2410.04122 null
2024-10-02 Electrification of Transportation: A Hybrid Benders/SDDP Algorithm for Optimal Charging Station Trading Farnaz Sohrabi et.al. 2410.03763 null
2024-10-02 Effects of eco-driving on energy consumption and battery degradation for electric vehicles at signalized intersections Yongqiang Wang et.al. 2410.01685 null
2024-10-02 Krylov-Safonov theory for Pucci-type extremal inequalities on random data clouds Ángel Arroyo et.al. 2410.01642 null
2024-10-02 Automated Curvy Waveguide Routing for Large-Scale Photonic Integrated Circuits Hongjian Zhou et.al. 2410.01260 link
2024-09-30 Generalised mixed effects models for changepoint analysis of biomedical time series data Mark B. Fiecas et.al. 2410.00183 null
2024-09-30 Opt2Skill: Imitating Dynamically-feasible Whole-Body Trajectories for Versatile Humanoid Loco-Manipulation Fukang Liu et.al. 2409.20514 null
2024-09-28 On Computing Elastic Shape Distances between Curves in d-dimensional Space Javier Bernal et.al. 2409.19380 null
2024-09-25 MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual Features Katharina Anderer et.al. 2409.16765 link
2024-09-25 DeformStream: Deformation-based Adaptive Volumetric Video Streaming Boyan Li et.al. 2409.16615 null
2024-09-24 Partial Elastic Shape Registration of 3D Surfaces using Dynamic Programming Javier Bernal et.al. 2409.16462 null
2024-09-25 Efficient Nearest Neighbor Search Using Dynamic Programming Pengfei Wang et.al. 2409.15023 null
2024-09-22 Unsupervised Word Discovery: Boundary Detection with Clustering vs. Dynamic Programming Simon Malan et.al. 2409.14486 null
2024-09-24 Batch Predictive Inference Yonghoon Lee et.al. 2409.13990 link
2024-09-20 A Modified Algorithm for Optimal Picker Routing in a Single Block Warehouse George Dunn et.al. 2409.13219 null
2024-09-19 Program Slicing in the Era of Large Language Models Kimya Khakzad Shahandashti et.al. 2409.12369 null
2024-09-18 Differential dynamic programming with stagewise equality and inequality constraints using interior point method Siddharth Prabhu et.al. 2409.12048 link
2024-09-20 Second-Order Constrained Dynamic Optimization Yuichiro Aoyama et.al. 2409.11649 null
2024-09-18 Multi-stage stochastic linear programming for shared autonomous vehicle system operation and design with on-demand and pre-booked requests Riki Kawase et.al. 2409.11611 null
2024-09-17 Optimal Investment with Costly Expert Opinions Christoph Knochenhauer et.al. 2409.11569 null
2024-09-20 Exact Wavefront Propagation for Globally Optimal One-to-All Path Planning on 2D Cartesian Grids Ibrahim Ibrahim et.al. 2409.11545 link
2024-09-17 Neural Networks for Vehicle Routing Problem László Kovács et.al. 2409.11290 null
2024-09-17 Selective algorithm processing of subset sum distributions Nick Dawes et.al. 2409.11076 null
2024-09-17 Local discontinuous Galerkin method for nonlinear BSPDEs of Neumann boundary conditions with deep backward dynamic programming time-marching Yixiang Dai et.al. 2409.11004 null
2024-09-17 Relationship between stochastic maximum principle and dynamic programming principle under convex expectation Xiaojuan Li et.al. 2409.10987 null
2024-09-16 Direct Data-Driven Discounted Infinite Horizon Linear Quadratic Regulator with Robustness Guarantees Ramin Esmzad et.al. 2409.10703 null
2024-09-20 Motion Forecasting via Model-Based Risk Minimization Aron Distelzweig et.al. 2409.10585 null
2024-09-16 Estimates for Optimal Multistage Group Partition Testing Guojiang Shao et.al. 2409.10410 null
2024-09-16 Pareto Sums of Pareto Sets: Lower Bounds and Algorithms Daniel Funke et.al. 2409.10232 null
2024-09-12 Q-value Regularized Decision ConvFormer for Offline Reinforcement Learning Teng Yan et.al. 2409.08062 null
2024-09-12 Super Monotonic Alignment Search Junhyeok Lee et.al. 2409.07704 link
2024-09-10 Design of Threshold-Constrained Indirect Quantizers Ariel Doubchak et.al. 2409.06839 null
2024-09-10 Cooptimizing Safety and Performance with a Control-Constrained Formulation Hao Wang et.al. 2409.06696 link
2024-09-12 Valuation Model of Chinese Convertible Bonds Based on Monte Carlo Simulation Yu Liu et.al. 2409.06496 null
2024-09-09 OTFS-MDMA: An Elastic Multi-Domain Resource Utilization Mechanism for High Mobility Scenarios Jie Chen et.al. 2409.05724 null
2024-09-09 Enhancing Empathic Accuracy: Penalized Functional Alignment Method to Correct Misalignment in Emotional Perception Linh H Nghiem et.al. 2409.05343 null
2024-09-08 Cooperative Learning-Based Framework for VNF Caching and Placement Optimization over Low Earth Orbit Satellite Networks Khai Doan et.al. 2409.05025 null
2024-09-08 Fast Deep Predictive Coding Networks for Videos Feature Extraction without Labels Wenqian Xue et.al. 2409.04945 null
2024-09-17 Second-Order Stein Variational Dynamic Optimization Yuichiro Aoyama et.al. 2409.04644 null
2024-09-06 Refined Bounds on Near Optimality Finite Window Policies in POMDPs and Their Reinforcement Learning Yunus Emre Demirci et.al. 2409.04351 null
2024-09-05 Space-Efficient Algorithm for Integer Programming with Few Constraints Lars Rohwedder et.al. 2409.03681 null
2024-09-05 Fine-Grained Equivalence for Problems Related to Integer Linear Programming Lars Rohwedder et.al. 2409.03675 null
2024-09-06 Revenue Management with Calendar-Aware and Dependent Demands: Asymptotically Tight Fluid Approximations Weiyuan Li et.al. 2409.02637 null
2024-09-03 FuzzCoder: Byte-level Fuzzing Test via Large Language Model Liqun Yang et.al. 2409.01944 link
2024-09-03 Quantum Algorithms for One-Sided Crossing Minimization Susanna Caroppo et.al. 2409.01942 null
2024-09-02 Solving Integrated Process Planning and Scheduling Problem via Graph Neural Network Based Deep Reinforcement Learning Hongpei Li et.al. 2409.00968 link
2024-09-02 Multistage Robust Average Randomized Spectral Risk Optimization Qiong Wu et.al. 2409.00892 null
2024-09-01 An Optimized Binning and Probabilistic Slice Sharing Algorithm for Motion Correction in Abdominal DW-MRI Michelle Su et.al. 2409.00798 null
2024-09-01 Cooperative Path Planning with Asynchronous Multiagent Reinforcement Learning Jiaming Yin et.al. 2409.00754 null
2024-09-01 The landscape of deterministic and stochastic optimal control problems: One-shot Optimization versus Dynamic Programming Jihun Kim et.al. 2409.00655 null
2024-08-31 Foundations of Multivariate Distributional Reinforcement Learning Harley Wiltzer et.al. 2409.00328 null
2024-08-30 Approximation Algorithms for Anchored Multiwatchman Routes Joseph S. B. Mitchell et.al. 2408.17343 null
2024-08-30 Stationary Policies are Optimal in Risk-averse Total-reward MDPs with EVaR Xihong Su et.al. 2408.17286 link
2024-08-30 A Two-Timescale Decision-Hazard-Decision Formulation for Storage Usage Values Calculation Camila Martinez Parra et.al. 2408.17113 null
2024-08-29 Optimization Models for the Quadratic Traveling Salesperson Problem Yuxiao Chen et.al. 2408.16680 null
2024-08-27 On the parameterized complexity of computing good edge-labelings Davi de Andrade et.al. 2408.15181 null
2024-08-26 Achieving designed texture and flows in bulk active nematics using optimal control theory Saptorshi Ghosh et.al. 2408.14596 null
2024-08-25 Decentralized Stochastic Control in Standard Borel Spaces: Centralized MDP Reductions, Near Optimality of Finite Window Local Information, and Q-Learning Omar Mrani-Zentar et.al. 2408.13828 null
2024-08-23 The Ultimate Guide to Fine-Tuning LLMs from Basics to Breakthroughs: An Exhaustive Review of Technologies, Research, Best Practices, Applied Research Challenges and Opportunities Venkatesh Balavadhani Parthasarathy et.al. 2408.13296 null
2024-08-18 An Introduction to Cognidynamics Marco Gori et.al. 2408.13112 null
2024-08-20 Optimal Guarantees for Online Selection Over Time Sebastian Perez-Salazar et.al. 2408.11224 null
2024-08-20 Fault Tolerant Dynamic Task Assignment for UAV-based Search Teams Ali Nasir et.al. 2408.10564 null
2024-08-19 Efficient Exploration in Deep Reinforcement Learning: A Novel Bayesian Actor-Critic Algorithm Nikolai Rozanov et.al. 2408.10055 null
2024-08-19 Continuous-Time Dynamic Decision Making with Costly Information Christoph Knochenhauer et.al. 2408.09693 null
2024-08-19 Solving stochastic climate-economy models: A deep least-squares Monte Carlo approach Aleksandar Arandjelović et.al. 2408.09642 null
2024-08-18 Exploratory Optimal Stopping: A Singular Control Formulation Jodi Dianetti et.al. 2408.09335 null
2024-08-17 Optimal Strip Attitude Command of Earth Observation Satellite using Differential Dynamic Programming Seungyeop Han et.al. 2408.09244 null
2024-08-17 Twin Sorting Dynamic Programming Assisted User Association and Wireless Bandwidth Allocation for Hierarchical Federated Learning Rung-Hung Gau et.al. 2408.09076 null
2024-08-17 Atlas: Hierarchical Partitioning for Quantum Circuit Simulation on GPUs (Extended Version) Mingkuan Xu et.al. 2408.09055 null
2024-08-15 Optimal control problems with generalized mean-field dynamics and viscosity solution to Master Bellman equation Rainer Buckdahn et.al. 2408.08046 null
2024-08-14 Differentiating Policies for Non-Myopic Bayesian Optimization Darian Nwankwo et.al. 2408.07812 null
2024-08-11 Moderate Exponential-time Quantum Dynamic Programming Across the Subsets for Scheduling Problems Camille Grange et.al. 2408.05741 null
2024-08-10 Convergence Guarantee of Dynamic Programming for LTL Surrogate Reward Zetong Xuan et.al. 2408.05438 null
2024-08-09 MIDI-to-Tab: Guitar Tablature Inference via Masked Language Modeling Drew Edwards et.al. 2408.05024 null
2024-08-09 A Comprehensive System Architecture using Field Programmable Gate Arrays Technology, Dijkstra's Algorithm, and Edge Computing for Emergency Response in Smart Cities Mahamat Abdel Aziz Assoul et.al. 2408.04924 null
2024-08-08 Mathematical Programming For Adaptive Experiments Ethan Che et.al. 2408.04570 null
2024-08-08 Non-maximizing policies that fulfill multi-criterion aspirations in expectation Simon Dima et.al. 2408.04385 null
2024-08-08 Enhanced Traffic Flow Prediction with Multi-Segment Fusion Tensor Graph Convolutional Networks Wei Zhang et.al. 2408.04232 null
2024-08-06 A Course in Dynamic Optimization Bar Light et.al. 2408.03034 null
2024-08-05 Positive Dynamic Programming: A Critique Aaqib Peerzada et.al. 2408.02809 null
2024-08-05 Multi-level Traffic-Responsive Tilt Camera Surveillance through Predictive Correlated Online Learning Tao Li et.al. 2408.02208 null
2024-08-04 Non-local Hamilton-Jacobi-Bellman equations for the stochastic optimal control of path-dependent piecewise deterministic processes Elena Bandini et.al. 2408.02147 null
2024-08-03 Leveraging GNSS and Onboard Visual Data from Consumer Vehicles for Robust Road Network Estimation Balázs Opra et.al. 2408.01640 null
2024-08-02 Occasionally Observed Piecewise-deterministic Markov Processes Marissa Gee et.al. 2408.01335 null
2024-08-02 The Impact of Program Reduction on Automated Program Repair Linas Vidziunas et.al. 2408.01134 null
2024-08-11 Deep Learning Approach for Changepoint Detection: Penalty Parameter Optimization Tung L Nguyen et.al. 2408.00856 link
2024-07-31 Tractable and Provably Efficient Distributional Reinforcement Learning with General Value Function Approximation Taehyun Cho et.al. 2407.21260 null
2024-07-30 A Machine Learning Approach to Boost the Vehicle-2-Grid Scheduling Gabriele Agliardi et.al. 2407.20802 null
2024-07-30 Generalized replicator dynamics based on mean-field pairwise comparison dynamic Hidekazu Yoshioka et.al. 2407.20751 null
2024-08-10 A UAV-Enabled Time-Sensitive Data Collection Scheme for Grassland Monitoring Edge Networks Dongbin Jiao et.al. 2407.20585 null
2024-07-29 A Differential Dynamic Programming Framework for Inverse Reinforcement Learning Kun Cao et.al. 2407.19902 null
2024-07-27 Map-Matching Queries under Fréchet Distance on Low-Density Spanners Kevin Buchin et.al. 2407.19304 null
2024-07-26 RRO: A Regularized Routing Optimization Algorithm for Enhanced Throughput and Low Latency with Efficient Complexity David Zenati et.al. 2407.18683 null
2024-07-26 Mean-field control of non exchangeable systems Anna De Crescenzo et.al. 2407.18635 null
2024-08-01 Stochastic Games with Minimally Bounded Action Costs David Mguni et.al. 2407.18010 null
2024-07-25 Personalized and Context-aware Route Planning for Edge-assisted Vehicles Dinesh Cyril Selvaraj et.al. 2407.17980 null
2024-07-23 Data-Driven Optimal Feedback Laws via Kernel Mean Embeddings Petar Bevanda et.al. 2407.16407 null
2024-07-23 Data-driven Multistage Distributionally Robust Linear Optimization with Nested Distance Rui Gao et.al. 2407.16346 null
2024-07-22 Faster Optimal Coalition Structure Generation via Offline Coalition Selection and Graph-Based Search Redha Taguelmimt et.al. 2407.16092 null
2024-07-22 Scheduling on a Stochastic Number of Machines Moritz Buchem et.al. 2407.15737 null
2024-07-20 Interdiction of minimum spanning trees and other matroid bases Noah Weninger et.al. 2407.14906 link
2024-07-20 A Tale of Two Scales: Reconciling Horizontal and Vertical Scaling for Inference Serving Systems Kamran Razavi et.al. 2407.14843 null
2024-07-19 Dynamic Programming Techniques for Planar Orbital Transfer of Low Earth Orbit Satellites C. Ciancarelli et.al. 2407.14675 null
2024-07-19 Generalization Error Analysis of Deep Backward Dynamic Programming for Solving Nonlinear PDEs Du Ouyang et.al. 2407.14566 null
2024-07-19 On Policy Evaluation Algorithms in Distributional Reinforcement Learning Julian Gerstenberg et.al. 2407.14175 null
2024-07-18 Shaded Route Planning Using Active Segmentation and Identification of Satellite Images Longchao Da et.al. 2407.13689 null
2024-07-18 The Madness of Multiple Entries in March Madness Jeff Decary et.al. 2407.13438 null
2024-07-18 Double interdiction problem on trees on the sum of root-leaf distances by upgrading edges Xiao Li et.al. 2407.13391 null
2024-07-18 Deterministic Trajectory Optimization through Probabilistic Optimal Control Mohammad Mahmoudi Filabadi et.al. 2407.13316 null
2024-07-18 Integrated Hardware Architecture and Device Placement Search Irene Wang et.al. 2407.13143 link
2024-07-18 Multiobjective Vehicle Routing Optimization with Time Windows: A Hybrid Approach Using Deep Reinforcement Learning and NSGA-II Rixin Wu et.al. 2407.13113 null
2024-07-17 Dynamic Programming Principle and Hamilton-Jacobi-Bellman Equation for Optimal Control Problems with Uncertainty M. Soledad Aronna et.al. 2407.13045 null
2024-07-17 Estimating the Potential Impact of Combined Race and Ethnicity Reporting on Long-Term Earnings Statistics Kevin L. McKinney et.al. 2407.12775 null
2024-07-16 Enabling MCTS Explainability for Sequential Planning Through Computation Tree Logic Ziyan An et.al. 2407.10820 null
2024-07-14 Fine Grained Lower Bounds for Multidimensional Knapsack Ilan Doron-Arad et.al. 2407.10146 null
2024-07-12 Investigating the Interplay of Prioritized Replay and Generalization Parham Mohammad Panahi et.al. 2407.09702 null
2024-07-12 An efficient algorithm to compute the minimum free energy of interacting nucleic acid strands Ahmed Shalaby et.al. 2407.09676 null
2024-07-12 Hamilton-Jacobi Reachability in Reinforcement Learning: A Survey Milan Ganai et.al. 2407.09645 null
2024-07-12 Integer programs with nearly totally unimodular matrices: the cographic case Manuel Aprile et.al. 2407.09477 null
2024-07-12 A new approach to principal-agent problems with volatility control Alessandro Chiusolo et.al. 2407.09471 null
2024-07-12 CAACS: A Carbon Aware Ant Colony System Marina Lin et.al. 2407.09404 null
2024-07-12 Structure and Independence in Hyperbolic Uniform Disk Graphs Thomas Bläsius et.al. 2407.09362 null
2024-07-12 KUNPENG: An Embodied Large Model for Intelligent Maritime Naiyao Wang et.al. 2407.09048 link
2024-07-09 Trajectory Data Mining and Trip Travel Time Prediction on Specific Roads Muhammad Awais Amin et.al. 2407.07030 null
2024-07-08 Solving Multi-Model MDPs by Coordinate Ascent and Dynamic Programming Xihong Su et.al. 2407.06329 link
2024-07-08 Narrowing the Gap between Adversarial and Stochastic MDPs via Policy Optimization Daniil Tiapkin et.al. 2407.05704 null
2024-07-06 Advancing Algorithmic Approaches to Probabilistic Argumentation under the Constellation Approach Andrei Popescu et.al. 2407.05058 null
2024-07-05 Re-Tuning: Overcoming the Compositionality Limits of Large Language Models with Recursive Tuning Eric Pasewark et.al. 2407.04787 link
2024-07-05 GOALPlace: Begin with the End in Mind Anthony Agnesina et.al. 2407.04579 null
2024-07-04 Advanced Artificial Intelligence Strategy for Optimizing Urban Rail Network Design using Nature-Inspired Algorithms Hariram Sampath Kumar et.al. 2407.04087 null
2024-07-04 Multi-Time Scale Service Caching and Pricing in MEC Systems with Dynamic Program Popularity Yiming Chen et.al. 2407.03804 null
2024-07-03 Reconsidering utility: unveiling the limitations of synthetic mobility data generation algorithms in real-life scenarios Alexandra Kapp et.al. 2407.03237 null
2024-07-12 A Two-stage Identification Method for Switched Linear Systems Zheng Wenju et.al. 2407.02743 null
2024-07-02 DM3D: Distortion-Minimized Weight Pruning for Lossless 3D Object Detection Kaixin Xu et.al. 2407.02098 null
2024-06-28 Edge-DIRECT: A Deep Reinforcement Learning-based Method for Solving Heterogeneous Electric Vehicle Routing Problem with Time Window Constraints Arash Mozhdehi et.al. 2407.01615 null
2024-07-02 Contractual Reinforcement Learning: Pulling Arms with Invisible Hands Jibang Wu et.al. 2407.01458 null
2024-07-01 Exact statistical analysis for response-adaptive clinical trials: a general and computationally tractable approach Stef Baas et.al. 2407.01055 null
2024-06-30 Maximum Entropy Inverse Reinforcement Learning of Diffusion Models with Energy-Based Models Sangwoong Yoon et.al. 2407.00626 link
2024-06-30 Your Car Tells Me Where You Drove: A Novel Path Inference Attack via CAN Bus and OBD-II Data Tommaso Bianchi et.al. 2407.00585 null
2024-06-29 A Two-stage Reinforcement Learning-based Approach for Multi-entity Task Allocation Aicheng Gong et.al. 2407.00496 link
2024-06-29 Vector-valued robust stochastic control Igor Cialenco et.al. 2407.00266 null
2024-06-28 Leveraging Fixed-Parameter Tractability for Robot Inspection Planning Yosuke Mizutani et.al. 2407.00251 null
2024-06-28 Approximate Solutions for Multi-Trip Route Planning in Time-Sensitive Situations Bahar Cavdar et.al. 2407.00173 null
2024-06-28 Online Optimization of DNN Inference Network Utility in Collaborative Edge Computing Rui Li et.al. 2406.19613 null
2024-06-27 Efficient and Distributed Large-Scale 3D Map Registration using Tomographic Features Halil Utku Unlu et.al. 2406.19461 link
2024-06-27 Cuts in Graphs with Matroid Constraints Aritra Banik et.al. 2406.19134 null
2024-06-27 State and Input Constrained Output-Feedback Adaptive Optimal Control of Affine Nonlinear Systems Tochukwu Elijah Ogri et.al. 2406.18804 null
2024-06-26 Markov Decision Process and Approximate Dynamic Programming for a Patient Assignment Scheduling problem Malgorzata M. O'Reilly et.al. 2406.18618 null
2024-06-26 Tiered Service Architecture for Remote Patient Monitoring Siddharth Chandak et.al. 2406.18000 null
2024-06-25 Splitting Guarantees for Prophet Inequalities via Nonlinear Systems Johannes Brustle et.al. 2406.17767 null
2024-06-25 Using iterated local alignment to aggregate GPS trajectories into a traffic flow map Tarn Duong et.al. 2406.17500 null
2024-06-24 A multiplicative surface signature through its Magnus expansion Ilya Chevyrev et.al. 2406.16856 null
2024-06-24 Stochastic Path-Dependent Volatility Models for Price-Storage Dynamics in Natural Gas Markets and Discrete-Time Swing Option Pricing Jinniao Qiu et.al. 2406.16400 null
2024-06-21 Exact discovery is polynomial for sparse causal Bayesian networks Felix L. Rios et.al. 2406.15012 link
2024-06-19 A programmable wafer-scale chiroptical heterostructure of twisted aligned carbon nanotubes and phase change materials Jichao Fan et.al. 2406.13190 null
2024-06-14 Interpretable Cascading Mixture-of-Experts for Urban Traffic Congestion Prediction Wenzhao Jiang et.al. 2406.12923 null
2024-06-26 LayerMerge: Neural Network Depth Compression through Layer Pruning and Merging Jinuk Kim et.al. 2406.12837 link
2024-06-17 LibProf: A Python Profiler for Improving Cold Start Performance in Serverless Applications Syed Salauddin Mohammad Tariq et.al. 2406.11734 null
2024-06-17 Statistical Learning of Distributionally Robust Stochastic Control in Continuous State Spaces Shengbo Wang et.al. 2406.11281 null
2024-06-16 WeShap: Weak Supervision Source Evaluation with Shapley Values Naiqing Guan et.al. 2406.11010 null
2024-06-16 Solving Co-Path/Cycle Packing Faster than $3^k$ Yuxi Liu et.al. 2406.10829 null
2024-06-15 Scheduling two types of jobs with minimum makespan Song Cao et.al. 2406.10467 null
2024-06-14 CycleTrajectory: An End-to-End Pipeline for Enriching and Analyzing GPS Trajectories to Understand Cycling Behavior and Environment Meihui Wang et.al. 2406.10069 link
2024-06-13 Optimal Control of Agent-Based Dynamics under Deep Galerkin Feedback Laws Frederik Kelbel et.al. 2406.09141 link
2024-06-13 Coordinated Trading Strategies for Battery Storage in Reserve and Spot Markets Paul E. Seifert et.al. 2406.08390 null
2024-06-11 Flow Map Matching Nicholas M. Boffi et.al. 2406.07507 null
2024-06-11 Variational inequalities and smooth-fit principle for singular stochastic control problems in Hilbert spaces Salvatore Federico et.al. 2406.07242 null
2024-06-10 Stochastic Guidance of Buoyancy Controlled Vehicles under Ice Shelves using Ocean Currents Federico Rossi et.al. 2406.06724 null
2024-06-10 Leveraging Hyperscanning EEG and VR Omnidirectional Treadmill to Explore Inter-Brain Synchrony in Collaborative Spatial Navigation Chun-Hsiang Chuang et.al. 2406.06327 null
2024-06-09 Production and distribution planning, scheduling, and routing optimization in a yogurt supply chain under demand uncertainty: A case study Babak Javadi et.al. 2406.05803 null
2024-06-09 Heart Sound Segmentation Using Deep Learning Techniques Manas Madine et.al. 2406.05653 null
2024-06-11 Bisimulation Metrics are Optimal Transport Distances, and Can be Computed Efficiently Sergio Calo et.al. 2406.04056 null
2024-06-04 GrootVL: Tree Topology is All You Need in State Space Model Yicheng Xiao et.al. 2406.02395 link
2024-06-21 Branches: A Fast Dynamic Programming and Branch & Bound Algorithm for Optimal Decision Trees Ayman Chaouki et.al. 2406.02175 link
2024-06-03 An efficient solution to Hidden Markov Models on trees with coupled branches Farzan Vafa et.al. 2406.01663 null
2024-06-03 A New View on Planning in Online Reinforcement Learning Kevin Roice et.al. 2406.01562 null
2024-06-02 Dual Policy Reinforcement Learning for Real-time Rebalancing in Bike-sharing Systems Jiaqi Liang et.al. 2406.00868 null
2024-06-02 Computing Optimal Equilibria in Repeated Games with Restarts Ratip Emin Berker et.al. 2406.00851 null
2024-06-02 A Lazy Abstraction Algorithm for Markov Decision Processes: Theory and Initial Evaluation Dániel Szekeres et.al. 2406.00824 null
2024-06-10 Model Predictive Control and Reinforcement Learning: A Unified Framework Based on Dynamic Programming Dimitri P. Bertsekas et.al. 2406.00592 null
2024-06-01 Optimal Transmission Power Scheduling for Networked Control System under DoS Attack Siyi Wang et.al. 2406.00540 null
2024-06-01 A Single-Loop Robust Policy Gradient Method for Robust Markov Decision Processes Zhenwei Lin et.al. 2406.00274 link
2024-05-31 Finding Diverse Solutions Parameterized by Cliquewidth Karolina Drabik et.al. 2405.20931 null
2024-05-29 A numerical algorithm with linear complexity for Multi-marginal Optimal Transport with $L^1$ Cost Chunhui Chen et.al. 2405.19246 null
2024-05-28 A Pontryagin Perspective on Reinforcement Learning Onno Eberhard et.al. 2405.18100 null
2024-05-27 Q-value Regularized Transformer for Offline Reinforcement Learning Shengchao Hu et.al. 2405.17098 null
2024-05-25 A Bi-Objective Approach to Last-Mile Delivery Routing Considering Driver Preferences Juan Pablo Mesa et.al. 2405.16051 null
2024-06-03 Inference of Utilities and Time Preference in Sequential Decision-Making Haoyang Cao et.al. 2405.15975 null
2024-05-31 Stability and Performance Analysis of Model Predictive Control of Uncertain Linear Systems Changrui Liu et.al. 2405.15552 link
2024-05-24 An Approximate Dynamic Programming Framework for Occlusion-Robust Multi-Object Tracking Pratyusha Musunuru et.al. 2405.15137 null
2024-05-23 Two-Stage ML-Guided Decision Rules for Sequential Decision Making under Uncertainty Andrew Rosemberg et.al. 2405.14973 null
2024-05-23 A rolling horizon heuristic approach for a multi-stage stochastic waste collection problem Andrea Spinelli et.al. 2405.14499 link
2024-05-23 EdgeShard: Efficient LLM Inference via Collaborative Edge Computing Mingjin Zhang et.al. 2405.14371 null
2024-05-23 Optimal Whole Body Trajectory Planning for Mobile Manipulators in Planetary Exploration and Construction Federica Storiale et.al. 2405.14363 null
2024-05-23 Deterministic Policies for Constrained Reinforcement Learning in Polynomial-Time Jeremy McMahan et.al. 2405.14183 null
2024-05-22 Tackling Decision Processes with Non-Cumulative Objectives using Reinforcement Learning Maximilian Nägele et.al. 2405.13609 link
2024-05-21 Parallel Algorithm for Optimal Threshold Labeling of Ordinal Regression Methods Ryoya Yamasaki et.al. 2405.12756 link
2024-05-21 Short and simple introduction to Bellman filtering and smoothing Rutger-Jan Lange et.al. 2405.12668 null
2024-05-21 Data-driven Coordinated AC/DC Control Strategy for Frequency Safety Qianni Cao et.al. 2405.12546 null
2024-05-20 Semantic Trajectory Data Mining with LLM-Informed POI Classification Yifan Liu et.al. 2405.11715 null
2024-05-18 On the Trajectory Regularity of ODE-based Diffusion Sampling Defang Chen et.al. 2405.11326 link
2024-05-15 Harmonizing Human Insights and AI Precision: Hand in Hand for Advancing Knowledge Graph Task Shurong Wang et.al. 2405.09477 null
2024-05-14 Treatment Effect Estimation for User Interest Exploration on Recommender Systems Jiaju Chen et.al. 2405.08582 link
2024-05-27 Dynamic Programming for Symbolic Boolean Realizability and Synthesis Yi Lin et.al. 2405.07975 null
2024-05-13 Space Domain based Ecological Cooperative and Adaptive Cruise Control on Rolling Terrain Mingyue Lei et.al. 2405.07553 null
2024-05-12 Deciding regular games: a playground for exponential time algorithms Zihui Liang et.al. 2405.07188 null
2024-05-12 Trade execution games in a Markovian environment Masamitsu Ohnishi et.al. 2405.07184 null
2024-05-10 Dynamic programming principle and computable prices in financial market models with transaction costs Emmanuel Lepinette et.al. 2405.06623 null
2024-05-09 Change point localisation and inference in fragmented functional data Gengyu Xue et.al. 2405.05730 link
2024-05-09 Infinite horizon stochastic recursive control problems with jumps: dynamic programming and stochastic verification theorems Sheng Luo et.al. 2405.05561 null
2024-05-14 Robust Reward Placement under Uncertainty Petros Petsinis et.al. 2405.05433 null
2024-05-06 Novel Tour Construction Heuristic for Pick-Up and Delivery Routing Problems Mithun Goutham et.al. 2405.03774 null
2024-05-05 TSP Escapes the $O(2^n n^2)$ Curse Mihail Stoian et.al. 2405.03018 link
2024-05-02 DiffusionPipe: Training Large Diffusion Models with Efficient Pipelines Ye Tian et.al. 2405.01248 null
2024-05-02 Lipschitz constant estimation for general neural network architectures using control tools Patricia Pauli et.al. 2405.01125 link
2024-05-01 A biased random-key genetic algorithm with variable mutants to solve a vehicle routing problem Paola Festa et.al. 2405.00268 null
2024-04-28 Bi-objective optimization of a VRP problem applied to urban solid waste collection through a model that includes the visual attraction of routes Diego Rossit et.al. 2405.00068 null
2024-04-26 Energy Storage Arbitrage in Two-settlement Markets: A Transformer-Based Approach Saud Alghumayjan et.al. 2404.17683 null
2024-04-25 Path integral control under McKean-Vlasov dynamics Timothy Bennett et.al. 2404.17006 null
2024-04-25 Parallel and (Nearly) Work-Efficient Dynamic Programming Xiangyun Ding et.al. 2404.16314 link
2024-04-23 Prediction from compression for models with infinite memory, with applications to hidden Markov and renewal processes Yanjun Han et.al. 2404.15454 null
2024-04-26 Variational Dynamic Programming for Stochastic Optimal Control Marc Lambert et.al. 2404.14806 link
2024-04-22 Tile-Weighted Rate-Distortion Optimized Packet Scheduling for 360 $^\circ$ VR Video Streaming Haopeng Wang et.al. 2404.14573 null
2024-04-21 Stochastic Multi-round Submodular Optimization with Budget Vincenzo Auletta et.al. 2404.13737 null
2024-04-21 Planning of Truck Platooning for Road-Network Capacitated Vehicle Routing Problem Yilang Hao et.al. 2404.13512 null
2024-04-20 Liquidity Pool Design on Automated Market Makers Xue Dong He et.al. 2404.13291 null
2024-04-19 Decentralized Coordination of Distributed Energy Resources through Local Energy Markets and Deep Reinforcement Learning Daniel May et.al. 2404.13142 null
2024-04-18 NLP-enabled trajectory map-matching in urban road networks using transformer sequence-to-sequence model Sevin Mohammadi et.al. 2404.12460 null
2024-04-18 Recursive stochastic differential games with non-Lipschitzian generators and viscosity solutions of Hamilton-Jacobi-Bellman-Isaacs equation Guangchen Wang et.al. 2404.12129 null
2024-04-18 Actor-Critic Reinforcement Learning with Phased Actor Ruofan Wu et.al. 2404.11834 null
2024-04-18 Itō and Itō-Wentzell chain rule for flows of conditional laws of continuous semimartingales: an easy approach Assil Fadle et.al. 2404.11010 null
2024-04-16 Zero-Sum Games for Volterra Integral Equations and Viscosity Solutions of Path-Dependent Hamilton-Jacobi Equations Mikhail I. Gomoyunov et.al. 2404.10428 null
2024-04-16 Urban Water Sprinkler Routing: A Multi-Depot Mixed Capacitated Arc Routing Problem Incorporating Real-Time Demands Hongtai Yang et.al. 2404.10230 null
2024-04-13 Fast Gradient Computation for Gromov-Wasserstein Distance Wei Zhang et.al. 2404.08970 null
2024-04-12 A Parametric Approach for Solving Convex Quadratic Optimization with Indicators Over Trees Aaresh Bhathena et.al. 2404.08178 link
2024-04-06 Viscosity solutions for mean field optimal switching with a two-time-scale Markov chain Tian Chen et.al. 2404.07998 null
2024-04-11 Parameterized Fast and Safe Tracking (FaSTrack) using Deepreach Hyun Joe Jeong et.al. 2404.07431 null
2024-04-09 Inexact Policy Iteration Methods for Large-Scale Markov Decision Processes Matilde Gargiani et.al. 2404.06136 null
2024-04-09 fastcpd: Fast Change Point Detection in R Xingchi Li et.al. 2404.05933 link
2024-04-08 Non-concave distributionally robust stochastic control in a discrete time finite horizon setting Ariel Neufeld et.al. 2404.05230 link
2024-04-07 Percentile Criterion Optimization in Offline Reinforcement Learning Elita A. Lobo et.al. 2404.05055 link
2024-04-05 A Ground Mobile Robot for Autonomous Terrestrial Laser Scanning-Based Field Phenotyping Javier Rodriguez-Sanchez et.al. 2404.04404 null
2024-04-04 Forecasting with Neuro-Dynamic Programming Pedro Afonso Fernandes et.al. 2404.03737 null
2024-04-03 Reinforcement Learning in Categorical Cybernetics Jules Hedges et.al. 2404.02688 null
2024-04-03 Transformer-based Stagewise Decomposition for Large-Scale Multistage Stochastic Optimization Chanyeong Kim et.al. 2404.02583 null
2024-04-01 Versatile Navigation under Partial Observability via Value-guided Diffusion Policy Gengyu Zhang et.al. 2404.02176 null
2024-03-31 Adversarially-Robust Inference on Trees via Belief Propagation Samuel B. Hopkins et.al. 2404.00768 null
2024-03-28 A Faster Algorithm for Pigeonhole Equal Sums Ce Jin et.al. 2403.19117 null
2024-03-27 Policy iteration for discrete-time systems with discounted costs: stability and near-optimality guarantees Jonathan de Brusse et.al. 2403.19007 null
2024-03-27 A Dynamic Programming Approach for Road Traffic Estimation Mattia Laurini et.al. 2403.18561 null
2024-03-26 Generalized Maximum Entropy Differential Dynamic Programming Yuichiro Aoyama et.al. 2403.18130 null
2024-03-26 Accuracy enhancement method for speech emotion recognition from spectrogram using temporal frequency correlation and positional information learning through knowledge transfer Jeong-Yoon Kim et.al. 2403.17327 link
2024-03-25 State-Augmented Linear Games with Antagonistic Error for High-Dimensional, Nonlinear Hamilton-Jacobi Reachability Will Sharpless et.al. 2403.16982 link
2024-03-25 Semantic-Aware Remote Estimation of Multiple Markov Sources Under Constraints Jiping Luo et.al. 2403.16855 null
2024-03-24 On the Navier-Stokes equations and the Hamilton-Jacobi-Bellman equation on the group of volume preserving diffeomorphisms Xiang-Dong Li et.al. 2403.15997 null
2024-03-23 On Merton's Optimal Portfolio Problem under Sporadic Bankruptcy Yaacov Kopeliovich et.al. 2403.15923 link
2024-03-22 Transactive Local Energy Markets Enable Community-Level Resource Coordination Using Individual Rewards Daniel C. May et.al. 2403.15617 null
2024-03-19 Most Likely Sequence Generation for $n$ -Grams, Transformers, HMMs, and Markov Chains, by Using Rollout Algorithms Yuchao Li et.al. 2403.15465 null
2024-03-21 Conservative Linear Envelopes for High-Dimensional, Hamilton-Jacobi Reachability for Nonlinear Systems via the Hopf Formula Will Sharpless et.al. 2403.14184 null
2024-03-20 Optimal control of continuous-time symmetric systems with unknown dynamics and noisy measurements Hamed Taghavian et.al. 2403.13605 null
2024-03-19 Solving Combinatorial Pricing Problems using Embedded Dynamic Programming Models Quang Minh Bui et.al. 2403.12923 null
2024-03-18 AdaMER-CTC: Connectionist Temporal Classification with Adaptive Maximum Entropy Regularization for Automatic Speech Recognition SooHwan Eom et.al. 2403.11578 null
2024-03-17 Multiscale Quantile Regression with Local Error Control Zhi Liu et.al. 2403.11356 link
2024-03-15 Fast Generation of Feasible Trajectories in Direct Optimal Control David Kiessling et.al. 2403.10115 link
2024-03-14 Is Data All That Matters? The Role of Control Frequency for Learning-Based Sampled-Data Control of Uncertain Systems Ralf Römer et.al. 2403.09504 link
2024-03-14 Quantum Dynamic Programming Jeongrak Son et.al. 2403.09187 null
2024-03-15 Relationship between General MP and DPP for the Stochastic Recursive Optimal Control Problem With Jumps: Viscosity Solution Framework Bin Wang et.al. 2403.09044 null
2024-03-13 Model-free Resilient Controller Design based on Incentive Feedback Stackelberg Game and Q-learning Jiajun Shen et.al. 2403.08948 null
2024-03-13 Online Multi-Contact Feedback Model Predictive Control for Interactive Robotic Tasks Seo Wook Han et.al. 2403.08302 null
2024-03-12 Optimal Design and Implementation of an Open-source Emulation Platform for User-Centric Shared E-mobility Services Maqsood Hussain Shah et.al. 2403.07964 null
2024-03-12 The Primal Pathwidth SETH Michael Lampis et.al. 2403.07239 null
2024-03-10 A Unified Model for Spatio-Temporal Prediction Queries with Arbitrary Modifiable Areal Units Liyue Chen et.al. 2403.07022 link
2024-03-11 Domain-Independent Dynamic Programming and Constraint Programming Approaches for Assembly Line Balancing Problems with Setups Jiachen Zhang et.al. 2403.06780 null
2024-03-11 Balanced Substructures in Bicolored Graphs P. S. Ardra et.al. 2403.06608 null
2024-03-11 An Efficient Solution to the 2D Visibility Problem in Cartesian Grid Maps and its Application in Heuristic Path Planning Ibrahim Ibrahim et.al. 2403.06494 link
2024-03-11 AGAThA: Fast and Efficient GPU Acceleration of Guided Sequence Alignment for Long Read Mapping Seongyeon Park et.al. 2403.06478 link
2024-03-09 Spatial Clustering Approach for Vessel Path Identification Mohamed Abuella et.al. 2403.05778 link
2024-03-07 On $[1,2]$ -Domination in Interval and Circle Graphs Mohsen Alambardar Meybodi et.al. 2403.04694 null
2024-03-07 Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control Sadegh Sadeghi Tabas et.al. 2403.04195 null
2024-03-06 Global Geolocated Realtime Data of Interfleet Urban Transit Bus Idling Nicholas Kunz et.al. 2403.03489 link
2024-03-06 SalienTime: User-driven Selection of Salient Time Steps for Large-Scale Geospatial Data Visualization Juntong Chen et.al. 2403.03449 link
2024-03-06 Leveraging The Finite States of Emotion Processing to Study Late-Life Mental Health Yuanzhe Huang et.al. 2403.03414 null
2024-03-04 Dynamic programming principle in cost-efficient sequential design: application to switching measurements Jeongmin Han et.al. 2403.02245 null
2024-03-04 Cooperative and Interaction-aware Driver Model for Lane Change Maneuver Jemin Woo et.al. 2403.01752 null
2024-03-01 DyPyBench: A Benchmark of Executable Python Software Islem Bouzenia et.al. 2403.00539 link
2024-03-01 Graph Construction with Flexible Nodes for Traffic Demand Prediction Jinyan Hou et.al. 2403.00276 link
2024-02-29 Lifelong Benchmarks: Efficient Model Evaluation in an Era of Rapid Progress Ameya Prabhu et.al. 2402.19472 link
2024-02-27 Globally Convergent Distributed Sequential Quadratic Programming with Overlapping Decomposition and Exact Augmented Lagrangian Merit Function Runxin Ni et.al. 2402.17170 null
2024-02-24 Selective Task offloading for Maximum Inference Accuracy and Energy efficient Real-Time IoT Sensing Systems Abdelkarim Ben Sada et.al. 2402.16904 null
2024-02-25 IKLink: End-Effector Trajectory Tracking with Minimal Reconfigurations Yeping Wang et.al. 2402.16154 link
2024-02-25 Evolving E-commerce Logistics Planning- Integrating Embedded Technology and Ant Colony Algorithm for Enhanced Efficiency Lynn Huang et.al. 2402.15965 null
2024-02-25 Budget-Constrained Tool Learning with Planning Yuanhang Zheng et.al. 2402.15960 link
2024-02-23 Neural optimal controller for stochastic systems via pathwise HJB operator Zhe Jiao et.al. 2402.15592 null
2024-02-23 Curve fitting on a quantum annealer for an advanced navigation method Philipp Isserstedt et.al. 2402.15308 null
2024-02-22 Quantum Markov Decision Processes Part II: Optimal Solutions and Algorithms Naci Saldi et.al. 2402.14651 null
2024-02-22 Quantum Markov Decision Processes Part I: General Theory, Approximations, and Classes of Policies Naci Saldi et.al. 2402.14649 null
2024-02-21 Quantum Annealing and Graph Neural Networks for Solving TSP with QUBO Haoqi He et.al. 2402.14036 null
2024-02-21 Do Efficient Transformers Really Save Computation? Kai Yang et.al. 2402.13934 null
2024-02-21 Benchmarking and Dissecting the Nvidia Hopper GPU Architecture Weile Luo et.al. 2402.13499 null
2024-02-20 An Improved Lower Bound on the Number of Pseudoline Arrangements Fernando Cortés Kühnast et.al. 2402.13107 null
2024-02-20 Smart Mobility Digital Twin Based Automated Vehicle Navigation System: A Proof of Concept Kui Wang et.al. 2402.12682 null
2024-02-19 An algorithm for counting number of all (normal) fuzzy subgroups in $U_{6n}$ Marek Hyčko et.al. 2402.12543 null
2024-02-29 Sequoia: Scalable, Robust, and Hardware-aware Speculative Decoding Zhuoming Chen et.al. 2402.12374 link
2024-02-19 Scalable Virtual Valuations Combinatorial Auction Design by Combining Zeroth-Order and First-Order Optimization Method Zhijian Duan et.al. 2402.11904 null
2024-02-19 Two Online Map Matching Algorithms Based on Analytic Hierarchy Process and Fuzzy Logic Jeremy J. Lin et.al. 2402.11866 null
2024-02-18 A Fisher Information based Receding Horizon Control Method for Signal Strength Model Estimation Yancheng Zhu et.al. 2402.11483 null
2024-02-16 Optimal Savings and Value of Population in A Stochastic Environment: Transient Behavior Hao Liu et.al. 2402.10768 null
2024-02-15 Engraving Oriented Joint Estimation of Pitch Spelling and Local and Global Keys Augustin Bouquillard et.al. 2402.10247 null
2024-02-14 Analyzing the Impact of Computation in Adaptive Dynamic Programming for Stochastic LQR Problem Wenhan Cao et.al. 2402.09575 null
2024-02-13 Approximate Sequential Optimization for Informative Path Planning Joshua Ott et.al. 2402.08841 link
2024-02-13 Sequence graphs realizations and ambiguity in language models Sammy Khalife et.al. 2402.08830 null
2024-02-11 GenSTL: General Sparse Trajectory Learning via Auto-regressive Generation of Feature Domains Yan Lin et.al. 2402.07232 link
2024-02-09 High-Precision Geosteering via Reinforcement Learning and Particle Filters Ressi Bonti Muhammad et.al. 2402.06377 null
2024-02-09 Bellman Conformal Inference: Calibrating Prediction Intervals For Time Series Zitong Yang et.al. 2402.05203 link
2024-02-04 Empowering Computing and Networks Convergence System with Distributed Cooperative Routing Yujiao Hu et.al. 2402.02381 null
2024-02-03 Multiple sequences Prophet Inequality Under Observation Constraints Aristomenis Tsopelakos et.al. 2402.02059 null
2024-02-02 Capturing waste collection planning expert knowledge in a fitness function through preference learning Laura Fernández Díaz et.al. 2402.01849 null
2024-02-02 Dynamic programming for the stochastic matching model on general graphs: the case of the `N-graph' Loïc Jean et.al. 2402.01803 null
2024-02-01 AlphaRank: An Artificial Intelligence Approach for Ranking and Selection Problems Ruihan Zhou et.al. 2402.00907 null
2024-02-01 Cocco: Hardware-Mapping Co-Exploration towards Memory Capacity-Communication Optimization Zhanhong Tan et.al. 2402.00629 null
2024-02-02 Branch and Price for the Length-Constrained Cycle Partition Problem Mohammed Ghannam et.al. 2401.17937 link
2024-01-31 Revisiting speech segmentation and lexicon learning with better features Herman Kamper et.al. 2401.17902 null
2024-02-16 The computation of approximate feedback Stackelberg equilibria in multi-player nonlinear constrained dynamic games Jingqi Li et.al. 2401.15745 link
2024-01-28 HappyRouting: Learning Emotion-Aware Route Trajectories for Scalable In-The-Wild Navigation David Bethge et.al. 2401.15695 null
2024-01-28 Constrained Markov decision processes for response-adaptive procedures in clinical trials with binary outcomes Stef Baas et.al. 2401.15694 null
2024-01-27 Fair and Efficient Ridesharing: A Dynamic Programming-based Relocation Approach Aqsa Ashraf Makhdomi et.al. 2401.15363 null
2024-01-27 Optimal Sparse Survival Trees Rui Zhang et.al. 2401.15330 link
2024-01-25 Domain-Independent Dynamic Programming Ryo Kuroiwa et.al. 2401.13883 link
2024-01-27 Deep multitask neural networks for solving some stochastic optimal control problems Christian Yeo et.al. 2401.12923 link
2024-01-23 Optimal Stopping of Branching Diffusion Processes Idris Kharroubi et.al. 2401.12811 null
2024-01-22 On a class of interdiction problems with partition matroids: complexity and polynomial-time algorithms Sergey S. Ketkov et.al. 2401.12010 null
2024-01-22 Finite horizon optimal control of reaction-diffusion SIV epidemic system with stochastic environment Zong Wang et.al. 2401.11744 null
2024-01-20 Closing the Gap between TD Learning and Supervised Learning -- A Generalisation Point of View Raj Ghugare et.al. 2401.11237 link

(back to top)

Large Language Model

Publish Date Title Authors PDF Code
2025-06-10 VIKI-R: Coordinating Embodied Multi-Agent Cooperation via Reinforcement Learning Li Kang et.al. 2506.09049 null
2025-06-10 Same Task, Different Circuits: Disentangling Modality-Specific Mechanisms in VLMs Yaniv Nikankin et.al. 2506.09047 null
2025-06-10 Agentic Neural Networks: Self-Evolving Multi-Agent Systems via Textual Backpropagation Xiaowen Ma et.al. 2506.09046 null
2025-06-10 Cosmos-Drive-Dreams: Scalable Synthetic Driving Data Generation with World Foundation Models Xuanchi Ren et.al. 2506.09042 null
2025-06-10 Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better Dianyi Wang et.al. 2506.09040 null
2025-06-10 AbstentionBench: Reasoning LLMs Fail on Unanswerable Questions Polina Kirichenko et.al. 2506.09038 null
2025-06-10 FZOO: Fast Zeroth-Order Optimizer for Fine-Tuning Large Language Models towards Adam-Scale Speed Sizhe Dang et.al. 2506.09034 null
2025-06-10 Router-R1: Teaching LLMs Multi-Round Routing and Aggregation via Reinforcement Learning Haozhen Zhang et.al. 2506.09033 null
2025-06-10 Do MIL Models Transfer? Daniel Shao et.al. 2506.09022 null
2025-06-10 SPEED-RL: Faster Training of Reasoning Models via Online Curriculum Learning Ruiqi Zhang et.al. 2506.09016 null
2025-06-10 Learning to Reason Across Parallel Samples for LLM Reasoning Jianing Qi et.al. 2506.09014 null
2025-06-10 Boosting Rust Unit Test Coverage through Hybrid Program Analysis and Large Language Models Bei Chu et.al. 2506.09002 null
2025-06-10 Efficient Medical Vision-Language Alignment Through Adapting Masked Vision Models Chenyu Lian et.al. 2506.08990 null
2025-06-10 SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning Xiao Liang et.al. 2506.08989 null
2025-06-10 On Finetuning Tabular Foundation Models Ivan Rubachev et.al. 2506.08982 null
2025-06-10 AdaDec: Uncertainty-Guided Adaptive Decoding for LLM-based Code Generation Kaifeng He et.al. 2506.08980 null
2025-06-10 Propositional Logic for Probing Generalization in Neural Networks Anna Langedijk et.al. 2506.08978 null
2025-06-10 Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Scheduling System Yuan Guo et.al. 2506.08972 null
2025-06-10 ADAM: Autonomous Discovery and Annotation Model using LLMs for Context-Aware Annotations Amirreza Rouhi et.al. 2506.08968 null
2025-06-10 Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model Ailin Huang et.al. 2506.08967 null
2025-06-09 GUI-Reflection: Empowering Multimodal GUI Models with Self-Reflection Behavior Penghao Wu et.al. 2506.08012 null
2025-06-09 Play to Generalize: Learning to Reason Through Game Play Yunfei Xie et.al. 2506.08011 null
2025-06-09 Vision Transformers Don't Need Trained Registers Nick Jiang et.al. 2506.08010 null
2025-06-09 Hidden in plain sight: VLMs overlook their visual representations Stephanie Fu et.al. 2506.08008 null
2025-06-09 Reinforcement Pre-Training Qingxiu Dong et.al. 2506.08007 null
2025-06-09 Reparameterized LLM Training via Orthogonal Equivalence Transformation Zeju Qiu et.al. 2506.08001 null
2025-06-09 Supporting Construction Worker Well-Being with a Multi-Agent Conversational AI System Fan Yang et.al. 2506.07997 null
2025-06-09 HeuriGym: An Agentic Benchmark for LLM-Crafted Heuristics in Combinatorial Optimization Hongzheng Chen et.al. 2506.07972 null
2025-06-09 CyberV: Cybernetics for Test-time Scaling in Video Understanding Jiahao Meng et.al. 2506.07971 null
2025-06-09 SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial Intelligence Ziyang Gong et.al. 2506.07966 null
2025-06-09 Reinforcing Multimodal Understanding and Generation with Dual Self-rewards Jixiang Hong et.al. 2506.07963 null
2025-06-09 Correlated Errors in Large Language Models Elliot Kim et.al. 2506.07962 null
2025-06-09 BridgeVLA: Input-Output Alignment for Efficient 3D Manipulation Learning with Vision-Language Models Peiyan Li et.al. 2506.07961 null
2025-06-09 Language Models over Canonical Byte-Pair Encodings Tim Vieira et.al. 2506.07956 null
2025-06-09 TokenBreak: Bypassing Text Classification Models Through Token Manipulation Kasimir Schulz et.al. 2506.07948 null
2025-06-09 Statistical Hypothesis Testing for Auditing Robustness in Language Models Paulius Rauba et.al. 2506.07947 null
2025-06-09 ProtocolLLM: RTL Benchmark for SystemVerilog Generation of Communication Protocols Arnav Sheth et.al. 2506.07945 null
2025-06-09 Decoupling the Image Perception and Multimodal Reasoning for Reasoning Segmentation with Digital Twin Representations Yizhen Li et.al. 2506.07943 null
2025-06-09 Adversarial Attack Classification and Robustness Testing for Large Language Models for Code Yang Liu et.al. 2506.07942 null
2025-06-09 Gradients: When Markets Meet Fine-tuning -- A Distributed Approach to Model Optimisation Christopher Subia-Waud et.al. 2506.07940 null
2025-06-06 TerraFM: A Scalable Foundation Model for Unified Multisensor Earth Observation Muhammad Sohail Danish et.al. 2506.06281 null
2025-06-06 Eigenspectrum Analysis of Neural Networks without Aspect Ratio Bias Yuanzhe Hu et.al. 2506.06280 null
2025-06-06 CoMemo: LVLMs Need Image Context with Image Memory Shi Liu et.al. 2506.06279 null
2025-06-06 Movie Facts and Fibs (MF $^2$ ): A Benchmark for Long Movie Understanding Emmanouil Zaranis et.al. 2506.06275 null
2025-06-06 AdvSumm: Adversarial Training for Bias Mitigation in Text Summarization Mukur Gupta et.al. 2506.06273 null
2025-06-06 RecGPT: A Foundation Model for Sequential Recommendation Yangqin Jiang et.al. 2506.06270 null
2025-06-09 Cartridges: Lightweight and general-purpose long context representations via self-study Sabri Eyuboglu et.al. 2506.06266 null
2025-06-06 PersonaAgent: When Large Language Model Agents Meet Personalization at Test Time Weizhi Zhang et.al. 2506.06254 null
2025-06-06 DesignBench: A Comprehensive Benchmark for MLLM-based Front-end Code Generation Jingyu Xiao et.al. 2506.06251 null
2025-06-06 Visual Graph Arena: Evaluating Visual Conceptualization of Vision and Multimodal Large Language Models Zahra Babaiee et.al. 2506.06242 null
2025-06-06 Bridging External and Parametric Knowledge: Mitigating Hallucination of LLMs with Shared-Private Semantic Synergy in Dual-Stream Knowledge Yi Sui et.al. 2506.06240 null
2025-06-06 Explaining Matters: Leveraging Definitions and Semantic Expansion for Sexism Detection Sahrish Khan et.al. 2506.06238 null
2025-06-06 Challenging Vision-Language Models with Surgical Data: A New Dataset and Broad Benchmarking Study Leon Mayer et.al. 2506.06232 null
2025-06-06 CompilerGPT: Leveraging Large Language Models for Analyzing and Acting on Compiler Optimization Reports Peter Pirkelbauer et.al. 2506.06227 null
2025-06-06 PROVSYN: Synthesizing Provenance Graphs for Data Augmentation in Intrusion Detection Systems Yi Huang et.al. 2506.06226 null
2025-06-06 GenIR: Generative Visual Feedback for Mental Image Retrieval Diji Yang et.al. 2506.06220 null
2025-06-06 STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous Driving Christian Fruhwirth-Reisinger et.al. 2506.06218 null
2025-06-06 Corrector Sampling in Language Models Itai Gat et.al. 2506.06215 null
2025-06-06 Can Theoretical Physics Research Benefit from Language Agents? Sirui Lu et.al. 2506.06214 null
2025-06-06 PuzzleWorld: A Benchmark for Multimodal, Open-Ended Reasoning in Puzzlehunts Hengzhi Li et.al. 2506.06211 null
2025-06-05 Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets Lei Hsiung et.al. 2506.05346 null
2025-06-05 SparseMM: Head Sparsity Emerges from Visual Concept Responses in MLLMs Jiahui Wang et.al. 2506.05344 null
2025-06-05 Direct Numerical Layout Generation for 3D Indoor Scene Synthesis via Spatial Reasoning Xingjian Ran et.al. 2506.05341 null
2025-06-05 Flattery, Fluff, and Fog: Diagnosing and Mitigating Idiosyncratic Biases in Preference Models Anirudh Bharadwaj et.al. 2506.05339 null
2025-06-05 VideoMolmo: Spatio-Temporal Grounding Meets Pointing Ghazi Shazan Ahmad et.al. 2506.05336 null
2025-06-05 Search Arena: Analyzing Search-Augmented LLMs Mihran Miroyan et.al. 2506.05334 null
2025-06-05 Unleashing Hour-Scale Video Training for Long Video-Language Understanding Jingyang Lin et.al. 2506.05332 null
2025-06-05 MINT-CoT: Enabling Interleaved Visual Tokens in Mathematical Chain-of-Thought Reasoning Xinyan Chen et.al. 2506.05331 null
2025-06-05 LSM-2: Learning from Incomplete Wearable Sensor Data Maxwell A. Xu et.al. 2506.05321 null
2025-06-06 Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs Haoyuan Li et.al. 2506.05318 null
2025-06-05 Improving Data Efficiency for LLM Reinforcement Fine-tuning Through Difficulty-targeted Online Data Selection and Rollout Replay Yifan Sun et.al. 2506.05316 null
2025-06-05 Constrained Entropic Unlearning: A Primal-Dual Framework for Large Language Models Taha Entesari et.al. 2506.05314 null
2025-06-05 ProRefine: Inference-time Prompt Refinement with Textual Feedback Deepak Pandita et.al. 2506.05305 null
2025-06-05 Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos Weifeng Lin et.al. 2506.05302 null
2025-06-05 Power Law Guided Dynamic Sifting for Efficient Attention Nirav Koley et.al. 2506.05300 null
2025-06-05 Control Tax: The Price of Keeping AI in Check Mikhail Terekhov et.al. 2506.05296 null
2025-06-05 Sample Complexity and Representation Ability of Test-time Scaling Paradigms Baihe Huang et.al. 2506.05295 null
2025-06-05 EOC-Bench: Can MLLMs Identify, Recall, and Forecast Objects in an Egocentric World? Yuqian Yuan et.al. 2506.05287 null
2025-06-05 Micro-Act: Mitigate Knowledge Conflict in Question Answering via Actionable Self-Reasoning Nan Huo et.al. 2506.05278 null
2025-06-06 Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams Mohammed Almutairi et.al. 2506.05265 null
2025-06-04 OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis Junting Chen et.al. 2506.04217 null
2025-06-04 Language-Image Alignment with Fixed Text Encoders Jingfeng Yang et.al. 2506.04209 null
2025-06-04 Advancing Multimodal Reasoning: From Optimized Cold Start to Staged Reinforcement Learning Shuang Chen et.al. 2506.04207 null
2025-06-04 EPiC: Towards Lossless Speedup for Reasoning Training through Edge-Preserving CoT Condensation Jinghan Jia et.al. 2506.04205 null
2025-06-04 Cascadia: A Cascade Serving System for Large Language Models Youhe Jiang et.al. 2506.04203 null
2025-06-04 TracLLM: A Generic Framework for Attributing Long Context LLMs Yanting Wang et.al. 2506.04202 null
2025-06-04 R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning Qingfei Zhao et.al. 2506.04185 null
2025-06-04 SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models Yuhao Wu et.al. 2506.04180 null
2025-06-04 SkipGPT: Dynamic Layer Pruning Reinvented with Token Awareness and Module Decoupling Anhao Zhao et.al. 2506.04179 null
2025-06-04 Does Prompt Design Impact Quality of Data Imputation by LLMs? Shreenidhi Srinivasan et.al. 2506.04172 null
2025-06-04 VISCA: Inferring Component Abstractions for Automated End-to-End Testing Parsa Alian et.al. 2506.04161 null
2025-06-04 Image Editing As Programs with Diffusion Models Yujia Hu et.al. 2506.04158 null
2025-06-04 A Dataset for Addressing Patient's Information Needs related to Clinical Course of Hospitalization Sarvesh Soni et.al. 2506.04156 null
2025-06-04 Establishing Trustworthy LLM Evaluation via Shortcut Neuron Analysis Kejian Zhu et.al. 2506.04142 null
2025-06-04 MMR-V: What's Left Unsaid? A Benchmark for Multimodal Deep Reasoning in Videos Kejian Zhu et.al. 2506.04141 null
2025-06-04 TRiSM for Agentic AI: A Review of Trust, Risk, and Security Management in LLM-based Agentic Multi-Agent Systems Shaina Raza et.al. 2506.04133 null
2025-06-04 Recent Advances in Medical Image Classification Loan Dao et.al. 2506.04129 null
2025-06-04 Guided Speculative Inference for Efficient Test-Time Alignment of LLMs Jonathan Geuter et.al. 2506.04118 null
2025-06-05 Rectified Sparse Attention Yutao Sun et.al. 2506.04108 null
2025-06-04 TextAtari: 100K Frames Game Playing with Language Agents Wenhao Li et.al. 2506.04098 link
2025-06-03 Causal Estimation of Tokenisation Bias Pietro Lesci et.al. 2506.03149 null
2025-06-04 UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation Bin Lin et.al. 2506.03147 null
2025-06-03 Entity-Augmented Neuroscience Knowledge Retrieval Using Ontology and Semantic Understanding Capability of LLM Pralaypati Ta et.al. 2506.03145 null
2025-06-03 Not All Tokens Are Meant to Be Forgotten Xiangyu Zhou et.al. 2506.03142 null
2025-06-03 SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation Siqi Chen et.al. 2506.03139 null
2025-06-03 OmniSpatial: Towards Comprehensive Spatial Reasoning Benchmark for Vision Language Models Mengdi Jia et.al. 2506.03135 null
2025-06-03 Native-Resolution Image Synthesis Zidong Wang et.al. 2506.03131 null
2025-06-03 AnimeShooter: A Multi-Shot Animation Dataset for Reference-Guided Video Generation Lu Qiu et.al. 2506.03126 null
2025-06-03 AUTOCIRCUIT-RL: Reinforcement Learning-Driven LLM for Automated Circuit Topology Generation Prashanth Vijayaraghavan et.al. 2506.03122 null
2025-06-03 Targeted Forgetting of Image Subgroups in CLIP Models Zeliang Zhang et.al. 2506.03117 null
2025-06-04 Critique-GRPO: Advancing LLM Reasoning with Natural Language and Numerical Feedback Xiaoying Zhang et.al. 2506.03106 null
2025-06-03 Beyond Text Compression: Evaluating Tokenizers Across Scales Jonas F. Lotz et.al. 2506.03101 null
2025-06-03 TalkingMachines: Real-Time Audio-Driven FaceTime-Style Video via Autoregressive Diffusion Models Chetwin Low et.al. 2506.03099 null
2025-06-03 EgoVLM: Policy Optimization for Egocentric Video Understanding Ashwin Vinod et.al. 2506.03097 null
2025-06-03 DPO Learning with LLMs-Judge Signal for Computer Use Agents Man Luo et.al. 2506.03095 null
2025-06-03 From Flat to Hierarchical: Extracting Sparse Representations with Matching Pursuit Valérie Costa et.al. 2506.03093 null
2025-06-03 Literary Evidence Retrieval via Long-Context Language Models Katherine Thai et.al. 2506.03090 null
2025-06-03 StreamBP: Memory-Efficient Exact Backpropagation for Long Sequence Training of LLMs Qijun Luo et.al. 2506.03077 null
2025-06-03 LEG-SLAM: Real-Time Language-Enhanced Gaussian Splatting for SLAM Roman Titkov et.al. 2506.03073 null
2025-06-03 EDITOR: Effective and Interpretable Prompt Inversion for Text-to-Image Diffusion Models Mingzhe Li et.al. 2506.03067 null
2025-05-30 ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL Yu Zhang et.al. 2505.24875 null
2025-05-30 The Road to Generalizable Neuro-Symbolic Learning Should be Paved with Foundation Models Adam Stein et.al. 2505.24874 null
2025-05-30 ProxyThinker: Test-Time Guidance through Small Visual Reasoners Zilin Xiao et.al. 2505.24872 null
2025-05-30 MoDoMoDo: Multi-Domain Data Mixtures for Multimodal LLM Reinforcement Learning Yiqing Liang et.al. 2505.24871 null
2025-05-30 GenSpace: Benchmarking Spatially-Aware Image Generation Zehan Wang et.al. 2505.24870 null
2025-05-30 SiLVR: A Simple Language-based Video Reasoning Framework Ce Zhang et.al. 2505.24869 link
2025-05-30 Time Blindness: Why Video-Language Models Can't See What Humans Can? Ujjwal Upadhyay et.al. 2505.24867 null
2025-05-30 ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Mingjie Liu et.al. 2505.24864 link
2025-05-30 Beyond Multiple Choice: Evaluating Steering Vectors for Adaptive Free-Form Summarization Joschka Braun et.al. 2505.24859 null
2025-05-30 Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking Heli Ben-Hamu et.al. 2505.24857 null
2025-05-30 MiCRo: Mixture Modeling and Context-aware Routing for Personalized Preference Learning Jingyan Shen et.al. 2505.24846 null
2025-05-30 Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning Wanyun Xie et.al. 2505.24844 null
2025-05-30 Cascading Adversarial Bias from Injection to Distillation in Language Models Harsh Chaudhari et.al. 2505.24842 null
2025-05-30 Vision LLMs Are Bad at Hierarchical Visual Understanding, and LLMs Are the Bottleneck Yuwen Tan et.al. 2505.24840 null
2025-05-30 VideoCAD: A Large-Scale Video Dataset for Learning UI Interactions and 3D Reasoning from CAD Software Brandon Man et.al. 2505.24838 null
2025-06-02 How much do language models memorize? John X. Morris et.al. 2505.24832 null
2025-05-30 Improving Reliability and Explainability of Medical Question Answering through Atomic Fact Checking in Retrieval-Augmented LLMs Juraj Vladika et.al. 2505.24830 null
2025-05-30 LegalEval-Q: A New Benchmark for The Quality Evaluation of LLM-Generated Legal Text Li yunhan et.al. 2505.24826 null
2025-05-30 PhySense: Principle-Based Physics Reasoning Benchmarking for Large Language Models Yinggan Xu et.al. 2505.24823 null
2025-05-30 Bi-Manual Joint Camera Calibration and Scene Representation Haozhan Tang et.al. 2505.24819 null
2025-05-29 TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models Yao Xiao et.al. 2505.23769 link
2025-05-29 Argus: Vision-Centric Reasoning with Grounded Chain-of-Thought Yunze Man et.al. 2505.23766 null
2025-05-29 From Chat Logs to Collective Insights: Aggregative Question Answering Wentao Zhang et.al. 2505.23765 null
2025-05-29 MMSI-Bench: A Benchmark for Multi-Image Spatial Intelligence Sihan Yang et.al. 2505.23764 null
2025-05-29 ZeroGUI: Automating Online GUI Learning at Zero Human Cost Chenyu Yang et.al. 2505.23762 link
2025-05-29 Differential Information: An Information-Theoretic Perspective on Preference Optimization Yunjae Won et.al. 2505.23761 null
2025-05-29 Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint Heekyung Lee et.al. 2505.23759 link
2025-05-29 DeepTheorem: Advancing LLM Reasoning for Theorem Proving Through Natural Language and Reinforcement Learning Ziyin Zhang et.al. 2505.23754 link
2025-05-29 ThinkGeo: Evaluating Tool-Augmented Agents for Remote Sensing Tasks Akashah Shabbir et.al. 2505.23752 link
2025-05-29 Distortion of AI Alignment: Does Preference Optimization Optimize for Preferences? Paul Gölz et.al. 2505.23749 null
2025-05-29 Spatial-MLLM: Boosting MLLM Capabilities in Visual-based Spatial Intelligence Diankun Wu et.al. 2505.23747 null
2025-05-29 To Trust Or Not To Trust Your Vision-Language Model's Prediction Hao Dong et.al. 2505.23745 link
2025-05-29 LayerPeeler: Autoregressive Peeling for Layer-wise Image Vectorization Ronghuan Wu et.al. 2505.23740 null
2025-05-29 ATLAS: Learning to Optimally Memorize the Context at Test Time Ali Behrouz et.al. 2505.23735 null
2025-05-29 Bounded Rationality for LLMs: Satisficing Alignment at Inference-Time Mohamad Chehade et.al. 2505.23729 null
2025-05-29 PixelThink: Towards Efficient Chain-of-Pixel Reasoning Song Wang et.al. 2505.23727 null
2025-05-29 FMG-Det: Foundation Model Guided Robust Object Detection Darryl Hannan et.al. 2505.23726 null
2025-05-29 MuLoCo: Muon is a practical inner optimizer for DiLoCo Benjamin Thérien et.al. 2505.23725 null
2025-05-29 SC-LoRA: Balancing Efficient Fine-tuning and Knowledge Preservation via Subspace-Constrained LoRA Minrui Luo et.al. 2505.23724 null
2025-05-29 ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering Zexi Liu et.al. 2505.23723 link
2025-05-28 Zero-Shot Vision Encoder Grafting via LLM Surrogates Kaiyu Yue et.al. 2505.22664 link
2025-05-28 Training Free Stylized Abstraction Aimon Rahman et.al. 2505.22663 null
2025-05-28 AutoL2S: Auto Long-Short Reasoning for Efficient Large Language Models Feng Luo et.al. 2505.22662 null
2025-05-28 GuessArena: Guess Who I Am? A Self-Adaptive Framework for Evaluating LLMs in Domain-Specific Knowledge and Reasoning Qingchen Yu et.al. 2505.22661 null
2025-05-29 Maximizing Confidence Alone Improves Reasoning Mihir Prabhudesai et.al. 2505.22660 null
2025-05-28 3DLLM-Mem: Long-Term Spatial-Temporal Memory for Embodied 3D Large Language Model Wenbo Hu et.al. 2505.22657 null
2025-05-28 Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents Michael Kirchhof et.al. 2505.22655 null
2025-05-28 VScan: Rethinking Visual Token Reduction for Efficient Large Vision-Language Models Ce Zhang et.al. 2505.22654 null
2025-05-28 The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason Ang Lv et.al. 2505.22653 null
2025-05-28 Sherlock: Self-Correcting Reasoning in Vision-Language Models Yi Ding et.al. 2505.22651 null
2025-05-28 Characterizing Bias: Benchmarking Large Language Models in Simplified versus Traditional Chinese Hanjia Lyu et.al. 2505.22645 link
2025-05-28 Understanding (Un)Reliability of Steering Vectors in Language Models Joschka Braun et.al. 2505.22637 null
2025-05-28 Learning Composable Chains-of-Thought Fangcong Yin et.al. 2505.22635 null
2025-05-28 Spatial Knowledge Graph-Guided Multimodal Synthesis Yida Xue et.al. 2505.22633 null
2025-05-28 Stochastic Chameleons: Irrelevant Context Hallucinations Reveal Class-Based (Mis)Generalization in LLMs Ziling Cheng et.al. 2505.22630 null
2025-05-28 Principled Out-of-Distribution Generalization via Simplicity Jiawei Ge et.al. 2505.22622 null
2025-05-28 Fast-dLLM: Training-free Acceleration of Diffusion LLM by Enabling KV Cache and Parallel Decoding Chengyue Wu et.al. 2505.22618 null
2025-05-28 The Entropy Mechanism of Reinforcement Learning for Reasoning Language Models Ganqu Cui et.al. 2505.22617 null
2025-05-28 RICO: Improving Accuracy and Completeness in Image Recaptioning via Visual Reconstruction Yuchi Wang et.al. 2505.22613 null
2025-05-28 Effective and Efficient One-pass Compression of Speech Foundation Models Using Sparsity-aware Self-pinching Gates Haoning Xu et.al. 2505.22608 null
2025-05-27 Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making Yihan Wang et.al. 2505.21503 null
2025-05-27 ViewSpatial-Bench: Evaluating Multi-perspective Spatial Localization in Vision-Language Models Dingming Li et.al. 2505.21500 null
2025-05-27 AdInject: Real-World Black-Box Attacks on Web Agents via Advertising Delivery Haowei Wang et.al. 2505.21499 link
2025-05-27 Adversarial Attacks against Closed-Source MLLMs via Feature Optimal Alignment Xiaojun Jia et.al. 2505.21494 link
2025-05-27 Reinforcing General Reasoning without Verifiers Xiangxin Zhou et.al. 2505.21493 null
2025-05-27 Robust Hypothesis Generation: LLM-Automated Language Bias for Inductive Logic Programming Yang Yang et.al. 2505.21486 null
2025-05-27 Are Language Models Consequentialist or Deontological Moral Reasoners? Keenan Samway et.al. 2505.21479 null
2025-05-27 Policy Optimized Text-to-Image Pipeline Design Uri Gadot et.al. 2505.21478 null
2025-05-27 Mitigating Hallucination in Large Vision-Language Models via Adaptive Attention Calibration Mehrdad Fazli et.al. 2505.21472 null
2025-05-27 Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration Zijun Liu et.al. 2505.21471 link
2025-05-27 Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion Zhanqiu Hu et.al. 2505.21467 null
2025-05-27 ID-Align: RoPE-Conscious Position Remapping for Dynamic High-Resolution Adaptation in Vision-Language Models Bozhou Li et.al. 2505.21465 null
2025-05-27 LazyVLM: Neuro-Symbolic Approach to Video Analytics Xiangru Jian et.al. 2505.21459 null
2025-05-27 Do LLMs Need to Think in One Language? Correlation between Latent Language and Task Performance Shintaro Ozaki et.al. 2505.21458 null
2025-05-27 Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO Muzhi Zhu et.al. 2505.21457 null
2025-05-27 Can Large Reasoning Models Self-Train? Sheikh Shafayat et.al. 2505.21444 null
2025-05-27 Towards Better Instruction Following Retrieval Models Yuchen Zhuang et.al. 2505.21439 null
2025-05-27 Hume: Introducing System-2 Thinking in Visual-Language-Action Model Haoming Song et.al. 2505.21432 null
2025-05-27 Policy Induction: Predicting Startup Success via Explainable Memory-Augmented In-Context Learning Xianling Mu et.al. 2505.21427 null
2025-05-27 GUARD:Dual-Agent based Backdoor Defense on Chain-of-Thought in Neural Code Generation Naizhu Jin et.al. 2505.21425 null
2025-05-26 On Path to Multimodal Historical Reasoning: HistBench and HistAgent Jiahao Qiu et.al. 2505.20246 link
2025-05-26 KnowTrace: Bootstrapping Iterative Retrieval-Augmented Generation with Structured Knowledge Tracing Rui Li et.al. 2505.20245 link
2025-05-26 It's High Time: A Survey of Temporal Information Retrieval and Question Answering Bhawna Piryani et.al. 2505.20243 null
2025-05-26 RedAHD: Reduction-Based End-to-End Automatic Heuristic Design with Large Language Models Nguyen Thach et.al. 2505.20242 null
2025-05-26 DreamPRM: Domain-Reweighted Process Reward Model for Multimodal Reasoning Qi Cao et.al. 2505.20241 null
2025-05-26 Efficient Speech Translation through Model Compression and Knowledge Distillation Yasmin Moslem et.al. 2505.20237 link
2025-05-26 Seeing is Believing, but How Much? A Comprehensive Analysis of Verbalized Calibration in Vision-Language Models Weihao Xuan et.al. 2505.20236 null
2025-05-26 FLAME-MoE: A Transparent End-to-End Research Platform for Mixture-of-Experts Language Models Hao Kang et.al. 2505.20225 link
2025-05-26 Chain-of-Thought for Autonomous Driving: A Comprehensive Survey and Future Prospects Yixin Cui et.al. 2505.20223 link
2025-05-26 Fine-grained List-wise Alignment for Generative Medication Recommendation Chenxiao Fan et.al. 2505.20218 link
2025-05-26 Parameter-Efficient Fine-Tuning with Column Space Projection Junseo Hwang et.al. 2505.20211 null
2025-05-26 How to Improve the Robustness of Closed-Source Models on NLI Joe Stacey et.al. 2505.20209 null
2025-05-26 Evaluating Large Language Models for Code Review Umut Cihan et.al. 2505.20206 null
2025-05-26 PathBench: A comprehensive comparison benchmark for pathology foundation models towards precision oncology Jiabo Ma et.al. 2505.20202 null
2025-05-26 Reasoning Is Not All You Need: Examining LLMs for Multi-Turn Mental Health Conversations Mohit Chandra et.al. 2505.20201 null
2025-05-26 Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking Pengxiang Li et.al. 2505.20199 link
2025-05-26 Temporal Sampling for Forgotten Reasoning in LLMs Yuetai Li et.al. 2505.20196 link
2025-05-26 FunReason: Enhancing Large Language Models' Function Calling via Self-Refinement Multiscale Loss and Automated Data Refinement Bingguang Hao et.al. 2505.20192 link
2025-05-26 THiNK: Can Large Language Models Think-aloud? Yongan Yu et.al. 2505.20184 link
2025-05-26 An Empirical Study on Strong-Weak Model Collaboration for Repo-level Code Generation Shubham Gandhi et.al. 2505.20182 link
2025-05-26 Pangu Light: Weight Re-Initialization for Pruning and Accelerating LLMs Hanting Chen et.al. 2505.20155 null
2025-05-26 UORA: Uniform Orthogonal Reinitialization Adaptation in Parameter-Efficient Fine-Tuning of Large Models Xueyan Zhang et.al. 2505.20154 null
2025-05-26 MineAnyBuild: Benchmarking Spatial Planning for Open-world AI Agents Ziming Wei et.al. 2505.20148 link
2025-05-26 FUDOKI: Discrete Flow-based Unified Understanding and Generation via Kinetic-Optimal Velocities Jin Wang et.al. 2505.20147 null
2025-05-26 SeMe: Training-Free Language Model Merging via Semantic Alignment Jian Gu et.al. 2505.20144 null
2025-05-26 StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs Jialin Yang et.al. 2505.20139 null
2025-05-26 AweDist: Attention-aware Embedding Distillation for New Input Token Embeddings Konstantin Dobler et.al. 2505.20133 null
2025-05-26 Agentic 3D Scene Generation with Spatially Contextualized VLMs Xinhang Liu et.al. 2505.20129 null
2025-05-26 Iterative Self-Incentivization Empowers Large Language Models as Agentic Searchers Zhengliang Shi et.al. 2505.20128 link
2025-05-26 Agentic AI Process Observability: Discovering Behavioral Variability Fabiana Fournier et.al. 2505.20127 null
2025-05-26 MEBench: A Novel Benchmark for Understanding Mutual Exclusivity Bias in Vision-Language Models Anh Thai et.al. 2505.20122 null
2025-05-26 TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent Dominik Meier et.al. 2505.20118 link
2025-05-26 Named Entity Recognition in Historical Italian: The Case of Giacomo Leopardi's Zibaldone Cristian Santini et.al. 2505.20113 null
2025-05-26 ResSVD: Residual Compensated SVD for Large Language Model Compression Haolei Bai et.al. 2505.20112 null
2025-05-26 Language-Agnostic Suicidal Risk Detection Using Large Language Models June-Woo Kim et.al. 2505.20109 null
2025-05-26 Adaptive Deep Reasoning: Triggering Deep Thinking When Needed Yunhao Wang et.al. 2505.20101 null
2025-05-26 AdaTP: Attention-Debiased Token Pruning for Video Large Language Models Fengyuan Sun et.al. 2505.20100 null
2025-05-26 Large Language Models Meet Knowledge Graphs for Question Answering: Synthesis and Opportunities Chuangtao Ma et.al. 2505.20099 link
2025-05-26 S2LPP: Small-to-Large Prompt Prediction across LLMs Liang Cheng et.al. 2505.20097 null
2025-05-26 Multi-Domain Explainability of Preferences Nitay Calderon et.al. 2505.20088 null
2025-05-23 Fann or Flop: A Multigenre, Multiera Benchmark for Arabic Poetry Understanding in LLMs Wafa Alghallabi et.al. 2505.18152 link
2025-05-23 First Finish Search: Efficient Test-Time Scaling in Large Language Models Aradhye Agarwal et.al. 2505.18149 null
2025-05-23 Lost in the Haystack: Smaller Needles are More Difficult for LLMs to Find Owen Bianchi et.al. 2505.18148 null
2025-05-23 Graph-Linguistic Fusion: Using Language Models for Wikidata Vandalism Detection Mykola Trokhymovych et.al. 2505.18136 null
2025-05-23 Gaming Tool Preferences in Agentic LLMs Kazem Faghih et.al. 2505.18135 link
2025-05-23 VideoGameBench: Can Vision-Language Models complete popular video games? Alex L. Zhang et.al. 2505.18134 null
2025-05-23 One RL to See Them All: Visual Triple Unified Reinforcement Learning Yan Ma et.al. 2505.18129 null
2025-05-23 Reward Model Overoptimisation in Iterated RLHF Lorenz Wolf et.al. 2505.18126 null
2025-05-23 TabSTAR: A Foundation Tabular Model With Semantically Target-Aware Representations Alan Arazi et.al. 2505.18125 null
2025-05-23 UNJOIN: Enhancing Multi-Table Text-to-SQL Generation via Schema Simplification Poojah Ganesan et.al. 2505.18122 null
2025-05-23 ProgRM: Build Better GUI Agents with Progress Rewards Danyang Zhang et.al. 2505.18121 null
2025-05-23 Bidirectional Knowledge Distillation for Enhancing Sequential Recommendation with Large Language Models Jiongran Wu et.al. 2505.18120 null
2025-05-23 Watch and Listen: Understanding Audio-Visual-Speech Moments with Multimodal LLM Zinuo Li et.al. 2505.18110 null
2025-05-23 ManuSearch: Democratizing Deep Search in Large Language Models with a Transparent and Open Multi-Agent Framework Lisheng Huang et.al. 2505.18105 null
2025-05-23 How Can I Publish My LLM Benchmark Without Giving the True Answers Away? Takashi Ishida et.al. 2505.18102 null
2025-05-23 Planning without Search: Refining Frontier LLMs with Offline Goal-Conditioned RL Joey Hong et.al. 2505.18098 null
2025-05-23 QwenLong-CPRS: Towards $\infty$ -LLMs with Dynamic Context Optimization Weizhou Shen et.al. 2505.18092 null
2025-05-23 Data Mixing Can Induce Phase Transitions in Knowledge Acquisition Xinran Gu et.al. 2505.18091 null
2025-05-23 CXReasonBench: A Benchmark for Evaluating Structured Diagnostic Reasoning in Chest X-rays Hyungyung Lee et.al. 2505.18087 null
2025-05-23 Deep Video Discovery: Agentic Search with Tool Use for Long-form Video Understanding Xiaoyi Zhang et.al. 2505.18079 null
2025-05-22 CrossLMM: Decoupling Long Video Sequences from LMMs via Dual Cross-Attention Mechanisms Shilin Yan et.al. 2505.17020 link
2025-05-22 Let Androids Dream of Electric Sheep: A Human-like Image Implication Understanding and Reasoning Framework Chenhao Zhang et.al. 2505.17019 link
2025-05-22 SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward Kaixuan Fan et.al. 2505.17018 link
2025-05-22 Delving into RL for Image Generation with CoT: A Study on DPO vs. GRPO Chengzhuo Tong et.al. 2505.17017 link
2025-05-22 Multi-SpatialMLLM: Multi-Frame Spatial Understanding with Multi-Modal Large Language Models Runsen Xu et.al. 2505.17015 null
2025-05-22 SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding Haoning Wu et.al. 2505.17012 link
2025-05-22 R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning Huatong Song et.al. 2505.17005 link
2025-05-22 Do Large Language Models Excel in Complex Logical Reasoning with Formal Language? Jin Jiang et.al. 2505.16998 link
2025-05-22 DecoupledESC: Enhancing Emotional Support Generation via Strategy-Response Decoupled Preference Optimization Chao Zhang et.al. 2505.16995 null
2025-05-22 Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding Runpeng Yu et.al. 2505.16990 link
2025-05-22 T1: A Tool-Oriented Conversational Dataset for Multi-Turn Agentic Planning Amartya Chakraborty et.al. 2505.16986 null
2025-05-22 UFT: Unifying Supervised and Reinforcement Fine-Tuning Mingyang Liu et.al. 2505.16984 link
2025-05-22 LLM as Effective Streaming Processor: Bridging Streaming-Batch Mismatches with Group Position Encoding Junlong Tong et.al. 2505.16983 link
2025-05-22 Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine Adib Bazgir et.al. 2505.16982 null
2025-05-22 HyGenar: An LLM-Driven Hybrid Genetic Algorithm for Few-Shot Grammar Generation Weizhi Tang et.al. 2505.16978 link
2025-05-22 SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development Yaxin Du et.al. 2505.16975 link
2025-05-22 CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark Ahmed Heakl et.al. 2505.16968 link
2025-05-22 Invisible Prompts, Visible Threats: Malicious Font Injection in External Resources for Large Language Models Junjie Xiong et.al. 2505.16957 null
2025-05-22 On Multilingual Encoder Language Model Compression for Low-Resource Languages Daniil Gurgurov et.al. 2505.16956 null
2025-05-22 A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization Shengyu Feng et.al. 2505.16952 null
2025-05-21 InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition Yijie Zheng et.al. 2505.15818 link
2025-05-21 On the creation of narrow AI: hierarchy and nonlocality of neural network skills Eric J. Michaud et.al. 2505.15811 link
2025-05-21 MMaDA: Multimodal Large Diffusion Language Models Ling Yang et.al. 2505.15809 link
2025-05-21 The Atlas of In-Context Learning: How Attention Heads Shape In-Context Retrieval Augmentation Patrick Kahardipraja et.al. 2505.15807 link
2025-05-21 Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering Hwan Chang et.al. 2505.15805 link
2025-05-21 STAR-R1: Spacial TrAnsformation Reasoning by Reinforcing Multimodal LLMs Zongzhao Li et.al. 2505.15804 null
2025-05-21 VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models Yuchen Yan et.al. 2505.15801 null
2025-05-21 Model Merging is Secretly Certifiable: Non-Vacuous Generalisation Bounds for Low-Shot Learning Taehoon Kim et.al. 2505.15798 null
2025-05-21 Reverse Engineering Human Preferences with Reinforcement Learning Lisa Alazraki et.al. 2505.15795 null
2025-05-21 HCRMP: A LLM-Hinted Contextual Reinforcement Learning Framework for Autonomous Driving Zhiwen Chen et.al. 2505.15793 null
2025-05-21 Large Language Models as Computable Approximations to Solomonoff Induction Jun Wan et.al. 2505.15784 null
2025-05-21 dKV-Cache: The Cache for Diffusion Language Models Xinyin Ma et.al. 2505.15781 link
2025-05-21 ConvSearch-R1: Enhancing Query Reformulation for Conversational Search with Reasoning via Reinforcement Learning Changtai Zhu et.al. 2505.15776 link
2025-05-21 Beyond Hard and Soft: Hybrid Context Compression for Balancing Local and Global Information Retention Huanxuan Liao et.al. 2505.15774 link
2025-05-21 MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling Cheng Yifan et.al. 2505.15772 null
2025-05-21 An Empirical Analysis of Vulnerability Detection Tools for Solidity Smart Contracts Using Line Level Manually Annotated Vulnerabilities Francesco Salzano et.al. 2505.15756 null
2025-05-21 Exploring The Visual Feature Space for Multimodal Neural Decoding Weihao Xia et.al. 2505.15755 null
2025-05-21 Scalable Defense against In-the-wild Jailbreaking Attacks with Safety Context Retrieval Taiye Chen et.al. 2505.15753 null
2025-05-21 Multi-modal Integration Analysis of Alzheimer's Disease Using Large Language Models and Knowledge Graphs Kanan Kiguchi et.al. 2505.15747 null
2025-05-21 Evolutionary Computation and Large Language Models: A Survey of Methods, Synergies, and Applications Dikshit Chauhan et.al. 2505.15741 null
2025-05-20 Language Models use Lookbacks to Track Beliefs Nikhil Prakash et.al. 2505.14685 null
2025-05-21 Mind the Gap: Bridging Thought Leap for Improved Chain-of-Thought Tuning Haolei Xu et.al. 2505.14684 null
2025-05-20 Emerging Properties in Unified Multimodal Pretraining Chaorui Deng et.al. 2505.14683 null
2025-05-20 UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation Rui Tian et.al. 2505.14682 null
2025-05-20 UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models Xiaojie Gu et.al. 2505.14679 link
2025-05-20 Visionary-R1: Mitigating Shortcuts in Visual Reasoning with Reinforcement Learning Jiaer Xia et.al. 2505.14677 null
2025-05-20 Reward Reasoning Model Jiaxin Guo et.al. 2505.14674 null
2025-05-20 UniCTokens: Boosting Personalized Understanding and Generation via Unified Concept Tokens Ruichuan An et.al. 2505.14671 null
2025-05-20 Quartet: Native FP4 Training Can Be Optimal for Large Language Models Roberto L. Castro et.al. 2505.14669 link
2025-05-20 ContextAgent: Context-Aware Proactive LLM Agents with Open-World Sensory Perceptions Bufang Yang et.al. 2505.14668 null
2025-05-20 Beyond Words: Multimodal LLM Knows When to Speak Zikai Liao et.al. 2505.14654 null
2025-05-21 General-Reasoner: Advancing LLM Reasoning Across All Domains Xueguang Ma et.al. 2505.14652 null
2025-05-20 Vox-Profile: A Speech Foundation Model Benchmark for Characterizing Diverse Speaker and Speech Traits Tiantian Feng et.al. 2505.14648 link
2025-05-20 CAD-Coder: An Open-Source Vision-Language Model for Computer-Aided Design Code Generation Anna C. Doris et.al. 2505.14646 link
2025-05-21 Think Only When You Need with Large Hybrid-Reasoning Models Lingjie Jiang et.al. 2505.14631 null
2025-05-20 KERL: Knowledge-Enhanced Personalized Recipe Recommendation using Large Language Models Fnu Mohbat et.al. 2505.14629 link
2025-05-20 Debating for Better Reasoning: An Unsupervised Multimodal Approach Ashutosh Adhikari et.al. 2505.14627 null
2025-05-20 TinyV: Reducing False Negatives in Verification Improves RL for LLM Reasoning Zhangchen Xu et.al. 2505.14625 link
2025-05-20 Enhancing Learned Knowledge in LoRA Adapters Through Efficient Contrastive Decoding on Ascend NPUs Morgan Lindsay Heisler et.al. 2505.14620 null
2025-05-20 Linear Control of Test Awareness Reveals Differential Compliance in Reasoning Models Sahar Abdelnabi et.al. 2505.14617 link
2025-05-19 CIE: Controlling Language Model Text Generations Using Continuous Signals Vinay Samuel et.al. 2505.13448 link
2025-05-19 Trust, But Verify: A Self-Verification Approach to Reinforcement Learning with Verifiable Rewards Xiaoyuan Liu et.al. 2505.13445 link
2025-05-19 ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models Liyan Tang et.al. 2505.13444 null
2025-05-19 GraspMolmo: Generalizable Task-Oriented Grasping via Large-Scale Synthetic Data Generation Abhay Deshpande et.al. 2505.13441 null
2025-05-19 Optimizing Anytime Reasoning via Budget Relative Policy Optimization Penghui Qi et.al. 2505.13438 link
2025-05-19 SMOTExT: SMOTE meets Large Language Models Mateusz Bystroński et.al. 2505.13434 null
2025-05-19 Fine-tuning Quantized Neural Networks with Zeroth-order Optimization Sifeng Shang et.al. 2505.13430 null
2025-05-19 MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level Supervision Lingxiao Du et.al. 2505.13427 link
2025-05-19 G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning Liang Chen et.al. 2505.13426 link
2025-05-19 Learnware of Language Models: Specialized Small Language Models Can Do Big Zhi-Hao Tan et.al. 2505.13425 link
2025-05-19 Make Still Further Progress: Chain of Thoughts for Tabular Data Leaderboard Si-Yang Liu et.al. 2505.13421 null
2025-05-19 FEALLM: Advancing Facial Emotion Analysis in Multimodal Large Language Models with Emotional Synergy and Reasoning Zhuozhao Hu et.al. 2505.13419 link
2025-05-19 CoT-Kinetics: A Theoretical Modeling Assessing LRM Reasoning Process Jinhe Bi et.al. 2505.13408 null
2025-05-19 AutoMathKG: The automated mathematical knowledge graph based on LLM and vector database Rong Bian et.al. 2505.13406 null
2025-05-19 MR. Judge: Multimodal Reasoner as a Judge Renjie Pi et.al. 2505.13403 null
2025-05-19 R3: Robust Rubric-Agnostic Reward Models David Anugraha et.al. 2505.13388 link
2025-05-19 CompeteSMoE -- Statistically Guaranteed Mixture of Experts Training via Competition Nam V. Nguyen et.al. 2505.13380 link
2025-05-19 Thinkless: LLM Learns When to Think Gongfan Fang et.al. 2505.13379 link
2025-05-19 Seeing, Saying, Solving: An LLM-to-TL Framework for Cooperative Robots Dan BW Choe et.al. 2505.13376 null
2025-05-19 Multi-Armed Bandits Meet Large Language Models Djallel Bouneffouf et.al. 2505.13355 null
2025-05-16 Modeling cognitive processes of natural reading with transformer-based Language Models Bruno Bianchi et.al. 2505.11485 null
2025-05-16 msf-CNN: Patch-based Multi-Stage Fusion with Convolutional Neural Networks for TinyML Zhaolan Huang et.al. 2505.11483 link
2025-05-16 Improving Assembly Code Performance with Large Language Models via Reinforcement Learning Anjiang Wei et.al. 2505.11480 null
2025-05-16 HelpSteer3-Preference: Open Human-Annotated Preference Data across Diverse Tasks and Languages Zhilin Wang et.al. 2505.11475 null
2025-05-16 Disentangling Reasoning and Knowledge in Medical Large Language Models Rahul Thapa et.al. 2505.11462 null
2025-05-16 ProxyPrompt: Securing System Prompts against Prompt Extraction Attacks Zhixiong Zhuang et.al. 2505.11459 null
2025-05-16 LLMs unlock new paths to monetizing exploits Nicholas Carlini et.al. 2505.11449 null
2025-05-16 Is Compression Really Linear with Code Intelligence? Xianzhen Luo et.al. 2505.11441 null
2025-05-16 GODBench: A Benchmark for Multimodal Large Language Models in Video Comment Art Chenkai Zhang et.al. 2505.11436 link
2025-05-16 MegaScale-MoE: Large-Scale Communication-Efficient Training of Mixture-of-Experts Models in Production Chao Jin et.al. 2505.11432 null
2025-05-16 Mergenetic: a Simple Evolutionary Model Merging Library Adrian Robert Minut et.al. 2505.11427 link
2025-05-16 When Thinking Fails: The Pitfalls of Reasoning for Instruction-Following in LLMs Xiaomin Li et.al. 2505.11423 null
2025-05-16 Towards Cultural Bridge by Bahnaric-Vietnamese Translation Using Transfer Learning of Sequence-To-Sequence Pre-training Language Model Phan Tran Minh Dat et.al. 2505.11421 null
2025-05-16 EdgeWisePersona: A Dataset for On-Device User Profiling from Natural Language Interactions Patryk Bartkowiak et.al. 2505.11417 link
2025-05-16 MoE-CAP: Benchmarking Cost, Accuracy and Performance of Sparse Mixture-of-Experts Systems Yinsicheng Jiang et.al. 2505.11415 null
2025-05-16 CARES: Comprehensive Evaluation of Safety and Adversarial Robustness in Medical LLMs Sijia Chen et.al. 2505.11413 null
2025-05-16 Visual Planning: Let's Think Only with Images Yi Xu et.al. 2505.11409 link
2025-05-16 Large Language Model Use Impact Locus of Control Jenny Xiyu Fu et.al. 2505.11406 null
2025-05-16 EmotionHallucer: Evaluating Emotion Hallucinations in Multimodal Large Language Models Bohao Xing et.al. 2505.11405 link
2025-05-16 Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner Wenchuan Zhang et.al. 2505.11404 link
2025-05-15 End-to-End Vision Tokenizer Tuning Wenxuan Wang et.al. 2505.10562 null
2025-05-15 Neural Thermodynamic Laws for Large Language Model Training Ziming Liu et.al. 2505.10559 null
2025-05-15 Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data Yiwen Liu et.al. 2505.10551 link
2025-05-15 Real-Time Out-of-Distribution Failure Prevention via Multi-Modal Reasoning Milan Ganai et.al. 2505.10547 null
2025-05-15 Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models Annie Wong et.al. 2505.10543 link
2025-05-15 Exploring Implicit Visual Misunderstandings in Multimodal Large Language Models through Attention Analysis Pengfei Wang et.al. 2505.10541 link
2025-05-15 S3C2 Summit 2024-09: Industry Secure Software Supply Chain Summit Imranur Rahman et.al. 2505.10538 null
2025-05-15 WorldPM: Scaling Human Preference Modeling Binghai Wang et.al. 2505.10527 link
2025-05-15 MASSV: Multimodal Adaptation and Self-Data Distillation for Speculative Decoding of Vision-Language Models Mugilan Ganesan et.al. 2505.10526 null
2025-05-15 Multi-Token Prediction Needs Registers Anastasios Gerontopoulos et.al. 2505.10518 link
2025-05-15 RouteNator: A Router-Based Multi-Modal Architecture for Generating Synthetic Training Data for Function Calling LLMs Vibha Belavadi et.al. 2505.10495 null
2025-05-15 Can You Really Trust Code Copilots? Evaluating Large Language Models from a Code Security Perspective Yutao Mou et.al. 2505.10494 link
2025-05-15 CL-RAG: Bridging the Gap in Retrieval-Augmented Generation with Curriculum Learning Shaohan Wang et.al. 2505.10493 null
2025-05-15 Campus AI vs Commercial AI: A Late-Breaking Study on How LLM As-A-Service Customizations Shape Trust and Usage Patterns Leon Hannig et.al. 2505.10490 null
2025-05-15 Parallel Scaling Law for Language Models Mouxiang Chen et.al. 2505.10475 link
2025-05-15 Large Language Models for Cancer Communication: Evaluating Linguistic Quality, Safety, and Accessibility in Generative AI Agnik Saha et.al. 2505.10472 null
2025-05-15 AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge Ranjan Sapkota et.al. 2505.10468 null
2025-05-15 Superposition Yields Robust Neural Scaling Yizhou liu et.al. 2505.10465 link
2025-05-15 Vision language models have difficulty recognizing virtual objects Tyler Tran et.al. 2505.10453 null
2025-05-15 Reinforcing the Diffusion Chain of Lateral Thought with Diffusion Language Models Zemin Huang et.al. 2505.10446 null
2025-05-14 Language Agents Mirror Human Causal Reasoning Biases. How Can We Help Them Think Like Scientists? Anthony GX-Chen et.al. 2505.09614 null
2025-05-14 Customizing a Large Language Model for VHDL Design of High-Performance Microprocessors Nicolas Dupuis et.al. 2505.09610 null
2025-05-14 Adversarial Suffix Filtering: a Defense Pipeline for LLMs David Khachaturov et.al. 2505.09602 null
2025-05-14 How Hungry is AI? Benchmarking Energy, Water, and Carbon Footprint of LLM Inference Nidhal Jegham et.al. 2505.09598 null
2025-05-14 WorldView-Bench: A Benchmark for Evaluating Global Cultural Perspectives in Large Language Models Abdullah Mushtaq et.al. 2505.09595 null
2025-05-14 Variational Visual Question Answering Tobias Jan Wieczorek et.al. 2505.09591 null
2025-05-15 Beyond Likes: How Normative Feedback Complements Engagement Signals on Social Media Yuchen Wu et.al. 2505.09583 null
2025-05-14 VTLA: Vision-Tactile-Language-Action Model with Preference Learning for Insertion Manipulation Chaofan Zhang et.al. 2505.09577 null
2025-05-14 Ethics and Persuasion in Reinforcement Learning from Human Feedback: A Procedural Rhetorical Approach Shannon Lodoen et.al. 2505.09576 null
2025-05-14 MIGRATION-BENCH: Repository-Level Code Migration Benchmark from Java 8 Linbo Liu et.al. 2505.09569 link
2025-05-14 Using Foundation Models as Pseudo-Label Generators for Pre-Clinical 4D Cardiac CT Segmentation Anne-Marie Rickmann et.al. 2505.09564 null
2025-05-14 WavReward: Spoken Dialogue Models With Generalist Reward Evaluators Shengpeng Ji et.al. 2505.09558 link
2025-05-14 PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt Tuning Zongqian Li et.al. 2505.09519 link
2025-05-15 Towards Fair In-Context Learning with Tabular Foundation Models Patrik Kenfack et.al. 2505.09503 null
2025-05-14 Layered Unlearning for Adversarial Relearning Timothy Qian et.al. 2505.09500 link
2025-05-14 Flash-VL 2B: Optimizing Vision-Language Model Performance for Ultra-Low Latency and High Throughput Bo Zhang et.al. 2505.09498 null
2025-05-14 Card Sorting Simulator: Augmenting Design of Logical Information Architectures with Large Language Models Eduard Kuric et.al. 2505.09478 null
2025-05-14 Deploying Foundation Model-Enabled Air and Ground Robots in the Field: Challenges and Opportunities Zachary Ravichandran et.al. 2505.09477 null
2025-05-14 Evaluating GPT- and Reasoning-based Large Language Models on Physics Olympiad Problems: Surpassing Human Performance and Implications for Educational Assessment Paul Tschisgale et.al. 2505.09438 null
2025-05-14 CXMArena: Unified Dataset to benchmark performance in realistic CXM Scenarios Raghav Garg et.al. 2505.09436 link
2025-05-13 CodePDE: An Inference Framework for LLM-driven PDE Solver Generation Shanda Li et.al. 2505.08783 link
2025-05-13 HealthBench: Evaluating Large Language Models Towards Improved Human Health Rahul K. Arora et.al. 2505.08775 link
2025-05-14 Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology Yatai Ji et.al. 2505.08765 null
2025-05-13 Aya Vision: Advancing the Frontier of Multilingual Multimodality Saurabh Dash et.al. 2505.08751 null
2025-05-13 AC-Reason: Towards Theory-Guided Actual Causality Reasoning with Large Language Models Yanxi Zhang et.al. 2505.08750 link
2025-05-13 DeepMath-Creative: A Benchmark for Evaluating Mathematical Creativity of Large Language Models Xiaoyang Chen et.al. 2505.08744 link
2025-05-13 Probability Consistency in Large Language Models: Theoretical Foundations Meet Empirical Discrepancies Xiaoliang Luo et.al. 2505.08739 link
2025-05-13 Towards Foundation Models for Experimental Readout Systems Combining Discrete and Continuous Data James Giroux et.al. 2505.08736 link
2025-05-13 NurValues: Real-World Nursing Values Evaluation for Large Language Models in Clinical Context Ben Yao et.al. 2505.08734 null
2025-05-13 Securing RAG: A Risk Assessment and Mitigation Framework Lukas Ammann et.al. 2505.08728 null
2025-05-13 Memorization-Compression Cycles Improve Generalization Fangyuan Yu et.al. 2505.08727 null
2025-05-13 Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving Zongchuang Zhao et.al. 2505.08725 link
2025-05-13 TiMo: Spatiotemporal Foundation Model for Satellite Image Time Series Xiaolei Qin et.al. 2505.08723 link
2025-05-13 PWC-MoE: Privacy-Aware Wireless Collaborative Mixture of Experts Yang Su et.al. 2505.08719 null
2025-05-13 Controllable Image Colorization with Instance-aware Texts and Masks Yanru An et.al. 2505.08705 null
2025-05-13 LLM-based Prompt Ensemble for Reliable Medical Entity Recognition from EHRs K M Sajjadul Islam et.al. 2505.08704 null
2025-05-14 Granite-speech: open-source speech-aware LLMs with strong English ASR capabilities George Saon et.al. 2505.08699 null
2025-05-13 VizCV: AI-assisted visualization of researchers' publications tracks Vladimír Lazárik et.al. 2505.08691 null
2025-05-13 Adaptive Schema-aware Event Extraction with Retrieval-Augmented Generation Sheng Liang et.al. 2505.08690 null
2025-05-13 A Social Robot with Inner Speech for Dietary Guidance Valerio Belcamino et.al. 2505.08664 link
2025-05-12 DanceGRPO: Unleashing GRPO on Visual Generation Zeyue Xue et.al. 2505.07818 null
2025-05-12 Imagine, Verify, Execute: Memory-Guided Agentic Exploration with Vision-Language Models Seungjae Lee et.al. 2505.07815 null
2025-05-12 Learning Dynamics in Continual Pre-Training for Large Language Models Xingjin Wang et.al. 2505.07796 null
2025-05-12 Domain Regeneration: How well do LLMs match syntactic properties of text domains? Da Ju et.al. 2505.07784 null
2025-05-12 Relative Overfitting and Accept-Reject Framework Yanxin Liu et.al. 2505.07783 null
2025-05-12 MLE-Dojo: Interactive Environments for Empowering LLM Agents in Machine Learning Engineering Rushi Qiang et.al. 2505.07782 link
2025-05-12 Must Read: A Systematic Survey of Computational Persuasion Nimet Beyza Bozdag et.al. 2505.07775 link
2025-05-12 Agent RL Scaling Law: Agent RL with Spontaneous Code Execution for Mathematical Problem Solving Xinji Mai et.al. 2505.07773 link
2025-05-12 Enhancing Code Generation via Bidirectional Comment-Level Mutual Grounding Yifeng Di et.al. 2505.07768 link
2025-05-12 BodyGPS: Anatomical Positioning System Halid Ziya Yerebakan et.al. 2505.07744 null
2025-05-12 Assessing the Chemical Intelligence of Large Language Models Nicholas T. Runcie et.al. 2505.07735 link
2025-05-12 Spoken Language Understanding on Unseen Tasks With In-Context Learning Neeraj Agrawal et.al. 2505.07731 null
2025-05-12 Reproducibility, Replicability, and Insights into Visual Document Retrieval with Late Interaction Jingfen Qiao et.al. 2505.07730 link
2025-05-12 Circuit Partitioning Using Large Language Models for Quantum Compilation and Simulations Pranav Sinha et.al. 2505.07711 null
2025-05-12 Through the Looking Glass: Common Sense Consistency Evaluation of Weird Images Elisei Rykov et.al. 2505.07704 null
2025-05-12 PatchTrack: A Comprehensive Analysis of ChatGPT's Influence on Pull Request Outcomes Daniel Ogenrwot et.al. 2505.07700 null
2025-05-12 Beyond CLIP Generalization: Against Forward&Backward Forgetting Adapter for Continual Learning of Vision-Language Models Songlin Dong et.al. 2505.07690 null
2025-05-12 S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models Muzhi Dai et.al. 2505.07686 null
2025-05-12 Multimodal Survival Modeling in the Age of Foundation Models Steven Song et.al. 2505.07683 link
2025-05-12 SpecRouter: Adaptive Routing for Multi-Level Speculative Decoding in Large Language Models Hang Wu et.al. 2505.07680 null
2025-05-09 Towards a Unified Representation Evaluation Framework Beyond Downstream Tasks Christos Plachouras et.al. 2505.06224 link
2025-05-09 Adapting a Segmentation Foundation Model for Medical Image Classification Pengfei Gu et.al. 2505.06217 null
2025-05-09 From Millions of Tweets to Actionable Insights: Leveraging LLMs for User Profiling Vahid Rahimzadeh et.al. 2505.06184 null
2025-05-09 A Large Language Model-Enhanced Q-learning for Capacitated Vehicle Routing Problem with Time Windows Linjiang Cao et.al. 2505.06178 null
2025-05-09 MonetGPT: Solving Puzzles Enhances MLLMs' Image Retouching Skills Niladri Shekhar Dutt et.al. 2505.06176 null
2025-05-09 Turbo-ICL: In-Context Learning-Based Turbo Equalization Zihang Song et.al. 2505.06175 null
2025-05-09 MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks Wenqi Zeng et.al. 2505.06152 link
2025-05-09 A Scaling Law for Token Efficiency in LLM Fine-Tuning Under Fixed Compute Budgets Ryan Lagasse et.al. 2505.06150 null
2025-05-09 Can Prompting LLMs Unlock Hate Speech Detection across Languages? A Zero-shot and Few-shot Study Faeze Ghorbanpour et.al. 2505.06149 null
2025-05-09 LLMs Get Lost In Multi-Turn Conversation Philippe Laban et.al. 2505.06120 link
2025-05-09 LLMs Outperform Experts on Challenging Biology Benchmarks Lennart Justen et.al. 2505.06108 null
2025-05-09 Free and Fair Hardware: A Pathway to Copyright Infringement-Free Verilog Generation using LLMs Sam Bush et.al. 2505.06096 null
2025-05-09 Assessing Tenstorrent's RISC-V MatMul Acceleration Capabilities Hiari Pizzini Cavagna et.al. 2505.06085 null
2025-05-09 Healthy LLMs? Benchmarking LLM Knowledge of UK Government Public Health Information Joshua Harris et.al. 2505.06046 null
2025-05-09 Short-circuiting Shortcuts: Mechanistic Investigation of Shortcuts in Text Classification Leon Eshuijs et.al. 2505.06032 link
2025-05-09 Unilogit: Robust Machine Unlearning for LLMs Using Uniform-Target Self-Distillation Stefan Vasilev et.al. 2505.06027 null
2025-05-09 ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding Shuai Wang et.al. 2505.06020 null
2025-05-09 Exploring the Feasibility of Multilingual Grammatical Error Correction with a Single LLM up to 9B parameters: A Comparative Study of 17 Models Dawid Wisniewski et.al. 2505.06004 link
2025-05-09 Task-Adapter++: Task-specific Adaptation with Order-aware Alignment for Few-shot Action Recognition Congqi Cao et.al. 2505.06002 link
2025-05-09 Towards Developmentally Plausible Rewards: Communicative Success as a Learning Signal for Interactive Language Models Lennart Stöpler et.al. 2505.05970 null
2025-05-08 Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation Chao Liao et.al. 2505.05472 null
2025-05-08 Generating Physically Stable and Buildable LEGO Designs from Text Ava Pun et.al. 2505.05469 link
2025-05-08 StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant Haibo Wang et.al. 2505.05467 null
2025-05-08 ComPO: Preference Alignment via Comparison Oracles Peter Chen et.al. 2505.05465 null
2025-05-08 Bring Reason to Vision: Understanding Perception and Reasoning through Model Merging Shiqi Chen et.al. 2505.05464 link
2025-05-08 UKElectionNarratives: A Dataset of Misleading Narratives Surrounding Recent UK General Elections Fatima Haouari et.al. 2505.05459 null
2025-05-08 SITE: towards Spatial Intelligence Thorough Evaluation Wenqi Wang et.al. 2505.05456 null
2025-05-08 Conversational Process Model Redesign Nataliia Klievtsova et.al. 2505.05453 null
2025-05-08 clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations Chalamalasetti Kranti et.al. 2505.05445 null
2025-05-08 GesPrompt: Leveraging Co-Speech Gestures to Augment LLM-Based Interaction in Virtual Reality Xiyun Hu et.al. 2505.05441 null
2025-05-09 EcoAgent: An Efficient Edge-Cloud Collaborative Multi-Agent Framework for Mobile Automation Biao Yi et.al. 2505.05440 null
2025-05-08 Ultra-FineWeb: Efficient Data Filtering and Verification for High-Quality LLM Training Data Yudong Wang et.al. 2505.05427 null
2025-05-09 LiTransProQA: an LLM-based Literary Translation evaluation metric with Professional Question Answering Ran Zhang et.al. 2505.05423 link
2025-05-08 Crosslingual Reasoning through Test-Time Scaling Zheng-Xin Yong et.al. 2505.05408 link
2025-05-08 Frame In, Frame Out: Do LLMs Generate More Biased News Headlines than Humans? Valeria Pastorino et.al. 2505.05406 null
2025-05-08 A Pain Assessment Framework based on multimodal data and Deep Machine Learning methods Stefanos Gkikas et.al. 2505.05396 null
2025-05-08 DSDrive: Distilling Large Language Model for Lightweight End-to-End Autonomous Driving with Unified Reasoning and Planning Wenru Liu et.al. 2505.05360 null
2025-05-08 Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source Localization Sooyoung Park et.al. 2505.05343 link
2025-05-08 FLAM: Frame-Wise Language-Audio Modeling Yusong Wu et.al. 2505.05335 null
2025-05-08 ICon: In-Context Contribution for Automatic Data Selection Yixin Yang et.al. 2505.05327 null
2025-05-07 EchoInk-R1: Exploring Audio-Visual Reasoning in Multimodal LLMs via Reinforcement Learning Zhenghao Xing et.al. 2505.04623 link
2025-05-07 On Path to Multimodal Generalist: General-Level and General-Bench Hao Fei et.al. 2505.04620 null
2025-05-07 OmniGIRL: A Multilingual and Multimodal Benchmark for GitHub Issue Resolution Lianghong Guo et.al. 2505.04606 link
2025-05-07 OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning Xianhang Li et.al. 2505.04601 null
2025-05-08 MonoCoP: Chain-of-Prediction for Monocular 3D Object Detection Zhihao Zhang et.al. 2505.04594 null
2025-05-07 ZeroSearch: Incentivize the Search Capability of LLMs without Searching Hao Sun et.al. 2505.04588 link
2025-05-07 SlideItRight: Using AI to Find Relevant Slides and Provide Feedback for Open-Ended Questions Chloe Qianhui Zhao et.al. 2505.04584 link
2025-05-07 Fight Fire with Fire: Defending Against Malicious RL Fine-Tuning via Reward Neutralization Wenjun Cao et.al. 2505.04578 null
2025-05-07 Communication-Efficient Federated Fine-Tuning of Language Models via Dynamic Update Schedules Michail Theologitis et.al. 2505.04535 link
2025-05-07 Overcoming Data Scarcity in Generative Language Modelling for Low-Resource Languages: A Systematic Review Josh McGiff et.al. 2505.04531 null
2025-05-07 Comparative Analysis of Carbon Footprint in Manual vs. LLM-Assisted Code Development Kuen Sum Cheung et.al. 2505.04521 null
2025-05-07 Pangu Ultra MoE: How to Train Your Big MoE on Ascend NPUs Yehui Tang et.al. 2505.04519 null
2025-05-07 "I Can See Forever!": Evaluating Real-time VideoLLMs for Assisting Individuals with Visual Impairments Ziyi Zhang et.al. 2505.04488 null
2025-05-07 CAD-Llama: Leveraging Large Language Models for Computer-Aided Design Parametric 3D Model Generation Jiahao Li et.al. 2505.04481 null
2025-05-07 TrajEvo: Designing Trajectory Prediction Heuristics via LLM-driven Evolution Zhikai Zhao et.al. 2505.04480 link
2025-05-07 Miipher-2: A Universal Speech Restoration Model for Million-Hour Scale Data Restoration Shigeki Karita et.al. 2505.04457 link
2025-05-07 M2Rec: Multi-scale Mamba for Efficient Sequential Recommendation Qianru Zhang et.al. 2505.04445 null
2025-05-07 Towards Effectively Leveraging Execution Traces for Program Repair with Code LLMs Mirazul Haque et.al. 2505.04441 null
2025-05-07 OBLIVIATE: Robust and Practical Machine Unlearning for Large Language Models Xiaoyu Xu et.al. 2505.04416 null
2025-05-07 DeCLIP: Decoupled Learning for Open-Vocabulary Dense Perception Junjie Wang et.al. 2505.04410 link
2025-05-06 VITA-Audio: Fast Interleaved Cross-Modal Token Generation for Efficient Large Speech-Language Model Zuwei Long et.al. 2505.03739 link
2025-05-06 Decentralized Nonconvex Optimization under Heavy-Tailed Noise: Normalization and Optimal Convergence Shuhua Yu et.al. 2505.03736 null
2025-05-06 Meta-Optimization and Program Search using Language Models for Task and Motion Planning Denis Shcherba et.al. 2505.03725 null
2025-05-06 Fill the Gap: Quantifying and Reducing the Modality Gap in Image-Text Representation Learning François Role et.al. 2505.03703 null
2025-05-06 Fairness of Automatic Speech Recognition in Cleft Lip and Palate Speech Susmita Bhattacharjee et.al. 2505.03697 null
2025-05-06 Graph Drawing for LLMs: An Empirical Evaluation Walter Didimo et.al. 2505.03678 null
2025-05-06 Distribution-Conditional Generation: From Class Distribution to Creative Generation Fu Feng et.al. 2505.03667 null
2025-05-06 Binding threshold units with artificial oscillatory neurons Vladimir Fanaskov et.al. 2505.03648 link
2025-05-06 PhysLLM: Harnessing Large Language Models for Cross-Modal Remote Physiological Sensing Yiping Xie et.al. 2505.03621 null
2025-05-06 Learning Unknown Spoof Prompts for Generalized Face Anti-Spoofing Using Only Real Face Images Fangling Jiang et.al. 2505.03611 null
2025-05-06 Learning Knowledge-based Prompts for Robust 3D Mask Presentation Attack Detection Fangling Jiang et.al. 2505.03610 null
2025-05-06 DyGEnc: Encoding a Sequence of Textual Scene Graphs to Reason and Answer Questions in Dynamic Scenes Sergey Linok et.al. 2505.03581 link
2025-05-06 LlamaFirewall: An open source guardrail system for building secure AI agents Sahana Chennabasappa et.al. 2505.03574 null
2025-05-06 Say It Another Way: A Framework for User-Grounded Paraphrasing Cléa Chataigner et.al. 2505.03563 null
2025-05-06 A Comprehensive Survey of Large AI Models for Future Communications: Foundations, Applications and Challenges Feibo Jiang et.al. 2505.03556 link
2025-05-06 A Hashgraph-Inspired Consensus Mechanism for Reliable Multi-Model Reasoning Kolawole E. Ogunsina et.al. 2505.03553 null
2025-05-06 STORY2GAME: Generating (Almost) Everything in an Interactive Fiction Game Eric Zhou et.al. 2505.03547 null
2025-05-06 Faster MoE LLM Inference for Extremely Large Models Haoqi Yang et.al. 2505.03531 null
2025-05-06 Ruled by the Representation Space: On the University's Embrace of Large Language Models Katia Schwerzmann et.al. 2505.03513 null
2025-05-06 BadLingual: A Novel Lingual-Backdoor Attack against Large Language Models Zihan Wang et.al. 2505.03501 null
2025-05-05 Scenethesis: A Language and Vision Agentic Framework for 3D Scene Generation Lu Ling et.al. 2505.02836 null
2025-05-05 R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning Yi-Fan Zhang et.al. 2505.02835 link
2025-05-05 No Other Representation Component Is Needed: Diffusion Transformers Can Provide Representation Guidance by Themselves Dengyang Jiang et.al. 2505.02831 link
2025-05-05 LISAT: Language-Instructed Segmentation Assistant for Satellite Imagery Jerome Quenum et.al. 2505.02829 null
2025-05-05 ReplaceMe: Network Simplification via Layer Pruning and Linear Transformations Dmitriy Shopkhoev et.al. 2505.02819 link
2025-05-05 Knowing You Don't Know: Learning When to Continue Search in Multi-round RAG through Self-Practicing Diji Yang et.al. 2505.02811 link
2025-05-05 Towards Quantifying the Hessian Structure of Neural Networks Zhaorui Dong et.al. 2505.02809 link
2025-05-05 Generating HomeAssistant Automations Using an LLM-based Chatbot Mathyas Giudici et.al. 2505.02802 null
2025-05-05 HSplitLoRA: A Heterogeneous Split Parameter-Efficient Fine-Tuning Framework for Large Language Models Zheng Lin et.al. 2505.02795 null
2025-05-05 Giving Simulated Cells a Voice: Evolving Prompt-to-Intervention Models for Cellular Control Nam H. Le et.al. 2505.02766 null
2025-05-05 Bye-bye, Bluebook? Automating Legal Procedure with Large Language Models Matthew Dahl et.al. 2505.02763 null
2025-05-05 Using Knowledge Graphs to harvest datasets for efficient CLIP model training Simon Ging et.al. 2505.02746 link
2025-05-06 Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation Gerard Pons et.al. 2505.02737 null
2025-05-05 FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models Zhouliang Yu et.al. 2505.02735 link
2025-05-05 Enhancing LLMs' Clinical Reasoning with Real-World Data from a Nationwide Sepsis Registry Junu Kim et.al. 2505.02722 link
2025-05-05 Less is More: Efficient Weight Farcasting with 1-Layer Neural Network Xiao Shou et.al. 2505.02714 null
2025-05-05 Technical Report: Evaluating Goal Drift in Language Model Agents Rauno Arike et.al. 2505.02709 null
2025-05-05 Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play Yemin Shi et.al. 2505.02707 link
2025-05-05 AI Standardized Patient Improves Human Conversations in Advanced Cancer Care Kurtis Haut et.al. 2505.02694 link
2025-05-05 Predicting Movie Hits Before They Happen with LLMs Shaghayegh Agah et.al. 2505.02693 null
2025-05-02 How Effective are Large Time Series Models in Hydrology? A Study on Water Level Forecasting in Everglades Rahuul Rangaraj et.al. 2505.01415 null
2025-05-02 Dynamic Robot Tool Use with Vision Language Models Noah Trupin et.al. 2505.01399 null
2025-05-02 FreeInsert: Disentangled Text-Guided Object Insertion in 3D Gaussian Scene without Spatial Priors Chenxi Li et.al. 2505.01322 null
2025-05-02 Helping Big Language Models Protect Themselves: An Enhanced Filtering and Summarization System Sheikh Samit Muhaimin et.al. 2505.01315 null
2025-05-02 Enhancing SPARQL Query Rewriting for Complex Ontology Alignments Anicet Lepetit Ondo et.al. 2505.01309 null
2025-05-02 Document Retrieval Augmented Fine-Tuning (DRAFT) for safety-critical software assessments Regan Bolton et.al. 2505.01307 null
2025-05-02 FlowDubber: Movie Dubbing with LLM-based Semantic-aware Learning and Flow Matching based Voice Enhancing Gaoxiang Cong et.al. 2505.01263 null
2025-05-02 Digital Pathway Curation (DPC): a comparative pipeline to assess the reproducibility, consensus and accuracy across Gemini, PubMed, and scientific reviewers in biomedical research Flavio Lichtenstein et.al. 2505.01259 null
2025-05-02 Can Foundation Models Really Segment Tumors? A Benchmarking Odyssey in Lung CT Imaging Elena Mulero Ayllón et.al. 2505.01239 null
2025-05-02 CaReAQA: A Cardiac and Respiratory Audio Question Answering Model for Open-Ended Diagnostic Reasoning Tsai-Ning Wang et.al. 2505.01199 null
2025-05-02 Gender Bias in Explainability: Investigating Performance Disparity in Post-hoc Methods Mahdi Dhaini et.al. 2505.01198 link
2025-05-05 TSTMotion: Training-free Scene-aware Text-to-motion Generation Ziyan Guo et.al. 2505.01182 null
2025-05-02 LLM Security: Vulnerabilities, Attacks, Defenses, and Countermeasures Francisco Aguilera-Martínez et.al. 2505.01177 null
2025-05-02 On the Limitations of Steering in Language Model Alignment Chebrolu Niranjan et.al. 2505.01162 null
2025-05-02 Methodological Foundations for AI-Driven Survey Question Generation Ted K. Mburu et.al. 2505.01150 null
2025-05-02 Retrieval-Augmented Generation in Biomedicine: A Survey of Technologies, Datasets, and Clinical Applications Jiawei He et.al. 2505.01146 null
2025-05-02 MateICL: Mitigating Attention Dispersion in Large-Scale In-Context Learning Murtadha Ahmed et.al. 2505.01110 null
2025-05-02 Self-Supervision Enhances Instance-based Multiple Instance Learning Methods in Digital Pathology: A Benchmark Study Ali Mammadov et.al. 2505.01109 link
2025-05-02 Nesterov Method for Asynchronous Pipeline Parallel Optimization Thalaiyasingam Ajanthan et.al. 2505.01099 link
2025-05-02 Evaluating Vision Language Model Adaptations for Radiology Report Generation in Low-Resource Languages Marco Salmè et.al. 2505.01096 null
2025-05-01 T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT Dongzhi Jiang et.al. 2505.00703 link
2025-05-01 Robotic Visual Instruction Yanbang Li et.al. 2505.00693 null
2025-05-01 Visual Test-time Scaling for GUI Agent Grounding Tiange Luo et.al. 2505.00684 link
2025-05-01 Steering Large Language Models with Register Analysis for Arbitrary Style Transfer Xinchen Yang et.al. 2505.00679 null
2025-05-01 Rethinking Memory in AI: Taxonomy, Operations, Topics, and Future Directions Yiming Du et.al. 2505.00675 link
2025-05-01 DeepCritic: Deliberate Critique with Large Language Models Wenkai Yang et.al. 2505.00662 link
2025-05-01 On the generalization of language models from in-context learning and finetuning: a controlled study Andrew K. Lampinen et.al. 2505.00661 null
2025-05-01 Large Language Models Understanding: an Inherent Ambiguity Barrier Daniel N. Nissani et.al. 2505.00654 null
2025-05-01 Open-Source LLM-Driven Federated Transformer for Predictive IoV Management Yazan Otoum et.al. 2505.00651 null
2025-05-01 Investigating Task Arithmetic for Zero-Shot Information Retrieval Marco Braga et.al. 2505.00649 link
2025-05-01 Brain Foundation Models with Hypergraph Dynamic Adapter for Brain Disease Analysis Zhongying Deng et.al. 2505.00627 null
2025-05-01 The Illusion of Role Separation: Hidden Shortcuts in LLM Role Learning (and How to Fix Them) Zihao Wang et.al. 2505.00626 null
2025-05-01 FineScope : Precision Pruning for Domain-Specialized Large Language Models Using SAE-Guided Self-Data Cultivation Chaitali Bhattacharyya et.al. 2505.00624 null
2025-05-01 Pixel3DMM: Versatile Screen-Space Priors for Single-Image 3D Face Reconstruction Simon Giebenhain et.al. 2505.00615 null
2025-05-01 Combining LLMs with Logic-Based Framework to Explain MCTS Ziyan An et.al. 2505.00610 null
2025-05-01 Can LLMs Help Improve Analogical Reasoning For Strategic Decisions? Experimental Evidence from Humans and GPT-4 Phanish Puranam et.al. 2505.00603 null
2025-05-02 Fast and Low-Cost Genomic Foundation Models via Outlier Removal Haozheng Luo et.al. 2505.00598 link
2025-05-01 Block Circulant Adapter for Large Language Models Xinyu Ding et.al. 2505.00582 null
2025-05-01 Parameter-Efficient Fine-Tuning with Circulant and Diagonal Vectors Xinyu Ding et.al. 2505.00580 null
2025-05-01 FreqKV: Frequency Domain Key-Value Compression for Efficient Context Window Extension Jushi Kai et.al. 2505.00570 null
2025-04-30 TRUST: An LLM-Based Dialogue System for Trauma Understanding and Structured Assessments Sichang Tu et.al. 2504.21851 null
2025-04-30 COMPACT: COMPositional Atomic-to-Complex Visual Capability Tuning Xindi Wu et.al. 2504.21850 null
2025-04-30 Early Exit and Multi Stage Knowledge Distillation in VLMs for Video Summarization Anas Anwarul Haq Khan et.al. 2504.21831 null
2025-04-30 Why Compress What You Can Generate? When GPT-4o Generation Ushers in Image Compression Fields Yixin Gao et.al. 2504.21814 null
2025-04-30 A simple and effective approach for body part recognition on CT scans based on projection estimation Franko Hrzic et.al. 2504.21810 null
2025-04-30 An Empirical Study on the Effectiveness of Large Language Models for Binary Code Understanding Xiuwei Shang et.al. 2504.21803 null
2025-04-30 DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition Z. Z. Ren et.al. 2504.21801 link
2025-04-30 SWE-smith: Scaling Data for Software Engineering Agents John Yang et.al. 2504.21798 null
2025-04-30 MAC-Tuning: LLM Multi-Compositional Problem Reasoning with Enhanced Knowledge Boundary Awareness Junsheng Huang et.al. 2504.21773 null
2025-04-30 LASHED: LLMs And Static Hardware Analysis for Early Detection of RTL Bugs Baleegh Ahmad et.al. 2504.21770 null
2025-04-30 LLM-based Interactive Imitation Learning for Robotic Manipulation Jonas Werner et.al. 2504.21769 link
2025-04-30 Investigating Literary Motifs in Ancient and Medieval Novels with Large Language Models Emelie Hallenberg et.al. 2504.21742 null
2025-04-30 TheraQuest: A Gamified, LLM-Powered Simulation for Massage Therapy Training Shengqian Wang et.al. 2504.21735 null
2025-04-30 XBreaking: Explainable Artificial Intelligence for Jailbreaking LLMs Marco Arazzi et.al. 2504.21700 null
2025-04-30 Visual Text Processing: A Comprehensive Review and Unified Evaluation Yan Shu et.al. 2504.21682 link
2025-04-30 Hoist with His Own Petard: Inducing Guardrails to Facilitate Denial-of-Service Attacks on Retrieval-Augmented Generation of LLMs Pan Suo et.al. 2504.21680 null
2025-04-30 Traceback of Poisoning Attacks to Retrieval-Augmented Generation Baolei Zhang et.al. 2504.21668 null
2025-04-30 From Precision to Perception: User-Centred Evaluation of Keyword Extraction Algorithms for Internet-Scale Contextual Advertising Jingwen Cai et.al. 2504.21667 null
2025-04-30 AdaR1: From Long-CoT to Hybrid-CoT via Bi-Level Adaptive Reasoning Optimization Haotian Luo et.al. 2504.21659 link
2025-04-30 Sadeed: Advancing Arabic Diacritization Through Small Language Model Zeina Aldallal et.al. 2504.21635 null
2025-04-29 Toward Efficient Exploration by Large Language Model Agents Dilip Arumugam et.al. 2504.20997 null
2025-04-29 X-Fusion: Introducing New Modality to Frozen Large Language Models Sicheng Mo et.al. 2504.20996 null
2025-04-29 ACE: A Security Architecture for LLM-Integrated App Systems Evan Li et.al. 2504.20984 null
2025-04-29 Real-Time Wayfinding Assistant for Blind and Low-Vision Users Dabbrata Das et.al. 2504.20976 null
2025-04-29 SetKE: Knowledge Editing for Knowledge Elements Overlap Yifan Wei et.al. 2504.20972 null
2025-04-29 OSVBench: Benchmarking LLMs on Specification Generation Tasks for Operating System Verification Shangyu Li et.al. 2504.20964 link
2025-04-29 Information Gravity: A Field-Theoretic Model for Token Selection in Large Language Models Maryna Vyshnyvetska et.al. 2504.20951 null
2025-04-29 Trace-of-Thought: Enhanced Arithmetic Problem Solving via Reasoning Distillation From Large to Small Language Models Tyler McDonald et.al. 2504.20946 null
2025-04-29 ChestX-Reasoner: Advancing Radiology Foundation Models with Reasoning through Step-by-Step Verification Ziqing Fan et.al. 2504.20930 link
2025-04-29 An Empirical Study on the Capability of LLMs in Decomposing Bug Reports Zhiyuan Chen et.al. 2504.20911 null
2025-04-29 Classifier-to-Bias: Toward Unsupervised Automatic Bias Detection for Visual Classifiers Quentin Guimard et.al. 2504.20902 null
2025-04-29 LELANTE: LEveraging LLM for Automated ANdroid TEsting Shamit Fatin et.al. 2504.20896 null
2025-04-29 FedMVP: Federated Multi-modal Visual Prompt Tuning for Vision-Language Models Mainak Singha et.al. 2504.20860 null
2025-04-29 X-Cross: Dynamic Integration of Language Models for Cross-Domain Sequential Recommendation Guy Hadad et.al. 2504.20859 null
2025-04-29 JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry Anum Afzal et.al. 2504.20849 null
2025-04-29 Language Model for Large-Text Transmission in Noisy Quantum Communications Yuqi Li et.al. 2504.20842 null
2025-04-29 Universal language model with the intervention of quantum theory D. -F. Qin et.al. 2504.20839 null
2025-04-29 Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning Hongfei Xue et.al. 2504.20835 null
2025-04-29 Reinforcement Learning for LLM Reasoning Under Memory Constraints Alan Lee et.al. 2504.20834 null
2025-04-30 Ascendra: Dynamic Request Prioritization for Efficient LLM Serving Azam Ikram et.al. 2504.20828 null
2025-04-28 Learning Streaming Video Representation via Multitask Training Yibin Yan et.al. 2504.20041 null
2025-04-28 AutoJudge: Judge Decoding Without Manual Annotation Roman Garipov et.al. 2504.20039 null
2025-04-28 SpatialReasoner: Towards Explicit and Generalizable 3D Spatial Reasoning Wufei Ma et.al. 2504.20024 null
2025-04-28 Better To Ask in English? Evaluating Factual Accuracy of Multilingual LLMs in English and Low-Resource Languages Pritika Rohera et.al. 2504.20022 null
2025-04-28 Modular Machine Learning: An Indispensable Path towards New-Generation Large Language Models Xin Wang et.al. 2504.20020 null
2025-04-29 LLM-Generated Fake News Induces Truth Decay in News Ecosystem: A Case Study on Neural News Recommendation Beizhe Hu et.al. 2504.20013 null
2025-04-28 Towards Automated Scoping of AI for Social Good Projects Jacob Emmerson et.al. 2504.20010 null
2025-04-28 Knowledge Distillation of Domain-adapted LLMs for Question-Answering in Telecom Rishika Sen et.al. 2504.20000 null
2025-04-28 HJRNO: Hamilton-Jacobi Reachability with Neural Operators Yankai Li et.al. 2504.19989 null
2025-04-28 TD-EVAL: Revisiting Task-Oriented Dialogue Evaluation by Combining Turn-Level Precision with Dialogue-Level Comparisons Emre Can Acikgoz et.al. 2504.19982 null
2025-04-28 Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets Adam Younsi et.al. 2504.19981 null
2025-04-29 From Concept to Practice: an Automated LLM-aided UVM Machine for RTL Verification Junhao Ye et.al. 2504.19959 null
2025-04-28 Enhancing Surgical Documentation through Multimodal Visual-Temporal Transformers and Generative AI Hugo Georgenthum et.al. 2504.19918 null
2025-04-28 Can AI Agents Design and Implement Drug Discovery Pipelines? Khachik Smbatyan et.al. 2504.19912 null
2025-04-28 GenCLS++: Pushing the Boundaries of Generative Classification in LLMs Through Comprehensive SFT and RL Studies Across Diverse Datasets Mingqian He et.al. 2504.19898 null
2025-04-28 CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition Quynh Phung et.al. 2504.19894 null
2025-04-28 semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage Ke Hong et.al. 2504.19867 null
2025-04-28 CoherenDream: Boosting Holistic Text Coherence in 3D Generation via Multimodal Large Language Models Feedback Chenhan Jiang et.al. 2504.19860 null
2025-04-28 Efficient Domain-adaptive Continual Pretraining for the Process Industry in the German Language Anastasia Zhukova et.al. 2504.19856 null
2025-04-29 The Automation Advantage in AI Red Teaming Rob Mulla et.al. 2504.19855 null
2025-04-25 Generalization Capability for Imitation Learning Yixiao Wang et.al. 2504.18538 null
2025-04-25 TRACE Back from the Future: A Probabilistic Reasoning Approach to Controllable Language Generation Gwen Yidou Weng et.al. 2504.18535 null
2025-04-25 Eval3D: Interpretable and Fine-grained Evaluation for 3D Generation Shivam Duggal et.al. 2504.18509 null
2025-04-25 Investigating Co-Constructive Behavior of Large Language Models in Explanation Dialogues Leandra Fichtel et.al. 2504.18483 null
2025-04-25 Generative Induction of Dialogue Task Schemas with Streaming Refinement and Simulated Interactions James D. Finch et.al. 2504.18474 null
2025-04-25 Fast-Slow Thinking for Large Vision-Language Model Reasoning Wenyi Xiao et.al. 2504.18458 null
2025-04-25 Pseudo-Asynchronous Local SGD: Robust and Efficient Data-Parallel Training Hiroki Naganuma et.al. 2504.18454 null
2025-04-25 Reason Like a Radiologist: Chain-of-Thought and Reinforcement Learning for Verifiable Report Generation Peiyuan Jing et.al. 2504.18453 null
2025-04-25 Kimi-Audio Technical Report KimiTeam et.al. 2504.18425 link
2025-04-25 LLMpatronous: Harnessing the Power of LLMs For Vulnerability Detection Rajesh Yarra et.al. 2504.18423 null
2025-04-25 BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs Hongyu Wang et.al. 2504.18415 null
2025-04-25 An Empirical Study of Evaluating Long-form Question Answering Ning Xian et.al. 2504.18413 link
2025-04-25 Expressing stigma and inappropriate responses prevents LLMs from safely replacing mental health providers Jared Moore et.al. 2504.18412 link
2025-04-25 HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding? Yusen Zhang et.al. 2504.18406 null
2025-04-25 Unsupervised Visual Chain-of-Thought Reasoning via Preference Optimization Kesen Zhao et.al. 2504.18397 null
2025-04-25 Bridge the Domains: Large Language Models Enhanced Cross-domain Sequential Recommendation Qidong Liu et.al. 2504.18383 null
2025-04-25 Pushing the boundary on Natural Language Inference Pablo Miralles-González et.al. 2504.18376 null
2025-04-25 Auto-SLURP: A Benchmark Dataset for Evaluating Multi-Agent Frameworks in Smart Personal Assistant Lei Shen et.al. 2504.18373 link
2025-04-25 ThreMoLIA: Threat Modeling of Large Language Model-Integrated Applications Felix Viktor Jedrzejewski et.al. 2504.18369 null
2025-04-25 Testing Individual Fairness in Graph Neural Networks Roya Nasiri et.al. 2504.18353 null
2025-04-24 Token-Shuffle: Towards High-Resolution Image Generation with Autoregressive Models Xu Ma et.al. 2504.17789 null
2025-04-24 Replay to Remember: Retaining Domain Knowledge in Streaming Language Models Sneh Pillai et.al. 2504.17780 null
2025-04-24 Conversational Assistants to support Heart Failure Patients: comparing a Neurosymbolic Architecture with ChatGPT Anuja Tayal et.al. 2504.17753 null
2025-04-24 Towards Robust LLMs: an Adversarial Robustness Measurement Framework Natan Levy et.al. 2504.17723 null
2025-04-24 Multilingual Performance Biases of Large Language Models in Education Vansh Gupta et.al. 2504.17720 null
2025-04-24 PICO: Reconstructing 3D People In Contact with Objects Alpár Cseke et.al. 2504.17695 null
2025-04-24 Ensemble Bayesian Inference: Leveraging Small Language Models to Achieve LLM-level Accuracy in Profile Matching Tasks Haru-Tada Sato et.al. 2504.17685 null
2025-04-24 INSIGHT: Bridging the Student-Teacher Gap in Times of Large Language Models Jarne Thys et.al. 2504.17677 null
2025-04-24 Energy Considerations of Large Language Model Inference and Efficiency Optimizations Jared Fernandez et.al. 2504.17674 null
2025-04-24 Cross-region Model Training with Communication-Computation Overlapping and Delay Compensation Ying Zhu et.al. 2504.17672 null
2025-04-25 Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction Yuanchang Ye et.al. 2504.17671 null
2025-04-24 Towards a HIPAA Compliant Agentic AI System in Healthcare Subash Neupane et.al. 2504.17669 null
2025-04-24 Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics Zena Al-Khalili et.al. 2504.17665 null
2025-04-24 Effortless, Simulation-Efficient Bayesian Inference using Tabular Foundation Models Julius Vetter et.al. 2504.17660 null
2025-04-24 Portability of Optimizations from SC to TSO Akshay Gopalakrishnan et.al. 2504.17646 null
2025-04-24 L3: DIMM-PIM Integrated Architecture and Coordination for Scalable Long-Context LLM Inference Qingyuan Liu et.al. 2504.17584 null
2025-04-25 DeepDistill: Enhancing LLM Reasoning Capabilities via Large-Scale Difficulty-Graded Data Training Xiaoyu Tian et.al. 2504.17565 null
2025-04-24 When Does Metadata Conditioning (NOT) Work for Language Model Pre-Training? A Study with Context-Free Grammars Rei Higuchi et.al. 2504.17562 null
2025-04-24 HalluLens: LLM Hallucination Benchmark Yejin Bang et.al. 2504.17550 null
2025-04-24 A Comprehensive Survey of Knowledge-Based Vision Question Answering Systems: The Lifecycle of Knowledge in Visual Reasoning Task Jiaqi Deng et.al. 2504.17547 null
2025-04-23 Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of Light Ali Hassani et.al. 2504.16922 null
2025-04-23 IberBench: LLM Evaluation on Iberian Languages José Ángel González et.al. 2504.16921 null
2025-04-23 Tracing Thought: Using Chain-of-Thought Reasoning to Identify the LLM Behind AI-Generated Text Shifali Agrahari et.al. 2504.16913 null
2025-04-23 Do Large Language Models know who did what to whom? Joseph M. Denning et.al. 2504.16884 null
2025-04-23 Enhancing Critical Thinking with AI: A Tailored Warning System for RAG Models Xuyang Zhu et.al. 2504.16883 null
2025-04-23 Context-Enhanced Vulnerability Detection Based on Large Language Model Yixin Yang et.al. 2504.16877 null
2025-04-24 Exploring How LLMs Capture and Represent Domain-Specific Knowledge Mirian Hipolito Garcia et.al. 2504.16871 null
2025-04-23 Common Functional Decompositions Can Mis-attribute Differences in Outcomes Between Populations Manuel Quintero et.al. 2504.16864 null
2025-04-23 Planning with Diffusion Models for Target-Oriented Dialogue Systems Hanwen Du et.al. 2504.16858 null
2025-04-23 Emo Pillars: Knowledge Distillation to Support Fine-Grained Context-Aware and Context-Less Emotion Classification Alexander Shvets et.al. 2504.16856 null
2025-04-23 Monte Carlo Planning with Large Language Model for Text-Based Game Agents Zijing Shi et.al. 2504.16855 null
2025-04-23 Improving Significant Wave Height Prediction Using Chronos Models Yilin Zhai et.al. 2504.16834 null
2025-04-23 LRASGen: LLM-based RESTful API Specification Generation Sida Deng et.al. 2504.16833 null
2025-04-23 GreenMind: A Next-Generation Vietnamese Large Language Model for Structured and Logical Reasoning Luu Quy Tung et.al. 2504.16832 null
2025-04-23 Decoupled Global-Local Alignment for Improving Compositional Understanding Xiaoxing Hu et.al. 2504.16801 null
2025-04-23 MOOSComp: Improving Lightweight Long-Context Compressor via Mitigating Over-Smoothing and Incorporating Outlier Scores Fengwei Zhou et.al. 2504.16786 null
2025-04-23 Graph2Nav: 3D Object-Relation Graph Generation to Robot Navigation Tixiao Shan et.al. 2504.16782 null
2025-04-23 How Effective are Generative Large Language Models in Performing Requirements Classification? Waad Alhoshan et.al. 2504.16768 null
2025-04-23 Lightweight Latent Verifiers for Efficient Meta-Generation Strategies Bartosz Piotrowski et.al. 2504.16760 null
2025-04-23 HEMA : A Hippocampus-Inspired Extended Memory Architecture for Long-Context AI Conversations Kwangseob Ahn et.al. 2504.16754 null
2025-04-22 TTRL: Test-Time Reinforcement Learning Yuxin Zuo et.al. 2504.16084 link
2025-04-22 MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention Yucheng Li et.al. 2504.16083 null
2025-04-22 MR. Video: "MapReduce" is the Principle for Long Video Understanding Ziqi Pang et.al. 2504.16082 null
2025-04-22 From Reflection to Perfection: Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning Le Zhuo et.al. 2504.16080 null
2025-04-22 LLMs are Greedy Agents: Effects of RL Fine-tuning on Decision-Making Abilities Thomas Schmied et.al. 2504.16078 null
2025-04-22 PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models Shi Qiu et.al. 2504.16074 null
2025-04-22 Guiding VLM Agents with Process Rewards at Inference Time for GUI Navigation Zhiyuan Hu et.al. 2504.16073 null
2025-04-22 Describe Anything: Detailed Localized Image and Video Captioning Long Lian et.al. 2504.16072 null
2025-04-22 A Python Tool for Reconstructing Full News Text from GDELT A. Fronzetti Colladon et.al. 2504.16063 link
2025-04-22 Vision language models are unreliable at trivial spatial cognition Sangeet Khemlani et.al. 2504.16061 null
2025-04-22 Vision-Language Models Are Not Pragmatically Competent in Referring Expression Generation Ziqiao Ma et.al. 2504.16060 link
2025-04-22 Automated Static Vulnerability Detection via a Holistic Neuro-symbolic Approach Penghui Li et.al. 2504.16057 null
2025-04-22 Honey, I Shrunk the Language Model: Impact of Knowledge Distillation Methods on Performance and Explainability Daniel Hendriks et.al. 2504.16056 null
2025-04-22 LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field Enlargement Zhifan Ye et.al. 2504.16053 link
2025-04-22 Evaluating Vision Language Models (VLMs) for Radiology: A Comprehensive Analysis Frank Li et.al. 2504.16047 null
2025-04-23 Certified Mitigation of Worst-Case LLM Copyright Infringement Jingyu Zhang et.al. 2504.16046 null
2025-04-22 LLMs meet Federated Learning for Scalable and Secure IoT Management Yazan Otoum et.al. 2504.16032 null
2025-04-22 LiveCC: Learning Video LLM with Streaming Speech Transcription at Scale Joya Chen et.al. 2504.16030 null
2025-04-22 Benchmarking LLM for Code Smells Detection: OpenAI GPT-4.0 vs DeepSeek-V3 Ahmed R. Sadik et.al. 2504.16027 null
2025-04-22 Efficient Temporal Consistency in Diffusion-Based Video Editing with Adaptor Modules: A Theoretical Framework Xinyuan Song et.al. 2504.16016 null
2025-04-21 Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs Chun-Hsiao Yeh et.al. 2504.15280 link
2025-04-21 VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models Weiye Xu et.al. 2504.15279 null
2025-04-21 Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning Jie Cheng et.al. 2504.15275 link
2025-04-21 Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models Guo Chen et.al. 2504.15271 null
2025-04-21 Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction Vaishnavh Nagarajan et.al. 2504.15266 link
2025-04-21 Interpretable Locomotion Prediction in Construction Using a Memory-Driven LLM Agent With Chain-of-Thought Reasoning Ehsan Ahmadi et.al. 2504.15263 null
2025-04-21 Leveraging Language Models for Automated Patient Record Linkage Mohammad Beheshti et.al. 2504.15261 null
2025-04-21 CRUST-Bench: A Comprehensive Benchmark for C-to-safe-Rust Transpilation Anirudh Khatry et.al. 2504.15254 link
2025-04-21 Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators Yilun Zhou et.al. 2504.15253 link
2025-04-21 MR. Guard: Multilingual Reasoning Guardrail using Curriculum Learning Yahan Yang et.al. 2504.15241 null
2025-04-21 Values in the Wild: Discovering and Analyzing Values in Real-World Language Model Interactions Saffron Huang et.al. 2504.15236 null
2025-04-21 A Self-Improving Coding Agent Maxime Robeyns et.al. 2504.15228 null
2025-04-21 EvalAgent: Discovering Implicit Evaluation Criteria from the Web Manya Wadhwa et.al. 2504.15219 null
2025-04-21 Integrating Symbolic Execution into the Fine-Tuning of Code-Generating LLMs Marina Sakharova et.al. 2504.15210 null
2025-04-21 Compute-Optimal LLMs Provably Generalize Better With Scale Marc Finzi et.al. 2504.15208 null
2025-04-21 Support Evaluation for the TREC 2024 RAG Track: Comparing Human versus LLM Judges Nandan Thakur et.al. 2504.15205 null
2025-04-22 Synergistic Weak-Strong Collaboration by Aligning Preferences Yizhu Jiao et.al. 2504.15188 null
2025-04-21 DSPO: Direct Semantic Preference Optimization for Real-World Image Super-Resolution Miaomiao Cai et.al. 2504.15176 null
2025-04-21 The Synthetic Imputation Approach: Generating Optimal Synthetic Texts For Underrepresented Categories In Supervised Classification Tasks Joan C. Timoneda et.al. 2504.15160 null
2025-04-21 KGMEL: Knowledge Graph-Enhanced Multimodal Entity Linking Juyeon Kim et.al. 2504.15135 link
2025-04-18 Generative AI Act II: Test Time Scaling Drives Cognition Engineering Shijie Xia et.al. 2504.13828 link
2025-04-18 Feature Alignment and Representation Transfer in Knowledge Distillation for Large Language Models Junjie Yang et.al. 2504.13825 null
2025-04-18 CheXWorld: Exploring Image World Modeling for Radiograph Representation Learning Yang Yue et.al. 2504.13820 link
2025-04-18 Not All Rollouts are Useful: Down-Sampling Rollouts in LLM Reinforcement Learning Yixuan Even Xu et.al. 2504.13818 null
2025-04-18 BadApex: Backdoor Attack Based on Adaptive Optimization Mechanism of Black-box Large Language Models Zhengxian Wu et.al. 2504.13775 null
2025-04-18 DP2Unlearning: An Efficient and Guaranteed Unlearning Framework for LLMs Tamim Al Mahmud et.al. 2504.13774 link
2025-04-18 Detecting Malicious Source Code in PyPI Packages with LLMs: Does RAG Come in Handy? Motunrayo Ibiyo et.al. 2504.13769 null
2025-04-18 Decoding Vision Transformers: the Diffusion Steering Lens Ryota Takatsuki et.al. 2504.13763 link
2025-04-18 Scaling sparse feature circuit finding for in-context learning Dmitrii Kharlapenko et.al. 2504.13756 null
2025-04-18 Learning to Attribute with Attention Benjamin Cohen-Wang et.al. 2504.13752 link
2025-04-18 Controlled Territory and Conflict Tracking (CONTACT): (Geo-)Mapping Occupied Territory from Open Source Intelligence Paul K. Mandal et.al. 2504.13730 link
2025-04-18 OpenDeception: Benchmarking and Investigating AI Deceptive Behaviors via Open-ended Interaction Simulation Yichen Wu et.al. 2504.13707 null
2025-04-18 Exploring Multimodal Prompt for Visualization Authoring with Large Language Models Zhen Wen et.al. 2504.13700 null
2025-04-18 Analysing the Robustness of Vision-Language-Models to Common Corruptions Muhammad Usama et.al. 2504.13690 null
2025-04-18 Intelligent Interaction Strategies for Context-Aware Cognitive Augmentation Xiangrong et.al. 2504.13684 null
2025-04-18 Revisiting Uncertainty Quantification Evaluation in Language Models: Spurious Interactions with Response Length Bias Results Andrea Santilli et.al. 2504.13677 null
2025-04-18 Large Language Models Will Change The Way Children Think About Technology And Impact Every Interaction Paradigm Russell Beale et.al. 2504.13667 null
2025-04-18 Do Prompt Patterns Affect Code Quality? A First Empirical Assessment of ChatGPT-Generated Code Antonio Della Porta et.al. 2504.13656 null
2025-04-18 EyecareGPT: Boosting Comprehensive Ophthalmology Understanding with Tailored Dataset, Benchmark and Model Sijing Li et.al. 2504.13650 link
2025-04-18 Exploring the Potential for Large Language Models to Demonstrate Rational Probabilistic Beliefs Gabriel Freedman et.al. 2504.13644 link
2025-04-17 Perception Encoder: The best visual embeddings are not at the output of the network Daniel Bolya et.al. 2504.13181 null
2025-04-17 PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding Jang Hyun Cho et.al. 2504.13180 link
2025-04-17 It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization Ali Behrouz et.al. 2504.13173 null
2025-04-17 Sleep-time Compute: Beyond Inference Scaling at Test-time Kevin Lin et.al. 2504.13171 link
2025-04-17 Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling Tsung-Han Wu et.al. 2504.13169 link
2025-04-17 CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training Shizhe Diao et.al. 2504.13161 null
2025-04-17 Digital Twin Generation from Visual Data: A Survey Andrew Melnik et.al. 2504.13159 link
2025-04-17 MIB: A Mechanistic Interpretability Benchmark Aaron Mueller et.al. 2504.13151 link
2025-04-17 Exploring Expert Failures Improves LLM Agent Tuning Li-Cheng Lan et.al. 2504.13145 null
2025-04-17 Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo João Loula et.al. 2504.13139 null
2025-04-17 Energy-Based Reward Models for Robust Language Model Alignment Anamika Lochab et.al. 2504.13134 link
2025-04-17 LLMs Meet Finance: Fine-Tuning Foundation Models for the Open FinLLM Leaderboard Varun Rao et.al. 2504.13125 null
2025-04-17 Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training Xinsong Zhang et.al. 2504.13123 null
2025-04-17 VistaDPO: Video Hierarchical Spatial-Temporal Direct Preference Optimization for Large Video Models Haojian Huang et.al. 2504.13122 link
2025-04-17 Probing and Inducing Combinational Creativity in Vision-Language Models Yongqian Peng et.al. 2504.13120 null
2025-04-17 Object-Driven Narrative in AR: A Scenario-Metaphor Framework with VLM Integration Yusi Sun et.al. 2504.13119 null
2025-04-17 Uncertainty-Aware Trajectory Prediction via Rule-Regularized Heteroscedastic Deep Classification Kumar Manas et.al. 2504.13111 null
2025-04-17 EventVAD: Training-Free Event-Aware Video Anomaly Detection Yihua Shao et.al. 2504.13092 null
2025-04-17 Retrieval-Augmented Generation with Conflicting Evidence Han Wang et.al. 2504.13079 link
2025-04-18 SkyReels-V2: Infinite-length Film Generative Model Guibin Chen et.al. 2504.13074 link
2025-04-16 BitNet b1.58 2B4T Technical Report Shuming Ma et.al. 2504.12285 null
2025-04-16 HLS-Eval: A Benchmark and Framework for Evaluating LLMs on High-Level Synthesis Design Tasks Stefan Abi-Karam et.al. 2504.12268 link
2025-04-16 FLIP Reasoning Challenge Andreas Plesner et.al. 2504.12256 link
2025-04-16 AnomalyGen: An Automated Semantic Log Sequence Generation Framework with LLM for Anomaly Detection Xinyu Li et.al. 2504.12250 null
2025-04-16 MOS: Towards Effective Smart Contract Vulnerability Detection through Mixture-of-Experts Tuning of Large Language Models Hang Yuan et.al. 2504.12234 null
2025-04-16 Watermarking Needs Input Repetition Masking David Khachaturov et.al. 2504.12229 null
2025-04-16 d1: Scaling Reasoning in Diffusion Large Language Models via Reinforcement Learning Siyan Zhao et.al. 2504.12216 null
2025-04-16 What Do Large Language Models Know? Tacit Knowledge as a Potential Causal-Explanatory Structure Céline Budding et.al. 2504.12187 null
2025-04-16 SALAD: Improving Robustness and Generalization through Contrastive Learning with Structure-Aware and LLM-Driven Augmented Data Suyoung Bae et.al. 2504.12185 null
2025-04-16 Trusting CHATGPT: how minor tweaks in the prompts lead to major differences in sentiment classification Jaime E. Cuellar et.al. 2504.12180 null
2025-04-16 Multilingual Contextualization of Large Language Models for Document-Level Machine Translation Miguel Moura Ramos et.al. 2504.12140 null
2025-04-16 Efficient Contrastive Decoding with Probabilistic Hallucination Detection - Mitigating Hallucinations in Large Vision Language Models - Laura Fieback et.al. 2504.12137 null
2025-04-16 Clarifying Ambiguities: on the Role of Ambiguity Types in Prompting Methods for Clarification Generation Anfu Tang et.al. 2504.12113 null
2025-04-16 Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation Shizhan Cai et.al. 2504.12108 null
2025-04-16 Logits DeConfusion with CLIP for Few-Shot Learning Shuo Li et.al. 2504.12104 link
2025-04-16 Gauging Overprecision in LLMs: An Empirical Study Adil Bahaj et.al. 2504.12098 null
2025-04-16 Reasoning-Based AI for Startup Evaluation (R.A.I.S.E.): A Memory-Augmented, Multi-Step Decision Framework Jack Preuveneers et.al. 2504.12090 null
2025-04-16 Self-alignment of Large Video Language Models with Refined Regularized Preference Optimization Pritam Sarkar et.al. 2504.12083 null
2025-04-16 Selective Demonstration Retrieval for Improved Implicit Hate Speech Detection Yumin Kim et.al. 2504.12082 null
2025-04-16 Subitizing-Inspired_Large_Language_Models_for_Floorplanning Shao-Chien Lu et.al. 2504.12076 null
2025-04-16 Elucidating the Design Space of Multimodal Protein Language Models Cheng-Yen Hsieh et.al. 2504.11454 null
2025-04-15 TextArena Leon Guertler et.al. 2504.11442 link
2025-04-15 Masculine Defaults via Gendered Discourse in Podcasts and Large Language Models Maria Teleki et.al. 2504.11431 link
2025-04-15 A Dual-Space Framework for General Knowledge Distillation of Large Language Models Xue Zhang et.al. 2504.11426 null
2025-04-15 Reinforcing Compositional Retrieval: Retrieving Step-by-Step for Composing Informative Contexts Quanyu Long et.al. 2504.11420 null
2025-04-15 Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning Ali Taghibakhshi et.al. 2504.11409 null
2025-04-15 DataDecide: How to Predict Best Pretraining Data with Small Experiments Ian Magnusson et.al. 2504.11393 null
2025-04-15 RankAlign: A Ranking View of the Generator-Validator Gap in Large Language Models Juan Diego Rodriguez et.al. 2504.11381 link
2025-04-15 Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False Presuppositions Wang Bill Zhu et.al. 2504.11373 link
2025-04-15 OpenTuringBench: An Open-Model-based Benchmark and Framework for Machine-Generated Text Detection and Attribution Lucio La Cava et.al. 2504.11369 null
2025-04-15 From Gaze to Insight: Bridging Human Visual Attention and Vision Language Model Explanation for Weakly-Supervised Medical Image Segmentation Jingkun Chen et.al. 2504.11368 null
2025-04-15 Teaching Large Language Models to Reason through Learning and Forgetting Tianwei Ni et.al. 2504.11364 link
2025-04-15 Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning Haiming Wang et.al. 2504.11354 link
2025-04-16 Seedream 3.0 Technical Report Yu Gao et.al. 2504.11346 null
2025-04-15 A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce Wei Xiong et.al. 2504.11343 link
2025-04-15 REWARD CONSISTENCY: Improving Multi-Objective Alignment from a Data-Centric Perspective Zhihao Xu et.al. 2504.11337 null
2025-04-15 Looking beyond the next token Abitha Thankaraj et.al. 2504.11336 null
2025-04-15 Optimizing LLM Inference: Fluid-Guided Online Scheduling with Memory Constraints Ruicheng Ao et.al. 2504.11320 link
2025-04-15 Learning to Be A Doctor: Searching for Effective Medical Agent Architectures Yangyang Zhuang et.al. 2504.11301 null
2025-04-16 Automated Python Translation Joshua Otten et.al. 2504.11290 null
2025-04-14 InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Jinguo Zhu et.al. 2504.10479 link
2025-04-14 Weight Ensembling Improves Reasoning in Language Models Xingyu Dang et.al. 2504.10478 null
2025-04-14 MIEB: Massive Image Embedding Benchmark Chenghao Xiao et.al. 2504.10471 link
2025-04-14 Pixel-SAIL: Single Transformer For Pixel-Grounded Understanding Tao Zhang et.al. 2504.10465 link
2025-04-14 The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer Weixian Lei et.al. 2504.10462 link
2025-04-15 GUI-R1 : A Generalist R1-Style Vision-Language Action Model For GUI Agents Xiaobo Xia et.al. 2504.10458 null
2025-04-14 M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models Junxiong Wang et.al. 2504.10449 link
2025-04-14 Multimodal Long Video Modeling Based on Temporal Dynamic Context Haoran Hao et.al. 2504.10443 link
2025-04-14 LLM Can be a Dangerous Persuader: Empirical Study of Persuasion Safety in Large Language Models Minqian Liu et.al. 2504.10430 null
2025-04-14 Foundation models for electronic health records: representation dynamics and transferability Michael C. Burkhart et.al. 2504.10422 link
2025-04-14 Can We Edit LLMs for Long-Tail Biomedical Knowledge? Xinhao Yi et.al. 2504.10421 link
2025-04-15 Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA Michał Turski et.al. 2504.10419 link
2025-04-14 CliniChat: A Multi-Source Knowledge-Driven Framework for Clinical Interview Dialogue Reconstruction and Evaluation Jing Chen et.al. 2504.10418 null
2025-04-14 LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models Parshin Shojaee et.al. 2504.10415 link
2025-04-14 Performance of Large Language Models in Supporting Medical Diagnosis and Treatment Diogo Sousa et.al. 2504.10405 null
2025-04-14 Satellite Federated Fine-Tuning for Foundation Models in Space Computing Power Networks Yan zhu et.al. 2504.10403 null
2025-04-14 Can LLMs Assist Expert Elicitation for Probabilistic Causal Modeling? Olha Shaposhnyk et.al. 2504.10397 null
2025-04-14 SymRTLO: Enhancing RTL Code Optimization with LLMs and Neuron-Inspired Symbolic Reasoning Yiting Wang et.al. 2504.10369 null
2025-04-14 DICE: A Framework for Dimensional and Contextual Evaluation of Language Models Aryan Shrivastava et.al. 2504.10359 null
2025-04-14 Pseudo-Autoregressive Neural Codec Language Models for Efficient Zero-Shot Text-to-Speech Synthesis Yifan Yang et.al. 2504.10352 null
2025-04-11 Quantum Large Language Model Fine-Tuning Sang Hyub Kim et.al. 2504.08732 null
2025-04-11 DocAgent: A Multi-Agent System for Automated Code Documentation Generation Dayu Yang et.al. 2504.08725 link
2025-04-11 SWAN-GPT: An Efficient and Scalable Approach for Long-Context Language Modeling Krishna C. Puvvada et.al. 2504.08719 null
2025-04-11 SWE-PolyBench: A multi-language benchmark for repository level evaluation of coding agents Muhammad Shihab Rashid et.al. 2504.08703 link
2025-04-11 Large Language Models as Span Annotators Zdeněk Kasner et.al. 2504.08697 null
2025-04-11 TP-RAG: Benchmarking Retrieval-Augmented Large Language Model Agents for Spatiotemporal-Aware Travel Planning Hang Ni et.al. 2504.08694 null
2025-04-11 Fast-Slow-Thinking: Complex Task Solving with Large Language Models Yiliu Sun et.al. 2504.08690 null
2025-04-11 Voice Interaction With Conversational AI Could Facilitate Thoughtful Reflection and Substantive Revision in Writing Jiho Kim et.al. 2504.08687 null
2025-04-11 Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model Team Seawead et.al. 2504.08685 null
2025-04-11 Variability-Driven User-Story Generation using LLM and Triadic Concept Analysis Alexandre Bazin et.al. 2504.08666 null
2025-04-11 Quality evaluation of Tabby coding assistant using real source code snippets Marta Borek et.al. 2504.08650 link
2025-04-11 Do LLMs trust AI regulation? Emerging behaviour of game-theoretic LLM agents Alessio Buscemi et.al. 2504.08640 null
2025-04-11 Latent Diffusion Autoencoders: Toward Efficient and Meaningful Unsupervised Representation Learning in Medical Imaging Gabriele Lozupone et.al. 2504.08635 link
2025-04-11 MooseAgent: A LLM Based Multi-agent Framework for Automating Moose Simulation Tao Zhang et.al. 2504.08621 link
2025-04-11 Analyzing 16,193 LLM Papers for Fun and Profits Zhiqiu Xia et.al. 2504.08619 null
2025-04-11 Playpen: An Environment for Exploring Learning Through Conversational Interaction Nicola Horst et.al. 2504.08590 link
2025-04-11 AstroLLaVA: towards the unification of astronomical data and natural language Sharaf Zaman et.al. 2504.08583 null
2025-04-11 UoB-NLP at SemEval-2025 Task 11: Leveraging Adapters for Multilingual and Cross-Lingual Emotion Detection Frances Laureano De Leon et.al. 2504.08543 null
2025-04-11 Embodied Image Captioning: Self-supervised Learning Agents for Spatially Coherent Image Descriptions Tommaso Galliena et.al. 2504.08531 null
2025-04-11 On The Landscape of Spoken Language Models: A Comprehensive Survey Siddhant Arora et.al. 2504.08528 null
2025-04-10 Cat, Rat, Meow: On the Alignment of Language Model and Human Term-Similarity Judgments Lorenz Linhardt et.al. 2504.07965 null
2025-04-10 C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing Zhongyang Li et.al. 2504.07964 link
2025-04-10 GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation Lang Lin et.al. 2504.07962 null
2025-04-10 Detect Anything 3D in the Wild Hanxue Zhang et.al. 2504.07958 link
2025-04-10 MM-IFEngine: Towards Multimodal Instruction Following Shengyuan Ding et.al. 2504.07957 link
2025-04-10 VCR-Bench: A Comprehensive Evaluation Framework for Video Chain-of-Thought Reasoning Yukun Qi et.al. 2504.07956 null
2025-04-10 Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory Mirac Suzgun et.al. 2504.07952 link
2025-04-10 We Are All Creators: Generative AI, Collective Knowledge, and the Path Towards Human-AI Synergy Jordi Linares-Pellicer et.al. 2504.07936 null
2025-04-10 Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining Rosie Zhao et.al. 2504.07912 link
2025-04-10 Porting an LLM based Application from ChatGPT to an On-Premise Environment Teemu Paloniemi et.al. 2504.07907 null
2025-04-10 Redefining Machine Translation on Social Network Services with Large Language Models Hongcheng Guo et.al. 2504.07901 link
2025-04-10 How do Large Language Models Understand Relevance? A Mechanistic Interpretability Perspective Qi Liu et.al. 2504.07898 link
2025-04-10 Fast Adaptation with Behavioral Foundation Models Harshit Sikchi et.al. 2504.07896 null
2025-04-10 Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge Riccardo Cantini et.al. 2504.07887 link
2025-04-11 An LLM-Driven Multi-Agent Debate System for Mendelian Diseases Xinyang Zhou et.al. 2504.07881 null
2025-04-10 Token Level Routing Inference System for Edge Devices Jianshu She et.al. 2504.07878 null
2025-04-10 SAMJAM: Zero-Shot Video Scene Graph Generation for Egocentric Kitchen Videos Joshua Li et.al. 2504.07867 null
2025-04-11 Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Yichun Yin et.al. 2504.07866 null
2025-04-10 Robust Hallucination Detection in LLMs via Adaptive Token Selection Mengjia Niu et.al. 2504.07863 null
2025-04-10 2D-Curri-DPO: Two-Dimensional Curriculum Learning for Direct Preference Optimization Mengyang Li et.al. 2504.07856 null
2025-04-09 Sculpting Subspaces: Constrained Full Fine-Tuning in LLMs for Continual Learning Nikhil Shivakumar Nayak et.al. 2504.07097 link
2025-04-09 OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens Jiacheng Liu et.al. 2504.07096 null
2025-04-09 Are We Done with Object-Centric Learning? Alexander Rubinstein et.al. 2504.07092 link
2025-04-09 KG-LLM-Bench: A Scalable Benchmark for Evaluating LLM Reasoning on Textualized Knowledge Graphs Elan Markowitz et.al. 2504.07087 null
2025-04-09 A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility Andreas Hochlehnert et.al. 2504.07086 null
2025-04-09 Self-Steering Language Models Gabriel Grand et.al. 2504.07081 null
2025-04-09 DeduCE: Deductive Consistency as a Framework to Evaluate LLM Reasoning Atharva Pandey et.al. 2504.07080 null
2025-04-09 Kaleidoscope: In-language Exams for Massively Multilingual Vision Evaluation Israfel Salazar et.al. 2504.07072 null
2025-04-09 A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models Zhouhang Xie et.al. 2504.07070 null
2025-04-09 HalluciNot: Hallucination Detection Through Context and Common Knowledge Verification Bibek Paudel et.al. 2504.07069 null
2025-04-09 Teaching pathology foundation models to accurately predict gene expression with parameter efficient knowledge transfer Shi Pan et.al. 2504.07061 null
2025-04-09 TASTE: Text-Aligned Speech Tokenization and Embedding for Spoken Language Modeling Liang-Hsuan Tseng et.al. 2504.07053 link
2025-04-09 To Backtrack or Not to Backtrack: When Sequential Search Limits Model Reasoning Tian Qin et.al. 2504.07052 null
2025-04-09 Evaluating Retrieval Augmented Generative Models for Document Queries in Transportation Safety Chad Melton et.al. 2504.07022 null
2025-04-09 LLM-IFT: LLM-Powered Information Flow Tracking for Secure Hardware Nowfel Mashnoor et.al. 2504.07015 null
2025-04-09 Towards LLMs Robustness to Changes in Prompt Format Styles Lilian Ngweta et.al. 2504.06969 null
2025-04-09 Efficient Self-Supervised Learning for Earth Observation via Dynamic Dataset Curation Thomas Kerdreux et.al. 2504.06962 null
2025-04-10 VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning Xinhao Li et.al. 2504.06958 null
2025-04-09 Adaptive Computation Pruning for the Forgetting Transformer Zhixuan Lin et.al. 2504.06949 null
2025-04-09 RuOpinionNE-2024: Extraction of Opinion Tuples from Russian News Texts Natalia Loukachevitch et.al. 2504.06947 link
2025-04-08 GOLLuM: Gaussian Process Optimized LLMs -- Reframing LLM Finetuning through Bayesian Optimization Bojana Ranković et.al. 2504.06265 link
2025-04-08 OmniSVG: A Unified Scalable Vector Graphics Generation Model Yiying Yang et.al. 2504.06263 null
2025-04-09 Hogwild! Inference: Parallel LLM Generation via Concurrent Attention Gleb Rodionov et.al. 2504.06261 null
2025-04-08 FEABench: Evaluating Language Models on Multiphysics Reasoning Ability Nayantara Mudur et.al. 2504.06260 link
2025-04-08 Orb-v3: atomistic simulation at scale Benjamin Rhodes et.al. 2504.06231 link
2025-04-08 LExT: Towards Evaluating Trustworthiness of Natural Language Explanations Krithi Shailya et.al. 2504.06227 null
2025-04-08 Encoder-Decoder Gemma: Improving the Quality-Efficiency Trade-Off via Adaptation Biao Zhang et.al. 2504.06225 null
2025-04-09 Earth-Adapter: Bridge the Geospatial Domain Gaps with Mixture of Frequency Adaptation Xiaoxing Hu et.al. 2504.06220 link
2025-04-08 Can Performant LLMs Be Ethical? Quantifying the Impact of Web Crawling Opt-Outs Dongyang Fan et.al. 2504.06219 null
2025-04-08 From 128K to 4M: Efficient Training of Ultra-Long Context Large Language Models Chejian Xu et.al. 2504.06214 null
2025-04-08 TxGemma: Efficient and Agentic LLMs for Therapeutics Eric Wang et.al. 2504.06196 null
2025-04-08 A Self-Supervised Framework for Space Object Behaviour Characterisation Ian Groves et.al. 2504.06176 null
2025-04-08 Assessing how hyperparameters impact Large Language Models' sarcasm detection performance Montgomery Gole et.al. 2504.06166 null
2025-04-09 Navigating the Rabbit Hole: Emergent Biases in LLM-Generated Attack Narratives Targeting Mental Health Groups Rijul Magu et.al. 2504.06160 null
2025-04-08 A Large-Scale Analysis on Contextual Self-Supervised Video Representation Learning Akash Kumar et.al. 2504.06153 null
2025-04-08 V-MAGE: A Game Evaluation Framework for Assessing Visual-Centric Capabilities in Multimodal Large Language Models Xiangxi Zheng et.al. 2504.06148 link
2025-04-08 ARLO: A Tailorable Approach for Transforming Natural Language Software Requirements into Architecture using LLMs Tooraj Helmi et.al. 2504.06143 null
2025-04-08 Adversarial Training of Reward Models Alexander Bukharin et.al. 2504.06141 null
2025-04-08 A Multimedia Analytics Model for the Foundation Model Era Marcel Worring et.al. 2504.06138 null
2025-04-08 QGen Studio: An Adaptive Question-Answer Generation, Training and Evaluation Platform Movina Moses et.al. 2504.06136 null
2025-04-07 URECA: Unique Region Caption Anything Sangbeom Lim et.al. 2504.05305 null
2025-04-07 InteractVLM: 3D Interaction Reasoning from 2D Foundational Models Sai Kumar Dwivedi et.al. 2504.05303 link
2025-04-07 SmolVLM: Redefining small and efficient multimodal models Andrés Marafioti et.al. 2504.05299 null
2025-04-07 Truthful or Fabricated? Using Causal Attribution to Mitigate Reward Hacking in Explanations Pedro Ferreira et.al. 2504.05294 null
2025-04-07 The challenge of uncertainty quantification of large language models in medicine Zahra Atf et.al. 2504.05278 null
2025-04-07 Enhancing LLM-Based Short Answer Grading with Retrieval-Augmented Generation Yucheng Chu et.al. 2504.05276 null
2025-04-07 Do PhD-level LLMs Truly Grasp Elementary Addition? Probing Rule Learning vs. Memorization in Large Language Models Yang Yan et.al. 2504.05262 null
2025-04-07 Learning to Reason Over Time: Timeline Self-Reflection for Improved Temporal Reasoning in Language Models Adrián Bazaga et.al. 2504.05258 null
2025-04-07 Explaining Low Perception Model Competency with High-Competency Counterfactuals Sara Pohland et.al. 2504.05254 null
2025-04-07 LLM-based Automated Grading with Human-in-the-Loop Hang Li et.al. 2504.05239 null
2025-04-08 NoveltyBench: Evaluating Language Models for Humanlike Diversity Yiming Zhang et.al. 2504.05228 null
2025-04-07 A Reality Check of Vision-Language Pre-training in Radiology: Have We Progressed Using Text? Julio Silva-Rodríguez et.al. 2504.05227 null
2025-04-07 Vision-Language Model Predictive Control for Manipulation Planning and Trajectory Generation Jiaming Chen et.al. 2504.05225 link
2025-04-08 Leveraging LLMs for Utility-Focused Annotation: Reducing Manual Effort for Retrieval and RAG Hengran Zhang et.al. 2504.05220 null
2025-04-07 Unleashing the Power of LLMs in Dense Retrieval with Query Likelihood Modeling Hengran Zhang et.al. 2504.05216 null
2025-04-07 Post-Training Language Models for Continual Relation Extraction Sefika Efeoglu et.al. 2504.05214 null
2025-04-07 Quantum Program Linting with LLMs: Emerging Results from a Comparative Study Seung Yeob Shin et.al. 2504.05204 null
2025-04-07 Training state-of-the-art pathology foundation models with orders of magnitude less data Mikhail Karasikov et.al. 2504.05186 null
2025-04-07 Concise Reasoning via Reinforcement Learning Mehdi Fatemi et.al. 2504.05185 link
2025-04-07 BRIDGES: Bridging Graph Modality and Large Language Models within EDA Tasks Wei Li et.al. 2504.05180 null
2025-04-04 Shape My Moves: Text-Driven Shape-Aware Synthesis of Human Motions Ting-Hsuan Liao et.al. 2504.03639 null
2025-04-04 Do Larger Language Models Imply Better Reasoning? A Pretraining Scaling Law for Reasoning Xinyi Wang et.al. 2504.03635 null
2025-04-04 Align to Structure: Aligning Large Language Models with Structural Information Zae Myung Kim et.al. 2504.03622 null
2025-04-04 VISTA-OCR: Towards generative and interactive end to end OCR models Laziz Hamdi et.al. 2504.03621 null
2025-04-04 Multilingual Retrieval-Augmented Generation for Knowledge-Intensive Task Leonardo Ranaldi et.al. 2504.03616 null
2025-04-04 AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset Bingxiang He et.al. 2504.03612 null
2025-04-04 MedSAM2: Segment Anything in 3D Medical Images and Videos Jun Ma et.al. 2504.03600 link
2025-04-04 EnrichIndex: Using LLMs to Enrich Retrieval Indices Offline Peter Baile Chen et.al. 2504.03598 null
2025-04-04 PF3Det: A Prompted Foundation Feature Assisted Visual LiDAR 3D Detector Kaidong Li et.al. 2504.03563 null
2025-04-04 Agentic Knowledgeable Self-awareness Shuofei Qiao et.al. 2504.03553 link
2025-04-04 RANa: Retrieval-Augmented Navigation Gianluca Monaci et.al. 2504.03524 null
2025-04-04 Neutralizing the Narrative: AI-Powered Debiasing of Online News Articles Chen Wei Kuo et.al. 2504.03520 null
2025-04-04 SpectR: Dynamically Composing LM Experts with Spectral Routing William Fleshman et.al. 2504.03454 null
2025-04-04 Optimizing Specific and Shared Parameters for Efficient Parameter Tuning Van-Anh Nguyen et.al. 2504.03450 null
2025-04-04 LLMSched: Uncertainty-Aware Workload Scheduling for Compound LLM Applications Botao Zhu et.al. 2504.03444 null
2025-04-04 Know What You do Not Know: Verbalized Uncertainty Estimation Robustness on Corrupted Images in Vision-Language Models Mirko Borszukovszki et.al. 2504.03440 null
2025-04-04 Locations of Characters in Narratives: Andersen and Persuasion Datasets Batuhan Ozyurt et.al. 2504.03434 link
2025-04-04 Online Difficulty Filtering for Reasoning Oriented Reinforcement Learning Sanghwan Bae et.al. 2504.03380 null
2025-04-04 MultiClear: Multimodal Soft Exoskeleton Glove for Transparent Object Grasping Assistance Chen Hu et.al. 2504.03379 null
2025-04-04 Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency Erik Johannes Husom et.al. 2504.03360 null
2025-04-03 STING-BEE: Towards Vision-Language Model for Real-World X-ray Baggage Security Inspection Divya Velayudhan et.al. 2504.02823 null
2025-04-03 Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models Mateusz Pach et.al. 2504.02821 link
2025-04-03 Generative Evaluation of Complex Reasoning in Large Language Models Haowei Lin et.al. 2504.02810 link
2025-04-03 MegaMath: Pushing the Limits of Open Math Corpora Fan Zhou et.al. 2504.02807 link
2025-04-03 F-ViTA: Foundation Model Guided Visible to Thermal Translation Jay N. Paranjape et.al. 2504.02801 link
2025-04-04 A Survey of Large Language Models in Mental Health Disorder Detection on Social Media Zhuohan Ge et.al. 2504.02800 null
2025-04-03 Systematic Evaluation of Large Vision-Language Models for Surgical Artificial Intelligence Anita Rau et.al. 2504.02799 null
2025-04-03 A Framework for Situating Innovations, Opportunities, and Challenges in Advancing Vertical Systems with Large AI Models Gaurav Verma et.al. 2504.02793 null
2025-04-03 Unified World Models: Coupling Video and Action Diffusion for Pretraining on Large Robotic Datasets Chuning Zhu et.al. 2504.02792 null
2025-04-03 A Framework for Robust Cognitive Evaluation of LLMs Karin de Langis et.al. 2504.02789 null
2025-04-03 From Consumption to Collaboration: Measuring Interaction Patterns to Augment Human Cognition in Open-Ended Tasks Joshua Holstein et.al. 2504.02780 null
2025-04-03 BT-ACTION: A Test-Driven Approach for Modular Understanding of User Instruction Leveraging Behaviour Trees and LLMs Alexander Leszczynski et.al. 2504.02779 link
2025-04-03 How Deep Do Large Language Models Internalize Scientific Literature and Citation Practices? Andres Algaba et.al. 2504.02767 link
2025-04-03 Robot-Led Vision Language Model Wellbeing Assessment of Children Nida Itrat Abbasi et.al. 2504.02765 null
2025-04-03 Enhancing LLM Robustness to Perturbed Instructions: An Empirical Study Aryan Agrawal et.al. 2504.02733 link
2025-04-04 Why do LLMs attend to the first token? Federico Barbero et.al. 2504.02732 null
2025-04-03 ERPO: Advancing Safety Alignment via Ex-Ante Reasoning Preference Optimization Kehua Feng et.al. 2504.02725 null
2025-04-03 TeleMoM: Consensus-Driven Telecom Intelligence via Mixture of Models Xinquan Wang et.al. 2504.02712 null
2025-04-03 The Hidden Space of Safety: Understanding Preference-Tuned LLMs in Multilingual context Nikhil Verma et.al. 2504.02708 null
2025-04-03 LLM for Complex Reasoning Task: An Exploratory Study in Fermi Problems Zishuo Liu et.al. 2504.02671 null
2025-04-02 Slot-Level Robotic Placement via Visual Imitation from Single Human Video Dandan Shan et.al. 2504.01959 null
2025-04-02 Towards Unified Referring Expression Segmentation Across Omni-Level Visual Target Granularities Jing Liu et.al. 2504.01954 null
2025-04-02 The LLM Wears Prada: Analysing Gender Bias and Stereotypes through Online Shopping Data Massimiliano Luca et.al. 2504.01951 null
2025-04-02 Efficient Federated Learning Tiny Language Models for Mobile Network Feature Prediction Daniel Becking et.al. 2504.01947 null
2025-04-02 OpenCodeReasoning: Advancing Data Distillation for Competitive Coding Wasi Uddin Ahmad et.al. 2504.01943 null
2025-04-02 Critical Thinking: Which Kinds of Complexity Govern Optimal Reasoning Length? Celine Lee et.al. 2504.01935 link
2025-04-02 A thorough benchmark of automatic text classification: From traditional approaches to large language models Washington Cunha et.al. 2504.01930 link
2025-04-02 Gen-C: Populating Virtual Worlds with Generative Crowds Andreas Panayiotou et.al. 2504.01924 null
2025-04-02 Is Less Really More? Fake News Detection with Limited Information Zhaoyang Cao et.al. 2504.01922 link
2025-04-03 Bridging the Linguistic Divide: A Survey on Leveraging Large Language Models for Machine Translation Baban Gain et.al. 2504.01919 null
2025-04-02 FineLIP: Extending CLIP's Reach via Fine-Grained Alignment with Longer Text Inputs Mothilal Asokan et.al. 2504.01916 link
2025-04-02 Advancing AI-Scientist Understanding: Making LLM Think Like a Physicist with Interpretable Reasoning Yinggan Xu et.al. 2504.01911 null
2025-04-02 Is Temporal Prompting All We Need For Limited Labeled Action Recognition? Shreyank N Gowda et.al. 2504.01890 null
2025-04-02 TransientTables: Evaluating LLMs' Reasoning on Temporally Evolving Semi-structured Tables Abhilash Shankarampeta et.al. 2504.01879 null
2025-04-02 From Code Generation to Software Testing: AI Copilot with Context-Based RAG Yuchen Wang et.al. 2504.01866 null
2025-04-02 Cross-Lingual Consistency: A Novel Inference Framework for Advancing Reasoning in Large Language Models Zhiwei Yu et.al. 2504.01857 null
2025-04-02 Code Red! On the Harmfulness of Applying Off-the-shelf Large Language Models to Programming Tasks Ali Al-Kaswan et.al. 2504.01850 null
2025-04-02 LARGE: Legal Retrieval Augmented Generation Evaluation Tool Minhu Park et.al. 2504.01840 link
2025-04-02 Prompting Medical Vision-Language Models to Mitigate Diagnosis Bias by Generating Realistic Dermoscopic Images Nusrat Munia et.al. 2504.01838 link
2025-04-02 YourBench: Easy Custom Evaluation Sets for Everyone Sumuk Shashidhar et.al. 2504.01833 link
2025-03-31 Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation Shengqiong Wu et.al. 2503.24379 null
2025-03-31 ACPBench Hard: Unrestrained Reasoning about Action, Change, and Planning Harsha Kokel et.al. 2503.24378 null
2025-03-31 Harnessing the Reasoning Economy: A Survey of Efficient Reasoning for Large Language Models Rui Wang et.al. 2503.24377 link
2025-03-31 Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 Yi Chen et.al. 2503.24376 link
2025-03-31 Effectively Controlling Reasoning Models through Thinking Intervention Tong Wu et.al. 2503.24370 null
2025-03-31 Adapting Vision Foundation Models for Real-time Ultrasound Image Segmentation Xiaoran Zhang et.al. 2503.24368 null
2025-03-31 ORAL: Prompting Your Large-Scale LoRAs via Conditional Recurrent Diffusion Rana Muhammad Shahroz Khan et.al. 2503.24354 null
2025-03-31 PathOrchestra: A Comprehensive Foundation Model for Computational Pathology with Over 100 Diverse Clinical-Grade Tasks Fang Yan et.al. 2503.24345 null
2025-03-31 Can Test-Time Scaling Improve World Foundation Model? Wenyan Cong et.al. 2503.24320 link
2025-03-31 BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models Alok Abhishek et.al. 2503.24310 null
2025-03-31 A Systematic Evaluation of LLM Strategies for Mental Health Text Analysis: Fine-tuning vs. Prompt Engineering vs. RAG Arshia Kermani et.al. 2503.24307 null
2025-03-31 Rec-R1: Bridging Generative Large Language Models and User-Centric Recommendation Systems via Reinforcement Learning Jiacheng Lin et.al. 2503.24289 link
2025-03-31 Style Quantization for Data-Efficient GAN Training Jian Wang et.al. 2503.24282 null
2025-03-31 Evaluating and Designing Sparse Autoencoders by Approximating Quasi-Orthogonality Sewoong Lee et.al. 2503.24277 link
2025-03-31 Enhancing Large Language Models (LLMs) for Telecommunications using Knowledge Graphs and Retrieval-Augmented Generation Dun Yuan et.al. 2503.24245 null
2025-03-31 What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models Qiyuan Zhang et.al. 2503.24235 link
2025-03-31 Synthetic News Generation for Fake News Classification Abdul Sittar et.al. 2503.24206 null
2025-03-31 TwT: Thinking without Tokens by Habitual Reasoning Distillation with Multi-Teachers' Guidance Jingxian Xu et.al. 2503.24198 null
2025-04-02 Text2Tracks: Prompt-based Music Recommendation via Generative Retrieval Enrico Palumbo et.al. 2503.24193 null
2025-03-31 Output Constraints as Attack Surface: Exploiting Structured Generation to Bypass LLM Safety Mechanisms Shuoming Zhang et.al. 2503.24191 null
2025-03-28 Q-Insight: Understanding Image Quality via Visual Reinforcement Learning Weiqi Li et.al. 2503.22679 link
2025-03-28 QuestBench: Can LLMs ask the right question to acquire information in reasoning tasks? Belinda Z. Li et.al. 2503.22674 link
2025-03-28 Exploring the Effectiveness of Multi-stage Fine-tuning for Cross-encoder Re-rankers Francesca Pezzuti et.al. 2503.22672 link
2025-03-28 Understanding Co-speech Gestures in-the-wild Sindhu B Hegde et.al. 2503.22668 null
2025-03-28 Unicorn: Text-Only Data Synthesis for Vision Language Model Training Xiaomin Yu et.al. 2503.22655 link
2025-03-28 Evaluating Multimodal Language Models as Visual Assistants for Visually Impaired Users Antonia Karamolegkou et.al. 2503.22610 null
2025-03-28 On the Alignment of Post-Publication Reviews & Bibliometric and Altmetric Impact -- A Case Study on Expert Statements from the Science Media Center Germany Dirk Tunger et.al. 2503.22594 null
2025-03-28 LLM-enabled Instance Model Generation Fengjunjie Pan et.al. 2503.22587 null
2025-03-28 Historical Ink: Exploring Large Language Models for Irony Detection in 19th-Century Spanish Kevin Cohen et.al. 2503.22585 link
2025-03-28 Beyond Vanilla Fine-Tuning: Leveraging Multistage, Multilingual, and Domain-Specific Methods for Low-Resource Machine Translation Sarubi Thillainathan et.al. 2503.22582 null
2025-03-28 Breaking Language Barriers in Visual Language Models via Multilingual Textual Regularization Iñigo Pikabea et.al. 2503.22577 null
2025-03-28 Niyama : Breaking the Silos of LLM Inference Serving Kanishk Goel et.al. 2503.22562 null
2025-03-28 Bridging the Dimensional Chasm: Uncover Layer-wise Dimensional Reduction in Transformers through Token Correlation Zhuo-Yang Song et.al. 2503.22547 null
2025-03-28 Exploiting Mixture-of-Experts Redundancy Unlocks Multimodal Generative Abilities Raman Dutt et.al. 2503.22517 null
2025-03-28 Assessing Foundation Models for Sea Ice Type Segmentation in Sentinel-1 SAR Imagery Samira Alkaee Taleghan et.al. 2503.22516 null
2025-03-28 Probabilistic Uncertain Reward Model: A Natural Generalization of Bradley-Terry Reward Model Wangtao Sun et.al. 2503.22480 null
2025-03-28 WorkTeam: Constructing Workflows from Natural Language with Multi-Agents Hanchao Liu et.al. 2503.22473 null
2025-03-28 Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey Shengyue Guan et.al. 2503.22458 null
2025-03-28 Entropy-guided sequence weighting for efficient exploration in RL-based LLM fine-tuning Abdullah Vanlioglu et.al. 2503.22456 null
2025-03-28 STADE: Standard Deviation as a Pruning Metric Diego Coello de Portugal Mecke et.al. 2503.22451 link
2025-03-27 Mobile-VideoGPT: Fast and Accurate Video Understanding Language Model Abdelrahman Shaker et.al. 2503.21782 link
2025-03-27 Video-R1: Reinforcing Video Reasoning in MLLMs Kaituo Feng et.al. 2503.21776 link
2025-03-27 Stable-SCore: A Stable Registration-based Framework for 3D Shape Correspondence Haolin Liu et.al. 2503.21766 null
2025-03-27 Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video David Yifan Yao et.al. 2503.21761 link
2025-03-27 MemInsight: Autonomous Memory Augmentation for LLM Agents Rana Salama et.al. 2503.21760 null
2025-03-27 Fwd2Bot: LVLM Visual Token Compression with Double Forward Bottleneck Adrian Bulat et.al. 2503.21757 null
2025-03-27 GateLens: A Reasoning-Enhanced LLM Agent for Automotive Software Release Analytics Arsham Gholamzadeh Khoee et.al. 2503.21735 null
2025-03-27 Effective Skill Unlearning through Intervention and Abstention Yongce Li et.al. 2503.21730 link
2025-03-27 Collab: Controlled Decoding using Mixture of Agents for LLM Alignment Souradip Chakraborty et.al. 2503.21720 null
2025-03-28 Outlier dimensions favor frequent tokens in language models Iuri Macocco et.al. 2503.21718 null
2025-03-27 As easy as PIE: understanding when pruning causes language models to disagree Pietro Tropeano et.al. 2503.21714 link
2025-03-27 Enhancing Repository-Level Software Repair via Repository-Aware Knowledge Graphs Boyang Yang et.al. 2503.21710 null
2025-03-27 LLM-Gomoku: A Large Language Model-Based System for Strategic Gomoku with Self-Play and Reinforcement Learning Hui Wang et.al. 2503.21683 null
2025-03-27 JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models' Detection of Human Self-Destructive Behavior Content in Jirai Community Yunze Xiao et.al. 2503.21679 null
2025-03-27 How do language models learn facts? Dynamics, curricula and hallucinations Nicolas Zucchet et.al. 2503.21676 null
2025-03-27 Intelligent IoT Attack Detection Design via ODLLM with Feature Ranking-based Knowledge Base Satvik Verma et.al. 2503.21674 link
2025-03-27 Model Assembly Learning with Heterogeneous Layer Weight Merging Yi-Kai Zhang et.al. 2503.21657 null
2025-03-27 UI-R1: Enhancing Action Prediction of GUI Agents by Reinforcement Learning Zhengxi Lu et.al. 2503.21620 link
2025-03-27 Leveraging Language Models for Analyzing Longitudinal Experiential Data in Education Ahatsham Hayat et.al. 2503.21617 null
2025-03-27 Evaluating book summaries from internal knowledge in Large Language Models: a cross-model and semantic consistency approach Javier Coronado-Blázquez et.al. 2503.21613 null
2025-03-26 Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark Sondos Mahmoud Bsharat et.al. 2503.20786 link
2025-03-26 Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency Tianqi Liu et.al. 2503.20785 link
2025-03-26 Feature4X: Bridging Any Monocular Video to 4D Agentic AI with Versatile Gaussian Feature Fields Shijie Zhou et.al. 2503.20776 null
2025-03-26 ASGO: Adaptive Structured Gradient Optimization Kang An et.al. 2503.20762 null
2025-03-26 MCTS-RAG: Enhancing Retrieval-Augmented Generation with Monte Carlo Tree Search Yunhai Hu et.al. 2503.20757 null
2025-03-27 Reason-RFT: Reinforcement Fine-Tuning for Visual Reasoning Huajie Tan et.al. 2503.20752 null
2025-03-26 UniSTD: Towards Unified Spatio-Temporal Learning across Diverse Disciplines Chen Tang et.al. 2503.20748 null
2025-03-26 MATHGLANCE: Multimodal Large Language Models Do Not Know Where to Look in Mathematical Diagrams Yanpeng Sun et.al. 2503.20745 null
2025-03-26 Dynamic Motion Blending for Versatile Motion Editing Nan Jiang et.al. 2503.20724 null
2025-03-26 From Annotation to Adaptation: Metrics, Synthetic Data, and Aspect Extraction for Aspect-Based Sentiment Analysis with Large Language Models Nikita Neveditsin et.al. 2503.20715 null
2025-03-26 MMMORRF: Multimodal Multilingual Modularized Reciprocal Rank Fusion Saron Samuel et.al. 2503.20698 null
2025-03-26 Graph-Enhanced Model-Free Reinforcement Learning Agents for Efficient Power Grid Topological Control Eloy Anguiano Batanero et.al. 2503.20688 null
2025-03-27 Flip Learning: Weakly Supervised Erase to Segment Nodules in Breast Ultrasound Yuhao Huang et.al. 2503.20685 null
2025-03-27 Mitigating Low-Level Visual Hallucinations Requires Self-Awareness: Database, Model and Training Strategy Yinan Sun et.al. 2503.20673 null
2025-03-26 TAMA: A Human-AI Collaborative Thematic Analysis Framework Using Multi-Agent LLMs for Clinical Interviews Huimin Xu et.al. 2503.20666 null
2025-03-26 AutoRad-Lung: A Radiomic-Guided Prompting Autoregressive Vision-Language Model for Lung Nodule Malignancy Prediction Sadaf Khademi et.al. 2503.20662 null
2025-03-26 AccidentSim: Generating Physically Realistic Vehicle Collision Videos from Real-World Accident Reports Xiangwen Zhang et.al. 2503.20654 null
2025-03-26 Unlocking Efficient Long-to-Short LLM Reasoning with Model Merging Han Wu et.al. 2503.20641 link
2025-03-26 Collaborative Storytelling and LLM: A Linguistic Analysis of Automatically-Generated Role-Playing Game Sessions Alessandro Maisto et.al. 2503.20623 null
2025-03-26 IAP: Improving Continual Learning of Vision-Language Models via Instance-Aware Prompting Hao Fu et.al. 2503.20612 link
2025-03-25 SuperFlow++: Enhanced Spatiotemporal Consistency for Cross-Modal Data Pretraining Xiang Xu et.al. 2503.19912 link
2025-03-25 CoLLM: A Large Language Model for Composed Image Retrieval Chuong Huynh et.al. 2503.19910 link
2025-03-25 FullDiT: Multi-Task Video Generative Foundation Model with Full Attention Xuan Ju et.al. 2503.19907 null
2025-03-25 CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning Hao Yu et.al. 2503.19900 link
2025-03-25 A Multi-Agent Framework Integrating Large Language Models and Generative AI for Accelerated Metamaterial Design Jie Tian et.al. 2503.19889 null
2025-03-25 CausalRAG: Integrating Causal Graphs into Retrieval-Augmented Generation Nengbo Wang et.al. 2503.19878 null
2025-03-25 Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators Seungone Kim et.al. 2503.19877 null
2025-03-25 SLA-Awareness for AI-assisted coding Kishanthan Thangarajah et.al. 2503.19876 null
2025-03-25 Think Twice: Enhancing LLM Reasoning by Scaling Multi-round Test-time Thinking Xiaoyu Tian et.al. 2503.19855 null
2025-03-25 Towards Online Multi-Modal Social Interaction Understanding Xinpeng Li et.al. 2503.19851 link
2025-03-25 FALCONEye: Finding Answers and Localizing Content in ONE-hour-long videos with multi-modal LLMs Carlos Plou et.al. 2503.19850 null
2025-03-25 A Comparative Analysis of Word Segmentation, Part-of-Speech Tagging, and Named Entity Recognition for Historical Chinese Sources, 1900-1950 Zhao Fang et.al. 2503.19844 null
2025-03-25 FireEdit: Fine-grained Instruction-based Image Editing via Region-aware Vision Language Model Jun Zhou et.al. 2503.19839 null
2025-03-25 Domain-incremental White Blood Cell Classification with Privacy-aware Continual Learning Pratibha Kumari et.al. 2503.19819 null
2025-03-25 SeLIP: Similarity Enhanced Contrastive Language Image Pretraining for Multi-modal Head MRI Zhiyang Liu et.al. 2503.19801 null
2025-03-25 SemEval-2025 Task 9: The Food Hazard Detection Challenge Korbinian Randl et.al. 2503.19800 null
2025-03-25 PAVE: Patching and Adapting Video Large Language Models Zhuoming Liu et.al. 2503.19794 link
2025-03-25 Fine-Grained Erasure in Text-to-Image Diffusion-based Foundation Models Kartik Thakral et.al. 2503.19783 null
2025-03-25 LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation Vladan Stojnić et.al. 2503.19777 link
2025-03-25 OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations Christina Kassab et.al. 2503.19764 null
2025-03-24 DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation Karim Abou Zeid et.al. 2503.18944 link
2025-03-24 SlowFast-LLaVA-1.5: A Family of Token-Efficient Video Large Language Models for Long-Form Video Understanding Mingze Xu et.al. 2503.18943 null
2025-03-24 Video-T1: Test-Time Scaling for Video Generation Fangfu Liu et.al. 2503.18942 null
2025-03-24 Exploring Training and Inference Scaling Laws in Generative Retrieval Hongru Cai et.al. 2503.18941 link
2025-03-24 CoMP: Continual Multimodal Pre-training for Vision Foundation Models Yitong Chen et.al. 2503.18931 link
2025-03-24 Trajectory Balance with Asynchrony: Decoupling Exploration and Learning for Fast, Scalable LLM Post-Training Brian R. Bartoldson et.al. 2503.18929 null
2025-03-24 Video SimpleQA: Towards Factuality Evaluation in Large Video Language Models Meng Cao et.al. 2503.18923 null
2025-03-24 FFN Fusion: Rethinking Sequential Computation in Large Language Models Akhiad Bercovich et.al. 2503.18908 null
2025-03-24 xKV: Cross-Layer SVD for KV-Cache Compression Chi-Chih Chang et.al. 2503.18893 link
2025-03-24 AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration Zhexuan Wang et.al. 2503.18891 link
2025-03-24 Toward building next-generation Geocoding systems: a systematic review Zhengcong Yin et.al. 2503.18888 null
2025-03-24 I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders Andrey Galichin et.al. 2503.18878 link
2025-03-24 Efficient Self-Supervised Adaptation for Medical Image Analysis Moein Sorkhei et.al. 2503.18873 link
2025-03-24 Reimagining Memory Access for LLM Inference: Compression-Aware Memory Controller Design Rui Xie et.al. 2503.18869 null
2025-03-24 Reasoning to Learn from Latent Thoughts Yangjun Ruan et.al. 2503.18866 null
2025-03-25 Structuring Scientific Innovation: A Framework for Modeling and Discovering Impactful Knowledge Combinations Junlan Chen et.al. 2503.18865 null
2025-03-25 MC-LLaVA: Multi-Concept Personalized Vision-Language Model Ruichuan An et.al. 2503.18854 link
2025-03-24 Enhanced OoD Detection through Cross-Modal Alignment of Multi-Modal Representations Jeonghyeon Kim et.al. 2503.18817 link
2025-03-24 Defeating Prompt Injections by Design Edoardo Debenedetti et.al. 2503.18813 null
2025-03-24 SKDU at De-Factify 4.0: Vision Transformer with Data Augmentation for AI-Generated Image Detection Shrikant Malviya et.al. 2503.18812 link
2025-03-21 Dancing with Critiques: Enhancing LLM Reasoning with Stepwise Natural Language Self-Critique Yansi Li et.al. 2503.17363 null
2025-03-21 HCAST: Human-Calibrated Autonomy Software Tasks David Rein et.al. 2503.17354 link
2025-03-21 NdLinear Is All You Need for Representation Learning Alex Reneau et.al. 2503.17353 link
2025-03-21 OpenVLThinker: An Early Exploration to Complex Vision-Language Reasoning via Iterative Self-Improvement Yihe Deng et.al. 2503.17352 link
2025-03-21 Beyond Semantics: Rediscovering Spatial Awareness in Vision-Language Models Jianing Qi et.al. 2503.17349 null
2025-03-21 Capturing Individual Human Preferences with Reward Features André Barreto et.al. 2503.17338 null
2025-03-21 Efficient Intent-Based Filtering for Multi-Party Conversations Using Knowledge Distillation from LLMs Reem Gody et.al. 2503.17336 null
2025-03-21 CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities Yuxuan Zhu et.al. 2503.17332 link
2025-03-21 LLM+MAP: Bimanual Robot Task Planning using Large Language Models and Planning Domain Definition Language Kun Chu et.al. 2503.17309 link
2025-03-21 Bugdar: AI-Augmented Secure Code Review for GitHub Pull Requests John Naulty et.al. 2503.17302 null
2025-03-21 FastCuRL: Curriculum Reinforcement Learning with Progressive Context Extension for Efficient Training R1-like Reasoning Models Mingyang Song et.al. 2503.17287 link
2025-03-21 CASE -- Condition-Aware Sentence Embeddings for Conditional Semantic Textual Similarity Measurement Gaifan Zhang et.al. 2503.17279 null
2025-03-21 Revisiting End To End Sparse Autoencoder Training -- A Short Finetune is All You Need Adam Karvonen et.al. 2503.17272 link
2025-03-21 SafeMERGE: Preserving Safety Alignment in Fine-Tuned Large Language Models via Selective Layer-Wise Model Merging Aladin Djuhera et.al. 2503.17239 link
2025-03-21 Slide-Level Prompt Learning with Vision Language Models for Few-Shot Multiple Instance Learning in Histopathology Devavrat Tomar et.al. 2503.17238 link
2025-03-21 FactSelfCheck: Fact-Level Black-Box Hallucination Detection for LLMs Albert Sawczyn et.al. 2503.17229 null
2025-03-21 Automating Adjudication of Cardiovascular Events Using Large Language Models Sonish Sivarajkumar et.al. 2503.17222 null
2025-03-21 A Language Anchor-Guided Method for Robust Noisy Domain Generalization Zilin Dai et.al. 2503.17211 null
2025-03-21 TreeSynth: Synthesizing Diverse Data from Scratch via Tree-Guided Subspace Partitioning Sheng Wang et.al. 2503.17195 null
2025-03-21 LLMs Love Python: A Study of LLMs' Bias for Programming Languages and Libraries Lukas Twist et.al. 2503.17181 link
2025-03-20 DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding Keyan Chen et.al. 2503.16426 link
2025-03-20 Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models Yang Sui et.al. 2503.16419 link
2025-03-20 M3: 3D-Spatial MultiModal Memory Xueyan Zou et.al. 2503.16413 link
2025-03-20 The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination Yifan Sun et.al. 2503.16402 link
2025-03-20 Exploring the Hidden Reasoning Process of Large Language Models by Misleading Them Guanyu Chen et.al. 2503.16401 null
2025-03-20 Deconstructing Long Chain-of-Thought: A Structured Reasoning Optimization Framework for Long CoT Distillation Yijia Luo et.al. 2503.16385 link
2025-03-20 LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images Leyang Wang et.al. 2503.16376 null
2025-03-20 JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse Muyao Li et.al. 2503.16365 null
2025-03-20 CaKE: Circuit-aware Editing Enables Generalizable Knowledge Learners Yunzhi Yao et.al. 2503.16356 link
2025-03-20 Lyra: An Efficient and Expressive Subquadratic Architecture for Modeling Biological Sequences Krithik Ramesh et.al. 2503.16351 null
2025-03-20 LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates Ying Shen et.al. 2503.16334 null
2025-03-20 OmniGeo: Towards a Multimodal Large Language Models for Geospatial Artificial Intelligence Long Yuan et.al. 2503.16326 null
2025-03-20 Issue2Test: Generating Reproducing Test Cases from Issue Reports Noor Nashid et.al. 2503.16320 null
2025-03-21 Bridging Technology and Humanities: Evaluating the Impact of Large Language Models on Social Sciences Research with DeepSeek-R1 Peiran Gu et.al. 2503.16304 null
2025-03-20 Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model Zhaochong An et.al. 2503.16282 link
2025-03-21 Uni-3DAR: Unified 3D Generation and Understanding via Autoregression on Compressed Spatial Tokens Shuqi Lu et.al. 2503.16278 link
2025-03-20 Chain of Functions: A Programmatic Pipeline for Fine-Grained Chart Reasoning Data Zijian Li et.al. 2503.16260 null
2025-03-20 Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models Keda Tao et.al. 2503.16257 null
2025-03-21 Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning Zhaowei Liu et.al. 2503.16252 link
2025-03-20 Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Quy-Anh Dang et.al. 2503.16219 link
2025-03-19 TULIP: Towards Unified Language-Image Pretraining Zineng Tang et.al. 2503.15485 null
2025-03-19 SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks Yifei Zhou et.al. 2503.15478 link
2025-03-19 What Makes a Reward Model a Good Teacher? An Optimization Perspective Noam Razin et.al. 2503.15477 link
2025-03-19 Cube: A Roblox View of 3D Intelligence Foundation AI Team et.al. 2503.15475 link
2025-03-19 EgoDTM: Towards 3D-Aware Egocentric Video-Language Pretraining Boshen Xu et.al. 2503.15470 link
2025-03-19 From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level Alignment Jia-Nan Li et.al. 2503.15463 link
2025-03-19 SkyLadder: Better and Faster Pretraining via Context Window Scheduling Tongyao Zhu et.al. 2503.15450 link
2025-03-19 VenusFactory: A Unified Platform for Protein Engineering Data Retrieval and Language Model Fine-Tuning Yang Tan et.al. 2503.15438 link
2025-03-19 Visual Position Prompt for MLLM based Visual Grounding Wei Tang et.al. 2503.15426 link
2025-03-19 Probing the topology of the space of tokens with structured prompts Michael Robinson et.al. 2503.15421 null
2025-03-19 Visual Persona: Foundation Model for Full-Body Human Customization Jisu Nam et.al. 2503.15406 null
2025-03-19 FedSCA: Federated Tuning with Similarity-guided Collaborative Aggregation for Heterogeneous Medical Image Segmentation Yumin Zhang et.al. 2503.15390 null
2025-03-19 EfficientLLaVA:Generalizable Auto-Pruning for Large Vision-language Models Yinan Liang et.al. 2503.15369 null
2025-03-19 SemEval-2025 Task 1: AdMIRe -- Advancing Multimodal Idiomaticity Representation Thomas Pickard et.al. 2503.15358 null
2025-03-19 SPILL: Domain-Adaptive Intent Clustering based on Selection and Pooling with Large Language Models I-Fan Lin et.al. 2503.15351 null
2025-03-19 TruthLens:A Training-Free Paradigm for DeepFake Detection Ritabrata Chakraborty et.al. 2503.15342 null
2025-03-19 Uncertainty-Guided Chain-of-Thought for Code Generation with LLMs Yuqi Zhu et.al. 2503.15341 null
2025-03-19 Solla: Towards a Speech-Oriented LLM That Hears Acoustic Context Junyi Ao et.al. 2503.15338 link
2025-03-19 Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal Transport Hao Tan et.al. 2503.15337 link
2025-03-19 Euclid Quick Data Release (Q1) Exploring galaxy properties with a multi-modal foundation model Euclid Collaboration et.al. 2503.15312 link
2025-03-18 Aligning Multimodal LLM with Human Preference: A Survey Tao Yu et.al. 2503.14504 link
2025-03-18 Engineering Scientific Assistants using Interactive Structured Induction of Programs Shraddha Surana et.al. 2503.14488 null
2025-03-18 Gricean Norms as a Basis for Effective Collaboration Fardin Saad et.al. 2503.14484 link
2025-03-19 Creation-MMBench: Assessing Context-Aware Creative Intelligence in MLLM Xinyu Fang et.al. 2503.14478 link
2025-03-18 Characterizing Data Visualization Literacy: a Systematic Literature Review Sara Beschi et.al. 2503.14468 null
2025-03-18 RWKV-7 "Goose" with Expressive Dynamic State Evolution Bo Peng et.al. 2503.14456 link
2025-03-18 EnvBench: A Benchmark for Automated Environment Setup Aleksandra Eliseeva et.al. 2503.14443 link
2025-03-18 LLM-FE: Automated Feature Engineering for Tabular Data with LLMs as Evolutionary Optimizers Nikhil Abhyankar et.al. 2503.14434 link
2025-03-18 PLAY2PROMPT: Zero-shot Tool Instruction Optimization for LLM Agents via Tool Play Wei Fang et.al. 2503.14432 null
2025-03-18 ExDDV: A New Dataset for Explainable Deepfake Detection in Video Vlad Hondru et.al. 2503.14421 link
2025-03-18 Unifying Text Semantics and Graph Structures for Temporal Text-attributed Graphs with Large Language Models Siwei Zhang et.al. 2503.14411 null
2025-03-18 Large Language Models for Virtual Human Gesture Selection Parisa Ghanad Torshizi et.al. 2503.14408 null
2025-03-18 DUNE: Distilling a Universal Encoder from Heterogeneous 2D and 3D Teachers Mert Bulent Sariyildiz et.al. 2503.14405 null
2025-03-18 From "Hallucination" to "Suture": Insights from Language Philosophy to Enhance Large Language Models Qiantong Wang et.al. 2503.14392 null
2025-03-18 How much do LLMs learn from negative examples? Shadi Hamdan et.al. 2503.14391 null
2025-03-18 Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation Rikuto Tsuchida et.al. 2503.14382 null
2025-03-18 On the Standard Performance Criteria for Applied Control Design: PID, MPC or Machine Learning Controller? Pouria Sarhadi et.al. 2503.14379 link
2025-03-18 Tiled Flash Linear Attention: More Efficient Linear RNN and xLSTM Kernels Maximilian Beck et.al. 2503.14376 link
2025-03-18 MAST-Pro: Dynamic Mixture-of-Experts for Adaptive Segmentation of Pan-Tumors with Knowledge-Driven Prompts Runqi Meng et.al. 2503.14355 null
2025-03-19 MoonCast: High-Quality Zero-Shot Podcast Generation Zeqian Ju et.al. 2503.14345 link
2025-03-17 MetaScale: Test-Time Scaling with Evolving Meta-Thoughts Qin Liu et.al. 2503.13447 null
2025-03-17 MoManipVLA: Transferring Vision-language-action Models for General Mobile Manipulation Zhenyu Wu et.al. 2503.13446 null
2025-03-17 Faithfulness of LLM Self-Explanations for Commonsense Tasks: Larger Is Better, and Instruction-Tuning Allows Trade-Offs but Not Pareto Dominance Noah Y. Siegel et.al. 2503.13445 null
2025-03-17 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning Ye Liu et.al. 2503.13444 link
2025-03-17 DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models Haoyang Li et.al. 2503.13443 link
2025-03-18 MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling Yingyue Li et.al. 2503.13440 link
2025-03-17 xLSTM 7B: A Recurrent LLM for Fast and Efficient Inference Maximilian Beck et.al. 2503.13427 link
2025-03-17 SuperBPE: Space Travel for Language Models Alisa Liu et.al. 2503.13423 null
2025-03-17 A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives Weiqiang Jin et.al. 2503.13415 null
2025-03-18 DLPO: Towards a Robust, Efficient, and Generalizable Prompt Optimization Framework from a Deep-Learning Perspective Dengyun Peng et.al. 2503.13413 link
2025-03-17 Using the Tools of Cognitive Science to Understand Large Language Models at Different Levels of Analysis Alexander Ku et.al. 2503.13401 null
2025-03-17 MicroVQA: A Multimodal Reasoning Benchmark for Microscopy-Based Scientific Research James Burgess et.al. 2503.13399 link
2025-03-17 Aligned Probing: Relating Toxic Behavior and Model Internals Andreas Waldis et.al. 2503.13390 null
2025-03-17 Cream of the Crop: Harvesting Rich, Scalable and Transferable Multi-Modal Data for Instruction Fine-Tuning Mengyao Lyu et.al. 2503.13383 null
2025-03-17 Sightation Counts: Leveraging Sighted User Feedback in Building a BLV-aligned Dataset of Diagram Descriptions Wan Ju Kang et.al. 2503.13369 null
2025-03-17 Mitigating Visual Forgetting via Take-along Visual Conditioning for Multi-modal Long CoT Reasoning Hai-Long Sun et.al. 2503.13360 null
2025-03-17 Agents Play Thousands of 3D Video Games Zhongwen Xu et.al. 2503.13356 null
2025-03-17 Valid Text-to-SQL Generation with Unification-based DeepStochLog Ying Jiao et.al. 2503.13342 link
2025-03-17 LearnMate: Enhancing Online Education with LLM-Powered Personalized Learning Plans and Support Xinyu Jessica Wang et.al. 2503.13340 null
2025-03-17 Reliable and Efficient Amortized Model-based Evaluation Sang Truong et.al. 2503.13335 null
2025-03-14 Tit-for-Tat: Safeguarding Large Vision-Language Models Against Jailbreak Attacks via Adversarial Defense Shuyang Hao et.al. 2503.11619 null
2025-03-14 ASMA-Tune: Unlocking LLMs' Assembly Code Comprehension via Structural-Semantic Instruction Tuning Xinyi Wang et.al. 2503.11617 link
2025-03-14 Rethinking Few-Shot Adaptation of Vision-Language Models in Two Stages Matteo Farina et.al. 2503.11609 link
2025-03-14 Do Construction Distributions Shape Formal Language Learning In German BabyLMs? Bastian Bunzeck et.al. 2503.11593 null
2025-03-14 Pathology Image Compression with Pre-trained Autoencoders Srikar Yellapragada et.al. 2503.11591 null
2025-03-14 Broaden your SCOPE! Efficient Multi-turn Conversation Planning for LLMs using Semantic Space Zhiliang Chen et.al. 2503.11586 link
2025-03-14 SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Ahmed Nassar et.al. 2503.11576 null
2025-03-14 Synthesizing Access Control Policies using Large Language Models Adarsh Vatsa et.al. 2503.11573 null
2025-03-14 Implicit Bias-Like Patterns in Reasoning Models Messi H. J. Lee et.al. 2503.11572 null
2025-03-14 VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity Jing Bi et.al. 2503.11557 null
2025-03-14 Similarity-Aware Token Pruning: Your VLM but Faster Ahmadreza Jeddi et.al. 2503.11549 link
2025-03-14 Potential of large language model-powered nudges for promoting daily water and energy conservation Zonghan Li et.al. 2503.11531 null
2025-03-14 Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models Hao Cheng et.al. 2503.11519 null
2025-03-14 HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models Ziqin Zhou et.al. 2503.11513 null
2025-03-14 V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning Zixu Cheng et.al. 2503.11495 null
2025-03-14 A Review of DeepSeek Models' Key Innovative Techniques Chengen Wang et.al. 2503.11486 null
2025-03-14 Integrating LLMs in Gamified Systems Carlos J. Costa et.al. 2503.11458 null
2025-03-14 D3: Diversity, Difficulty, and Dependability-Aware Data Selection for Sample-Efficient LLM Instruction Tuning Jia Zhang et.al. 2503.11441 null
2025-03-14 Text Compression for Efficient Language Generation David Gu et.al. 2503.11426 null
2025-03-14 Empowering Time Series Analysis with Synthetic Data: A Survey and Outlook in the Era of Foundation Models Xu Liu et.al. 2503.11411 null
2025-03-13 GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing Rongyao Fang et.al. 2503.10639 link
2025-03-13 A Frustratingly Simple Yet Highly Effective Attack Baseline: Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1 Zhaoyi Li et.al. 2503.10635 link
2025-03-13 HybridVLA: Collaborative Diffusion and Autoregression in a Unified Vision-Language-Action Model Jiaming Liu et.al. 2503.10631 null
2025-03-13 UniGoal: Towards Universal Zero-shot Goal-oriented Navigation Hang Yin et.al. 2503.10630 null
2025-03-13 Transformers without Normalization Jiachen Zhu et.al. 2503.10622 null
2025-03-13 From TOWER to SPIRE: Adding the Speech Modality to a Text-Only LLM Kshitij Ambilduke et.al. 2503.10620 link
2025-03-13 Siege: Autonomous Multi-Turn Jailbreaking of Large Language Models with Tree Search Andy Zhou et.al. 2503.10619 null
2025-03-13 Compositional Subspace Representation Fine-tuning for Adaptive Large Language Models Andy Zhou et.al. 2503.10617 null
2025-03-13 R1-Onevision: Advancing Generalized Multimodal Reasoning through Cross-Modal Formalization Yi Yang et.al. 2503.10615 link
2025-03-13 CoSTA $\ast$ : Cost-Sensitive Toolpath Agent for Multi-turn Image Editing Advait Gupta et.al. 2503.10613 link
2025-03-13 TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention Jinhao Duan et.al. 2503.10602 link
2025-03-13 GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding Rui Hu et.al. 2503.10596 link
2025-03-13 Unlock the Power of Unlabeled Data in Language Driving Model Chaoqun Wang et.al. 2503.10586 null
2025-03-13 VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search Yiming Jia et.al. 2503.10582 null
2025-03-13 Unveiling the Mathematical Reasoning in DeepSeek Models: A Comparative Study of Large Language Models Afrar Jahin et.al. 2503.10573 null
2025-03-13 ASIDE: Architectural Separation of Instructions and Data in Language Models Egor Zverev et.al. 2503.10566 null
2025-03-13 Short-term AI literacy intervention does not reduce over-reliance on incorrect ChatGPT recommendations Brett Puppart et.al. 2503.10556 null
2025-03-13 KUDA: Keypoints to Unify Dynamics Learning and Visual Prompting for Open-Vocabulary Robotic Manipulation Zixian Liu et.al. 2503.10546 null
2025-03-13 DP-GPL: Differentially Private Graph Prompt Learning Jing Xu et.al. 2503.10544 null
2025-03-13 Language Models, Graph Searching, and Supervision Adulteration: When More Supervision is Less and How to Make More More Arvid Frydenlund et.al. 2503.10542 null
2025-03-12 MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System Jihao Zhao et.al. 2503.09600 link
2025-03-12 How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation Ruohao Guo et.al. 2503.09598 link
2025-03-12 SimLingo: Vision-Only Closed-Loop Autonomous Driving with Language-Action Alignment Katrin Renz et.al. 2503.09594 null
2025-03-12 BIMBA: Selective-Scan Compression for Long-Range Video Question Answering Md Mohaiminul Islam et.al. 2503.09590 link
2025-03-12 Cost-Optimal Grouped-Query Attention for Long-Context LLMs Yingfa Chen et.al. 2503.09579 link
2025-03-12 Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Marianne Arriola et.al. 2503.09573 link
2025-03-12 Plan-and-Act: Improving Planning of Agents for Long-Horizon Tasks Lutfi Eren Erdogan et.al. 2503.09572 null
2025-03-13 Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models Qiguang Chen et.al. 2503.09567 null
2025-03-12 PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs Oskar van der Wal et.al. 2503.09543 link
2025-03-13 Large Language Models for Multi-Facility Location Mechanism Design Nguyen Thach et.al. 2503.09533 null
2025-03-13 SAEBench: A Comprehensive Benchmark for Sparse Autoencoders in Language Model Interpretability Adam Karvonen et.al. 2503.09532 null
2025-03-12 Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning Bowen Jin et.al. 2503.09516 link
2025-03-12 Reinforcement Learning is all You Need Yongsheng Lian et.al. 2503.09512 null
2025-03-12 ReMA: Learning to Meta-think for LLMs with Multi-Agent Reinforcement Learning Ziyu Wan et.al. 2503.09501 link
2025-03-12 MindGYM: Enhancing Vision-Language Models via Synthetic Self-Challenging Questions Zhe Xu et.al. 2503.09499 link
2025-03-12 Parameter-Efficient Adaptation of Geospatial Foundation Models through Embedding Deflection Romain Thoreau et.al. 2503.09493 null
2025-03-12 Project-Probe-Aggregate: Efficient Fine-Tuning for Group Robustness Beier Zhu et.al. 2503.09487 null
2025-03-12 BAMBI: Developing Baby Language Models for Italian Alice Suozzi et.al. 2503.09481 null
2025-03-12 SurgicalVLM-Agent: Towards an Interactive AI Co-Pilot for Pituitary Surgery Jiayuan Huang et.al. 2503.09474 null
2025-03-12 Explicit Learning and the LLM in Machine Translation Malik Marmonier et.al. 2503.09454 link
2025-03-11 QuoTA: Query-oriented Token Assignment via CoT Query Decouple for Long Video Comprehension Yongdong Luo et.al. 2503.08689 link
2025-03-11 Randomness, Not Representation: The Unreliability of Evaluating Cultural Alignment in LLMs Ariba Khan et.al. 2503.08688 link
2025-03-11 Perplexity Trap: PLM-Based Retrievers Overrate Low Perplexity Documents Haoyu Wang et.al. 2503.08684 link
2025-03-11 Self-Taught Self-Correction for Small Language Models Viktor Moskvoretskii et.al. 2503.08681 null
2025-03-11 Understanding and Mitigating Distribution Shifts For Machine Learning Force Fields Tobias Kreiman et.al. 2503.08674 null
2025-03-11 Generating Robot Constitutions & Benchmarks for Semantic Safety Pierre Sermanet et.al. 2503.08663 null
2025-03-11 Exploring the Word Sense Disambiguation Capabilities of Large Language Models Pierpaolo Basile et.al. 2503.08662 null
2025-03-11 YuE: Scaling Open Foundation Models for Long-Form Music Generation Ruibin Yuan et.al. 2503.08638 link
2025-03-11 LightGen: Efficient Image Generation through Knowledge Distillation and Direct Preference Optimization Xianfeng Wu et.al. 2503.08619 link
2025-03-11 EMMOE: A Comprehensive Benchmark for Embodied Mobile Manipulation in Open Environments Dongping Li et.al. 2503.08604 link
2025-03-11 NSF-SciFy: Mining the NSF Awards Database for Scientific Claims Delip Rao et.al. 2503.08600 null
2025-03-11 Proc4Gem: Foundation models for physical agency through procedural generation Yixin Lin et.al. 2503.08593 null
2025-03-11 BiasEdit: Debiasing Stereotyped Language Models via Model Editing Xin Xu et.al. 2503.08588 link
2025-03-11 HierarQ: Task-Aware Hierarchical Q-Former for Enhanced Video Understanding Shehreen Azad et.al. 2503.08585 null
2025-03-11 RAG-Adapter: A Plug-and-Play RAG-enhanced Framework for Long Video Understanding Xichen Tan et.al. 2503.08576 null
2025-03-11 DeepReview: Improving LLM-based Paper Review with Human-like Deep Thinking Process Minjun Zhu et.al. 2503.08569 null
2025-03-11 Reasoning and Sampling-Augmented MCQ Difficulty Prediction via LLMs Wanyong Feng et.al. 2503.08551 null
2025-03-11 Transferring Extreme Subword Style Using Ngram Model-Based Logit Scaling Craig Messner et.al. 2503.08550 null
2025-03-11 Graph of AI Ideas: Leveraging Knowledge Graphs and LLMs for AI Research Idea Generation Xian Gao et.al. 2503.08549 null
2025-03-11 TLA: Tactile-Language-Action Model for Contact-Rich Manipulation Peng Hao et.al. 2503.08548 null
2025-03-10 Robusto-1 Dataset: Comparing Humans and VLMs on real out-of-distribution Autonomous Driving VQA from Peru Dunant Cusipuma et.al. 2503.07587 null
2025-03-10 Talking to GDELT Through Knowledge Graphs Audun Myers et.al. 2503.07584 null
2025-03-10 VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models Jen-tse Huang et.al. 2503.07575 link
2025-03-10 AutoSpatial: Visual-Language Reasoning for Social Robot Navigation through Efficient Spatial Reasoning Learning Yangzhe Kong et.al. 2503.07557 null
2025-03-10 Junior Software Developers' Perspectives on Adopting LLMs for Software Engineering: a Systematic Literature Review Samuel Ferino et.al. 2503.07556 null
2025-03-10 KSOD: Knowledge Supplement for LLMs On Demand Haoran Li et.al. 2503.07550 null
2025-03-10 Bi-Directional Mental Model Reconciliation for Human-Robot Interaction with Large Language Models Nina Moorman et.al. 2503.07547 null
2025-03-10 Queueing, Predictions, and LLMs: Challenges and Open Problems Michael Mitzenmacher et.al. 2503.07545 null
2025-03-10 XIFBench: Evaluating Large Language Models on Multilingual Instruction Following Zhenyu Li et.al. 2503.07539 null
2025-03-10 Building English ASR model with regional language support Purvi Agrawal et.al. 2503.07522 null
2025-03-10 GRITHopper: Decomposition-Free Multi-Hop Dense Retrieval Justus-Jonas Erker et.al. 2503.07519 link
2025-03-10 TokenButler: Token Importance is Predictable Yash Akhauri et.al. 2503.07518 link
2025-03-10 Language Models Fail to Introspect About Their Knowledge of Language Siyuan Song et.al. 2503.07513 link
2025-03-10 Plume: Scaffolding Text Composition in Dashboards Maxim Lisnic et.al. 2503.07512 null
2025-03-10 Sometimes the Model doth Preach: Quantifying Religious Bias in Open LLMs through Demographic Analysis in Asian Nations Hari Shankar et.al. 2503.07510 link
2025-03-10 Think Before You Segment: High-Quality Reasoning Segmentation with GPT Chain of Thoughts Shiu-hong Kao et.al. 2503.07503 null
2025-03-10 V2Flow: Unifying Visual Tokenization and Large Language Model Vocabularies for Autoregressive Image Generation Guiwei Zhang et.al. 2503.07493 link
2025-03-10 LLaVA-RadZ: Can Multimodal Large Language Models Effectively Tackle Zero-shot Radiology Recognition? Bangyan Li et.al. 2503.07487 null
2025-03-10 Chameleon: Fast-slow Neuro-symbolic Lane Topology Extraction Zongzheng Zhang et.al. 2503.07485 link
2025-03-10 VLRMBench: A Comprehensive and Challenging Benchmark for Vision-Language Reward Models Jiacheng Ruan et.al. 2503.07478 link
2025-03-07 Fairness-Aware Low-Rank Adaptation Under Demographic Privacy Constraints Parameswaran Kamalaruban et.al. 2503.05684 null
2025-03-07 Understanding the Limits of Lifelong Knowledge Editing in LLMs Lukas Thede et.al. 2503.05683 null
2025-03-07 A Survey of Large Language Model Empowered Agents for Recommendation and Search: Towards Next-Generation Information Retrieval Yu Zhang et.al. 2503.05659 link
2025-03-07 Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings Xuanqing Liu et.al. 2503.05620 null
2025-03-07 A Survey on Sparse Autoencoders: Interpreting the Internal Mechanisms of Large Language Models Dong Shu et.al. 2503.05613 null
2025-03-07 From Theory to Application: A Practical Introduction to Neural Operators in Scientific Computing Prashant K. Jha et.al. 2503.05598 link
2025-03-07 R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning Huatong Song et.al. 2503.05592 null
2025-03-07 Quantifying the Robustness of Retrieval-Augmented Language Models Against Spurious Features in Grounding Data Shiping Yang et.al. 2503.05587 null
2025-03-07 Evaluating open-source Large Language Models for automated fact-checking Nicolo' Fontana et.al. 2503.05565 null
2025-03-07 Revitalizing Saturated Benchmarks: A Weighted Metric Approach for Differentiating Large Language Model Performance Bryan Etzine et.al. 2503.05551 null
2025-03-07 Leveraging Approximate Caching for Faster Retrieval-Augmented Generation Shai Bergman et.al. 2503.05530 null
2025-03-07 PoSSUM: A Protocol for Surveying Social-media Users with Multimodal LLMs Roberto Cerina et.al. 2503.05529 null
2025-03-07 Cognitive Bias Detection Using Advanced Prompt Engineering Frederic Lemieux et.al. 2503.05516 null
2025-03-07 Grammar-Based Code Representation: Is It a Worthy Pursuit for LLMs? Qingyuan Liang et.al. 2503.05507 null
2025-03-07 Statistical Guarantees of Correctness Coverage for Medical Multiple-Choice Question Answering Yusong Ke et.al. 2503.05505 null
2025-03-07 Benchmarking LLMs in Recommendation Tasks: A Comparative Evaluation with Conventional Recommenders Qijiong Liu et.al. 2503.05493 null
2025-03-07 Maximum Hallucination Standards for Domain-Specific Large Language Models Tingmingke Lu et.al. 2503.05481 null
2025-03-07 The Society of HiveMind: Multi-Agent Optimization of Foundation Model Swarms to Unlock the Potential of Collective Intelligence Noah Mamie et.al. 2503.05473 null
2025-03-07 Soft Policy Optimization: Online Off-Policy RL for Sequence Models Taco Cohen et.al. 2503.05453 null
2025-03-07 LLM-based Iterative Approach to Metamodeling in Automotive Nenad Petrovic et.al. 2503.05449 null
2025-03-06 L $^2$ M: Mutual Information Scaling Law for Long-Context Language Modeling Zhuo Chen et.al. 2503.04725 link
2025-03-06 LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM Sambal Shikhar et.al. 2503.04724 null
2025-03-07 Shifting Long-Context LLMs Research from Input to Output Yuhao Wu et.al. 2503.04723 null
2025-03-06 Enough Coin Flips Can Make LLMs Act Bayesian Ritwik Gupta et.al. 2503.04722 null
2025-03-06 Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities Guan-Ting Lin et.al. 2503.04721 link
2025-03-06 Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining Houyi Li et.al. 2503.04715 null
2025-03-06 Scaling Rich Style-Prompted Text-to-Speech Datasets Anuj Diwan et.al. 2503.04713 link
2025-03-06 Universality of Layer-Level Entropy-Weighted Quantization Beyond Model Architecture and Size Alireza Behtash et.al. 2503.04704 null
2025-03-06 L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning Pranjal Aggarwal et.al. 2503.04697 null
2025-03-06 UIPE: Enhancing LLM Unlearning by Removing Knowledge Related to Forgetting Targets Wenyu Wang et.al. 2503.04693 null
2025-03-06 Quantifying the Reasoning Abilities of LLMs on Real-world Clinical Cases Pengcheng Qiu et.al. 2503.04691 null
2025-03-06 LLM-guided Plan and Retrieval: A Strategic Alignment for Interpretable User Satisfaction Estimation in Dialogue Sangyeop Kim et.al. 2503.04675 null
2025-03-06 An Information-theoretic Multi-task Representation Learning Framework for Natural Language Understanding Dou Hu et.al. 2503.04667 link
2025-03-06 CLDyB: Towards Dynamic Benchmarking for Continual Learning with Pre-trained Models Shengzhuang Chen et.al. 2503.04655 link
2025-03-06 Transferable Foundation Models for Geometric Tasks on Point Cloud Representations: Geometric Neural Operators Blaine Quackenbush et.al. 2503.04649 link
2025-03-06 Implicit Cross-Lingual Rewarding for Efficient Multilingual Preference Alignment Wen Yang et.al. 2503.04647 null
2025-03-06 Enhancing SAM with Efficient Prompting and Preference Optimization for Semi-supervised Medical Image Segmentation Aishik Konwer et.al. 2503.04639 null
2025-03-06 Mark Your LLM: Detecting the Misuse of Open-Source Large Language Models via Watermarking Yijie Xu et.al. 2503.04636 null
2025-03-06 Better Process Supervision with Bi-directional Rewarding Signals Wenxiang Chen et.al. 2503.04618 null
2025-03-06 Towards Data-Efficient Language Models: A Child-Inspired Approach to Language Learning Mohammad Amin Ghanizadeh et.al. 2503.04611 null
2025-03-05 The MASK Benchmark: Disentangling Honesty From Accuracy in AI Systems Richard Ren et.al. 2503.03750 null
2025-03-05 Process-based Self-Rewarding Language Models Shimao Zhang et.al. 2503.03746 link
2025-03-05 CHOP: Mobile Operating Assistant with Constrained High-frequency Optimized Subtask Planning Yuqi Zhou et.al. 2503.03743 link
2025-03-05 Towards Understanding Distilled Reasoning Models: A Representational Approach David D. Baek et.al. 2503.03730 null
2025-03-05 Improving LLM Safety Alignment with Dual-Objective Optimization Xuandong Zhao et.al. 2503.03710 link
2025-03-05 Effective LLM Knowledge Learning via Model Generalization Mingkang Zhu et.al. 2503.03705 null
2025-03-05 A Practical Memory Injection Attack against LLM Agents Shen Dong et.al. 2503.03704 null
2025-03-05 Developing and Utilizing a Large-Scale Cantonese Dataset for Multi-Tasking in Large Language Models Jiyue Jiang et.al. 2503.03702 null
2025-03-05 Addressing Overprescribing Challenges: Fine-Tuning Large Language Models for Medication Recommendation Tasks Zihao Zhao et.al. 2503.03687 link
2025-03-05 Attentive Reasoning Queries: A Systematic Method for Optimizing Instruction-Following in Large Language Models Bar Karov et.al. 2503.03669 link
2025-03-05 Analogical Reasoning Inside Large Language Models: Concept Vectors and the Limits of Abstraction Gustaw Opiełka et.al. 2503.03666 link
2025-03-05 Robust Learning of Diverse Code Edits Tushar Aggarwal et.al. 2503.03656 null
2025-03-05 Improving Neutral Point of View Text Generation through Parameter-Efficient Reinforcement Learning and a Small-Scale High-Quality Dataset Jessica Hoffmann et.al. 2503.03654 null
2025-03-05 Token-Level Privacy in Large Language Models Re'em Harel et.al. 2503.03652 null
2025-03-05 Psy-Copilot: Visual Chain of Thought for Counseling Keqi Chen et.al. 2503.03645 null
2025-03-05 Large language models in finance: estimating financial sentiment for stock prediction Kemal Kirtac et.al. 2503.03612 null
2025-03-05 Enhancing the Accuracy and Comprehensibility in Architectural Tactics Detection via Small Model-Augmented Prompt Engineering Lingli Cao et.al. 2503.03609 link
2025-03-05 Psy-Insight: Explainable Multi-turn Bilingual Dataset for Mental Health Counseling Keqi Chen et.al. 2503.03607 null
2025-03-05 Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders Kristian Kuznetsov et.al. 2503.03601 null
2025-03-05 Small but Mighty: Enhancing Time Series Forecasting with Lightweight LLMs Haoran Fan et.al. 2503.03594 link
2025-03-04 Wikipedia in the Era of LLMs: Evolution and Risks Siming Huang et.al. 2503.02879 link
2025-03-04 Language Models can Self-Improve at State-Value Estimation for Better Search Ethan Mendes et.al. 2503.02878 link
2025-03-04 SPIDER: A Comprehensive Multi-Organ Supervised Pathology Dataset and Baseline Models Dmitry Nechaev et.al. 2503.02876 link
2025-03-04 The First Few Tokens Are All You Need: An Efficient and Effective Unsupervised Prefix Fine-Tuning Method for Reasoning Models Ke Ji et.al. 2503.02875 null
2025-03-04 Prompting Generative AI with Interaction-Augmented Instructions Leixian Shen et.al. 2503.02874 null
2025-03-04 FairSense-AI: Responsible AI Meets Sustainability Shaina Raza et.al. 2503.02865 null
2025-03-04 Calibrating LLM Confidence with Semantic Steering: A Multi-Prompt Aggregation Framework Ziang Zhou et.al. 2503.02863 null
2025-03-04 Privacy and Accuracy-Aware AI/ML Model Deduplication Hong Guan et.al. 2503.02862 null
2025-03-04 (How) Do Language Models Track State? Belinda Z. Li et.al. 2503.02854 null
2025-03-04 Shakespearean Sparks: The Dance of Hallucination and Creativity in LLMs' Decoding Layers Zicong He et.al. 2503.02851 link
2025-03-04 Mask-DPO: Generalizable Fine-grained Factuality Alignment of LLMs Yuzhe Gu et.al. 2503.02846 link
2025-03-04 Beyond Cosine Decay: On the effectiveness of Infinite Learning Rate Schedule for Continual Pre-training Paul Janson et.al. 2503.02844 null
2025-03-04 AlignDistil: Token-Level Language Model Alignment as Adaptive Policy Distillation Songming Zhang et.al. 2503.02832 null
2025-03-04 Developing a PET/CT Foundation Model for Cross-Modal Anatomical and Functional Imaging Yujin Oh et.al. 2503.02824 null
2025-03-04 "What If Smart Homes Could See Our Homes?": Exploring DIY Smart Home Building Experiences with VLM-Based Camera Sensors Sojeong Yun et.al. 2503.02816 null
2025-03-04 Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression Nathan Godey et.al. 2503.02812 link
2025-03-04 RAAD-LLM: Adaptive Anomaly Detection Using LLMs and RAG Integration Alicia Russell-Gilbert et.al. 2503.02800 null
2025-03-04 Multimodal AI predicts clinical outcomes of drug combinations from preclinical data Yepeng Huang et.al. 2503.02781 link
2025-03-04 Implicit Bias in LLMs: A Survey Xinru Lin et.al. 2503.02776 null
2025-03-04 InSerter: Speech Instruction Following with Unsupervised Interleaved Pre-training Dingdong Wang et.al. 2503.02769 null
2025-02-28 LLM Post-Training: A Deep Dive into Reasoning Large Language Models Komal Kumar et.al. 2502.21321 link
2025-02-28 Raccoon: Multi-stage Diffusion Training with Coarse-to-Fine Curating Videos Zhiyu Tan et.al. 2502.21314 null
2025-02-28 FANformer: Improving Large Language Models Through Effective Periodicity Modeling Yihong Dong et.al. 2502.21309 link
2025-02-28 Contextualizing biological perturbation experiments through language Menghua Wu et.al. 2502.21290 link
2025-02-28 Adaptive Keyframe Sampling for Long Video Understanding Xi Tang et.al. 2502.21271 null
2025-03-03 Foundation Models -- A Panacea for Artificial Intelligence in Pathology? Nita Mulliqi et.al. 2502.21264 null
2025-02-28 Modeling Human Beliefs about AI Behavior for Scalable Oversight Leon Lang et.al. 2502.21262 null
2025-02-28 PET Image Denoising via Text-Guided Diffusion: Integrating Anatomical Priors through Text Prompts Boxiao Yu et.al. 2502.21260 null
2025-02-28 RoboBrain: A Unified Brain Model for Robotic Manipulation from Abstract to Concrete Yuheng Ji et.al. 2502.21257 null
2025-02-28 TimesBERT: A BERT-Style Foundation Model for Time Series Understanding Haoran Zhang et.al. 2502.21245 null
2025-02-28 Semantic Volume: Quantifying and Detecting both External and Internal Uncertainty in LLMs Xiaomin Li et.al. 2502.21239 null
2025-02-28 Transforming Tuberculosis Care: Optimizing Large Language Models For Enhanced Clinician-Patient Communication Daniil Filienko et.al. 2502.21236 null
2025-02-28 ByteScale: Efficient Scaling of LLM Training with a 2048K Context Length on More Than 12,000 GPUs Hao Ge et.al. 2502.21231 null
2025-03-03 ECLeKTic: a Novel Challenge Set for Evaluation of Cross-Lingual Knowledge Transfer Omer Goldman et.al. 2502.21228 null
2025-02-28 Transformers Learn to Implement Multi-step Gradient Descent with Chain of Thought Jianhao Huang et.al. 2502.21212 null
2025-02-28 Chronologically Consistent Large Language Models Songrun He et.al. 2502.21206 null
2025-02-28 $Δ$ -model correction of Foundation Model based on the models own understanding Mads-Peter Verner Christiansen et.al. 2502.21179 null
2025-03-03 Causality Is Key to Understand and Balance Multiple Goals in Trustworthy ML and Foundation Models Ruta Binkyte et.al. 2502.21123 null
2025-02-28 Optimizing Large Language Models for ESG Activity Detection in Financial Texts Mattia Birti et.al. 2502.21112 link
2025-02-28 Large Language Model-Based Benchmarking Experiment Settings for Evolutionary Multi-Objective Optimization Lie Meng Pang et.al. 2502.21108 null
2025-02-27 R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts Zhongyang Li et.al. 2502.20395 link
2025-02-27 Why Are Web AI Agents More Vulnerable Than Standalone LLMs? A Security Analysis Jeffrey Yang Fan Chiang et.al. 2502.20383 null
2025-02-27 Multi-Agent Verification: Scaling Test-Time Compute with Multiple Verifiers Shalev Lifshitz et.al. 2502.20379 null
2025-02-27 PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation Albert Gong et.al. 2502.20377 link
2025-02-27 Bridging Legal Knowledge and AI: Retrieval-Augmented Generation with Vector Stores, Knowledge Graphs, and Hierarchical Non-negative Matrix Factorization Ryan C. Barron et.al. 2502.20364 link
2025-02-27 Bridging the Creativity Understanding Gap: Small-Scale Human Alignment Enables Expert-Level Humor Ranking in LLMs Kuan Lok Zhou et.al. 2502.20356 null
2025-02-27 KEDRec-LM: A Knowledge-distilled Explainable Drug Recommendation Large Language Model Kai Zhang et.al. 2502.20350 null
2025-02-27 Sparse Auto-Encoder Interprets Linguistic Features in Large Language Models Yi Jing et.al. 2502.20344 null
2025-02-27 Thinking Slow, Fast: Scaling Inference Compute with Distilled Reasoners Daniele Paliotta et.al. 2502.20339 null
2025-02-27 Expertise Is What We Want Alan Ashworth et.al. 2502.20335 null
2025-02-27 Emergent Symbolic Mechanisms Support Abstract Reasoning in Large Language Models Yukang Yang et.al. 2502.20332 null
2025-02-27 Long-Context Inference with Retrieval-Augmented Speculative Decoding Guanzheng Chen et.al. 2502.20330 link
2025-02-27 LangProBe: a Language Programs Benchmark Shangyin Tan et.al. 2502.20315 null
2025-02-27 EAIRA: Establishing a Methodology for Evaluating AI Models as Scientific Research Assistants Franck Cappello et.al. 2502.20309 link
2025-02-27 M^3Builder: A Multi-Agent System for Automated Machine Learning in Medical Imaging Jinghao Feng et.al. 2502.20301 null
2025-02-27 An exploration of features to improve the generalisability of fake news detection models Nathaniel Hoy et.al. 2502.20299 null
2025-02-27 Judge a Book by its Cover: Investigating Multi-Modal LLMs for Multi-Page Handwritten Document Transcription Benjamin Gutteridge et.al. 2502.20295 link
2025-02-27 Visual Adaptive Prompting for Compositional Zero-Shot Learning Kyle Stein et.al. 2502.20292 null
2025-02-27 Conformal Tail Risk Control for Large Language Model Alignment Catherine Yu-Chi Chen et.al. 2502.20285 null
2025-02-27 Evaluating Human Trust in LLM-Based Planners: A Preliminary Study Shenghui Chen et.al. 2502.20284 null
2025-02-26 Hi Robot: Open-Ended Instruction Following with Hierarchical Vision-Language-Action Models Lucy Xiaoyang Shi et.al. 2502.19417 null
2025-02-26 Norm Growth and Stability Challenges in Localized Sequential Knowledge Editing Akshat Gupta et.al. 2502.19416 null
2025-02-26 Can Language Models Falsify? Evaluating Algorithmic Reasoning with Counterexample Creation Shiven Sinha et.al. 2502.19414 link
2025-02-26 Project Alexandria: Towards Freeing Scientific Knowledge from Copyright Burdens via LLMs Christoph Schuhmann et.al. 2502.19413 null
2025-02-26 Code to Think, Think to Code: A Survey on Code-Enhanced Reasoning and Reasoning-Driven Code Intelligence in LLMs Dayu Yang et.al. 2502.19411 link
2025-02-26 Less or More: Towards Glanceable Explanations for LLM Recommendations Using Ultra-Small Devices Xinru Wang et.al. 2502.19410 null
2025-02-26 ImageChain: Advancing Sequential Image-to-Text Reasoning in Multimodal Large Language Models Danae Sánchez Villegas et.al. 2502.19409 null
2025-02-26 Learning Code-Edit Embedding to Model Student Debugging Behavior Hasnain Heickal et.al. 2502.19407 null
2025-02-26 General Reasoning Requires Learning to Reason from the Get-go Seungwook Han et.al. 2502.19402 null
2025-02-26 TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding Max Ku et.al. 2502.19400 null
2025-02-26 LiDAR Registration with Visual Foundation Models Niclas Vödisch et.al. 2502.19374 null
2025-02-26 Deep Learning For Time Series Analysis With Application On Human Motion Ali Ismail-Fawaz et.al. 2502.19364 null
2025-02-26 DataMan: Data Manager for Pre-training Large Language Models Ru Peng et.al. 2502.19363 null
2025-02-26 Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning? Yancheng He et.al. 2502.19361 link
2025-02-26 Controlled Diversity: Length-optimized Natural Language Generation Diana Marie Schenke et.al. 2502.19347 null
2025-02-26 Evaluating LLMs and Pre-trained Models for Text Summarization Across Diverse Datasets Tohida Rehman et.al. 2502.19339 null
2025-02-26 I Know What I Don't Know: Improving Model Cascades Through Confidence Tuning Stephan Rabanser et.al. 2502.19335 null
2025-02-26 Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems Hao Peng et.al. 2502.19328 link
2025-02-26 Shh, don't say that! Domain Certification in LLMs Cornelius Emde et.al. 2502.19320 null
2025-02-26 Rethinking LLM Unlearning Objectives: A Gradient Perspective and Go Beyond Qizhou Wang et.al. 2502.19301 null
2025-02-25 DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers Xueguang Ma et.al. 2502.18460 link
2025-02-25 LLM-Based Design Pattern Detection Christian Schindler et.al. 2502.18458 null
2025-02-25 Evaluating the Effectiveness of Small Language Models in Detecting Refactoring Bugs Rohit Gheyi et.al. 2502.18454 null
2025-02-25 FRIDA to the Rescue! Analyzing Synthetic Data Effectiveness in Object-Based Common Sense Reasoning for Disaster Response Mollie Shichman et.al. 2502.18452 null
2025-02-25 SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Yuxiang Wei et.al. 2502.18449 null
2025-02-25 olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models Jake Poznanski et.al. 2502.18443 link
2025-02-25 MAPoRL: Multi-Agent Post-Co-Training for Collaborative Large Language Models with Reinforcement Learning Chanwoo Park et.al. 2502.18439 null
2025-02-25 Reversal Blessing: Thinking Backward May Outpace Thinking Forward in Multi-choice Questions Yizhe Zhang et.al. 2502.18435 null
2025-02-25 Exploring Gender Disparities in Automatic Speech Recognition Technology Hend ElGhazaly et.al. 2502.18434 null
2025-02-25 TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning Frederikus Hudi et.al. 2502.18431 link
2025-02-25 PyEvalAI: AI-assisted evaluation of Jupyter Notebooks for immediate personalized feedback Nils Wandel et.al. 2502.18425 null
2025-02-25 Compressing Language Models for Specialized Domains Miles Williams et.al. 2502.18424 null
2025-02-25 Rank1: Test-Time Compute for Reranking in Information Retrieval Orion Weller et.al. 2502.18418 link
2025-02-25 OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference Xiangyu Zhao et.al. 2502.18411 link
2025-02-25 Enhancing DNA Foundation Models to Address Masking Inefficiencies Monireh Safari et.al. 2502.18405 null
2025-02-25 Monte Carlo Temperature: a robust sampling strategy for LLM's uncertainty quantification methods Nicola Cecere et.al. 2502.18389 null
2025-02-25 How Far are LLMs from Real Search? A Comprehensive Study on Efficiency, Completeness, and Inherent Capabilities Minhua Lin et.al. 2502.18387 null
2025-02-25 MindMem: Multimodal for Predicting Advertisement Memorability Using LLMs and Deep Learning Sepehr Asgarian et.al. 2502.18371 null
2025-02-25 Responsible AI Agents Deven R. Desai et.al. 2502.18359 null
2025-02-25 Which Contributions Deserve Credit? Perceptions of Attribution in Human-AI Co-Creation Jessica He et.al. 2502.18357 null
2025-02-24 Introducing Visual Perception Token into Multimodal Large Language Model Runpeng Yu et.al. 2502.17425 link
2025-02-24 MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs Jiarui Zhang et.al. 2502.17422 link
2025-02-24 LongSpec: Long-Context Speculative Decoding with Efficient Drafting and Verification Penghui Yang et.al. 2502.17421 link
2025-02-24 The Geometry of Refusal in Large Language Models: Concept Cones and Representational Independence Tom Wollschläger et.al. 2502.17420 null
2025-02-24 From System 1 to System 2: A Survey of Reasoning Large Language Models Zhong-Zhi Li et.al. 2502.17419 link
2025-02-24 Reasoning with Latent Thoughts: On the Power of Looped Transformers Nikunj Saunshi et.al. 2502.17416 null
2025-02-24 COSMOS: A Hybrid Adaptive Optimizer for Memory-Efficient Training of LLMs Liming Liu et.al. 2502.17410 link
2025-02-24 Large Language Models are Powerful EHR Encoders Stefan Hegselmann et.al. 2502.17403 link
2025-02-24 Big-Math: A Large-Scale, High-Quality Math Dataset for Reinforcement Learning in Language Models Alon Albalak et.al. 2502.17387 link
2025-02-24 Bridging Gaps in Natural Language Processing for Yorùbá: A Systematic Review of a Decade of Progress and Prospects Toheeb A. Jimoh et.al. 2502.17364 null
2025-02-24 A Closer Look at TabPFN v2: Strength, Limitation, and Extension Han-Jia Ye et.al. 2502.17361 null
2025-02-24 RELICT: A Replica Detection Framework for Medical Image Generation Orhun Utku Aydin et.al. 2502.17360 link
2025-02-24 DIS-CO: Discovering Copyrighted Content in VLMs Training Data André V. Duarte et.al. 2502.17358 link
2025-02-24 Distributional Scaling Laws for Emergent Capabilities Rosie Zhao et.al. 2502.17356 null
2025-02-24 On Relation-Specific Neurons in Large Language Models Yihong Liu et.al. 2502.17355 link
2025-02-24 How Scientists Use Large Language Models to Program Gabrielle O'Brien et.al. 2502.17348 null
2025-02-24 Time series forecasting based on optimized LLM for fault prediction in distribution power grid insulators João Pedro Matos-Carvalho et.al. 2502.17341 null
2025-02-24 Tokenized SAEs: Disentangling SAE Reconstructions Thomas Dooms et.al. 2502.17332 null
2025-02-24 HIPPO: Enhancing the Table Understanding Capability of Large Language Models through Hybrid-Modal Preference Optimization Zhenghao Liu et.al. 2502.17315 link
2025-02-24 `Generalization is hallucination' through the lens of tensor completions Liang Ze Wong et.al. 2502.17305 null
2025-02-21 ELIP: Enhanced Visual-Language Foundation Models for Image Retrieval Guanqi Zhan et.al. 2502.15682 null
2025-02-21 Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training Jaydeep Borkar et.al. 2502.15680 link
2025-02-21 BOSS: Benchmark for Observation Space Shift in Long-Horizon Task Yue Yang et.al. 2502.15679 null
2025-02-21 Testing the limits of fine-tuning to improve reasoning in vision language models Luca M. Schulze Buschoff et.al. 2502.15678 null
2025-02-21 FLEKE: Federated Locate-then-Edit Knowledge Editing Zongkai Zhao et.al. 2502.15677 link
2025-02-21 AutoToM: Automated Bayesian Inverse Planning and Model Discovery for Open-ended Theory of Mind Zhining Zhang et.al. 2502.15676 link
2025-02-21 Almost AI, Almost Human: The Challenge of Detecting AI-Polished Writing Shoumik Saha et.al. 2502.15666 link
2025-02-21 Machine-generated text detection prevents language model collapse George Drayson et.al. 2502.15654 link
2025-02-21 Empowering LLMs with Logical Reasoning: A Comprehensive Survey Fengxiang Cheng et.al. 2502.15652 null
2025-02-21 Steering into New Embedding Spaces: Analyzing Cross-Lingual Alignment Induced by Model Interventions in Multilingual Language Models Anirudh Sundar et.al. 2502.15639 null
2025-02-21 Mantis: Lightweight Calibrated Foundation Model for User-Friendly Time Series Classification Vasilii Feofanov et.al. 2502.15637 link
2025-02-21 The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not Longer Marthe Ballon et.al. 2502.15631 link
2025-02-21 Extraction multi-étiquettes de relations en utilisant des couches de Transformer Ngoc Luyen Le et.al. 2502.15619 null
2025-02-21 Probe Pruning: Accelerating LLMs through Dynamic Pruning via Model-Probing Qi Le et.al. 2502.15618 link
2025-02-21 PDeepPP:A Deep learning framework with Pretrained Protein language for peptide classification Jixiu Zhai et.al. 2502.15610 link
2025-02-21 On the Robustness of Transformers against Context Hijacking for Linear Classification Tianle Li et.al. 2502.15609 null
2025-02-21 Cross-Format Retrieval-Augmented Generation in XR with LLMs for Context-Aware Maintenance Assistance Akos Nagy et.al. 2502.15604 null
2025-02-21 Do Multilingual LLMs Think In English? Lisa Schut et.al. 2502.15603 null
2025-02-21 WorldCraft: Photo-Realistic 3D World Creation and Customization via LLM Agents Xinhang Liu et.al. 2502.15601 null
2025-02-21 SafeInt: Shielding Large Language Models from Jailbreak Attacks via Safety-Aware Representation Intervention Jiaqi Wu et.al. 2502.15594 null
2025-02-20 LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention Shang Yang et.al. 2502.14866 link
2025-02-20 Aligning LLMs to Ask Good Questions A Case Study in Clinical Reasoning Shuyue Stella Li et.al. 2502.14860 link
2025-02-20 FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling Weilin Zhao et.al. 2502.14856 null
2025-02-20 Prompt-to-Leaderboard Evan Frick et.al. 2502.14855 link
2025-02-20 GATE: Graph-based Adaptive Tool Evolution Across Diverse Tasks Jianwen Luo et.al. 2502.14848 link
2025-02-20 Red-Teaming LLM Multi-Agent Systems via Communication Attacks Pengfei He et.al. 2502.14847 null
2025-02-20 Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation Yue Yang et.al. 2502.14846 null
2025-02-20 Revealing and Mitigating Over-Attention in Knowledge Editing Pinzheng Wang et.al. 2502.14838 link
2025-02-20 LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models Shangqing Tu et.al. 2502.14834 link
2025-02-20 Middle-Layer Representation Alignment for Cross-Lingual Transfer in Fine-Tuned LLMs Danni Liu et.al. 2502.14830 link
2025-02-20 Measuring Faithfulness of Chains of Thought by Unlearning Reasoning Steps Martin Tutek et.al. 2502.14829 link
2025-02-20 Exploring Advanced Techniques for Visual Question Answering: A Comprehensive Comparison Aiswarya Baby et.al. 2502.14827 null
2025-02-20 A Survey of Model Architectures in Information Retrieval Zhichao Xu et.al. 2502.14822 null
2025-02-20 eC-Tab2Text: Aspect-Based Text Generation from e-Commerce Product Tables Luis Antonio Gutiérrez Guanilo et.al. 2502.14820 null
2025-02-20 Dynamic Low-Rank Sparse Adaptation for Large Language Models Weizhong Huang et.al. 2502.14816 link
2025-02-20 FetalCLIP: A Visual-Language Foundation Model for Fetal Ultrasound Image Analysis Fadillah Maani et.al. 2502.14807 link
2025-02-20 From RAG to Memory: Non-Parametric Continual Learning for Large Language Models Bernal Jiménez Gutiérrez et.al. 2502.14802 link
2025-02-20 A Multi-Agent Perspective on Modern Information Retrieval Haya Nachimovsky et.al. 2502.14796 null
2025-02-20 Rapid Word Learning Through Meta In-Context Learning Wentao Wang et.al. 2502.14791 null
2025-02-20 SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Michael Tschannen et.al. 2502.14786 link
2025-02-19 Where's the Bug? Attention Probing for Scalable Fault Localization Adam Stein et.al. 2502.13966 null
2025-02-19 Autellix: An Efficient Serving Engine for LLM Agents as General Programs Michael Luo et.al. 2502.13965 null
2025-02-19 MuDAF: Long-Context Multi-Document Attention Focusing through Contrastive Learning on Attention Heads Weihao Liu et.al. 2502.13963 link
2025-02-19 Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering William Jurayj et.al. 2502.13962 null
2025-02-19 LIDDIA: Language-based Intelligent Drug Discovery Agent Reza Averly et.al. 2502.13959 null
2025-02-19 Neurosymbolic artificial intelligence via large language models and coherence-driven inference Steve Huntsman et.al. 2502.13953 null
2025-02-19 Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region Chak Tou Leong et.al. 2502.13946 null
2025-02-19 A Chain-of-Thought Subspace Meta-Learning for Few-shot Image Captioning with Large Vision and Language Models Hao Huang et.al. 2502.13942 null
2025-02-19 Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images Shengguang Wu et.al. 2502.13928 null
2025-02-19 Beyond Single Frames: Can LMMs Comprehend Temporal and Contextual Narratives in Image Sequences? Xiaochen Wang et.al. 2502.13925 null
2025-02-19 LongPO: Long Context Self-Evolution of Large Language Models through Short-to-Long Preference Optimization Guanzheng Chen et.al. 2502.13922 link
2025-02-19 Exploring Code Language Models for Automated HLS-based Hardware Generation: Benchmark, Infrastructure and Analysis Jiahao Gai et.al. 2502.13921 null
2025-02-19 Exploring Personalized Health Support through Data-Driven, Theory-Guided LLMs: A Case Study in Sleep Health Xingbo Wang et.al. 2502.13920 link
2025-02-19 TESS 2: A Large-Scale Generalist Diffusion Language Model Jaesung Tae et.al. 2502.13917 link
2025-02-19 How Do LLMs Perform Two-Hop Reasoning in Context? Tianyu Guo et.al. 2502.13913 null
2025-02-19 Lost in Sequence: Do Large Language Models Understand Sequential Recommendation? Sein Kim et.al. 2502.13909 link
2025-02-19 Judging the Judges: A Collection of LLM-Generated Relevance Judgements Hossein A. Rahmani et.al. 2502.13908 link
2025-02-19 DataSciBench: An LLM Agent Benchmark for Data Science Dan Zhang et.al. 2502.13897 link
2025-02-19 NavigateDiff: Visual Predictors are Zero-Shot Navigation Assistants Yiran Qin et.al. 2502.13894 null
2025-02-19 Refining embeddings with fill-tuning: data-efficient generalised performance improvements for materials foundation models Matthew P. Wilson et.al. 2502.13886 link
2025-02-18 Re-Align: Aligning Vision Language Models via Retrieval-Augmented Direct Preference Optimization Shuo Xing et.al. 2502.13146 link
2025-02-18 Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation Bencheng Liao et.al. 2502.13145 link
2025-02-18 Pre-training Auto-regressive Robotic Models with 4D Representations Dantong Niu et.al. 2502.13142 null
2025-02-18 UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models Huawei Lin et.al. 2502.13141 link
2025-02-18 AIDE: AI-Driven Exploration in the Space of Code Zhengyao Jiang et.al. 2502.13138 link
2025-02-18 Theorem Prover as a Judge for Synthetic Data Generation Joshua Ong Jun Leang et.al. 2502.13137 null
2025-02-18 Sleepless Nights, Sugary Days: Creating Synthetic Users with Health Conditions for Realistic Coaching Agent Interactions Taedong Yun et.al. 2502.13135 null
2025-02-18 Learning to Defer for Causal Discovery with Imperfect Experts Oscar Clivio et.al. 2502.13132 null
2025-02-18 Rethinking Diverse Human Preference Learning through Principal Component Analysis Feng Luo et.al. 2502.13131 null
2025-02-18 Magma: A Foundation Model for Multimodal AI Agents Jianwei Yang et.al. 2502.13130 link
2025-02-18 Facilitating Long Context Understanding via Supervised Chain-of-Thought Reasoning Jingyang Lin et.al. 2502.13127 null
2025-02-18 RuozhiBench: Evaluating LLMs with Logical Fallacies and Misleading Premises Zenan Zhai et.al. 2502.13125 link
2025-02-18 Adapting Psycholinguistic Research for LLMs: Gender-inclusive Language in a Coreference Context Marion Bartl et.al. 2502.13120 null
2025-02-18 STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models Narun Raman et.al. 2502.13119 null
2025-02-18 Performance Evaluation of Large Language Models in Statistical Programming Xinyi Song et.al. 2502.13117 link
2025-02-18 MatterChat: A Multi-Modal LLM for Material Science Yingheng Tang et.al. 2502.13107 null
2025-02-18 Understanding and Rectifying Safety Perception Distortion in VLMs Xiaohan Zou et.al. 2502.13095 null
2025-02-18 Text2World: Benchmarking Large Language Models for Symbolic World Model Generation Mengkang Hu et.al. 2502.13092 null
2025-02-18 KAPPA: A Generic Patent Analysis Framework with Keyphrase-Based Portraits Xin Xia et.al. 2502.13076 null
2025-02-18 Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity Yuri Kuratov et.al. 2502.13063 link
2025-02-17 Idiosyncrasies in Large Language Models Mingjie Sun et.al. 2502.12150 link
2025-02-17 HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generation Ling Yang et.al. 2502.12148 link
2025-02-17 Fast or Better? Balancing Accuracy and Cost in Retrieval-Augmented Generation with Flexible User Control Jinyan Su et.al. 2502.12145 link
2025-02-17 Small Models Struggle to Learn from Strong Reasoners Yuetai Li et.al. 2502.12143 null
2025-02-17 SoftCoT: Soft Chain-of-Thought for Efficient Reasoning with LLMs Yige Xu et.al. 2502.12134 link
2025-02-17 Transformer Dynamics: A neuroscientific approach to interpretability of large language models Jesseba Fernando et.al. 2502.12131 null
2025-02-17 Scaling Autonomous Agents via Automatic Reward Modeling And Planning Zhenfang Chen et.al. 2502.12130 null
2025-02-17 On the Query Complexity of Verifier-Assisted Language Generation Edoardo Botta et.al. 2502.12123 null
2025-02-17 Minimal Ranks, Maximum Confidence: Parameter-efficient Uncertainty Quantification for LoRA Patryk Marszałek et.al. 2502.12122 link
2025-02-17 LLMs on the Line: Data Determines Loss-to-Loss Scaling Laws Prasanna Mayilvahanan et.al. 2502.12120 null
2025-02-17 PRISM: Self-Pruning Intrinsic Selection Method for Training-Free Multimodal Data Selection Jinhe Bi et.al. 2502.12119 null
2025-02-17 A-MEM: Agentic Memory for LLM Agents Wujiang Xu et.al. 2502.12110 link
2025-02-17 Personality Structured Interview for Large Language Model Simulation in Personality Research Pengda Wang et.al. 2502.12109 null
2025-02-17 Relational Norms for Human-AI Cooperation Brian D. Earp et.al. 2502.12102 null
2025-02-17 Token Communications: A Unified Framework for Cross-modal Context-aware Semantic Communications Li Qiao et.al. 2502.12096 null
2025-02-17 Descriminative-Generative Custom Tokens for Vision-Language Models Pramuditha Perera et.al. 2502.12095 null
2025-02-17 Meta-Statistical Learning: Supervised Learning of Statistical Inference Maxime Peyrard et.al. 2502.12088 null
2025-02-17 APB: Accelerating Distributed Long-Context Inference by Passing Compressed Context Blocks across GPUs Yuxiang Huang et.al. 2502.12085 link
2025-02-17 VLM $^2$ -Bench: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues Jianshu Zhang et.al. 2502.12084 null
2025-02-17 AdaSplash: Adaptive Sparse Flash Attention Nuno Gonçalves et.al. 2502.12082 link
2025-02-14 MM-RLHF: The Next Step Forward in Multimodal LLM Alignment Yi-Fan Zhang et.al. 2502.10391 null
2025-02-14 Aspect-Oriented Summarization for Psychiatric Short-Term Readmission Prediction WonJin Yoon et.al. 2502.10388 null
2025-02-14 Unknown Word Detection for English as a Second Language (ESL) Learners Using Gaze and Pre-trained Language Models Jiexin Ding et.al. 2502.10378 null
2025-02-14 Robustness tests for biomedical foundation models should tailor to specification R. Patrick Xian et.al. 2502.10374 link
2025-02-14 Enhancing Multilingual LLM Pretraining with Model-Based Data Selection Bettina Messmer et.al. 2502.10361 null
2025-02-14 Organize the Web: Constructing Domains Enhances Pre-Training Data Curation Alexander Wettig et.al. 2502.10341 null
2025-02-14 Evaluating the Meta- and Object-Level Reasoning of Large Language Models for Question Answering Nick Ferguson et.al. 2502.10338 null
2025-02-14 LLM-Powered Preference Elicitation in Combinatorial Assignment Ermis Soumalias et.al. 2502.10308 null
2025-02-14 SPIRIT: Short-term Prediction of solar IRradIance for zero-shot Transfer learning using Foundation Models Aditya Mishra et.al. 2502.10307 null
2025-02-14 Open-Source AI-Powered Optimization in Scalene: Advancing Python Performance Profiling with DeepSeek-R1 and LLaMA 3.2 Saem Hasan et.al. 2502.10299 null
2025-02-14 DeltaProduct: Increasing the Expressivity of DeltaNet Through Products of Householders Julien Siems et.al. 2502.10297 link
2025-02-14 Probing Perceptual Constancy in Large Vision Language Models Haoran Sun et.al. 2502.10273 null
2025-02-14 Are Large Language Models the future crowd workers of Linguistics? Iris Ferrazzo et.al. 2502.10266 null
2025-02-14 Large Language Models and Synthetic Data for Monitoring Dataset Mentions in Research Papers Aivin V. Solatorio et.al. 2502.10263 link
2025-02-14 VisCon-100K: Leveraging Contextual Web Data for Fine-tuning Vision Language Models Gokul Karthik Kumar et.al. 2502.10250 null
2025-02-14 Step-Video-T2V Technical Report: The Practice, Challenges, and Future of Video Foundation Model Guoqing Ma et.al. 2502.10248 link
2025-02-14 Efficient Zero-Order Federated Finetuning of Language Models for Resource-Constrained Devices Mohamed Aboelenien Ahmed et.al. 2502.10239 null
2025-02-14 AdaPTS: Adapting Univariate Foundation Models to Probabilistic Multivariate Time Series Forecasting Abdelhakim Benechehab et.al. 2502.10235 link
2025-02-14 Do Large Language Models Reason Causally Like Us? Even Better? Hanna M. Dettki et.al. 2502.10215 null
2025-02-14 Can Post-Training Quantization Benefit from an Additional QLoRA Integration? Xiliang Zhu et.al. 2502.10202 null
2025-02-13 Theoretical Benefit and Limitation of Diffusion Language Model Guhao Feng et.al. 2502.09622 null
2025-02-13 MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency Dongzhi Jiang et.al. 2502.09621 null
2025-02-13 Exploring the Potential of Encoder-free Architectures in 3D LMMs Yiwen Tang et.al. 2502.09620 link
2025-02-13 Human-LLM Coevolution: Evidence from Academic Writing Mingmeng Geng et.al. 2502.09606 null
2025-02-13 SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models Yung-Sung Chuang et.al. 2502.09604 link
2025-02-13 GAIA: A Global, Multi-modal, Multi-scale Vision-Language Dataset for Remote Sensing Image Analysis Angelos Zavras et.al. 2502.09598 link
2025-02-13 Do LLMs Recognize Your Preferences? Evaluating Personalized Preference Following in LLMs Siyan Zhao et.al. 2502.09597 link
2025-02-13 KIMAs: A Configurable Knowledge Integrated Multi-Agent System Zitao Li et.al. 2502.09596 null
2025-02-13 Logical forms complement probability in understanding language model (and human) performance Yixuan Wang et.al. 2502.09589 null
2025-02-13 Polymind: Parallel Visual Diagramming with Large Language Models to Support Prewriting Through Microtasks Qian Wan et.al. 2502.09577 null
2025-02-13 MorphNLI: A Stepwise Approach to Natural Language Inference Using Text Morphing Vlad Andrei Negru et.al. 2502.09567 null
2025-02-13 Zero-shot generation of synthetic neurosurgical data with large language models Austin A. Barr et.al. 2502.09566 link
2025-02-13 MDCrow: Automating Molecular Dynamics Workflows with Large Language Models Quintina Campbell et.al. 2502.09565 link
2025-02-13 EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents Rui Yang et.al. 2502.09560 null
2025-02-13 Explainable AI-assisted Optimization for Feynman Integral Reduction Zhuo-Yang Song et.al. 2502.09544 null
2025-02-13 Mind the Gap! Choice Independence in Using Multilingual LLMs for Persuasive Co-Writing Tasks in Different Languages Shreyan Biswas et.al. 2502.09532 null
2025-02-13 When and How Does CLIP Enable Domain and Compositional Generalization? Elias Kempf et.al. 2502.09507 null
2025-02-13 Improve LLM-based Automatic Essay Scoring with Linguistic Features Zhaoyi Joey Hou et.al. 2502.09497 null
2025-02-13 Foundation Neural-Network Quantum States Riccardo Rende et.al. 2502.09488 null
2025-02-13 Objective quantification of mood states using large language models Jakub Onysk et.al. 2502.09487 null
2025-02-12 SwiftSketch: A Diffusion Model for Image-to-Vector Sketch Generation Ellie Arar et.al. 2502.08642 null
2025-02-12 Examining Multilingual Embedding Models Cross-Lingually Through LLM-Generated Adversarial Examples Andrianos Michail et.al. 2502.08638 null
2025-02-12 Ensemble based approach to quantifying uncertainty of LLM based classifications Srijith Rajamohan et.al. 2502.08631 null
2025-02-12 Continuous Cardiac Arrest Prediction in ICU using PPG Foundation Model Saurabh Kataria et.al. 2502.08612 null
2025-02-12 Causal Analysis of ASR Errors for Children: Quantifying the Impact of Physiological, Cognitive, and Extrinsic Factors Vishwanath Pratap Singh et.al. 2502.08587 null
2025-02-12 Commercial LLM Agents Are Already Vulnerable to Simple Yet Dangerous Attacks Ang Li et.al. 2502.08586 null
2025-02-12 COAST: Intelligent Time-Adaptive Neural Operators Zhikai Wu et.al. 2502.08574 null
2025-02-12 QA-Expand: Multi-Question Answer Generation for Enhanced Query Expansion in Information Retrieval Wonduk Seo et.al. 2502.08557 null
2025-02-12 Human-Centric Foundation Models: Perception, Generation and Agentic Modeling Shixiang Tang et.al. 2502.08556 link
2025-02-12 Fostering Appropriate Reliance on Large Language Models: The Role of Explanations, Sources, and Inconsistencies Sunnie S. Y. Kim et.al. 2502.08554 null
2025-02-12 LLMs can implicitly learn from mistakes in-context Lisa Alazraki et.al. 2502.08550 null
2025-02-12 Representation Learning to Advance Multi-institutional Studies with Electronic Health Record Data Doudou Zhou et.al. 2502.08547 null
2025-02-12 Moment of Untruth: Dealing with Negative Queries in Video Moment Retrieval Kevin Flanagan et.al. 2502.08544 link
2025-02-12 LLM Pretraining with Continuous Concepts Jihoon Tack et.al. 2502.08524 null
2025-02-12 The Paradox of Stochasticity: Limited Creativity and Computational Decoupling in Temperature-Varied LLM Outputs of Structured Fictional Data Evgenii Evstafev et.al. 2502.08515 null
2025-02-12 Faithful, Unfaithful or Ambiguous? Multi-Agent Debate with Initial Stance for Summary Evaluation Mahnaz Koupaee et.al. 2502.08514 link
2025-02-12 Measuring Diversity in Synthetic Datasets Yuchang Zhu et.al. 2502.08512 link
2025-02-12 Explanation based In-Context Demonstrations Retrieval for Multilingual Grammatical Error Correction Wei Li et.al. 2502.08507 link
2025-02-12 Salamandra Technical Report Aitor Gonzalez-Agirre et.al. 2502.08489 link
2025-02-12 One-Shot Federated Learning with Classifier-Free Diffusion Models Obaidullah Zaland et.al. 2502.08488 null
2025-02-11 DarwinLM: Evolutionary Structured Pruning of Large Language Models Shengkun Tang et.al. 2502.07780 link
2025-02-11 Auditing Prompt Caching in Language Model APIs Chenchen Gu et.al. 2502.07776 link
2025-02-11 Automatic Robot Task Planning by Integrating Large Language Model with Genetic Programming Azizjon Kobilov et.al. 2502.07772 null
2025-02-11 Breaking Down Bias: On The Limits of Generalizable Pruning Strategies Sibo Ma et.al. 2502.07771 null
2025-02-11 Great Power Brings Great Responsibility: Personalizing Conversational AI for Diverse Problem-Solvers Italo Santos et.al. 2502.07763 null
2025-02-11 Scalable Fingerprinting of Large Language Models Anshul Nasery et.al. 2502.07760 null
2025-02-11 Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension Wenbo Gong et.al. 2502.07752 null
2025-02-11 WHODUNIT: Evaluation benchmark for culprit detection in mystery stories Kshitij Gupta et.al. 2502.07747 link
2025-02-11 The Economics of Large Language Models: Token Allocation, Fine-Tuning, and Optimal Pricing Dirk Bergemann et.al. 2502.07736 null
2025-02-11 Economics of Sourcing Human Data Sebastin Santy et.al. 2502.07732 null
2025-02-11 Verifying LLM-Generated Code in the Context of Software Verification with Ada/SPARK Marcos Cramer et.al. 2502.07728 null
2025-02-11 Making Language Models Robust Against Negation MohammadHossein Rezaei et.al. 2502.07717 link
2025-02-11 Magic 1-For-1: Generating One Minute Video Clips within One Minute Hongwei Yi et.al. 2502.07701 link
2025-02-11 A Framework for LLM-powered Design Assistants Swaroop Panda et.al. 2502.07698 null
2025-02-11 Large Language Models as Proxies for Theories of Human Linguistic Cognition Imry Ziv et.al. 2502.07687 null
2025-02-11 SymGPT: Auditing Smart Contracts via Combining Symbolic Execution with Large Language Models Shihao Xia et.al. 2502.07644 null
2025-02-11 FoQA: A Faroese Question-Answering Dataset Annika Simonsen et.al. 2502.07642 null
2025-02-11 Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving Yong Lin et.al. 2502.07640 link
2025-02-11 Exploring Mobile Touch Interaction with Large Language Models Tim Zindulka et.al. 2502.07629 null
2025-02-11 Scaling Pre-training to One Hundred Billion Data for Vision Language Models Xiao Wang et.al. 2502.07617 null
2025-02-10 EVEv2: Improved Baselines for Encoder-Free Vision-Language Models Haiwen Diao et.al. 2502.06788 link
2025-02-10 Visual Agentic AI for Spatial Reasoning with a Dynamic API Damiano Marsili et.al. 2502.06787 null
2025-02-10 DeepCrossAttention: Supercharging Transformer Residual Connections Mike Heddes et.al. 2502.06785 null
2025-02-10 Towards Internet-Scale Training For Agents Brandon Trabucco et.al. 2502.06776 null
2025-02-10 Enhancing Trust in Language Model-Based Code Optimization through RLHF: A Research Design Jingzhi Gong et.al. 2502.06769 null
2025-02-10 Exploiting Sparsity for Long Context Inference: Million Token Contexts on Commodity GPUs Ryan Synk et.al. 2502.06766 link
2025-02-10 Rationalization Models for Text-to-SQL Gaetano Rossiello et.al. 2502.06759 null
2025-02-10 Accelerating Data Processing and Benchmarking of AI Models for Pathology Andrew Zhang et.al. 2502.06750 link
2025-02-10 Gradient Multi-Normalization for Stateless and Scalable LLM Training Meyer Scetbon et.al. 2502.06742 null
2025-02-10 VersaPRM: Multi-Domain Process Reward Model via Synthetic Reasoning Data Thomas Zeng et.al. 2502.06737 null
2025-02-10 Dynamic Loss-Based Sample Reweighting for Improved Large Language Model Pretraining Daouda Sow et.al. 2502.06733 null
2025-02-10 Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Runze Liu et.al. 2502.06703 link
2025-02-10 EquiTabPFN: A Target-Permutation Equivariant Prior Fitted Networks Michael Arbel et.al. 2502.06684 null
2025-02-10 Boosting Self-Efficacy and Performance of Large Language Models via Verbal Efficacy Stimulations Rui Chen et.al. 2502.06669 null
2025-02-10 Automatic Evaluation of Healthcare LLMs Beyond Question-Answering Anna Arias-Duart et.al. 2502.06666 null
2025-02-10 Evaluation of Deep Audio Representations for Hearables Fabian Gröger et.al. 2502.06664 null
2025-02-10 EfficientLLM: Scalable Pruning-Aware Pretraining for Architecture-Agnostic Edge Language Models Xingrun Xing et.al. 2502.06663 null
2025-02-10 Unbiased Evaluation of Large Language Models from a Causal Perspective Meilin Chen et.al. 2502.06655 null
2025-02-10 In-Context Learning (and Unlearning) of Length Biases Stephanie Schoch et.al. 2502.06653 null
2025-02-10 Transparent NLP: Using RAG and LLM Alignment for Privacy Q&A Anna Leschanowsky et.al. 2502.06652 null
2025-02-07 Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context Accuray Yunhang Shen et.al. 2502.05177 link
2025-02-07 Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Jonas Geiping et.al. 2502.05171 link
2025-02-07 NoLiMa: Long-Context Evaluation Beyond Literal Matching Ali Modarressi et.al. 2502.05167 link
2025-02-07 Multitwine: Multi-Object Compositing with Text and Layout Control Gemma Canet Tarrés et.al. 2502.05165 null
2025-02-07 DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails Yihe Deng et.al. 2502.05163 link
2025-02-07 A Lightweight Method to Disrupt Memorized Sequences in LLM Parjanya Prajakta Prashant et.al. 2502.05159 null
2025-02-07 Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation Steffen Eger et.al. 2502.05151 link
2025-02-07 CodeSCM: Causal Analysis for Multi-Modal Code Generation Mukur Gupta et.al. 2502.05150 link
2025-02-07 An Annotated Reading of 'The Singer of Tales' in the LLM Era Kush R. Varshney et.al. 2502.05148 null
2025-02-07 Chest X-ray Foundation Model with Global and Local Representations Integration Zefan Yang et.al. 2502.05142 link
2025-02-07 Refining Integration-by-Parts Reduction of Feynman Integrals with Machine Learning Matt von Hippel et.al. 2502.05121 null
2025-02-07 Flexible and Efficient Grammar-Constrained Decoding Kanghee Park et.al. 2502.05111 null
2025-02-07 Lost in Time: Clock and Calendar Understanding Challenges in Multimodal LLMs Rohit Saxena et.al. 2502.05092 null
2025-02-07 DCFormer: Efficient 3D Vision-Language Modeling with Decomposed Convolutions Gorkem Can Ates et.al. 2502.05091 null
2025-02-07 Mitigating Unintended Memorization with LoRA in Federated Learning for LLMs Thierry Bossy et.al. 2502.05087 link
2025-02-07 Causality can systematically address the monsters under the bench(marks) Felix Leeb et.al. 2502.05085 null
2025-02-07 ChallengeMe: An Adversarial Learning-enabled Text Summarization Framework Xiaoyu Deng et.al. 2502.05084 null
2025-02-07 Adaptive Graph of Thoughts: Test-Time Adaptive Reasoning Unifying Chain, Tree, and Graph Structures Tushar Pandey et.al. 2502.05078 link
2025-02-07 nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow Geliang Ouyang et.al. 2502.05036 link
2025-02-07 EnseSmells: Deep ensemble and programming language models for automated code smells detection Anh Ho et.al. 2502.05012 link
2025-02-06 Ola: Pushing the Frontiers of Omni-Modal Language Model with Progressive Modality Alignment Zuyan Liu et.al. 2502.04328 link
2025-02-06 Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactions Yik Siu Chan et.al. 2502.04322 link
2025-02-06 ConceptAttention: Diffusion Transformers Learn Highly Interpretable Features Alec Helbling et.al. 2502.04320 link
2025-02-06 sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views Eyvaz Najafli et.al. 2502.04318 null
2025-02-06 ChamaleonLLM: Batch-Aware Dynamic Low-Rank Adaptation via Inference-Time Clusters Kamer Ali Yuksel et.al. 2502.04315 link
2025-02-06 Great Models Think Alike and this Undermines AI Oversight Shashwat Goel et.al. 2502.04313 link
2025-02-06 ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference Optimization Yinjie Wang et.al. 2502.04306 link
2025-02-06 Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization Yuanye Liu et.al. 2502.04295 link
2025-02-06 PILAF: Optimal Human Preference Sampling for Reward Modeling Yunzhen Feng et.al. 2502.04270 null
2025-02-06 How does a Multilingual LM Handle Multiple Languages? Santhosh Kakarla et.al. 2502.04269 null
2025-02-06 Cross the Gap: Exposing the Intra-modal Misalignment in CLIP via Modality Inversion Marco Mistretta et.al. 2502.04263 link
2025-02-06 Efficient Randomized Experiments Using Foundation Models Piersilvio De Bartolomeis et.al. 2502.04262 link
2025-02-06 MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion Xintong Hao et.al. 2502.04235 null
2025-02-06 Can LLMs Hack Enterprise Networks? Autonomous Assumed Breach Penetration-Testing Active Directory Networks Andreas Happe et.al. 2502.04227 null
2025-02-06 Keep It Light! Simplifying Image Clustering Via Text-Free Adapters Yicen Li et.al. 2502.04226 null
2025-02-06 Éclair -- Extracting Content and Layout with Integrated Reading Order for Documents Ilia Karmanov et.al. 2502.04223 null
2025-02-06 Sports and Women's Sports: Gender Bias in Text Generation with Olympic Data Laura Biester et.al. 2502.04218 null
2025-02-06 Algorithmic causal structure emerging through compression Liang Wendong et.al. 2502.04210 null
2025-02-06 "Short-length" Adversarial Training Helps LLMs Defend "Long-length" Jailbreak Attacks: Theoretical and Empirical Evidence Shaopeng Fu et.al. 2502.04204 link
2025-02-06 The Best Instruction-Tuning Data are Those That Fit Dylan Zhang et.al. 2502.04194 null
2025-02-05 Do Large Language Model Benchmarks Test Reliability? Joshua Vendrow et.al. 2502.03461 link
2025-02-05 Adapt-Pruner: Adaptive Structural Pruning for Efficient Small Language Model Training Boyao Wang et.al. 2502.03460 null
2025-02-05 SKI Models: Skeleton Induced Vision-Language Embeddings for Understanding Activities of Daily Living Arkaprava Sinha et.al. 2502.03459 null
2025-02-05 A Schema-Guided Reason-while-Retrieve framework for Reasoning on Scene Graphs with Large-Language-Models (LLMs) Yiye Chen et.al. 2502.03450 null
2025-02-05 BFS-Prover: Scalable Best-First Tree Search for LLM-based Automatic Theorem Proving Ran Xin et.al. 2502.03438 null
2025-02-05 On Fairness of Unified Multimodal Large Language Model for Image Generation Ming Liu et.al. 2502.03429 null
2025-02-05 Harnessing Large Language Models for Curated Code Reviews Oussama Ben Sghaier et.al. 2502.03425 link
2025-02-05 Think or Step-by-Step? UnZIPping the Black Box in Zero-Shot Prompts Nikta Gohari Sadr et.al. 2502.03418 null
2025-02-05 SPRI: Aligning Large Language Models with Context-Situated Principles Hongli Zhan et.al. 2502.03397 null
2025-02-05 Benchmarking Time Series Forecasting Models: From Statistical Techniques to Foundation Models in Real-World Applications Issar Arab et.al. 2502.03395 null
2025-02-05 LIMO: Less is More for Reasoning Yixin Ye et.al. 2502.03387 link
2025-02-05 Transformers and Their Roles as Time Series Foundation Models Dennis Wu et.al. 2502.03383 null
2025-02-05 High-Fidelity Simultaneous Speech-To-Speech Translation Tom Labiausse et.al. 2502.03382 link
2025-02-05 Demystifying Long Chain-of-Thought Reasoning in LLMs Edward Yeo et.al. 2502.03373 link
2025-02-05 PalimpChat: Declarative and Interactive AI analytics Chunwei Liu et.al. 2502.03368 null
2025-02-05 Minerva: A Programmable Memory Test Benchmark for Language Models Menglin Xia et.al. 2502.03358 null
2025-02-05 RadVLM: A Multitask Conversational Vision-Language Model for Radiology Nicolas Deperrois et.al. 2502.03333 null
2025-02-05 ECM: A Unified Electronic Circuit Model for Explaining the Emergence of In-Context Learning and Chain-of-Thought in Large Language Model Qiguang Chen et.al. 2502.03325 null
2025-02-05 Out-of-Distribution Detection using Synthetic Data Generation Momin Abbas et.al. 2502.03323 null
2025-02-05 Simplifying Formal Proof-Generating Models with ChatGPT and Basic Searching Techniques Sangjun Han et.al. 2502.03321 null
2025-02-04 Articulate AnyMesh: Open-Vocabulary 3D Articulated Objects Modeling Xiaowen Qiu et.al. 2502.02590 null
2025-02-04 COCONut-PanCap: Joint Panoptic Segmentation and Grounded Captions for Fine-Grained Understanding and Generation Xueqing Deng et.al. 2502.02589 null
2025-02-04 A comparison of translation performance between DeepL and Supertext Alex Flückiger et.al. 2502.02577 link
2025-02-04 Are Language Models Up to Sequential Optimization Problems? From Evaluation to a Hegelian-Inspired Enhancement Soheil Abbasloo et.al. 2502.02573 null
2025-02-04 Learning the RoPEs: Better 2D and 3D Position Encodings with STRING Connor Schenck et.al. 2502.02562 null
2025-02-04 Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation Junha Lee et.al. 2502.02548 null
2025-02-04 LLMs for Generation of Architectural Components: An Exploratory Empirical Study in the Serverless World Shrikara Arun et.al. 2502.02539 null
2025-02-04 Adaptive Self-improvement LLM Agentic System for ML Library Development Genghan Zhang et.al. 2502.02534 link
2025-02-04 Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies Han Zhou et.al. 2502.02533 null
2025-02-04 Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search Maohao Shen et.al. 2502.02508 null
2025-02-04 Analyzing Similarity Metrics for Data Selection for Language Model Pretraining Dylan Sam et.al. 2502.02494 null
2025-02-04 EasySpec: Layer-Parallel Speculative Decoding for Efficient Multi-GPU Utilization Yize Wu et.al. 2502.02493 null
2025-02-04 Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study Menglong Cui et.al. 2502.02481 null
2025-02-04 Mind the Gap: Evaluating Patch Embeddings from General-Purpose and Histopathology Foundation Models for Cell Segmentation and Classification Valentina Vadori et.al. 2502.02471 link
2025-02-04 Modular Training of Neural Networks aids Interpretability Satvik Golechha et.al. 2502.02470 null
2025-02-04 SAISA: Towards Multimodal Large Language Models with Both Training and Inference Efficiency Qianhao Yuan et.al. 2502.02458 link
2025-02-04 IMDPrompter: Adapting SAM to Image Manipulation Detection by Cross-View Automated Prompt Learning Quan Zhang et.al. 2502.02454 null
2025-02-04 Personalization Toolkit: Training Free Personalization of Large Vision Language Models Soroush Seifi et.al. 2502.02452 null
2025-02-04 Beyond English: Evaluating Automated Measurement of Moral Foundations in Non-English Discourse with a Chinese Case Study Calvin Yixiang Cheng et.al. 2502.02451 link
2025-02-04 Generative Psycho-Lexical Approach for Constructing Value Systems in Large Language Models Haoran Ye et.al. 2502.02444 null
2025-01-31 Low-Rank Adapting Models for Sparse Autoencoders Matthew Chen et.al. 2501.19406 link
2025-01-31 Vintix: Action Model via In-Context Reinforcement Learning Andrey Polubarov et.al. 2501.19400 link
2025-01-31 Scalable-Softmax Is Superior for Attention Ken M. Nakanishi et.al. 2501.19399 null
2025-01-31 Do LLMs Strategically Reveal, Conceal, and Infer Information? A Theoretical and Empirical Analysis in The Chameleon Game Mustafa O. Karabag et.al. 2501.19398 link
2025-02-03 s1: Simple test-time scaling Niklas Muennighoff et.al. 2501.19393 link
2025-01-31 Cache Me If You Must: Adaptive Key-Value Quantization for Large Language Models Alina Shutova et.al. 2501.19392 link
2025-01-31 Federated Sketching LoRA: On-Device Collaborative Fine-Tuning of Large Language Models Wenzhi Fang et.al. 2501.19389 link
2025-01-31 Decoding-based Regression Xingyou Song et.al. 2501.19383 link
2025-01-31 TableMaster: A Recipe to Advance Table Understanding with Language Models Lang Cao et.al. 2501.19378 null
2025-02-03 SELMA: A Speech-Enabled Language Model for Virtual Assistant Interactions Dominik Wagner et.al. 2501.19377 null
2025-01-31 We're Different, We're the Same: Creative Homogeneity Across LLMs Emily Wenger et.al. 2501.19361 null
2025-01-31 Mechanical Properties of the Meninges: Large Language Model Assisted Systematic Review of over 25,000 Studies Brandon P. Chelstrom et.al. 2501.19359 null
2025-01-31 The Energy Loss Phenomenon in RLHF: A New Perspective on Mitigating Reward Hacking Yuchun Miao et.al. 2501.19358 null
2025-01-31 Towards Adaptive Self-Improvement for Smarter Energy Systems Alexander Sommer et.al. 2501.19340 null
2025-01-31 PixelWorld: Towards Perceiving Everything as Pixels Zhiheng Lyu et.al. 2501.19339 null
2025-01-31 Homogeneity Bias as Differential Sampling Uncertainty in Language Models Messi H. J. Lee et.al. 2501.19337 null
2025-01-31 Reward-Guided Speculative Decoding for Efficient LLM Reasoning Baohao Liao et.al. 2501.19324 null
2025-01-31 MINDSTORES: Memory-Informed Neural Decision Synthesis for Task-Oriented Reinforcement in Embodied Systems Anirudh Chari et.al. 2501.19318 null
2025-01-31 LLM-based Affective Text Generation Quality Based on Different Quantization Values Yarik Menchaca Resendiz et.al. 2501.19317 null
2025-01-31 An Efficient Approach for Machine Translation on Low-resource Languages: A Case Study in Vietnamese-Chinese Tran Ngoc Son et.al. 2501.19314 null
2025-01-30 Foundational Models for 3D Point Clouds: A Survey and Outlook Vishal Thengane et.al. 2501.18594 null
2025-01-30 Advances in Multimodal Adaptation and Generalization: From Traditional Approaches to Foundation Models Hao Dong et.al. 2501.18592 link
2025-01-30 Thoughts Are All Over the Place: On the Underthinking of o1-Like LLMs Yue Wang et.al. 2501.18585 null
2025-01-30 Prediction-Powered Inference with Imputed Covariates and Nonuniform Sampling Dan M. Kluger et.al. 2501.18577 link
2025-01-30 Token-Hungry, Yet Precise: DeepSeek R1 Highlights the Need for Multi-Step Reasoning Over Speed in MATH Evgenii Evstafev et.al. 2501.18576 null
2025-01-30 BounTCHA: A CAPTCHA Utilizing Boundary Identification in AI-extended Videos Lehao Lin et.al. 2501.18565 null
2025-01-30 SAM2Act: Integrating Visual Foundation Model with A Memory Architecture for Robotic Manipulation Haoquan Fang et.al. 2501.18564 link
2025-01-30 Semantic Web and Creative AI -- A Technical Report from ISWS 2023 Raia Abu Ahmad et.al. 2501.18542 null
2025-01-30 Loss Functions and Operators Generated by f-Divergences Vincent Roulet et.al. 2501.18537 null
2025-01-30 Illusions of Relevance: Using Content Injection Attacks to Deceive Retrievers, Rerankers, and LLM Judges Manveer Singh Tamber et.al. 2501.18536 link
2025-01-30 Rethinking Bottlenecks in Safety Fine-Tuning of Vision Language Models Yi Ding et.al. 2501.18533 null
2025-01-30 Differentially Private Steering for Large Language Model Alignment Anmol Goel et.al. 2501.18532 link
2025-01-30 Learn from the Past: Language-conditioned Object Rearrangement with Large Language Models Guanqun Cao et.al. 2501.18516 null
2025-01-30 Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Arthur Douillard et.al. 2501.18512 null
2025-01-30 WILDCHAT-50M: A Deep Dive Into the Role of Synthetic Data in Post-Training Benjamin Feuer et.al. 2501.18511 link
2025-01-30 CLEAR: Cue Learning using Evolution for Accurate Recognition Applied to Sustainability Data Extraction Peter J. Bentley et.al. 2501.18504 null
2025-01-30 A Tool for In-depth Analysis of Code Execution Reasoning of Large Language Models Changshu Liu et.al. 2501.18482 null
2025-01-30 CLoQ: Enhancing Fine-Tuning of Quantized LLMs via Calibrated LoRA Initialization Yanxia Deng et.al. 2501.18475 null
2025-01-30 Tuning Vision Foundation Model via Test-Time Prompt-Guided Training for VFSS Segmentations Chengxi Zeng et.al. 2501.18474 null
2025-01-30 A Benchmark and Evaluation for Real-World Out-of-Distribution Detection Using Vision-Language Models Shiho Noda et.al. 2501.18463 link
2025-01-29 Learning Beyond the Surface: How Far Can Continual Pre-Training with LoRA Enhance LLMs' Domain-Specific Insight Learning? Pouya Pezeshkpour et.al. 2501.17840 link
2025-01-29 Matrix Product Sketching via Coordinated Sampling Majid Daliri et.al. 2501.17836 null
2025-01-29 Aggregation Schemes for Single-Vector WSI Representation Learning in Digital Pathology Sobhan Hemati et.al. 2501.17822 null
2025-01-29 Leveraging Multimodal LLM for Inspirational User Interface Search Seokhyeon Park et.al. 2501.17799 link
2025-01-29 BreezyVoice: Adapting TTS for Taiwanese Mandarin with Enhanced Polyphone Disambiguation -- Challenges and Insights Chan-Jan Hsu et.al. 2501.17790 null
2025-01-29 Reasoning Over the Glyphs: Evaluation of LLM's Decipherment of Rare Scripts Yu-Fei Shih et.al. 2501.17785 null
2025-01-29 AdditiveLLM: Large Language Models Predict Defects in Additive Manufacturing Peter Pak et.al. 2501.17784 null
2025-01-29 2SSP: A Two-Stage Framework for Structured Pruning of LLMs Fabrizio Sandri et.al. 2501.17771 link
2025-01-29 Hybrid Graphs for Table-and-Text based Question Answering using LLMs Ankush Agarwal et.al. 2501.17767 null
2025-01-29 On the Partitioning of GPU Power among Multi-Instances Tirth Vamja et.al. 2501.17752 null
2025-01-29 Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation Aitor Arrieta et.al. 2501.17749 null
2025-01-29 A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches Ana R. Baião et.al. 2501.17729 null
2025-01-29 Using Code Generation to Solve Open Instances of Combinatorial Design Problems Christopher D. Rosin et.al. 2501.17725 link
2025-01-29 RICoTA: Red-teaming of In-the-wild Conversation with Test Attempts Eujeong Choi et.al. 2501.17715 link
2025-01-29 Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate Yubo Wang et.al. 2501.17703 null
2025-01-29 Planning with Vision-Language Models and a Use Case in Robot-Assisted Teaching Xuzhe Dang et.al. 2501.17665 null
2025-01-29 Exploring Vision Language Models for Multimodal and Multilingual Stance Detection Jake Vasilakes et.al. 2501.17654 null
2025-01-29 Tonguescape: Exploring Language Models Understanding of Vowel Articulation Haruki Sakajo et.al. 2501.17643 link
2025-01-29 Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation Lin Chen et.al. 2501.17642 null
2025-01-29 In-Context Meta LoRA Generation Yihua Shao et.al. 2501.17635 null
2025-01-28 SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Tianzhe Chu et.al. 2501.17161 null
2025-01-28 AxBench: Steering LLMs? Even Simple Baselines Outperform Sparse Autoencoders Zhengxuan Wu et.al. 2501.17148 link
2025-01-28 FactCG: Enhancing Fact Checkers with Graph-Based Multi-Hop Data Deren Lei et.al. 2501.17144 link
2025-01-28 ASTRAL: Automated Safety Testing of Large Language Models Miriam Ugarte et.al. 2501.17132 null
2025-01-28 Scenario Understanding of Traffic Scenes Through Large Visual Language Models Rivera Esteban et.al. 2501.17131 null
2025-01-28 Histoires Morales: A French Dataset for Assessing Moral Alignment Thibaud Leteno et.al. 2501.17117 link
2025-01-28 Optimizing Large Language Model Training Using FP4 Quantization Ruizhe Wang et.al. 2501.17116 null
2025-01-28 Unlocking Transparent Alignment Through Enhanced Inverse Constitutional AI for Principle Extraction Carl-Leander Henneking et.al. 2501.17112 null
2025-01-28 COS(M+O)S: Curiosity and RL-Enhanced MCTS for Exploring Story Space via Language Models Tobias Materzok et.al. 2501.17104 null
2025-01-28 Token-by-Token Regeneration and Domain Biases: A Benchmark of LLMs on Advanced Mathematical Problem-Solving Evgenii Evstafev et.al. 2501.17084 null
2025-01-28 Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding Akash Kumar et.al. 2501.17053 null
2025-01-28 How Linguistics Learned to Stop Worrying and Love the Language Models Richard Futrell et.al. 2501.17047 null
2025-01-28 Enhanced Retrieval of Long Documents: Leveraging Fine-Grained Block Representations with Large Language Models Minghan Li et.al. 2501.17039 null
2025-01-28 Challenges in Ensuring AI Safety in DeepSeek-R1 Models: The Shortcomings of Reinforcement Learning Strategies Manojkumar Parmar et.al. 2501.17030 null
2025-01-28 Automated Refactoring of Non-Idiomatic Python Code: A Differentiated Replication with LLMs Alessandro Midolo et.al. 2501.17024 link
2025-01-28 Mobile Manipulation Instruction Generation from Multiple Images with Automatic Metric Enhancement Kei Katsumata et.al. 2501.17022 link
2025-01-28 Large Language Models for Code Generation: The Practitioners Perspective Zeeshan Rasheed et.al. 2501.16998 link
2025-01-28 Artificial Intelligence Clones Annie Liang et.al. 2501.16996 null
2025-01-28 FedEFM: Federated Endovascular Foundation Model with Unseen Data Tuong Do et.al. 2501.16992 null
2025-01-28 Modulating CNN Features with Pre-Trained ViT Representations for Open-Vocabulary Object Detection Xiangyu Gao et.al. 2501.16981 null
2025-01-27 LUCY: Linguistic Understanding and Control Yielding Early Stage of Her Heting Gao et.al. 2501.16327 link
2025-01-27 Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology Meiyun Cao et.al. 2501.16309 null
2025-01-27 RAPID: Retrieval-Augmented Parallel Inference Drafting for Text-Based Video Event Retrieval Long Nguyen et.al. 2501.16303 null
2025-01-27 Matryoshka Re-Ranker: A Flexible Re-Ranking Architecture With Configurable Depth and Width Zheng Liu et.al. 2501.16302 null
2025-01-27 Large Models in Dialogue for Active Perception and Anomaly Detection Tzoulio Chamiti et.al. 2501.16300 link
2025-01-27 FALCON: Resolving Visual Redundancy and Fragmentation in High-resolution Multimodal Large Language Models via Visual Registers Renshan Zhang et.al. 2501.16297 null
2025-01-27 Brain-Adapter: Enhancing Neurological Disorder Analysis with Adapter-Tuning Multimodal Large Language Models Jing Zhang et.al. 2501.16282 null
2025-01-27 Do LLMs Have Visualization Literacy? An Evaluation on Modified Visualizations to Test Generalization in Data Interpretation Jiayi Hong et.al. 2501.16277 link
2025-01-27 URAG: Implementing a Unified Hybrid RAG for Precise Answers in University Admission Chatbots -- A Case Study at HCMUT Long Nguyen et.al. 2501.16276 null
2025-01-27 Return of the Encoder: Maximizing Parameter Efficiency for SLMs Mohamed Elfeki et.al. 2501.16273 link
2025-01-27 A foundation model for human-AI collaboration in medical literature mining Zifeng Wang et.al. 2501.16255 null
2025-01-27 Multi-Agent Geospatial Copilots for Remote Sensing Workflows Chaehong Lee et.al. 2501.16254 null
2025-01-27 Zero-Shot Decision Tree Construction via Large Language Models Lucas Carrasco et.al. 2501.16247 null
2025-01-27 CLISC: Bridging clip and sam by enhanced cam for unsupervised brain tumor segmentation Xiaochuan Ma et.al. 2501.16246 null
2025-01-27 Phase Transitions in Large Language Models and the $O(N)$ Model Youran Sun et.al. 2501.16241 null
2025-01-27 AiGet: Transforming Everyday Moments into Hidden Knowledge Discovery with AI Assistance on Smart Glasses Runze Cai et.al. 2501.16240 link
2025-01-27 Distilling foundation models for robust and efficient models in digital pathology Alexandre Filiot et.al. 2501.16239 null
2025-01-27 Language-Based Bayesian Optimization Research Assistant (BORA) Abdoulatif Cissé et.al. 2501.16224 null
2025-01-27 Enhancing Visual Inspection Capability of Multi-Modal Large Language Models on Medical Time Series with Supportive Conformalized and Interpretable Small Specialized Models Huayu Li et.al. 2501.16215 link
2025-01-27 Provence: efficient and robust context pruning for retrieval-augmented generation Nadezhda Chirkova et.al. 2501.16214 null
2025-01-24 HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation Xin Zhou et.al. 2501.14729 link
2025-01-24 Do LLMs Provide Consistent Answers to Health-Related Questions across Languages? Ipek Baris Schlicht et.al. 2501.14719 null
2025-01-24 Towards Better Understanding Table Instruction Tuning: Decoupling the Effects from Data versus Models Naihao Deng et.al. 2501.14717 null
2025-01-24 FlexiGPT: Pruning and Extending Large Language Models with Low-Rank Weight Sharing James Seale Smith et.al. 2501.14713 null
2025-01-24 The Karp Dataset Mason DiCicco et.al. 2501.14705 null
2025-01-24 Rethinking Table Instruction Tuning Naihao Deng et.al. 2501.14693 null
2025-01-24 Rethinking Foundation Models for Medical Image Classification through a Benchmark Study on MedMNIST Fuping Wu et.al. 2501.14685 null
2025-01-24 An Empirical Study on LLM-based Classification of Requirements-related Provisions in Food-safety Regulations Shabnam Hassani et.al. 2501.14683 null
2025-01-24 Diffusion based Text-to-Music Generationwith Global and Local Text based Conditioning Jisi Zhang et.al. 2501.14680 null
2025-01-24 MedAgentBench: Dataset for Benchmarking LLMs as Agents in Medical Applications Yixing Jiang et.al. 2501.14654 link
2025-01-24 Investigating the (De)Composition Capabilities of Large Language Models in Natural-to-Formal Language Conversion Ziyao Xu et.al. 2501.14649 link
2025-01-24 Recommending Actionable Strategies: A Semantic Approach to Integrating Analytical Frameworks with Decision Heuristics Renato Ghisellini et.al. 2501.14634 null
2025-01-24 Extracting Problem Structure with LLMs for Optimized SAT Local Search André Schilder et.al. 2501.14630 null
2025-01-24 ReferDINO: Referring Video Object Segmentation with Visual Grounding Foundations Tianming Liang et.al. 2501.14607 null
2025-01-24 Knowledge Graphs Construction from Criminal Court Appeals: Insights from the French Cassation Court Alexander V. Belikov et.al. 2501.14579 null
2025-01-24 ZETA: Leveraging Z-order Curves for Efficient Top-k Attention Qiuhao Zeng et.al. 2501.14577 null
2025-01-24 Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding Zhongyi Shui et.al. 2501.14548 link
2025-01-24 Leveraging ChatGPT's Multimodal Vision Capabilities to Rank Satellite Images by Poverty Level: Advancing Tools for Social Science Research Hamid Sarmadi et.al. 2501.14546 null
2025-01-24 VERUS-LM: a Versatile Framework for Combining LLMs with Symbolic Reasoning Benjamin Callewaert et.al. 2501.14540 null
2025-01-24 Design and Implementation of a Psychiatry Resident Training System Based on Large Language Models Zhenguang Zhong et.al. 2501.14530 link
2025-01-23 CRPO: Confidence-Reward Driven Preference Optimization for Machine Translation Guofeng Cui et.al. 2501.13927 null
2025-01-23 The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling Capabilities Chan-Jan Hsu et.al. 2501.13921 link
2025-01-23 Analysis of Indic Language Capabilities in LLMs Aatman Vaidya et.al. 2501.13912 null
2025-01-23 Privacy-Preserving Personalized Federated Prompt Learning for Multimodal Large Language Models Linh Tran et.al. 2501.13904 null
2025-01-23 Exploring Finetuned Audio-LLM on Heart Murmur Features Adrian Florea et.al. 2501.13884 null
2025-01-23 The machine learning platform for developers of large systems Alexey Naikov et.al. 2501.13881 null
2025-01-23 A RAG-Based Institutional Assistant Gustavo Kuratomi et.al. 2501.13880 null
2025-01-23 Dual-Modal Prototype Joint Learning for Compositional Zero-Shot Learning Shiyu Zhang et.al. 2501.13859 null
2025-01-23 Large Vision-Language Models for Knowledge-Grounded Data Annotation of Memes Shiling Deng et.al. 2501.13851 link
2025-01-23 Think Outside the Data: Colonial Biases and Systemic Issues in Automated Moderation Pipelines for Low-Resource Languages Farhana Shahid et.al. 2501.13836 null
2025-01-23 On the Reasoning Capacity of AI Models and How to Quantify It Santosh Kumar Radha et.al. 2501.13833 null
2025-01-23 Predicting Compact Phrasal Rewrites with Large Language Models for ASR Post Editing Hao Zhang et.al. 2501.13831 null
2025-01-23 Hallucinations Can Improve Large Language Models in Drug Discovery Shuzhou Yuan et.al. 2501.13824 null
2025-01-23 Large Language Model driven Policy Exploration for Recommender Systems Jie Wang et.al. 2501.13816 null
2025-01-23 Enhancing LLMs for Governance with Human Oversight: Evaluating and Aligning LLMs on Expert Classification of Climate Misinformation for Detecting False or Misleading Claims about Climate Change Mowafak Allaham et.al. 2501.13802 null
2025-01-23 PromptMono: Cross Prompting Attention for Self-Supervised Monocular Depth Estimation in Challenging Environments Changhao Wang et.al. 2501.13796 null
2025-01-23 Training-Free Zero-Shot Temporal Action Detection with Vision-Language Models Chaolei Han et.al. 2501.13795 link
2025-01-23 Parameter-Efficient Fine-Tuning for Foundation Models Dan Zhang et.al. 2501.13787 link
2025-01-23 Not Every AI Problem is a Data Problem: We Should Be Intentional About Data Scaling Tanya Rodchenko et.al. 2501.13779 null
2025-01-23 Explainable XR: Understanding User Behaviors of XR Environments using LLM-assisted Analytics Framework Yoonsang Kim et.al. 2501.13778 link
2025-01-22 VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Boqiang Zhang et.al. 2501.13106 link
2025-01-22 Refining Input Guardrails: Enhancing LLM-as-a-Judge Efficiency Through Chain-of-Thought Fine-Tuning and Alignment Melissa Kazemi Rad et.al. 2501.13080 null
2025-01-22 Autonomy-of-Experts Models Ang Lv et.al. 2501.13074 null
2025-01-22 Does Table Source Matter? Benchmarking and Improving Multimodal Scientific Table Understanding and Reasoning Bohao Yang et.al. 2501.13042 link
2025-01-22 Pairwise RM: Perform Best-of-N Sampling with Knockout Tournament Yantao Liu et.al. 2501.13007 link
2025-01-22 Large Language Model-Based Semantic Communication System for Image Transmission Soheyb Ribouh et.al. 2501.12988 null
2025-01-22 LLM4WM: Adapting LLM for Wireless Multi-Tasking Xuanyu Liu et.al. 2501.12983 null
2025-01-22 OnionEval: An Unified Evaluation of Fact-conflicting Hallucination for Small-Large Language Models Chongren Sun et.al. 2501.12975 link
2025-01-22 Accessible Smart Contracts Verification: Synthesizing Formal Models with Tamed LLMs Jan Corazza et.al. 2501.12972 link
2025-01-22 It's complicated. The relationship of algorithmic fairness and non-discrimination regulations in the EU AI Act Kristof Meding et.al. 2501.12962 null
2025-01-22 Efficient Prompt Compression with Evaluator Heads for Long-Context Transformer Inference Weizhi Fei et.al. 2501.12959 null
2025-01-22 GANQ: GPU-Adaptive Non-Uniform Quantization for Large Language Models Pengxiang Zhao et.al. 2501.12956 null
2025-01-22 Correctness Assessment of Code Generated by Large Language Models Using Internal Representations Tuan-Dung Bui et.al. 2501.12934 link
2025-01-22 DynamicEarth: How Far are We from Open-Vocabulary Change Detection? Kaiyu Li et.al. 2501.12931 null
2025-01-22 A Functional Software Reference Architecture for LLM-Integrated Systems Alessio Bucaioni et.al. 2501.12904 null
2025-01-22 Architectural Fusion Through Contextual Partitioning in Large Language Models: A Novel Approach to Parameterized Knowledge Integration Offa Kingsleigh et.al. 2501.12901 null
2025-01-22 Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback Yafu Li et.al. 2501.12895 link
2025-01-22 Generative AI Misuse Potential in Cyber Security Education: A Case Study of a UK Degree Program Carlton Shepherd et.al. 2501.12883 null
2025-01-22 WisdomBot: Tuning Large Language Models with Artificial Intelligence Knowledge Jingyuan Chen et.al. 2501.12877 null
2025-01-22 HierPromptLM: A Pure PLM-based Framework for Representation Learning on Heterogeneous Text-rich Networks Qiuyu Zhu et.al. 2501.12857 null
2025-01-21 InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling Yi Wang et.al. 2501.12386 link
2025-01-21 MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Yilun Zhao et.al. 2501.12380 link
2025-01-21 Expertise elevates AI usage: experimental evidence comparing laypeople and professional artists Thomas F. Eisenmann et.al. 2501.12374 link
2025-01-21 Is Long Context All You Need? Leveraging LLM's Extended Context for NL2SQL Yeounoh Chung et.al. 2501.12372 link
2025-01-21 Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models Samira Abnar et.al. 2501.12370 null
2025-01-21 InternLM-XComposer2.5-Reward: A Simple Yet Effective Multi-Modal Reward Model Yuhang Zang et.al. 2501.12368 link
2025-01-21 Vision-Language Models for Automated Chest X-ray Interpretation: Leveraging ViT and GPT-2 Md. Rakibul Islam et.al. 2501.12356 null
2025-01-21 Automatic Labelling with Open-source LLMs using Dynamic Label Schema Integration Thomas Walshe et.al. 2501.12332 null
2025-01-21 Cinepro: Robust Training of Foundation Models for Cancer Detection in Prostate Ultrasound Cineloops Mohamed Harmanani et.al. 2501.12331 link
2025-01-21 VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model Xianwei Zhuang et.al. 2501.12327 link
2025-01-21 LLM-Assisted Knowledge Graph Completion for Curriculum and Domain Modelling in Personalized Higher Education Recommendations Hasan Abu-Rasheed et.al. 2501.12300 null
2025-01-21 MoGERNN: An Inductive Traffic Predictor for Unobserved Locations in Dynamic Sensing Networks Qishen Zhou et.al. 2501.12281 link
2025-01-21 Condor: Enhance LLM Alignment with Knowledge-Driven Data Synthesis and Refinement Maosong Cao et.al. 2501.12273 link
2025-01-21 CBVLM: Training-free Explainable Concept-based Large Vision Language Models for Medical Image Classification Cristiano Patrício et.al. 2501.12266 null
2025-01-21 FOCUS: First Order Concentrated Updating Scheme Yizhou Liu et.al. 2501.12243 null
2025-01-21 InsTALL: Context-aware Instructional Task Assistance with Multi-modal Large Language Models Pha Nguyen et.al. 2501.12231 null
2025-01-21 CDW-CoT: Clustered Distance-Weighted Chain-of-Thoughts Reasoning Yuanheng Fang et.al. 2501.12226 null
2025-01-21 Leveraging Large Language Models for Realizing Truly Intelligent User Interfaces Allard Oelen et.al. 2501.12221 null
2025-01-21 You Can't Eat Your Cake and Have It Too: The Performance Degradation of LLMs with Jailbreak Defense Wuyuao Mai et.al. 2501.12210 null
2025-01-21 Fixing Imbalanced Attention to Mitigate In-Context Hallucination of Large Vision-Language Model Kazi Hasan Ibn Arif et.al. 2501.12206 link
2025-01-17 FaceXBench: Evaluating Multimodal LLMs on Face Understanding Kartik Narayan et.al. 2501.10360 link
2025-01-17 Agent4Edu: Generating Learner Response Data by Generative Agents for Intelligent Education Systems Weibo Gao et.al. 2501.10332 link
2025-01-17 BoK: Introducing Bag-of-Keywords Loss for Interpretable Dialogue Response Generation Suvodip Dey et.al. 2501.10328 link
2025-01-17 Large language models for automated scholarly paper review: A survey Zhenzhen Zhuang et.al. 2501.10326 null
2025-01-17 Hierarchical Autoregressive Transformers: Combining Byte-~and Word-Level Processing for Robust, Adaptable Language Models Pit Neitemeier et.al. 2501.10322 null
2025-01-17 HiMix: Reducing Computational Complexity in Large Vision-Language Models Xuange Zhang et.al. 2501.10318 null
2025-01-17 Addressing Popularity Bias in Third-Party Library Recommendations Using LLMs Claudio Di Sipio et.al. 2501.10313 null
2025-01-17 Computational Protein Science in the Era of Large Language Models (LLMs) Wenqi Fan et.al. 2501.10282 null
2025-01-17 Test Wars: A Comparative Study of SBST, Symbolic Execution, and LLM-Based Approaches to Unit Test Generation Azat Abdullin et.al. 2501.10200 null
2025-01-17 Generative Artificial Intelligence: Implications for Biomedical and Health Professions Education William Hersh et.al. 2501.10186 null
2025-01-17 Multi-stage Training of Bilingual Islamic LLM for Neural Passage Retrieval Vera Pavlova et.al. 2501.10175 null
2025-01-17 Dual Debiasing: Remove Stereotypes and Keep Factual Gender for Fair Language Modeling and Translation Tomasz Limisiewicz et.al. 2501.10150 null
2025-01-17 A Vision-Language Framework for Multispectral Scene Representation Using Language-Grounded Features Enes Karanfil et.al. 2501.10144 null
2025-01-17 Exploring the Impact of Generative Artificial Intelligence in Education: A Thematic Analysis Abhishek Kaushik et.al. 2501.10134 null
2025-01-17 ComplexFuncBench: Exploring Multi-Step and Constrained Function Calling under Long-Context Scenario Lucen Zhong et.al. 2501.10132 link
2025-01-17 PaSa: An LLM Agent for Comprehensive Academic Paper Search Yichen He et.al. 2501.10120 link
2025-01-17 LLM Reasoner and Automated Planner: A new NPC approach Israel Puerta-Merino et.al. 2501.10106 null
2025-01-17 Universal Actions for Enhanced Embodied Foundation Models Jinliang Zheng et.al. 2501.10105 link
2025-01-17 Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks Michael Schwingshackl et.al. 2501.10080 link
2025-01-17 SpatialCoT: Advancing Spatial Reasoning through Coordinate Alignment and Chain-of-Thought for Embodied Task Planning Yuecheng Liu et.al. 2501.10074 null
2025-01-16 Distilling Multi-modal Large Language Models for Autonomous Driving Deepti Hegde et.al. 2501.09757 null
2025-01-16 Lost in Translation, Found in Context: Sign Language Translation with Contextual Cues Youngjoon Jang et.al. 2501.09754 null
2025-01-16 OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Zekun Xi et.al. 2501.09751 link
2025-01-16 Enhancing Lexicon-Based Text Embeddings with Large Language Models Yibin Lei et.al. 2501.09749 null
2025-01-16 Suggesting Code Edits in Interactive Machine Learning Notebooks Using Large Language Models Bihui Jin et.al. 2501.09745 null
2025-01-16 Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps Nanye Ma et.al. 2501.09732 null
2025-01-16 A Simple Aerial Detection Baseline of Multimodal Language Models Qingyun Li et.al. 2501.09720 link
2025-01-16 CyberMentor: AI Powered Learning Tool Platform to Address Diverse Student Needs in Cybersecurity Education Tianyu Wang et.al. 2501.09709 link
2025-01-16 Domain Adaptation of Foundation LLMs for e-Commerce Christian Herold et.al. 2501.09706 null
2025-01-16 Cueless EEG imagined speech for subject identification: dataset and benchmarks Ali Derakhshesh et.al. 2501.09700 link
2025-01-16 Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key Zhihe Yang et.al. 2501.09695 link
2025-01-16 Simulated Interactive Debugging Yannic Noller et.al. 2501.09694 null
2025-01-16 Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Fengli Xu et.al. 2501.09686 null
2025-01-16 Reward-Guided Controlled Generation for Inference-Time Alignment in Diffusion Models: Tutorial and Review Masatoshi Uehara et.al. 2501.09685 null
2025-01-16 Robin: a Suite of Multi-Scale Vision-Language Models and the CHIRP Evaluation Benchmark Alexis Roger et.al. 2501.09672 null
2025-01-16 A Survey of Research in Large Language Models for Electronic Design Automation Jingyu Pan et.al. 2501.09655 null
2025-01-16 The Heap: A Contamination-Free Multilingual Code Dataset for Evaluating Large Language Models Jonathan Katzy et.al. 2501.09653 null
2025-01-16 CarMem: Enhancing Long-Term Memory in LLM Voice Assistants through Category-Bounding Johannes Kirmayr et.al. 2501.09645 link
2025-01-16 LLM-Based Routing in Mixture of Experts: A Novel Framework for Trading Kuan-Ming Liu et.al. 2501.09636 null
2025-01-16 Empowering Large Language Models in Wireless Communication: A Novel Dataset and Fine-Tuning Framework Yushen Lin et.al. 2501.09631 null
2025-01-15 Towards Fast, Specialized Machine Learning Force Fields: Distilling Foundation Models via Energy Hessians Ishan Amin et.al. 2501.09009 link
2025-01-15 Aegis2.0: A Diverse AI Safety Dataset and Risks Taxonomy for Alignment of LLM Guardrails Shaona Ghosh et.al. 2501.09004 null
2025-01-15 Vision Foundation Models for Computed Tomography Suraj Pai et.al. 2501.09001 link
2025-01-15 CityLoc: 6 DoF Localization of Text Descriptions in Large-Scale Scenes with Gaussian Representation Qi Ma et.al. 2501.08982 null
2025-01-15 Development and Validation of the Provider Documentation Summarization Quality Instrument for Large Language Models Emma Croxford et.al. 2501.08977 null
2025-01-15 Learning to Extract Cross-Domain Aspects and Understanding Sentiments Using Large Language Models Karukriti Kaushik Ghosh et.al. 2501.08974 null
2025-01-15 Analyzing the Ethical Logic of Six Large Language Models W. Russell Neuman et.al. 2501.08951 null
2025-01-15 Applying General Turn-taking Models to Conversational Human-Robot Interaction Gabriel Skantze et.al. 2501.08946 null
2025-01-15 Disentangling Exploration of Large Language Models by Optimal Exploitation Tim Grams et.al. 2501.08925 null
2025-01-15 GenAI Content Detection Task 3: Cross-Domain Machine-Generated Text Detection Challenge Liam Dugan et.al. 2501.08913 link
2025-01-15 Leveraging Large Language Models as Knowledge-Driven Agents for Reliable Retrosynthesis Planning Qinyu Ma et.al. 2501.08897 link
2025-01-15 Generative Planning with 3D-vision Language Pre-training for End-to-End Autonomous Driving Tengpeng Li et.al. 2501.08861 link
2025-01-15 Exploring Task-Level Optimal Prompts for Visual In-Context Learning Yan Zhu et.al. 2501.08841 null
2025-01-15 IDEA: Image Description Enhanced CLIP-Adapter Zhipeng Ye et.al. 2501.08816 link
2025-01-15 How Developers Interact with AI: A Taxonomy of Human-AI Collaboration in Software Engineering Christoph Treude et.al. 2501.08774 null
2025-01-15 Admitting Ignorance Helps the Video Question Answering Models to Answer Haopeng Li et.al. 2501.08771 null
2025-01-15 Enhanced Large Language Models for Effective Screening of Depression and Anxiety June M. Liu et.al. 2501.08769 null
2025-01-15 Leveraging LLM Agents for Translating Network Configurations Yunze Wei et.al. 2501.08760 null
2025-01-15 Expanding Vietnamese SentiWordNet to Improve Performance of Vietnamese Sentiment Analysis Models Hong-Viet Tran et.al. 2501.08758 null
2025-01-15 The Inherent Limits of Pretrained LLMs: The Unexpected Convergence of Instruction Tuning and In-Context Learning Capabilities Irina Bigoulaeva et.al. 2501.08716 link
2025-01-14 PokerBench: Training Large Language Models to become Professional Poker Players Richard Zhuang et.al. 2501.08328 link
2025-01-14 Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Miran Heo et.al. 2501.08326 null
2025-01-14 ADAM-1: AI and Bioinformatics for Alzheimer's Detection and Microbiome-Clinical Data Integrations Ziyuan Huang et.al. 2501.08324 null
2025-01-14 Exploring Robustness of Multilingual LLMs on Real-World Noisy Data Amirhossein Aliakbarzadeh et.al. 2501.08322 link
2025-01-14 Enhancing Automated Interpretability with Output-Centric Feature Descriptions Yoav Gur-Arieh et.al. 2501.08319 link
2025-01-14 MiniMax-01: Scaling Foundation Models with Lightning Attention MiniMax et.al. 2501.08313 null
2025-01-14 HALoGEN: Fantastic LLM Hallucinations and Where to Find Them Abhilasha Ravichander et.al. 2501.08292 null
2025-01-14 LLaVA-ST: A Multimodal Large Language Model for Fine-Grained Spatial-Temporal Understanding Hongyu Li et.al. 2501.08282 link
2025-01-14 Exploring Robustness of LLMs to Sociodemographically-Conditioned Paraphrasing Pulkit Arora et.al. 2501.08276 null
2025-01-14 Addressing the sustainable AI trilemma: a case study on LLM agents and RAG Hui Wu et.al. 2501.08262 link
2025-01-14 Eliciting In-context Retrieval and Reasoning for Long-context Large Language Models Yifu Qiu et.al. 2501.08248 null
2025-01-14 Text-Diffusion Red-Teaming of Large Language Models: Unveiling Harmful Behaviors with Proximity Constraints Jonathan Nöther et.al. 2501.08246 null
2025-01-14 Investigating Energy Efficiency and Performance Trade-offs in LLM Inference Across Tasks and DVFS Settings Paul Joe Maliakel et.al. 2501.08219 null
2025-01-14 ASTRID -- An Automated and Scalable TRIaD for the Evaluation of RAG-based Clinical Question Answering Systems Mohita Chowdhury et.al. 2501.08208 null
2025-01-14 ArithmAttack: Evaluating Robustness of LLMs to Noisy Context in Math Problem Solving Zain Ul Abedin et.al. 2501.08203 null
2025-01-14 CWEval: Outcome-driven Evaluation on Functionality and Security of LLM Code Generation Jinjun Peng et.al. 2501.08200 link
2025-01-14 OpenCSG Chinese Corpus: A Series of High-quality Chinese Datasets for LLM Training Yijiong Yu et.al. 2501.08197 link
2025-01-14 PRESERVE: Prefetching Model Weights and KV-Cache in Distributed LLM Serving Ahmet Caner Yüzügüler et.al. 2501.08192 null
2025-01-14 A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation Steven Landgraf et.al. 2501.08188 null
2025-01-14 A Multi-Modal AI Copilot for Single-Cell Analysis with Instruction Following Yin Fang et.al. 2501.08187 link
2025-01-13 Training-Free Motion-Guided Video Generation with Enhanced Temporal Consistency Using Motion Consistency Loss Xinyu Zhang et.al. 2501.07563 null
2025-01-13 SST-EM: Advanced Metrics for Evaluating Semantic, Spatial and Temporal Aspects in Video Editing Varun Biyyala et.al. 2501.07554 link
2025-01-13 Imagine while Reasoning in Space: Multimodal Visualization-of-Thought Chengzu Li et.al. 2501.07542 null
2025-01-13 ML Mule: Mobile-Driven Context-Aware Collaborative Learning Haoxiang Yu et.al. 2501.07536 null
2025-01-13 Investigating Large Language Models in Inferring Personality Traits from User Conversations Jianfeng Zhu et.al. 2501.07532 null
2025-01-13 RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment Difei Gu et.al. 2501.07525 link
2025-01-13 Parallel Key-Value Cache Fusion for Position Invariant RAG Philhoon Oh et.al. 2501.07523 null
**

About

Automatically Update Arxiv Papers about Path Planning, LLM and Autonomous Driving using Github Actions since 2024.2.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages