GitHub

Updated on 2025.08.30

Table of Contents

6D Pose
Point Cloud Registration
Point Cloud Segmentation
Zero-shot

6D Pose

Publish Date	Title	Authors	PDF	Code
2025-07-23	RemixFusion: Residual-based Mixed Representation for Large-scale Online RGB-D Reconstruction	Yuqing Lan et.al.	2507.17594v1	null
2025-07-23	Physics-based Human Pose Estimation from a Single Moving RGB Camera	Ayce Idil Aytekin et.al.	2507.17406v1	null
2025-07-21	Toward a Real-Time Framework for Accurate Monocular 3D Human Pose Estimation with Geometric Priors	Mohamed Adjel et.al.	2507.16850v1	null
2025-07-22	Adaptive Relative Pose Estimation Framework with Dual Noise Tuning for Safe Approaching Maneuvers	Batu Candan et.al.	2507.16214v1	null
2025-07-21	TONUS: Neuromorphic human pose estimation for artistic sound co-creation	Jules Lecomte et.al.	2507.15734v1	null
2025-07-21	Hi^2-GSLoc: Dual-Hierarchical Gaussian-Specific Visual Relocalization for Remote Sensing	Boni Hu et.al.	2507.15683v1	null
2025-07-21	Dense-depth map guided deep Lidar-Visual Odometry with Sparse Point Clouds and Images	JunYing Huang et.al.	2507.15496v1	null
2025-07-20	3-Dimensional CryoEM Pose Estimation and Shift Correction Pipeline	Kaishva Chintan Shah et.al.	2507.14924v1	null
2025-07-20	An Evaluation of DUSt3R/MASt3R/VGGT 3D Reconstruction on Photogrammetric Aerial Blocks	Xinyi Wu et.al.	2507.14798v1	null
2025-07-22	AI-Enhanced Precision in Sport Taekwondo: Increasing Fairness, Speed, and Trust in Competition (FST.ai)	Keivan Shariatmadar et.al.	2507.14657v2	null
2025-07-18	C-DOG: Training-Free Multi-View Multi-Object Association in Dense Scenes Without Visual Feature via Connected δ-Overlap Graphs	Yung-Hong Sun et.al.	2507.14095v1	null
2025-07-21	PCR-GS: COLMAP-Free 3D Gaussian Splatting via Pose Co-Regularizations	Yu Wei et.al.	2507.13891v2	null
2025-07-18	MaskHOI: Robust 3D Hand-Object Interaction Estimation via Masked Pre-training	Yuechen Xie et.al.	2507.13673v1	null
2025-07-17	$π^3$ : Scalable Permutation-Equivariant Visual Geometry Learning	Yifan Wang et.al.	2507.13347v1	null
2025-07-17	Revisiting Reliability in the Reasoning-based Pose Estimation Benchmark	Junsu Kim et.al.	2507.13314v1	null
2025-07-17	DINO-VO: A Feature-based Visual Odometry Leveraging a Visual Foundation Model	Maulana Bisyir Azhari et.al.	2507.13145v1	null
2025-07-17	AthleticsPose: Authentic Sports Motion Dataset on Athletic Field and Evaluation of Monocular 3D Pose Estimation Ability	Tomohiro Suzuki et.al.	2507.12905v1	null
2025-07-17	From Neck to Head: Bio-Impedance Sensing for Head Pose Estimation	Mengxi Liu et.al.	2507.12884v1	null
2025-07-19	SpatialTrackerV2: 3D Point Tracking Made Easy	Yuxi Xiao et.al.	2507.12462v2	null
2025-07-16	Spontaneous Spatial Cognition Emerges during Egocentric Video Viewing through Non-invasive BCI	Weichen Dai et.al.	2507.12417v1	null
2025-07-16	Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation	Antonio Finocchiaro et.al.	2507.12292v1	null
2025-07-16	UniLGL: Learning Uniform Place Recognition for FOV-limited/Panoramic LiDAR Global Localization	Hongming Shen et.al.	2507.12194v1	null
2025-07-16	BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images	Davide Di Nucci et.al.	2507.12095v1	null
2025-07-16	SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation	Beining Xu et.al.	2507.12027v1	null
2025-07-16	SEPose: A Synthetic Event-based Human Pose Estimation Dataset for Pedestrian Monitoring	Kaustav Chanda et.al.	2507.11910v1	null
2025-07-15	GKNet: Graph-based Keypoints Network for Monocular Pose Estimation of Non-cooperative Spacecraft	Weizhao Ma et.al.	2507.11077v1	null
2025-07-15	Joint angle model based learning to refine kinematic human pose estimation	Chang Peng et.al.	2507.11075v1	null
2025-07-14	Raci-Net: Ego-vehicle Odometry Estimation in Adverse Weather Conditions	Mohammadhossein Talebi et.al.	2507.10376v1	null
2025-07-14	Kaleidoscopic Background Attack: Disrupting Pose Estimation with Multi-Fold Radial Symmetry Textures	Xinlong Ding et.al.	2507.10265v1	null
2025-07-14	ProGait: A Multi-Purpose Video Dataset and Benchmark for Transfemoral Prosthesis Users	Xiangyu Yin et.al.	2507.10223v1	null
2025-07-13	VST-Pose: A Velocity-Integrated Spatiotem-poral Attention Network for Human WiFi Pose Estimation	Xinyu Zhang et.al.	2507.09672v1	null
2025-07-13	EHPE: A Segmented Architecture for Enhanced Hand Pose Estimation	Bolun Zheng et.al.	2507.09560v1	null
2025-07-13	Self-supervised pretraining of vision transformers for animal behavioral analysis and neural encoding	Yanchen Wang et.al.	2507.09513v1	null
2025-07-12	PoseLLM: Enhancing Language-Guided Human Pose Estimation with MLP Alignment	Dewen Zhang et.al.	2507.09139v1	null
2025-07-10	RegGS: Unposed Sparse Views Gaussian Splatting with 3DGS Registration	Chong Cheng et.al.	2507.08136v1	null
2025-07-10	SCREP: Scene Coordinate Regression and Evidential Learning-based Perception-Aware Trajectory Generation	Juyeop Han et.al.	2507.07467v1	null
2025-07-09	g2o vs. Ceres: Optimizing Scan Matching in Cartographer SLAM	Quanjie Qiu et.al.	2507.07142v1	null
2025-07-09	Smartphone Exergames with Real-Time Markerless Motion Capture: Challenges and Trade-offs	Mathieu Phosanarack et.al.	2507.06669v1	null
2025-07-09	MK-Pose: Category-Level Object Pose Estimation via Multimodal-Based Keypoint Learning	Yifan Yang et.al.	2507.06662v1	null
2025-07-09	Failure Forecasting Boosts Robustness of Sim2Real Rhythmic Insertion Policies	Yuhan Liu et.al.	2507.06519v1	null
2025-07-09	Mask6D: Masked Pose Priors For 6D Object Pose Estimation	Yuechen Xie et.al.	2507.06486v1	null
2025-07-08	SenseShift6D: Multimodal RGB-D Benchmarking for Robust 6D Pose Estimation across Environment and Sensor Variations	Yegyu Han et.al.	2507.05751v1	null
2025-07-08	Event-RGB Fusion for Spacecraft Pose Estimation Under Harsh Lighting	Mohsi Jawaid et.al.	2507.05698v1	null
2025-07-07	W2W: A Simulated Exploration of IMU Placement Across the Human Body for Designing Smarter Wearable	Lala Shakti Swarup Ray et.al.	2507.05532v1	null
2025-07-07	UDF-GMA: Uncertainty Disentanglement and Fusion for General Movement Assessment	Zeqi Luo et.al.	2507.04814v1	null
2025-07-06	Thousand-Brains Systems: Sensorimotor Intelligence for Rapid, Robust Learning and Inference	Niels Leadholm et.al.	2507.04494v1	null
2025-07-09	Gaussian-LIC2: LiDAR-Inertial-Camera Gaussian Splatting SLAM	Xiaolei Lang et.al.	2507.04004v2	null
2025-07-05	Accurate Pose Estimation Using Contact Manifold Sampling for Safe Peg-in-Hole Insertion of Complex Geometries	Abhay Negi et.al.	2507.03925v1	null
2025-07-02	Markerless Stride Length estimation in Athletic using Pose Estimation with monocular vision	Patryk Skorupski et.al.	2507.03016v1	null
2025-07-03	Reconstructing Close Human Interaction with Appearance and Proxemics Reasoning	Buzhen Huang et.al.	2507.02565v1	null
2025-07-03	IMASHRIMP: Automatic White Shrimp (Penaeus vannamei) Biometrical Analysis from Laboratory Images Using Computer Vision and Deep Learning	Abiam Remache González et.al.	2507.02519v1	null
2025-07-03	3D Heart Reconstruction from Sparse Pose-agnostic 2D Echocardiographic Slices	Zhurong Chen et.al.	2507.02411v1	null
2025-07-03	LMPNet for Weakly-supervised Keypoint Discovery	Pei Guo et.al.	2507.02308v1	null
2025-07-02	What does really matter in image goal navigation?	Gianluca Monaci et.al.	2507.01667v1	null
2025-07-01	2024 NASA SUITS Report: LLM-Driven Immersive Augmented Reality User Interface for Robotics and Space Exploration	Kathy Zhuang et.al.	2507.01206v1	null
2025-07-04	Robotic Manipulation by Imitating Generated Videos Without Physical Demonstrations	Shivansh Patel et.al.	2507.00990v2	null
2025-07-01	Multi-Modal Graph Convolutional Network with Sinusoidal Encoding for Robust Human Action Segmentation	Hao Xing et.al.	2507.00752v1	null
2025-07-01	LoD-Loc v2: Aerial Visual Localization over Low Level-of-Detail City Models using Explicit Silhouette Alignment	Juelin Zhu et.al.	2507.00659v1	null
2025-06-30	Computer Vision for Objects used in Group Work: Challenges and Opportunities	Changsoo Jung et.al.	2507.00224v1	null
2025-06-30	Validation of AI-Based 3D Human Pose Estimation in a Cyber-Physical Environment	Lisa Marie Otto et.al.	2506.23739v1	null
2025-06-30	MGPRL: Distributed Multi-Gaussian Processes for Wi-Fi-based Multi-Robot Relative Localization in Large Indoor Environments	Sai Krishna Ghanta et.al.	2506.23514v1	null
2025-06-29	TVG-SLAM: Robust Gaussian Splatting SLAM with Tri-view Geometric Constraints	Zhen Tan et.al.	2506.23207v1	null
2025-06-28	Deterministic Object Pose Confidence Region Estimation	Jinghao Wang et.al.	2506.22720v1	null
2025-06-27	Evaluating Pointing Gestures for Target Selection in Human-Robot Collaboration	Noora Sassali et.al.	2506.22116v1	null
2025-06-27	Single-Scanline Relative Pose Estimation for Rolling Shutter Cameras	Petr Hruby et.al.	2506.22069v1	null
2025-06-24	ICP-3DGS: SfM-free 3D Gaussian Splatting for Large-scale Unbounded Scenes	Chenhao Zhang et.al.	2506.21629v1	null
2025-06-26	EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting	Taoyu Wu et.al.	2506.21420v1	null
2025-06-26	CURL-SLAM: Continuous and Compact LiDAR Mapping	Kaicheng Zhang et.al.	2506.21077v1	null
2025-06-27	DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation	Wenzhou Lyu et.al.	2506.21034v2	null
2025-06-25	How do Foundation Models Compare to Skeleton-Based Approaches for Gesture Recognition in Human-Robot Interaction?	Stephanie Käs et.al.	2506.20795v1	null
2025-06-26	Consensus-Driven Uncertainty for Robotic Grasping based on RGB Perception	Eric C. Joyce et.al.	2506.20045v2	null
2025-06-24	Systematic Comparison of Projection Methods for Monocular 3D Human Pose Estimation on Fisheye Images	Stephanie Käs et.al.	2506.19747v1	null
2025-06-23	RAG-6DPose: Retrieval-Augmented 6D Pose Estimation via Leveraging CAD as Knowledge Base	Kuanning Wang et.al.	2506.18856v1	null
2025-06-19	Reproducible Evaluation of Camera Auto-Exposure Methods in the Field: Platform, Benchmark and Lessons Learned	Olivier Gamache et.al.	2506.18844v1	null
2025-06-23	SViP: Sequencing Bimanual Visuomotor Policies with Object-Centric Motion Primitives	Yizhou Chen et.al.	2506.18825v1	null
2025-06-20	RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and Tracking	Teng Guo et.al.	2506.17119v1	link
2025-06-20	Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping	Teng Guo et.al.	2506.17110v1	null
2025-06-20	LunarLoc: Segment-Based Global Localization on the Moon	Annika Thomas et.al.	2506.16940v1	link
2025-06-19	ControlVLA: Few-shot Object-centric Adaptation for Pre-trained Vision-Language-Action Models	Puhao Li et.al.	2506.16211v1	null
2025-06-19	STAR-Pose: Efficient Low-Resolution Video Human Pose Estimation via Spatial-Temporal Adaptive Super-Resolution	Yucheng Jin et.al.	2506.16061v1	null
2025-06-19	KARL: Kalman-Filter Assisted Reinforcement Learner for Dynamic Object Tracking and Grasping	Kowndinya Boyalakuntla et.al.	2506.15945v1	null
2025-06-19	Beyond Audio and Pose: A General-Purpose Framework for Video Synchronization	Yosub Shin et.al.	2506.15937v1	null
2025-06-18	Improving Robotic Manipulation: Techniques for Object Pose Estimation, Accommodating Positional Uncertainty, and Disassembly Tasks from Examples	Viral Rasik Galaiya et.al.	2506.15865v1	null
2025-06-18	PRISM-Loc: a Lightweight Long-range LiDAR Localization in Urban Environments with Topological Maps	Kirill Muravyev et.al.	2506.15849v1	null
2025-06-18	Human Motion Capture from Loose and Sparse Inertial Sensors with Garment-aware Diffusion Models	Andela Ilic et.al.	2506.15290v1	null
2025-06-18	RA-NeRF: Robust Neural Radiance Field Reconstruction with Accurate Camera Pose Estimation under Complex Trajectories	Qingsong Yan et.al.	2506.15242v1	null
2025-06-17	PoseGRAF: Geometric-Reinforced Adaptive Fusion for Monocular 3D Human Pose Estimation	Ming Xu et.al.	2506.14596v1	link
2025-06-17	Non-Overlap-Aware Egocentric Pose Estimation for Collaborative Perception in Connected Autonomy	Hong Huang et.al.	2506.14180v1	null
2025-06-17	TACS-Graphs: Traversability-Aware Consistent Scene Graphs for Ground Robot Indoor Localization and Mapping	Jeewon Kim et.al.	2506.14178v1	null
2025-06-16	Diffusion-based Inverse Observation Model for Artificial Skin	Ante Maric et.al.	2506.13986v1	null
2025-06-16	ATK: Automatic Task-driven Keypoint Selection for Robust Policy Learning	Yunchu Zhang et.al.	2506.13867v1	null
2025-06-16	PF-LHM: 3D Animatable Avatar Reconstruction from Pose-free Articulated Human Images	Lingteng Qiu et.al.	2506.13766v1	null
2025-06-16	JENGA: Object selection and pose estimation for robotic grasping from a stack	Sai Srinivas Jeevanandam et.al.	2506.13425v1	null
2025-06-16	Automatic Multi-View X-Ray/CT Registration Using Bone Substructure Contours	Roman Flepp et.al.	2506.13292v1	null
2025-06-16	DETRPose: Real-time end-to-end transformer model for multi-person pose estimation	Sebastian Janampa et.al.	2506.13027v1	link
2025-06-15	A large-scale, physically-based synthetic dataset for satellite pose estimation	Szabolcs Velkei et.al.	2506.12782v1	null
2025-06-13	ViTaSCOPE: Visuo-tactile Implicit Representation for In-hand Pose and Extrinsic Contact Estimation	Jayjun Lee et.al.	2506.12239v1	null
2025-06-10	Monocular 3D Hand Pose Estimation with Implicit Camera Alignment	Christos Pantazopoulos et.al.	2506.11133v1	link
2025-06-12	Occlusion-Aware 3D Hand-Object Pose Estimation with Masked AutoEncoders	Hui Yang et.al.	2506.10816v1	null
2025-06-12	In-Hand Object Pose Estimation via Visual-Tactile Fusion	Felix Nonnengießer et.al.	2506.10787v1	null
2025-06-11	Fluoroscopic Shape and Pose Tracking of Catheters with Custom Radiopaque Markers	Jared Lawson et.al.	2506.09934v1	null
2025-06-11	EquiCaps: Predictor-Free Pose-Aware Pre-Trained Capsule Networks	Athinoulla Konstantinou et.al.	2506.09895v1	link
2025-06-11	Accurate and efficient zero-shot 6D pose estimation with frozen foundation models	Andrea Caraffa et.al.	2506.09784v1	null
2025-06-11	CHIP: A multi-sensor dataset for 6D pose estimation of chairs in industrial settings	Mattia Nardon et.al.	2506.09699v1	null
2025-06-10	Princeton365: A Diverse Dataset with Accurate Camera Pose	Karhan Kayan et.al.	2506.09035v1	null
2025-06-10	ArrowPose: Segmentation, Detection, and 5 DoF Pose Estimation Network for Colorless Point Clouds	Frederik Hagelskjaer et.al.	2506.08699v1	null
2025-06-09	UA-Pose: Uncertainty-Aware 6D Object Pose Estimation and Online Object Completion with Partial References	Ming-Feng Li et.al.	2506.07996v1	null
2025-06-09	Hierarchical Scoring with 3D Gaussian Splatting for Instance Image-Goal Navigation	Yijie Deng et.al.	2506.07338v1	null
2025-06-10	From Generation to Generalization: Emergent Few-Shot Learning in Video Diffusion Models	Pablo Acuaviva et.al.	2506.07280v2	null
2025-06-08	GoTrack: Generic 6DoF Object Pose Refinement and Tracking	Van Nguyen Nguyen et.al.	2506.07155v1	null
2025-06-08	UNO: Unified Self-Supervised Monocular Odometry for Platform-Agnostic Deployment	Wentao Zhao et.al.	2506.07013v1	null
2025-06-07	Deep Inertial Pose: A deep learning approach for human pose estimation	Sara M. Cerqueira et.al.	2506.06850v1	null
2025-06-06	Dy3DGS-SLAM: Monocular 3D Gaussian Splatting SLAM for Dynamic Environments	Mingrui Li et.al.	2506.05965v1	null
2025-06-06	SurGSplat: Progressive Geometry-Constrained Gaussian Splatting for Surgical Scene Reconstruction	Yuchao Zheng et.al.	2506.05935v1	null
2025-06-06	CryoFastAR: Fast Cryo-EM Ab Initio Reconstruction Made Easy	Jiakai Zhang et.al.	2506.05864v1	null
2025-06-06	You Only Estimate Once: Unified, One-stage, Real-Time Category-level Articulated Object 6D Pose Estimation for Robotic Grasping	Jingshun Huang et.al.	2506.05719v1	null
2025-06-05	On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images	Andreas Meuleman et.al.	2506.05558v1	null
2025-06-05	Rectified Point Flow: Generic Point Cloud Pose Estimation	Tao Sun et.al.	2506.05282v1	null
2025-06-05	Realizing Text-Driven Motion Generation on NAO Robot: A Reinforcement Learning-Optimized Control Pipeline	Zihan Xu et.al.	2506.05117v1	link
2025-06-05	CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx	Lukas Picek et.al.	2506.04931v1	null
2025-06-05	SupeRANSAC: One RANSAC to Rule Them All	Daniel Barath et.al.	2506.04803v1	link
2025-06-05	LGM-Pose: A Lightweight Global Modeling Network for Real-time Human Pose Estimation	Biao Guo et.al.	2506.04561v1	null
2025-06-04	Photoreal Scene Reconstruction from an Egocentric Device	Zhaoyang Lv et.al.	2506.04444v1	link
2025-06-04	cuVSLAM: CUDA accelerated visual odometry	Alexander Korovko et.al.	2506.04359v1	link
2025-06-04	Voyager: Long-Range and World-Consistent Video Diffusion for Explorable 3D Scene Generation	Tianyu Huang et.al.	2506.04225v1	null
2025-06-04	Accelerating SfM-based Pose Estimation with Dominating Set	Joji Joseph et.al.	2506.03667v1	null
2025-06-03	Learning Pyramid-structured Long-range Dependencies for 3D Human Pose Estimation	Mingjie Wei et.al.	2506.02853v1	link
2025-06-03	GeneA-SLAM2: Dynamic SLAM with AutoEncoder-Preprocessed Genetic Keypoints Resampling and Depth Variance-Guided Dynamic Region Removal	Shufan Qing et.al.	2506.02736v1	link
2025-06-02	Rig3R: Rig-Aware Conditioning for Learned 3D Reconstruction	Samuel Li et.al.	2506.02265v1	null
2025-06-02	E3D-Bench: A Benchmark for End-to-End 3D Geometric Foundation Models	Wenyan Cong et.al.	2506.01933v1	null
2025-06-02	SteerPose: Simultaneous Extrinsic Camera Calibration and Matching from Articulation	Sang-Eun Lee et.al.	2506.01691v1	null
2025-06-01	TIGeR: Text-Instructed Generation and Refinement for Template-Free Hand-Object Interaction	Yiyao Huang et.al.	2506.00953v1	null
2025-05-31	XYZ-IBD: High-precision Bin-picking Dataset for Object 6D Pose Estimation Capturing Real-world Industrial Complexity	Junwen Huang et.al.	2506.00599v1	null
2025-05-30	Lazy Heuristic Search for Solving POMDPs with Expensive-to-Compute Belief Transitions	Muhammad Suhail Saleem et.al.	2506.00285v1	null
2025-05-30	6D Pose Estimation on Point Cloud Data through Prior Knowledge Integration: A Case Study in Autonomous Disassembly	Chengzhi Wu et.al.	2505.24669v1	null
2025-05-30	Category-Level 6D Object Pose Estimation in Agricultural Settings Using a Lattice-Deformation Framework and Diffusion-Augmented Synthetic Data	Marios Glytsos et.al.	2505.24636v1	null
2025-05-30	PCIE_Pose Solution for EgoExo4D Pose and Proficiency Estimation Challenge	Feng Chen et.al.	2505.24411v1	null
2025-05-29	Pose-free 3D Gaussian splatting via shape-ray estimation	Youngju Na et.al.	2505.22978v1	null
2025-05-28	TwinTrack: Bridging Vision and Contact Physics for Real-Time Tracking of Unknown Dynamic Objects	Wen Yang et.al.	2505.22882v1	null
2025-05-28	4DTAM: Non-Rigid Tracking and Mapping via Dynamic Surface Gaussians	Hidenobu Matsuki et.al.	2505.22859v1	null
2025-05-28	MultiFormer: A Multi-Person Pose Estimation System Based on CSI and Attention Mechanism	Yanyi Qu et.al.	2505.22555v1	null
2025-05-28	Event-based Egocentric Human Pose Estimation in Dynamic Environment	Wataru Ikeda et.al.	2505.22007v1	null
2025-05-27	Spectral Compression Transformer with Line Pose Graph for Monocular 3D Human Pose Estimation	Zenghao Zheng et.al.	2505.21309v1	null
2025-05-29	ReassembleNet: Learnable Keypoints and Diffusion for 2D Fresco Reconstruction	Adeela Islam et.al.	2505.21117v2	null
2025-05-27	HS-SLAM: A Fast and Hybrid Strategy-Based SLAM Approach for Low-Speed Autonomous Driving	Bingxiang Kang et.al.	2505.20906v1	null
2025-05-27	Mamba-Driven Topology Fusion for Monocular 3-D Human Pose Estimation	Zenghao Zheng et.al.	2505.20611v1	null
2025-05-28	HAND Me the Data: Fast Robot Adaptation via Hand Path Retrieval	Matthew Hong et.al.	2505.20455v2	null
2025-05-25	Learning the Contact Manifold for Accurate Pose Estimation During Peg-in-Hole Insertion of Complex Geometries	Abhay Negi et.al.	2505.19215v1	null
2025-05-24	Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPU	Yicheng Lin et.al.	2505.18652v1	link
2025-05-24	An Inertial Sequence Learning Framework for Vehicle Speed Estimation via Smartphone IMU	Xuan Xiao et.al.	2505.18490v1	null
2025-05-23	Pose Splatter: A 3D Gaussian Splatting Model for Quantifying Animal Pose and Appearance	Jack Goffinet et.al.	2505.18342v1	null
2025-05-23	To Glue or Not to Glue? Classical vs Learned Image Matching for Mobile Mapping Cameras to Textured Semantic 3D Building Models	Simone Gaisbauer et.al.	2505.17973v1	link
2025-05-23	Towards Dynamic 3D Reconstruction of Hand-Instrument Interaction in Ophthalmic Surgery	Ming Hu et.al.	2505.17677v1	null
2025-05-23	PoseBH: Prototypical Multi-Dataset Training Beyond Human Pose Estimation	Uyoung Jeong et.al.	2505.17475v1	link
2025-05-22	Towards Texture- And Shape-Independent 3D Keypoint Estimation in Birds	Valentin Schmuker et.al.	2505.16633v1	null
2025-05-22	GMatch: Geometry-Constrained Feature Matching for RGB-D Object Pose Estimation	Ming Yang et.al.	2505.16144v1	null
2025-05-21	Object-Focus Actor for Data-efficient Robot Generalization Dexterous Manipulation	Yihang Li et.al.	2505.15098v1	null
2025-05-20	UPTor: Unified 3D Human Pose Dynamics and Trajectory Prediction for Human-Robot Interaction	Nisarga Nilavadi et.al.	2505.14866v1	null
2025-05-19	Recollection from Pensieve: Novel View Synthesis via Learning from Uncalibrated Videos	Ruoyu Wang et.al.	2505.13440v1	link
2025-05-19	KinTwin: Imitation Learning with Torque and Muscle Driven Biomechanical Models Enables Precise Replication of Able-Bodied and Impaired Movement from Markerless Motion Capture	R. James Cotton et.al.	2505.13436v1	null
2025-05-19	The Way Up: A Dataset for Hold Usage Detection in Sport Climbing	Anna Maschek et.al.	2505.12854v1	null
2025-05-17	Keypoints as Dynamic Centroids for Unified Human Pose and Segmentation	Niaz Ahmad et.al.	2505.12130v1	null
2025-05-17	Black-box Adversaries from Latent Space: Unnoticeable Attacks on Human Pose and Shape Estimation	Zhiying Li et.al.	2505.12009v1	null
2025-05-17	ElderFallGuard: Real-Time IoT and Computer Vision-Based Fall Detection System for Elderly Safety	Tasrifur Riahi et.al.	2505.11845v1	null
2025-05-16	SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision	Utsav Rai et.al.	2505.11439v1	null
2025-05-16	MTevent: A Multi-Task Event Camera Dataset for 6D Pose Estimation and Moving Object Detection	Shrutarv Awasthi et.al.	2505.11282v1	link
2025-05-16	PoseBench3D: A Cross-Dataset Analysis Framework for 3D Human Pose Estimation	Saad Manzur et.al.	2505.10888v1	link
2025-05-16	RefPose: Leveraging Reference Geometric Correspondences for Accurate 6D Pose Estimation of Unseen Objects	Jaeguk Kim et.al.	2505.10841v1	null
2025-05-14	UMotion: Uncertainty-driven Human Motion Estimation from Inertial and Ultra-wideband Units	Huakun Liu et.al.	2505.09393v1	link
2025-05-14	APR-Transformer: Initial Pose Estimation for Localization in Complex Environments through Absolute Pose Regression	Srinivas Ravuri et.al.	2505.09356v1	link
2025-05-13	Real-time Capable Learning-based Visual Tool Pose Correction via Differentiable Simulation	Shuyuan Yang et.al.	2505.08875v1	null
2025-05-12	Sleep Position Classification using Transfer Learning for Bed-based Pressure Sensors	Olivier Papillon et.al.	2505.08111v1	null
2025-05-07	Pose Estimation for Intra-cardiac Echocardiography Catheter via AI-Based Anatomical Understanding	Jaeyoung Huh et.al.	2505.07851v1	null
2025-05-12	Enabling Privacy-Aware AI-Based Ergonomic Analysis	Sander De Coninck et.al.	2505.07306v1	null
2025-05-13	Human Motion Prediction via Test-domain-aware Adaptation with Easily-available Human Motions Estimated from Videos	Katsuki Shimbo et.al.	2505.07301v2	null
2025-05-12	When Dance Video Archives Challenge Computer Vision	Philippe Colantoni et.al.	2505.07249v1	null
2025-05-10	CompSLAM: Complementary Hierarchical Multi-Modal Localization and Mapping for Robot Autonomy in Underground Environments	Shehryar Khattak et.al.	2505.06483v1	null
2025-05-09	Active Perception for Tactile Sensing: A Task-Agnostic Attention-Based Approach	Tim Schneider et.al.	2505.06182v1	null
2025-05-08	Progressive Inertial Poser: Progressive Real-Time Kinematic Chain Estimation for 3D Full-Body Pose from Three IMU Sensors	Zunjie Zhu et.al.	2505.05336v1	null
2025-05-08	Improving Global Motion Estimation in Sparse IMU-based Motion Capture with Physics	Xinyu Yi et.al.	2505.05010v1	null
2025-05-08	An Efficient Method for Accurate Pose Estimation and Error Correction of Cuboidal Objects	Utsav Rai et.al.	2505.04962v1	null
2025-05-07	Comparison of Visual Trackers for Biomechanical Analysis of Running	Luis F. Gomez et.al.	2505.04713v1	null
2025-05-07	Do We Still Need to Work on Odometry for Autonomous Driving?	Cedric Le Gentil et.al.	2505.04438v1	null
2025-05-07	HDiffTG: A Lightweight Hybrid Diffusion-Transformer-GCN Architecture for 3D Human Pose Estimation	Yajie Fu et.al.	2505.04276v1	link
2025-05-07	One2Any: One-Reference 6D Pose Estimation for Any Object	Mengya Liu et.al.	2505.04109v1	null
2025-05-06	Polar Coordinate-Based 2D Pose Prior with Neural Distance Field	Qi Gan et.al.	2505.03445v1	null
2025-05-06	LiftFeat: 3D Geometry-Aware Local Feature Matching	Yepeng Liu et.al.	2505.03422v1	link
2025-05-06	Artificial Behavior Intelligence: Technology, Challenges, and Future Directions	Kanghyun Jo et.al.	2505.03315v1	null
2025-05-05	Dance of Fireworks: An Interactive Broadcast Gymnastics Training System Based on Pose Estimation	Haotian Chen et.al.	2505.02690v1	null
2025-05-05	Corr2Distrib: Making Ambiguous Correspondences an Ally to Predict Reliable 6D Pose Distributions	Asma Brazi et.al.	2505.02501v1	null
2025-05-05	Finger Pose Estimation for Under-screen Fingerprint Sensor	Xiongjun Guan et.al.	2505.02481v1	link
2025-05-05	6D Pose Estimation on Spoons and Hands	Kevin Tan et.al.	2505.02335v1	null
2025-05-04	Continuous Normalizing Flows for Uncertainty-Aware Human Pose Estimation	Shipeng Liu et.al.	2505.02287v1	null
2025-05-04	A Birotation Solution for Relative Pose Problems	Hongbo Zhao et.al.	2505.02025v1	null
2025-05-03	Near-field 5D Pose Estimation using Reconfigurable Intelligent Surfaces	Srikar Sharma Sadhu et.al.	2505.01829v1	null
2025-05-03	AquaGS: Fast Underwater Scene Reconstruction with SfM-Free Gaussian Splatting	Junhao Shi et.al.	2505.01799v1	null
2025-05-03	PosePilot: Steering Camera Pose for Generative World Models with Self-supervised Depth	Bu Jin et.al.	2505.01729v1	null
2025-05-02	T-Graph: Enhancing Sparse-view Camera Pose Estimation by Pairwise Translation Graph	Qingyu Xian et.al.	2505.01207v1	null
2025-05-02	3D Human Pose Estimation via Spatial Graph Order Attention and Temporal Body Aware Transformer	Kamel Aouaidjia et.al.	2505.01003v1	link
2025-05-01	Are Minimal Radial Distortion Solvers Really Necessary for Relative Pose Estimation?	Viktor Kocur et.al.	2505.00866v1	link
2025-05-01	P2P-Insole: Human Pose Estimation Using Foot Pressure Distribution and Motion Sensors	Atsuya Watanabe et.al.	2505.00755v1	null
2025-05-01	Dietary Intake Estimation via Continuous 3D Reconstruction of Food	Wallace Lee et.al.	2505.00606v1	null
2025-05-02	InterLoc: LiDAR-based Intersection Localization using Road Segmentation with Automated Evaluation Method	Nguyen Hoang Khoi Tran et.al.	2505.00512v2	null
2025-04-30	Self-Supervised Monocular Visual Drone Model Identification through Improved Occlusion Handling	Stavrow A. Bahnam et.al.	2504.21695v1	null
2025-04-30	Multiview Point Cloud Registration via Optimization in an Autoencoder Latent Space	Luc Vedrenne et.al.	2504.21467v1	null
2025-04-29	Dance Style Recognition Using Laban Movement Analysis	Muhammad Turab et.al.	2504.21166v1	null
2025-04-29	Adept: Annotation-Denoising Auxiliary Tasks with Discrete Cosine Transform Map and Keypoint for Human-Centric Pretraining	Weizhen He et.al.	2504.20800v1	null
2025-04-29	A Survey on Event-based Optical Marker Systems	Nafiseh Jabbari Tofighi et.al.	2504.20736v1	null
2025-04-29	Large-scale visual SLAM for in-the-wild videos	Shuo Sun et.al.	2504.20496v1	null
2025-05-01	GSFeatLoc: Visual Localization Using Feature Correspondence on 3D Gaussian Splatting	Jongwon Lee et.al.	2504.20379v2	null
2025-05-01	PRISM-DP: Spatial Pose-based Observations for Diffusion-Policies via Segmentation, Mesh Generation, and Pose Tracking	Xiatao Sun et.al.	2504.20359v2	null
2025-04-28	Transformation & Translation Occupancy Grid Mapping: 2-Dimensional Deep Learning Refined SLAM	Leon Davies et.al.	2504.19654v1	null
2025-04-28	GAN-SLAM: Real-Time GAN Aided Floor Plan Creation Through SLAM	Leon Davies et.al.	2504.19653v1	null
2025-04-28	Category-Level and Open-Set Object Pose Estimation for Robotics	Peter Hönig et.al.	2504.19572v1	null
2025-04-25	Certifiably-Correct Mapping for Safe Navigation Despite Odometry Drift	Devansh R. Agrawal et.al.	2504.18713v1	null
2025-04-25	SSD-Poser: Avatar Pose Estimation with State Space Duality from Sparse Observations	Shuting Zhao et.al.	2504.18332v1	null
2025-04-25	S3MOT: Monocular 3D Object Tracking with Selective State Space Model	Zhuohao Yan et.al.	2504.18068v1	null
2025-04-22	SmallGS: Gaussian Splatting-based Camera Pose Estimation for Small-Baseline Videos	Yuxin Yao et.al.	2504.17810v1	null
2025-04-24	Dynamic Camera Poses and Where to Find Them	Chris Rockwell et.al.	2504.17788v1	null
2025-04-24	A Guide to Structureless Visual Localization	Vojtech Panek et.al.	2504.17636v1	null
2025-04-24	Object Pose Estimation by Camera Arm Control Based on the Next Viewpoint Estimation	Tomoki Mizuno et.al.	2504.17424v1	null
2025-04-24	Bias-Eliminated PnP for Stereo Visual Odometry: Provably Consistent and Large-Scale Localization	Guangyang Zeng et.al.	2504.17410v1	null
2025-04-23	WiFi based Human Fall and Activity Recognition using Transformer based Encoder Decoder and Graph Neural Networks	Younggeol Cho et.al.	2504.16655v1	null
2025-04-23	Assessing the Feasibility of Internet-Sourced Video for Automatic Cattle Lameness Detection	Md Fahimuzzman Sohan et.al.	2504.16404v1	null
2025-04-22	SignX: The Foundation Model for Sign Recognition	Sen Fang et.al.	2504.16315v1	null
2025-04-22	GADS: A Super Lightweight Model for Head Pose Estimation	Menan Velayuthan et.al.	2504.15751v1	null
2025-04-21	Field Report on Ground Penetrating Radar for Localization at the Mars Desert Research Station	Anja Sheppard et.al.	2504.15455v1	null
2025-04-21	Vision6D: 3D-to-2D Interactive Visualization and Annotation Tool for 6D Pose Estimation	Yike Zhang et.al.	2504.15329v1	link
2025-04-21	Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs	Chun-Hsiao Yeh et.al.	2504.15280v1	link
2025-04-21	Instance-Adaptive Keypoint Learning with Local-to-Global Geometric Aggregation for Category-Level Object Pose Estimation	Xiao Zhang et.al.	2504.15134v1	null
2025-04-20	Back on Track: Bundle Adjustment for Dynamic Scene Reconstruction	Weirong Chen et.al.	2504.14516v1	null
2025-04-20	SG-Reg: Generalizable and Efficient Scene Graph Registration	Chuhao Liu et.al.	2504.14440v1	link
2025-04-18	Imitation Learning with Precisely Labeled Human Demonstrations	Yilong Song et.al.	2504.13803v1	null
2025-04-18	Mono3R: Exploiting Monocular Cues for Geometric 3D Reconstruction	Wenyu Li et.al.	2504.13419v1	null
2025-04-17	ViTa-Zero: Zero-shot Visuotactile Object 6D Pose Estimation	Hongyu Li et.al.	2504.13179v1	null
2025-04-18	ODHSR: Online Dense 3D Reconstruction of Humans and Scenes from Monocular Videos	Zetong Zhang et.al.	2504.13167v2	null
2025-04-17	Unsupervised Cross-Domain 3D Human Pose Estimation via Pseudo-Label-Guided Global Transforms	Jingjing Liu et.al.	2504.12699v1	null
2025-04-16	MobilePoser: Real-Time Full-Body Pose Estimation and 3D Human Translation from IMUs in Mobile Consumer Devices	Vasco Xu et.al.	2504.12492v1	link
2025-04-16	Diffusion Based Robust LiDAR Place Recognition	Benjamin Krummenacher et.al.	2504.12412v1	null
2025-04-16	Regist3R: Incremental Registration with Stereo Foundation Model	Sidun Liu et.al.	2504.12356v1	null
2025-04-16	CoMotion: Concurrent Multi-person 3D Motion	Alejandro Newell et.al.	2504.12186v1	link
2025-04-16	No Fuss, Just Function -- A Proposal for Non-Intrusive Full Body Tracking in XR for Meaningful Spatial Interactions	Elisabeth Mayer et.al.	2504.11987v1	null
2025-04-16	An Online Adaptation Method for Robust Depth Estimation and Visual Odometry in the Open World	Xingwu Ji et.al.	2504.11698v1	link
2025-04-17	CAP-Net: A Unified Network for 6D Pose and Size Estimation of Categorical Articulated Parts from a Single RGB-D Image	Jingshun Huang et.al.	2504.11230v2	null
2025-04-15	DMAGaze: Gaze Estimation Based on Feature Disentanglement and Multi-Scale Attention	Haohan Chen et.al.	2504.11160v1	null
2025-04-14	MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion Model	Jian Liu et.al.	2504.10433v1	link
2025-04-14	Benchmarking 3D Human Pose Estimation Models Under Occlusions	Filipa Lino et.al.	2504.10350v1	null
2025-04-15	Differentially Private 2D Human Pose Estimation	Kaushik Bhargav Sivangi et.al.	2504.10190v2	null
2025-04-14	TT3D: Table Tennis 3D Reconstruction	Thomas Gossard et.al.	2504.10035v1	null
2025-04-14	Efficient 2D to Full 3D Human Pose Uplifting including Joint Rotations	Katja Ludwig et.al.	2504.09953v1	null
2025-04-14	NeRF-Based Transparent Object Grasping Enhanced by Shape Priors	Yi Han et.al.	2504.09868v1	null
2025-04-13	EasyREG: Easy Depth-Based Markerless Registration and Tracking using Augmented Reality Device for Surgical Guidance	Yue Yang et.al.	2504.09498v1	null
2025-04-12	SCFlow2: Plug-and-Play Object Pose Refiner with Shape-Constraint Scene Flow	Qingyuan Wang et.al.	2504.09160v1	null
2025-04-12	A Constrained Optimization Approach for Gaussian Splatting from Coarsely-posed Images and Noisy Lidar Point Clouds	Jizong Peng et.al.	2504.09129v1	null
2025-04-12	BIGS: Bimanual Category-agnostic Interaction Reconstruction from Monocular Videos via 3D Gaussian Splatting	Jeongwan On et.al.	2504.09097v1	null
2025-04-11	The Invisible EgoHand: 3D Hand Forecasting through EgoBody Pose Estimation	Masashi Hatano et.al.	2504.08654v1	null
2025-04-11	MBE-ARI: A Multimodal Dataset Mapping Bi-directional Engagement in Animal-Robot Interaction	Ian Noronha et.al.	2504.08646v1	link
2025-04-11	Hardware, Algorithms, and Applications of the Neuromorphic Vision Sensor: a Review	Claudio Cimarelli et.al.	2504.08588v1	null
2025-04-11	Multi-person Physics-based Pose Estimation for Combat Sports	Hossein Feiz et.al.	2504.08175v1	null
2025-04-10	Towards Unconstrained 2D Pose Estimation of the Human Spine	Muhammad Saif Ullah Khan et.al.	2504.08110v1	null
2025-04-10	BoxDreamer: Dreaming Box Corners for Generalizable Object Pose Estimation	Yuanhong Yu et.al.	2504.07955v1	null
2025-04-09	DLTPose: 6DoF Pose Estimation From Accurate Dense Surface Point Estimates	Akash Jadhav et.al.	2504.07335v1	null
2025-04-09	Two by Two: Learning Multi-Task Pairwise Objects Assembly for Generalizable Robot Manipulation	Yu Qi et.al.	2504.06961v1	null
2025-04-09	GraspClutter6D: A Large-scale Real-world Dataset for Robust Perception and Grasping in Cluttered Scenes	Seunghyeok Back et.al.	2504.06866v1	link
2025-04-09	Setup-Invariant Augmented Reality for Teaching by Demonstration with Surgical Robots	Alexandre Banks et.al.	2504.06677v1	link
2025-04-09	HGMamba: Enhancing 3D Human Pose Estimation with a HyperGCN-Mamba Network	Hu Cui et.al.	2504.06638v1	null
2025-04-08	Leveraging Synthetic Adult Datasets for Unsupervised Infant Pose Estimation	Sarosij Bose et.al.	2504.05789v1	null
2025-04-08	SAP-CoPE: Social-Aware Planning using Cooperative Pose Estimation with Infrastructure Sensor Nodes	Minghao Ning et.al.	2504.05727v1	link
2025-04-08	POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D Reconstruction	Songyan Zhang et.al.	2504.05692v1	link
2025-04-10	Learning Affine Correspondences by Integrating Geometric Constraints	Pengju Sun et.al.	2504.04834v2	link
2025-04-10	A Convex and Global Solution for the P $n$ P Problem in 2D Forward-Looking Sonar	Jiayi Su et.al.	2504.04445v2	null
2025-04-05	3R-GS: Best Practice in Optimizing Camera Poses Along with 3DGS	Zhisheng Huang et.al.	2504.04294v1	null
2025-04-02	A Geometric Approach For Pose and Velocity Estimation Using IMU and Inertial/Body-Frame Measurements	Sifeddine Benahmed et.al.	2504.03764v1	null
2025-04-04	Robust Human Registration with Body Part Segmentation on Noisy Point Clouds	Kai Lascheit et.al.	2504.03602v1	null
2025-04-04	Endo3R: Unified Online Reconstruction from Dynamic Monocular Endoscopic Video	Jiaxin Guo et.al.	2504.03198v1	null
2025-04-03	Cooperative Inference for Real-Time 3D Human Pose Estimation in Multi-Device Edge Networks	Hyun-Ho Choi et.al.	2504.03052v1	link
2025-04-03	BOP Challenge 2024 on Model-Based and Model-Free 6D Object Pose Estimation	Van Nguyen Nguyen et.al.	2504.02812v1	null
2025-04-03	PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose Estimation	Lihua Liu et.al.	2504.02617v1	link
2025-04-02	Dual-stream Transformer-GCN Model with Contextualized Representations Learning for Monocular 3D Human Pose Estimation	Mingrui Ye et.al.	2504.01764v1	link
2025-04-02	ForestVO: Enhancing Visual Odometry in Forest Environments through ForestGlue	Thomas Pritchard et.al.	2504.01261v1	link
2025-04-01	AP-CAP: Advancing High-Quality Data Synthesis for Animal Pose Estimation via a Controllable Image Generation Pipeline	Lei Wang et.al.	2504.00394v1	null
2025-03-31	Easi3R: Estimating Disentangled Motion from DUSt3R Without Training	Xingyu Chen et.al.	2503.24391v1	link
2025-03-31	LiM-Loc: Visual Localization with Dense and Accurate 3D Reference Maps Directly Corresponding 2D Keypoints to 3D LiDAR Point Clouds	Masahiko Tsuji et.al.	2503.23664v1	null
2025-03-30	PhysPose: Refining 6D Object Poses with Physical Constraints	Martin Malenický et.al.	2503.23587v1	null
2025-03-30	Improving Indoor Localization Accuracy by Using an Efficient Implicit Neural Map Representation	Haofei Kuang et.al.	2503.23480v1	link
2025-03-30	SparseLoc: Sparse Open-Set Landmark-based Global Localization for Autonomous Navigation	Pranjal Paul et.al.	2503.23465v1	null
2025-03-30	HiPART: Hierarchical Pose AutoRegressive Transformer for Occluded 3D Human Pose Estimation	Hongwei Zheng et.al.	2503.23331v1	null
2025-03-29	Incorporating GNSS Information with LIDAR-Inertial Odometry for Accurate Land-Vehicle Localization	Jintao Cheng et.al.	2503.23199v1	null
2025-03-29	FRAME: Floor-aligned Representation for Avatar Motion from Egocentric Video	Andrea Boscolo Camiletto et.al.	2503.23094v1	null
2025-03-28	ForcePose: A Deep Learning Approach for Force Calculation Based on Action Recognition Using MediaPipe Pose Estimation Combined with Object Detection	Nandakishor M et.al.	2503.22363v1	null
2025-03-28	GCRayDiffusion: Pose-Free Surface Reconstruction via Geometric Consistent Ray Diffusion	Li-Heng Chen et.al.	2503.22349v1	null
2025-03-27	NeRF-based Point Cloud Reconstruction using a Stationary Camera for Agricultural Applications	Kibon Ku et.al.	2503.21958v1	null
2025-03-27	Uni4D: Unifying Visual Foundation Models for 4D Modeling from a Single Video	David Yifan Yao et.al.	2503.21761v1	link
2025-03-27	Reconstructing Humans with a Biomechanically Accurate Skeleton	Yan Xia et.al.	2503.21751v1	null
2025-03-27	OccRobNet : Occlusion Robust Network for Accurate 3D Interacting Hand-Object Pose Estimation	Mallika Garg et.al.	2503.21723v1	null
2025-03-27	RapidPoseTriangulation: Multi-view Multi-person Whole-body Human Pose Triangulation in a Millisecond	Daniel Bermuth et.al.	2503.21692v1	null
2025-03-27	STAMICS: Splat, Track And Map with Integrated Consistency and Semantics for Dense RGB-D SLAM	Yongxu Wang et.al.	2503.21425v1	null
2025-03-27	Lidar-only Odometry based on Multiple Scan-to-Scan Alignments over a Moving Window	Aaron Kurda et.al.	2503.21293v1	null
2025-03-27	Recurrent Feature Mining and Keypoint Mixup Padding for Category-Agnostic Pose Estimation	Junjie Chen et.al.	2503.21140v1	link
2025-03-26	DINeMo: Learning Neural Mesh Models with no 3D Annotations	Weijie Guo et.al.	2503.20220v1	null
2025-03-25	Zero-Shot Human-Object Interaction Synthesis with Multimodal Priors	Yuke Lou et.al.	2503.20118v1	null
2025-03-25	Vanishing Depth: A Depth Adapter with Positional Depth Encoding for Generalized Image Encoders	Paul Koch et.al.	2503.19947v1	link
2025-03-25	Visuo-Tactile Object Pose Estimation for a Multi-Finger Robot Hand with Low-Resolution In-Hand Tactile Sensing	Lukas Mack et.al.	2503.19893v1	null
2025-03-25	Semi-SD: Semi-Supervised Metric Depth Estimation via Surrounding Cameras for Autonomous Driving	Yusen Xie et.al.	2503.19713v1	link
2025-03-25	DynOPETs: A Versatile Benchmark for Dynamic Object Pose Estimation and Tracking in Moving Camera Scenarios	Xiangting Meng et.al.	2503.19625v1	null
2025-03-25	Pose-Based Fall Detection System: Efficient Monitoring on Standard CPUs	Vinayak Mali et.al.	2503.19501v1	null
2025-03-25	Multi-modal 3D Pose and Shape Estimation with Computed Tomography	Mingxiao Tu et.al.	2503.19405v1	null
2025-03-25	From Sparse to Dense: Camera Relocalization with Scene-Specific Detector from Feature Gaussian Splatting	Zhiwei Huang et.al.	2503.19358v1	null
2025-03-25	Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose Estimation	Zhuoran Zhao et.al.	2503.19307v1	link
2025-03-25	Any6D: Model-free 6D Pose Estimation of Novel Objects	Taeyeop Lee et.al.	2503.18673v2	null
2025-03-24	Structure-Aware Correspondence Learning for Relative Pose Estimation	Yihan Chen et.al.	2503.18671v1	null
2025-03-24	TrackID3x3: A Dataset and Algorithm for Multi-Player Tracking with Identification and Pose Estimation in 3x3 Basketball Full-court Videos	Kazuhiro Yamada et.al.	2503.18282v1	link
2025-03-23	Selecting and Pruning: A Differentiable Causal Sequentialized State-Space Model for Two-View Correspondence Learning	Xiang Fang et.al.	2503.17938v1	null
2025-03-22	Co-op: Correspondence-based Novel Object Pose Estimation	Sungphill Moon et.al.	2503.17731v1	null
2025-03-21	Image as an IMU: Estimating Camera Motion from a Single Motion-Blurred Image	Jerred Chen et.al.	2503.17358v1	null
2025-03-21	Pow3R: Empowering Unconstrained 3D Reconstruction with Camera and Scene Priors	Wonbong Jang et.al.	2503.17316v1	null
2025-03-20	ContactFusion: Stochastic Poisson Surface Maps from Visual and Contact Sensing	Aditya Kamireddypalli et.al.	2503.16592v1	null
2025-03-20	Probabilistic Prompt Distribution Learning for Animal Pose Estimation	Jiyong Rao et.al.	2503.16120v1	link
2025-03-20	PoseTraj: Pose-Aware Trajectory Control in Video Diffusion	Longbin Ji et.al.	2503.16068v1	null
2025-03-20	Automating 3D Dataset Generation with Neural Radiance Fields	P. Schulz et.al.	2503.15997v1	link
2025-03-20	Learning to Efficiently Adapt Foundation Models for Self-Supervised Endoscopic 3D Scene Reconstruction from Any Cameras	Beilei Cui et.al.	2503.15917v1	null
2025-03-19	EdgeRegNet: Edge Feature-based Multimodal Registration Network between Images and LiDAR Point Clouds	Yuanchao Yue et.al.	2503.15284v1	link
2025-03-20	GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose Estimation	Zinqin Huang et.al.	2503.15110v2	link
2025-03-20	Distilling 3D distinctive local descriptors for 6D pose estimation	Amir Hamza et.al.	2503.15106v2	null
2025-03-18	Validation of Human Pose Estimation and Human Mesh Recovery for Extracting Clinically Relevant Motion Data from Videos	Kai Armstrong et.al.	2503.14760v1	null
2025-03-18	SIR-DIFF: Sparse Image Sets Restoration with Multi-View Diffusion Model	Yucheng Mao et.al.	2503.14463v1	null
2025-03-18	SCJD: Sparse Correlation and Joint Distillation for Efficient 3D Human Pose Estimation	Weihong Chen et.al.	2503.14097v1	null
2025-03-18	Foundation Feature-Driven Online End-Effector Pose Estimation: A Marker-Free and Learning-Free Approach	Tianshu Wu et.al.	2503.14051v1	null
2025-03-19	Learning Shape-Independent Transformation via Spherical Representations for Category-Level Object Pose Estimation	Huan Ren et.al.	2503.13926v2	null
2025-03-20	STEP: Simultaneous Tracking and Estimation of Pose for Animals and Humans	Shashikant Verma et.al.	2503.13344v2	link
2025-03-17	UniHOPE: A Unified Approach for Hand-Only and Hand-Object Pose Estimation	Yinqiao Wang et.al.	2503.13303v1	null
2025-03-17	Uncertainty-Aware Knowledge Distillation for Compact and Efficient 6DoF Pose Estimation	Nassim Ali Ousalah et.al.	2503.13053v1	null
2025-03-17	PoseSyn: Synthesizing Diverse 3D Pose Data from In-the-Wild 2D Data	ChangHee Yang et.al.	2503.13025v1	null
2025-03-15	Gun Detection Using Combined Human Pose and Weapon Appearance	Amulya Reddy Maligireddy et.al.	2503.12215v1	null
2025-03-15	TACO: Taming Diffusion for in-the-wild Video Amodal Completion	Ruijie Lu et.al.	2503.12049v1	null
2025-03-14	Bring Your Rear Cameras for Egocentric 3D Human Pose Estimation	Hiroyasu Akada et.al.	2503.11652v1	null
2025-03-14	Online Test-time Adaptation for 3D Human Pose Estimation: A Practical Perspective with Estimated 2D Poses	Qiuxia Lin et.al.	2503.11194v1	null
2025-03-14	Fast and Robust Localization for Humanoid Soccer Robot via Iterative Landmark Matching	Ruochen Hou et.al.	2503.11020v1	null
2025-03-13	Clothes-Changing Person Re-identification Based On Skeleton Dynamics	Asaf Joseph et.al.	2503.10759v1	null
2025-03-13	Consistent multi-animal pose estimation in cattle using dynamic Kalman filter based tracking	Maarten Perneel et.al.	2503.10450v1	link
2025-03-13	6D Object Pose Tracking in Internet Videos for Robotic Manipulation	Georgy Ponimatkin et.al.	2503.10307v1	null
2025-03-13	VicaSplat: A Single Run is All You Need for 3D Gaussian Splatting and Camera Estimation from Unposed Video Frames	Zhiqi Li et.al.	2503.10286v1	null
2025-03-12	Physics-Aware Human-Object Rendering from Sparse Views via 3D Gaussian Splatting	Weiquan Wang et.al.	2503.09640v1	null
2025-03-12	GenHPE: Generative Counterfactuals for 3D Human Pose Estimation with Radio Frequency Signals	Shuokang Huang et.al.	2503.09537v1	null
2025-03-12	MonoSLAM: Robust Monocular SLAM with Global Structure Optimization	Bingzheng Jiang et.al.	2503.09296v1	null
2025-03-12	Better Together: Unified Motion Capture and 3D Avatar Reconstruction	Arthur Moreau et.al.	2503.09293v1	null
2025-03-11	Acoustic Neural 3D Reconstruction Under Pose Drift	Tianxiang Lin et.al.	2503.08930v1	null
2025-03-11	Keypoint Semantic Integration for Improved Feature Matching in Outdoor Agricultural Environments	Rajitha de Silva et.al.	2503.08843v1	null
2025-03-11	Keypoint Detection and Description for Raw Bayer Images	Jiakai Lin et.al.	2503.08673v1	null
2025-03-11	SGNetPose+: Stepwise Goal-Driven Networks with Pose Information for Trajectory Prediction in Autonomous Driving	Akshat Ghiya et.al.	2503.08016v1	null
2025-03-10	Better Pose Initialization for Fast and Robust 2D/3D Pelvis Registration	Yehyun Suh et.al.	2503.07767v1	null
2025-03-10	HumanMM: Global Human Motion Recovery from Multi-shot Videos	Yuhong Zhang et.al.	2503.07597v1	link
2025-03-11	AthletePose3D: A Benchmark Dataset for 3D Human Pose Estimation and Kinematic Validation in Athletic Movements	Calvin Yeung et.al.	2503.07499v2	link
2025-03-10	Multi-Robot System for Cooperative Exploration in Unknown Environments: A Survey	Chuqi Wang et.al.	2503.07278v1	null
2025-03-12	Endo-FASt3r: Endoscopic Foundation model Adaptation for Structure from motion	Mona Sheikh Zeinoddin et.al.	2503.07204v2	null
2025-03-10	Multi-Modal 3D Mesh Reconstruction from Images and Text	Melvin Reka et.al.	2503.07190v1	null
2025-03-11	PoseLess: Depth-Free Vision-to-Joint Control via Direct Image Mapping with VLM	Alan Dao et.al.	2503.07111v2	null
2025-03-09	AxisPose: Model-Free Matching-Free Single-Shot 6D Object Pose Estimation via Axis Generation	Yang Zou et.al.	2503.06660v1	null
2025-03-08	NeuraLoc: Visual Localization in Neural Implicit Map with Dual Complementary Features	Hongjia Zhai et.al.	2503.06117v1	null
2025-03-08	Fish2Mesh Transformer: 3D Human Mesh Recovery from Egocentric Vision	David C. Jeong et.al.	2503.06089v1	null
2025-03-08	ReJSHand: Efficient Real-Time Hand Pose Estimation and Mesh Reconstruction Using Refined Joint and Skeleton Features	Shan An et.al.	2503.05995v1	link
2025-03-07	Differentiable Rendering-based Pose Estimation for Surgical Robotic Instruments	Zekai Liang et.al.	2503.05953v1	null
2025-03-07	Novel Object 6D Pose Estimation with a Single Reference View	Jian Liu et.al.	2503.05578v1	link
2025-03-07	Multi-Grained Feature Pruning for Video-Based Human Pose Estimation	Zhigang Wang et.al.	2503.05365v1	null
2025-03-07	Persistent Object Gaussian Splat (POGS) for Tracking Human and Robot Manipulation of Irregularly Shaped Objects	Justin Yu et.al.	2503.05189v1	null
2025-03-07	SplatPose: Geometry-Aware 6-DoF Pose Estimation from Single RGB Image via 3D Gaussian Splatting	Linqi Yang et.al.	2503.05174v1	null
2025-03-07	GaussianCAD: Robust Self-Supervised CAD Reconstruction from Three Orthographic Views Using 3D Gaussian Splatting	Zheng Zhou et.al.	2503.05161v1	null
2025-03-06	MarsLGPR: Mars Rover Localization with Ground Penetrating Radar	Anja Sheppard et.al.	2503.04944v1	null
2025-03-09	ReynoldsFlow: Exquisite Flow Estimation via Reynolds Transport Theorem	Yu-Hsi Chen et.al.	2503.04500v2	link
2025-03-05	Active 6D Pose Estimation for Textureless Objects using Multi-View RGB Frames	Jun Yang et.al.	2503.03726v1	null
2025-03-05	Machine Learning in Biomechanics: Key Applications and Limitations in Walking, Running, and Sports Movements	Carlo Dindorf et.al.	2503.03717v1	null
2025-03-05	Improving 6D Object Pose Estimation of metallic Household and Industry Objects	Thomas Pöllabauer et.al.	2503.03655v1	null
2025-03-05	Tiny Lidars for Manipulator Self-Awareness: Sensor Characterization and Initial Localization Experiments	Giammarco Caroleo et.al.	2503.03449v1	null
2025-03-05	Direct Sparse Odometry with Continuous 3D Gaussian Maps for Indoor Environments	Jie Deng et.al.	2503.03373v1	link
2025-03-05	Supervised Visual Docking Network for Unmanned Surface Vehicles Using Auto-labeling in Real-world Water Environments	Yijie Chu et.al.	2503.03282v1	null
2025-03-05	SCORE: Saturated Consensus Relocalization in Semantic Line Maps	Haodong Jiang et.al.	2503.03254v1	link
2025-03-04	Monocular Person Localization under Camera Ego-motion	Yu Zhan et.al.	2503.02916v1	null
2025-03-04	PIDLoc: Cross-View Pose Optimization Network Inspired by PID Controllers	Wooju Lee et.al.	2503.02388v1	null
2025-03-04	DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting	Haoyuan Li et.al.	2503.02223v1	link
2025-03-04	Zero-Shot Sim-to-Real Visual Quadrotor Control with Hard Constraints	Yan Miao et.al.	2503.02198v1	null
2025-03-03	Constraint-Based Modeling of Dynamic Entities in 3D Scene Graphs for Robust SLAM	Marco Giberna et.al.	2503.02050v1	null
2025-03-05	Category-level Meta-learned NeRF Priors for Efficient Object Mapping	Saad Ejaz et.al.	2503.01582v2	null
2025-03-03	RUSSO: Robust Underwater SLAM with Sonar Optimization against Visual Degradation	Shu Pan et.al.	2503.01434v1	null
2025-03-03	ecg2o: A Seamless Extension of g2o for Equality-Constrained Factor Graph Optimization	Anas Abdelkarim et.al.	2503.01311v1	link
2025-03-03	Convex Hull-based Algebraic Constraint for Visual Quadric SLAM	Xiaolong Yu et.al.	2503.01254v1	link
2025-03-04	Floorplan-SLAM: A Real-Time, High-Accuracy, and Long-Term Multi-Session Point-Plane SLAM for Efficient Floorplan Reconstruction	Haolin Wang et.al.	2503.00397v2	null
2025-03-01	BGM2Pose: Active 3D Human Pose Estimation with Non-Stationary Sounds	Yuto Shibata et.al.	2503.00389v1	null
2025-02-28	BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket Sports	Jing-Yuan Chang et.al.	2502.21085v1	link
2025-02-28	Two-Stream Spatial-Temporal Transformer Framework for Person Identification via Natural Conversational Keypoints	Masoumeh Chapariniya et.al.	2502.20803v1	null
2025-02-27	Cutting-edge 3D reconstruction solutions for underwater coral reef images: A review and comparison	Jiageng Zhong et.al.	2502.20154v1	null
2025-02-27	BEV-DWPVO: BEV-based Differentiable Weighted Procrustes for Low Scale-drift Monocular Visual Odometry on Ground	Yufei Wei et.al.	2502.20078v1	null
2025-02-28	SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird's-Eye-View Segmentation	Zijie Zhou et.al.	2502.20077v2	link
2025-02-27	RUBIK: A Structured Benchmark for Image Matching across Geometric Challenges	Thibaut Loiseau et.al.	2502.19955v1	null
2025-02-27	QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects	Elkhan Ismayilzada et.al.	2502.19769v1	null
2025-02-27	Accurate Pose Estimation for Flight Platforms based on Divergent Multi-Aperture Imaging System	Shunkun Liang et.al.	2502.19708v1	null
2025-02-26	Increasing the Task Flexibility of Heavy-Duty Manipulators Using Visual 6D Pose Estimation of Objects	Petri Mäkinen et.al.	2502.19169v1	null
2025-02-25	EgoSim: An Egocentric Multi-view Simulator and Real Dataset for Body-worn Cameras during Motion and Activity	Dominik Hollidt et.al.	2502.18373v1	null
2025-02-25	Learning Structure-Supporting Dependencies via Keypoint Interactive Transformer for General Mammal Pose Estimation	Tianyang Xu et.al.	2502.18214v1	link
2025-02-24	V-HOP: Visuo-Haptic 6D Object Pose Tracking	Hongyu Li et.al.	2502.17434v1	null
2025-02-23	Orchestrating Joint Offloading and Scheduling for Low-Latency Edge SLAM	Yao Zhang et.al.	2502.16495v1	null
2025-02-23	DeProPose: Deficiency-Proof 3D Human Pose Estimation via Adaptive Multi-View Fusion	Jianbin Jiao et.al.	2502.16419v1	link
2025-02-21	RGB-Only Gaussian Splatting SLAM for Unbounded Outdoor Scenes	Sicheng Yu et.al.	2502.15633v1	null
2025-02-21	SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-training	Nie Lin et.al.	2502.15251v1	link
2025-02-21	Nonlinear Dynamical Systems for Automatic Face Annotation in Head Tracking and Pose Estimation	Thoa Thieu et.al.	2502.15179v1	null
2025-02-20	Design of a Visual Pose Estimation Algorithm for Moon Landing	Atakan Süslü et.al.	2502.14942v1	null
2025-02-20	Hier-SLAM++: Neuro-Symbolic Semantic SLAM with a Hierarchically Categorical Gaussian Splatting	Boying Li et.al.	2502.14931v1	null
2025-02-19	EfficientPose 6D: Scalable and Efficient 6D Object Pose Estimation	Zixuan Fang et.al.	2502.14061v1	null
2025-02-19	Active Illumination for Visual Ego-Motion Estimation in the Dark	Francesco Crocetti et.al.	2502.13708v1	null
2025-02-19	Object-Pose Estimation With Neural Population Codes	Heiko Hoffmann et.al.	2502.13403v1	null
2025-02-18	Spatiotemporal Multi-Camera Calibration using Freely Moving People	Sang-Eun Lee et.al.	2502.12546v1	null
2025-02-18	Learning Transformation-Isomorphic Latent Space for Accurate Hand Pose Estimation	Kaiwen Ren et.al.	2502.12535v1	null
2025-02-19	FLARE: Feed-forward Geometry, Appearance and Camera Estimation from Uncalibrated Sparse Views	Shangzhan Zhang et.al.	2502.12138v2	null
2025-02-17	Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection	Tessa Pulli et.al.	2502.12027v1	null
2025-02-17	SurgPose: a Dataset for Articulated Robotic Surgical Tool Pose Estimation and Tracking	Zijian Wu et.al.	2502.11534v1	null
2025-02-18	VarGes: Improving Variation in Co-Speech 3D Gesture Generation via StyleCLIPS	Ming Meng et.al.	2502.10729v2	link
2025-02-15	Semantics-aware Test-time Adaptation for 3D Human Pose Estimation	Qiuxia Lin et.al.	2502.10724v1	null
2025-02-15	Learning semantical dynamics and spatiotemporal collaboration for human pose estimation in video	Runyang Feng et.al.	2502.10616v1	null
2025-02-14	HIPPo: Harnessing Image-to-3D Priors for Model-free Zero-shot 6D Pose Estimation	Yibo Liu et.al.	2502.10606v1	null
2025-02-14	Manual2Skill: Learning to Read Manuals and Acquire Robotic Skills for Furniture Assembly Using Vision-Language Models	Chenrui Tie et.al.	2502.10090v1	link
2025-02-13	Metamorphic Testing for Pose Estimation Systems	Matias Duran et.al.	2502.09460v1	null
2025-02-13	BevSplat: Resolving Height Ambiguity via Feature-Based Gaussian Primitives for Weakly-Supervised Cross-View Localization	Qiwei Wang et.al.	2502.09080v1	null
2025-02-14	Siren Song: Manipulating Pose Estimation in XR Headsets Using Acoustic Attacks	Zijian Huang et.al.	2502.08865v2	null
2025-02-12	LIR-LIVO: A Lightweight,Robust LiDAR/Vision/Inertial Odometry with Illumination-Resilient Deep Features	Shujie Zhou et.al.	2502.08676v1	link
2025-02-12	CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World	Yankai Fu et.al.	2502.08449v1	null
2025-02-11	GaRLIO: Gravity enhanced Radar-LiDAR-Inertial Odometry	Chiyun Noh et.al.	2502.07703v1	link
2025-02-11	Matrix3D: Large Photogrammetry Model All-in-One	Yuanxun Lu et.al.	2502.07685v1	null
2025-02-08	Vision-in-the-loop Simulation for Deep Monocular Pose Estimation of UAV in Ocean Environment	Maneesha Wickramasuriya et.al.	2502.05409v1	null
2025-02-06	Measuring Physical Plausibility of 3D Human Poses Using Physics Simulation	Nathan Louis et.al.	2502.04483v1	link
2025-02-06	GCE-Pose: Global Context Enhancement for Category-level Object Pose Estimation	Weihang Li et.al.	2502.04293v1	null
2025-02-06	Advanced Object Detection and Pose Estimation with Hybrid Task Cascade and High-Resolution Networks	Yuhui Jin et.al.	2502.03877v1	null
2025-02-05	Mapping and Localization Using LiDAR Fiducial Markers	Yibo Liu et.al.	2502.03510v1	null
2025-02-04	Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation	Jian Liu et.al.	2502.02525v1	link
2025-02-03	CleanPose: Category-Level Object Pose Estimation via Causal Learning and Knowledge Distillation	Xiao Lin et.al.	2502.01312v1	null
2025-02-03	Enhancing Feature Tracking Reliability for Visual Navigation using Real-Time Safety Filter	Dabin Kim et.al.	2502.01092v1	null
2025-02-03	ZeroBP: Learning Position-Aware Correspondence for Zero-shot 6D Pose Estimation in Bin-Picking	Jianqiu Chen et.al.	2502.01004v1	null
2025-01-31	A Direct Semi-Exhaustive Search Method for Robust, Partial-to-Full Point Cloud Registration	Richard Cheng et.al.	2502.00115v1	null
2025-01-31	XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and Glasses	Bo Lan et.al.	2501.19034v1	link
2025-01-30	SimpleDepthPose: Fast and Reliable Human Pose Estimation with RGBD-Images	Daniel Bermuth et.al.	2501.18478v1	link
2025-01-29	Online Trajectory Replanner for Dynamically Grasping Irregular Objects	Minh Nhat Vu et.al.	2501.17968v1	null
2025-01-28	DebugAgent: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging	Muxi Chen et.al.	2501.16751v1	null
2025-01-27	Toward Efficient Generalization in 3D Human Pose Estimation via a Canonical Domain Approach	Hoosang Lee et.al.	2501.16146v1	null
2025-01-27	NanoHTNet: Nano Human Topology Network for Efficient 3D Human Pose Estimation	Jialun Cai et.al.	2501.15763v1	null
2025-01-25	Towards Better Robustness: Progressively Joint Pose-3DGS Learning for Arbitrarily Long Videos	Zhen-Hui Dong et.al.	2501.15096v1	null
2025-01-25	SpatioTemporal Learning for Human Pose Estimation in Sparsely-Labeled Videos	Yingying Jiao et.al.	2501.15073v1	null
2025-01-24	3D/2D Registration of Angiograms using Silhouette-based Differentiable Rendering	Taewoong Lee et.al.	2501.14918v1	link
2025-01-24	Light3R-SfM: Towards Feed-forward Structure-from-Motion	Sven Elflein et.al.	2501.14914v1	null
2025-01-24	Glissando-Net: Deep sinGLe vIew category level poSe eStimation ANd 3D recOnstruction	Bo Sun et.al.	2501.14896v1	null
2025-01-24	Optimizing Grasping Precision for Industrial Pick-and-Place Tasks Through a Novel Visual Servoing Approach	Khairidine Benali et.al.	2501.14557v1	null
2025-01-24	LiDAR-Based Vehicle Detection and Tracking for Autonomous Racing	Marcello Cellina et.al.	2501.14502v1	null
2025-01-24	Optimizing Human Pose Estimation Through Focused Human and Joint Regions	Yingying Jiao et.al.	2501.14439v1	null
2025-01-24	Causal-Inspired Multitask Learning for Video-Based Human Pose Estimation	Haipeng Chen et.al.	2501.14356v1	null
2025-01-24	HAMMER: Heterogeneous, Multi-Robot Semantic Gaussian Splatting	Javier Yu et.al.	2501.14147v1	null
2025-01-23	Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass	Jianing Yang et.al.	2501.13928v1	link
2025-01-23	EgoHand: Ego-centric Hand Pose Estimation and Gesture Recognition with Head-mounted Millimeter-wave Radar and IMUs	Yizhe Lv et.al.	2501.13805v1	link
2025-01-23	VIGS SLAM: IMU-based Large-Scale 3D Gaussian Splatting SLAM	Gyuhyeon Pak et.al.	2501.13402v1	null
2025-01-22	Deep Learning-Based Image Recovery and Pose Estimation for Resident Space Objects	Louis Aberdeen et.al.	2501.13009v1	null
2025-01-21	BlanketGen2-Fit3D: Synthetic Blanket Augmentation Towards Improving Real-World In-Bed Blanket Occluded Human Pose Estimation	Tamás Karácsony et.al.	2501.12318v1	null
2025-01-19	Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation	Shibang Liu et.al.	2501.11069v1	null
2025-01-18	RoMu4o: A Robotic Manipulation Unit For Orchard Operations Automating Proximal Hyperspectral Leaf Sensing	Mehrad Mortazavi et.al.	2501.10621v1	link
2025-01-17	landmarker: a Toolkit for Anatomical Landmark Localization in 2D/3D Images	Jef Jonkers et.al.	2501.10098v1	link
2025-01-16	A New Teacher-Reviewer-Student Framework for Semi-supervised 2D Human Pose Estimation	Wulian Yun et.al.	2501.09565v1	null
2025-01-21	Towards Robust and Realistic Human Pose Estimation via WiFi Signals	Yang Chen et.al.	2501.09411v2	link
2025-01-16	RoboReflect: Robotic Reflective Reasoning for Grasping Ambiguous-Condition Objects	Zhen Luo et.al.	2501.09307v1	null
2025-01-16	BRIGHT-VO: Brightness-Guided Hybrid Transformer for Visual Odometry with Multi-modality Refinement Module	Dongzhihan Wang et.al.	2501.08659v2	null
2025-01-14	Poseidon: A ViT-based Architecture for Multi-Frame Pose Estimation with Adaptive Frame Weighting and Multi-Scale Feature Fusion	Cesare Davide Pace et.al.	2501.08446v1	link
2025-01-14	Leveraging 2D Masked Reconstruction for Domain Adaptation of 3D Pose Estimation	Hansoo Park et.al.	2501.08408v1	null
2025-01-14	Predicting 4D Hand Trajectory from Monocular Videos	Yufei Ye et.al.	2501.08329v1	null
2025-01-14	A Critical Synthesis of Uncertainty Quantification and Foundation Models in Monocular Depth Estimation	Steven Landgraf et.al.	2501.08188v1	null
2025-01-14	AgentPose: Progressive Distribution Alignment via Feature Agent for Human Pose Distillation	Feng Zhang et.al.	2501.08088v1	null
2025-01-14	Robust Low-Light Human Pose Estimation through Illumination-Texture Modulation	Feng Zhang et.al.	2501.08038v1	null
2025-01-14	BioPose: Biomechanically-accurate 3D Pose Estimation from Monocular Videos	Farnoosh Koleini et.al.	2501.07800v1	null
2025-01-13	Fixing the Scale and Shift in Monocular Depth For Camera Pose Estimation	Yaqing Ding et.al.	2501.07742v1	link
2025-01-13	Efficiently Closing Loops in LiDAR-Based SLAM Using Point Cloud Density Maps	Saurabh Gupta et.al.	2501.07399v1	null
2025-01-13	Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics	Tze Ho Elden Tse et.al.	2501.07100v1	null
2025-01-10	eKalibr: Dynamic Intrinsic Calibration for Event Cameras From First Principles of Events	Shuolong Chen et.al.	2501.05688v1	link
2025-01-09	Relative Pose Estimation through Affine Corrections of Monocular Depth Priors	Yifan Yu et.al.	2501.05446v1	link
2025-01-09	From Simple to Complex Skills: The Case of In-Hand Object Reorientation	Haozhi Qi et.al.	2501.05439v1	null
2025-01-11	Towards Balanced Continual Multi-Modal Learning in Human Pose Estimation	Jiaxuan Peng et.al.	2501.05264v2	null
2025-01-08	KN-LIO: Geometric Kinematics and Neural Field Coupled LiDAR-Inertial Odometry	Zhong Wang et.al.	2501.04263v1	null
2025-01-07	OmniManip: Towards General Robotic Manipulation via Object-Centric Interaction Primitives as Spatial Constraints	Mingjie Pan et.al.	2501.03841v1	null
2025-01-10	MC-VTON: Minimal Control Virtual Try-On Diffusion Transformer	Junsheng Luan et.al.	2501.03630v2	null
2025-01-07	TexHOI: Reconstructing Textures of 3D Unknown Objects in Monocular Hand-Object Interaction Scenes	Alakh Aggarwal et.al.	2501.03525v1	link
2025-01-06	Mobile Augmented Reality Framework with Fusional Localization and Pose Estimation	Songlin Hou et.al.	2501.03336v1	null
2025-01-06	SurgRIPE challenge: Benchmark of Surgical Robot Instrument Pose Estimation	Haozheng Xu et.al.	2501.02990v1	null
2025-01-06	HaWoR: World-Space Hand Motion Reconstruction from Egocentric Videos	Jinglei Zhang et.al.	2501.02973v1	null
2025-01-06	Spiking monocular event based 6D pose estimation for space application	Jonathan Courtois et.al.	2501.02916v1	null
2025-01-06	Universal Features Guided Zero-Shot Category-Level Object Pose Estimation	Wentian Qu et.al.	2501.02831v1	null
2025-01-06	Unsupervised Domain Adaptation for Occlusion Resilient Human Pose Estimation	Arindam Dutta et.al.	2501.02773v1	null
2025-01-06	WorldPose: A World Cup Dataset for Global 3D Human Pose Estimation	Tianjian Jiang et.al.	2501.02771v1	null
2025-01-05	LP-ICP: General Localizability-Aware Point Cloud Registration for Robust Localization in Extreme Unstructured Environments	Haosong Yue et.al.	2501.02580v1	link
2025-01-04	ROLO-SLAM: Rotation-Optimized LiDAR-Only SLAM in Uneven Terrain with Ground Vehicle	Yinchuan Wang et.al.	2501.02166v1	link
2025-01-03	TCPFormer: Learning Temporal Correlation with Implicit Pose Proxy for 3D Human Pose Estimation	Jiajie Liu et.al.	2501.01770v1	link
2025-01-03	Laparoscopic Scene Analysis for Intraoperative Visualisation of Gamma Probe Signals in Minimally Invasive Cancer Surgery	Baoru Huang et.al.	2501.01752v1	null
2025-01-03	Free-Form Motion Control: A Synthetic Video Generation Dataset with Controllable Camera and Object Motions	Xincheng Shuai et.al.	2501.01425v2	null
2025-01-02	On Unifying Video Generation and Camera Pose Estimation	Chun-Hao Paul Huang et.al.	2501.01409v1	null
2025-01-02	L3D-Pose: Lifting Pose for 3D Avatars from a Single Camera in the Wild	Soumyaratna Debnath et.al.	2501.01174v1	null
2024-12-31	Relative Pose Observability Analysis Using Dual Quaternions	Nicholas B. Andrews et.al.	2501.00657v1	null
2024-12-31	VinT-6D: A Large-Scale Object-in-hand Dataset from Vision, Touch and Proprioception	Zhaoliang Wan et.al.	2501.00510v1	null
2024-12-30	Hierarchical Pose Estimation and Mapping with Multi-Scale Neural Feature Fields	Evgenii Kruzhkov et.al.	2412.20976v1	null
2024-12-30	ReFlow6D: Refraction-Guided Transparent Object 6D Pose Estimation via Intermediate Representation Learning	Hrishikesh Gupta et.al.	2412.20830v1	link
2024-12-30	Frequency-aware Event Cloud Network	Hongwei Ren et.al.	2412.20803v1	null
2024-12-30	KeyGS: A Keyframe-Centric Gaussian Splatting Method for Monocular Image Sequences	Keng-Wei Chang et.al.	2412.20767v1	null
2024-12-30	Towards nation-wide analytical healthcare infrastructures: A privacy-preserving augmented knee rehabilitation case study	Boris Bačić et.al.	2412.20733v1	link
2024-12-29	Exploiting Aggregation and Segregation of Representations for Domain Adaptive Human Pose Estimation	Qucheng Peng et.al.	2412.20538v1	link
2024-12-28	MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing	Shuo Wang et.al.	2412.20082v1	null
2024-12-28	GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian Splatting	Atticus J. Zeller et.al.	2412.20056v1	link
2024-12-27	Optimizing Local-Global Dependencies for Accurate 3D Human Pose Estimation	Guangsheng Xu et.al.	2412.19676v1	link
2024-12-27	Dust to Tower: Coarse-to-Fine Photo-Realistic Scene Reconstruction from Sparse Uncalibrated Images	Xudong Cai et.al.	2412.19518v1	null
2024-12-26	Humans as a Calibration Pattern: Dynamic 3D Scene Reconstruction from Unsynchronized and Uncalibrated Videos	Changwoon Choi et.al.	2412.19089v1	null
2024-12-23	Reconstructing People, Places, and Cameras	Lea Müller et.al.	2412.17806v1	link
2024-12-22	Leveraging Consistent Spatio-Temporal Correspondence for Robust Visual Odometry	Zhaoxing Zhang et.al.	2412.16923v1	link
2024-12-21	EasyVis2: A Real Time Multi-view 3D Visualization for Laparoscopic Surgery Training Enhanced by a Deep Neural Network YOLOv8-Pose	Yung-Hong Sun et.al.	2412.16742v1	null
2024-12-21	FACTS: Fine-Grained Action Classification for Tactical Sports	Christopher Lai et.al.	2412.16454v1	null
2024-12-20	Can Generative Video Models Help Pose Estimation?	Ruojin Cai et.al.	2412.16155v1	null
2024-12-20	Monkey Transfer Learning Can Improve Human Pose Estimation	Bradley Scott et.al.	2412.15966v1	null
2024-12-19	Scaling 4D Representations	João Carreira et.al.	2412.15212v1	null
2024-12-13	IMPROVE: Impact of Mobile Phones on Remote Online Virtual Education	Roberto Daza et.al.	2412.14195v1	link
2024-12-18	Level-Set Parameters: Novel Representation for 3D Shape Analysis	Huan Lei et.al.	2412.13502v1	null
2024-12-18	Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation	Xiaoqi An et.al.	2412.13454v1	link
2024-12-17	CondiMen: Conditional Multi-Person Mesh Recovery	Brégier Romain et.al.	2412.13058v1	null
2024-12-17	ShotVL: Human-Centric Highlight Frame Retrieval via Language Queries	Wangyu Xue et.al.	2412.12675v1	null
2024-12-16	Category Level 6D Object Pose Estimation from a Single RGB Image using Diffusion	Adam Bethell et.al.	2412.11420v1	null
2024-12-13	ExeChecker: Where Did I Go Wrong?	Yiwen Gu et.al.	2412.10573v1	link
2024-12-11	CUPS: Improving Human Pose-Shape Estimators with Conformalized Deep Uncertainty	Harry Zhang et.al.	2412.10431v1	null
2024-12-13	RP-SLAM: Real-time Photorealistic SLAM with Efficient 3D Gaussian Splatting	Lizhi Bai et.al.	2412.09868v1	null
2024-12-12	Stereo4D: Learning How Things Move in 3D from Internet Stereo Videos	Linyi Jin et.al.	2412.09621v1	null
2024-12-12	FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction	Jiale Xu et.al.	2412.09573v1	null
2024-12-11	BLADE: Single-view Body Mesh Learning through Accurate Depth Estimation	Shengze Wang et.al.	2412.08640v1	null
2024-12-12	Drift-free Visual SLAM using Digital Twins	Roxane Merat et.al.	2412.08496v2	null
2024-12-11	Reloc3r: Large-Scale Training of Relative Camera Pose Regression for Generalizable, Fast, and Accurate Visual Localization	Siyan Dong et.al.	2412.08376v1	link
2024-12-10	LoRA3D: Low-Rank Self-Calibration of 3D Geometric Foundation Models	Ziqi Lu et.al.	2412.07746v1	null
2024-12-09	MV-DUSt3R+: Single-Stage Scene Reconstruction from Sparse Views In 2 Seconds	Zhenggang Tang et.al.	2412.06974v1	null
2024-12-09	An Efficient Scene Coordinate Encoding and Relocalization Method	Kuan Xu et.al.	2412.06488v1	link
2024-12-09	Attention-Enhanced Lightweight Hourglass Network for Human Pose Estimation	Marsha Mariya Kappan et.al.	2412.06227v1	null
2024-12-06	CCS: Continuous Learning for Customized Incremental Wireless Sensing Services	Qunhang Fu et.al.	2412.04821v1	null
2024-12-05	ProPLIKS: Probablistic 3D human body pose estimation	Karthik Shetty et.al.	2412.04665v1	null
2024-12-05	DualPM: Dual Posed-Canonical Point Maps for 3D Shape and Pose Reconstruction	Ben Kaye et.al.	2412.04464v1	null
2024-12-05	Targeted Hard Sample Synthesis Based on Estimated Pose and Occlusion Error for Improved Object Pose Estimation	Alan Li et.al.	2412.04279v1	null
2024-12-04	Sparse-view Pose Estimation and Reconstruction via Analysis by Generative Synthesis	Qitao Zhao et.al.	2412.03570v1	null
2024-12-06	NVComposer: Boosting Generative Novel View Synthesis with Multiple Sparse and Unposed Images	Lingen Li et.al.	2412.03517v2	null
2024-12-05	A Bidirectional Siamese Recurrent Neural Network for Accurate Gait Recognition Using Body Landmarks	Proma Hossain Progga et.al.	2412.03498v2	null
2024-12-04	MCVO: A Generic Visual Odometry for Arbitrarily Arranged Multi-Cameras	Huai Yu et.al.	2412.03146v1	link
2024-12-04	An indoor DSO-based ceiling-vision odometry system for indoor industrial environments	Abdelhak Bougouffa et.al.	2412.02950v1	null
2024-12-03	EgoCast: Forecasting Egocentric Human Pose in the Wild	Maria Escobar et.al.	2412.02903v1	null
2024-12-02	emg2pose: A Large and Diverse Benchmark for Surface Electromyographic Hand Pose Estimation	Sasha Salter et.al.	2412.02725v1	link
2024-12-03	ProbPose: A Probabilistic Approach to 2D Human Pose Estimation	Miroslav Purkrabek et.al.	2412.02254v1	link
2024-12-03	Cascaded Multi-Scale Attention for Enhanced Multi-Scale Feature Extraction and Interaction with Low-Resolution Images	Xiangyong Lu et.al.	2412.02197v1	link
2024-12-03	CLERF: Contrastive LEaRning for Full Range Head Pose Estimation	Ting-Ruen Wei et.al.	2412.02066v1	null
2024-12-02	Detection, Pose Estimation and Segmentation for Multiple Bodies: Closing the Virtuous Circle	Miroslav Purkrabek et.al.	2412.01562v1	link
2024-12-02	6DOPE-GS: Online 6D Object Pose Estimation using Gaussian Splatting	Yufeng Jin et.al.	2412.01543v1	null
2024-12-02	HandOS: 3D Hand Reconstruction in One Stage	Xingyu Chen et.al.	2412.01537v1	null
2024-12-02	SF-Loc: A Visual Mapping and Geo-Localization System based on Sparse Visual Structure Frames	Yuxuan Zhou et.al.	2412.01500v1	link
2024-12-02	MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection	Yonghao Dang et.al.	2412.01422v1	null
2024-12-02	Cross-Modal Visual Relocalization in Prior LiDAR Maps Utilizing Intensity Textures	Qiyuan Shen et.al.	2412.01299v1	null
2024-12-02	CRISP: Object Pose and Shape Estimation with Test-Time Adaptation	Jingnan Shi et.al.	2412.01052v1	null
2024-11-29	Diorama: Unleashing Zero-shot Single-view 3D Scene Modeling	Qirui Wu et.al.	2411.19492v1	null
2024-11-29	Multiview Equivariance Improves 3D Correspondence Understanding with Minimal Feature Finetuning	Yang You et.al.	2411.19458v1	link
2024-11-28	GMS-VINS:Multi-category Dynamic Objects Semantic Segmentation for Enhanced Visual-Inertial Odometry Using a Promptable Foundation Model	Rui Zhou et.al.	2411.19289v1	null
2024-11-28	HOT3D: Hand and Object Tracking in 3D from Egocentric Multi-View Videos	Prithviraj Banerjee et.al.	2411.19167v1	null
2024-11-28	Lost & Found: Updating Dynamic 3D Scene Graphs from Egocentric Observations	Tjark Behrens et.al.	2411.19162v1	link
2024-11-28	Distributed Dual Quaternion Extended Kalman Filtering for Spacecraft Pose Estimation	Mathias Hudoba de Badyn et.al.	2411.19033v1	null
2024-11-28	Waterfall Transformer for Multi-person Pose Estimation	Navin Ranjan et.al.	2411.18944v1	null
2024-12-02	AC3D: Analyzing and Improving 3D Camera Control in Video Diffusion Transformers	Sherwin Bahmani et.al.	2411.18673v2	null
2024-11-27	XR-MBT: Multi-modal Full Body Tracking for XR through Self-Supervision with Learned Depth Point Cloud Registration	Denys Rozumnyi et.al.	2411.18377v1	null
2024-11-27	Manual-PA: Learning 3D Part Assembly from Instruction Diagrams	Jiahao Zhang et.al.	2411.18011v1	null
2024-11-26	Self-supervised Monocular Depth and Pose Estimation for Endoscopy with Generative Latent Priors	Ziang Xu et.al.	2411.17790v1	null
2024-11-26	Geometric Point Attention Transformer for 3D Shape Reassembly	Jiahan Li et.al.	2411.17788v1	null
2024-11-26	RoboPEPP: Vision-Based Robot Pose and Joint Angle Estimation through Embedding Predictive Pre-Training	Raktim Gautam Goswami et.al.	2411.17662v1	link
2024-11-26	Communication-Efficient Cooperative SLAMMOT via Determining the Number of Collaboration Vehicles	Susu Fang et.al.	2411.17432v1	null
2024-11-26	Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration	Junyuan Deng et.al.	2411.17240v1	link
2024-11-28	SelfSplat: Pose-Free and 3D Prior-Free Generalizable 3D Gaussian Splatting	Gyeongjin Kang et.al.	2411.17190v3	null
2024-11-26	GMFlow: Global Motion-Guided Recurrent Flow for 6D Object Pose Estimation	Xin Liu et.al.	2411.17174v1	null
2024-11-25	Diffusion Features for Zero-Shot 6DoF Object Pose Estimation	Bernd Von Gimborn et.al.	2411.16668v1	null
2024-11-25	Edge Weight Prediction For Category-Agnostic Pose Estimation	Or Hirschorn et.al.	2411.16665v1	link
2024-11-25	SplatFlow: Multi-View Rectified Flow Model for 3D Gaussian Splatting Synthesis	Hyojun Go et.al.	2411.16443v1	link
2024-11-25	One Diffusion to Generate Them All	Duong H. Le et.al.	2411.16318v1	link
2024-11-25	UNOPose: Unseen Object Pose Estimation with an Unposed RGB-D Reference Image	Xingyu Liu et.al.	2411.16106v1	null
2024-11-24	Generalizable Single-view Object Pose Estimation by Two-side Generating and Matching	Yujing Sun et.al.	2411.15860v1	link
2024-11-24	PEnG: Pose-Enhanced Geo-Localisation	Tavis Shore et.al.	2411.15742v1	link
2024-11-22	Personalization of Wearable Sensor-Based Joint Kinematic Estimation Using Computer Vision for Hip Exoskeleton Applications	Changseob Song et.al.	2411.15366v1	null
2024-11-22	Enhancing Exploration with Diffusion Policies in Hybrid Off-Policy RL: Application to Non-Prehensile Manipulation	Huy Le et.al.	2411.14913v1	null
2024-11-22	mmWave Radar for Sit-to-Stand Analysis: A Comparative Study with Wearables and Kinect	Shuting Hu et.al.	2411.14656v1	null
2024-11-21	DINO-X: A Unified Vision Model for Open-World Object Detection and Understanding	Tianhe Ren et.al.	2411.14347v1	link
2024-11-21	SEMPose: A Single End-to-end Network for Multi-object Pose Estimation	Xin Liu et.al.	2411.14002v1	null
2024-11-21	Dehazing-aided Multi-Rate Multi-Modal Pose Estimation Framework for Mitigating Visual Disturbances in Extreme Underwater Domain	Vidya Sudevan et.al.	2411.13988v1	null
2024-11-21	Hybrid-Neuromorphic Approach for Underwater Robotics Applications: A Conceptual Framework	Vidya Sudevan et.al.	2411.13962v1	null
2024-11-20	Developing Normative Gait Cycle Parameters for Clinical Analysis Using Human Pose Estimation	Rahm Ranjan et.al.	2411.13716v1	null
2024-11-20	Robust SG-NeRF: Robust Scene Graph Aided Neural Surface Reconstruction	Yi Gu et.al.	2411.13620v1	null
2024-11-19	VioPose: Violin Performance 4D Pose Estimation by Hierarchical Audiovisual Inference	Seong Jong Yoo et.al.	2411.13607v1	link
2024-11-20	DATAP-SfM: Dynamic-Aware Tracking Any Point for Robust Structure from Motion in the Wild	Weicai Ye et.al.	2411.13291v1	null
2024-11-20	X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose Estimation	Yuchen Yang et.al.	2411.13026v1	link
2024-11-19	IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose	Fei Ren et.al.	2411.12676v1	null
2024-11-15	SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D Reconstruction	Yutao Tang et.al.	2411.12592v1	link
2024-11-19	GLOVER: Generalizable Open-Vocabulary Affordance Reasoning for Task-Oriented Grasping	Teli Ma et.al.	2411.12286v1	null
2024-11-18	IKEA Manuals at Work: 4D Grounding of Assembly Instructions on Internet Videos	Yunong Liu et.al.	2411.11409v1	link
2024-11-15	USP-Gaussian: Unifying Spike-based Image Reconstruction, Pose Correction and Gaussian Splatting	Kang Chen et.al.	2411.10504v1	link
2024-11-13	ReMP: Reusable Motion Prior for Multi-domain 3D Human Pose Estimation and Motion Inbetweening	Hojun Jang et.al.	2411.09435v1	null
2024-11-13	Generalized Pose Space Embeddings for Training In-the-Wild using Anaylis-by-Synthesis	Dominik Borer et.al.	2411.08603v1	null
2024-11-13	DG-SLAM: Robust Dynamic Gaussian Splatting SLAM with Hybrid Pose Optimization	Yueming Xu et.al.	2411.08373v1	null
2024-11-16	RINO: Accurate, Robust Radar-Inertial Odometry with Non-Iterative Estimation	Shuocheng Yang et.al.	2411.07699v2	link
2024-11-12	Human Arm Pose Estimation with a Shoulder-worn Force-Myography Device for Human-Robot Interaction	Rotem Atari et.al.	2411.07644v1	null
2024-11-12	Towards Seamless Integration of Magnetic Tracking into Fluoroscopy-guided Interventions	Shuwei Xing et.al.	2411.07495v1	null
2024-11-08	Acoustic-based 3D Human Pose Estimation Robust to Human Position	Yusuke Oumi et.al.	2411.07165v1	null
2024-11-11	CapeLLM: Support-Free Category-Agnostic Pose Estimation with Multimodal Large Language Models	Junho Kim et.al.	2411.06869v1	null
2024-11-11	GenZ-ICP: Generalizable and Degeneracy-Robust LiDAR Odometry Using an Adaptive Weighting	Daehan Lee et.al.	2411.06766v1	link
2024-11-11	GTA-Net: An IoT-Integrated 3D Human Pose Estimation System for Real-Time Adolescent Sports Posture Correction	Shizhe Yuan et.al.	2411.06725v1	null
2024-11-10	Magnetic Field Aided Vehicle Localization with Acceleration Correction	Mrunmayee Deshpande et.al.	2411.06543v1	null
2024-11-10	Visuotactile-Based Learning for Insertion with Compliant Hands	Osher Azulay et.al.	2411.06408v1	link
2024-11-08	Poze: Sports Technique Feedback under Data Constraints	Agamdeep Singh et.al.	2411.05734v1	null
2024-11-08	DeepArUco++: Improved detection of square fiducial markers in challenging lighting conditions	Rafael Berral-Soler et.al.	2411.05552v1	link
2024-11-08	Tightly-Coupled, Speed-aided Monocular Visual-Inertial Localization in Topological Map	Chanuk Yang et.al.	2411.05497v1	null
2024-11-08	Relative Pose Estimation for Nonholonomic Robot Formation with UWB-IO Measurements	Kunrui Ze et.al.	2411.05481v1	null
2024-11-07	Social EgoMesh Estimation	Luca Scofano et.al.	2411.04598v1	link
2024-11-07	Pose2Trajectory: Using Transformers on Body Pose to Predict Tennis Player's Trajectory	Ali K. AlShami et.al.	2411.04501v1	null
2024-11-08	SuperQ-GRASP: Superquadrics-based Grasp Pose Estimation on Larger Objects for Mobile-Manipulation	Xun Tu et.al.	2411.04386v2	null
2024-11-08	GS2Pose: Two-stage 6D Object Pose Estimation Guided by Gaussian Splatting	Jilan Mei et.al.	2411.03807v3	null
2024-11-06	Estimation of Psychosocial Work Environment Exposures Through Video Object Detection. Proof of Concept Using CCTV Footage	Claus D. Hansen et.al.	2411.03724v1	null
2024-11-05	Estimating Ego-Body Pose from Doubly Sparse Egocentric Video Data	Seunggeun Chi et.al.	2411.03561v1	null
2024-11-05	HFGaussian: Learning Generalizable Gaussian Human with Integrated Human Features	Arnab Dey et.al.	2411.03086v1	null
2024-11-04	Semantic Masking and Visual Feature Matching for Robust Localization	Luisa Mao et.al.	2411.01804v1	null
2024-11-03	Activating Self-Attention for Multi-Scene Absolute Pose Regression	Miso Lee et.al.	2411.01443v1	link
2024-11-04	3D Equivariant Pose Regression via Direct Wigner-D Harmonics Prediction	Jongmin Lee et.al.	2411.00543v2	null
2024-10-31	Whole-Herd Elephant Pose Estimation from Drone Data for Collective Behavior Analysis	Brody McNutt et.al.	2411.00196v1	null
2024-10-31	No Pose, No Problem: Surprisingly Simple 3D Gaussian Splats from Sparse Unposed Images	Botao Ye et.al.	2410.24207v1	link
2024-11-06	SceneComplete: Open-World 3D Scene Completion in Complex Real World Environments for Robot Manipulation	Aditya Agarwal et.al.	2410.23643v2	null
2024-10-30	SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a Benchmark	HyunJun Jung et.al.	2410.22715v1	link
2024-10-29	LiVisSfM: Accurate and Robust Structure-from-Motion with LiDAR and Visual Cues	Hanqing Jiang et.al.	2410.22213v1	null
2024-10-29	PF3plat: Pose-Free Feed-Forward 3D Gaussian Splatting	Sunghwan Hong et.al.	2410.22128v1	link
2024-10-29	HRPVT: High-Resolution Pyramid Vision Transformer for medium and small-scale human pose estimation	Zhoujie Xu et.al.	2410.22079v1	null
2024-10-29	EI-Nexus: Towards Unmediated and Flexible Inter-Modality Local Feature Extraction and Matching for Event-Image Data	Zhonghua Yi et.al.	2410.21743v1	link
2024-10-28	Synthetica: Large Scale Synthetic Data for Robot Perception	Ritvik Singh et.al.	2410.21153v1	null
2024-10-29	BLAPose: Enhancing 3D Human Pose Estimation with Bone Length Adjustment	Chih-Hsiang Hsu et.al.	2410.20731v2	link
2024-11-01	RopeTP: Global Human Motion Recovery via Integrating Robust Pose Estimation with Diffusion Trajectory Prior	Mingjiang Liang et.al.	2410.20358v2	null
2024-10-27	Harmony4D: A Video Dataset for In-The-Wild Close Human Interactions	Rawal Khirodkar et.al.	2410.20294v1	null
2024-10-26	Neural Fields in Robotics: A Survey	Muhammad Zubair Irshad et.al.	2410.20220v1	link
2024-10-25	DECADE: Towards Designing Efficient-yet-Accurate Distance Estimation Modules for Collision Avoidance in Mobile Advanced Driver Assistance Systems	Muhammad Zaeem Shahzad et.al.	2410.19336v1	null
2024-10-24	Where Am I and What Will I See: An Auto-Regressive Model for Spatial Localization and View Prediction	Junyi Chen et.al.	2410.18962v1	null
2024-10-24	VoxelKeypointFusion: Generalizable Multi-View Multi-Person Pose Estimation	Daniel Bermuth et.al.	2410.18723v1	link
2024-10-23	Robust Two-View Geometry Estimation with Implicit Differentiation	Vladislav Pyatov et.al.	2410.17983v1	link
2024-10-23	YOLOv11: An Overview of the Key Architectural Enhancements	Rahima Khanam et.al.	2410.17725v1	link
2024-10-21	Assisted Physical Interaction: Autonomous Aerial Robots with Neural Network Detection, Navigation, and Safety Layers	Andrea Berra et.al.	2410.15802v1	null
2024-10-21	ARTS: Semi-Analytical Regressor using Disentangled Skeletal Representations for Human Mesh Recovery from Videos	Tao Tang et.al.	2410.15582v1	link
2024-10-20	Neural Active Structure-from-Motion in Dark and Textureless Environment	Kazuto Ichimaru et.al.	2410.15378v1	null
2024-10-20	POSE: Pose estimation Of virtual Sync Exhibit system	Hao-Tang Tsui et.al.	2410.15343v1	link
2024-10-18	Graph Optimality-Aware Stochastic LiDAR Bundle Adjustment with Progressive Spatial Smoothing	Jianping Li et.al.	2410.14565v1	null
2024-10-18	Multi-modal Pose Diffuser: A Multimodal Generative Conditional Pose Prior	Calvin-Khang Ta et.al.	2410.14540v1	null
2024-10-18	Sim2real Cattle Joint Estimation in 3D point clouds	Okour Mohammad et.al.	2410.14419v1	null
2024-10-18	Unlabeled Action Quality Assessment Based on Multi-dimensional Adaptive Constrained Dynamic Time Warping	Renguang Chen et.al.	2410.14161v1	null
2024-10-15	From Real Artifacts to Virtual Reference: A Robust Framework for Translating Endoscopic Images	unyang Wu et.al.	2410.13896v1	null
2024-10-17	DualQuat-LOAM: LiDAR Odometry and Mapping parametrized on Dual Quaternions	Edison P. Velasco-Sánchez et.al.	2410.13541v1	null
2024-10-17	Object Pose Estimation Using Implicit Representation For Transparent Objects	Varun Burde et.al.	2410.13465v1	null
2024-10-16	Optimizing Multi-Task Learning for Accurate Spacecraft Pose Estimation	Francesco Evangelisti et.al.	2410.12679v1	null
2024-10-15	Contrastive Touch-to-Touch Pretraining	Samanta Rodriguez et.al.	2410.11834v1	null
2024-10-18	X-Fi: A Modality-Invariant Foundation Model for Multimodal Human Sensing	Xinyan Chen et.al.	2410.10167v2	null
2024-10-13	Occluded Human Pose Estimation based on Limb Joint Augmentation	Gangtao Han et.al.	2410.09885v1	null
2024-10-12	Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors	Hritam Basak et.al.	2410.09467v1	null
2024-10-12	Towards Multi-Modal Animal Pose Estimation: An In-Depth Analysis	Qianyi Deng et.al.	2410.09312v1	link
2024-10-11	CVAM-Pose: Conditional Variational Autoencoder for Multi-Object Monocular Pose Estimation	Jianyu Zhao et.al.	2410.09010v1	link
2024-10-11	Look Gauss, No Pose: Novel View Synthesis using Gaussian Splatting without Accurate Pose Initialization	Christian Schmidt et.al.	2410.08743v1	link
2024-10-10	Generalizing Stochastic Smoothing for Differentiation and Gradient Estimation	Felix Petersen et.al.	2410.08125v1	null
2024-10-10	Robotic framework for autonomous manipulation of laboratory equipment with different degrees of transparency via 6D pose estimation	Maria Makarova et.al.	2410.07801v1	null
2024-10-10	Optimal-State Dynamics Estimation for Physics-based Human Motion Capture from Videos	Cuong Le et.al.	2410.07795v1	link
2024-10-12	Autonomous Driving in Unstructured Environments: How Far Have We Come?	Chen Min et.al.	2410.07701v2	link
2024-10-10	Invisibility Cloak: Disappearance under Human Pose Estimation via Backdoor Attacks	Minxing Zhang et.al.	2410.07670v1	null
2024-10-09	OmniPose6D: Towards Short-Term Object Pose Tracking in Dynamic Scenes from Monocular RGB	Yunzhi Lin et.al.	2410.06694v1	null
2024-10-08	SpecTrack: Learned Multi-Rotation Tracking via Speckle Imaging	Ziyang Chen et.al.	2410.06028v1	link
2024-10-08	AIVIO: Closed-loop, Object-relative Navigation of UAVs with AI-aided Visual Inertial Odometry	Thomas Jantos et.al.	2410.05996v1	null
2024-10-08	Are Minimal Radial Distortion Solvers Necessary for Relative Pose Estimation?	Charalambos Tzamos et.al.	2410.05984v1	link
2024-10-08	FürElise: Capturing and Physically Synthesizing Hand Motions of Piano Performance	Ruocheng Wang et.al.	2410.05791v1	null
2024-10-07	Comparison of marker-less 2D image-based methods for infant pose estimation	Lennart Jahn et.al.	2410.04980v1	null
2024-10-06	Enhancing 3D Human Pose Estimation Amidst Severe Occlusion with Dual Transformer Fusion	Mehwish Ghafoor et.al.	2410.04574v1	link
2024-10-06	LiteVLoc: Map-Lite Visual Localization for Image Goal Navigation	Jianhao Jiao et.al.	2410.04419v1	null
2024-10-05	Test-Time Adaptation for Keypoint-Based Spacecraft Pose Estimation Based on Predicted-View Synthesis	Juan Ignacio Bravo Pérez-Villar et.al.	2410.04298v1	link
2024-10-05	A Framework for Reproducible Benchmarking and Performance Diagnosis of SLAM Systems	Nikola Radulov et.al.	2410.04242v1	link
2024-10-04	Unsupervised Prior Learning: Discovering Categorical Pose Priors from Videos	Ziyu Wang et.al.	2410.03858v1	null
2024-10-04	Universal Global State Estimation for Inertial Navigation Systems	Sifeddine Benahmed et.al.	2410.03846v1	null
2024-10-04	MonST3R: A Simple Approach for Estimating Geometry in the Presence of Motion	Junyi Zhang et.al.	2410.03825v1	null
2024-10-04	Dessie: Disentanglement for Articulated 3D Horse Shape and Pose Estimation from Images	Ci Li et.al.	2410.03438v1	null
2024-10-04	HRVMamba: High-Resolution Visual State Space Model for Dense Prediction	Hao Zhang et.al.	2410.03174v1	null
2024-10-04	CLIP-Clique: Graph-based Correspondence Matching Augmented by Vision Language Models for Object-based Global Localization	Shigemichi Matsuzaki et.al.	2410.03054v1	null
2024-10-03	Why Sample Space Matters: Keyframe Sampling Optimization for LiDAR-based Place Recognition	Nikolaos Stathoulopoulos et.al.	2410.02643v1	link
2024-10-03	Key-Grid: Unsupervised 3D Keypoints Detection using Grid Heatmap Features	Chengkai Hou et.al.	2410.02237v1	null
2024-10-02	SGBA: Semantic Gaussian Mixture Model-Based LiDAR Bundle Adjustment	Xingyu Ji et.al.	2410.01618v1	null
2024-10-02	SurgeoNet: Realtime 3D Pose Estimation of Articulated Surgical Instruments from Stereo Images using a Synthetically-trained Network	Ahmed Tawfik Aboukhadra et.al.	2410.01293v1	null
2024-10-01	Pose Estimation of Buried Deep-Sea Objects using 3D Vision Deep Learning Models	Jerry Yan et.al.	2410.01061v1	null
2024-10-01	RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic Observations	Kaichen Zhou et.al.	2410.00713v1	link
2024-10-01	GERA: Geometric Embedding for Efficient Point Registration Analysis	Geng Li et.al.	2410.00589v1	null
2024-09-30	Continual Human Pose Estimation for Incremental Integration of Keypoints and Pose Variations	Muhammad Saif Ullah Khan et.al.	2409.20469v1	null
2024-09-30	Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies	Shalini Sarode et.al.	2409.20237v1	null
2024-09-30	PuzzleBoard: A New Camera Calibration Pattern with Position Encoding	Peer Stelldinger et.al.	2409.20127v1	link
2024-09-30	Robust Gaussian Splatting SLAM by Leveraging Loop Closure	Zunjie Zhu et.al.	2409.20111v1	null
2024-09-30	GearTrack: Automating 6D Pose Estimation	Yu Deng et.al.	2409.19986v1	null
2024-09-29	PPLNs: Parametric Piecewise Linear Networks for Event-Based Temporal Modeling and Beyond	Chen Song et.al.	2409.19772v1	link
2024-09-29	GelSlim 4.0: Focusing on Touch and Reproducibility	Andrea Sipos et.al.	2409.19770v1	null
2024-09-27	Robust Proximity Operations using Probabilistic Markov Models	Deep Parikh et.al.	2409.19062v1	null
2024-09-27	Exploiting Motion Prior for Accurate Pose Estimation of Dashboard Cameras	Yipeng Lu et.al.	2409.18673v1	null
2024-09-27	DynaWeightPnP: Toward global real-time 3D-2D solver in PnP without correspondences	Jingwei Song et.al.	2409.18457v1	null
2024-09-30	Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose Estimation	Mengchen Zhang et.al.	2409.18261v2	link
2024-09-26	AI-Powered Augmented Reality for Satellite Assembly, Integration and Test	Alvaro Patricio et.al.	2409.18101v1	null
2024-09-27	Leveraging Anthropometric Measurements to Improve Human Mesh Estimation and Ensure Consistent Body Shapes	Katja Ludwig et.al.	2409.17671v2	null
2024-09-25	Safe Leaf Manipulation for Accurate Shape and Pose Estimation of Occluded Fruits	Shaoxiong Yao et.al.	2409.17389v1	null
2024-09-25	Hierarchical Tri-manual Planning for Vision-assisted Fruit Harvesting with Quadrupedal Robots	Zhichao Liu et.al.	2409.17116v1	null
2024-09-25	Self-Sensing for Proprioception and Contact Detection in Soft Robots Using Shape Memory Alloy Artificial Muscles	Ran Jing et.al.	2409.17111v1	null
2024-09-25	Online 6DoF Pose Estimation in Forests using Cross-View Factor Graph Optimisation and Deep Learned Re-localisation	Lucas Carvalho de Lima et.al.	2409.16680v1	null
2024-09-25	FAFA: Frequency-Aware Flow-Aided Self-Supervision for Underwater Object Pose Estimation	Jingyi Tang et.al.	2409.16600v1	null
2024-09-25	Robo-Platform: A Robotic System for Recording Sensors and Controlling Robots	Masoud Dayani Najafabadi et.al.	2409.16595v1	link
2024-09-24	PseudoNeg-MAE: Self-Supervised Point Cloud Learning using Conditional Pseudo-Negative Embeddings	Sutharsan Mahendren et.al.	2409.15832v1	null
2024-09-24	LaPose: Laplacian Mixture Shape Modeling for RGB-Based Category-Level Object Pose Estimation	Ruida Zhang et.al.	2409.15727v1	link
2024-09-23	Framework for Robust Localization of UUVs and Mapping of Net Pens	David Botta et.al.	2409.15475v1	null
2024-09-23	FisheyeDepth: A Real Scale Self-Supervised Depth Estimation Model for Fisheye Camera	Guoyang Zhao et.al.	2409.15054v1	link
2024-09-23	BranchPoseNet: Characterizing tree branching with a deep learning-based pose estimation approach	Stefano Puliti et.al.	2409.14755v1	link
2024-09-23	ERPoT: Effective and Reliable Pose Tracking for Mobile Robots Based on Lightweight and Compact Polygon Maps	Haiming Gao et.al.	2409.14723v1	link
2024-09-22	Tactile Functasets: Neural Implicit Representations of Tactile Datasets	Sikai Li et.al.	2409.14592v1	null
2024-09-22	AR Overlay: Training Image Pose Estimation on Curved Surface in a Synthetic Way	Sining Huang et.al.	2409.14577v1	null
2024-09-22	DROP: Dexterous Reorientation via Online Planning	Albert H. Li et.al.	2409.14562v1	null
2024-09-21	Combining Absolute and Semi-Generalized Relative Poses for Visual Localization	Vojtech Panek et.al.	2409.14269v1	null
2024-09-18	SpotLight: Robotic Scene Understanding through Interaction and Affordance Detection	Tim Engelbracht et.al.	2409.11870v1	link
2024-09-18	End-to-End Probabilistic Geometry-Guided Regression for 6DoF Object Pose Estimation	Thomas Pöllabauer et.al.	2409.11819v1	null
2024-09-18	Bridging Domain Gap for Flight-Ready Spaceborne Vision	Tae Ha Park et.al.	2409.11661v1	null
2024-09-17	Good Grasps Only: A data engine for self-supervised fine-tuning of pose estimation using grasp poses for verification	Frederik Hagelskjær et.al.	2409.11512v1	null
2024-09-17	Training Datasets Generation for Machine Learning: Application to Vision Based Navigation	Jérémy Lebreton et.al.	2409.11383v1	null
2024-09-17	OmniGen: Unified Image Generation	Shitao Xiao et.al.	2409.11340v1	link
2024-09-17	ULOC: Learning to Localize in Complex Large-Scale Environments with Ultra-Wideband Ranges	Thien-Minh Nguyen et.al.	2409.11122v1	link
2024-09-17	Depth-based Privileged Information for Boosting 3D Human Pose Estimation on RGB	Alessandro Simoni et.al.	2409.11104v1	null
2024-09-21	HGSLoc: 3DGS-based Heuristic Camera Pose Refinement	Zhongyan Niu et.al.	2409.10925v2	null
2024-09-17	Pose estimation of CubeSats via sensor fusion and Error-State Extended Kalman Filter	Deep Parikh et.al.	2409.10815v1	null
2024-09-16	CtRNet-X: Camera-to-Robot Pose Estimation in Real-world Conditions Using a Single Camera	Jingpei Lu et.al.	2409.10441v1	null
2024-09-16	HiFi-CS: Towards Open Vocabulary Visual Grounding For Robotic Grasping Using Vision-Language Models	Vineet Bhat et.al.	2409.10419v1	link
2024-09-16	2D or not 2D: How Does the Dimensionality of Gesture Representation Affect 3D Co-Speech Gesture Generation?	Téo Guichoux et.al.	2409.10357v1	null
2024-09-16	Human Insights Driven Latent Space for Different Driving Perspectives: A Unified Encoder for Efficient Multi-Task Inference	Huy-Dung Nguyen et.al.	2409.10095v1	null
2024-09-15	Precise Pick-and-Place using Score-Based Diffusion Networks	Shih-Wei Guo et.al.	2409.09725v1	null
2024-09-15	Pre-Training for 3D Hand Pose Estimation with Contrastive Learning on Large-Scale Hand Images in the Wild	Nie Lin et.al.	2409.09714v1	null
2024-09-15	Proximity operations of CubeSats via sensor fusion of ultra-wideband range measurements with rate gyroscopes, accelerometers and monocular vision	Deep Parikh et.al.	2409.09665v1	null
2024-09-15	A Scalable Tabletop Satellite Automation Testbed:Design And Experiments	Deep Parikh et.al.	2409.09633v1	null
2024-09-14	MAC-VO: Metrics-aware Covariance for Learning-based Stereo Visual Odometry	Yuheng Qiu et.al.	2409.09479v1	null
2024-09-14	Distributed Invariant Kalman Filter for Object-level Multi-robot Pose SLAM	Haoying Li et.al.	2409.09410v1	null
2024-09-13	Causal Transformer for Fusion and Pose Estimation in Deep Visual Inertial Odometry	Yunus Bilge Kurt et.al.	2409.08769v1	link
2024-09-13	WheelPoser: Sparse-IMU Based Body Pose Estimation for Wheelchair Users	Yunzhi Li et.al.	2409.08494v1	link
2024-09-12	Bayesian Inverse Graphics for Few-Shot Concept Learning	Octavio Arriaga et.al.	2409.08351v1	link
2024-09-12	Touch2Touch: Cross-Modal Tactile Generation for Object Manipulation	Samanta Rodriguez et.al.	2409.08269v1	null
2024-09-12	Covariance Intersection-based Invariant Kalman Filtering(DInCIKF) for Distributed Pose Estimation	Haoying Li et.al.	2409.07933v1	null
2024-09-12	GateAttentionPose: Enhancing Pose Estimation with Agent Attention and Improved Gated Convolutions	Liang Feng et.al.	2409.07798v1	null
2024-09-12	GatedUniPose: A Novel Approach for Pose Estimation Combining UniRepLKNet and Gated Convolution	Liang Feng et.al.	2409.07752v1	null
2024-09-11	FaVoR: Features via Voxel Rendering for Camera Relocalization	Vincenzo Polizzi et.al.	2409.07571v1	link
2024-09-11	Benchmarking 2D Egocentric Hand Pose Datasets	Olga Taran et.al.	2409.07337v1	null
2024-09-11	iKalibr-RGBD: Partially-Specialized Target-Free Visual-Inertial Spatiotemporal Calibration For RGBDs via Continuous-Time Velocity Estimation	Shuolong Chen et.al.	2409.07116v1	link
2024-09-11	Equivariant Filter for Tightly Coupled LiDAR-Inertial Odometry	Anbo Tao et.al.	2409.06948v1	null
2024-09-13	A Bayesian framework for active object recognition, pose estimation and shape transfer learning through touch	Haodong Zheng et.al.	2409.06912v2	null
2024-09-11	Alignist: CAD-Informed Orientation Distribution Estimation by Fusing Shape and Correspondences	Shishir Reddy Vutukur et.al.	2409.06683v2	link
2024-09-10	PoseEmbroider: Towards a 3D, Visual, Semantic-aware Human Pose Representation	Ginger Delmas et.al.	2409.06535v1	null
2024-09-10	Test-Time Certifiable Self-Supervision to Bridge the Sim2Real Gap in Event-Based Satellite Pose Estimation	Mohsi Jawaid et.al.	2409.06240v1	null
2024-09-09	From Words to Poses: Enhancing Novel Object Pose Estimation with Vision Language Models	Tessa Pulli et.al.	2409.05413v1	null
2024-09-08	HelmetPoser: A Helmet-Mounted IMU Dataset for Data-Driven Estimation of Human Head Motion in Diverse Conditions	Jianping Li et.al.	2409.05006v1	null
2024-09-06	Casper DPM: Cascaded Perceptual Dynamic Projection Mapping onto Hands	Yotam Erel et.al.	2409.04397v1	null
2024-09-06	GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers	Lorenza Prospero et.al.	2409.04196v1	link
2024-09-06	Dense Hand-Object(HO) GraspNet with Full Grasping Taxonomy and Dynamics	Woojin Cho et.al.	2409.04033v1	null
2024-09-06	Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments	Therese Joseph et.al.	2409.03998v1	null
2024-09-09	The Influence of Faulty Labels in Data Sets on Human Pose Estimation	Arnold Schwarz et.al.	2409.03887v2	null
2024-09-05	MaskVal: Simple but Effective Uncertainty Quantification for 6D Pose Estimation	Philipp Quentin et.al.	2409.03556v1	null
2024-09-05	UAV (Unmanned Aerial Vehicles): Diverse Applications of UAV Datasets in Segmentation, Classification, Detection, and Tracking	Md. Mahfuzur Rahman et.al.	2409.03245v1	null
2024-09-01	Recoverable Anonymization for Pose Estimation: A Privacy-Enhancing Approach	Wenjun Huang et.al.	2409.02715v1	null
2024-09-04	Object Gaussian for Monocular 6D Pose Estimation from Sparse Views	Luqing Luo et.al.	2409.02581v1	null
2024-09-03	EgoPressure: A Dataset for Hand Pressure and Pose Estimation in Egocentric Vision	Yiming Zhao et.al.	2409.02224v1	null
2024-09-03	Deep learning for objective estimation of Parkinsonian tremor severity	Felipe Duque-Quiceno et.al.	2409.02011v1	null
2024-09-03	SPiKE: 3D Human Pose from Point Cloud Sequences	Irene Ballester et.al.	2409.01879v1	link
2024-09-02	Kalman Filtering for Precise Indoor Position and Orientation Estimation Using IMU and Acoustics on Riemannian Manifolds	Mohammed H. AlSharif et.al.	2409.01002v1	null
2024-09-01	Detection, Recognition and Pose Estimation of Tabletop Objects	Sanjuksha Nirgude et.al.	2409.00869v1	null
2024-09-01	DSLO: Deep Sequence LiDAR Odometry Based on Inconsistent Spatio-temporal Propagation	Huixin Zhang et.al.	2409.00744v1	link
2024-09-01	MoManifold: Learning to Measure 3D Human Motion via Decoupled Joint Acceleration Manifolds	Ziqiang Dang et.al.	2409.00736v1	null
2024-08-31	ActionPose: Pretraining 3D Human Pose Estimation with the Dark Knowledge of Action	Longyun Liao et.al.	2409.00449v1	null
2024-09-04	Augmented Reality without Borders: Achieving Precise Localization Without Maps	Albert Gassol Puigjaner et.al.	2408.17373v3	null
2024-08-30	BOP-D: Revisiting 6D Pose Estimation Benchmark for Better Evaluation under Visual Ambiguities	Boris Meden et.al.	2408.17297v1	null
2024-08-30	EMHI: A Multimodal Egocentric Human Motion Dataset with HMD and Body-Worn IMUs	Zhen Fan et.al.	2408.17168v1	null
2024-09-01	Generic Objects as Pose Probes for Few-Shot View Synthesis	Zhirui Gao et.al.	2408.16690v2	null
2024-08-29	OP-Align: Object-level and Part-level Alignment for Self-supervised Category-level Articulated Object Pose Estimation	Yuchen Che et.al.	2408.16547v1	link
2024-08-29	GRPose: Learning Graph Relations for Human Image Generation with Pose Priors	Xiangchen Yin et.al.	2408.16540v1	link
2024-08-28	Are Pose Estimators Ready for the Open World? STAGE: Synthetic Data Generation Toolkit for Auditing 3D Human Pose Estimators	Nikita Kister et.al.	2408.16536v1	null
2024-08-28	Multi-view Pose Fusion for Occlusion-Aware 3D Human Pose Estimation	Laura Bragagnolo et.al.	2408.15810v1	link
2024-08-30	Addressing the challenges of loop detection in agricultural environments	Nicolás Soncini et.al.	2408.15761v2	link
2024-08-28	Str-L Pose: Integrating Point and Structured Line for Relative Pose Estimation in Dual-Graph	Zherong Zhang et.al.	2408.15750v1	null
2024-08-28	Benchmarking ML Approaches to UWB-Based Range-Only Posture Recognition for Human Robot-Interaction	Salma Salimi et.al.	2408.15717v1	null
2024-08-26	Bengali Sign Language Recognition through Hand Pose Estimation using Multi-Branch Spatial-Temporal Attention Model	Abu Saleh Musa Miah et.al.	2408.14111v1	null
2024-08-25	InterTrack: Tracking Human Object Interaction without Object Templates	Xianghui Xie et.al.	2408.13953v1	null
2024-08-24	Temporally-consistent 3D Reconstruction of Birds	Johannes Hägerlind et.al.	2408.13629v1	null
2024-08-24	Explainable Convolutional Networks for Crater Detection and Lunar Landing Navigation	Jianing Song et.al.	2408.13587v1	null
2024-08-27	Sapiens: Foundation for Human Vision Models	Rawal Khirodkar et.al.	2408.12569v3	null
2024-08-21	GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting	Wanshui Gan et.al.	2408.11447v1	link
2024-08-20	GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting	Changkun Liu et.al.	2408.11085v1	link
2024-08-20	ZebraPose: Zebra Detection and Pose Estimation using only Synthetic Data	Elia Bonetto et.al.	2408.10831v1	null
2024-08-20	MPL: Lifting 3D Human Pose from Multi-view 2D Poses	Seyed Abolfazl Ghasemzadeh et.al.	2408.10805v1	link
2024-08-19	RUMI: Rummaging Using Mutual Information	Sheng Zhong et.al.	2408.10450v1	null
2024-08-19	SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse Views	Chao Xu et.al.	2408.10195v1	null
2024-08-19	SHARP: Segmentation of Hands and Arms by Range using Pseudo-Depth for Enhanced Egocentric 3D Hand Pose Estimation and Action Recognition	Wiktor Mucha et.al.	2408.10037v1	link
2024-08-19	Pose-GuideNet: Automatic Scanning Guidance for Fetal Head Ultrasound from Pose Estimation	Qianhui Men et.al.	2408.09931v1	null
2024-08-18	OPPH: A Vision-Based Operator for Measuring Body Movements for Personal Healthcare	Chen Long-fei et.al.	2408.09409v1	null
2024-08-17	An Open-Source American Sign Language Fingerspell Recognition and Semantic Pose Retrieval Interface	Kevin Jose Thomas et.al.	2408.09311v1	link
2024-08-16	ADen: Adaptive Density Representations for Sparse-view Camera Pose Estimation	Hao Tang et.al.	2408.09042v1	null
2024-08-16	Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS	Wei Sun et.al.	2408.08723v1	null
2024-08-16	SketchRef: A Benchmark Dataset and Evaluation Metrics for Automated Sketch Synthesis	Xingyue Lin et.al.	2408.08623v1	null
2024-08-15	HyperTaxel: Hyper-Resolution for Taxel-Based Tactile Signals Through Contrastive Learning	Hongyu Li et.al.	2408.08312v1	null
2024-08-15	Comparative Evaluation of 3D Reconstruction Methods for Object Pose Estimation	Varun Burde et.al.	2408.08234v1	link
2024-08-15	Towards Practical Human Motion Prediction with LiDAR Point Clouds	Xiao Han et.al.	2408.08202v1	null
2024-08-15	Your Turn: Real-World Turning Angle Estimation for Parkinson's Disease Severity Assessment	Qiushuo Cheng et.al.	2408.08182v1	null
2024-08-15	Polaris: Open-ended Interactive Robotic Manipulation via Syn2Real Visual Grounding and Large Language Models	Tianyu Wang et.al.	2408.07975v1	null
2024-08-15	GOReloc: Graph-based Object-Level Relocalization for Visual SLAM	Yutong Wang et.al.	2408.07917v1	link
2024-08-13	Grasping by Hanging: a Learning-Free Grasping Detection Method for Previously Unseen Objects	Wanze Li et.al.	2408.06734v1	null
2024-08-13	A Miniature Vision-Based Localization System for Indoor Blimps	Shicong Ma et.al.	2408.06648v1	null
2024-08-12	UniT: Unified Tactile Representation for Robot Learning	Zhengtong Xu et.al.	2408.06481v1	link
2024-08-12	Moo-ving Beyond Tradition: Revolutionizing Cattle Behavioural Phenotyping with Pose Estimation Techniques	Navid Ghassemi et.al.	2408.06336v1	null
2024-08-12	CAD-Mesher: A Convenient, Accurate, Dense Mesh-based Mapping Module in SLAM for Dynamic Environments	Yanpeng Jia et.al.	2408.05981v1	null
2024-08-12	PAFormer: Part Aware Transformer for Person Re-identification	Hyeono Jung et.al.	2408.05918v1	null
2024-08-11	SABER-6D: Shape Representation Based Implicit Object Pose Estimation	Shishir Reddy Vutukur et.al.	2408.05867v1	null
2024-08-10	Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis	Zhongche Qu et.al.	2408.05635v1	null
2024-08-10	Anticipation through Head Pose Estimation: a preliminary study	Federico Figari Tomenotti et.al.	2408.05516v1	null
2024-08-09	Mesh-based Object Tracking for Dynamic Semantic 3D Scene Graphs via Ray Tracing	Lennart Niecksch et.al.	2408.04979v1	null
2024-08-07	PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space Model	Yunlong Huang et.al.	2408.03540v1	link
2024-08-06	Line-based 6-DoF Object Pose Estimation and Tracking With an Event Camera	Zibin Liu et.al.	2408.03225v1	link
2024-08-06	Training on the Fly: On-device Self-supervised Learning aboard Nano-drones within 20 mW	Elia Cereda et.al.	2408.03168v1	null
2024-08-06	BodySLAM: A Generalized Monocular Visual SLAM Framework for Surgical Applications	G. Manni et.al.	2408.03078v1	link
2024-08-07	Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network	Xinyi Zhang et.al.	2408.02922v2	null
2024-08-05	Analyzing Data Efficiency and Performance of Machine Learning Algorithms for Assessing Low Back Pain Physical Rehabilitation Exercises	Aleksa Marusic et.al.	2408.02855v1	null
2024-08-05	Joint-Motion Mutual Learning for Pose Estimation in Videos	Sifan Wu et.al.	2408.02285v1	null
2024-08-04	AvatarPose: Avatar-guided 3D Pose Estimation of Close Human Interaction from Sparse Multi-view Videos	Feichi Lu et.al.	2408.02110v1	null
2024-08-04	Generalized Maximum Likelihood Estimation for Perspective-n-Point Problem	Tian Zhan et.al.	2408.01945v1	null
2024-08-03	MotionTrace: IMU-based Field of View Prediction for Smartphone AR Interactions	Rahul Islam et.al.	2408.01850v1	null
2024-08-03	BEVPlace++: Fast, Robust, and Lightweight LiDAR Global Localization for Unmanned Ground Vehicles	Lun Luo et.al.	2408.01841v1	link
2024-08-03	E $^3$ NeRF: Efficient Event-Enhanced Neural Radiance Fields from Blurry Images	Yunshan Qi et.al.	2408.01840v1	null
2024-08-03	Survey on Emotion Recognition through Posture Detection and the possibility of its application in Virtual Reality	Leina Elansary et.al.	2408.01728v1	null
2024-08-03	Stimulating Imagination: Towards General-purpose Object Rearrangement	Jianyang Wu et.al.	2408.01655v1	null
2024-08-02	Full-range Head Pose Geometric Data Augmentations	Huei-Chung Hu et.al.	2408.01566v1	null
2024-07-31	Adapting Skills to Novel Grasps: A Self-Supervised Approach	Georgios Papagiannis et.al.	2408.00178v1	null
2024-07-31	Certifying Robustness of Learning-Based Keypoint Detection and Pose Estimation Methods	Xusheng Luo et.al.	2408.00117v1	null
2024-07-30	StackFLOW: Monocular Human-Object Reconstruction by Stacked Normalizing Flow with Offset	Chaofan Huo et.al.	2407.20545v1	link
2024-07-30	HandDAGT: A Denoising Adaptive Graph Transformer for 3D Hand Pose Estimation	Wencan Cheng et.al.	2407.20542v1	link
2024-07-30	Markers Identification for Relative Pose Estimation of an Uncooperative Target	Batu Candan et.al.	2407.20515v1	null
2024-07-29	BaseBoostDepth: Exploiting Larger Baselines For Self-supervised Monocular Depth Estimation	Kieran Saunders et.al.	2407.20437v1	null
2024-07-28	Skeleton-based Group Activity Recognition via Spatial-Temporal Panoramic Graph	Zhengcen Li et.al.	2407.19497v1	link
2024-07-26	Flexible graph convolutional network for 3D human pose estimation	Abu Taib Mohammed Shahjahan et.al.	2407.19077v1	link
2024-07-26	From 2D to 3D: AISG-SLA Visual Localization Challenge	Jialin Gao et.al.	2407.18590v1	null
2024-07-28	HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation	Zhenzhi Wang et.al.	2407.17438v2	link
2024-07-24	Active Loop Closure for OSM-guided Robotic Mapping in Large-Scale Urban Environments	Wei Gao et.al.	2407.17078v1	null
2024-07-30	DreamCar: Leveraging Car-specific Prior for in-the-wild 3D Car Reconstruction	Xiaobiao Du et.al.	2407.16988v2	link
2024-07-24	Pose Estimation from Camera Images for Underwater Inspection	Luyuan Peng et.al.	2407.16961v1	null
2024-07-23	COALA: A Practical and Vision-Centric Federated Learning Platform	Weiming Zhuang et.al.	2407.16560v1	link
2024-07-23	Probabilistic Parameter Estimators and Calibration Metrics for Pose Estimation from Image Features	Romeo Valentin et.al.	2407.16223v1	null
2024-07-23	Optimal camera-robot pose estimation in linear time from points and lines	Guangyang Zeng et.al.	2407.16151v1	null
2024-07-23	3D-UGCN: A Unified Graph Convolutional Network for Robust 3D Human Pose Estimation from Monocular RGB Images	Jie Zhao et.al.	2407.16137v1	null
2024-07-21	CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models	Zheng Chong et.al.	2407.15886v1	link
2024-07-22	RADA: Robust and Accurate Feature Learning with Domain Adaptation	Jingtai He et.al.	2407.15791v1	null
2024-07-22	Local Occupancy-Enhanced Object Grasping with Multiple Triplanar Projection	Kangqi Ma et.al.	2407.15771v1	null
2024-07-22	6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model	Matteo Bortolon et.al.	2407.15484v1	null
2024-07-23	Domain-Adaptive 2D Human Pose Estimation via Dual Teachers in Extremely Low-Light Conditions	Yihao Ai et.al.	2407.15451v2	link
2024-07-22	avaTTAR: Table Tennis Stroke Training with On-body and Detached Visualization in Augmented Reality	Dizhi Ma et.al.	2407.15373v1	null
2024-07-20	From Underground Mines to Offices: A Versatile and Robust Framework for Range-Inertial SLAM	Lorenzo Montano-Oliván et.al.	2407.14797v1	null
2024-07-19	ESCAPE: Energy-based Selective Adaptive Correction for Out-of-distribution 3D Human Pose Estimation	Luke Bidulka et.al.	2407.14605v1	null
2024-07-19	6DoF Head Pose Estimation through Explicit Bidirectional Interaction with Face Geometry	Sungho Chun et.al.	2407.14136v1	link
2024-07-18	RT-Pose: A 4D Radar Tensor-based 3D Human Pose Estimation and Localization Benchmark	Yuan-Hao Ho et.al.	2407.13930v1	null
2024-07-19	GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation	Bangyan Liao et.al.	2407.13537v2	link
2024-07-18	SCAPE: A Simple and Strong Category-Agnostic Pose Estimator	Yujia Liang et.al.	2407.13483v1	link
2024-07-17	SG-NeRF: Neural Surface Reconstruction with Scene Graph Optimization	Yiyang Chen et.al.	2407.12667v1	link
2024-07-17	Invertible Neural Warp for NeRF	Shin-Fang Chng et.al.	2407.12354v1	null
2024-07-16	NeuSurfEmb: A Complete Pipeline for Dense Correspondence-based 6D Object Pose Estimation without CAD Models	Francesco Milano et.al.	2407.12207v1	link
2024-07-16	Monocular pose estimation of articulated surgical instruments in open surgery	Robert Spektor et.al.	2407.12138v1	null
2024-07-17	GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection	Jingwen Yu et.al.	2407.11736v2	link
2024-07-16	TCFormer: Visual Recognition via Token Clustering Transformer	Wang Zeng et.al.	2407.11321v1	link
2024-07-15	A BlueROV2-based platform for underwater mapping experiments	Tudor Alinei-Poiana et.al.	2407.10901v1	link
2024-07-15	LVCP: LiDAR-Vision Tightly Coupled Collaborative Real-time Relative Positioning	Zhuozhu Jian et.al.	2407.10782v1	null
2024-07-15	Domain Generalization for 6D Pose Estimation Through NeRF-based Image Synthesis	Antoine Legrand et.al.	2407.10762v1	null
2024-07-16	GTPT: Group-based Token Pruning Transformer for Efficient Human Pose Estimation	Haonan Wang et.al.	2407.10756v2	null
2024-07-15	Learning to Estimate the Pose of a Peer Robot in a Camera Image by Predicting the States of its LEDs	Nicholas Carlotti et.al.	2407.10661v1	null
2024-07-15	Deep-Learning-Based Markerless Pose Estimation Systems in Gait Analysis: DeepLabCut Custom Training and the Refinement Function	Giulia Panconi et.al.	2407.10590v1	null
2024-07-14	3D Foundation Models Enable Simultaneous Geometry and Pose Estimation of Grasped Objects	Weiming Zhi et.al.	2407.10331v1	null
2024-07-16	psifx -- Psychological and Social Interactions Feature Extraction Package	Guillaume Rochette et.al.	2407.10266v2	null
2024-07-14	PAFUSE: Part-based Diffusion for 3D Whole-Body Pose Estimation	Nermin Samet et.al.	2407.10220v1	link
2024-07-14	3DEgo: 3D Editing on the Go!	Umar Khalid et.al.	2407.10102v1	null
2024-07-12	iNeMo: Incremental Neural Mesh Models for Robust Class-Incremental Learning	Tom Fischer et.al.	2407.09271v1	link
2024-07-12	HUP-3D: A 3D multi-view synthetic dataset for assisted-egocentric hand-ultrasound pose estimation	Manuel Birlo et.al.	2407.09215v1	null
2024-07-12	KGpose: Keypoint-Graph Driven End-to-End Multi-Object 6D Pose Estimation via Point-Wise Pose Voting	Andrew Jeong et.al.	2407.08909v1	null
2024-07-11	RTMW: Real-Time Multi-Person 2D and 3D Whole-body Pose Estimation	Tao Jiang et.al.	2407.08634v1	link
2024-07-11	SRPose: Two-view Relative Pose Estimation with Sparse Keypoints	Rui Yin et.al.	2407.08199v1	link
2024-07-11	SGLC: Semantic Graph-Guided Coarse-Fine-Refine Full Loop Closing for LiDAR SLAM	Neng Wang et.al.	2407.08106v1	link
2024-07-10	RoCap: A Robotic Data Collection Pipeline for the Pose Estimation of Appearance-Changing Objects	Jiahao Nick Li et.al.	2407.08081v1	null
2024-07-10	Hybrid Structure-from-Motion and Camera Relocalization for Enhanced Egocentric Localization	Jinjie Mai et.al.	2407.08023v1	link
2024-07-10	Greit-HRNet: Grouped Lightweight High-Resolution Network for Human Pose Estimation	Junjia Han et.al.	2407.07389v1	null
2024-07-09	Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images	Chuanrui Zhang et.al.	2407.06984v1	null
2024-07-09	Computer vision tasks for intelligent aerospace missions: An overview	Huilin Chen et.al.	2407.06513v1	null
2024-07-08	GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields	Weiyi Xue et.al.	2407.05597v1	null
2024-07-10	On the power of data augmentation for head pose estimation	Michael Welter et.al.	2407.05357v2	link
2024-07-07	SCIPaD: Incorporating Spatial Clues into Unsupervised Pose-Depth Joint Learning	Yi Feng et.al.	2407.05283v1	link
2024-07-05	Unsupervised Learning of Category-Level 3D Pose from Object-Centric Videos	Leonhard Sommer et.al.	2407.04384v1	link
2024-07-04	Towards Cross-View-Consistent Self-Supervised Surround Depth Estimation	Laiyan Ding et.al.	2407.04041v1	link
2024-07-04	Markerless Multi-view 3D Human Pose Estimation: a survey	Ana Filipa Rodrigues Nogueira et.al.	2407.03817v1	null
2024-07-04	A Fast Dynamic Point Detection Method for LiDAR-Inertial Odometry in Driving Scenarios	Zikang Yuan et.al.	2407.03590v1	link
2024-07-03	Graph and Skipped Transformer: Exploiting Spatial and Temporal Modeling Capacities for Efficient 3D Human Pose Estimation	Mengmeng Cui et.al.	2407.02990v1	null
2024-07-03	Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction	Jiaxin Guo et.al.	2407.02918v1	link
2024-07-02	SUPER: Seated Upper Body Pose Estimation using mmWave Radars	Bo Zhang et.al.	2407.02455v1	null
2024-07-02	ReliaAvatar: A Robust Real-Time Avatar Animator with Integrated Motion Prediction	Bo Qian et.al.	2407.02129v1	null
2024-07-02	Joint-Dataset Learning and Cross-Consistent Regularization for Text-to-Motion Retrieval	Nicola Messina et.al.	2407.02104v1	null
2024-07-01	Active Human Pose Estimation via an Autonomous UAV Agent	Jingxi Chen et.al.	2407.01811v1	null
2024-07-01	RoDyn-SLAM: Robust Dynamic Dense RGB-D SLAM with Neural Radiance Fields	Haochen Jiang et.al.	2407.01303v1	link
2024-07-01	Collaborative Graph Exploration with Reduced Pose-SLAM Uncertainty via Submodular Optimization	Ruofei Bai et.al.	2407.01013v1	link
2024-06-30	Ego-to-Exo: Interfacing Third Person Visuals from Egocentric Views in Real-time for Improved ROV Teleoperation	Adnan Abdullah et.al.	2407.00848v1	null
2024-06-29	When Robots Get Chatty: Grounding Multimodal Human-Robot Conversation and Collaboration	Philipp Allgeuer et.al.	2407.00518v1	link
2024-06-28	Assistive Image Annotation Systems with Deep Learning and Natural Language Capabilities: A Review	Moseli Mots'oehli et.al.	2407.00252v1	null
2024-06-28	EPOCH: Jointly Estimating the 3D Pose of Cameras and Humans	Nicola Garau et.al.	2406.19726v1	null
2024-06-28	CLOi-Mapper: Consistent, Lightweight, Robust, and Incremental Mapper With Embedded Systems for Commercial Robot Services	DongKi Noh et.al.	2406.19634v1	null
2024-06-27	Multimodal Visual-haptic pose estimation in the presence of transient occlusion	Michael Zechmair et.al.	2406.19323v1	null
2024-06-27	Human Modelling and Pose Estimation Overview	Pawel Knap et.al.	2406.19290v1	null
2024-06-26	Towards Human-Level 3D Relative Pose Estimation: Generalizable, Training-Free, with Single Reference	Yuan Gao et.al.	2406.18453v1	link
2024-06-27	Automatic infant 2D pose estimation from videos: comparing seven deep neural network methods	Filipe Gama et.al.	2406.17382v2	null
2024-06-24	High-resolution open-vocabulary object 6D pose estimation	Jaime Corsetti et.al.	2406.16384v1	null
2024-06-23	Breaking the Frame: Image Retrieval by Visual Overlap Prediction	Tong Wei et.al.	2406.16204v1	link
2024-06-21	Efficient Human Pose Estimation: Leveraging Advanced Techniques with MediaPipe	Sandeep Singh Sengar et.al.	2406.15649v1	link
2024-06-24	Investigating the impact of 2D gesture representation on co-speech gesture generation	Teo Guichoux et.al.	2406.15111v2	null
2024-06-20	Benchmarking Monocular 3D Dog Pose Estimation Using In-The-Wild Motion Capture Data	Moira Shooter et.al.	2406.14412v1	null
2024-06-20	PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions	Sihan Ma et.al.	2406.14367v1	null
2024-06-19	NeRF-Feat: 6D Object Pose Estimation using Feature Rendering	Shishir Reddy Vutukur et.al.	2406.13796v1	null
2024-06-19	CNN Based Flank Predictor for Quadruped Animal Species	Vanessa Suessle et.al.	2406.13588v1	null
2024-06-19	MVSBoost: An Efficient Point Cloud-based 3D Reconstruction	Umair Haroon et.al.	2406.13515v1	null
2024-06-19	An Efficient yet High-Performance Method for Precise Radar-Based Imaging of Human Hand Poses	Johanna Bräunig et.al.	2406.13464v1	null
2024-06-18	Head Pose Estimation and 3D Neural Surface Reconstruction via Monocular Camera in situ for Navigation and Safe Insertion into Natural Openings	Ruijie Tang et.al.	2406.13048v1	null
2024-06-17	Matching Query Image Against Selected NeRF Feature for Efficient and Scalable Localization	Huaiji Zhou et.al.	2406.11766v1	null
2024-06-17	Domain Generalization for In-Orbit 6D Pose Estimation	Antoine Legrand et.al.	2406.11743v1	null
2024-06-17	SeamPose: Repurposing Seams as Capacitive Sensors in a Shirt for Upper-Body Pose Tracking	Tianhong Catherine Yu et.al.	2406.11645v1	null
2024-06-14	Galibr: Targetless LiDAR-Camera Extrinsic Calibration Method via Ground Plane Initialization	Wonho Song et.al.	2406.11599v1	null
2024-06-15	MMVR: Millimeter-wave Multi-View Radar Dataset and Benchmark for Indoor Perception	M. Mahbubur Rahman et.al.	2406.10708v1	link
2024-06-15	Improving Ab-Initio Cryo-EM Reconstruction with Semi-Amortized Pose Inference	Shayan Shekarforoush et.al.	2406.10455v1	null
2024-06-14	The BabyView dataset: High-resolution egocentric videos of infants' and young children's everyday experiences	Bria Long et.al.	2406.10447v1	null
2024-06-14	OpenCapBench: A Benchmark to Bridge Pose Estimation and Biomechanics	Yoni Gozlan et.al.	2406.09788v1	null
2024-06-13	ImageNet3D: Towards General-Purpose Object-Level 3D Understanding	Wufei Ma et.al.	2406.09613v1	link
2024-06-13	Deep Transformer Network for Monocular Pose Estimation of Ship-Based UAV	Maneesha Wickramasuriya et.al.	2406.09260v1	link
2024-06-14	Language-Driven Closed-Loop Grasping with Model-Predictive Trajectory Replanning	Huy Hoang Nguyen et.al.	2406.09039v2	null
2024-06-14	VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks	Jiannan Wu et.al.	2406.08394v2	link
2024-06-12	Asymptotic Unbiased Sample Sampling to Speed Up Sharpness-Aware Minimization	Jiaxin Deng et.al.	2406.08001v1	null
2024-06-12	IFTD: Image Feature Triangle Descriptor for Loop Detection in Driving Scenes	Fengtian Lang et.al.	2406.07937v1	link
2024-06-12	From Variance to Veracity: Unbundling and Mitigating Gradient Variance in Differentiable Bundle Adjustment Layers	Swaminathan Gurumurthy et.al.	2406.07785v1	link
2024-06-12	SPIN: Spacecraft Imagery for Navigation	Javier Montalvo et.al.	2406.07500v2	link
2024-06-11	Realistic Data Generation for 6D Pose Estimation of Surgical Instruments	Juan Antonio Barragan et.al.	2406.07328v1	link
2024-06-11	SignMusketeers: An Efficient Multi-Stream Approach for Sign Language Translation at Scale	Shester Gueuwou et.al.	2406.06907v1	null
2024-06-10	Multicam-SLAM: Non-overlapping Multi-camera SLAM for Indirect Visual Localization and Navigation	Shenghao Li et.al.	2406.06374v1	link
2024-06-08	A preprocessing-based planning framework for utilizing contacts in high-precision insertion tasks	Muhammad Suhail Saleem et.al.	2406.05522v1	null
2024-06-06	GLACE: Global Local Accelerated Coordinate Encoding	Fangjinhua Wang et.al.	2406.04340v1	link
2024-06-06	Omni6DPose: A Benchmark and Model for Universal 6D Object Pose Estimation and Tracking	Jiyao Zhang et.al.	2406.04316v1	null
2024-06-05	Hi5: 2D Hand Pose Estimation with Zero Human Annotation	Masum Hasan et.al.	2406.03599v1	null
2024-06-05	Sparse Color-Code Net: Real-Time RGB-Based 6D Object Pose Estimation on Edge Devices	Xingjian Yang et.al.	2406.02977v1	null
2024-06-04	CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation	Dejia Xu et.al.	2406.02509v1	null
2024-06-04	HPE-CogVLM: New Head Pose Grounding Task Exploration on Vision Language Model	Yu Tian et.al.	2406.01914v1	null
2024-06-03	A Robust Filter for Marker-less Multi-person Tracking in Human-Robot Interaction Scenarios	Enrico Martini et.al.	2406.01832v1	link
2024-06-01	Equivariant amortized inference of poses for cryo-EM	Larissa de Ruijter et.al.	2406.01630v1	null
2024-06-03	3D WholeBody Pose Estimation based on Semantic Graph Attention Network and Distance Information	Sihan Wen et.al.	2406.01196v1	null
2024-06-01	CapeX: Category-Agnostic Pose Estimation from Textual Point Explanation	Matan Rusanovsky et.al.	2406.00384v1	link
2024-05-30	Estimating Human Poses Across Datasets: A Unified Skeleton and Multi-Teacher Distillation Approach	Muhammad Saif Ullah Khan et.al.	2405.20084v1	null
2024-05-30	TAMBRIDGE: Bridging Frame-Centered Tracking and 3D Gaussian Splatting for Enhanced SLAM	Peifeng Jiang et.al.	2405.19614v1	null
2024-05-29	Real-Time Dynamic Robot-Assisted Hand-Object Interaction via Motion Primitives	Mingqi Yuan et.al.	2405.19531v1	null
2024-05-29	Exploring AI-based Anonymization of Industrial Image and Video Data in the Context of Feature Preservation	Sabrina Cynthia Triess et.al.	2405.19173v1	null
2024-05-28	World Models for General Surgical Grasping	Hongbin Lin et.al.	2405.17940v1	null
2024-05-27	MoSca: Dynamic Gaussian Fusion from Casual Videos via 4D Motion Scaffolds	Jiahui Lei et.al.	2405.17421v1	link
2024-05-27	Occlusion Handling in 3D Human Pose Estimation with Perturbed Positional Encoding	Niloofar Azizi et.al.	2405.17397v1	null
2024-05-27	$\text{Di}^2\text{Pose}$ : Discrete Diffusion Model for Occluded 3D Human Pose Estimation	Weiquan Wang et.al.	2405.17016v1	null
2024-05-27	Clustering-based Learning for UAV Tracking and Pose Estimation	Jiaping Xiao et.al.	2405.16867v1	null
2024-05-26	Multi-Modal UAV Detection, Classification and Tracking Algorithm -- Technical Report for CVPR 2024 UG2 Challenge	Tianchen Deng et.al.	2405.16464v1	link
2024-05-25	Intensity and Texture Correction of Omnidirectional Image Using Camera Images for Indirect Augmented Reality	Hakim Ikebayashi et.al.	2405.16008v1	null
2024-05-23	CoPeD-Advancing Multi-Robot Collaborative Perception: A Comprehensive Dataset in Real-World Environments	Yang Zhou et.al.	2405.14731v1	link
2024-05-23	Segformer++: Efficient Token-Merging Strategies for High-Resolution Semantic Segmentation	Daniel Kienzle et.al.	2405.14467v1	link
2024-05-21	Geometric Transformation Uncertainty for Improving 3D Fetal Brain Pose Prediction from Freehand 2D Ultrasound Videos	Jayroop Ramesh et.al.	2405.13235v1	link
2024-05-21	Leveraging Neural Radiance Fields for Pose Estimation of an Unknown Space Object during Proximity Operations	Antoine Legrand et.al.	2405.12728v1	null
2024-05-21	PoseGravity: Pose Estimation from Points and Lines with Axis Prior	Akshay Chandrasekhar et.al.	2405.12646v1	link
2024-05-19	Focus on Low-Resolution Information: Multi-Granular Information-Lossless Model for Low-Resolution Human Pose Estimation	Zejun Gu et.al.	2405.12247v1	null
2024-05-20	AutoSoccerPose: Automated 3D posture Analysis of Soccer Shot Movements	Calvin Yeung et.al.	2405.12070v1	link
2024-05-19	Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries	Christiaan G. A. Viviers et.al.	2405.11677v1	link
2024-05-19	Cross-Domain Knowledge Distillation for Low-Resolution Human Pose Estimation	Zejun Gu et.al.	2405.11448v1	null
2024-05-18	PS6D: Point Cloud Based Symmetry-Aware 6D Object Pose Estimation in Robot Bin-Picking	Yifan Yang et.al.	2405.11257v1	null
2024-05-18	MotionGS : Compact Gaussian Splatting SLAM by Motion Filter	Xinli Guo et.al.	2405.11129v1	link
2024-05-17	Resolving Symmetry Ambiguity in Correspondence-based Methods for Instance-level Object Pose Estimation	Yongliang Lin et.al.	2405.10557v1	null
2024-05-16	Diversity-Aware Sign Language Production through a Pose Encoding Variational Autoencoder	Mohamed Ilyes Lakhal et.al.	2405.10423v1	null
2024-05-17	Toon3D: Seeing Cartoons from a New Perspective	Ethan Weber et.al.	2405.10320v2	null
2024-05-15	Task-adaptive Q-Face	Haomiao Sun et.al.	2405.09059v1	null
2024-05-14	RDPN6D: Residual-based Dense Point-wise Network for 6Dof Object Pose Estimation Based on RGB-D Images	Zong-Wei Hong et.al.	2405.08483v1	link
2024-05-14	TP3M: Transformer-based Pseudo 3D Image Matching with Reference	Liming Han et.al.	2405.08434v1	null
2024-05-13	Deep Learning-Based Object Pose Estimation: A Comprehensive Survey	Jian Liu et.al.	2405.07801v1	link
2024-05-13	JointLoc: A Real-time Visual Localization Framework for Planetary UAVs Based on Joint Relative and Absolute Pose Estimation	Xubo Luo et.al.	2405.07429v1	link
2024-05-11	TD-NeRF: Novel Truncated Depth Prior for Joint Camera Pose and Neural Radiance Field Optimization	Zhen Tan et.al.	2405.07027v1	link
2024-05-11	AHPPEBot: Autonomous Robot for Tomato Harvesting based on Phenotyping and Pose Estimation	Xingxu Li et.al.	2405.06959v1	null
2024-05-10	CasCalib: Cascaded Calibration for Motion Capture from Sparse Unsynchronized Cameras	James Tang et.al.	2405.06845v1	link
2024-05-10	MGS-SLAM: Monocular Sparse Tracking and Gaussian Mapping with Depth Smooth Regularization	Pengcheng Zhu et.al.	2405.06241v1	null
2024-05-10	Free-Moving Object Reconstruction and Pose Estimation with Virtual Camera	Haixin Shi et.al.	2405.05858v2	null
2024-05-09	Semi-Autonomous Laparoscopic Robot Docking with Learned Hand-Eye Information Fusion	Huanyu Tian et.al.	2405.05817v1	null
2024-05-09	NeuRSS: Enhancing AUV Localization and Bathymetric Mapping with Neural Rendering for Sidescan SLAM	Yiping Xie et.al.	2405.05807v1	null
2024-05-09	Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview	Yuhang Ming et.al.	2405.05526v1	null
2024-05-08	Adversary-Guided Motion Retargeting for Skeleton Anonymization	Thomas Carr et.al.	2405.05428v1	null
2024-05-08	FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models	Jinglin Xu et.al.	2405.05216v1	link
2024-05-08	ProbRadarM3F: mmWave Radar based Human Skeletal Pose Estimation with Probability Map Guided Multi-Format Feature Fusion	Bing Zhu et.al.	2405.05164v1	null
2024-05-08	GISR: Geometric Initialization and Silhouette-based Refinement for Single-View Robot Pose and Configuration Estimation	Ivan Bilić et.al.	2405.04890v1	null
2024-05-07	Learning Distributional Demonstration Spaces for Task-Specific Cross-Pose Estimation	Jenny Wang et.al.	2405.04609v1	null
2024-05-07	Bayesian Simultaneous Localization and Multi-Lane Tracking Using Onboard Sensors and a SD Map	Yuxuan Xia et.al.	2405.04290v1	null
2024-05-07	Speak the Same Language: Global LiDAR Registration on BIM Using Pose Hough Transform	Zhijian Qiao et.al.	2405.03969v1	null
2024-05-07	Joint Estimation of Identity Verification and Relative Pose for Partial Fingerprints	Xiongjun Guan et.al.	2405.03959v1	link
2024-05-06	Pose Priors from Language Models	Sanjay Subramanian et.al.	2405.03689v1	null
2024-05-06	Optimizing Hand Region Detection in MediaPipe Holistic Full-Body Pose Estimation to Improve Accuracy and Avoid Downstream Errors	Amit Moryossef et.al.	2405.03545v1	link
2024-05-05	Multi-hop graph transformer network for 3D human pose estimation	Zaedul Islam et.al.	2405.03055v1	null
2024-05-05	Blending Distributed NeRFs with Tri-stage Robust Pose Optimization	Baijun Ye et.al.	2405.02880v1	null
2024-05-03	WeightedPose: Generalizable Cross-Pose Estimation via Weighted SVD	Xuxin Cheng et.al.	2405.02241v1	link
2024-05-03	Probablistic Restoration with Adaptive Noise Sampling for 3D Human Pose Estimation	Xianzhou Zeng et.al.	2405.02114v1	link
2024-05-03	An Onboard Framework for Staircases Modeling Based on Point Clouds	Chun Qing et.al.	2405.01918v1	null
2024-05-06	ShadowNav: Autonomous Global Localization for Lunar Navigation in Darkness	Deegan Atha et.al.	2405.01673v2	null
2024-05-02	IntervenGen: Interventional Data Generation for Robust and Data-Efficient Robot Imitation Learning	Ryan Hoque et.al.	2405.01472v1	null
2024-05-02	Behavior Imitation for Manipulator Control and Grasping with Deep Reinforcement Learning	Liu Qiyuan et.al.	2405.01284v1	null
2024-05-02	Sports Analysis and VR Viewing System Based on Player Tracking and Pose Estimation with Multimodal and Multiview Sensors	Wenxuan Guo et.al.	2405.01112v1	null
2024-05-02	CoViS-Net: A Cooperative Visual Spatial Foundation Model for Multi-Robot Applications	Jan Blumenkamp et.al.	2405.01107v1	null
2024-05-04	HandSSCA: 3D Hand Mesh Reconstruction with State Space Channel Attention from RGB images	Zixun Jiao et.al.	2405.01066v2	null
2024-05-01	Radar-Based Localization For Autonomous Ground Vehicles In Suburban Neighborhoods	Andrew J. Kramer et.al.	2405.00600v1	null
2024-04-30	Ultra Inertial Poser: Scalable Motion Capture and Tracking from Sparse Inertial Sensors and Ultra-Wideband Ranging	Rayan Armani et.al.	2404.19541v1	link
2024-04-30	UniFS: Universal Few-shot Instance Perception with Point Representations	Sheng Jin et.al.	2404.19401v1	link
2024-04-30	Quater-GCN: Enhancing 3D Human Pose Estimation with Orientation and Semi-supervised Training	Xingyu Song et.al.	2404.19279v1	link
2024-04-30	XFeat: Accelerated Features for Lightweight Image Matching	Guilherme Potje et.al.	2404.19174v1	null
2024-04-29	Self-Avatar Animation in Virtual Reality: Impact of Motion Signals Artifacts on the Full-Body Pose Reconstruction	Antoine Maiorca et.al.	2404.18628v1	null
2024-04-29	Mesh-based Photorealistic and Real-time 3D Mapping for Robust Visual Perception of Autonomous Underwater Vehicle	Jungwoo Lee et.al.	2404.18395v1	null
2024-04-29	Reconstructing Satellites in 3D from Amateur Telescope Images	Zhiming Chang et.al.	2404.18394v1	null
2024-04-27	Hybrid 3D Human Pose Estimation with Monocular Video and Sparse IMUs	Yiming Bao et.al.	2404.17837v1	null
2024-04-26	Localization Through Particle Filter Powered Neural Network Estimated Monocular Camera Poses	Yi Shen et.al.	2404.17685v1	null
2024-04-26	SLAM for Indoor Mapping of Wide Area Construction Environments	Vincent Ress et.al.	2404.17215v1	null
2024-04-25	WheelPose: Data Synthesis Techniques to Improve Pose Estimation Performance on Wheelchair Users	William Huang et.al.	2404.17063v1	link
2024-04-25	Transformer-Based Local Feature Matching for Multimodal Image Registration	Remi Delaunay et.al.	2404.16802v1	null
2024-04-25	DeepKalPose: An Enhanced Deep-Learning Kalman Filter for Temporally Consistent Monocular Vehicle Pose Estimation	Leandro Di Bella et.al.	2404.16558v1	null
2024-04-25	Efficient Solution of Point-Line Absolute Pose	Petr Hruby et.al.	2404.16552v1	link
2024-04-25	COBRA -- COnfidence score Based on shape Regression Analysis for method-independent quality assessment of object pose estimation from single images	Panagiotis Sapoutzoglou et.al.	2404.16471v1	link
2024-04-25	MegaParticles: Range-based 6-DoF Monte Carlo Localization with GPU-Accelerated Stein Particle Filter	Kenji Koide et.al.	2404.16370v1	null
2024-04-24	3D Human Pose Estimation with Occlusions: Introducing BlendMimic3D Dataset and GCN Refinement	Filipa Lino et.al.	2404.16136v1	link
2024-04-23	SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose Estimation	Xiangyu Xu et.al.	2404.15276v1	link
2024-04-25	Domain adaptive pose estimation via multi-level alignment	Yugan Chen et.al.	2404.14885v2	link
2024-04-23	Semi-supervised 2D Human Pose Estimation via Adaptive Keypoint Masking	Kexin Meng et.al.	2404.14835v1	null
2024-04-23	UPose3D: Uncertainty-Aware 3D Human Pose Estimation with Cross-View and Temporal Cues	Vandad Davoodnia et.al.	2404.14634v1	null
2024-04-22	DHRNet: A Dual-Path Hierarchical Relation Network for Multi-Person Pose Estimation	Yonghao Dang et.al.	2404.14025v1	link
2024-04-23	CT-NeRF: Incremental Optimizing Neural Radiance Field and Poses with Complex Trajectory	Yunlong Ran et.al.	2404.13896v2	null
2024-04-21	Resampling-free Particle Filters in High-dimensions	Akhilan Boopathy et.al.	2404.13698v1	link
2024-04-20	EC-SLAM: Real-time Dense Neural RGB-D SLAM System with Effectively Constrained Global Bundle Adjustment	Guanghao Li et.al.	2404.13346v1	link
2024-04-18	Spot-Compose: A Framework for Open-Vocabulary Object Retrieval and Drawer Manipulation in Point Clouds	Oliver Lemke et.al.	2404.12440v1	null
2024-04-18	Gait Recognition from Highly Compressed Videos	Andrei Niculae et.al.	2404.12183v1	null
2024-04-17	Mushroom Segmentation and 3D Pose Estimation from Point Clouds using Fully Convolutional Geometric Features and Implicit Pose Encoding	George Retsinas et.al.	2404.12144v1	link
2024-04-17	Kathakali Hand Gesture Recognition With Minimal Data	Kavitha Raju et.al.	2404.11205v1	null
2024-04-17	GeoReF: Geometric Alignment Across Shape Variation for Category-level Object Pose Refinement	Linfang Zheng et.al.	2404.11139v1	null
2024-04-17	CorrNet+: Sign Language Recognition and Translation via Spatial-Temporal Correlation	Lianyu Hu et.al.	2404.11111v1	link
2024-04-16	HumMUSS: Human Motion Understanding using State Space Models	Arnab Kumar Mondal et.al.	2404.10880v1	null
2024-04-16	Invariant Kalman Filtering with Noise-Free Pseudo-Measurements	Sven Goffin et.al.	2404.10687v1	null
2024-04-16	The Unreasonable Effectiveness of Pre-Trained Features for Camera Pose Refinement	Gabriele Trivigno et.al.	2404.10438v1	null
2024-04-16	GaitPoint+: A Gait Recognition Network Incorporating Point Cloud Analysis and Recycling	Huantao Ren et.al.	2404.10213v1	null
2024-04-16	LWIRPOSE: A novel LWIR Thermal Image Dataset and Benchmark	Avinash Upadhyay et.al.	2404.10212v1	link
2024-04-15	LetsGo: Large-Scale Garage Modeling and Rendering via LiDAR-Assisted Gaussian Primitives	Jiadi Cui et.al.	2404.09748v1	null
2024-04-14	In My Perspective, In My Hands: Accurate Egocentric 2D Hand Pose and Action Recognition	Wiktor Mucha et.al.	2404.09308v1	link
2024-04-13	DeDoDe v2: Analyzing and Improving the DeDoDe Keypoint Detector	Johan Edstedt et.al.	2404.08928v1	link
2024-04-16	3D Human Scan With A Moving Event Camera	Kai Kohyama et.al.	2404.08504v2	null
2024-04-11	Separated Attention: An Improved Cycle GAN Based Under Water Image Enhancement Method	Tashmoy Ghosh et.al.	2404.07649v1	null
2024-04-11	GLID: Pre-training a Generalist Encoder-Decoder Vision Model	Jihao Liu et.al.	2404.07603v1	null
2024-04-10	Measuring proximity to standard planes during fetal brain ultrasound scanning	Chiara Di Vece et.al.	2404.07124v1	null
2024-04-10	MoCap-to-Visual Domain Adaptation for Efficient Human Mesh Estimation from 2D Keypoints	Bedirhan Uguz et.al.	2404.07094v1	null
2024-04-10	Gaussian-LIC: Photo-realistic LiDAR-Inertial-Camera SLAM with 3D Gaussian Splatting	Xiaolei Lang et.al.	2404.06926v1	null
2024-04-09	Matching 2D Images in 3D: Metric Relative Pose from Metric Correspondences	Axel Barroso-Laguna et.al.	2404.06337v1	link
2024-04-09	Incremental Joint Learning of Depth, Pose and Implicit Scene Representation on Monocular Camera in Large-scale Scenes	Tianchen Deng et.al.	2404.06050v1	null
2024-04-08	Learning 3D-Aware GANs from Unposed Images with Template Feature Field	Xinya Chen et.al.	2404.05705v1	null
2024-04-08	Learning a Category-level Object Pose Estimator without Pose Annotations	Fengrui Tian et.al.	2404.05626v1	null
2024-04-08	DepthMOT: Depth Cues Lead to a Strong Multi-Object Tracker	Jiapeng Wu et.al.	2404.05518v1	link
2024-04-08	Two Hands Are Better Than One: Resolving Hand to Hand Intersections via Occupancy Networks	Maksym Ivashechkin et.al.	2404.05414v1	null
2024-04-08	STITCH: Augmented Dexterity for Suture Throws Including Thread Coordination and Handoffs	Kush Hari et.al.	2404.05151v1	null
2024-04-05	ToolEENet: Tool Affordance 6D Pose Estimation	Yunlong Wang et.al.	2404.04193v1	null
2024-04-04	SDPose: Tokenized Pose Estimation via Circulation-Guide Self-Distillation	Sichen Chen et.al.	2404.03518v1	link
2024-04-04	Multi Positive Contrastive Learning with Pose-Consistent Generated Images	Sho Inayoshi et.al.	2404.03256v1	null
2024-04-04	HandDiff: 3D Hand Pose Estimation with Diffusion on Image-Point Cloud	Wencan Cheng et.al.	2404.03159v1	link
2024-04-03	Fusing Multi-sensor Input with State Information on TinyML Brains for Autonomous Nano-drones	Luca Crupi et.al.	2404.02567v1	null
2024-04-03	Semi-Supervised Unconstrained Head Pose Estimation in the Wild	Huayi Zhou et.al.	2404.02544v1	link
2024-04-02	3D Congealing: 3D-Aware Image Alignment in the Wild	Yunzhi Zhang et.al.	2404.02125v1	null
2024-04-02	SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose Estimation	Vinkle Srivastav et.al.	2404.02041v1	link
2024-04-01	Marrying NeRF with Feature Matching for One-step Pose Estimation	Ronghan Chen et.al.	2404.00891v1	null
2024-03-31	Graph-Based vs. Error State Kalman Filter-Based Fusion Of 5G And Inertial Data For MAV Indoor Pose Estimation	Meisam Kabiri et.al.	2404.00691v1	null
2024-03-31	OmniLocalRF: Omnidirectional Local Radiance Fields from Dynamic Videos	Dongyoung Choi et.al.	2404.00676v1	null
2024-04-02	KTPFormer: Kinematics and Trajectory Prior Knowledge-Enhanced Transformer for 3D Human Pose Estimation	Jihua Peng et.al.	2404.00658v2	link
2024-03-29	FetalDiffusion: Pose-Controllable 3D Fetal MRI Synthesis with Conditional Diffusion Model	Molin Zhang et.al.	2404.00132v1	null
2024-03-29	Latent Embedding Clustering for Occlusion Robust Head Pose Estimation	José Celestino et.al.	2403.20251v1	null
2024-03-29	A Unified Framework for Human-centric Point Cloud Video Understanding	Yiteng Xu et.al.	2403.20031v1	null
2024-04-01	Video-Based Human Pose Regression via Decoupled Space-Time Aggregation	Jijie He et.al.	2403.19926v2	link
2024-03-28	Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose Estimation	Xiao Lin et.al.	2403.19527v1	link
2024-03-27	Object Pose Estimation via the Aggregation of Diffusion Features	Tianfu Wang et.al.	2403.18791v1	link
2024-03-27	RoboKeyGen: Robot Pose and Joint Angles Estimation via Diffusion-based 3D Keypoint Generation	Yang Tian et.al.	2403.18259v1	null
2024-03-26	Mathematical Foundation and Corrections for Full Range Head Pose Estimation	Huei-Chung Hu et.al.	2403.18104v1	null
2024-03-26	EgoPoseFormer: A Simple Baseline for Egocentric 3D Human Pose Estimation	Chenhongyi Yang et.al.	2403.18080v1	link
2024-03-26	A Survey on 3D Egocentric Human Pose Estimation	Md Mushfiqur Azam et.al.	2403.17893v1	link
2024-03-26	GTA-HDR: A Large-Scale Synthetic Dataset for HDR Image Reconstruction	Hrishav Bakul Barua et.al.	2403.17837v1	link
2024-03-26	DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions	Sammy Christen et.al.	2403.17827v1	null
2024-03-26	System Calibration of a Field Phenotyping Robot with Multiple High-Precision Profile Laser Scanners	Felix Esser et.al.	2403.17788v1	null
2024-03-25	Animal Avatars: Reconstructing Animatable 3D Animals from Casual Videos	Remy Sabathier et.al.	2403.17103v1	link
2024-03-25	Characterisation of the Intel RealSense D415 Stereo Depth Camera for Motion-Corrected CT Perfusion Imaging	Mahdieh Dashtbani Moghari et.al.	2403.16490v1	null
2024-03-25	Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with Objects	Zicong Fan et.al.	2403.16428v1	link
2024-03-25	A Geometric Perspective on Fusing Gaussian Distributions on Lie Groups	Yixiao Ge et.al.	2403.16411v1	null
2024-03-25	ASDF: Assembly State Detection Utilizing Late Fusion by Integrating 6D Pose Estimation	Hannah Schieber et.al.	2403.16400v1	link
2024-03-24	KITchen: A Real-World Benchmark and Dataset for 6D Object Pose Estimation in Kitchen Environments	Abdelrahman Younes et.al.	2403.16238v1	null
2024-03-24	Diffusion Model is a Good Pose Estimator from 3D RF-Vision	Junqiao Fan et.al.	2403.16198v1	null
2024-03-23	UPNeRF: A Unified Framework for Monocular 3D Object Reconstruction and Pose Estimation	Yuliang Guo et.al.	2403.15705v1	link
2024-03-22	InterFusion: Text-Driven Generation of 3D Human-Object Interaction	Sisi Dai et.al.	2403.15612v1	link
2024-03-22	Augmented Reality Warnings in Roadway Work Zones: Evaluating the Effect of Modality on Worker Reaction Times	Sepehr Sabeti et.al.	2403.15571v1	null
2024-03-22	Gesture-Controlled Aerial Robot Formation for Human-Swarm Interaction in Safety Monitoring Applications	Vít Krátký et.al.	2403.15333v1	null
2024-03-22	WSCLoc: Weakly-Supervised Sparse-View Camera Relocalization	Jialu Wang et.al.	2403.15272v1	null
2024-03-22	DITTO: Demonstration Imitation by Trajectory Transformation	Nick Heppert et.al.	2403.15203v1	link
2024-03-22	Cartoon Hallucinations Detection: Pose-aware In Context Visual Learning	Bumsoo Kim et.al.	2403.15048v1	null
2024-03-22	Trajectory Regularization Enhances Self-Supervised Geometric Representation	Jiayun Wang et.al.	2403.14973v1	link
2024-03-21	VURF: A General-purpose Reasoning and Self-refinement Framework for Video Understanding	Ahmad Mahmood et.al.	2403.14743v1	link
2024-03-21	Visibility-Aware Keypoint Localization for 6DoF Object Pose Estimation	Ruyi Lian et.al.	2403.14559v1	null
2024-03-23	Exploring 3D Human Pose Estimation and Forecasting from the Robot's Perspective: The HARPER Dataset	Andrea Avogaro et.al.	2403.14447v2	null
2024-03-21	Evaluation and Deployment of LiDAR-based Place Recognition in Dense Forests	Haedam Oh et.al.	2403.14326v1	null
2024-03-21	Zero123-6D: Zero-shot Novel View Synthesis for RGB Category-level 6D Pose Estimation	Francesco Di Felice et.al.	2403.14279v1	null
2024-03-20	DVMNet: Computing Relative Pose for Unseen Objects Beyond Hypotheses	Chen Zhao et.al.	2403.13683v1	link
2024-03-20	Meta-Point Learning and Refining for Category-Agnostic Pose Estimation	Junjie Chen et.al.	2403.13647v1	link
2024-03-20	Advancing 6D Pose Estimation in Augmented Reality -- Overcoming Projection Ambiguity with Uncontrolled Imagery	Mayura Manawadu et.al.	2403.13434v1	null
2024-03-20	DOR3D-Net: Dense Ordinal Regression Network for 3D Hand Pose Estimation	Yamin Mao et.al.	2403.13405v1	null
2024-03-20	ManiPose: A Comprehensive Benchmark for Pose-aware Object Manipulation in Robotics	Qiaojun Yu et.al.	2403.13365v1	null
2024-03-20	MULAN-WC: Multi-Robot Localization Uncertainty-aware Active NeRF with Wireless Coordination	Weiying Wang et.al.	2403.13348v1	null
2024-03-19	FaceXFormer: A Unified Transformer for Facial Analysis	Kartik Narayan et.al.	2403.12960v1	link
2024-03-19	WHAC: World-grounded Humans and Cameras	Wanqi Yin et.al.	2403.12959v1	link
2024-03-19	Diffusion-Driven Self-Supervised Learning for Shape Reconstruction and Pose Estimation	Jingtao Sun et.al.	2403.12728v1	link
2024-03-19	IFFNeRF: Initialisation Free and Fast 6DoF pose estimation from a single image and a NeRF model	Matteo Bortolon et.al.	2403.12682v1	null
2024-03-19	In-Hand Following of Deformable Linear Objects Using Dexterous Fingers with Tactile Sensing	Mingrui Yu et.al.	2403.12676v1	null
2024-03-19	Self-learning Canonical Space for Multi-view 3D Human Pose Estimation	Xiaoben Li et.al.	2403.12440v1	null
2024-03-20	Human Mesh Recovery from Arbitrary Multi-view Images	Xiaoben Li et.al.	2403.12434v2	link
2024-03-19	XPose: eXplainable Human Pose Estimation	Luyu Qiu et.al.	2403.12370v1	null
2024-03-18	HOIDiffusion: Generating Realistic 3D Hand-Object Interaction Data	Mengqi Zhang et.al.	2403.12011v1	null
2024-03-18	Normalized Validity Scores for DNNs in Regression based Eye Feature Extraction	Wolfgang Fuhl et.al.	2403.11665v1	null
2024-03-18	An Accurate and Real-time Relative Pose Estimation from Triple Point-line Images by Decoupling Rotation and Translation	Zewen Xu et.al.	2403.11639v1	null
2024-03-18	LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models	Yang Yang et.al.	2403.11627v1	link
2024-03-18	GenFlow: Generalizable Recurrent Flow for 6D Pose Refinement of Novel Objects	Sungphill Moon et.al.	2403.11510v1	null
2024-03-17	A Dual-Augmentor Framework for Domain Generalization in 3D Human Pose Estimation	Qucheng Peng et.al.	2403.11310v1	link
2024-03-17	Compact 3D Gaussian Splatting For Dense Visual SLAM	Tianchen Deng et.al.	2403.11247v1	link
2024-03-16	Robotic Task Success Evaluation Under Multi-modal Non-Parametric Object Pose Uncertainty	Lakshadeep Naik et.al.	2403.10874v1	null
2024-03-16	DPPE: Dense Pose Estimation in a Plenoxels Environment using Gradient Approximation	Christopher Kolios et.al.	2403.10773v1	null
2024-03-15	GS-Pose: Cascaded Framework for Generalizable Segmentation-based 6D Object Pose Estimation	Dingding Cai et.al.	2403.10683v1	null
2024-03-15	CLOSURE: Fast Quantification of Pose Uncertainty Sets	Yihuai Gao et.al.	2403.09990v1	null
2024-03-14	ThermoHands: A Benchmark for 3D Hand Pose Estimation from Egocentric Thermal Image	Fangqiang Ding et.al.	2403.09871v1	null
2024-03-14	BOP Challenge 2023 on Detection, Segmentation and Pose Estimation of Seen and Unseen Rigid Objects	Tomas Hodan et.al.	2403.09799v1	null
2024-03-14	Scalable Autonomous Drone Flight in the Forest with Visual-Inertial SLAM and Dense Submaps Built without LiDAR	Sebastián Barbas Laina et.al.	2403.09596v1	null
2024-03-14	Improving Real-Time Omnidirectional 3D Multi-Person Human Pose Estimation with People Matching and Unsupervised 2D-3D Lifting	Pawel Knap et.al.	2403.09437v1	null
2024-03-14	LM2D: Lyrics- and Music-Driven Dance Synthesis	Wenjie Yin et.al.	2403.09407v1	null
2024-03-14	SD-Net: Symmetric-Aware Keypoint Prediction and Domain Adaptation for 6D Pose Estimation In Bin-picking Scenarios	Ding-Tao Huang et.al.	2403.09317v1	link
2024-03-14	MOTPose: Multi-object 6D Pose Estimation for Dynamic Video Sequences using Attention-based Temporal Fusion	Arul Selvam Periyasamy et.al.	2403.09309v1	null
2024-03-13	Data Augmentation in Human-Centric Vision	Wentao Jiang et.al.	2403.08650v1	null
2024-03-15	PRAGO: Differentiable Multi-View Pose Optimization From Objectness Detections	Matteo Taiana et.al.	2403.08586v2	null
2024-03-13	NeRF-Supervised Feature Point Detection and Description	Ali Youssef et.al.	2403.08156v1	link
2024-03-12	Q-SLAM: Quadric Representations for Monocular SLAM	Chensheng Peng et.al.	2403.08125v1	null
2024-03-12	MRC-Net: 6-DoF Pose Estimation with MultiScale Residual Correlation	Yuelong Li et.al.	2403.08019v1	link
2024-03-12	Uncertainty Quantification with Deep Ensembles for 6D Object Pose Estimation	Kira Wursthorn et.al.	2403.07741v1	null
2024-03-12	Adaptive Fusion of Single-View and Multi-View Depth for Autonomous Driving	JunDa Cheng et.al.	2403.07535v1	link
2024-03-12	Category-Agnostic Pose Estimation for Point Clouds	Bowen Liu et.al.	2403.07437v1	null
2024-03-12	Monocular Microscope to CT Registration using Pose Estimation of the Incus for Augmented Reality Cochlear Implant Surgery	Yike Zhang et.al.	2403.07219v1	null
2024-03-11	Real-Time Simulated Avatar from Head-Mounted Sensors	Zhengyi Luo et.al.	2403.06862v1	null
2024-03-11	Transformer-based Fusion of 2D-pose and Spatio-temporal Embeddings for Distracted Driver Action Recognition	Erkut Akdag et.al.	2403.06577v1	null
2024-03-10	Platypose: Calibrated Zero-Shot Multi-Hypothesis 3D Human Motion Estimation	Paweł A. Pierzchlewicz et.al.	2403.06164v1	link
2024-03-10	Diffusion Models Trained with Large Data Are Transferable Visual Models	Guangkai Xu et.al.	2403.06090v1	link
2024-03-08	Prepared for the Worst: A Learning-Based Adversarial Attack for Resilience Analysis of the ICP Algorithm	Ziyu Zhang et.al.	2403.05666v1	null
2024-03-11	Exploiting polar symmetry in designing equivariant observers for vision-based motion estimation	Tarek Bouazza et.al.	2403.05450v2	null
2024-03-07	Real-Time Planning Under Uncertainty for AUVs Using Virtual Maps	Ivana Collado-Gonzalez et.al.	2403.04936v1	null
2024-03-07	That's My Point: Compact Object-centric LiDAR Pose Estimation for Large-scale Outdoor Localisation	Georgi Pramatarov et.al.	2403.04755v1	null
2024-03-07	Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser	Qingyuan Cai et.al.	2403.04444v1	link
2024-03-09	Single-to-Dual-View Adaptation for Egocentric 3D Hand Pose Estimation	Ruicong Liu et.al.	2403.04381v2	link
2024-03-05	FAR: Flexible, Accurate and Robust 6DoF Relative Camera Pose Estimation	Chris Rockwell et.al.	2403.03221v1	null
2024-03-05	NRDF: Neural Riemannian Distance Fields for Learning Articulated Pose Priors	Yannan He et.al.	2403.03122v1	null
2024-03-05	Improved LiDAR Odometry and Mapping using Deep Semantic Segmentation and Novel Outliers Detection	Mohamed Afifi et.al.	2403.03111v1	null
2024-03-05	Splat-Nav: Safe Real-Time Robot Navigation in Gaussian Splatting Maps	Timothy Chen et.al.	2403.02751v1	link
2024-03-04	PowerSkel: A Device-Free Framework Using CSI Signal for Human Skeleton Estimation in Power Station	Cunyi Yin et.al.	2403.01913v1	link
2024-03-04	A Simple Baseline for Efficient Hand Mesh Reconstruction	Zhishan Zhou et.al.	2403.01813v1	null
2024-03-03	MatchU: Matching Unseen Objects for 6D Pose Estimation from RGB-D Images	Junwen Huang et.al.	2403.01517v1	null
2024-03-02	Single-image camera calibration with model-free distortion correction	Katia Genovese et.al.	2403.01263v1	null
2024-03-02	Grid-based Fast and Structural Visual Odometry	Zhang Zhihe et.al.	2403.01110v1	null
2024-03-01	Optimal Robot Formations: Balancing Range-Based Observability and User-Defined Configurations	Syed Shabbir Ahmed et.al.	2403.00988v1	null
2024-03-04	TEXterity -- Tactile Extrinsic deXterity: Simultaneous Tactile Estimation and Control for Extrinsic Dexterity	Sangwoon Kim et.al.	2403.00049v2	null
2024-03-01	Graph Convolutional Neural Networks for Automated Echocardiography View Recognition: A Holistic Approach	Sarina Thomas et.al.	2402.19062v2	null
2024-02-29	Deep Learning for 3D Human Pose Estimation and Mesh Recovery: A Survey	Yang Liu et.al.	2402.18844v1	link
2024-02-28	Attention-Propagation Network for Egocentric Heatmap to 3D Pose Lifting	Taeho Kang et.al.	2402.18330v1	link
2024-02-28	Location-guided Head Pose Estimation for Fisheye Image	Bing Li et.al.	2402.18320v1	null
2024-02-28	NToP: NeRF-Powered Large-scale Dataset Generation for 2D and 3D Human Pose Estimation in Top-View Fisheye Images	Jingrui Yu et.al.	2402.18196v1	link
2024-02-28	Six-Point Method for Multi-Camera Systems with Reduced Solution Space	Banglei Guan et.al.	2402.18066v1	link
2024-02-27	Real-Time Estimation of Relative Pose for UAVs Using a Dual-Channel Feature Association	Zhaoying Wang et.al.	2402.17504v1	null
2024-02-26	HOISDF: Constraining 3D Hand-Object Pose Estimation with Global Signed Distance Fields	Haozhe Qi et.al.	2402.17062v1	link
2024-02-26	DRSI-Net: Dual-Residual Spatial Interaction Network for Multi-Person Pose Estimation	Shang Wu et.al.	2402.16640v1	null
2024-02-26	GEA: Reconstructing Expressive 3D Gaussian Avatar from Monocular Video	Xinqi Liu et.al.	2402.16607v1	null
2024-02-26	DreamUp3D: Object-Centric Generative Models for Single-View 3D Scene Understanding and Real-to-Sim Transfer	Yizhe Wu et.al.	2402.16308v1	null
2024-02-25	XAI-based gait analysis of patients walking with Knee-Ankle-Foot orthosis using video cameras	Arnav Mishra et.al.	2402.16175v1	null
2024-02-25	VOLoc: Visual Place Recognition by Querying Compressed Lidar Map	Xudong Cai et.al.	2402.15961v1	link
2024-02-24	CLIPose: Category-Level Object Pose Estimation with Pre-trained Vision-Language Knowledge	Xiao Lin et.al.	2402.15726v1	null
2024-02-23	Optimized Deployment of Deep Neural Networks for Visual Pose Estimation on Nano-drones	Matteo Risso et.al.	2402.15273v1	null
2024-02-22	Cameras as Rays: Pose Estimation via Ray Diffusion	Jason Y. Zhang et.al.	2402.14817v1	null
2024-02-22	S^2Former-OR: Single-Stage Bimodal Transformer for Scene Graph Generation in OR	Jialun Pei et.al.	2402.14461v1	link
2024-02-22	VLPose: Bridging the Domain Gap in Pose Estimation with Language-Vision Tuning	Jingyao Li et.al.	2402.14456v1	null
2024-02-22	Modeling 3D Infant Kinetics Using Adaptive Graph Convolutional Networks	Daniel Holmberg et.al.	2402.14400v1	link
2024-02-22	Secure Navigation using Landmark-based Localization in a GPS-denied Environment	Ganesh Sapkota et.al.	2402.14280v1	null
2024-02-21	SecurePose: Automated Face Blurring and Human Movement Kinematics Extraction from Videos Recorded in Clinical Settings	Rishabh Bajpai et.al.	2402.14143v1	null
2024-02-21	High-throughput Visual Nano-drone to Nano-drone Relative Localization using Onboard Fully Convolutional Networks	Luca Crupi et.al.	2402.13756v1	null
2024-02-21	EffLoc: Lightweight Vision Transformer for Efficient 6-DOF Camera Relocalization	Zhendong Xiao et.al.	2402.13537v1	null
2024-02-20	DiffusionNOCS: Managing Symmetry and Uncertainty in Sim2Real Multi-Modal Category-level Pose Estimation	Takuya Ikeda et.al.	2402.12647v1	link
2024-02-19	Landmark-based Localization using Stereo Vision and Deep Learning in GPS-Denied Battlefield Environment	Ganesh Sapkota et.al.	2402.12551v1	null
2024-02-18	Boosting Semi-Supervised 2D Human Pose Estimation by Revisiting Data Augmentation and Consistency Training	Huayi Zhou et.al.	2402.11566v1	link
2024-02-17	Enhancing Surgical Performance in Cardiothoracic Surgery with Innovations from Computer Vision and Artificial Intelligence: A Narrative Review	Merryn D. Constable et.al.	2402.11288v1	null
2024-02-17	Dense Matchers for Dense Tracking	Tomáš Jelínek et.al.	2402.11287v1	null
2024-02-16	Occlusion Resilient 3D Human Pose Estimation	Soumava Kumar Roy et.al.	2402.11036v1	null
2024-02-16	3D Diffuser Actor: Policy Diffusion with 3D Scene Representations	Tsung-Wei Ke et.al.	2402.10885v1	null
2024-02-15	Lester: rotoscope animation through video object segmentation and tracking	Ruben Tous et.al.	2402.09883v1	link
2024-02-15	Foul prediction with estimated poses from soccer broadcast video	Jiale Fang et.al.	2402.09650v1	null
2024-02-16	IMUOptimize: A Data-Driven Approach to Optimal IMU Placement for Human Pose Estimation with Transformer Architecture	Varun Ramani et.al.	2402.08923v2	null
2024-02-13	Are Semi-Dense Detector-Free Methods Good at Matching Local Features?	Matthieu Vilain et.al.	2402.08671v1	null
2024-02-13	Gaussian-Sum Filter for Range-based 3D Relative Pose Estimation in the Presence of Ambiguities	Syed S. Ahmed et.al.	2402.08566v1	null
2024-02-13	Learning to Produce Semi-dense Correspondences for Visual Localization	Khang Truong Giang et.al.	2402.08359v1	link
2024-02-12	Extending 3D body pose estimation for robotic-assistive therapies of autistic children	Laura Santos et.al.	2402.08006v1	null
2024-02-12	GBOT: Graph-Based 3D Object Tracking for Augmented Reality-Assisted Assembly Guidance	Shiyu Li et.al.	2402.07677v1	link
2024-02-12	UAV-assisted Visual SLAM Generating Reconstructed 3D Scene Graphs in GPS-denied Environments	Ahmed Radwan et.al.	2402.07537v1	null
2024-02-09	Improving 2D-3D Dense Correspondences with Diffusion Models for 6D Object Pose Estimation	Peter Hönig et.al.	2402.06436v1	null
2024-02-08	Real-time Holistic Robot Pose Estimation with Unknown States	Shikun Ban et.al.	2402.05655v1	link
2024-02-08	Extending 6D Object Pose Estimators for Stereo Vision	Thomas Pöllabauer et.al.	2402.05610v1	null
2024-02-09	NCRF: Neural Contact Radiance Fields for Free-Viewpoint Rendering of Hand-Object Interaction	Zhongqun Zhang et.al.	2402.05532v2	null
2024-02-07	Detection and Pose Estimation of flat, Texture-less Industry Objects on HoloLens using synthetic Training	Thomas Pöllabauer et.al.	2402.04979v1	null
2024-02-07	4-Dimensional deformation part model for pose estimation using Kalman filter constraints	Enrique Martinez-Berti et.al.	2402.04953v1	null
2024-02-07	STAR: Shape-focused Texture Agnostic Representations for Improved Object Detection and 6D Pose Estimation	Peter Hönig et.al.	2402.04878v1	link
2024-02-05	A Computer Vision Based Approach for Stalking Detection Using a CNN-LSTM-MLP Hybrid Fusion Model	Murad Hasan et.al.	2402.03417v1	null
2024-02-05	SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM	Mingrui Li et.al.	2402.03246v1	link
2024-02-05	Extreme Two-View Geometry From Object Poses with Diffusion Models	Yujing Sun et.al.	2402.02800v1	link
2024-02-04	Uncertainty-Aware Testing-Time Optimization for 3D Human Pose Estimation	Ti Wang et.al.	2402.02339v1	null
2024-02-01	mmID: High-Resolution mmWave Imaging for Human Identification	Sakila S. Jayaweera et.al.	2402.00996v1	null
2024-02-01	In-Bed Pose Estimation: A Review	Ziya Ata Yazıcı et.al.	2402.00700v1	null
2024-02-01	WayFASTER: a Self-Supervised Traversability Prediction for Increased Navigation Awareness	Mateus Valverde Gasparino et.al.	2402.00683v1	link
2024-02-02	CMRNext: Camera to LiDAR Matching in the Wild for Localization and Extrinsic Calibration	Daniele Cattaneo et.al.	2402.00129v2	null
2024-01-31	Improved Scene Landmark Detection for Camera Localization	Tien Do et.al.	2401.18083v1	link
2024-01-30	Navigating the Unknown: Uncertainty-Aware Compute-in-Memory Autonomy of Edge Robotics	Nastaran Darabi et.al.	2401.17481v1	null
2024-01-30	MESA: Matching Everything by Segmenting Anything	Yesheng Zhang et.al.	2401.16741v1	null
2024-01-30	Towards Precise 3D Human Pose Estimation with Multi-Perspective Spatial-Temporal Relational Transformers	Jianbin Jiao et.al.	2401.16700v1	link
2024-01-29	Leveraging Positional Encoding for Robust Multi-Reference-Based Object 6D Pose Estimation	Jaewoo Park et.al.	2401.16284v1	null
2024-01-29	Reconstructing Close Human Interactions from Multiple Views	Qing Shuai et.al.	2401.16173v1	link
2024-01-28	Multi-Person 3D Pose Estimation from Multi-View Uncalibrated Depth Cameras	Yu-Jhe Li et.al.	2401.15616v1	null
2024-01-30	Multi-Robot Relative Pose Estimation in SE(2) with Observability Analysis: A Comparison of Extended Kalman Filtering and Robust Pose Graph Optimization	Kihoon Shin et.al.	2401.15313v2	null
2024-01-26	Adaptive Deep Learning for Efficient Visual Pose Estimation aboard Ultra-low-power Nano-drones	Beatrice Alessandra Motetti et.al.	2401.15236v1	null
2024-01-26	SimpleEgo: Predicting Probabilistic Body Pose from Egocentric Cameras	Hanz Cuevas-Velasquez et.al.	2401.14785v1	null
2024-01-24	Synthetic data enables faster annotation and robust segmentation for multi-object grasping in clutter	Dongmyoung Lee et.al.	2401.13405v1	null
2024-01-24	Linear Relative Pose Estimation Founded on Pose-only Imaging Geometry	Qi Cai et.al.	2401.13357v1	null
2024-01-23	SemanticSLAM: Learning based Semantic Map Construction and Robust Camera Localization	Mingyang Li et.al.	2401.13076v1	link
2024-01-24	RGBD Objects in the Wild: Scaling Real-World 3D Object Learning from RGB-D Videos	Hongchi Xia et.al.	2401.12592v2	null
2024-01-26	MobileARLoc: On-device Robust Absolute Localisation for Pervasive Markerless Mobile AR	Changkun Liu et.al.	2401.11511v2	null
2024-01-19	SCENES: Subpixel Correspondence Estimation With Epipolar Supervision	Dominik A. Kloepfer et.al.	2401.10886v1	null
2024-01-19	Source-Free and Image-Only Unsupervised Domain Adaptation for Category Level Object Pose Estimation	Prakhar Kaushik et.al.	2401.10848v1	null
2024-01-22	TEXterity: Tactile Extrinsic deXterity	Antonia Bronars et.al.	2401.10230v2	null
2024-01-18	Exploring Latent Cross-Channel Embedding for Accurate 3D Human Pose Reconstruction in a Diffusion Framework	Junkun Jiang et.al.	2401.09836v1	link
2024-01-17	DK-SLAM: Monocular Visual SLAM with Deep Keypoints Adaptive Learning, Tracking and Loop-Closing	Hao Qu et.al.	2401.09160v1	null
2024-01-17	PIN-SLAM: LiDAR SLAM Using a Point-Based Implicit Neural Representation for Achieving Global Map Consistency	Yue Pan et.al.	2401.09101v1	link
2024-01-16	AdaSem: Adaptive Goal-Oriented Semantic Communications for End-to-End Camera Relocalization	Qi Liao et.al.	2401.08360v1	null
2024-01-16	S3M: Semantic Segmentation Sparse Mapping for UAVs with RGB-D Camera	Thanh Nguyen Canh et.al.	2401.08134v1	null
2024-01-15	Collaboratively Self-supervised Video Representation Learning for Action Recognition	Jie Zhang et.al.	2401.07584v1	null
2024-01-14	3D Landmark Detection on Human Point Clouds: A Benchmark and A Dual Cascade Point Transformer Framework	Fan Zhang et.al.	2401.07251v1	null
2024-01-11	On the representation and methodology for wide and short range head pose estimation	Alejandro Cobo et.al.	2401.05807v1	link
2024-01-10	Structure from Duplicates: Neural Inverse Graphics from a Pile of Objects	Tianhang Cheng et.al.	2401.05236v1	link
2024-01-10	Video-based Automatic Lameness Detection of Dairy Cows using Pose Estimation and Multiple Locomotion Traits	Helena Russello et.al.	2401.05202v1	null
2024-01-10	Diffusion-based Pose Refinement and Muti-hypothesis Generation for 3D Human Pose Estimaiton	Hongbo Kang et.al.	2401.04921v1	link
2024-01-15	Towards Real-World Aerial Vision Guidance with Categorical 6D Pose Tracker	Jingtao Sun et.al.	2401.04377v2	link
2024-01-07	RHOBIN Challenge: Reconstruction of Human Object Interaction	Xianghui Xie et.al.	2401.04143v1	null
2024-01-08	D3PRefiner: A Diffusion-based Denoise Method for 3D Human Pose Refinement	Danqi Yan et.al.	2401.03914v1	null
2024-01-07	Big Data and Deep Learning in Smart Cities: A Comprehensive Dataset for AI-Driven Traffic Accident Detection and Computer Vision Systems	Victor Adewopo et.al.	2401.03587v1	null
2024-01-04	Survey of 3D Human Body Pose and Shape Estimation Methods for Contemporary Dance Applications	Darshan Venkatrayappa et.al.	2401.02383v1	null
2024-01-04	Fit-NGP: Fitting Object Models to Neural Graphics Primitives	Marwan Taher et.al.	2401.02357v1	null
2024-01-04	PEGASUS: Physically Enhanced Gaussian Splatting Simulation System for 6DOF Object Pose Dataset Generation	Lukas Meyer et.al.	2401.02281v1	link
2024-01-03	Real-Time Human Fall Detection using a Lightweight Pose Estimation Technique	Ekram Alam et.al.	2401.01587v1	link
2024-01-05	PLE-SLAM: A Visual-Inertial SLAM Based on Point-Line Features and Efficient IMU Initialization	Jiaming He et.al.	2401.01081v2	link
2023-12-30	3D Human Pose Perception from Egocentric Stereo Videos	Hiroyasu Akada et.al.	2401.00889v1	null
2024-01-01	Geometry Depth Consistency in RGBD Relative Pose Estimation	Sourav Kumar et.al.	2401.00639v1	null
2023-12-30	A comprehensive framework for occluded human pose estimation	Linhao Xu et.al.	2401.00155v1	null
2024-01-02	6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation	Li Xu et.al.	2401.00029v2	null
2023-12-29	MURP: Multi-Agent Ultra-Wideband Relative Pose Estimation with Constrained Communications in 3D Environments	Andrew Fishberg et.al.	2312.17731v1	link
2023-12-28	iFusion: Inverting Diffusion for Pose-Free Reconstruction from Sparse Views	Chin-Hsuan Wu et.al.	2312.17250v1	link
2023-12-28	EvPlug: Learn a Plug-and-Play Module for Event and Image Fusion	Jianping Jiang et.al.	2312.16933v1	null
2023-12-28	SR-LIVO: LiDAR-Inertial-Visual Odometry and Mapping with Sweep Reconstruction	Zikang Yuan et.al.	2312.16800v1	link
2023-12-28	L-LO: Enhancing Pose Estimation Precision via a Landmark-Based LiDAR Odometry	Feiya Li et.al.	2312.16787v1	null
2023-12-27	HMP: Hand Motion Priors for Pose and Shape Estimation from Video	Enes Duran et.al.	2312.16737v1	null
2023-12-27	Camera calibration for the surround-view system: a benchmark and dataset	L Qin et.al.	2312.16499v1	null
2023-12-24	TEMP3D: Temporally Continuous 3D Human Pose Estimation Under Occlusions	Rohit Lal et.al.	2312.16221v1	link
2023-12-26	Graph Context Transformation Learning for Progressive Correspondence Pruning	Junwen Guo et.al.	2312.15971v1	link
2023-12-25	Lifting by Image -- Leveraging Image Cues for Accurate 3D Human Pose Estimation	Feng Zhou et.al.	2312.15636v1	null
2023-12-25	APTv2: Benchmarking Animal Pose Estimation and Tracking with a Large-scale Dataset and Beyond	Yuxiang Yang et.al.	2312.15612v1	link
2023-12-23	PACE: Pose Annotations in Cluttered Environments	Yang You et.al.	2312.15130v1	link
2023-12-22	PoseGen: Learning to Generate 3D Human Pose Dataset with NeRF	Mohsen Gholami et.al.	2312.14915v1	link
2023-12-22	Harnessing Diffusion Models for Visual Perception with Meta Prompts	Qiang Wan et.al.	2312.14733v1	link
2023-12-22	Pola4All: survey of polarimetric applications and an open-source toolkit to analyze polarization	Joaquin Rodriguez et.al.	2312.14697v1	link
2023-12-22	PoseViNet: Distracted Driver Action Recognition Framework Using Multi-View Pose Estimation and Vision Transformer	Neha Sengar et.al.	2312.14577v1	null
2023-12-22	Scalable 3D Reconstruction From Single Particle X-Ray Diffraction Images Based on Online Machine Learning	Jay Shenoy et.al.	2312.14432v1	null
2023-12-21	3D Pose Estimation of Two Interacting Hands from a Monocular Event Camera	Christen Millerdurai et.al.	2312.14157v1	null
2023-12-21	DUSt3R: Geometric 3D Vision Made Easy	Shuzhe Wang et.al.	2312.14132v1	link
2023-12-20	NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields	Jens Naumann et.al.	2312.13471v1	null
2023-12-20	Brain-Inspired Visual Odometry: Balancing Speed and Interpretability through a System of Systems Approach	Habib Boloorchi Tabrizi et.al.	2312.13162v1	link
2023-12-18	Unified framework for diffusion generative models in SO(3): applications in computer vision and astrophysics	Yesukhei Jagvaral et.al.	2312.11707v1	null
2023-12-18	Underwater Robot Pose Estimation Using Acoustic Methods and Intermittent Position Measurements at the Surface	Vicu-Mihalis Maer et.al.	2312.11401v1	null
2023-12-17	SHaRPose: Sparse High-Resolution Representation for Human Pose Estimation	Xiaoqi An et.al.	2312.10758v1	link
2023-12-17	PNeRFLoc: Visual Localization with Point-based Neural Radiance Fields	Boming Zhao et.al.	2312.10649v1	null
2023-12-15	SoloPose: One-Shot Kinematic 3D Human Pose Estimation with Video Data Augmentation	David C. Jeong et.al.	2312.10195v1	link
2023-12-14	iComMa: Inverting 3D Gaussians Splatting for Camera Pose Estimation via Comparing and Matching	Yuan Sun et.al.	2312.09031v1	null
2023-12-14	Scene 3-D Reconstruction System in Scattering Medium	Zhuoyifan Zhang et.al.	2312.09005v1	null
2023-12-14	CattleEyeView: A Multi-task Top-down View Cattle Dataset for Smarter Precision Livestock Farming	Kian Eng Ong et.al.	2312.08764v1	link
2023-12-20	PnP for Two-Dimensional Pose Estimation	Joshua Wang et.al.	2312.08488v2	link
2023-12-13	Pose and shear-based tactile servoing	John Lloyd et.al.	2312.08411v1	null
2023-12-13	FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects	Bowen Wen et.al.	2312.08344v1	link
2023-12-13	Efficient Multi-Object Pose Estimation using Multi-Resolution Deformable Attention and Query Aggregation	Arul Selvam Periyasamy et.al.	2312.08268v1	null
2023-12-13	CenterGrasp: Object-Aware Implicit Representation Learning for Simultaneous Shape Reconstruction and 6-DoF Grasp Estimation	Eugenio Chisari et.al.	2312.08240v1	null
2023-12-13	C-BEV: Contrastive Bird's Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation	Florian Fervers et.al.	2312.08060v1	null
2023-12-13	Three-Filters-to-Normal+: Revisiting Discontinuity Discrimination in Depth-to-Normal Translation	Jingwei Yang et.al.	2312.07964v1	null
2023-12-13	Diffusion Models Enable Zero-Shot Pose Estimation for Lower-Limb Prosthetic Users	Tianxun Zhou et.al.	2312.07854v1	null
2023-12-12	RTMO: Towards High-Performance One-Stage Real-Time Multi-Person Pose Estimation	Peng Lu et.al.	2312.07526v1	link
2023-12-12	COLMAP-Free 3D Gaussian Splatting	Yang Fu et.al.	2312.07504v1	link
2023-12-12	RMS: Redundancy-Minimizing Point Cloud Sampling for Real-Time Pose Estimation in Degenerated Environments	Pavel Petracek et.al.	2312.07337v1	link
2023-12-12	Unifying Correspondence, Pose and NeRF for Pose-Free Novel View Synthesis from Stereo Pairs	Sunghwan Hong et.al.	2312.07246v1	link
2023-12-12	Mask as Supervision: Leveraging Unified Mask Information for Unsupervised 3D Pose Estimation	Yuchen Yang et.al.	2312.07051v1	link
2023-12-12	Towards Enhanced Human Activity Recognition through Natural Language Generation and Pose Estimation	Nikhil Kashyap et.al.	2312.06965v1	null
2023-12-12	Exploring Novel Object Recognition and Spontaneous Location Recognition Machine Learning Analysis Techniques in Alzheimer's Mice	Soham Bafana et.al.	2312.06914v1	link
2023-12-11	Keypoint-based Stereophotoclinometry for Characterizing and Navigating Small Bodies: A Factor Graph Approach	Travis Driver et.al.	2312.06865v1	link
2023-12-11	Improving the Robustness of 3D Human Pose Estimation: A Benchmark and Learning from Noisy Input	Trung-Hieu Hoang et.al.	2312.06797v1	null
2023-12-11	3D Hand Pose Estimation in Egocentric Images in the Wild	Aditya Prakash et.al.	2312.06583v1	null
2023-12-11	PointVoxel: A Simple and Effective Pipeline for Multi-View Multi-Modal 3D Human Pose Estimation	Zhiyu Pan et.al.	2312.06409v1	null
2023-12-11	ManiPose: Manifold-Constrained Multi-Hypothesis 3D Human Pose Estimation	Cédric Rommel et.al.	2312.06386v1	link
2023-12-10	From Correspondences to Pose: Non-minimal Certifiably Optimal Relative Pose without Disambiguation	Javier Tirado-Garín et.al.	2312.05995v1	link
2023-12-09	You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception	Sheng Jin et.al.	2312.05525v1	link
2023-12-07	Image and AIS Data Fusion Technique for Maritime Computer Vision Applications	Emre Gülsoylu et.al.	2312.05270v1	link
2023-12-07	Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection	Kohei Yamashita et.al.	2312.04527v1	null
2023-12-07	Detecting and Restoring Non-Standard Hands in Stable Diffusion Generated Images	Yiqun Zhang et.al.	2312.04236v1	null
2023-12-06	Skeleton-in-Context: Unified Skeleton Sequence Modeling with In-Context Learning	Xinshun Wang et.al.	2312.03703v1	link
2023-12-06	Cooperative Probabilistic Trajectory Forecasting under Occlusion	Anshul Nayak et.al.	2312.03296v1	null
2023-12-05	A Unified Simulation Framework for Visual and Behavioral Fidelity in Crowd Analysis	Niccolò Bisagno et.al.	2312.02613v1	null
2023-12-05	6D Assembly Pose Estimation by Point Cloud Registration for Robot Manipulation	K. Samarawickrama et.al.	2312.02593v1	link
2023-12-05	PolyFit: A Peg-in-hole Assembly Framework for Unseen Polygon Shapes via Sim-to-real Adaptation	Geonhyup Lee et.al.	2312.02531v1	null
2023-12-04	GenEM: Physics-Informed Generative Cryo-Electron Microscopy	Jiakai Zhang et.al.	2312.02235v1	null
2023-12-02	Dynamic Inertial Poser (DynaIP): Part-Based Motion Dynamics Learning for Enhanced Human Pose Estimation with Sparse Inertial Sensors	Yu Zhang et.al.	2312.02196v1	link
2023-12-04	iMatching: Imperative Correspondence Learning	Zitong Zhan et.al.	2312.02141v1	link
2023-12-04	SplaTAM: Splat, Track & Map 3D Gaussians for Dense RGB-D SLAM	Nikhil Keetha et.al.	2312.02126v1	link
2023-12-04	Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection	Xubin Zhong et.al.	2312.01713v1	null
2023-12-05	Hulk: A Universal Knowledge Translator for Human-Centric Tasks	Yizhou Wang et.al.	2312.01697v2	link
2023-12-04	Multi-View Person Matching and 3D Pose Estimation with Arbitrary Uncalibrated Camera Networks	Yan Xu et.al.	2312.01561v1	null
2023-12-01	Object 6D pose estimation meets zero-shot learning	Andrea Caraffa et.al.	2312.00947v1	null
2023-12-01	Open-vocabulary object 6D pose estimation	Jaime Corsetti et.al.	2312.00690v1	null
2023-12-01	Global Localization: Utilizing Relative Spatio-Temporal Geometric Constraints from Adjacent and Distant Cameras	Mohammad Altillawi et.al.	2312.00500v1	null
2023-12-01	Learning Unorthogonalized Matrices for Rotation Estimation	Kerui Gu et.al.	2312.00462v1	null
2023-11-30	PoseGPT: Chatting about 3D Human Pose	Yao Feng et.al.	2311.18836v1	null
2023-11-30	FoundPose: Unseen Object Pose Estimation with Foundation Features	Evin Pınar Örnek et.al.	2311.18809v1	null
2023-11-30	Pose Estimation and Tracking for ASIST	Ari Goodman et.al.	2311.18665v1	null
2023-11-29	A Stochastic-Geometrical Framework for Object Pose Estimation based on Mixture Models Avoiding the Correspondence Problem	Wolfgang Hoegele et.al.	2311.18107v1	null
2023-11-29	Pose Anything: A Graph-Based Approach for Category-Agnostic Pose Estimation	Or Hirschorn et.al.	2311.17891v1	link
2023-11-29	Cinematic Behavior Transfer via NeRF-based Differentiable Filming	Xuekun Jiang et.al.	2311.17754v1	null
2023-11-29	PViT-6D: Overclocking Vision Transformers for 6D Pose Estimation with Confidence-Level Prediction and Pose Tokens	Sebastian Stapf et.al.	2311.17504v1	null
2023-11-28	On the Calibration of Human Pose Estimation	Kerui Gu et.al.	2311.17105v1	null
2023-11-28	Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence	Junyi Zhang et.al.	2311.17034v1	link
2023-11-28	HandyPriors: Physically Consistent Perception of Hand-Object Interactions with Differentiable Priors	Shutong Zhang et.al.	2311.16552v1	null
2023-11-28	Egocentric Whole-Body Motion Capture with FisheyeViT and Diffusion-Based Motion Refinement	Jian Wang et.al.	2311.16495v1	null
2023-11-24	UniHPE: Towards Unified Human Pose Estimation via Contrastive Learning	Zhongyu Jiang et.al.	2311.16477v1	null
2023-11-27	DiffSLVA: Harnessing Diffusion Models for Sign Language Video Anonymization	Zhaoyang Xia et.al.	2311.16060v1	link
2023-11-27	Uncertainty Quantification of Set-Membership Estimation in Control and Perception: Revisiting the Minimum Enclosing Ellipsoid	Yukai Tang et.al.	2311.15962v1	null
2023-11-27	Computer Vision for Carriers: PATRIOT	Ari Goodman et.al.	2311.15914v1	null
2023-11-27	SAM-6D: Segment Anything Model Meets Zero-Shot 6D Object Pose Estimation	Jiehong Lin et.al.	2311.15707v1	link
2023-11-24	RSB-Pose: Robust Short-Baseline Binocular 3D Human Pose Estimation with Occlusion Handling	Xiaoyue Wan et.al.	2311.14242v1	null
2023-11-23	Appearance-based gaze estimation enhanced with synthetic images using deep neural networks	Dmytro Herashchenko et.al.	2311.14175v1	link
2023-11-23	GigaPose: Fast and Robust Novel Object Pose Estimation via One Correspondence	Van Nguyen Nguyen et.al.	2311.14155v1	link
2023-11-23	GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence	Pengyuan Wang et.al.	2311.13777v1	null
2023-11-22	HEViTPose: High-Efficiency Vision Transformer for Human Pose Estimation	Chengpeng Wu et.al.	2311.13615v1	link
2023-11-24	Calibration System and Algorithm Design for a Soft Hinged Micro Scanning Mirror with a Triaxial Hall Effect Sensor	Di Wang et.al.	2311.12778v2	null
2023-11-21	HiPose: Hierarchical Binary Surface Encoding and Correspondence Pruning for RGB-D 6DoF Object Pose Estimation	Yongliang Lin et.al.	2311.12588v1	link
2023-11-21	CoVOR-SLAM: Cooperative SLAM using Visual Odometry and Ranges for Multi-Robot Systems	Young-Hee Lee et.al.	2311.12580v1	null
2023-11-21	HCA-Net: Hierarchical Context Attention Network for Intervertebral Disc Semantic Labeling	Afshin Bozorgpour et.al.	2311.12486v1	link
2023-11-21	Two Views Are Better than One: Monocular 3D Pose Estimation with Multiview Consistency	Christian Keilstrup Ingwersen et.al.	2311.12421v1	null
2023-11-20	Fingerspelling PoseNet: Enhancing Fingerspelling Translation with Pose-Based Transformer Models	Pooya Fayyazsanavi et.al.	2311.12128v1	link
2023-11-20	Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation	Wenhao Li et.al.	2311.12028v1	link
2023-11-20	SniffyArt: The Dataset of Smelling Persons	Mathias Zinnen et.al.	2311.11888v1	null
2023-11-21	Robot Hand-Eye Calibration using Structure-from-Motion	Nicolas Andreff et.al.	2311.11808v2	null
2023-11-18	SecondPose: SE(3)-Consistent Dual-Stream Feature Fusion for Category-Level Pose Estimation	Yamei Chen et.al.	2311.11125v1	link
2023-11-18	Synthetic Data Generation for Bridging Sim2Real Gap in a Production Environment	Parth Rawal et.al.	2311.11039v1	null
2023-11-18	Multiple View Geometry Transformers for 3D Human Pose Estimation	Ziwei Liao et.al.	2311.10983v1	link
2023-11-18	Jenga Stacking Based on 6D Pose Estimation for Architectural Form Finding Process	Zixun Huang et.al.	2311.10918v1	null
2023-11-17	BiHRNet: A Binary high-resolution network for Human Pose Estimation	Zhicheng Zhang et.al.	2311.10296v1	null
2023-11-16	Match and Locate: low-frequency monocular odometry based on deep feature matching	Stepan Konev et.al.	2311.10034v1	null
2023-11-16	LIO-EKF: High Frequency LiDAR-Inertial Odometry using Extended Kalman Filters	Yibin Wu et.al.	2311.09887v1	link
2023-11-16	Improved TokenPose with Sparsity	Anning Li et.al.	2311.09653v1	null
2023-11-16	Pseudo-keypoints RKHS Learning for Self-supervised 6DoF Pose Estimation	Yangzheng Wu et.al.	2311.09500v1	null
2023-11-15	NormNet: Scale Normalization for 6D Pose Estimation in Stacked Scenarios	En-Te Lin et.al.	2311.09269v1	link
2023-11-15	Range-Visual-Inertial Sensor Fusion for Micro Aerial Vehicle Localization and Navigation	Abhishek Goudar et.al.	2311.09056v1	link
2023-11-14	LocaliseBot: Multi-view 3D object localisation with differentiable rendering for robot grasping	Sujal Vijayaraghavan et.al.	2311.08438v1	null
2023-11-13	SPHINX: The Joint Mixing of Weights, Tasks, and Visual Embeddings for Multi-modal Large Language Models	Ziyi Lin et.al.	2311.07575v1	link
2023-11-13	Bio-Inspired Grasping Controller for Sensorized 2-DoF Grippers	Luca Lach et.al.	2311.07257v1	link
2023-11-10	CESPED: a new benchmark for supervised particle pose estimation in Cryo-EM	Ruben Sanchez-Garcia et.al.	2311.06194v1	link
2023-11-10	2D Image head pose estimation via latent space regression under occlusion settings	José Celestino et.al.	2311.06038v1	link
2023-11-10	Robust Adversarial Attacks Detection for Deep Learning based Relative Pose Estimation for Space Rendezvous	Ziwei Wang et.al.	2311.05992v1	null
2023-11-10	A Practical Guide to Implementing Off-Axis Stereo Projection Using Existing Ray Tracing Libraries	Stefan Zellmann et.al.	2311.05887v1	link
2023-11-09	Visually Guided Model Predictive Robot Control via 6D Object Pose Localization and Tracking	Mederic Fourmy et.al.	2311.05344v1	null
2023-11-09	Spatial Attention-based Distribution Integration Network for Human Pose Estimation	Sihan Gao et.al.	2311.05323v1	null
2023-11-09	SPADES: A Realistic Spacecraft Pose Estimation Dataset using Event Sensing	Arunkumar Rathinam et.al.	2311.05310v1	null
2023-11-09	Differentiable Cloth Parameter Identification and State Estimation in Manipulation	Dongzhe Zheng et.al.	2311.05141v1	null
2023-11-09	POISE: Pose Guided Human Silhouette Extraction under Occlusions	Arindam Dutta et.al.	2311.05077v1	link
2023-11-08	Active Transfer Learning for Efficient Video-Specific Human Pose Estimation	Hiromu Taketsugu et.al.	2311.05041v1	link
2023-11-08	3D Pose Estimation of Tomato Peduncle Nodes using Deep Keypoint Detection and Point Cloud	Jianchao Ci et.al.	2311.04699v1	null
2023-11-09	Rethinking Human Pose Estimation for Autonomous Driving with 3D Event Representations	Xiaoting Yin et.al.	2311.04591v2	link
2023-11-08	Learning Robust Multi-Scale Representation for Neural Radiance Fields from Unposed Images	Nishant Jain et.al.	2311.04521v1	null
2023-11-08	PLV-IEKF: Consistent Visual-Inertial Odometry using Points, Lines, and Vanishing Points	Tong Hua et.al.	2311.04477v1	null
2023-11-08	UP-NeRF: Unconstrained Pose-Prior-Free Neural Radiance Fields	Injae Kim et.al.	2311.03784v2	link
2023-11-06	A Single 2D Pose with Context is Worth Hundreds for 3D Human Pose Estimation	Qitao Zhao et.al.	2311.03312v1	null
2023-11-06	Enabling In-Situ Resources Utilisation by leveraging collaborative robotics and astronaut-robot interaction	Silvia Romero-Azpitarte et.al.	2311.03146v1	null
2023-11-06	Simultaneous Time Synchronization and Mutual Localization for Multi-robot System	Xiangyong Wen et.al.	2311.02948v1	null
2023-11-06	Initialisation of Autonomous Aircraft Visual Inspection Systems via CNN-Based Camera Pose Estimation	Xueyan Oh et.al.	2311.02900v1	null
2023-11-06	Efficient, Self-Supervised Human Pose Estimation with Inductive Prior Tuning	Nobline Yoo et.al.	2311.02815v1	link
2023-11-03	Generating Unbiased Pseudo-labels via a Theoretically Guaranteed Chebyshev Constraint to Unify Semi-supervised Classification and Regression	Jiaqi Wu et.al.	2311.01782v1	link
2023-11-03	Modeling the Uncertainty with Maximum Discrepant Students for Semi-supervised 2D Pose Estimation	Jiaqi Wu et.al.	2311.01770v1	null
2023-11-02	Sim2Real Bilevel Adaptation for Object Surface Classification using Vision-Based Tactile Sensors	Gabriele M. Caddeo et.al.	2311.01380v1	link
2023-11-01	A Spatial-Temporal Transformer based Framework For Human Pose Assessment And Correction in Education Scenarios	Wenyang Hu et.al.	2311.00401v1	null
2023-10-31	HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception	Junkun Yuan et.al.	2310.20695v1	link
2023-10-31	Pose-to-Motion: Cross-Domain Motion Retargeting with Pose Prior	Qingqing Zhao et.al.	2310.20249v1	null
2023-10-30	FetusMapV2: Enhanced Fetal Pose Estimation in 3D Ultrasound	Chaoyu Chen et.al.	2310.19293v1	null
2023-10-29	Distributed Nonlinear Filtering using Triangular Transport Maps	Daniel Grange et.al.	2310.19000v1	null
2023-10-29	TIC-TAC: A Framework To Learn And Evaluate Your Covariance	Megh Shukla et.al.	2310.18953v1	link
2023-10-29	Improving Multi-Person Pose Tracking with A Confidence Network	Zehua Fu et.al.	2310.18920v1	null
2023-10-29	HDMNet: A Hierarchical Matching Network with Double Attention for Large-scale Outdoor LiDAR Point Cloud Registration	Weiyi Xue et.al.	2310.18874v1	null
2023-10-28	Enhancing Grasping Performance of Novel Objects through an Improved Fine-Tuning Process	Xiao Hu et.al.	2310.18569v1	null
2023-10-27	ProcNet: Deep Predictive Coding Model for Robust-to-occlusion Visual Segmentation and Pose Estimation	Michael Zechmair et.al.	2310.18009v1	null
2023-10-26	Learning Extrinsic Dexterity with Parameterized Manipulation Primitives	Shih-Min Yang et.al.	2310.17785v1	null
2023-10-26	6-DoF Stability Field via Diffusion Models	Takuma Yoneda et.al.	2310.17649v1	null
2023-10-26	SE(3) Diffusion Model-based Point Cloud Registration for Robust 6D Object Pose Estimation	Haobo Jiang et.al.	2310.17359v1	null
2023-10-26	Automatic Edge Error Judgment in Figure Skating Using 3D Pose Estimation from a Monocular Camera and IMUs	Ryota Tanaka et.al.	2310.17193v1	link
2023-10-25	Real-time 6-DoF Pose Estimation by an Event-based Camera using Active LED Markers	Gerald Ebmer et.al.	2310.16618v1	null
2023-10-25	ChimpACT: A Longitudinal Dataset for Understanding Chimpanzee Behaviors	Xiaoxuan Ma et.al.	2310.16447v1	link
2023-10-25	MotionAGFormer: Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer Network	Soroush Mehraban et.al.	2310.16288v1	link
2023-10-25	TransPose: 6D Object Pose Estimation with Geometry-Aware Transformer	Xiao Lin et.al.	2310.16279v1	null
2023-10-23	Converting Depth Images and Point Clouds for Feature-based Pose Estimation	Robert Lösch et.al.	2310.14924v1	link
2023-10-23	Object Pose Estimation Annotation Pipeline for Multi-view Monocular Camera Systems in Industrial Settings	Hazem Youssef et.al.	2310.14914v1	null
2023-10-23	Player Re-Identification Using Body Part Appearences	Mahesh Bhosale et.al.	2310.14469v1	null
2023-10-20	LanPose: Language-Instructed 6D Object Pose Estimation for Robotic Assembly	Bowen Fu et.al.	2310.13819v1	null
2023-10-20	FMRT: Learning Accurate Feature Matching with Reconciliatory Transformer	Xinyu Zhang et.al.	2310.13605v1	null
2023-10-20	ColAG: A Collaborative Air-Ground Framework for Perception-Limited UGVs' Navigation	Zhehan Li et.al.	2310.13324v1	link
2023-10-20	CylinderTag: An Accurate and Flexible Marker for Cylinder-Shape Objects Pose Estimation Based on Projective Invariants	Shaoan Wang et.al.	2310.13320v1	link
2023-10-19	Human Pose-based Estimation, Tracking and Action Recognition with Deep Learning: A Survey	Lijuan Zhou et.al.	2310.13039v1	null
2023-10-19	FSD: Fast Self-Supervised Single RGB-D to Categorical 3D Objects	Mayank Lunayach et.al.	2310.12974v1	link
2023-10-18	Mesh Represented Recycle Learning for 3D Hand Pose and Mesh Estimation	Bosang Kim et.al.	2310.12189v1	null
2023-10-18	One-Shot Imitation Learning: A Pose Estimation Perspective	Pietro Vitiello et.al.	2310.12077v1	null
2023-10-18	ShapeGraFormer: GraFormer-Based Network for Hand-Object Reconstruction from a Single Depth Map	Ahmed Tawfik Aboukhadra et.al.	2310.11811v1	null
2023-10-17	Holistic Parking Slot Detection with Polygon-Shaped Representations	Lihao Wang et.al.	2310.11629v1	null
2023-10-17	Diver Interest via Pointing in Three Dimensions: 3D Pointing Reconstruction for Diver-AUV Communication	Chelsey Edge et.al.	2310.11536v1	null
2023-10-18	AP $n$P: A Less-constrained P$n$ P Solver for Pose Estimation with Unknown Anisotropic Scaling or Focal Lengths	Jiaxin Wei et.al.	2310.09982v2	link
2023-10-15	Tabletop Transparent Scene Reconstruction via Epipolar-Guided Optical Flow with Monocular Depth Completion Prior	Xiaotong Chen et.al.	2310.09956v1	null
2023-10-15	Socially reactive navigation models for mobile robots in dynamic environments	Ricarte Ribeiro et.al.	2310.09916v1	link
2023-10-15	MoEmo Vision Transformer: Integrating Cross-Attention and Movement Vectors in 3D Pose Estimation for HRI Emotion Detection	David C. Jeong et.al.	2310.09757v1	link
2023-10-16	IMU Preintegration for Multi-Robot Systems in the Presence of Bias and Communication Constraints	Mohammed Ayman Shalaby et.al.	2310.08686v2	null
2023-10-12	Towards Design and Development of an ArUco Markers-Based Quantitative Surface Tactile Sensor	Ozdemir Can Kara et.al.	2310.08398v1	null
2023-10-12	Multimodal Active Measurement for Human Mesh Recovery in Close Proximity	Takahiro Maeda et.al.	2310.08116v1	link
2023-10-12	X-HRNet: Towards Lightweight Human Pose Estimation with Spatially Unidimensional Self-Attention	Yixuan Zhou et.al.	2310.08042v1	link
2023-10-12	PoRF: Pose Residual Field for Accurate Neural Surface Reconstruction	Jia-Wang Bian et.al.	2310.07449v2	link
2023-10-11	SAGE-ICP: Semantic Information-Assisted ICP	Jiaming Cui et.al.	2310.07237v1	link
2023-10-11	DeepSimHO: Stable Pose Estimation for Hand-Object Interaction via Physics Simulation	Rong Wang et.al.	2310.07206v1	link
2023-10-12	FABind: Fast and Accurate Protein-Ligand Binding	Qizhi Pei et.al.	2310.06763v2	link
2023-10-10	EARL: Eye-on-Hand Reinforcement Learner for Dynamic Grasping with Active Pose Estimation	Baichuan Huang et.al.	2310.06751v1	null
2023-10-09	Augmenting Vision-Based Human Pose Estimation with Rotation Matrix	Milad Vazan et.al.	2310.06068v1	null
2023-10-07	Federated Self-Supervised Learning of Monocular Depth Estimators for Autonomous Vehicles	Elton F. de S. Soares et.al.	2310.04837v1	null
2023-10-10	1st Place Solution of Egocentric 3D Hand Pose Estimation Challenge 2023 Technical Report:A Concise Pipeline for Egocentric Hand Pose Reconstruction	Zhishan Zhou et.al.	2310.04769v2	null
2023-10-06	SwimXYZ: A large-scale dataset of synthetic swimming motions and videos	Fiche Guénolé et.al.	2310.04360v1	null
2023-10-05	BID-NeRF: RGB-D image pose estimation with inverted Neural Radiance Fields	Ágoston István Csehi et.al.	2310.03563v1	null
2023-10-05	3D-Aware Hypothesis & Verification for Generalizable Relative Object Pose Estimation	Chen Zhao et.al.	2310.03534v1	null
2023-10-05	RGBManip: Monocular Image-based Robotic Manipulation through Active Object Pose Estimation	Boshi An et.al.	2310.03478v1	null
2023-10-05	Cyber Physical System Information Collection: Robot Location and Navigation Method Based on QR Code	Hongwei Li et.al.	2310.03470v1	null
2023-10-04	Condition numbers in multiview geometry, instability in relative pose estimation, and RANSAC	Hongyi Fan et.al.	2310.02719v1	null
2023-10-05	USB-NeRF: Unrolling Shutter Bundle Adjusted Neural Radiance Fields	Moyang Li et.al.	2310.02687v2	link
2023-10-03	Beyond the Benchmark: Detecting Diverse Anomalies in Videos	Yoav Arad et.al.	2310.01904v1	link
2023-10-03	MFOS: Model-Free & One-Shot Object Pose Estimation	JongMin Lee et.al.	2310.01897v1	null
2023-10-02	LEAP: Liberate Sparse-view 3D Modeling from Camera Poses	Hanwen Jiang et.al.	2310.01410v1	link
2023-10-02	H-InDex: Visual Reinforcement Learning with Hand-Informed Representations for Dexterous Manipulation	Yanjie Ze et.al.	2310.01404v1	link
2023-10-04	Self-supervised Learning of Contextualized Local Visual Embeddings	Thalles Santos Silva et.al.	2310.00527v3	link
2023-09-30	Diff-DOPE: Differentiable Deep Object Pose Estimation	Jonathan Tremblay et.al.	2310.00463v1	null
2023-09-29	Diver Identification Using Anthropometric Data Ratios for Underwater Multi-Human-Robot Collaboration	Jungseok Hong et.al.	2310.00146v1	null
2023-09-29	Denoising and Selecting Pseudo-Heatmaps for Semi-Supervised Human Pose Estimation	Zhuoran Yu et.al.	2310.00099v1	null
2023-09-29	Revisiting Cephalometric Landmark Detection from the view of Human Pose Estimation with Lightweight Super-Resolution Head	Qian Wu et.al.	2309.17143v1	link
2023-09-29	AdaPose: Towards Cross-Site Device-Free Human Pose Estimation with Commodity WiFi	Yunjiao Zhou et.al.	2309.16964v1	null
2023-09-28	End-to-End (Instance)-Image Goal Navigation through Correspondence as an Emergent Phenomenon	Guillaume Bono et.al.	2309.16634v1	null
2023-09-28	Off-the-shelf bin picking workcell with visual pose estimation: A case study on the world robot summit 2018 kitting task	Frederik Hagelskjær et.al.	2309.16221v1	null
2023-09-28	Cloth2Body: Generating 3D Human Body Mesh from 2D Clothing	Lu Dai et.al.	2309.16189v1	null
2023-09-28	Laboratory Automation: Precision Insertion with Adaptive Fingers utilizing Contact through Sliding with Tactile-based Pose Estimation	Sameer Pai et.al.	2309.16170v1	null
2023-09-28	CLIP-Hand3D: Exploiting 3D Hand Pose Estimation via Context-Aware Prompting	Shaoxiang Guo et.al.	2309.16140v1	null
2023-09-28	A Modular Bio-inspired Robotic Hand with High Sensitivity	Chao Liu et.al.	2309.16081v1	null
2023-09-27	Handbook on Leveraging Lines for Two-View Relative Pose Estimation	Petr Hruby et.al.	2309.16040v1	null
2023-09-27	Q-REG: End-to-End Trainable Point Cloud Registration with Surface Curvature	Shengze Jin et.al.	2309.16023v1	null
2023-09-27	Analysis on Multi-robot Relative 6-DOF Pose Estimation Error Based on UWB Range	Xinran Li et.al.	2309.15367v1	null
2023-09-26	Unsupervised Reconstruction of 3D Human Pose Interactions From 2D Poses Alone	Peter Hardy et.al.	2309.14865v1	null
2023-09-26	Learning Vision-Based Bipedal Locomotion for Challenging Terrain	Helei Duan et.al.	2309.14594v1	null
2023-09-25	Spring-IMU Fusion Based Proprioception for Feedback Control of Soft Manipulators	Yinan Meng et.al.	2309.14279v1	null
2023-09-25	Industrial Application of 6D Pose Estimation for Robotic Manipulation in Automotive Internal Logistics	Philipp Quentin et.al.	2309.14265v1	null
2023-09-25	BoIR: Box-Supervised Instance Representation for Multi-Person Pose Estimation	Uyoung Jeong et.al.	2309.14072v1	link
2023-09-24	Towards Subcentimeter Accuracy Digital-Twin Tracking via An RGBD-based Transformer Model and A Comprehensive Mobile Dataset	Zixun Huang et.al.	2309.13570v1	link
2023-09-21	ORTexME: Occlusion-Robust Human Shape and Pose via Temporal Average Texture and Mesh Encoding	Yu Cheng et.al.	2309.12183v1	null
2023-09-21	ZS6D: Zero-shot 6D Object Pose Estimation using Vision Transformers	Philipp Ausserlechner et.al.	2309.11986v1	null
2023-09-21	Ego3DPose: Capturing 3D Cues from Binocular Egocentric Views	Taeho Kang et.al.	2309.11962v1	link
2023-09-21	A Real-Time Multi-Task Learning System for Joint Detection of Face, Facial Landmark and Head Pose	Qingtian Wu et.al.	2309.11773v1	null
2023-09-20	Understanding Pose and Appearance Disentanglement in 3D Human Pose Estimation	Krishna Kanth Nakka et.al.	2309.11667v1	null
2023-09-20	Online Supervised Training of Spaceborne Vision during Proximity Operations using Adaptive Kalman Filtering	Tae Ha Park et.al.	2309.11645v1	null
2023-09-20	OCC-VO: Dense Mapping via 3D Occupancy-Based Visual Odometry for Autonomous Driving	Heng Li et.al.	2309.11011v1	link
2023-09-19	Language-Conditioned Affordance-Pose Detection in 3D Point Clouds	Toan Nguyen et.al.	2309.10911v1	null
2023-09-19	MAGIC-TBR: Multiview Attention Fusion for Transformer-based Bodily Behavior Recognition in Group Settings	Surbhi Madan et.al.	2309.10765v1	link
2023-09-19	SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction	Anilkumar Swamy et.al.	2309.10748v1	null
2023-09-20	GloPro: Globally-Consistent Uncertainty-Aware 3D Human Pose Estimation & Tracking in the Wild	Simon Schaefer et.al.	2309.10369v2	null
2023-09-19	RGB-based Category-level Object Pose Estimation via Decoupled Metric Scale Recovery	Jiaxin Wei et.al.	2309.10255v1	link
2023-09-18	Hierarchical Attention and Graph Neural Networks: Toward Drift-Free Pose Estimation	Kathia Melbouci et.al.	2309.09934v1	null
2023-09-18	Application-driven Validation of Posteriors in Inverse Problems	Tim J. Adler et.al.	2309.09764v1	null
2023-09-18	RIDE: Self-Supervised Learning of Rotation-Equivariant Keypoint Detection and Invariant Description for Endoscopy	Mert Asim Karaoglu et.al.	2309.09563v1	null
2023-09-18	Sparse and Privacy-enhanced Representation for Human Pose Estimation	Ting-Ying Lin et.al.	2309.09515v1	null
2023-09-19	RenderIH: A Large-scale Synthetic Dataset for 3D Interacting Hand Pose Estimation	Lijun Li et.al.	2309.09301v2	link
2023-09-16	Optimal Initialization Strategies for Range-Only Trajectory Estimation	Abhishek Goudar et.al.	2309.09011v1	null
2023-09-16	DynaMoN: Motion-Aware Fast And Robust Camera Localization for Dynamic NeRF	Mert Asim Karaoglu et.al.	2309.08927v1	link
2023-09-16	Outram: One-shot Global Localization via Triangulated Scene Graph and Global Outlier Pruning	Pengyu Yin et.al.	2309.08914v1	link
2023-09-15	Towards Robust and Smooth 3D Multi-Person Pose Estimation from Monocular Videos in the Wild	Sungchan Park et.al.	2309.08644v1	null
2023-09-15	YCB-Ev: Event-vision dataset for 6DoF object pose estimation	Pavel Rojtberg et.al.	2309.08482v1	link
2023-09-15	Fast and Accurate Deep Loop Closing and Relocalization for Reliable LiDAR SLAM	Chenghao Shi et.al.	2309.08086v1	null
2023-09-14	Gradient based Grasp Pose Optimization on a NeRF that Approximates Grasp Success	Gergely Sóti et.al.	2309.08040v1	null
2023-09-14	TEMPO: Efficient Multi-View Pose Estimation, Tracking, and Forecasting	Rohan Choudhury et.al.	2309.07910v1	null
2023-09-14	Towards Robust and Unconstrained Full Range of Rotation Head Pose Estimation	Thorsten Hempel et.al.	2309.07654v1	link
2023-09-14	EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization	Minjung Kim et.al.	2309.07471v1	link
2023-09-14	Unleashing the Power of Depth and Pose Estimation Neural Networks by Designing Compatible Endoscopic Images	Junyang Wu et.al.	2309.07390v1	null
2023-09-13	LInKs "Lifting Independent Keypoints" -- Partial Pose Lifting for Occlusion Handling with Improved Accuracy in 2D-3D Human Pose Estimation	Peter Hardy et.al.	2309.07243v1	null
2023-09-13	3D Active Metric-Semantic SLAM	Yuezhan Tao et.al.	2309.06950v1	null
2023-09-11	ViHOPE: Visuotactile In-Hand Object 6D Pose Estimation with Shape Completion	Hongyu Li et.al.	2309.05662v1	null
2023-09-11	Towards Intuitive HMI for UAV Control	Filip Zoric et.al.	2309.05460v1	null
2023-09-12	FreeMan: Towards Benchmarking 3D Human Pose Estimation in the Wild	Jiong Wang et.al.	2309.05073v2	link
2023-09-09	Probabilistic Triangulation for Uncalibrated Multi-View 3D Human Pose Estimation	Boyuan Jiang et.al.	2309.04756v1	link
2023-09-09	Mirror-Aware Neural Humans	Daniel Ajisafe et.al.	2309.04750v1	link
2023-09-08	Robot Localization and Mapping Final Report -- Sequential Adversarial Learning for Self-Supervised Deep Visual Odometry	Akankshya Kar et.al.	2309.04147v1	null
2023-09-07	ArtiGrasp: Physically Plausible Synthesis of Bi-Manual Dexterous Grasping and Articulation	Hui Zhang et.al.	2309.03891v1	null
2023-09-05	An automated, high-resolution phenotypic assay for adult Brugia malayi and microfilaria	Upender Kalwa et.al.	2309.03235v1	null
2023-09-05	A Robust Localization Solution for an Uncrewed Ground Vehicle in Unstructured Outdoor GNSS-Denied Environments	W. Jacob Wagner et.al.	2309.02569v1	null
2023-09-05	GO-SLAM: Global Optimization for Consistent 3D Instant Reconstruction	Youmin Zhang et.al.	2309.02436v1	link
2023-09-05	DR-Pose: A Two-stage Deformation-and-Registration Pipeline for Category-level 6D Object Pose Estimation	Lei Zhou et.al.	2309.01925v1	link
2023-09-04	On the Query Strategies for Efficient Online Active Distillation	Michele Boldo et.al.	2309.01612v1	null
2023-09-04	DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion	Cédric Rommel et.al.	2309.01575v1	null
2023-09-06	Refined Temporal Pyramidal Compression-and-Amplification Transformer for 3D Human Pose Estimation	Hanbing Liu et.al.	2309.01365v2	link
2023-09-04	SKoPe3D: A Synthetic Dataset for Vehicle Keypoint Perception in 3D from Traffic Monitoring Cameras	Himanshu Pahadia et.al.	2309.01324v1	null
2023-09-03	BodySLAM++: Fast and Tightly-Coupled Visual-Inertial Camera and Human Motion Tracking	Dorian F. Henning et.al.	2309.01236v1	null
2023-09-02	Mitigating Motion Blur for Robust 3D Baseball Player Pose Modeling for Pitch Analysis	Jerrin Bright et.al.	2309.01010v1	null
2023-09-01	Fusing Monocular Images and Sparse IMU Signals for Real-time Human Motion Capture	Shaohua Pan et.al.	2309.00310v1	link
2023-08-31	EMDB: The Electromagnetic Database of Global 3D Human Pose and Shape in the Wild	Manuel Kaufmann et.al.	2308.16894v1	link
2023-08-31	SA6D: Self-Adaptive Few-Shot 6D Pose Estimator for Novel and Occluded Objects	Ning Gao et.al.	2308.16528v1	null
2023-08-30	Two-Stage Violence Detection Using ViTPose and Classification Models at Smart Airports	İrem Üstek et.al.	2308.16325v1	link
2023-08-30	SignDiff: Learning Diffusion Models for American Sign Language Production	Sen Fang et.al.	2308.16082v1	null
2023-08-30	Learning Structure-from-Motion with Graph Attention Networks	Lucas Brynte et.al.	2308.15984v1	link
2023-08-30	Reconstructing Groups of People with Hypergraph Relational Reasoning	Buzhen Huang et.al.	2308.15844v1	link
2023-08-29	3D-MuPPET: 3D Multi-Pigeon Pose Estimation and Tracking	Urs Waldmann et.al.	2308.15316v1	link
2023-08-29	Spatio-temporal MLP-graph network for 3D human pose estimation	Tanvir Hassan et.al.	2308.15313v1	link
2023-08-29	Pose-Free Neural Radiance Fields via Implicit Pose Regularization	Jiahui Zhang et.al.	2308.15049v1	null
2023-08-28	R3D3: Dense 3D Reconstruction of Dynamic Scenes from Multiple Cameras	Aron Schmied et.al.	2308.14713v1	null
2023-08-28	Video-Based Hand Pose Estimation for Remote Assessment of Bradykinesia in Parkinson's Disease	Gabriela T. Acevedo Trebbau et.al.	2308.14679v1	null
2023-08-28	Active Pose Refinement for Textureless Shiny Objects using the Structured Light Camera	Jun Yang et.al.	2308.14665v1	null
2023-08-28	CPFES: Physical Fitness Evaluation Based on Canadian Agility and Movement Skill Assessment	Pengcheng Dong et.al.	2308.14324v1	null
2023-08-27	LDL: Line Distance Functions for Panoramic Localization	Junho Kim et.al.	2308.13989v1	link
2023-08-26	Prior-guided Source-free Domain Adaptation for Human Pose Estimation	Dripta S. Raychaudhuri et.al.	2308.13954v1	null
2023-08-26	Vision-Based Human Pose Estimation via Deep Learning: A Survey	Gongjin Lan et.al.	2308.13872v1	null
2023-08-24	POCO: 3D Pose and Shape Estimation with Confidence	Sai Kumar Dwivedi et.al.	2308.12965v1	link
2023-08-24	Robot Pose Nowcasting: Forecast the Future to Improve the Present	Alessandro Simoni et.al.	2308.12914v1	null
2023-08-23	Certifiably Optimal Rotation and Pose Estimation Based on the Cayley Map	Timothy D Barfoot et.al.	2308.12418v1	null
2023-08-22	Animal3D: A Comprehensive Dataset of 3D Animal Pose and Shape	Jiacong Xu et.al.	2308.11737v1	null
2023-08-22	TrackFlow: Multi-Object Tracking with Normalizing Flows	Gianluca Mancusi et.al.	2308.11513v1	null
2023-08-22	A LiDAR-Inertial SLAM Tightly-Coupled with Dropout-Tolerant GNSS Fusion for Autonomous Mine Service Vehicles	Yusheng Wang et.al.	2308.11492v1	null
2023-08-22	PoseGraphNet++: Enriching 3D Human Pose with Orientation Estimation	Soubarna Banik et.al.	2308.11440v1	null
2023-08-22	Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views	Wentian Qu et.al.	2308.11198v1	null
2023-08-21	Spectral Graphormer: Spectral Graph-based Transformer for Egocentric Two-Hand Reconstruction using Multi-View Color Images	Tze Ho Elden Tse et.al.	2308.11015v1	null
2023-08-21	Polarimetric Information for Multi-Modal 6D Pose Estimation of Photometrically Challenging Objects with Limited Data	Patrick Ruhkamp et.al.	2308.10627v1	null
2023-08-21	GaitPT: Skeletons Are All You Need For Gait Recognition	Andy Catruna et.al.	2308.10623v1	null
2023-08-21	Approximately Equivariant Graph Networks	Ningyuan Huang et.al.	2308.10436v1	link
2023-08-21	In-Rack Test Tube Pose Estimation Using RGB-D Data	Hao Chen et.al.	2308.10411v1	null
2023-08-20	Co-Evolution of Pose and Mesh for 3D Human Body Estimation from Video	Yingxuan You et.al.	2308.10305v1	link
2023-08-20	OCHID-Fi: Occlusion-Robust Hand Pose Estimation in 3D via RF-Vision	Shujie Zhang et.al.	2308.10146v1	link
2023-08-19	3D-Aware Neural Body Fitting for Occlusion Robust 3D Human Pose Estimation	Yi Zhang et.al.	2308.10123v1	link
2023-08-19	Pseudo Flow Consistency for Self-Supervised 6D Object Pose Estimation	Yang Hai et.al.	2308.10016v1	link
2023-08-19	UniAP: Towards Universal Animal Perception in Vision via Few-shot Learning	Meiqi Sun et.al.	2308.09953v1	null
2023-08-22	Scene-Aware Feature Matching	Xiaoyong Lu et.al.	2308.09949v2	null
2023-08-18	PoSynDA: Multi-Hypothesis Pose Synthesis Domain Adaptation for Robust 3D Human Pose Estimation	Hanbing Liu et.al.	2308.09678v1	link
2023-08-18	Improving 3D Pose Estimation for Sign Language	Maksym Ivashechkin et.al.	2308.09525v1	null
2023-08-18	Denoising Diffusion for 3D Hand Pose Estimation from Images	Maksym Ivashechkin et.al.	2308.09523v1	null
2023-08-18	ResQ: Residual Quantization for Video Perception	Davide Abati et.al.	2308.09511v1	null
2023-08-17	MovePose: A High-performance Human Pose Estimation Algorithm on Mobile and Edge Devices	Dongyang Yu et.al.	2308.09084v1	null
2023-08-17	Pedestrian Environment Model for Automated Driving	Adrian Holzbock et.al.	2308.09080v1	link
2023-08-17	Exploiting Point-Wise Attention in 6D Object Pose Estimation Based on Bidirectional Prediction	Yuhao Yang et.al.	2308.08518v2	null
2023-08-16	View Consistent Purification for Accurate Cross-View Localization	Shan Wang et.al.	2308.08110v1	null
2023-08-15	Learning Better Keypoints for Multi-Object 6DoF Pose Estimation	Yangzheng Wu et.al.	2308.07827v1	link
2023-08-14	Group Pose: A Simple Baseline for End-to-End Multi-person Pose Estimation	Huan Liu et.al.	2308.07313v1	link
2023-08-12	4DRVO-Net: Deep 4D Radar-Visual Odometry Using Multi-Modal and Multi-Scale Adaptive Fusion	Guirong Zhuo et.al.	2308.06573v1	null
2023-08-17	EgoPoser: Robust Real-Time Ego-Body Pose Estimation in Large Scenes	Jiaxi Jiang et.al.	2308.06493v2	null
2023-08-11	Aggressive Aerial Grasping using a Soft Drone with Onboard Perception	Samuel Ubellacker et.al.	2308.06351v1	null
2023-08-11	VERF: Runtime Monitoring of Pose Estimation with Neural Radiance Fields	Dominic Maggio et.al.	2308.05939v1	null
2023-08-10	Toward Globally Optimal State Estimation Using Automatically Tightened Semidefinite Relaxations	Frederike Dümbgen et.al.	2308.05783v1	link
2023-08-10	KS-APR: Keyframe Selection for Robust Absolute Pose Regression	Changkun Liu et.al.	2308.05459v1	null
2023-08-10	How-to Augmented Lagrangian on Factor Graphs	Barbara Bazzana et.al.	2308.05444v1	null
2023-08-10	Deep Fusion Transformer Network with Weighted Vector-Wise Keypoints Voting for Robust 6D Object Pose Estimation	Jun Zhou et.al.	2308.05438v1	link
2023-08-10	Robust Localization with Visual-Inertial Odometry Constraints for Markerless Mobile AR	Changkun Liu et.al.	2308.05394v1	null
2023-08-10	Double-chain Constraints for 3D Human Pose Estimation in Images and Videos	Hongbo Kang et.al.	2308.05298v1	link
2023-08-09	ACE-HetEM for ab initio Heterogenous Cryo-EM 3D Reconstruction	Weijie Chen et.al.	2308.04956v1	null
2023-08-07	SEM-GAT: Explainable Semantic Pose Estimation using Learned Graph Attention	Efimia Panagiotaki et.al.	2308.03718v1	link
2023-08-07	A Horse with no Labels: Self-Supervised Horse Pose Estimation from Unlabelled Images and Synthetic Prior	Jose Sosa et.al.	2308.03411v1	null
2023-08-06	Source-free Domain Adaptive Human Pose Estimation	Qucheng Peng et.al.	2308.03202v1	link
2023-08-04	Diffusion-Augmented Depth Prediction with Sparse Annotations	Jiaqi Li et.al.	2308.02283v1	null
2023-08-04	DTF-Net: Category-Level Pose Estimation and Shape Reconstruction via Deformable Template Field	Haowen Wang et.al.	2308.02239v1	null
2023-08-07	Robust Self-Supervised Extrinsic Self-Calibration	Takayuki Kanai et.al.	2308.02153v2	null
2023-08-03	Sim-to-Real Vision-depth Fusion CNNs for Robust Pose Estimation Aboard Autonomous Nano-quadcopter	Luca Crupi et.al.	2308.01833v1	null
2023-08-03	Active Acoustic Sensing for Robot Manipulation	Shihan Lu et.al.	2308.01600v1	null
2023-08-02	HANDAL: A Dataset of Real-World Manipulable Object Categories with Pose Annotations, Affordances, and Reconstructions	Andrew Guo et.al.	2308.01477v1	null
2023-08-06	Human-M3: A Multi-view Multi-modal Dataset for 3D Human Pose Estimation in Outdoor Scenes	Bohao Fan et.al.	2308.00628v2	link
2023-08-01	Markerless human pose estimation for biomedical applications: a survey	Andrea Avogaro et.al.	2308.00519v1	null
2023-08-01	Kidnapping Deep Learning-based Multirotors using Optimized Flying Adversarial Patches	Pia Hanfeld et.al.	2308.00344v1	link
2023-08-01	Fine-Grained Sports, Yoga, and Dance Postures Recognition: A Benchmark Analysis	Asish Bera et.al.	2308.00323v1	null
2023-08-01	Robust Single-view Cone-beam X-ray Pose Estimation with Neural Tuned Tomography (NeTT) and Masked Neural Radiance Fields (mNeRF)	Chaochao Zhou et.al.	2308.00214v1	null
2023-07-31	Lightweight Super-Resolution Head for Human Pose Estimation	Haonan Wang et.al.	2307.16765v1	link
2023-07-31	DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation	Runyang Feng et.al.	2307.16687v1	null
2023-07-30	Touch if it's transparent! ACTOR: Active Tactile-based Category-Level Transparent Object Reconstruction	Prajval Kumar Murali et.al.	2307.16254v1	null
2023-07-30	Successive Pose Estimation and Beam Tracking for mmWave Vehicular Communication Systems	Cen Liu et.al.	2307.16117v1	link
2023-07-29	Iterative Graph Filtering Network for 3D Human Pose Estimation	Zaedul Islam et.al.	2307.16074v1	link
2023-07-29	HandMIM: Pose-Aware Self-Supervised Learning for 3D Hand Mesh Estimation	Zuyan Liu et.al.	2307.16061v1	null
2023-07-29	Effective Whole-body Pose Estimation with Two-stages Distillation	Zhendong Yang et.al.	2307.15880v1	link
2023-07-28	TrackAgent: 6D Object Tracking via Reinforcement Learning	Konstantin Röhrl et.al.	2307.15671v1	null
2023-07-28	Revisiting Fully Convolutional Geometric Features for Object 6D Pose Estimation	Jaime Corsetti et.al.	2307.15514v1	link
2023-07-28	Robust Visual Sim-to-Real Transfer for Robotic Manipulation	Ricardo Garcia et.al.	2307.15320v1	null
2023-07-27	Weakly Supervised Multi-Modal 3D Human Body Pose Estimation for Autonomous Driving	Peter Bauer et.al.	2307.14889v1	null
2023-07-26	Attention of Robot Touch: Tactile Saliency Prediction for Robust Sim-to-Real Tactile Control	Yijiong Lin et.al.	2307.14510v1	null
2023-07-28	CBGL: Fast Monte Carlo Passive Global Localisation of 2D LIDAR Sensor	Alexandros Filotheou et.al.	2307.14247v2	link
2023-07-26	Deep Robust Multi-Robot Re-localisation in Natural Environments	Milad Ramezani et.al.	2307.13950v1	null
2023-07-25	Of Mice and Pose: 2D Mouse Pose Estimation from Unlabelled Data and Synthetic Prior	Jose Sosa et.al.	2307.13361v1	null
2023-07-23	TransNet: Transparent Object Manipulation Through Category-Level Pose Estimation	Huijie Zhang et.al.	2307.12400v1	null
2023-07-25	FDCT: Fast Depth Completion for Transparent Objects	Tianan Li et.al.	2307.12274v2	link
2023-07-22	Challenges for Monocular 6D Object Pose Estimation in Robotics	Stefan Thalhammer et.al.	2307.12172v1	null
2023-07-22	Pyramid Semantic Graph-based Global Point Cloud Registration with Low Overlap	Zhijian Qiao et.al.	2307.12116v1	link
2023-07-22	Robot Structure Prior Guided Temporal Attention for Camera-to-Robot Pose Estimation from Image Sequence	Yang Tian et.al.	2307.12106v1	link
2023-07-26	LAMP: Leveraging Language Prompts for Multi-person Pose Estimation	Shengnan Hu et.al.	2307.11934v2	link
2023-07-21	YOLOPose V2: Understanding and Improving Transformer-based 6D Pose Estimation	Arul Selvam Periyasamy et.al.	2307.11550v1	null
2023-07-21	KVN: Keypoints Voting Network with Differentiable RANSAC for Stereo Pose Estimation	Ivano Donadi et.al.	2307.11543v1	link
2023-07-21	Semantically-enhanced Deep Collision Prediction for Autonomous Navigation using Aerial Robots	Mihir Kulkarni et.al.	2307.11522v1	null
2023-07-20	SimCol3D -- 3D Reconstruction during Colonoscopy Challenge	Anita Rau et.al.	2307.11261v1	link
2023-07-20	MSQNet: Actor-agnostic Action Recognition with Multi-modal Query	Anindya Mondal et.al.	2307.10763v1	link
2023-07-19	POV-Surgery: A Dataset for Egocentric Hand and Tool Pose Estimation During Surgical Activities	Rui Wang et.al.	2307.10387v1	link
2023-07-18	ActionPrompt: Action-Guided 3D Human Pose Estimation With Text and Pose Prompting	Hongwei Zheng et.al.	2307.09026v1	null
2023-07-17	Human Emergency Detection during Autonomous Hospital Transports	Andreas Zachariae et.al.	2307.08359v1	link
2023-07-17	Self-supervised Monocular Depth Estimation: Let's Talk About The Weather	Kieran Saunders et.al.	2307.08357v1	null
2023-07-20	Boosting 3-DoF Ground-to-Satellite Camera Localization Accuracy via Geometry-Guided Cross-View Transformer	Yujiao Shi et.al.	2307.08015v3	link
2023-07-15	Tightly-Coupled LiDAR-Visual SLAM Based on Geometric Features for Mobile Agents	Ke Cao et.al.	2307.07763v1	null
2023-07-13	Haptic-guided assisted telemanipulation approach for grasping desired objects from heaps	Maxime Adjigble et.al.	2307.07053v1	null
2023-07-13	Improving 2D Human Pose Estimation across Unseen Camera Views with Synthetic Data	Miroslav Purkrábek et.al.	2307.06737v1	link
2023-07-12	Deep learning-based estimation of whole-body kinematics from multi-view images	Kien X. Nguyen et.al.	2307.05896v1	link
2023-07-12	GLA-GCN: Global-local Adaptive Graph Convolutional Network for 3D Human	Bruce X. B. Yu et.al.	2307.05853v1	link
2023-07-09	TransPose: A Transformer-based 6D Object Pose Estimation Network with Depth Refinement	Mahmoud Abdulsalam et.al.	2307.05561v1	null
2023-07-11	ResMatch: Residual Attention Learning for Local Feature Matching	Yuxin Deng et.al.	2307.05180v1	link
2023-07-07	Proximity and Visuotactile Point Cloud Fusion for Contact Patches in Extreme Deformation	Jessica Yin et.al.	2307.03839v1	null
2023-07-07	Back to Optimization: Diffusion-based Zero-Shot 3D Human Pose Estimation	Zhongyu Jiang et.al.	2307.03833v1	link
2023-07-07	Equivariant Single View Pose Prediction Via Induced and Restricted Representations	Owen Howell et.al.	2307.03704v1	null
2023-07-07	RCDN -- Robust X-Corner Detection Algorithm based on Advanced CNN Model	Ben Chen et.al.	2307.03505v1	null
2023-07-06	Self-supervised Optimization of Hand Pose Estimation using Anatomical Features and Iterative Learning	Christian Jauch et.al.	2307.03007v1	null
2023-07-06	Recognition and Estimation of Human Finger Pointing with an RGB Camera for Robot Directive	Eran Bamani et.al.	2307.02949v1	null
2023-07-06	A Real-time Human Pose Estimation Approach for Optimal Sensor Placement in Sensor-based Human Activity Recognition	Orhan Konak et.al.	2307.02906v1	null
2023-07-04	Secure Deep Learning-based Distributed Intelligence on Pocket-sized Drones	Elia Cereda et.al.	2307.01559v1	null
2023-07-03	Joint Coordinate Regression and Association For Multi-Person Pose Estimation, A Pure Neural Network Approach	Dongyang Yu et.al.	2307.01004v1	null
2023-07-01	Automatic Solver Generator for Systems of Laurent Polynomial Equations	Evgeniy Martyushev et.al.	2307.00320v1	link
2023-07-01	SyMFM6D: Symmetry-aware Multi-directional Fusion for Multi-View 6D Object Pose Estimation	Fabian Duffhauss et.al.	2307.00306v1	link
2023-06-30	GIRA: Gaussian Mixture Models for Inference and Robot Autonomy	Kshitij Goel et.al.	2307.00071v1	link
2023-06-30	Towards the extraction of robust sign embeddings for low resource sign language recognition	Mathieu De Coster et.al.	2306.17558v1	null
2023-06-30	Fusion of Visual-Inertial Odometry with LiDAR Relative Localization for Cooperative Guidance of a Micro-Scale Aerial Vehicle	Václav Pritzl et.al.	2306.17544v1	link
2023-06-30	Locking On: Leveraging Dynamic Vehicle-Imposed Motion Constraints to Improve Visual Localization	Stephen Hausler et.al.	2306.17529v1	null
2023-06-29	ID-Pose: Sparse-view Camera Pose Estimation by Inverting Diffusion Models	Weihao Cheng et.al.	2306.17140v1	null
2023-06-29	Learning Structure-Guided Diffusion Model for 2D Human Pose Estimation	Zhongwei Qiu et.al.	2306.17074v1	null
2023-06-28	Hierarchical Graph Neural Networks for Proprioceptive 6D Pose Estimation of In-hand Objects	Alireza Rezazadeh et.al.	2306.15858v1	null
2023-06-09	Data-Link: High Fidelity Manufacturing Datasets for Model2Real Transfer under Industrial Settings	Sunny Katyara et.al.	2306.05766v1	null
2023-05-28	Counter-Hypothetical Particle Filters for Single Object Pose Tracking	Elizabeth A. Olson et.al.	2305.17828v1	null
2023-05-25	Enhanced 6D Pose Estimation for Robotic Fruit Picking	Marco Costanzo et.al.	2305.15856v1	null
2023-05-22	You Only Look at One: Category-Level Object Representations for Pose Estimation From a Single Example	Walter Goodwin et.al.	2305.12626v1	null
2023-05-18	Manifold-Aware Self-Training for Unsupervised Domain Adaptation on Regressing 6D Object Pose	Yichen Zhang et.al.	2305.10808v1	link
2023-05-08	RelPose++: Recovering 6D Poses from Sparse-view Observations	Amy Lin et.al.	2305.04926v1	link
2023-04-17	Uncovering the Background-Induced bias in RGB based 6-DoF Object Pose Estimation	Elena Govi et.al.	2304.08230v1	link
2023-03-28	CARTO: Category and Joint Agnostic Reconstruction of ARTiculated Objects	Nick Heppert et.al.	2303.15782v1	link
2023-03-23	Prior-free Category-level Pose Estimation with Implicit Space Transformation	Jianhui Liu et.al.	2303.13479v1	link
2023-06-21	6D Object Pose Estimation from Approximate 3D Models for Orbital Robotics	Maximilian Ulmer et.al.	2303.13241v3	null
2023-03-22	Rigidity-Aware Detection for 6D Object Pose Estimation	Yang Hai et.al.	2303.12396v1	link
2023-03-22	Object Pose Estimation with Statistical Guarantees: Conformal Keypoint Detection and Geometric Uncertainty Propagation	Heng Yang et.al.	2303.12246v1	link
2023-03-21	Linear-Covariance Loss for End-to-End Learning of 6D Pose Estimation	Fulin Liu et.al.	2303.11516v1	link
2023-03-18	SOCS: Semantically-aware Object Coordinate Space for Category-Level 6D Object Pose Estimation under Large Shape Variations	Boyan Wan et.al.	2303.10346v1	null
2023-03-12	Module-Wise Network Quantization for 6D Object Pose Estimation	Saqib Javed et.al.	2303.06753v1	link
2023-03-09	SpyroPose: Importance Sampling Pyramids for Object Pose Distribution Estimation in SE(3)	Rasmus Laurvig Haugaard et.al.	2303.05308v1	null
2023-03-03	Depth-based 6DoF Object Pose Estimation using Swin Transformer	Zhujun Li et.al.	2303.02133v1	link
2023-03-02	Canonical mapping as a general-purpose object descriptor for robotic manipulation	Benjamin Joffe et.al.	2303.01331v1	null
2023-02-14	MSDA: Monocular Self-supervised Domain Adaptation for 6D Object Pose Estimation	Dingding Cai et.al.	2302.07300v1	null
2023-02-14	Model-Based Underwater 6D Pose Estimation from RGB	Davide Sapienza et.al.	2302.06821v1	null
2023-02-02	A Projective Geometric View for 6D Pose Estimation in mmWave MIMO Systems	Shengqiang Shen et.al.	2302.00227v2	null
2023-01-31	Collision-aware In-hand 6D Object Pose Estimation using Multiple Vision-based Tactile Sensors	Gabriele M. Caddeo et.al.	2301.13667v1	link
2023-01-19	Learning ultrasound plane pose regression: assessing generalized pose coordinates in the fetal brain	Chiara Di Vece et.al.	2301.08317v1	null
2023-01-19	RGB-D-Based Categorical Object Pose and Shape Estimation: Methods, Datasets, and Evaluation	Leonard Bruns et.al.	2301.08147v1	link
2022-12-21	HouseCat6D -- A Large-Scale Multi-Modal Category Level 6D Object Pose Dataset with Household Objects in Realistic Scenarios	HyunJun Jung et.al.	2212.10428v2	link
2022-12-13	MegaPose: 6D Pose Estimation of Novel Objects via Render & Compare	Yann Labbé et.al.	2212.06870v1	null
2022-12-11	Context-aware 6D Pose Estimation of Known Objects using RGB-D data	Ankit Kumar et.al.	2212.05560v1	null
2023-01-30	Category-Level 6D Object Pose Estimation with Flexible Vector-Based Rotation Representation	Wei Chen et.al.	2212.04632v2	null

(back to top)

Point Cloud Registration

Publish Date	Title	Authors	PDF	Code
2025-07-20	Decision PCR: Decision version of the Point Cloud Registration task	Yaojie Zhang et.al.	2507.14965v1	null
2025-07-19	GPI-Net: Gestalt-Guided Parallel Interaction Network via Orthogonal Geometric Consistency for Robust Point Cloud Registration	Weikang Gu et.al.	2507.14452v1	null
2025-07-16	A Multi-Level Similarity Approach for Single-View Object Grasping: Matching, Planning, and Fine-Tuning	Hao Chen et.al.	2507.11938v1	null
2025-07-09	Diff $^2$ I2P: Differentiable Image-to-Point Cloud Registration with Diffusion Prior	Juncheng Mu et.al.	2507.06651v1	null
2025-07-07	Simultaneous Localization and Mapping Using Active mmWave Sensing in 5G NR	Tao Du et.al.	2507.04662v1	null
2025-07-06	Lidar Variability: A Novel Dataset and Comparative Study of Solid-State and Spinning Lidars	Doumegna Mawuto Koudjo Felix et.al.	2507.04321v1	null
2025-07-03	TurboReg: TurboClique for Robust and Efficient Point Cloud Registration	Shaocheng Yan et.al.	2507.01439v2	null
2025-06-26	CA-I2P: Channel-Adaptive Registration Network with Global Optimal Selection	Zhixin Cheng et.al.	2506.21364v1	null
2025-06-18	Correspondence-Free Multiview Point Cloud Registration via Depth-Guided Joint Optimisation	Yiran Zhou et.al.	2506.18922v1	null
2025-06-16	MT-PCR: A Hybrid Mamba-Transformer with Spatial Serialization for Hierarchical Point Cloud Registration	Bingxi Liu et.al.	2506.13183v1	null
2025-06-13	Robust Filtering -- Novel Statistical Learning and Inference Algorithms with Applications	Aamir Hussain Chughtai et.al.	2506.11530v1	null
2025-06-05	Rectified Point Flow: Generic Point Cloud Pose Estimation	Tao Sun et.al.	2506.05282v1	null
2025-05-30	A 3D Mobile Crowdsensing Framework for Sustainable Urban Digital Twins	Taku Yamazaki et.al.	2505.24348v1	null
2025-05-23	A Coarse to Fine 3D LiDAR Localization with Deep Local Features for Long Term Robot Navigation in Large Environments	Míriam Máximo et.al.	2505.18340v1	link
2025-05-22	D-LIO: 6DoF Direct LiDAR-Inertial Odometry based on Simultaneous Truncated Distance Field Mapping	Lucia Coto-Elena et.al.	2505.16726v1	link
2025-05-19	Cross-modal feature fusion for robust point cloud registration with ambiguous geometry	Zhaoyi Wang et.al.	2505.13088v1	link
2025-05-17	MonoMobility: Zero-Shot 3D Mobility Analysis from Monocular Videos	Hongyi Zhou et.al.	2505.11868v1	null
2025-05-15	VGC-RIO: A Tightly Integrated Radar-Inertial Odometry with Spatial Weighted Doppler Velocity and Local Geometric Constrained RCS Histograms	Jianguang Xiang et.al.	2505.09103v2	null
2025-05-08	An Efficient Method for Accurate Pose Estimation and Error Correction of Cuboidal Objects	Utsav Rai et.al.	2505.04962v1	null
2025-05-07	Registration of 3D Point Sets Using Exponential-based Similarity Matrix	Ashutosh Singandhupe et.al.	2505.04540v1	link
2025-05-08	FA-KPConv: Introducing Euclidean Symmetries to KPConv via Frame Averaging	Ali Alawieh et.al.	2505.04485v2	null
2025-05-06	Matching Distance and Geometric Distribution Aided Learning Multiview Point Cloud Registration	Shiqi Li et.al.	2505.03692v1	link
2025-05-04	Enhancing Lidar Point Cloud Sampling via Colorization and Super-Resolution of Lidar Imagery	Sier Ha et.al.	2505.02049v1	null
2025-05-09	3D Hand-Eye Calibration for Collaborative Robot Arm: Look at Robot Base Once	Leihui Li et.al.	2504.21619v2	link
2025-04-30	Multiview Point Cloud Registration via Optimization in an Autoencoder Latent Space	Luc Vedrenne et.al.	2504.21467v1	null
2025-04-10	Investigating Vision-Language Model for Point Cloud-based Vehicle Classification	Yiqiao Li et.al.	2504.08154v1	null
2025-04-09	A Pointcloud Registration Framework for Relocalization in Subterranean Environments	David Akhihiero et.al.	2504.07231v1	null
2025-04-09	FACT: Multinomial Misalignment Classification for Point Cloud Registration	Ludvig Dillén et.al.	2504.06627v1	null
2025-04-08	Implementation of a Zed 2i Stereo Camera for High-Frequency Shoreline Change and Coastal Elevation Monitoring	José A. Pilartes-Congo et.al.	2504.06464v1	null
2025-04-02	Bridge 2D-3D: Uncertainty-aware Hierarchical Registration Network with Domain Alignment	Zhixin Cheng et.al.	2504.01641v1	null
2025-03-21	R2LDM: An Efficient 4D Radar Super-Resolution Framework Leveraging Diffusion Model	Boyuan Zheng et.al.	2503.17097v1	null
2025-03-21	ColabSfM: Collaborative Structure-from-Motion by Point Cloud Registration	Johan Edstedt et.al.	2503.17093v1	link
2025-03-17	MT-PCR: Leveraging Modality Transformation for Large-Scale Point Cloud Registration with Limited Overlap	Yilong Wu et.al.	2503.12833v1	null
2025-03-13	Unlocking Generalization Power in LiDAR Point Cloud Registration	Zhenxuan Zeng et.al.	2503.10149v1	link
2025-03-11	BUFFER-X: Towards Zero-Shot Point Cloud Registration in Diverse Scenes	Minkyun Seo et.al.	2503.07940v1	link
2025-03-10	SANDRO: a Robust Solver with a Splitting Strategy for Point Cloud Registration	Michael Adlerstein et.al.	2503.07743v1	link
2025-03-10	HybridReg: Robust 3D Point Cloud Registration with Hybrid Motions	Keyu Du et.al.	2503.07019v1	link
2025-03-07	Diff-Reg v2: Diffusion-Based Matching Matrix Estimation for Image Matching and 3D Registration	Qianliang Wu et.al.	2503.04127v2	null
2025-03-04	HyperGCT: A Dynamic Hyper-GNN-Learned Geometric Constraint for 3D Registration	Xiyu Zhang et.al.	2503.02195v1	null
2025-03-02	Semantic-ICP: Iterative Closest Point for Non-rigid Multi-Organ Point Cloud Registration	Wanwen Chen et.al.	2503.00972v1	null
2025-02-26	BEV-LIO(LC): BEV Image Assisted LiDAR-Inertial Odometry with Loop Closure	Haoxin Cai et.al.	2502.19242v1	link
2025-02-15	Occlusion-aware Non-Rigid Point Cloud Registration via Unsupervised Neural Deformation Correntropy	Mingyang Zhao et.al.	2502.10704v1	link
2025-02-12	Fully-Geometric Cross-Attention for Point Cloud Registration	Weijie Wang et.al.	2502.08285v1	null
2025-02-11	Multiview Point Cloud Registration Based on Minimum Potential Energy for Free-Form Blade Measurement	Zijie Wu et.al.	2502.07680v1	null
2025-02-10	DefTransNet: A Transformer-based Method for Non-Rigid Point Cloud Registration in the Simulation of Soft Tissue Deformation	Sara Monji-Azad et.al.	2502.06336v1	null
2025-02-05	Mapping and Localization Using LiDAR Fiducial Markers	Yibo Liu et.al.	2502.03510v1	null
2025-01-31	A Direct Semi-Exhaustive Search Method for Robust, Partial-to-Full Point Cloud Registration	Richard Cheng et.al.	2502.00115v1	null
2025-01-18	PSReg: Prior-guided Sparse Mixture of Experts for Point Cloud Registration	Xiaoshui Huang et.al.	2501.07762v2	null
2025-01-10	LPRnet: A self-supervised registration network for LiDAR and photogrammetric point clouds	Chen Wang et.al.	2501.05669v1	null
2025-01-09	LP-ICP: General Localizability-Aware Point Cloud Registration for Robust Localization in Extreme Unstructured Environments	Haosong Yue et.al.	2501.02580v2	link
2025-01-03	MRG: A Multi-Robot Manufacturing Digital Scene Generation Method Using Multi-Instance Point Cloud Registration	Songjie Han et.al.	2501.02041v1	null
2024-12-29	Towards Explaining Uncertainty Estimates in Point Cloud Registration	Ziyuan Qin et.al.	2412.20612v1	null
2024-12-26	Resolving the Ambiguity of Complete-to-Partial Point Cloud Registration for Image-Guided Liver Surgery with Patches-to-Partial Matching	Zixin Yang et.al.	2412.19328v1	null
2024-12-25	Cross-PCR: A Robust Cross-Source Point Cloud Registration Framework	Guiyu Zhao et.al.	2412.18873v1	null
2024-12-23	PointVoxelFormer -- Reviving point cloud networks for 3D medical imaging	Mattias Paul Heinrich et.al.	2412.17390v1	null
2024-12-19	3D Registration in 30 Years: A Survey	Jiaqi Yang et.al.	2412.13735v2	link
2024-12-13	TrafficLoc: Localizing Traffic Surveillance Cameras in 3D Scenes	Yan Xia et.al.	2412.10308v1	null
2024-12-10	A Real-time Degeneracy Sensing and Compensation Method for Enhanced LiDAR SLAM	Zongbo Liao et.al.	2412.07513v1	null
2024-12-07	AutoURDF: Unsupervised Robot Modeling from Point Cloud Frames Using Cluster Registration	Jiong Lin et.al.	2412.05507v1	null
2024-12-06	GS-Matching: Reconsidering Feature Matching task in Point Cloud Registration	Yaojie Zhang et.al.	2412.04855v1	null
2024-12-04	AffordDP: Generalizable Diffusion Policy with Transferable Affordance	Shijie Wu et.al.	2412.03142v1	null
2024-12-04	QuadricsReg: Large-Scale Point Cloud Registration using Quadric Primitives	Ji Wu et.al.	2412.02998v1	null
2024-12-01	FlashSLAM: Accelerated RGB-D SLAM for Real-Time 3D Scene Reconstruction with Gaussian Splatting	Phu Pham et.al.	2412.00682v1	null
2024-11-27	XR-MBT: Multi-modal Full Body Tracking for XR through Self-Supervision with Learned Depth Point Cloud Registration	Denys Rozumnyi et.al.	2411.18377v1	null
2024-11-22	EADReg: Probabilistic Correspondence Generation with Efficient Autoregressive Diffusion Model for Outdoor Point Cloud Registration	Linrui Gong et.al.	2411.15271v1	null
2024-11-20	Automatic marker-free registration based on similar tetrahedras for single-tree point clouds	Jing Ren et.al.	2411.13069v1	null
2024-11-19	3D Reconstruction by Looking: Instantaneous Blind Spot Detector for Indoor SLAM through Mixed Reality	Hanbeom Chang et.al.	2411.12514v1	null
2024-11-16	Deep Loss Convexification for Learning Iterative Models	Ziming Zhang et.al.	2411.10649v1	null
2024-11-12	3D Focusing-and-Matching Network for Multi-Instance Point Cloud Registration	Liyuan Zhang et.al.	2411.07740v1	link
2024-11-04	Mining and Transferring Feature-Geometry Coherence for Unsupervised Point Cloud Registration	Kezheng Xiong et.al.	2411.01870v1	link
2024-10-30	UniRiT: Towards Few-Shot Non-Rigid Point Cloud Registration	Geng Li et.al.	2410.22909v1	null
2024-10-29	Micro-Structures Graph-Based Point Cloud Registration for Balancing Efficiency and Accuracy	Rongling Zhang et.al.	2410.21857v1	null
2024-10-29	Memory-Efficient Point Cloud Registration via Overlapping Region Sampling	Tomoyasu Shimada et.al.	2410.21753v1	null
2024-10-21	RANSAC Back to SOTA: A Two-stage Consensus Filtering for Real-time 3D Registration	Pengcheng Shi et.al.	2410.15682v1	link
2024-10-14	A Consistency-Aware Spot-Guided Transformer for Versatile and Hierarchical Point Cloud Registration	Renlang Huang et.al.	2410.10295v1	link
2024-10-14	Kinematic-ICP: Enhancing LiDAR Odometry with Kinematic Constraints for Wheeled Mobile Robots Moving on Planar Surfaces	Tiziano Guadagnino et.al.	2410.10277v1	null
2024-10-10	LiPO: LiDAR Inertial Odometry for ICP Comparison	Darwin Mick et.al.	2410.08097v1	null
2024-10-08	Equi-GSPR: Equivariant SE(3) Graph Network Model for Sparse Point Cloud Registration	Xueyang Kang et.al.	2410.05729v1	link
2024-10-07	Enhanced Multi-Robot SLAM System with Cross-Validation Matching and Exponential Threshold Keyframe Selection	Ang He et.al.	2410.05017v1	null
2024-10-03	LoGDesc: Local geometric features aggregation for robust point cloud registration	Karim Slimani et.al.	2410.02420v1	link
2024-10-01	GERA: Geometric Embedding for Efficient Point Registration Analysis	Geng Li et.al.	2410.00589v1	null
2024-10-01	TFCT-I2P: Three stream fusion network with color aware transformer for image-to-point cloud registration	Muyao Peng et.al.	2410.00360v1	link
2024-10-06	KISS-Matcher: Fast and Robust Point Cloud Registration Revisited	Hyungtae Lim et.al.	2409.15615v2	link
2024-09-23	MATCH POLICY: A Simple Pipeline from Point Cloud Registration to Manipulation Policies	Haojie Huang et.al.	2409.15517v1	null
2024-09-22	SynBench: A Synthetic Benchmark for Non-rigid 3D Point Cloud Registration	Sara Monji-Azad et.al.	2409.14474v1	null
2024-09-27	FracGM: A Fast Fractional Programming Technique for Geman-McClure Robust Estimator	Bang-Shien Chen et.al.	2409.13978v2	link
2024-09-17	Enhancing the Reliability of LiDAR Point Cloud Sampling: A Colorization and Super-Resolution Approach Based on LiDAR-Generated Images	Sier Ha et.al.	2409.11532v1	null
2024-09-14	Registration between Point Cloud Streams and Sequential Bounding Boxes via Gradient Descent	Xuesong Li et.al.	2409.09312v1	null
2024-09-11	Unsupervised Point Cloud Registration with Self-Distillation	Christian Löwens et.al.	2409.07558v1	link
2024-09-10	Mahalanobis k-NN: A Statistical Lens for Robust Point-Cloud Registrations	Tejas Anvekar et.al.	2409.06267v1	link
2024-09-09	From Words to Poses: Enhancing Novel Object Pose Estimation with Vision Language Models	Tessa Pulli et.al.	2409.05413v1	null
2024-09-08	Sight View Constraint for Robust Point Cloud Registration	Yaojie Zhang et.al.	2409.05065v1	null
2024-08-23	UMERegRobust - Universal Manifold Embedding Compatible Features for Robust Point Cloud Registration	Yuval Haitman et.al.	2408.12380v2	link
2024-08-21	Informed, Constrained, Aligned: A Field Analysis on Degeneracy-aware Point Cloud Registration in the Wild	Turcan Tuna et.al.	2408.11809v1	null
2024-08-20	LoopSplat: Loop Closure by Registering 3D Gaussian Splats	Liyuan Zhu et.al.	2408.10154v2	link
2024-08-05	CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration	Gongxin Yao et.al.	2408.02394v1	null
2024-08-05	MaFreeI2P: A Matching-Free Image-to-Point Cloud Registration Paradigm with Active Camera Pose Retrieval	Gongxin Yao et.al.	2408.02392v1	null
2024-07-29	Correspondence-Free SE(3) Point Cloud Registration in RKHS via Unsupervised Equivariant Learning	Ray Zhang et.al.	2407.20223v1	null
2024-07-24	Robust Point Cloud Registration in Robotic Inspection with Locally Consistent Gaussian Mixture Model	Lingjie Su et.al.	2407.17183v1	null
2024-07-23	SE3ET: SE(3)-Equivariant Transformer for Low-Overlap Point Cloud Registration	Chien Erh Lin et.al.	2407.16823v1	link
2024-07-19	PointRegGPT: Boosting 3D Point Cloud Registration using Generative Point-Cloud Pairs for Training	Suyi Chen et.al.	2407.14054v1	link
2024-07-19	GlobalPointer: Large-Scale Plane Adjustment with Bi-Convex Relaxation	Bangyan Liao et.al.	2407.13537v2	link
2024-07-22	Snail-Radar: A large-scale diverse dataset for the evaluation of 4D-radar-based SLAM systems	Jianzhu Huai et.al.	2407.11705v2	null
2024-07-14	PARE-Net: Position-Aware Rotation-Equivariant Networks for Robust Point Cloud Registration	Runzhao Yao et.al.	2407.10142v1	link
2024-07-13	ML-SemReg: Boosting Point Cloud Registration with Multi-level Semantic Consistency	Shaocheng Yan et.al.	2407.09862v1	link
2024-07-11	BiEquiFormer: Bi-Equivariant Representations for Global Point Cloud Registration	Stefanos Pertigkiozoglou et.al.	2407.08729v1	null
2024-07-10	Incremental Multiview Point Cloud Registration with Two-stage Candidate Retrieval	Shiqi Li et.al.	2407.07525v1	null
2024-07-08	SGOR: Outlier Removal by Leveraging Semantic and Geometric Information for Robust Point Cloud Registration	Guiyu Zhao et.al.	2407.06297v1	link
2024-07-08	GeoNLF: Geometry guided Pose-Free Neural LiDAR Fields	Weiyi Xue et.al.	2407.05597v1	null
2024-07-07	GaussReg: Fast 3D Registration with Gaussian Splatting	Jiahao Chang et.al.	2407.05254v1	null
2024-07-06	Incremental Multiview Point Cloud Registration	Xiaoya Cheng et.al.	2407.05021v1	link
2024-06-25	Point Tree Transformer for Point Cloud Registration	Meiling Wang et.al.	2406.17530v1	null
2024-06-17	Correspondence Free Multivector Cloud Registration using Conformal Geometric Algebra	Francisco Xavier Vasconcelos et.al.	2406.11732v1	link
2024-06-05	L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud Registration	Yibo Liu et.al.	2406.03298v1	link
2024-05-25	Deep-PE: A Learning-Based Pose Evaluator for Point Cloud Registration	Junjie Gao et.al.	2405.16085v1	null
2024-05-26	NV-LIO: LiDAR-Inertial Odometry using Normal Vectors Towards Robust SLAM in Multifloor Environments	Dongha Chung et.al.	2405.12563v2	link
2024-05-13	RGBD-Glue: General Feature Combination for Robust RGB-D Point Cloud Registration	Congjia Chen et.al.	2405.07594v1	null
2024-05-10	Benchmarking Classical and Learning-Based Multibeam Point Cloud Registration	Li Ling et.al.	2405.06279v1	link
2024-05-09	Rotation Initialization and Stepwise Refinement for Universal LiDAR Calibration	Yifan Duan et.al.	2405.05589v1	null
2024-05-07	Speak the Same Language: Global LiDAR Registration on BIM Using Pose Hough Transform	Zhijian Qiao et.al.	2405.03969v1	null
2024-05-06	Deep Learning-based Point Cloud Registration for Augmented Reality-guided Surgery	Maximilian Weber et.al.	2405.03314v1	null
2024-04-27	FRAME: A Modular Framework for Autonomous Map-merging: Advancements in the Field	Nikolaos Stathoulopoulos et.al.	2404.18006v1	null
2024-04-22	PointDifformer: Robust Point Cloud Registration With Neural Diffusion and Transformer	Rui She et.al.	2404.14034v1	null
2024-04-22	A Comprehensive Survey and Taxonomy on Point Cloud Registration Based on Deep Learning	Yu-Xin Zhang et.al.	2404.13830v1	link
2024-04-09	Efficient and Robust Point Cloud Registration via Heuristics-guided Parameter Search	Tianyu Huang et.al.	2404.06155v1	link
2024-04-08	Rendering-Enhanced Automatic Image-to-Point Cloud Registration for Roadside Scenes	Yu Sheng et.al.	2404.05164v1	null
2024-04-06	Learning Instance-Aware Correspondences for Robust Multi-Instance Point Cloud Registration in Cluttered Scenes	Zhiyuan Yu et.al.	2404.04557v1	link
2024-04-05	A Ground Mobile Robot for Autonomous Terrestrial Laser Scanning-Based Field Phenotyping	Javier Rodriguez-Sanchez et.al.	2404.04404v1	null
2024-04-01	FPGA-Accelerated Correspondence-free Point Cloud Registration with PointNet Features	Keisuke Sugiura et.al.	2404.01237v1	null
2024-03-28	SG-PGM: Partial Graph Matching Network with Semantic Geometric Fusion for 3D Scene Graph Alignment and Its Downstream Tasks	Yaxu Xie et.al.	2403.19474v1	link
2024-03-26	Global Point Cloud Registration Network for Large Transformations	Hanz Cuevas-Velasquez et.al.	2403.18040v1	link
2024-03-28	Exploring Accurate 3D Phenotyping in Greenhouse through Neural Radiance Fields	Junhong Zhao et.al.	2403.15981v2	null
2024-03-15	VRHCF: Cross-Source Point Cloud Registration via Voxel Representation and Hierarchical Correspondence Filtering	Guiyu Zhao et.al.	2403.10085v1	link
2024-03-15	MEDPNet: Achieving High-Precision Adaptive Registration for Complex Die Castings	Yu Du et.al.	2403.09996v1	null
2024-03-15	CLOSURE: Fast Quantification of Pose Uncertainty Sets	Yihuai Gao et.al.	2403.09990v1	null
2024-03-13	FastMAC: Stochastic Spectral Sampling of Correspondence Graph	Yifei Zhang et.al.	2403.08770v1	link
2024-03-13	NeRF-Supervised Feature Point Detection and Description	Ali Youssef et.al.	2403.08156v1	link
2024-03-10	PSS-BA: LiDAR Bundle Adjustment with Progressive Spatial Smoothing	Jianping Li et.al.	2403.06124v1	null
2024-03-27	Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance Extension	Quan Liu et.al.	2403.03532v2	link
2024-03-15	RELEAD: Resilient Localization with Enhanced LiDAR Odometry in Adverse Environments	Zhiqiang Chen et.al.	2402.18934v2	null
2024-02-28	PCR-99: A Practical Method for Point Cloud Registration with 99% Outliers	Seong Hun Lee et.al.	2402.16598v2	link
2024-02-23	CLIPPER+: A Fast Maximal Clique Algorithm for Robust Global Registration	Kaveh Fathian et.al.	2402.15464v1	link
2024-02-11	CLIPPER: Robust Data Association without an Initial Guess	Parker C. Lusk et.al.	2402.07284v1	null
2024-02-08	Tightly Coupled Range Inertial Localization on a 3D Prior Map Based on Sliding Window Factor Graph Optimization	Kenji Koide et.al.	2402.05540v1	null
2024-01-16	Registration of algebraic varieties using Riemannian optimization	Florentin Goyens et.al.	2401.08562v1	link
2024-01-09	Iterative Feedback Network for Unsupervised Point Cloud Registration	Yifan Xie et.al.	2401.04357v1	link
2024-01-06	PosDiffNet: Positional Neural Diffusion for Point Cloud Registration in a Large Field of View with Perturbations	Rui She et.al.	2401.03167v1	null
2024-01-04	OptFlow: Fast Optimization-based Scene Flow Estimation without Supervision	Rahul Ahuja et.al.	2401.02550v1	null
2024-01-17	Diff-PCR: Diffusion-Based Correspondence Searching in Doubly Stochastic Matrix Space for Point Cloud Registration	Qianliang Wu et.al.	2401.00436v4	null
2023-12-22	On Partial Optimal Transport: Revising the Infeasibility of Sinkhorn and Efficient Gradient Methods	Anh Duc Nguyen et.al.	2312.13970v2	link
2023-12-20	D3Former: Jointly Learning Repeatable Dense Detectors and Feature-enhanced Descriptors via Saliency-guided Transformer	Junjie Gao et.al.	2312.12970v1	null
2023-12-14	SPEAL: Skeletal Prior Embedded Attention Learning for Cross-Source Point Cloud Registration	Kezheng Xiong et.al.	2312.08664v1	null
2023-12-11	PCRDiffusion: Diffusion Probabilistic Models for Point Cloud Registration	Yue Wu et.al.	2312.06063v1	null
2023-12-05	DiffusionPCR: Diffusion Models for Robust Multi-Step Point Cloud Registration	Zhi Chen et.al.	2312.03053v1	link
2023-12-08	Zero-Shot Point Cloud Registration	Weijie Wang et.al.	2312.03032v2	null
2023-12-05	A Dynamic Network for Efficient Point Cloud Registration	Yang Ai et.al.	2312.02877v1	null
2023-12-05	6D Assembly Pose Estimation by Point Cloud Registration for Robot Manipulation	K. Samarawickrama et.al.	2312.02593v1	link
2023-12-04	Rotation-Invariant Rapid TRISO-Fueled Pebble Identification Based on Feature Matching and Point Cloud Registration	Ming Fang et.al.	2312.02006v1	null
2023-12-27	E2PNet: Event to Point Cloud Registration with Spatio-Temporal Representation Learning	Xiuhong Lin et.al.	2311.18433v2	link
2023-11-15	Nothing Stands Still: A Spatiotemporal Benchmark on 3D Point Cloud Registration Under Large Geometric and Temporal Change	Tao Sun et.al.	2311.09346v1	null
2023-11-02	Transformation Decoupling Strategy based on Screw Theory for Deterministic Point Cloud Registration with Gravity Prior	Xinyi Li et.al.	2311.01432v1	link
2023-11-02	Cross-Modal Information-Guided Network using Contrastive Learning for Point Cloud Registration	Yifan Xie et.al.	2311.01202v1	link
2023-10-29	HDMNet: A Hierarchical Matching Network with Double Attention for Large-scale Outdoor LiDAR Point Cloud Registration	Weiyi Xue et.al.	2310.18874v1	null
2023-10-27	Do we need scan-matching in radar odometry?	Vladimír Kubelka et.al.	2310.18117v1	link
2023-10-26	SE(3) Diffusion Model-based Point Cloud Registration for Robust 6D Object Pose Estimation	Haobo Jiang et.al.	2310.17359v1	null
2023-10-18	DBDNet:Partial-to-Partial Point Cloud Registration with Dual Branches Decoupling	Shiqi Li et.al.	2310.11733v1	null
2023-10-15	OAAFormer: Robust and Efficient Point Cloud Registration Through Overlapping-Aware Attention in Transformer	Junjie Gao et.al.	2310.09817v1	null
2023-10-09	FeatSense -- A Feature-based Registration Algorithm with GPU-accelerated TSDF-Mapping Backend for NVIDIA Jetson Boards	Julian Gaal et.al.	2310.05766v1	link
2023-10-09	Colmap-PCD: An Open-source Tool for Fine Image-to-point cloud Registration	Chunge Bai et.al.	2310.05504v1	link
2023-10-06	Light-LOAM: A Lightweight LiDAR Odometry and Mapping based on Graph-Matching	Shiquan Yi et.al.	2310.04162v1	link
2023-10-05	FreeReg: Image-to-Point Cloud Registration Leveraging Pretrained Diffusion Models and Monocular Depth Estimators	Haiping Wang et.al.	2310.03420v1	link
2023-10-02	COIN-LIO: Complementary Intensity-Augmented LiDAR Inertial Odometry	Patrick Pfreundschuh et.al.	2310.01235v1	link
2023-09-27	Q-REG: End-to-End Trainable Point Cloud Registration with Surface Curvature	Shengze Jin et.al.	2309.16023v1	null
2023-09-27	Partial Transport for Point-Cloud Registration	Yikun Bai et.al.	2309.15787v1	null
2023-09-27	KDD-LOAM: Jointly Learned Keypoint Detector and Descriptors Assisted LiDAR Odometry and Mapping	Renlang Huang et.al.	2309.15394v1	null
2023-09-26	CoFiI2P: Coarse-to-Fine Correspondences for Image-to-Point Cloud Registration	Shuhao Kang et.al.	2309.14660v1	null
2023-09-20	AutoSynth: Learning to Generate 3D Training Data for Object Point Cloud Registration	Zheng Dang et.al.	2309.11170v1	null
2023-09-19	LiDAR-Generated Images Derived Keypoints Assisted Point Cloud Registration Scheme in Odometry Estimation	Haizhou Zhang et.al.	2309.10436v1	link
2023-09-17	Hamiltonian Dynamics Learning from Point Cloud Observations for Nonholonomic Mobile Robot Control	Abdullah Altawaitan et.al.	2309.09163v1	link
2023-09-16	FF-LOGO: Cross-Modality Point Cloud Registration with Feature Filtering and Local to Global Optimization	Nan Ma et.al.	2309.08966v1	null
2023-09-16	Outram: One-shot Global Localization via Triangulated Scene Graph and Global Outlier Pruning	Pengyu Yin et.al.	2309.08914v1	link
2023-09-15	A Ground Segmentation Method Based on Point Cloud Map for Unstructured Roads	Zixuan Li et.al.	2309.08164v1	null
2023-09-15	Fast and Accurate Deep Loop Closing and Relocalization for Reliable LiDAR SLAM	Chenghao Shi et.al.	2309.08086v1	null
2023-09-14	EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual Localization	Minjung Kim et.al.	2309.07471v1	link
2023-09-12	SGFeat: Salient Geometric Feature for Point Cloud Registration	Qianliang Wu et.al.	2309.06207v1	null
2023-09-01	Point-TTA: Test-Time Adaptation for Point Cloud Registration Using Multitask Meta-Auxiliary Learning	Ahmed Hatem et.al.	2308.16481v2	null
2023-08-21	In-Rack Test Tube Pose Estimation Using RGB-D Data	Hao Chen et.al.	2308.10411v1	null
2023-08-18	DReg-NeRF: Deep Registration for Neural Radiance Fields	Yu Chen et.al.	2308.09386v1	link
2023-08-18	Overlap Bias Matching is Necessary for Point Cloud Registration	Pengcheng Shi et.al.	2308.09364v1	null
2023-08-10	Deep Semantic Graph Matching for Large-scale Outdoor Point Clouds Registration	Shaocong Liu et.al.	2308.05314v1	null
2023-08-09	PointMBF: A Multi-scale Bidirectional Fusion Network for Unsupervised RGB-D Point Cloud Registration	Mingzhi Yuan et.al.	2308.04782v1	link
2023-07-25	GeoTransformer: Fast and Robust Point Cloud Registration with Geometric Transformer	Zheng Qin et.al.	2308.03768v1	link
2023-07-26	One-Nearest Neighborhood Guides Inlier Estimation for Unsupervised Point Cloud Registration	Yongzhe Yuan et.al.	2307.14019v1	null
2023-07-22	Pyramid Semantic Graph-based Global Point Cloud Registration with Low Overlap	Zhijian Qiao et.al.	2307.12116v1	link
2023-09-12	ELiOT : End-to-end Lidar Odometry using Transformer Framework	Daegyu Lee et.al.	2307.11998v4	null
2023-08-08	Density-invariant Features for Distant Point Cloud Registration	Quan Liu et.al.	2307.09788v2	link
2023-07-18	SphereNet: Learning a Noise-Robust and General Descriptor for Point Cloud Registration	Guiyu Zhao et.al.	2307.09351v1	null
2023-07-14	CFI2P: Coarse-to-Fine Cross-Modal Correspondence Learning for Image-to-Point Cloud Registration	Gongxin Yao et.al.	2307.07142v1	null
2023-07-11	Exact Point Cloud Downsampling for Fast and Accurate Global Trajectory Optimization	Kenji Koide et.al.	2307.02948v2	link
2023-07-03	Direct Superpoints Matching for Fast and Robust Point Cloud Registration	Aniket Gupta et.al.	2307.01362v1	link
2023-07-04	A denoised Mean Teacher for domain adaptive point cloud registration	Alexander Bigalke et.al.	2306.14749v2	link
2023-06-20	End-to-end 2D-3D Registration between Image and LiDAR Point Cloud for Vehicle Localization	Guangming Wang et.al.	2306.11346v1	null
2023-06-14	ICET Online Accuracy Characterization for Geometry-Based Laser Scan Matching	Matthew McDermott et.al.	2306.08690v1	link
2023-06-12	Volume-DROID: A Real-Time Implementation of Volumetric Mapping with DROID-SLAM	Peter Stratton et.al.	2306.06850v1	link
2023-06-11	PWR-Align: Leveraging Part-Whole Relationships for Part-wise Rigid Point Cloud Registration in Mixed Reality Applications	Manorama Jha et.al.	2306.06717v1	null
2023-06-07	Robust-DefReg: A Robust Deformable Point Cloud Registration Method based on Graph Convolutional Neural Networks	Sara Monji-Azad et.al.	2306.04701v1	null
2023-05-23	Cross-source Point Cloud Registration: Challenges, Progress and Prospects	Xiaoshui Huang et.al.	2305.13570v1	null
2023-05-19	Efficient and Deterministic Search Strategy Based on Residual Projections for Point Cloud Registration	Xinyi Li et.al.	2305.11716v1	null
2023-05-18	3D Registration with Maximal Cliques	Xiyu Zhang et.al.	2305.10854v1	link
2023-05-05	HD2Reg: Hierarchical Descriptors and Detectors for Point Cloud Registration	Canhui Tang et.al.	2305.03487v1	link
2023-05-08	APR: Online Distant Point Cloud Registration Through Aggregated Point Cloud Reconstruction	Quan Liu et.al.	2305.02893v2	link
2023-04-27	RegHEC: Hand-Eye Calibration via Simultaneous Multi-view Point Clouds Registration of Arbitrary Object	Shiyu Xing et.al.	2304.14092v1	link
2023-04-26	Non-rigid Point Cloud Registration for Middle Ear Diagnostics with Endoscopic Optical Coherence Tomography	Peng Liu et.al.	2304.13618v1	link
2023-04-25	BO-ICP: Initialization of Iterative Closest Point Based on Bayesian Optimization	Harel Biggie et.al.	2304.13114v1	link
2023-04-18	SDFReg: Learning Signed Distance Functions for Point Cloud Registration	Leida Zhang et.al.	2304.08929v1	null
2023-04-12	SiLK -- Simple Learned Keypoints	Pierre Gleize et.al.	2304.06194v1	link
2023-04-11	TT-SDF2PC: Registration of Point Cloud and Compressed SDF Directly in the Memory-Efficient Tensor Train Domain	Alexey I. Boyko et.al.	2304.05342v1	null
2023-04-10	HybridFusion: LiDAR and Vision Cross-Source Point Cloud Fusion	Yu Wang et.al.	2304.04508v1	null
2023-04-09	Self-Supervised Learning of Object Segmentation from Unlabeled RGB-D Videos	Shiyang Lu et.al.	2304.04325v1	null
2023-04-09	DSMNet: Deep High-precision 3D Surface Modeling from Sparse Point Cloud Frames	Changjie Qiu et.al.	2304.04200v1	null
2023-04-02	Robust Multiview Point Cloud Registration with Reliable Pose Graph Initialization and History Reweighting	Haiping Wang et.al.	2304.00467v1	link
2023-03-31	kNN-Res: Residual Neural Network with kNN-Graph coherence for point cloud registration	Muhammad S. Battikh et.al.	2304.00050v1	link
2023-03-31	RDMNet: Reliable Dense Matching Based Point Cloud Registration for Autonomous Driving	Chenghao Shi et.al.	2303.18084v1	null
2023-04-23	HybridPoint: Point Cloud Registration Based on Hybrid Point Sampling and Matching	Yiheng Li et.al.	2303.16526v2	link
2023-03-27	Learnable Graph Matching: A Practical Paradigm for Data Association	Jiawei He et.al.	2303.15414v1	link
2023-03-23	Unsupervised Deep Probabilistic Approach for Partial Point Cloud Registration	Guofeng Mei et.al.	2303.13290v1	link
2023-03-22	RegFormer: An Efficient Projection-Aware Transformer Network for Large-Scale Point Cloud Registration	Jiuming Liu et.al.	2303.12384v1	link
2023-03-17	Deep Graph-based Spatial Consistency for Robust Non-rigid Point Cloud Registration	Zheng Qin et.al.	2303.09950v1	link
2023-03-14	RoCNet: 3D Robust Registration of Point-Clouds using Deep Learning	Karim Slimani et.al.	2303.07963v1	null
2023-03-07	GMCR: Graph-based Maximum Consensus Estimation for Point Cloud Registration	Michael Gentner et.al.	2303.04032v1	null
2023-03-02	Neural Intrinsic Embedding for Non-rigid Point Cloud Matching	Puhua Jiang et.al.	2303.01038v1	null
2023-03-14	A Unified BEV Model for Joint Learning of 3D Local Features and Overlap Estimation	Lin Li et.al.	2302.14511v2	link
2023-02-28	PCR-CG: Point Cloud Registration via Deep Color and Geometry	Yu Zhang et.al.	2302.14418v1	link
2023-02-28	Efficient Implicit Neural Reconstruction Using LiDAR	Dongyu Yan et.al.	2302.14363v1	link
2023-02-25	Accurate Gaussian Process Distance Fields with applications to Echolocation and Mapping	Cedric Le Gentil et.al.	2302.13005v1	null
2023-02-14	Point Cloud Registration for LiDAR and Photogrammetric Data: a Critical Synthesis and Performance Analysis on Classic and Deep Learning Algorithms	Ningli Xu et.al.	2302.07184v1	link

(back to top)

Point Cloud Segmentation

Publish Date	Title	Authors	PDF	Code
2025-07-09	PointVDP: Learning View-Dependent Projection by Fireworks Rays for 3D Point Cloud Segmentation	Yang Chen et.al.	2507.06618v1	null
2025-07-09	Ambiguity-aware Point Cloud Segmentation by Adaptive Margin Contrastive Learning	Yang Chen et.al.	2507.06592v1	null
2025-07-07	All in One: Visual-Description-Guided Unified Point Cloud Segmentation	Zongyan Han et.al.	2507.05211v1	null
2025-06-29	High-quality Pseudo-labeling for Point Cloud Segmentation with Scene-level Annotation	Lunhao Duan et.al.	2506.23227v1	null
2025-06-26	TSDASeg: A Two-Stage Model with Direct Alignment for Interactive Point Cloud Segmentation	Chade Li et.al.	2506.20991v1	null
2025-06-16	SRKD: Towards Efficient 3D Point Cloud Segmentation via Structure- and Relation-aware Knowledge Distillation	Yuqi Li et.al.	2506.17290v1	null
2025-06-11	Enhancing Human-Robot Collaboration: A Sim2Real Domain Adaptation Algorithm for Point Cloud Segmentation in Industrial Environments	Fatemeh Mohammadi Amin et.al.	2506.09552v1	null
2025-06-05	Point Cloud Segmentation of Agricultural Vehicles using 3D Gaussian Splatting	Alfred T. Christiansen et.al.	2506.05009v1	null
2025-06-05	OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model	Kunshen Zhang et.al.	2506.04837v1	link
2025-05-25	Staircase Recognition and Location Based on Polarization Vision	Weifeng Kong et.al.	2505.19026v1	null
2025-05-23	Generative Data Augmentation for Object Point Cloud Segmentation	Dekai Zhu et.al.	2505.17783v1	null
2025-05-15	APCoTTA: Continual Test-Time Adaptation for Semantic Segmentation of Airborne LiDAR Point Clouds	Yuan Gao et.al.	2505.09971v1	link
2025-04-26	WLTCL: Wide Field-of-View 3-D LiDAR Truck Compartment Automatic Localization System	Guodong Sun et.al.	2504.18870v1	null
2025-04-16	3D-PointZshotS: Geometry-Aware 3D Point Cloud Zero-Shot Semantic Segmentation Narrowing the Visual-Semantic Gap	Minmin Yang et.al.	2504.12442v1	link
2025-04-09	UAV Position Estimation using a LiDAR-based 3D Object Detection Method	Uthman Olawoye et.al.	2504.07028v1	null
2025-04-08	Turin3D: Evaluating Adaptation Strategies under Label Scarcity in Urban LiDAR Segmentation with Semi-Supervised Techniques	Luca Barco et.al.	2504.05882v1	null
2025-04-12	Robust Unsupervised Domain Adaptation for 3D Point Cloud Segmentation Under Source Adversarial Attacks	Haosheng Li et.al.	2504.01659v3	null
2025-04-12	ProtoGuard-guided PROPEL: Class-Aware Prototype Enhancement and Progressive Labeling for Incremental 3D Point Cloud Segmentation	Haosheng Li et.al.	2504.01648v2	null
2025-03-24	DINO in the Room: Leveraging 2D Foundation Models for 3D Segmentation	Karim Abou Zeid et.al.	2503.18944v1	link
2025-03-21	GeoT: Geometry-guided Instance-dependent Transition Matrix for Semi-supervised Tooth Point Cloud Segmentation	Weihao Yu et.al.	2503.16976v1	link
2025-05-20	Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language Model	Zhaochong An et.al.	2503.16282v2	link
2025-03-19	Depth-Aware Range Image-Based Model for Point Cloud Segmentation	Bike Chen et.al.	2503.14955v1	null
2025-03-18	Deep Unsupervised Segmentation of Log Point Clouds	Fedor Zolotarev et.al.	2503.14244v1	null
2025-03-07	Joint 3D Point Cloud Segmentation using Real-Sim Loop: From Panels to Trees and Branches	Tian Qiu et.al.	2503.05630v1	null
2025-03-05	Label-Efficient LiDAR Semantic Segmentation with 2D-3D Vision Transformer Adapters	Julia Hindel et.al.	2503.03299v1	null
2025-03-01	Explainable LiDAR 3D Point Cloud Segmentation and Clustering for Detecting Airplane-Generated Wind Turbulence	Zhan Qu et.al.	2503.00518v1	null
2025-02-26	PFSD: A Multi-Modal Pedestrian-Focus Scene Dataset for Rich Tasks in Semi-Structured Environments	Yueting Liu et.al.	2502.15342v3	link
2025-02-18	An Experimental Study of SOTA LiDAR Segmentation Models	Bike Chen et.al.	2502.12860v1	null
2025-01-30	Ground Awareness in Deep Learning for Large Outdoor Point Cloud Segmentation	Kevin Qiu et.al.	2501.18246v1	null
2025-01-29	3DSES: an indoor Lidar point cloud segmentation dataset with real and pseudo-labels from a 3D model	Maxime Mérizette et.al.	2501.17534v1	null
2025-01-24	LiDAR-Based Vehicle Detection and Tracking for Autonomous Racing	Marcello Cellina et.al.	2501.14502v1	null
2025-01-06	The 2nd Place Solution from the 3D Semantic Segmentation Track in the 2024 Waymo Open Dataset Challenge	Qing Wu et.al.	2501.05472v1	null
2025-01-03	MRG: A Multi-Robot Manufacturing Digital Scene Generation Method Using Multi-Instance Point Cloud Registration	Songjie Han et.al.	2501.02041v1	null
2025-01-18	Impact of color and mixing proportion of synthetic point clouds on semantic segmentation	Shaojie Zhou et.al.	2412.19145v2	link
2024-12-02	The Bare Necessities: Designing Simple, Effective Open-Vocabulary Scene Graphs	Christina Kassab et.al.	2412.01539v1	null
2024-11-30	Density-aware Global-Local Attention Network for Point Cloud Segmentation	Chade Li et.al.	2412.00489v1	null
2024-11-28	Textured As-Is BIM via GIS-informed Point Cloud Segmentation	Mohamed S. H. Alabassy et.al.	2411.18898v1	null
2024-11-27	Towards Cross-device and Training-free Robotic Grasping in 3D Open World	Weiguang Zhao et.al.	2411.18133v1	null
2024-11-20	BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation	Umamaheswaran Raman Kumar et.al.	2411.13251v1	null
2024-11-13	Biomass phenotyping of oilseed rape through UAV multi-view oblique imaging with 3DGS and SAM model	Yutao Shen et.al.	2411.08453v1	null
2024-11-13	Multiscale Graph Construction Using Non-local Cluster Features	Reina Kaneko et.al.	2411.08371v1	null
2024-10-30	Automated Image-Based Identification and Consistent Classification of Fire Patterns with Quantitative Shape Analysis and Spatial Location Identification	Pengkun Liu et.al.	2410.23105v1	null
2024-11-03	Multimodality Helps Few-Shot 3D Point Cloud Semantic Segmentation	Zhaochong An et.al.	2410.22489v2	link
2024-10-28	Exploring contextual modeling with linear complexity for point cloud segmentation	Yong Xien Chng et.al.	2410.21211v1	null
2024-10-14	Generalizable Humanoid Manipulation with Improved 3D Diffusion Policies	Yanjie Ze et.al.	2410.10803v1	link
2024-10-09	Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy	Qinfeng Zhu et.al.	2410.06725v1	null
2024-09-24	Underground Mapping and Localization Based on Ground-Penetrating Radar	Jinchang Zhang et.al.	2409.16446v1	null
2024-09-22	Lidar Panoptic Segmentation in an Open World	Anirudh S Chakravarthy et.al.	2409.14273v1	link
2024-09-03	When 3D Partial Points Meets SAM: Tooth Point Cloud Segmentation with Sparse Labels	Yifan Liu et.al.	2409.01691v1	null
2024-09-03	Efficiently Expanding Receptive Fields: Local Split Attention and Parallel Aggregation for Enhanced Large-scale Point Cloud Semantic Segmentation	Haodong Wang et.al.	2409.01662v1	null
2024-08-29	Towards Modality-agnostic Label-efficient Segmentation with Entropy-Regularized Distribution Alignment	Liyao Tang et.al.	2408.16520v1	link
2024-08-21	GSTran: Joint Geometric and Semantic Coherence for Point Cloud Segmentation	Abiao Li et.al.	2408.11558v1	link
2024-08-02	Trainable Pointwise Decoder Module for Point Cloud Segmentation	Bike Chen et.al.	2408.01548v1	null
2024-07-31	Fine-grained Metrics for Point Cloud Semantic Segmentation	Zhuheng Lu et.al.	2407.21289v1	null
2024-07-19	Scale Disparity of Instances in Interactive Point Cloud Segmentation	Chenrui Han et.al.	2407.14009v1	null
2024-07-18	SegPoint: Segment Any Point Cloud via Large Language Model	Shuting He et.al.	2407.13761v1	null
2024-07-17	Dual-level Adaptive Self-Labeling for Novel Class Discovery in Point Cloud Segmentation	Ruijie Xu et.al.	2407.12489v1	link
2024-07-17	HGL: Hierarchical Geometry Learning for Test-time Adaptation in 3D Point Cloud Segmentation	Tianpei Zou et.al.	2407.12387v1	link
2024-07-17	Serialized Point Mamba: A Serialized Point Cloud Mamba Segmentation Model	Tao Wang et.al.	2407.12319v1	null
2024-07-12	Uplifting Range-View-based 3D Semantic Segmentation in Real-Time with Multi-Sensor Fusion	Shiqi Tan et.al.	2407.09697v1	null
2024-07-01	fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence	Francis Williams et.al.	2407.01781v1	null
2024-06-25	Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model	Zhuoyuan Li et.al.	2406.17442v1	null
2024-08-04	Twin Deformable Point Convolutions for Point Cloud Semantic Segmentation in Remote Sensing Scenes	Yong-Qiang Mao et.al.	2405.19735v2	null
2024-05-24	3D Unsupervised Learning by Distilling 2D Open-Vocabulary Segmentation Models for Autonomous Driving	Boyi Sun et.al.	2405.15286v1	link
2024-05-25	Filling Missing Values Matters for Range Image-Based Point Cloud Segmentation	Bike Chen et.al.	2405.10175v2	null
2024-04-16	ECLAIR: A High-Fidelity Aerial LiDAR Dataset for Semantic Segmentation	Iaroslav Melekhov et.al.	2404.10699v1	link
2024-04-04	OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views	Francis Engelmann et.al.	2404.03650v1	null
2024-03-28	RiEMann: Near Real-Time SE(3)-Equivariant Robot Manipulation without Point Cloud Segmentation	Chongkai Gao et.al.	2403.19460v1	null
2024-05-30	CurbNet: Curb Detection Framework Based on LiDAR Point Cloud Segmentation	Guoyang Zhao et.al.	2403.16794v2	link
2024-03-18	EffiPerception: an Efficient Framework for Various Perception Tasks	Xinhao Xiang et.al.	2403.12317v1	null
2024-03-11	3DRef: 3D Dataset and Benchmark for Reflection Detection in RGB and Lidar Data	Xiting Zhao et.al.	2403.06538v1	null
2024-03-11	Refining Segmentation On-the-Fly: An Interactive Framework for Point Cloud Semantic Segmentation	Peng Zhang et.al.	2403.06401v1	null
2024-03-03	Region-Transformer: Self-Attention Region Based Class-Agnostic Point Cloud Segmentation	Dipesh Gyawali et.al.	2403.01407v1	null
2024-01-29	Dynamic Prototype Adaptation with Distillation for Few-shot Point Cloud Segmentation	Jie Liu et.al.	2401.16051v1	link
2024-01-19	Symbol as Points: Panoptic Symbol Spotting via Point-based Representation	Wenlong Liu et.al.	2401.10556v1	link
2023-12-29	Multi-modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation	Xiawei Li et.al.	2312.16578v2	link
2023-12-19	Point Cloud Segmentation Using Transfer Learning with RandLA-Net: A Case Study on Urban Areas	Alperen Enes Bayar et.al.	2312.11880v1	null
2023-12-15	T-MAE: Temporal Masked Autoencoders for Point Cloud Representation Learning	Weijie Wei et.al.	2312.10217v1	link
2023-12-14	FAPP: Fast and Adaptive Perception and Planning for UAVs in Dynamic Cluttered Environments	Minghao Lu et.al.	2312.08743v1	null
2023-12-12	Transferring CLIP's Knowledge into Zero-Shot Point Cloud Semantic Segmentation	Yuanbin Wang et.al.	2312.07221v1	null
2023-12-11	Densify Your Labels: Unsupervised Clustering with Bipartite Matching for Weakly Supervised Point Cloud Segmentation	Shaobo Xia et.al.	2312.06799v1	null
2024-01-15	Provable Adversarial Robustness for Group Equivariant Tasks: Graphs, Point Clouds, Molecules, and More	Jan Schuchardt et.al.	2312.02708v2	null
2023-11-24	OneFormer3D: One Transformer for Unified Point Cloud Segmentation	Maxim Kolodiazhnyi et.al.	2311.14405v1	null
2023-11-18	DatasetNeRF: Efficient 3D-aware Data Factory with Generative Radiance Fields	Yu Chi et.al.	2311.12063v1	link
2023-11-10	U3DS $^3$ : Unsupervised 3D Semantic Scene Segmentation	Jiaxu Liu et.al.	2311.06018v1	null
2023-11-06	Leveraging Large-Scale Pretrained Vision Foundation Models for Label-Efficient 3D Point Cloud Segmentation	Shichao Dong et.al.	2311.01989v2	null
2023-10-19	2D-3D Interlaced Transformer for Point Cloud Segmentation with Scene-Level Supervision	Cheng-Kun Yang et.al.	2310.12817v1	null
2023-10-11	PointHR: Exploring High-Resolution Architectures for 3D Point Cloud Segmentation	Haibo Qiu et.al.	2310.07743v1	link
2023-09-26	Addressing Data Misalignment in Image-LiDAR Fusion on Point Cloud Segmentation	Wei Jong Yang et.al.	2309.14932v1	null
2023-09-20	Towards Robust Few-shot Point Cloud Semantic Segmentation	Yating Xu et.al.	2309.11228v1	link
2023-09-20	Generalized Few-Shot Point Cloud Segmentation Via Geometric Words	Yating Xu et.al.	2309.11222v1	link
2023-08-29	Compositional Semantic Mix for Domain Adaptation in Point Cloud Segmentation	Cristiano Saltori et.al.	2308.14619v2	link
2023-08-22	Hierarchical Point-based Active Learning for Semi-supervised Point Cloud Semantic Segmentation	Zongyi Xu et.al.	2308.11166v1	link
2023-08-14	Autonomous Point Cloud Segmentation for Power Lines Inspection in Smart Grid	Alexander Kyuroson et.al.	2308.07283v1	null
2023-08-08	Boosting Few-shot 3D Point Cloud Segmentation via Query-Guided Enhancement	Zhenhua Ning et.al.	2308.03177v2	link
2023-07-31	pCTFusion: Point Convolution-Transformer Fusion with Semantic Aware Loss for Outdoor LiDAR Point Cloud Segmentation	Abhishek Kuriyal et.al.	2307.14777v2	link
2023-07-27	Clustering based Point Cloud Representation Learning for 3D Analysis	Tuo Feng et.al.	2307.14605v1	link
2023-07-20	See More and Know More: Zero-shot Point Cloud Segmentation via Multi-modal Visual Data	Yuhang Lu et.al.	2307.10782v1	null
2023-07-14	Achelous: A Fast Unified Water-surface Panoptic Perception Framework based on Fusion of Monocular Camera and 4D mmWave Radar	Runwei Guan et.al.	2307.07102v1	link
2023-07-08	BPNet: Bézier Primitive Segmentation on 3D Point Clouds	Rao Fu et.al.	2307.04013v1	link
2023-06-28	Point2Point : A Framework for Efficient Deep Learning on Hilbert sorted Point Clouds with applications in Spatio-Temporal Occupancy Prediction	Athrva Atul Pandhare et.al.	2306.16306v1	null
2023-05-30	Dynamic Clustering Transformer Network for Point Cloud Segmentation	Dening Lu et.al.	2306.08073v1	null
2023-05-23	Prototype Adaption and Projection for Few- and Zero-shot 3D Point Cloud Semantic Segmentation	Shuting He et.al.	2305.14335v1	link
2023-05-22	Contrastive Predictive Autoencoders for Dynamic Point Cloud Self-Supervised Learning	Xiaoxiao Sheng et.al.	2305.12959v1	null
2023-05-17	Tinto: Multisensor Benchmark for 3D Hyperspectral Point Cloud Segmentation in the Geosciences	Ahmed J. Afifi et.al.	2305.09928v1	null
2023-05-08	OctFormer: Octree-based Transformers for 3D Point Clouds	Peng-Shuai Wang et.al.	2305.03045v2	link
2023-05-22	Urban GeoBIM construction by integrating semantic LiDAR point clouds with as-designed BIM models	Jie Shao et.al.	2304.11719v2	null
2023-04-22	Knowledge Distillation from 3D to Bird's-Eye-View for LiDAR Semantic Segmentation	Feng Jiang et.al.	2304.11393v1	link
2023-06-02	Transformer-Based Visual Segmentation: A Survey	Xiangtai Li et.al.	2304.09854v2	link
2023-04-11	Feature-assisted interactive geometry reconstruction in 3D point clouds using incremental region growing	Attila Szabo et.al.	2304.05109v1	null

(back to top)

Zero-shot

Publish Date	Title	Authors	PDF	Code
2025-07-23	Attention (as Discrete-Time Markov) Chains	Yotam Erel et.al.	2507.17657v1	null
2025-07-23	Who Attacks, and Why? Using LLMs to Identify Negative Campaigning in 18M Tweets across 19 Countries	Victor Hartman et.al.	2507.17636v1	null
2025-07-23	Decoding Consumer Preferences Using Attention-Based Language Models	Joshua Foster et.al.	2507.17564v1	null
2025-07-23	Constructing Ophthalmic MLLM for Positioning-diagnosis Collaboration Through Clinical Cognitive Chain Reasoning	Xinyao Liu et.al.	2507.17539v1	null
2025-07-23	Probing Vision-Language Understanding through the Visual Entailment Task: promises and pitfalls	Elena Pitta et.al.	2507.17467v1	null
2025-07-23	Language-Conditioned Open-Vocabulary Mobile Manipulation with Pretrained Models	Shen Tan et.al.	2507.17379v1	null
2025-07-23	A Conditional Probability Framework for Compositional Zero-shot Learning	Peng Wu et.al.	2507.17377v1	null
2025-07-23	Application of Whisper in Clinical Practice: the Post-Stroke Speech Assessment during a Naming Task	Milena Davudova et.al.	2507.17326v1	null
2025-07-23	Exploring the Potential of LLMs for Serendipity Evaluation in Recommender Systems	Li Kang et.al.	2507.17290v1	null
2025-07-23	PolarAnything: Diffusion-based Polarimetric Image Synthesis	Kailong Zhang et.al.	2507.17268v1	null
2025-07-22	Task-Specific Zero-shot Quantization-Aware Training for Object Detection	Changhao Li et.al.	2507.16782v1	null
2025-07-22	Never Come Up Empty: Adaptive HyDE Retrieval for Improving LLM Developer Support	Fangjian Lei et.al.	2507.16754v1	null
2025-07-22	CMP: A Composable Meta Prompt for SAM-Based Cross-Domain Few-Shot Segmentation	Shuai Chen et.al.	2507.16753v1	null
2025-07-22	SALM: Spatial Audio Language Model with Structured Embeddings for Understanding and Editing	Jinbo Hu et.al.	2507.16724v1	null
2025-07-22	Are Foundation Models All You Need for Zero-shot Face Presentation Attack Detection?	Lazaro Janier Gonzalez-Sole et.al.	2507.16393v1	null
2025-07-22	Detect Any Sound: Open-Vocabulary Sound Event Detection with Multi-Modal Queries	Pengfei Cai et.al.	2507.16343v1	null
2025-07-22	Quality Text, Robust Vision: The Role of Language in Enhancing Visual Robustness of Vision-Language Models	Futa Waseda et.al.	2507.16257v1	null
2025-07-22	LMM4Edit: Benchmarking and Evaluating Multimodal Image Editing with LMMs	Zitong Xu et.al.	2507.16193v1	null
2025-07-22	Characterizing Online Activities Contributing to Suicide Mortality among Youth	Aparna Ananthasubramaniam et.al.	2507.16185v1	null
2025-07-22	PUSA V1.0: Surpassing Wan-I2V with $500 Training Cost by Vectorized Timestep Adaptation	Yaofang Liu et.al.	2507.16116v1	null
2025-07-21	VeriRAG: A Retrieval-Augmented Framework for Automated RTL Testability Repair	Haomin Qi et.al.	2507.15664v1	null
2025-07-21	Smart Eyes for Silent Threats: VLMs and In-Context Learning for THz Imaging	Nicolas Poggi et.al.	2507.15576v1	null
2025-07-21	HOLa: Zero-Shot HOI Detection with Low-Rank Decomposed VLM Feature Adaptation	Qinqian Lei et.al.	2507.15542v1	null
2025-07-21	One Last Attention for Your Vision-Language Model	Liang Chen et.al.	2507.15480v1	null
2025-07-21	PDEformer-2: A Versatile Foundation Model for Two-Dimensional Partial Differential Equations	Zhanhong Ye et.al.	2507.15409v1	null
2025-07-21	Beyond Easy Wins: A Text Hardness-Aware Benchmark for LLM-generated Text Detection	Navid Ayoobi et.al.	2507.15286v1	null
2025-07-21	A2TTS: TTS for Low Resource Indian Languages	Ayush Singh Bhadoriya et.al.	2507.15272v1	null
2025-07-21	FreeCus: Free Lunch Subject-driven Customization in Diffusion Transformers	Yanbing Zhang et.al.	2507.15249v1	null
2025-07-20	Deep Generative Models in Condition and Structural Health Monitoring: Opportunities, Limitations and Future Outlook	Xin Yang et.al.	2507.15026v1	null
2025-07-20	DMOSpeech 2: Reinforcement Learning for Duration Prediction in Metric-Optimized Speech Synthesis	Yinghao Aaron Li et.al.	2507.14988v1	null
2025-07-18	Blind Super Resolution with Reference Images and Implicit Degradation Representation	Huu-Phu Do et.al.	2507.13915v1	null
2025-07-18	SPARQL Query Generation with LLMs: Measuring the Impact of Training Data Memorization and Knowledge Injection	Aleksandr Gashkov et.al.	2507.13859v1	null
2025-07-18	Causal Knowledge Transfer for Multi-Agent Reinforcement Learning in Dynamic Environments	Kathrin Korte et.al.	2507.13846v1	null
2025-07-17	Learning Pluralistic User Preferences through Reinforcement Learning Fine-tuned Summaries	Hyunji Nam et.al.	2507.13579v1	null
2025-07-17	LoRA-Loop: Closing the Synthetic Replay Cycle for Continual VLM Learning	Kaihong Wang et.al.	2507.13568v1	null
2025-07-17	Revisiting Prompt Engineering: A Comprehensive Evaluation for LLM-based Personalized Recommendation	Genki Kusano et.al.	2507.13525v1	null
2025-07-17	Improving Out-of-distribution Human Activity Recognition via IMU-Video Cross-modal Representation Learning	Seyyed Saeid Cheshmi et.al.	2507.13482v1	null
2025-07-17	"PhyWorldBench": A Comprehensive Evaluation of Physical Realism in Text-to-Video Models	Jing Gu et.al.	2507.13428v1	null
2025-07-17	Comparing Apples to Oranges: A Dataset & Analysis of LLM Humour Understanding from Traditional Puns to Topical Jokes	Tyler Loakman et.al.	2507.13335v1	null
2025-07-17	Detecting LLM-generated Code with Subtle Modification by Adversarial Training	Xin Yin et.al.	2507.13123v1	null
2025-07-17	GLAD: Generalizable Tuning for Vision-Language Models	Yuqi Peng et.al.	2507.13089v1	null
2025-07-17	DEMONSTRATE: Zero-shot Language to Robotic Control via Multi-task Demonstration Learning	Rahel Rickenbach et.al.	2507.12855v1	null
2025-07-17	MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval	Jeong-Woo Park et.al.	2507.12819v1	null
2025-07-17	Think-Before-Draw: Decomposing Emotion Semantics & Fine-Grained Controllable Expressive Talking Head Generation	Hanlei Shi et.al.	2507.12761v1	null
2025-07-17	osmAG-LLM: Zero-Shot Open-Vocabulary Object Navigation via Semantic Maps and Large Language Models Reasoning	Fujing Xie et.al.	2507.12753v1	null
2025-07-16	Reconstruct, Inpaint, Finetune: Dynamic Novel-view Synthesis from Monocular Videos	Kaihua Chen et.al.	2507.12646v1	null
2025-07-16	Funnel-HOI: Top-Down Perception for Zero-Shot HOI Detection	Sandipan Sarma et.al.	2507.12628v1	null
2025-07-16	Generate to Ground: Multimodal Text Conditioning Boosts Phrase Grounding in Medical Vision-Language Models	Felix Nützel et.al.	2507.12236v1	null
2025-07-16	SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation	Jun Yin et.al.	2507.11994v1	null
2025-07-16	Style Composition within Distinct LoRA modules for Traditional Art	Jaehyun Lee et.al.	2507.11986v1	null
2025-07-16	GS-Bias: Global-Spatial Bias Learner for Single-Image Test-Time Adaptation of Vision-Language Models	Zhaohong Huang et.al.	2507.11969v1	null
2025-07-16	Imbalanced Regression Pipeline Recommendation	Juscimara G. Avelino et.al.	2507.11901v1	null
2025-07-16	SynCoGen: Synthesizable 3D Molecule Generation via Joint Reaction and Coordinate Modeling	Andrei Rekesh et.al.	2507.11818v1	null
2025-07-15	AI Wizards at CheckThat! 2025: Enhancing Transformer-Based Embeddings with Sentiment for Subjectivity Detection in News Articles	Matteo Fasulo et.al.	2507.11764v1	null
2025-07-15	Beyond Task-Specific Reasoning: A Unified Conditional Generative Framework for Abstract Visual Reasoning	Fan Shi et.al.	2507.11761v1	null
2025-07-15	Torsional-GFN: a conditional conformation generator for small molecules	Alexandra Volokhova et.al.	2507.11759v1	null
2025-07-15	CRABS: A syntactic-semantic pincer strategy for bounding LLM interpretation of Python notebooks	Meng Li et.al.	2507.11742v1	null
2025-07-15	Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation	Zhen Xu et.al.	2507.11540v1	null
2025-07-15	HUG-VAS: A Hierarchical NURBS-Based Generative Model for Aortic Geometry Synthesis and Controllable Editing	Pan Du et.al.	2507.11474v1	null
2025-07-15	Foundation Models for Logistics: Toward Certifiable, Conversational Planning Interfaces	Yunhao Yang et.al.	2507.11352v1	null
2025-07-15	How Far Have Medical Vision-Language Models Come? A Comprehensive Benchmarking Study	Che Liu et.al.	2507.11200v1	null
2025-07-15	MSA at ImageCLEF 2025 Multimodal Reasoning: Multilingual Multimodal Reasoning With Ensemble Vision Language Models	Seif Ahmed et.al.	2507.11114v1	null
2025-07-15	Bridge Feature Matching and Cross-Modal Alignment with Mutual-filtering for Zero-shot Anomaly Detection	Yuhu Bai et.al.	2507.11003v1	null
2025-07-15	Learning to Tune Like an Expert: Interpretable and Scene-Aware Navigation via MLLM Reasoning and CVAE-Based Adaptation	Yanbo Wang et.al.	2507.11001v1	null
2025-07-15	MalCodeAI: Autonomous Vulnerability Detection and Remediation via Language Agnostic Code Reasoning	Jugal Gajjar et.al.	2507.10898v1	null
2025-07-14	LLM-Guided Agentic Object Detection for Open-World Understanding	Furkan Mumcu et.al.	2507.10844v1	null
2025-07-14	EmbRACE-3K: Embodied Reasoning and Action in Complex Environments	Mingxian Lin et.al.	2507.10548v1	null
2025-07-14	Graph World Model	Tao Feng et.al.	2507.10539v1	null
2025-07-14	Fine-Grained Zero-Shot Object Detection	Hongxu Ma et.al.	2507.10358v1	null
2025-07-14	Prompt Informed Reinforcement Learning for Visual Coverage Path Planning	Venkat Margapuri et.al.	2507.10284v1	null
2025-07-14	Conditional Chemical Language Models are Versatile Tools in Drug Discovery	Lu Zhu et.al.	2507.10273v1	null
2025-07-14	Natural Language-based Assessment of L2 Oral Proficiency using LLMs	Stefano Bannò et.al.	2507.10200v1	null
2025-07-14	DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation	Ivan Martinović et.al.	2507.10118v1	null
2025-07-14	FIX-CLIP: Dual-Branch Hierarchical Contrastive Learning via Synthetic Captions for Better Understanding of Long Text	Bingchao Wang et.al.	2507.10095v1	null
2025-07-14	MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second	Chenguo Lin et.al.	2507.10065v1	null
2025-07-14	Automating SPARQL Query Translations between DBpedia and Wikidata	Malte Christian Bartels et.al.	2507.10045v1	null
2025-07-11	Compress Any Segment Anything Model (SAM)	Juntong Fan et.al.	2507.08765v1	null
2025-07-11	NL in the Middle: Code Translation with LLMs and Intermediate Representations	Chi-en Amy Tai et.al.	2507.08627v1	null
2025-07-11	BayesTTA: Continual-Temporal Test-Time Adaptation for Vision-Language Models via Gaussian Discriminant Analysis	Shuang Cui et.al.	2507.08607v1	null
2025-07-11	Unlocking Speech Instruction Data Potential with Query Rewriting	Yonghua Hei et.al.	2507.08603v1	null
2025-07-11	Visual Semantic Description Generation with MLLMs for Image-Text Matching	Junyu Chen et.al.	2507.08590v1	null
2025-07-11	Large Multi-modal Model Cartographic Map Comprehension for Textual Locality Georeferencing	Kalana Wijegunarathna et.al.	2507.08575v1	null
2025-07-11	AbbIE: Autoregressive Block-Based Iterative Encoder for Efficient Sequence Modeling	Preslav Aleksandrov et.al.	2507.08567v1	null
2025-07-11	MIDI-VALLE: Improving Expressive Piano Performance Synthesis Through Neural Codec Language Modelling	Jingjing Tang et.al.	2507.08530v1	null
2025-07-11	SPINT: Spatial Permutation-Invariant Neural Transformer for Consistent Intracortical Motor Decoding	Trung Le et.al.	2507.08402v1	null
2025-07-11	PanMatch: Unleashing the Potential of Large Vision Models for Unified Matching Models	Yongjian Zhang et.al.	2507.08400v1	null
2025-07-10	Impact of Pretraining Word Co-occurrence on Compositional Generalization in Multimodal Models	Helen Qu et.al.	2507.08000v1	null
2025-07-10	MGVQ: Could VQ-VAE Beat VAE? A Generalizable Tokenizer with Multi-group Quantization	Mingkai Jia et.al.	2507.07997v1	null
2025-07-10	CLIP Won't Learn Object-Attribute Binding from Natural Data and Here is Why	Bijay Gurung et.al.	2507.07985v1	null
2025-07-10	SAGE: A Visual Language Model for Anomaly Detection via Fact Enhancement and Entropy-aware Alignment	Guoxin Zang et.al.	2507.07939v1	null
2025-07-10	Lost in Pronunciation: Detecting Chinese Offensive Language Disguised by Phonetic Cloaking Replacement	Haotan Guo et.al.	2507.07640v1	null
2025-07-10	Exploring the Limits of Model Compression in LLMs: A Knowledge Distillation Study on QA Tasks	Joyeeta Datta et.al.	2507.07630v1	null
2025-07-10	LOSC: LiDAR Open-voc Segmentation Consolidator	Nermin Samet et.al.	2507.07605v1	null
2025-07-10	Mix-Geneformer: Unified Representation Learning for Human and Mouse scRNA-seq Data	Yuki Nishio et.al.	2507.07454v1	null
2025-07-10	EscherNet++: Simultaneous Amodal Completion and Scalable View Synthesis through Masked Fine-Tuning and Enhanced Feed-Forward 3D Reconstruction	Xinan Zhang et.al.	2507.07410v1	null
2025-07-10	Phishing Detection in the Gen-AI Era: Quantized LLMs vs Classical Models	Jikesh Thapa et.al.	2507.07406v1	null
2025-07-09	Go to Zero: Towards Zero-shot Motion Generation with Million-scale Data	Ke Fan et.al.	2507.07095v1	null
2025-07-09	Free on the Fly: Enhancing Flexibility in Test-Time Adaptation with Online EM	Qiyuan Dai et.al.	2507.06973v1	null
2025-07-09	MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection	Ziyan Liu et.al.	2507.06908v1	null
2025-07-09	MADPOT: Medical Anomaly Detection with CLIP Adaptation and Partial Optimal Transport	Mahshid Shiri et.al.	2507.06733v1	null
2025-07-09	CLI-RAG: A Retrieval-Augmented Framework for Clinically Structured and Context Aware Text Generation with LLMs	Garapati Keerthana et.al.	2507.06715v1	null
2025-07-09	Text-promptable Object Counting via Quantity Awareness Enhancement	Miaojing Shi et.al.	2507.06679v1	null
2025-07-09	Few-shot Learning on AMS Circuits and Its Application to Parasitic Capacitance Prediction	Shan Shen et.al.	2507.06538v1	null
2025-07-08	VisioPath: Vision-Language Enhanced Model Predictive Control for Safe Autonomous Navigation in Mixed Traffic	Shanting Wang et.al.	2507.06441v1	null
2025-07-08	Tile-Based ViT Inference with Visual-Cluster Priors for Zero-Shot Multi-Species Plant Identification	Murilo Gustineli et.al.	2507.06093v1	null
2025-07-08	Conditional Multi-Stage Failure Recovery for Embodied Agents	Youmna Farag et.al.	2507.06016v1	null
2025-07-08	From General Relation Patterns to Task-Specific Decision-Making in Continual Multi-Agent Coordination	Chang Yao et.al.	2507.06004v1	null
2025-07-08	DocIE@XLLM25: In-Context Learning for Information Extraction using Fully Synthetic Demonstrations	Nicholas Popovič et.al.	2507.05997v1	null
2025-07-08	Automatic Synthesis of High-Quality Triplet Data for Composed Image Retrieval	Haiwen Li et.al.	2507.05970v1	null
2025-07-09	A Wireless Foundation Model for Multi-Task Prediction	Yucheng Sheng et.al.	2507.05938v2	null
2025-07-08	Differentiable Reward Optimization for LLM based TTS system	Changfeng Gao et.al.	2507.05911v1	null
2025-07-08	Video Event Reasoning and Prediction by Fusing World Knowledge from LLMs with Vision Foundation Models	L'ea Dubois et.al.	2507.05822v1	null
2025-07-08	DreamGrasp: Zero-Shot 3D Multi-Object Reconstruction from Partial-View Images for Robotic Manipulation	Young Hun Kim et.al.	2507.05627v1	null
2025-07-07	SenseCF: LLM-Prompted Counterfactuals for Intervention and Sensor Data Augmentation	Shovito Barua Soumma et.al.	2507.05541v1	null
2025-07-07	Modeling Latent Partner Strategies for Adaptive Zero-Shot Human-Agent Collaboration	Benjamin Li et.al.	2507.05244v1	null
2025-07-07	In-Context Learning as an Effective Estimator of Functional Correctness of LLM-Generated Code	Susmita Das et.al.	2507.05200v1	null
2025-07-07	VERITAS: Verification and Explanation of Realness in Images for Transparency in AI Systems	Aadi Srivastava et.al.	2507.05146v1	null
2025-07-07	An Evaluation of Large Language Models on Text Summarization Tasks Using Prompt Engineering Techniques	Walid Mohamed Aly et.al.	2507.05123v1	null
2025-07-07	Multi-modal Representations for Fine-grained Multi-label Critical View of Safety Recognition	Britty Baby et.al.	2507.05007v1	null
2025-07-08	Do We Really Need Specialization? Evaluating Generalist Text Embeddings for Zero-Shot Recommendation and Search	Matteo Attimonelli et.al.	2507.05006v2	null
2025-07-07	Harnessing Pairwise Ranking Prompting Through Sample-Efficient Ranking Distillation	Junru Wu et.al.	2507.04820v1	null
2025-07-07	An analysis of vision-language models for fabric retrieval	Francesco Giuliari et.al.	2507.04735v1	null
2025-07-07	Why We Feel What We Feel: Joint Detection of Emotions and Their Opinion Triggers in E-commerce	Arnav Attri et.al.	2507.04708v1	null
2025-07-07	VectorLLM: Human-like Extraction of Structured Building Contours vis Multimodal LLMs	Tao Zhang et.al.	2507.04664v1	null
2025-07-03	MultiGen: Using Multimodal Generation in Simulation to Learn Multimodal Policies in Real	Renhao Wang et.al.	2507.02864v1	null
2025-07-03	RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation	Liheng Zhang et.al.	2507.02792v1	null
2025-07-06	KERAP: A Knowledge-Enhanced Reasoning Approach for Accurate Zero-shot Diagnosis Prediction Using Multi-agent LLMs	Yuzhang Xie et.al.	2507.02773v2	null
2025-07-03	DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment	Ke-Han Lu et.al.	2507.02768v1	null
2025-07-03	DexVLG: Dexterous Vision-Language-Grasp Model at Scale	Jiawei He et.al.	2507.02747v1	null
2025-07-03	Hierarchical Multi-Label Contrastive Learning for Protein-Protein Interaction Prediction Across Organisms	Shiyi Liu et.al.	2507.02724v1	null
2025-07-03	Learning few-step posterior samplers by unfolding and distillation of diffusion models	Charlesquin Kemajou Mbakam et.al.	2507.02686v1	null
2025-07-03	A Matrix Variational Auto-Encoder for Variant Effect Prediction in Pharmacogenes	Antoine Honoré et.al.	2507.02624v1	null
2025-07-03	LLMREI: Automating Requirements Elicitation Interviews with LLMs	Alexander Korn et.al.	2507.02564v1	null
2025-07-03	IGDNet: Zero-Shot Robust Underexposed Image Enhancement via Illumination-Guided and Denoising	Hailong Yan et.al.	2507.02445v1	null
2025-07-02	Reasoning to Edit: Hypothetical Instruction-Based Image Editing with Visual Reasoning	Qingdong He et.al.	2507.01908v1	null
2025-07-02	Towards Foundation Auto-Encoders for Time-Series Anomaly Detection	Gastón García González et.al.	2507.01875v1	null
2025-07-02	MoIRA: Modular Instruction Routing Architecture for Multi-Task Robotics	Dmytro Kuzmenko et.al.	2507.01843v1	null
2025-07-02	RobuSTereo: Robust Zero-Shot Stereo Matching under Adverse Weather	Yuran Wang et.al.	2507.01653v1	null
2025-07-02	Adapting Language Models to Indonesian Local Languages: An Empirical Study of Language Transferability on Zero-Shot Settings	Rifki Afina Putri et.al.	2507.01645v1	null
2025-07-02	Depth Anything at Any Condition	Boyuan Sun et.al.	2507.01634v1	null
2025-07-02	NOCTIS: Novel Object Cyclic Threshold based Instance Segmentation	Max Gandyra et.al.	2507.01463v1	null
2025-07-02	La RoSA: Enhancing LLM Efficiency via Layerwise Rotated Sparse Activation	Kai Liu et.al.	2507.01299v1	null
2025-07-02	AIGVE-MACS: Unified Multi-Aspect Commenting and Scoring Model for AI-Generated Video Evaluation	Xiao Liu et.al.	2507.01255v1	null
2025-07-01	VQ-VLA: Improving Vision-Language-Action Models via Scaling Vector-Quantized Action Tokenizers	Yating Wang et.al.	2507.01016v1	null
2025-06-30	Foundation Models for Zero-Shot Segmentation of Scientific Images without AI-Ready Data	Shubhabrata Mukherjee et.al.	2506.24039v1	null
2025-06-30	Machine Understanding of Scientific Language	Dustin Wright et.al.	2506.23990v1	null
2025-06-30	Leveraging the Potential of Prompt Engineering for Hate Speech Detection in Low-Resource Languages	Ruhina Tabasshum Prome et.al.	2506.23930v1	null
2025-06-30	World4Omni: A Zero-Shot Framework from Image Generation World Model to Robotic Manipulation	Haonan Chen et.al.	2506.23919v1	null
2025-06-30	Interpretable Zero-Shot Learning with Locally-Aligned Vision-Language Model	Shiming Chen et.al.	2506.23822v1	null
2025-06-30	Zero-Shot Contextual Embeddings via Offline Synthetic Corpus Generation	Philip Lippmann et.al.	2506.23662v1	null
2025-06-30	Blending Concepts with Text-to-Image Diffusion Models	Lorenzo Olearo et.al.	2506.23630v1	null
2025-06-30	StackCLIP: Clustering-Driven Stacked Prompt in Zero-Shot Industrial Anomaly Detection	Yanning Hou et.al.	2506.23577v1	null
2025-06-30	AdFair-CLIP: Adversarial Fair Contrastive Language-Image Pre-training for Chest X-rays	Chenlang Yi et.al.	2506.23467v1	null
2025-06-29	Federated Timeline Synthesis: Scalable and Private Methodology For Model Training and Deployment	Pawel Renc et.al.	2506.23358v1	null
2025-06-27	Reinforcement Learning with Physics-Informed Symbolic Program Priors for Zero-Shot Wireless Indoor Navigation	Tao Li et.al.	2506.22365v1	null
2025-06-27	OutDreamer: Video Outpainting with a Diffusion Transformer	Linhao Zhong et.al.	2506.22298v1	null
2025-06-27	Frequency-Semantic Enhanced Variational Autoencoder for Zero-Shot Skeleton-based Action Recognition	Wenhan Wu et.al.	2506.22179v1	null
2025-06-27	Partial CLIP is Enough: Chimera-Seg for Zero-shot Semantic Segmentation	Jialei Chen et.al.	2506.22032v1	null
2025-06-27	SPAZER: Spatial-Semantic Progressive Reasoning Agent for Zero-shot 3D Visual Grounding	Zhao Jin et.al.	2506.21924v1	null
2025-06-27	ZeroReg3D: A Zero-shot Registration Pipeline for 3D Consecutive Histopathology Image Reconstruction	Juming Xiong et.al.	2506.21923v1	null
2025-06-27	Embodied Domain Adaptation for Object Detection	Xiangyu Shi et.al.	2506.21860v1	null
2025-06-30	ProSAM: Enhancing the Robustness of SAM-based Visual Reference Segmentation with Probabilistic Prompts	Xiaoqi Wang et.al.	2506.21835v2	null
2025-06-26	WAFT: Warping-Alone Field Transforms for Optical Flow	Yihan Wang et.al.	2506.21526v1	null
2025-06-26	Lightweight Physics-Informed Zero-Shot Ultrasound Plane Wave Denoising	Hojat Asgariandehkordi et.al.	2506.21499v1	null
2025-06-26	Domain Knowledge-Enhanced LLMs for Fraud and Concept Drift Detection	Ali Şenol et.al.	2506.21443v1	null
2025-06-26	SMMILE: An Expert-Driven Benchmark for Multimodal Medical In-Context Learning	Melanie Rieff et.al.	2506.21355v1	null
2025-06-26	Zero-Shot Learning for Obsolescence Risk Forecasting	Elie Saad et.al.	2506.21240v1	null
2025-06-26	Efficient Skill Discovery via Regret-Aware Optimization	He Zhang et.al.	2506.21044v1	null
2025-06-26	EVA: Mixture-of-Experts Semantic Variant Alignment for Compositional Zero-Shot Learning	Xiao Zhang et.al.	2506.20986v1	null
2025-06-27	DFVEdit: Conditional Delta Flow Vector for Zero-shot Video Editing	Lingling Cai et.al.	2506.20967v2	null
2025-06-26	Consistent Zero-shot 3D Texture Synthesis Using Geometry-aware Diffusion and Temporal Video Models	Donggoo Kang et.al.	2506.20946v1	null
2025-06-25	MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans	Shubhankar Borse et.al.	2506.20879v1	null
2025-06-25	Uncovering Hidden Violent Tendencies in LLMs: A Demographic Analysis via Behavioral Vignettes	Quintin Myers et.al.	2506.20822v1	null
2025-06-25	Behavior Foundation Model: Towards Next-Generation Whole-Body Control System of Humanoid Robots	Mingqi Yuan et.al.	2506.20487v1	null
2025-06-25	HiWave: Training-Free High-Resolution Image Generation via Wavelet-Based Diffusion Sampling	Tobias Vontobel et.al.	2506.20452v1	null
2025-06-25	Recognizing Surgical Phases Anywhere: Few-Shot Test-time Adaptation and Task-graph Guided Refinement	Kun Yuan et.al.	2506.20254v1	null
2025-06-25	Zero-Shot Attribution for Large Language Models: A Distribution Testing Approach	Clément L. Canonne et.al.	2506.20197v1	null
2025-06-25	An Exploration of ECAPA-TDNN and x-vector Speaker Representations in Zero-shot Multi-speaker TTS	Marie Kunešová et.al.	2506.20190v1	null
2025-06-25	CCRS: A Zero-Shot LLM-as-a-Judge Framework for Comprehensive RAG Evaluation	Aashiq Muhamed et.al.	2506.20128v1	null
2025-06-24	Universal pre-training by iterated random computation	Peter Bloem et.al.	2506.20057v1	null
2025-06-24	TRACED: Transition-aware Regret Approximation with Co-learnability for Environment Design	Geonwoo Cho et.al.	2506.19997v1	null
2025-06-24	MILAAP: Mobile Link Allocation via Attention-based Prediction	Yung-Fu Chen et.al.	2506.19947v1	null
2025-06-26	ReactEMG: Zero-Shot, Low-Latency Intent Detection via sEMG	Runsheng Wang et.al.	2506.19815v2	null
2025-06-24	SAM2-SGP: Enhancing SAM2 for Medical Image Segmentation via Support-Set Guided Prompting	Yang Xing et.al.	2506.19658v1	null
2025-06-24	ChordPrompt: Orchestrating Cross-Modal Prompt Synergy for Multi-Domain Incremental Learning in CLIP	Zhiyuan Wang et.al.	2506.19608v1	null
2025-06-24	Commonsense Generation and Evaluation for Dialogue Systems using Large Language Models	Marcos Estecha-Garitagoitia et.al.	2506.19483v1	null
2025-06-24	Commander-GPT: Dividing and Routing for Multimodal Sarcasm Detection	Yazhou Zhang et.al.	2506.19420v1	null
2025-06-24	Maximal Update Parametrization and Zero-Shot Hyperparameter Transfer for Fourier Neural Operators	Shanda Li et.al.	2506.19396v1	null
2025-06-24	Zero-Shot Parameter Learning of Robot Dynamics Using Bayesian Statistics and Prior Knowledge	Carsten Reiners et.al.	2506.19350v1	null
2025-06-24	Robotic Perception with a Large Tactile-Vision-Language Model for Physical Property Inference	Zexiang Guo et.al.	2506.19303v1	null
2025-06-23	Spiritual-LLM : Gita Inspired Mental Health Therapy In the Era of LLMs	Janak Kapuriya et.al.	2506.19185v1	null
2025-06-23	EEG Foundation Challenge: From Cross-Task to Cross-Subject EEG Decoding	Bruno Aristimunha et.al.	2506.19141v1	null
2025-06-23	Universal Video Temporal Grounding with Generative Multi-modal Large Language Models	Zeqian Li et.al.	2506.18883v1	null
2025-06-23	A Modular Taxonomy for Hate Speech Definitions and Its Impact on Zero-Shot LLM Classification Performance	Matteo Melis et.al.	2506.18576v1	null
2025-06-23	Standard Applicability Judgment and Cross-jurisdictional Reasoning: A RAG-based Framework for Medical Device Compliance	Yu Han et.al.	2506.18511v1	null
2025-06-23	Generalizing Vision-Language Models to Novel Domains: A Comprehensive Survey	Xinyao Li et.al.	2506.18504v1	null
2025-06-23	GraspMAS: Zero-Shot Language-driven Grasp Detection with Multi-Agent System	Quang Nguyen et.al.	2506.18448v1	null
2025-06-23	CPAM: Context-Preserving Adaptive Manipulation for Zero-Shot Real Image Editing	Dinh-Khoi Vo et.al.	2506.18438v1	null
2025-06-23	A Multi-Scale Spatial Attention-Based Zero-Shot Learning Framework for Low-Light Image Enhancement	Muhammad Azeem Aslam et.al.	2506.18323v1	null
2025-06-23	Team LA at SCIDOCA shared task 2025: Citation Discovery via relation-based zero-shot retrieval	Trieu An et.al.	2506.18316v1	null
2025-06-23	GeNeRT: A Physics-Informed Approach to Intelligent Wireless Channel Modeling via Generalizable Neural Ray Tracing	Kejia Bian et.al.	2506.18295v1	null
2025-06-23	Learning Causal Graphs at Scale: A Foundation Model Approach	Naiyu Yin et.al.	2506.18285v1	null
2025-06-23	Emergent Temporal Correspondences from Video Diffusion Transformers	Jisu Nam et.al.	2506.17220v2	link
2025-06-20	Monocular One-Shot Metric-Depth Alignment for RGB-Based Robot Grasping	Teng Guo et.al.	2506.17110v1	null
2025-06-20	Prmpt2Adpt: Prompt-Based Zero-Shot Domain Adaptation for Resource-Constrained Environments	Yasir Ali Farrukh et.al.	2506.16994v1	null
2025-06-20	LunarLoc: Segment-Based Global Localization on the Moon	Annika Thomas et.al.	2506.16940v1	link
2025-06-20	Single-shot thermometry of simulated Bose--Einstein condensates using artificial intelligence	Jack Griffiths et.al.	2506.16925v1	null
2025-06-20	With Limited Data for Multimodal Alignment, Let the STRUCTURE Guide You	Fabian Gröger et.al.	2506.16895v1	null
2025-06-20	AnyTraverse: An off-road traversability framework with VLM and human operator in the loop	Sattwik Sahu et.al.	2506.16826v1	null
2025-06-20	Generalizable Agent Modeling for Agent Collaboration-Competition Adaptation with Multi-Retrieval and Dynamic Generation	Chenxu Wang et.al.	2506.16718v1	link
2025-06-20	LegiGPT: Party Politics and Transport Policy with Large Language Model	Hyunsoo Yun et.al.	2506.16692v1	null
2025-06-19	History-Augmented Vision-Language Models for Frontier-Based Zero-Shot Object Navigation	Mobin Habibpour et.al.	2506.16623v1	null
2025-06-18	Task-Agnostic Experts Composition for Continual Learning	Luigi Quarantiello et.al.	2506.15566v1	null
2025-06-18	Creating User-steerable Projections with Interactive Semantic Mapping	Artur André Oliveira et.al.	2506.15479v1	null
2025-06-18	Zero-Shot Reinforcement Learning Under Partial Observability	Scott Jeen et.al.	2506.15446v1	null
2025-06-18	DeVisE: Behavioral Testing of Medical Large Language Models	Camila Zurdo Tagliabue et.al.	2506.15339v1	null
2025-06-18	A Comparative Study of Task Adaptation Techniques of Large Language Models for Identifying Sustainable Development Goals	Andrea Cadeddu et.al.	2506.15208v1	null
2025-06-18	ReSeDis: A Dataset for Referring-based Object Search across Large-Scale Image Collections	Ziling Huang et.al.	2506.15180v1	null
2025-06-18	DyNaVLM: Zero-Shot Vision-Language Navigation System with Dynamic Viewpoints and Self-Refining Graph Memory	Zihe Ji et.al.	2506.15096v1	null
2025-06-17	From Chat to Checkup: Can Large Language Models Assist in Diabetes Prediction?	Shadman Sakib et.al.	2506.14949v1	link
2025-06-17	BMFM-RNA: An Open Framework for Building and Evaluating Transcriptomic Foundation Models	Bharath Dandala et.al.	2506.14861v1	link
2025-06-17	Revisiting Chain-of-Thought Prompting: Zero-shot Can Be Stronger than Few-shot	Xiang Cheng et.al.	2506.14641v1	null
2025-06-17	VisLanding: Monocular 3D Perception for UAV Safe Landing via Depth-Normal Synergy	Zhuoyue Tan et.al.	2506.14525v1	null
2025-06-17	EVA02-AT: Egocentric Video-Language Understanding with Spatial-Temporal Rotary Positional Embeddings and Symmetric Optimization	Xiaoqi Wang et.al.	2506.14356v1	link
2025-06-17	ClutterDexGrasp: A Sim-to-Real System for General Dexterous Grasping in Cluttered Scenes	Zeyuan Chen et.al.	2506.14317v1	null
2025-06-17	Equivariance Everywhere All At Once: A Recipe for Graph Foundation Models	Ben Finkelshtein et.al.	2506.14291v1	link
2025-06-17	Investigation of Zero-shot Text-to-Speech Models for Enhancing Short-Utterance Speaker Verification	Yiyang Zhao et.al.	2506.14226v1	null
2025-06-17	Interpreting Biomedical VLMs on High-Imbalance Out-of-Distributions: An Insight into BiomedCLIP on Radiology	Nafiz Sadman et.al.	2506.14136v1	link
2025-06-17	Multi-Scale Finetuning for Encoder-based Time Series Foundation Models	Zhongzheng Qiao et.al.	2506.14087v1	null
2025-06-16	An Interdisciplinary Review of Commonsense Reasoning and Intent Detection	Md Nazmus Sakib et.al.	2506.14040v1	null
2025-06-16	Comparison of ConvNeXt and Vision-Language Models for Breast Density Assessment in Screening Mammography	Yusdivia Molina-Román et.al.	2506.13964v1	null
2025-06-16	LeVERB: Humanoid Whole-Body Control with Latent Vision-Language Instruction	Haoru Xue et.al.	2506.13751v1	null
2025-06-16	OTFusion: Bridging Vision-only and Vision-Language Models via Optimal Transport for Transductive Zero-Shot Learning	Qiyu Xu et.al.	2506.13723v1	null
2025-06-16	Abstract, Align, Predict: Zero-Shot Stance Detection via Cognitive Inductive Reasoning	Jun Ma et.al.	2506.13470v1	null
2025-06-16	Zero-Shot Solving of Imaging Inverse Problems via Noise-Refined Likelihood Guided Diffusion Models	Zhen Wang et.al.	2506.13391v1	null
2025-06-16	TR2M: Transferring Monocular Relative Depth to Metric Depth with Language Descriptions and Scale-Oriented Contrast	Beilei Cui et.al.	2506.13387v1	link
2025-06-16	Distinct Computations Emerge From Compositional Curricula in In-Context Learning	Jin Hwa Lee et.al.	2506.13253v1	null
2025-06-16	PRISM2: Unlocking Multi-Modal General Pathology AI with Clinical Dialogue	George Shaikovski et.al.	2506.13063v1	null
2025-06-16	ZipVoice: Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching	Han Zhu et.al.	2506.13053v1	link
2025-06-16	Knowledge Graph Fusion with Large Language Models for Accurate, Explainable Manufacturing Process Planning	Danny Hoang et.al.	2506.13026v1	null
2025-06-15	Zero-shot denoising via neural compression: Theoretical and algorithmic framework	Ali Zafari et.al.	2506.12693v1	link
2025-06-13	On the Performance of LLMs for Real Estate Appraisal	Margot Geerts et.al.	2506.11812v1	null
2025-06-13	Persona-driven Simulation of Voting Behavior in the European Parliament with Large Language Models	Maximilian Kreutner et.al.	2506.11798v1	null
2025-06-13	Self-supervised Learning of Echocardiographic Video Representations via Online Cluster Distillation	Divyanshu Mishra et.al.	2506.11777v1	link
2025-06-13	ExoStart: Efficient learning for dexterous manipulation with sensorized exoskeleton demonstrations	Zilin Si et.al.	2506.11775v1	null
2025-06-13	Converting Annotated Clinical Cases into Structured Case Report Forms	Pietro Ferrazzi et.al.	2506.11666v1	null
2025-06-13	Prohibited Items Segmentation via Occlusion-aware Bilayer Modeling	Yunhan Ren et.al.	2506.11661v1	link
2025-06-13	OV-MAP : Open-Vocabulary Zero-Shot 3D Instance Segmentation Map for Robots	Juno Kim et.al.	2506.11585v1	null
2025-06-13	Identifying Helpful Context for LLM-based Vulnerability Repair: A Preliminary Study	Gábor Antal et.al.	2506.11561v1	null
2025-06-13	Manager: Aggregating Insights from Unimodal Experts in Two-Tower VLMs and MLLMs	Xiao Xu et.al.	2506.11515v1	null
2025-06-13	Preserving Clusters in Prompt Learning for Unsupervised Domain Adaptation	Tung-Long Vuong et.al.	2506.11493v1	null
2025-06-12	AIR: Zero-shot Generative Model Adaptation with Iterative Refinement	Guimeng Liu et.al.	2506.10895v1	link
2025-06-12	The Diffusion Duality	Subham Sekhar Sahoo et.al.	2506.10892v1	link
2025-06-12	Precise Zero-Shot Pointwise Ranking with LLMs through Post-Aggregated Global Context Information	Kehan Long et.al.	2506.10859v1	link
2025-06-12	Generalist Models in Medical Image Segmentation: A Survey and Performance Comparison with Task-Specific Approaches	Andrea Moglia et.al.	2506.10825v1	null
2025-06-12	Prompts to Summaries: Zero-Shot Language-Guided Video Summarization	Mario Barbara et.al.	2506.10807v1	null
2025-06-12	Neural at ArchEHR-QA 2025: Agentic Prompt Optimization for Evidence-Grounded Clinical Question Answering	Sai Prasanna Teja Reddy Bogireddy et.al.	2506.10751v1	null
2025-06-13	IQE-CLIP: Instance-aware Query Embedding for Zero-/Few-shot Anomaly Detection in Medical Domain	Hong Huang et.al.	2506.10730v2	link
2025-06-12	Beyond Single-User Dialogue: Assessing Multi-User Dialogue State Tracking Capabilities of Large Language Models	Sangmin Song et.al.	2506.10504v1	null
2025-06-12	LLMs Are Not Yet Ready for Deepfake Image Detection	Shahroz Tariq et.al.	2506.10474v1	null
2025-06-12	Using Vision Language Models to Detect Students' Academic Emotion through Facial Expressions	Deliang Wang et.al.	2506.10334v1	null
2025-06-11	Large Language Models for Toxic Language Detection in Low-Resource Balkan Languages	Amel Muminovic et.al.	2506.09992v1	link
2025-06-11	V-JEPA 2: Self-Supervised Video Models Enable Understanding, Prediction and Planning	Mido Assran et.al.	2506.09985v1	link
2025-06-11	Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking	Wuwei Zhang et.al.	2506.09944v1	link
2025-06-11	Dataset of News Articles with Provenance Metadata for Media Relevance Assessment	Tomas Peterka et.al.	2506.09847v1	null
2025-06-11	Superstudent intelligence in thermodynamics	Rebecca Loubet et.al.	2506.09822v1	null
2025-06-11	Do LLMs Give Psychometrically Plausible Responses in Educational Assessments?	Andreas Säuberli et.al.	2506.09796v1	null
2025-06-11	Accurate and efficient zero-shot 6D pose estimation with frozen foundation models	Andrea Caraffa et.al.	2506.09784v1	null
2025-06-11	ELBO-T2IAlign: A Generic ELBO-Based Method for Calibrating Pixel-level Text-Image Alignment in Diffusion Models	Qin Zhou et.al.	2506.09740v1	null
2025-06-11	CHIP: A multi-sensor dataset for 6D pose estimation of chairs in industrial settings	Mattia Nardon et.al.	2506.09699v1	null
2025-06-11	Geometric flow regularization in latent spaces for smooth dynamics with the efficient variations of curvature	Andrew Gracyk et.al.	2506.09679v1	null
2025-06-10	Efficient Medical Vision-Language Alignment Through Adapting Masked Vision Models	Chenyu Lian et.al.	2506.08990v1	link
2025-06-10	Hyperbolic Dual Feature Augmentation for Open-Environment	Peilin Yu et.al.	2506.08906v1	null
2025-06-10	Advancing STT for Low-Resource Real-World Speech	Flavio D'Intino et.al.	2506.08836v1	null
2025-06-10	Paths to Causality: Finding Informative Subgraphs Within Knowledge Graphs for Knowledge-Based Causal Discovery	Yuni Susanti et.al.	2506.08771v1	link
2025-06-11	AraReasoner: Evaluating Reasoning-Based LLMs for Arabic NLP	Ahmed Hasanaath et.al.	2506.08768v2	null
2025-06-11	ClimateViz: A Benchmark for Statistical Reasoning and Fact Verification on Scientific Charts	Ruiran Su et.al.	2506.08700v2	link
2025-06-10	Orientation Matters: Making 3D Generative Models Orientation-Aligned	Yichong Lu et.al.	2506.08640v1	null
2025-06-10	Dense Retrievers Can Fail on Simple Queries: Revealing The Granularity Dilemma of Embeddings	Liyan Xu et.al.	2506.08592v1	link
2025-06-10	Fairness is Not Silence: Unmasking Vacuous Neutrality in Small Language Models	Sumanth Manduru et.al.	2506.08487v1	null
2025-06-10	Detecting Harmful Memes with Decoupled Understanding and Guided CoT Reasoning	Fengjun Pan et.al.	2506.08477v1	null
2025-06-09	StableMTL: Repurposing Latent Diffusion Models for Multi-Task Learning from Partially Annotated Synthetic Datasets	Anh-Quan Cao et.al.	2506.08013v1	link
2025-06-09	ZeroVO: Visual Odometry with Minimal Assumptions	Lei Lai et.al.	2506.08005v1	null
2025-06-09	CXR-LT 2024: A MICCAI challenge on long-tailed, multi-label, and zero-shot disease classification from chest X-ray	Mingquan Lin et.al.	2506.07984v1	null
2025-06-09	LUCIFER: Language Understanding and Context-Infused Framework for Exploration and Behavior Refinement	Dimitris Panagopoulos et.al.	2506.07915v1	null
2025-06-09	Evaluating Large Language Models on the Frame and Symbol Grounding Problems: A Zero-shot Benchmark	Shoko Oka et.al.	2506.07896v1	link
2025-06-09	Deep Equivariant Multi-Agent Control Barrier Functions	Nikolaos Bousias et.al.	2506.07755v1	null
2025-06-09	Language Embedding Meets Dynamic Graph: A New Exploration for Neural Architecture Representation Learning	Haizhao Jing et.al.	2506.07735v1	null
2025-06-09	Vuyko Mistral: Adapting LLMs for Low-Resource Dialectal Translation	Roman Kyslyi et.al.	2506.07617v1	null
2025-06-09	MIRA: Medical Time Series Foundation Model for Real-World Health Data	Hao Li et.al.	2506.07584v1	null
2025-06-09	Efficient Generation of Diverse Cooperative Agents with World Models	Yi Loo et.al.	2506.07450v1	null
2025-06-06	RecGPT: A Foundation Model for Sequential Recommendation	Yangqin Jiang et.al.	2506.06270v1	link
2025-06-06	Masked Language Models are Good Heterogeneous Graph Generalizers	Jinyu Yang et.al.	2506.06157v1	link
2025-06-06	Let's CONFER: A Dataset for Evaluating Natural Language Inference Models on CONditional InFERence and Presupposition	Tara Azin et.al.	2506.06133v1	null
2025-06-06	Bridging the Gap: In-Context Learning for Modeling Human Disagreement	Benedetta Muscato et.al.	2506.06113v1	null
2025-06-09	Text-to-LoRA: Instant Transformer Adaption	Rujikorn Charakorn et.al.	2506.06105v2	null
2025-06-06	Full Conformal Adaptation of Medical Vision-Language Models	Julio Silva-Rodríguez et.al.	2506.06076v1	null
2025-06-06	Zero-Shot Detection of LLM-Generated Code via Approximated Task Conditioning	Maor Ashkenazi et.al.	2506.06069v1	null
2025-06-06	LightGTS: A Lightweight General Time Series Forecasting Model	Yihang Wang et.al.	2506.06005v1	null
2025-06-06	Improving Long-Range Navigation with Spatially-Enhanced Recurrent Memory via End-to-End Reinforcement Learning	Fan Yang et.al.	2506.05997v1	null
2025-06-06	MOGO: Residual Quantized Hierarchical Causal Transformer for High-Quality and Real-Time 3D Human Motion Generation	Dongjie Fu et.al.	2506.05952v1	null
2025-06-05	ProRefine: Inference-time Prompt Refinement with Textual Feedback	Deepak Pandita et.al.	2506.05305v1	null
2025-06-05	RaySt3R: Predicting Novel Depth Maps for Zero-Shot Object Completion	Bardienus P. Duisterhof et.al.	2506.05285v1	null
2025-06-05	From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos	Animesh Gupta et.al.	2506.05274v1	link
2025-06-05	Can Foundation Models Generalise the Presentation Attack Detection Capabilities on ID Cards?	Juan E. Tapia et.al.	2506.05263v1	null
2025-06-05	Towards Vision-Language-Garment Models For Web Knowledge Garment Understanding and Generation	Jan Ackermann et.al.	2506.05210v1	null
2025-06-05	Fabrica: Dual-Arm Assembly of General Multi-Part Objects via Integrated Planning and Learning	Yunsheng Tian et.al.	2506.05168v1	null
2025-06-05	DiCoRe: Enhancing Zero-shot Event Detection via Divergent-Convergent LLM Reasoning	Tanmay Parekh et.al.	2506.05128v1	null
2025-06-05	Just a Scratch: Enhancing LLM Capabilities for Self-harm Detection through Intent Differentiation and Emoji Interpretation	Soumitra Ghosh et.al.	2506.05073v1	null
2025-06-05	Tuning the Right Foundation Models is What you Need for Partial Label Learning	Kuang He et.al.	2506.05027v1	link
2025-06-05	Structure-Aware Radar-Camera Depth Estimation	Fuyi Zhang et.al.	2506.05008v1	null
2025-06-04	Object-centric 3D Motion Field for Robot Learning from Human Videos	Zhao-Heng Yin et.al.	2506.04227v1	null
2025-06-04	Struct2D: A Perception-Guided Framework for Spatial Reasoning in Large Multimodal Models	Fangrui Zhu et.al.	2506.04220v1	null
2025-06-04	OWMM-Agent: Open World Mobile Manipulation With Multi-modal Agentic Data Synthesis	Junting Chen et.al.	2506.04217v1	link
2025-06-04	MACS: Multi-Agent Reinforcement Learning for Optimization of Crystal Structures	Elena Zamaraeva et.al.	2506.04195v1	null
2025-06-04	Physics-Constrained Flow Matching: Sampling Generative Models with Hard Constraints	Utkarsh Utkarsh et.al.	2506.04171v1	null
2025-06-04	HiFiTTS-2: A Large-Scale High Bandwidth Speech Dataset	Ryan Langman et.al.	2506.04152v1	null
2025-06-04	TextAtari: 100K Frames Game Playing with Language Agents	Wenhao Li et.al.	2506.04098v1	link
2025-06-04	Towards Better Disentanglement in Non-Autoregressive Zero-Shot Expressive Voice Conversion	Seymanur Akti et.al.	2506.04013v1	null
2025-06-04	Vocabulary-free few-shot learning for Vision-Language Models	Maxime Zanella et.al.	2506.04005v1	null
2025-06-04	Kinship in Speech: Leveraging Linguistic Relatedness for Zero-Shot TTS in Indian Languages	Utkarsh Pathak et.al.	2506.03884v1	null
2025-06-03	Native-Resolution Image Synthesis	Zidong Wang et.al.	2506.03131v1	null
2025-06-03	Zero-Shot Time Series Forecasting with Covariates via In-Context Learning	Andreas Auer et.al.	2506.03128v1	null
2025-06-03	Targeted Forgetting of Image Subgroups in CLIP Models	Zeliang Zhang et.al.	2506.03117v1	null
2025-06-03	Zero-Shot Tree Detection and Segmentation from Aerial Forest Imagery	Michelle Chen et.al.	2506.03114v1	link
2025-06-03	FuseLIP: Multimodal Embeddings via Early Fusion of Discrete Tokens	Christian Schlarmann et.al.	2506.03096v1	link
2025-06-03	DFBench: Benchmarking Deepfake Image Detection Capability of Large Multimodal Models	Jiarui Wang et.al.	2506.03007v1	null
2025-06-03	A Multi-Agent Framework for Mitigating Dialect Biases in Privacy Policy Question-Answering Systems	Đorđe Klisura et.al.	2506.02998v1	null
2025-06-04	FlySearch: Exploring how vision-language models explore	Adam Pardyl et.al.	2506.02896v2	link
2025-06-03	DGMO: Training-Free Audio Source Separation through Diffusion-Guided Mask Optimization	Geonyoung Lee et.al.	2506.02858v1	null
2025-06-03	PBR-SR: Mesh PBR Texture Super Resolution from 2D Image Priors	Yujin Chen et.al.	2506.02846v1	null
2025-05-30	Zero-Shot Chinese Character Recognition with Hierarchical Multi-Granularity Image-Text Aligning	Yinglian Zhu et.al.	2505.24837v1	null
2025-05-30	Multilinguality Does not Make Sense: Investigating Factors Behind Zero-Shot Transfer in Sense-Aware Tasks	Roksana Goworek et.al.	2505.24834v1	null
2025-05-30	LGAR: Zero-Shot LLM-Guided Neural Ranking for Abstract Screening in Systematic Literature Reviews	Christian Jaumann et.al.	2505.24757v1	link
2025-05-30	Conformal Prediction for Zero-Shot Models	Julio Silva-Rodríguez et.al.	2505.24693v1	link
2025-05-30	TRIDENT: Enhancing Large Language Model Safety with Tri-Dimensional Diversified Red-Teaming Data Synthesis	Xiaorui Wu et.al.	2505.24672v1	link
2025-05-30	Benchmarking Large Language Models for Cryptanalysis and Mismatched-Generalization	Utsav Maskey et.al.	2505.24621v1	null
2025-05-30	When Harry Meets Superman: The Role of The Interlocutor in Persona-Based Dialogue Generation	Daniela Occhipinti et.al.	2505.24613v1	null
2025-05-30	Improving Language and Modality Transfer in Translation by Character-level Modeling	Ioannis Tsiamas et.al.	2505.24561v1	null
2025-05-30	Can Slow-thinking LLMs Reason Over Time? Empirical Studies in Time Series Forecasting	Jiahao Wang et.al.	2505.24511v1	link
2025-05-30	Advancing Compositional Awareness in CLIP with Efficient Fine-Tuning	Amit Peleg et.al.	2505.24424v1	null
2025-05-29	To Trust Or Not To Trust Your Vision-Language Model's Prediction	Hao Dong et.al.	2505.23745v1	link
2025-05-29	TiRex: Zero-Shot Forecasting Across Long and Short Horizons with Enhanced In-Context Learning	Andreas Auer et.al.	2505.23719v1	link
2025-05-29	AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views	Lihan Jiang et.al.	2505.23716v1	null
2025-05-29	LoLA: Low-Rank Linear Attention With Sparse Caching	Luke McDermott et.al.	2505.23666v1	null
2025-05-29	D-AR: Diffusion via Autoregressive Models	Ziteng Gao et.al.	2505.23660v1	link
2025-05-29	ARC: Argument Representation and Coverage Analysis for Zero-Shot Long Document Summarization with Instruction Following LLMs	Mohamed Elaraby et.al.	2505.23654v1	null
2025-05-29	ZeroSep: Separate Anything in Audio with Zero Training	Chao Huang et.al.	2505.23625v1	null
2025-05-29	Evaluating AI capabilities in detecting conspiracy theories on YouTube	Leonardo La Rocca et.al.	2505.23570v1	link
2025-05-29	Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression Recognition	Yu Li et.al.	2505.23566v1	link
2025-05-29	Spoken question answering for visual queries	Nimrod Shabtay et.al.	2505.23308v1	null
2025-05-28	Zero-Shot Vision Encoder Grafting via LLM Surrogates	Kaiyu Yue et.al.	2505.22664v1	link
2025-05-28	Learning Composable Chains-of-Thought	Fangcong Yin et.al.	2505.22635v1	null
2025-05-28	ClaimPKG: Enhancing Claim Verification via Pseudo-Subgraph Generation with Lightweight Specialized LLM	Hoang Pham et.al.	2505.22552v1	null
2025-05-28	Multi-MLLM Knowledge Distillation for Out-of-Context News Detection	Yimeng Gu et.al.	2505.22517v1	null
2025-05-28	Zero-Shot 3D Visual Grounding from Vision-Language Models	Rong Li et.al.	2505.22429v1	null
2025-05-29	Logical Consistency is Vital: Neural-Symbolic Information Retrieval for Negative-Constraint Queries	Ganlin Xu et.al.	2505.22299v2	link
2025-05-28	Compensating for Data with Reasoning: Low-Resource Machine Translation with LLMs	Samuel Frontull et.al.	2505.22293v1	null
2025-05-28	Domain Adaptation of Attention Heads for Zero-shot Anomaly Detection	Kiyoon Jeong et.al.	2505.22259v1	null
2025-05-28	3D Question Answering via only 2D Vision-Language Models	Fengyun Wang et.al.	2505.22143v1	null
2025-05-28	Bringing CLIP to the Clinic: Dynamic Soft Labels and Negation-Aware Learning for Medical Analysis	Hanbin Ko et.al.	2505.22079v1	null
2025-05-27	Vision Transformers with Self-Distilled Registers	Yinjie Chen et.al.	2505.21501v1	null
2025-05-27	M3S-UPD: Efficient Multi-Stage Self-Supervised Learning for Fine-Grained Encrypted Traffic Classification with Unknown Pattern Discovery	Yali Yuan et.al.	2505.21462v1	null
2025-05-27	Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO	Muzhi Zhu et.al.	2505.21457v1	null
2025-05-27	Leveraging Large Language Models for Bengali Math Word Problem Solving with Chain of Thought Reasoning	Bidyarthi Paul et.al.	2505.21354v1	null
2025-05-27	Breaking the Performance Ceiling in Complex Reinforcement Learning requires Inference Strategies	Felix Chalumeau et.al.	2505.21236v1	null
2025-05-27	Reason-Align-Respond: Aligning LLM Reasoning with Knowledge Graphs for KGQA	Xiangqing Shen et.al.	2505.20971v1	null
2025-05-27	Context-Aware Content Moderation for German Newspaper Comments	Felix Krejca et.al.	2505.20963v1	null
2025-05-27	In Context Learning with Vision Transformers: Case Study	Antony Zhao et.al.	2505.20872v1	null
2025-05-27	Respond to Change with Constancy: Instruction-tuning with LLM for Non-I.I.D. Network Traffic Classification	Xinjie Lin et.al.	2505.20866v1	null
2025-05-27	Cold-Start Recommendation with Knowledge-Guided Retrieval-Augmented Generation	Wooseong Yang et.al.	2505.20773v1	null
2025-05-26	ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers	Fotios Lygerakis et.al.	2505.20032v1	null
2025-05-26	Correlating instruction-tuning (in multimodal models) with vision-language processing (in the brain)	Subba Reddy Oota et.al.	2505.20029v1	link
2025-05-26	ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous Driving	Xueyi Liu et.al.	2505.20024v1	link
2025-05-26	Multimodal Reasoning Agent for Zero-Shot Composed Image Retrieval	Rong-Cheng Tu et.al.	2505.19952v1	null
2025-05-26	Can Visual Encoder Learn to See Arrows?	Naoyuki Terashita et.al.	2505.19944v1	null
2025-05-26	Multi-Timescale Motion-Decoupled Spiking Transformer for Audio-Visual Zero-Shot Learning	Wenrui Li et.al.	2505.19938v1	null
2025-05-26	Zero-Shot Pseudo Labels Generation Using SAM and CLIP for Semi-Supervised Semantic Segmentation	Nagito Saito et.al.	2505.19846v1	null
2025-05-26	MLLM-Guided VLM Fine-Tuning with Joint Inference for Zero-Shot Composed Image Retrieval	Rong-Cheng Tu et.al.	2505.19707v1	null
2025-05-26	Graph Guided Diffusion: Unified Guidance for Conditional Graph Generation	Victor M. Tenorio et.al.	2505.19685v1	null
2025-05-26	Calibrating Pre-trained Language Classifiers on LLM-generated Noisy Labels via Iterative Refinement	Liqin Ye et.al.	2505.19675v1	link
2025-05-23	FDBPL: Faster Distillation-Based Prompt Learning for Region-Aware Vision-Language Models Adaptation	Zherui Zhang et.al.	2505.18053v1	null
2025-05-23	Contrastive Distillation of Emotion Knowledge from LLMs for Zero-Shot Emotion Recognition	Minxue Niu et.al.	2505.18040v1	link
2025-05-23	Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation	Li Zhong et.al.	2505.18039v1	null
2025-05-23	LLM assisted web application functional requirements generation: A case study of four popular LLMs over a Mess Management System	Rashmi Gupta et.al.	2505.18019v1	null
2025-05-23	Diffusion Classifiers Understand Compositionality, but Conditions Apply	Yujin Jeong et.al.	2505.17955v1	link
2025-05-23	VeriThinker: Learning to Verify Makes Reasoning Model Efficient	Zigeng Chen et.al.	2505.17941v1	link
2025-05-23	AutoMiSeg: Automatic Medical Image Segmentation via Test-Time Adaptation of Foundation Models	Xingjian Li et.al.	2505.17931v1	null
2025-05-23	NeuroTrails: Training with Dynamic Sparse Heads as the Key to Effective Ensembling	Bram Grooten et.al.	2505.17909v1	null
2025-05-23	BLAST: Balanced Sampling Time Series Corpus for Universal Forecasting Models	Zezhi Shao et.al.	2505.17871v1	link
2025-05-23	Discriminating Form and Meaning in Multilingual Models with Minimal-Pair ABX Tasks	Maureen de Seyssel et.al.	2505.17747v1	null
2025-05-22	CoMo: Learning Continuous Latent Motion from Internet Videos for Scalable Robot Learning	Jiange Yang et.al.	2505.17006v1	null
2025-05-22	Native Segmentation Vision Transformers	Guillem Brasó et.al.	2505.16993v1	null
2025-05-22	Know the Ropes: A Heuristic Strategy for LLM-based Multi-Agent System Design	Zhenkun Li et.al.	2505.16979v1	null
2025-05-22	Fixing Data That Hurts Performance: Cascading LLMs to Relabel Hard Negatives for Robust Information Retrieval	Nandan Thakur et.al.	2505.16967v1	null
2025-05-22	UAV See, UGV Do: Aerial Imagery and Virtual Teach Enabling Zero-Shot Ground Vehicle Repeat	Desiree Fisker et.al.	2505.16912v1	null
2025-05-22	T2I-ConBench: Text-to-Image Benchmark for Continual Post-training	Zhehao Huang et.al.	2505.16875v1	null
2025-05-22	Walk&Retrieve: Simple Yet Effective Zero-shot Retrieval-Augmented Generation via Knowledge Graph Walks	Martin Böckling et.al.	2505.16849v1	link
2025-05-22	LLM-Based Emulation of the Radio Resource Control Layer: Towards AI-Native RAN Protocols	Ziming liu et.al.	2505.16821v1	null
2025-05-22	TRIM: Achieving Extreme Sparsity with Targeted Row-wise Iterative Metric-driven Pruning	Florentin Beck et.al.	2505.16743v1	link
2025-05-23	EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion	Advait Joglekar et.al.	2505.16691v2	null
2025-05-21	Exploring The Visual Feature Space for Multimodal Neural Decoding	Weihao Xia et.al.	2505.15755v1	null
2025-05-21	From Grounding to Manipulation: Case Studies of Foundation Model Integration in Embodied Robotic Systems	Xiuchao Sui et.al.	2505.15685v1	link
2025-05-21	Exploring the Limits of Vision-Language-Action Manipulations in Cross-task Generalization	Jiaming Zhou et.al.	2505.15660v1	link
2025-05-21	Prompt Tuning Vision Language Models with Margin Regularizer for Few-Shot Learning under Distribution Shifts	Debarshi Brahma et.al.	2505.15506v1	link
2025-05-21	On the Generalization vs Fidelity Paradox in Knowledge Distillation	Suhas Kamasetty Ramesh et.al.	2505.15442v1	link
2025-05-21	Prosody-Adaptable Audio Codecs for Zero-Shot Voice Conversion via In-Context Learning	Junchuan Zhao et.al.	2505.15402v1	null
2025-05-21	Expanding Zero-Shot Object Counting with Rich Prompts	Huilin Zhu et.al.	2505.15398v1	null
2025-05-21	RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation	Naman Patel et.al.	2505.15373v1	null
2025-05-21	Towards Zero-Shot Differential Morphing Attack Detection with Multimodal Large Language Models	Ria Shekhawat et.al.	2505.15332v1	null
2025-05-21	AgentThink: A Unified Framework for Tool-Augmented Chain-of-Thought Reasoning in Vision-Language Models for Autonomous Driving	Kangan Qian et.al.	2505.15298v1	null
2025-05-20	SAFEPATH: Preventing Harmful Reasoning in Chain-of-Thought via Early Alignment	Wonje Jeung et.al.	2505.14667v1	null
2025-05-20	Void in Language Models	Mani Shemiranifar et.al.	2505.14467v1	link
2025-05-20	Empowering LLMs in Task-Oriented Dialogues: A Domain-Independent Multi-Agent Framework and Fine-Tuning Strategy	Zihao Feng et.al.	2505.14299v1	null
2025-05-20	FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation	Shaolin Zhu et.al.	2505.14256v1	null
2025-05-20	UniVG-R1: Reasoning Guided Universal Visual Grounding with Reinforcement Learning	Sule Bai et.al.	2505.14231v1	null
2025-05-20	Beginning with You: Perceptual-Initialization Improves Vision-Language Representation and Alignment	Yang Hu et.al.	2505.14204v1	null
2025-05-20	LMP: Leveraging Motion Prior in Zero-Shot Video Generation with Diffusion Transformer	Changgu Chen et.al.	2505.14167v1	null
2025-05-20	Breaking Language Barriers or Reinforcing Bias? A Study of Gender and Racial Disparities in Multilingual Contrastive Vision Language Models	Zahraa Al Sahili et.al.	2505.14160v1	null
2025-05-20	AudSemThinker: Enhancing Audio-Language Models through Reasoning over Semantics of Sound	Gijs Wijngaard et.al.	2505.14142v1	link
2025-05-20	SeamlessEdit: Background Noise Aware Zero-Shot Speech Editing with in-Context Enhancement	Kuan-Yu Chen et.al.	2505.14066v1	null
2025-05-19	GraspMolmo: Generalizable Task-Oriented Grasping via Large-Scale Synthetic Data Generation	Abhay Deshpande et.al.	2505.13441v1	null
2025-05-19	FEALLM: Advancing Facial Emotion Analysis in Multimodal Large Language Models with Emotional Synergy and Reasoning	Zhuozhao Hu et.al.	2505.13419v1	link
2025-05-19	From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection	Lincan Cai et.al.	2505.13233v1	link
2025-05-20	StarFT: Robust Fine-tuning of Zero-shot Models via Spuriosity Alignment	Younghyun Kim et.al.	2505.13232v2	link
2025-05-19	True Zero-Shot Inference of Dynamical Systems Preserving Long-Term Statistics	Christoph Jürgen Hemmer et.al.	2505.13192v1	null
2025-05-19	Efficient Speech Language Modeling via Energy Distance in Continuous Latent Space	Zhengrui Ma et.al.	2505.13181v1	link
2025-05-19	A Case Study of Cross-Lingual Zero-Shot Generalization for Classical Languages in LLMs	V. S. D. S. Mahesh Akavarapu et.al.	2505.13173v1	link
2025-05-19	Zero-Shot Adaptation of Behavioral Foundation Models to Unseen Dynamics	Maksim Bobrin et.al.	2505.13150v1	link
2025-05-20	Zero-Shot Iterative Formalization and Planning in Partially Observable Environments	Liancheng Gong et.al.	2505.13126v2	link
2025-05-19	$μ$ PC: Scaling Predictive Coding to 100+ Layer Networks	Francesco Innocenti et.al.	2505.13124v1	link
2025-05-16	SurgPose: Generalisable Surgical Instrument Pose Estimation using Zero-Shot Learning and Stereo Vision	Utsav Rai et.al.	2505.11439v1	null
2025-05-16	Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner	Wenchuan Zhang et.al.	2505.11404v1	link
2025-05-16	Learning Multimodal AI Algorithms for Amplifying Limited User Input into High-dimensional Control Space	Ali Rabiee et.al.	2505.11366v1	link
2025-05-16	LegoSLM: Connecting LLM with Speech Encoder using CTC Posteriors	Rao Ma et.al.	2505.11352v1	null
2025-05-16	Context parroting: A simple but tough-to-beat baseline for foundation models in scientific machine learning	Yuanzhao Zhang et.al.	2505.11349v1	null
2025-05-16	Benchmarking Critical Questions Generation: A Challenging Reasoning Task for Large Language Models	Banca Calvo Figueras et.al.	2505.11341v1	null
2025-05-19	Massive-STEPS: Massive Semantic Trajectories for Understanding POI Check-ins -- Dataset and Benchmarks	Wilson Wongso et.al.	2505.11239v2	link
2025-05-16	Feasibility with Language Models for Open-World Compositional Zero-Shot Learning	Jae Myung Kim et.al.	2505.11181v1	null
2025-05-16	Foundation Time-Series AI Model for Realized Volatility Forecasting	Anubha Goel et.al.	2505.11163v1	null
2025-05-16	$\mathcal{A}LLM4ADD$ : Unlocking the Capabilities of Audio Large Language Models for Audio Deepfake Detection	Hao Gu et.al.	2505.11079v1	null
2025-05-15	Depth Anything with Any Prior	Zehan Wang et.al.	2505.10565v1	null
2025-05-15	NVSPolicy: Adaptive Novel-View Synthesis for Generalizable Language-Conditioned Policy Learning	Le Shi et.al.	2505.10359v1	null
2025-05-15	MSCI: Addressing CLIP's Inherent Limitations for Compositional Zero-Shot Learning	Yue Wang et.al.	2505.10289v1	link
2025-05-15	Comparing LLM Text Annotation Skills: A Study on Human Rights Violations in Social Media Data	Poli Apollinaire Nemkova et.al.	2505.10260v1	link
2025-05-15	MMRL++: Parameter-Efficient and Interaction-Aware Representation Learning for Vision-Language Models	Yuncheng Guo et.al.	2505.10088v1	link
2025-05-15	Advanced Crash Causation Analysis for Freeway Safety: A Large Language Model Approach to Identifying Key Contributing Factors	Ahmed S. Abdelrahman et.al.	2505.09949v1	null
2025-05-14	Achieving Tokenizer Flexibility in Language Models through Heuristic Adaptation and Supertoken Learning	Shaurya Sharthak et.al.	2505.09738v1	link
2025-05-14	Unfettered Forceful Skill Acquisition with Physical Reasoning and Coordinate Frame Labeling	William Xie et.al.	2505.09731v1	null
2025-05-14	Denoising and Alignment: Rethinking Domain Generalization for Multimodal Face Anti-Spoofing	Yingjie Ma et.al.	2505.09484v1	null
2025-05-14	Endo-CLIP: Progressive Self-Supervised Pre-training on Raw Colonoscopy Records	Yili He et.al.	2505.09435v1	null
2025-05-14	MAKE: Multi-Aspect Knowledge-Enhanced Vision-Language Pretraining for Zero-shot Dermatological Assessment	Siyuan Yan et.al.	2505.09372v1	link
2025-05-14	Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis	Bingxin Ke et.al.	2505.09358v1	link
2025-05-14	MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning	Bin-Bin Gao et.al.	2505.09265v1	null
2025-05-14	Zero-Shot Multi-modal Large Language Model v.s. Supervised Deep Learning: A Comparative Study on CT-Based Intracranial Hemorrhage Subtyping	Yinuo Wang et.al.	2505.09252v1	link
2025-05-14	Zero-shot Quantization: A Comprehensive Survey	Minjun Kim et.al.	2505.09188v1	null
2025-05-14	A Comparative Review of RNA Language Models	He Wang et.al.	2505.09087v1	null
2025-05-14	Human-like Cognitive Generalization for Large Models via Brain-in-the-loop Supervision	Jiaxuan Chen et.al.	2505.09085v1	null
2025-05-13	For GPT-4 as with Humans: Information Structure Predicts Acceptability of Long-Distance Dependencies	Nicole Cuneo et.al.	2505.09005v1	null
2025-05-13	SPAT: Sensitivity-based Multihead-attention Pruning on Time Series Forecasting Models	Suhan Guo et.al.	2505.08768v1	null
2025-05-13	NavDP: Learning Sim-to-Real Navigation Diffusion Policy with Privileged Information Guidance	Wenzhe Cai et.al.	2505.08712v1	null
2025-05-13	LLM-based Prompt Ensemble for Reliable Medical Entity Recognition from EHRs	K M Sajjadul Islam et.al.	2505.08704v1	null
2025-05-13	Augmented Reality for RObots (ARRO): Pointing Visuomotor Policies Towards Visual Robustness	Reihaneh Mirjalili et.al.	2505.08627v1	null
2025-05-13	Boosting Zero-shot Stereo Matching using Large-scale Mixed Images Sources in the Real World	Yuran Wang et.al.	2505.08607v1	null
2025-05-13	From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation	Yifu Yuan et.al.	2505.08548v1	link
2025-05-13	LCES: Zero-shot Automated Essay Scoring via Pairwise Comparisons Using Large Language Models	Takumi Shibata et.al.	2505.08498v1	null
2025-05-13	Large Language Models Meet Stance Detection: A Survey of Tasks, Methods, Applications, Challenges and Future Directions	Lata Pangtey et.al.	2505.08464v1	null
2025-05-13	Zero-Shot Sim-to-Real Reinforcement Learning for Fruit Harvesting	Emlyn Williams et.al.	2505.08458v1	null
2025-05-13	Visual Image Reconstruction from Brain Activity via Latent Representation	Yukiyasu Kamitani et.al.	2505.08429v1	null
2025-05-12	Beyond CLIP Generalization: Against Forward&Backward Forgetting Adapter for Continual Learning of Vision-Language Models	Songlin Dong et.al.	2505.07690v1	null
2025-05-12	Multimodal Survival Modeling in the Age of Foundation Models	Steven Song et.al.	2505.07683v1	link
2025-05-12	TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining	Paul Primus et.al.	2505.07609v1	null
2025-05-12	L-SWAG: Layer-Sample Wise Activation with Gradients information for Zero-Shot NAS on Vision Transformers	Sofia Casarin et.al.	2505.07300v1	null
2025-05-12	SAS-Bench: A Fine-Grained Benchmark for Evaluating Short Answer Scoring with Large Language Models	Peichao Lai et.al.	2505.07247v1	link
2025-05-11	A Vision-Language Foundation Model for Leaf Disease Identification	Khang Nguyen Quoc et.al.	2505.07019v1	link
2025-05-11	BridgeIV: Bridging Customized Image and Video Generation through Test-Time Autoregressive Identity Propagation	Panwen Hu et.al.	2505.06985v1	null
2025-05-11	Towards Artificial General or Personalized Intelligence? A Survey on Foundation Models for Personalized Federated Intelligence	Yu Qiao et.al.	2505.06907v1	null
2025-05-11	Image Classification Using a Diffusion Model as a Pre-Training Model	Kosuke Ukita et.al.	2505.06890v1	null
2025-05-10	Learning Graph Representation of Agent Diffuser	Youcef Djenouri et.al.	2505.06761v1	link
2025-05-09	Adapting a Segmentation Foundation Model for Medical Image Classification	Pengfei Gu et.al.	2505.06217v1	null
2025-05-09	Neuro-Symbolic Concepts	Jiayuan Mao et.al.	2505.06191v1	null
2025-05-09	MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks	Wenqi Zeng et.al.	2505.06152v1	link
2025-05-09	Can Prompting LLMs Unlock Hate Speech Detection across Languages? A Zero-shot and Few-shot Study	Faeze Ghorbanpour et.al.	2505.06149v1	null
2025-05-09	ELA-ZSON: Efficient Layout-Aware Zero-Shot Object Navigation Agent with Hierarchical Planning	Jiawei Hou et.al.	2505.06131v1	null
2025-05-12	LLMs Outperform Experts on Challenging Biology Benchmarks	Lennart Justen et.al.	2505.06108v2	null
2025-05-09	3D CAVLA: Leveraging Depth and 3D Context to Generalize Vision Language Action Models for Unseen Tasks	Vineet Bhat et.al.	2505.05800v1	null
2025-05-09	Towards Embodiment Scaling Laws in Robot Locomotion	Bo Ai et.al.	2505.05753v1	null
2025-05-08	scDrugMap: Benchmarking Large Foundation Models for Drug Response Prediction	Qing Wang et.al.	2505.05612v1	link
2025-05-08	KG-HTC: Integrating Knowledge Graphs into LLMs for Effective Zero-shot Hierarchical Text Classification	Qianbo Zang et.al.	2505.05583v1	link
2025-05-08	Mogao: An Omni Foundation Model for Interleaved Multi-Modal Generation	Chao Liao et.al.	2505.05472v1	null
2025-05-08	Hearing and Seeing Through CLIP: A Framework for Self-Supervised Sound Source Localization	Sooyoung Park et.al.	2505.05343v1	link
2025-05-09	FlexSpeech: Towards Stable, Controllable and Expressive Text-to-Speech	Linhan Ma et.al.	2505.05159v2	null
2025-05-08	CacheFL: Efficient Federated Cache Model Fine-Tuning for Vision-Language Models	Mengjun Yi et.al.	2505.05130v1	null
2025-05-08	Reliably Bounding False Positives: A Zero-Shot Machine-Generated Text Detection Framework via Multiscaled Conformal Prediction	Xiaowei Zhu et.al.	2505.05084v1	null
2025-05-08	FG-CLIP: Fine-Grained Visual and Textual Alignment	Chunyu Xie et.al.	2505.05071v1	link
2025-05-08	Performance Evaluation of Large Language Models in Bangla Consumer Health Query Summarization	Ajwad Abrar et.al.	2505.05070v1	null
2025-05-08	Split Matching for Inductive Zero-shot Semantic Segmentation	Jialei Chen et.al.	2505.05023v1	null
2025-05-08	The Pitfalls of Growing Group Complexity: LLMs and Social Choice-Based Aggregation for Group Recommendations	Cedric Waterschoot et.al.	2505.05016v1	null
2025-05-08	SpatialPrompting: Keyframe-driven Zero-Shot Spatial Reasoning with Off-the-Shelf Multimodal Large Language Models	Shun Taguchi et.al.	2505.04911v1	null
2025-05-07	Implicitly Aligning Humans and Autonomous Agents through Shared Task Abstractions	Stéphane Aroca-Ouellette et.al.	2505.04579v1	link
2025-05-07	Benchmarking LLMs' Swarm intelligence	Kai Ruan et.al.	2505.04364v1	link
2025-05-07	Neural Representational Consistency Emerges from Probabilistic Neural-Behavioral Representation Alignment	Yu Zhu et.al.	2505.04331v1	link
2025-05-07	Unmasking the Canvas: A Dynamic Benchmark for Image Generation Jailbreaking and LLM Content Safety	Variath Madhupal Gautham Nair et.al.	2505.04146v1	null
2025-05-07	Advancing Zero-shot Text-to-Speech Intelligibility across Diverse Domains via Preference Alignment	Xueyao Zhang et.al.	2505.04113v1	null
2025-05-06	Can Large Language Models Predict Parallel Code Performance?	Gregory Bolet et.al.	2505.03988v1	null
2025-05-06	Frog Soup: Zero-Shot, In-Context, and Sample-Efficient Frogger Agents	Xiang Li et.al.	2505.03947v1	link
2025-05-06	Fill the Gap: Quantifying and Reducing the Modality Gap in Image-Text Representation Learning	François Role et.al.	2505.03703v1	null
2025-05-06	CaRaFFusion: Improving 2D Semantic Segmentation with Camera-Radar Point Cloud Fusion and Zero-Shot Image Inpainting	Huawei Sun et.al.	2505.03679v1	null
2025-05-07	Breaking Annotation Barriers: Generalized Video Quality Assessment via Ranking-based Self-Supervision	Linhan Cao et.al.	2505.03631v2	link
2025-05-06	CXR-AD: Component X-ray Image Dataset for Industrial Anomaly Detection	Haoyu Bai et.al.	2505.03412v1	null
2025-05-06	Interpretable Zero-shot Learning with Infinite Class Concepts	Zihan Ye et.al.	2505.03361v1	null
2025-05-06	From Word to Sentence: A Large-Scale Multi-Instance Dataset for Open-Set Aerial Detection	Guoting Wei et.al.	2505.03334v1	null
2025-05-06	GraspVLA: a Grasping Foundation Model Pre-trained on Billion-scale Synthetic Action Data	Shengliang Deng et.al.	2505.03233v1	null
2025-05-06	Not All Parameters Matter: Masking Diffusion Models for Enhancing Generation Ability	Lei Wang et.al.	2505.03097v1	link
2025-05-05	Leveraging Protein Language Model Embeddings for Catalytic Turnover Prediction of Adenylate Kinase Orthologs in a Low-Data Regime	Duncan F. Muir et.al.	2505.03066v1	link
2025-05-05	Sim2Real Transfer for Vision-Based Grasp Verification	Pau Amargant et.al.	2505.03046v1	link
2025-05-05	Advancing Generalizable Tumor Segmentation with Anomaly-Aware Open-Vocabulary Attention Maps and Frozen Foundation Diffusion Models	Yankai Jiang et.al.	2505.02753v1	link
2025-05-06	Knowledge Graphs for Enhancing Large Language Models in Entity Disambiguation	Gerard Pons et.al.	2505.02737v2	null
2025-05-06	VGLD: Visually-Guided Linguistic Disambiguation for Monocular Depth Scale Recovery	Bojin Wu et.al.	2505.02704v2	link
2025-05-05	Tevatron 2.0: Unified Document Retrieval Toolkit across Scale, Language, and Modality	Xueguang Ma et.al.	2505.02466v1	link
2025-05-05	Recent Advances in Out-of-Distribution Detection with CLIP-Like Models: A Survey	Chaohua Li et.al.	2505.02448v1	null
2025-05-05	JTCSE: Joint Tensor-Modulus Constraints and Cross-Attention for Unsupervised Contrastive Learning of Sentence Embeddings	Tianyu Zong et.al.	2505.02366v1	link
2025-05-05	Advancing Email Spam Detection: Leveraging Zero-Shot Learning and Large Language Models	Ghazaleh SHirvani et.al.	2505.02362v1	link
2025-05-05	TeDA: Boosting Vision-Lanuage Models for Zero-Shot 3D Object Retrieval via Testing-time Distribution Alignment	Zhichuan Wang et.al.	2505.02325v1	link
2025-05-05	From Course to Skill: Evaluating LLM Performance in Curricular Analytics	Zhen Xu et.al.	2505.02324v1	link
2025-05-04	Compositional Image-Text Matching and Retrieval by Grounding Entities	Madhukar Reddy Vongala et.al.	2505.02278v1	null
2025-05-02	FalconWing: An Open-Source Platform for Ultra-Light Fixed-Wing Aircraft Research	Yan Miao et.al.	2505.01383v1	null
2025-05-05	Helping Large Language Models Protect Themselves: An Enhanced Filtering and Summarization System	Sheikh Samit Muhaimin et.al.	2505.01315v2	null
2025-05-02	CAV-MAE Sync: Improving Contrastive Audio-Visual Mask Autoencoders via Fine-Grained Alignment	Edson Araujo et.al.	2505.01237v1	link
2025-05-02	Zero-Shot Document-Level Biomedical Relation Extraction via Scenario-based Prompt Design in Two-Stage with LLM	Lei Zhao et.al.	2505.01077v1	null
2025-05-02	Multi-agents based User Values Mining for Recommendation	Lijian Chen et.al.	2505.00981v1	null
2025-05-01	ICQuant: Index Coding enables Low-bit LLM Quantization	Xinlin Li et.al.	2505.00850v1	null
2025-05-01	HMCF: A Human-in-the-loop Multi-Robot Collaboration Framework Based on Large Language Models	Zhaoxing Li et.al.	2505.00820v1	null
2025-05-01	Constructing an Optimal Behavior Basis for the Option Keyboard	Lucas N. Alegre et.al.	2505.00787v1	null
2025-05-01	Reasoning Capabilities and Invariability of Large Language Models	Alessandro Raganato et.al.	2505.00776v1	link
2025-05-01	Investigating Task Arithmetic for Zero-Shot Information Retrieval	Marco Braga et.al.	2505.00649v1	link
2025-05-01	Voice Cloning: Comprehensive Survey	Hussam Azzuni et.al.	2505.00579v1	null
2025-05-01	AI-Driven High-Resolution Cell Segmentation and Quantitative Analysis	Shuang Zhang et.al.	2505.00578v1	null
2025-05-01	DeCo: Task Decomposition and Skill Composition for Zero-Shot Generalization in Long-Horizon 3D Manipulation	Zixuan Chen et.al.	2505.00527v1	null
2025-05-01	Leveraging Pretrained Diffusion Models for Zero-Shot Part Assembly	Ruiyuan Zhang et.al.	2505.00426v1	null
2025-05-01	Perceptual Implications of Automatic Anonymization in Pathological Speech	Soroosh Tayebi Arasteh et.al.	2505.00409v1	null
2025-04-30	Investigating Zero-Shot Diagnostic Pathology in Vision-Language Models with Efficient Prompt Design	Vasudev Sharma et.al.	2505.00134v1	null
2025-04-30	Common3D: Self-Supervised Learning of 3D Morphable Models for Common Objects in Neural Feature Space	Leonhard Sommer et.al.	2504.21749v1	link
2025-04-30	Precision Where It Matters: A Novel Spike Aware Mixed-Precision Quantization Strategy for LLaMA-based Language Models	Lucas Maisonnave et.al.	2504.21553v1	null
2025-04-30	Synergy-CLIP: Extending CLIP with Multi-modal Integration for Robust Representation Learning	Sangyeon Cho et.al.	2504.21375v1	null
2025-04-30	Zero-Shot Super-Resolution from Unstructured Data Using a Transformer-Based Neural Operator for Urban Micrometeorology	Yuki Yasuda et.al.	2504.21361v1	link
2025-04-30	An Evaluation of a Visual Question Answering Strategy for Zero-shot Facial Expression Recognition in Still Images	Modesto Castrillón-Santana et.al.	2504.21309v1	null
2025-04-29	Graph Synthetic Out-of-Distribution Exposure with Large Language Models	Haoyan Xu et.al.	2504.21198v1	null
2025-04-29	Small or Large? Zero-Shot or Finetuned? Guiding Language Model Choice for Specialized Applications in Healthcare	Lovedeep Gondara et.al.	2504.21191v1	null
2025-04-29	GLIP-OOD: Zero-Shot Graph OOD Detection with Foundation Model	Haoyan Xu et.al.	2504.21186v1	null
2025-04-29	Efficient LLMs with AMP: Attention Heads and MLP Pruning	Leandro Giusti Mugnaini et.al.	2504.21174v1	null
2025-04-30	Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition	Tyler McDonald et.al.	2504.20946v2	null
2025-04-29	An Empirical Study on the Capability of LLMs in Decomposing Bug Reports	Zhiyuan Chen et.al.	2504.20911v1	null
2025-04-29	JaccDiv: A Metric and Benchmark for Quantifying Diversity of Generated Marketing Text in the Music Industry	Anum Afzal et.al.	2504.20849v1	null
2025-04-29	Using LLMs in Generating Design Rationale for Software Architecture Decisions	Xiyu Zhou et.al.	2504.20781v1	link
2025-04-29	In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer	Zechuan Zhang et.al.	2504.20690v1	null
2025-04-29	Revisiting the MIMIC-IV Benchmark: Experiments Using Language Models for Electronic Health Records	Jesus Lovon et.al.	2504.20547v1	null
2025-04-29	MuRAL: A Multi-Resident Ambient Sensor Dataset Annotated with Natural Language for Activities of Daily Living	Xi Chen et.al.	2504.20505v1	null
2025-04-29	Fane at SemEval-2025 Task 10: Zero-Shot Entity Framing with Large Language Models	Enfa Fane et.al.	2504.20469v1	link
2025-04-29	Plant Disease Detection through Multimodal Large Language Models and Convolutional Neural Networks	Konstantinos I. Roumeliotis et.al.	2504.20419v1	null
2025-04-29	FourierSpecNet: Neural Collision Operator Approximation Inspired by the Fourier Spectral Method for Solving the Boltzmann Equation	Jae Yong Lee et.al.	2504.20408v1	null
2025-04-28	AutoJudge: Judge Decoding Without Manual Annotation	Roman Garipov et.al.	2504.20039v1	null
2025-04-28	DeeCLIP: A Robust and Generalizable Transformer-Based Framework for Detecting AI-Generated Images	Mamadou Keita et.al.	2504.19876v1	link
2025-04-28	NORA: A Small Open-Sourced Generalist Vision Language Action Model for Embodied Tasks	Chia-Yu Hung et.al.	2504.19854v1	null
2025-04-28	Foundation Model-Driven Framework for Human-Object Interaction Prediction with Segmentation Mask Integration	Juhan Park et.al.	2504.19847v1	null
2025-04-28	EcoWikiRS: Learning Ecological Representation of Satellite Images from Weak Supervision with Species Observations and Wikipedia	Valerie Zermatten et.al.	2504.19742v1	null
2025-04-28	Interactive Discovery and Exploration of Visual Bias in Generative Text-to-Image Models	Johannes Eschner et.al.	2504.19703v1	null
2025-04-28	SynergyAmodal: Deocclude Anything with Text Control	Xinyang Li et.al.	2504.19506v1	null
2025-04-28	Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding	Yan Wang et.al.	2504.19500v1	null
2025-04-28	EarthMapper: Visual Autoregressive Models for Controllable Bidirectional Satellite-Map Translation	Zhe Dong et.al.	2504.19432v1	null
2025-04-27	From Inductive to Deductive: LLMs-Based Qualitative Data Analysis in Requirements Engineering	Syed Tauhid Ullah Shah et.al.	2504.19384v1	link
2025-04-25	RSFR: A Coarse-to-Fine Reconstruction Framework for Diffusion Tensor Cardiac MRI with Semantic-Aware Refinement	Jiahao Huang et.al.	2504.18520v1	null
2025-04-25	Action-Minimization Meets Generative Modeling: Efficient Transition Path Sampling with the Onsager-Machlup Functional	Sanjeev Raja et.al.	2504.18506v1	null
2025-04-25	Unsupervised Visual Chain-of-Thought Reasoning via Preference Optimization	Kesen Zhao et.al.	2504.18397v1	link
2025-04-25	Leveraging Decoder Architectures for Learned Sparse Retrieval	Jingfen Qiao et.al.	2504.18151v1	null
2025-04-25	PropRAG: Guiding Retrieval with Beam Search over Proposition Paths	Jingjin Wang et.al.	2504.18070v1	null
2025-04-25	From Mapping to Composing: A Two-Stage Framework for Zero-shot Composed Image Retrieval	Yabing Wang et.al.	2504.17990v1	null
2025-04-24	Optimism, Expectation, or Sarcasm? Multi-Class Hope Speech Detection in Spanish and English	Sabur Butt et.al.	2504.17974v1	null
2025-04-24	The Fourth Monocular Depth Estimation Challenge	Anton Obukhov et.al.	2504.17787v1	null
2025-04-24	Beyond Labels: Zero-Shot Diabetic Foot Ulcer Wound Segmentation with Self-attention Diffusion Models and the Potential for Text-Guided Customization	Abderrachid Hamrani et.al.	2504.17628v1	null
2025-04-24	StereoMamba: Real-time and Robust Intraoperative Stereo Disparity Estimation via Long-range Spatial Dependencies	Xu Wang et.al.	2504.17401v1	null
2025-04-24	Physics-based super-resolved simulation of 3D elastic wave propagation adopting scalable Diffusion Transformer	Hugo Gabrielidis et.al.	2504.17308v1	null
2025-04-24	Demonstrating Berkeley Humanoid Lite: An Open-source, Accessible, and Customizable 3D-printed Humanoid Robot	Yufeng Chi et.al.	2504.17249v1	null
2025-04-24	Visual and textual prompts for enhancing emotion recognition in video	Zhifeng Wang et.al.	2504.17224v1	null
2025-04-23	Tokenization Matters: Improving Zero-Shot NER for Indic Languages	Priyaranjan Pattnayak et.al.	2504.16977v1	null
2025-04-23	Procedural Dataset Generation for Zero-Shot Stereo Matching	David Yan et.al.	2504.16930v1	null
2025-04-23	Zero-shot Sim-to-Real Transfer for Reinforcement Learning-based Visual Servoing of Soft Continuum Arms	Hsin-Jung Yang et.al.	2504.16916v1	null
2025-04-23	Exploring zero-shot structure-based protein fitness prediction	Arnav Sharma et.al.	2504.16886v1	null
2025-04-23	Improving Significant Wave Height Prediction Using Chronos Models	Yilin Zhai et.al.	2504.16834v1	null
2025-04-23	Decoupled Global-Local Alignment for Improving Compositional Understanding	Xiaoxing Hu et.al.	2504.16801v1	null
2025-04-23	FrogDogNet: Fourier frequency Retained visual prompt Output Guidance for Domain Generalization of CLIP in Remote Sensing	Hariseetharam Gunduboina et.al.	2504.16433v1	null
2025-04-24	Can Large Language Models Help Multimodal Language Analysis? MMLA: A Comprehensive Benchmark	Hanlei Zhang et.al.	2504.16427v2	link
2025-04-23	Less is More: Enhancing Structured Multi-Agent Reasoning via Quality-Guided Distillation	Jiahao Yuan et.al.	2504.16408v1	link
2025-04-22	CLIRudit: Cross-Lingual Information Retrieval of Scientific Documents	Francisco Valentini et.al.	2504.16264v1	null
2025-04-22	W-PCA Based Gradient-Free Proxy for Efficient Search of Lightweight Language Models	Shang Wang et.al.	2504.15983v1	link
2025-04-22	FreeGraftor: Training-Free Cross-Image Feature Grafting for Subject-Driven Text-to-Image Generation	Zebin Yao et.al.	2504.15958v1	link
2025-04-23	Language Models to Support Multi-Label Classification of Industrial Data	Waleed Abdeen et.al.	2504.15922v2	null
2025-04-22	Structure-Preserving Zero-Shot Image Editing via Stage-Wise Latent Injection in Diffusion Models	Dasol Jeong et.al.	2504.15723v1	null
2025-04-22	ZeroSlide: Is Zero-Shot Classification Adequate for Lifelong Learning in Whole-Slide Image Analysis in the Era of Pathology Vision-Language Foundation Models?	Doanh C. Bui et.al.	2504.15627v1	null
2025-04-22	Research on Navigation Methods Based on LLMs	Anlong Zhang et.al.	2504.15600v1	null
2025-04-22	LLM-based Semantic Augmentation for Harmful Content Detection	Elyas Meguellati et.al.	2504.15548v1	null
2025-04-21	From Reviews to Dialogues: Active Synthesis for Zero-Shot LLM-based Conversational Recommender System	Rohan Surana et.al.	2504.15476v1	null
2025-04-21	Manifold Induced Biases for Zero-shot and Few-shot Detection of Generated Images	Jonathan Brokman et.al.	2504.15470v1	link
2025-04-21	Tell Me What You Know About Sexism: Expert-LLM Interaction Strategies and Co-Created Definitions for Zero-Shot Sexism Detection	Myrthe Reuver et.al.	2504.15392v1	link
2025-04-21	Leveraging Language Models for Automated Patient Record Linkage	Mohammad Beheshti et.al.	2504.15261v1	null
2025-04-21	Zero-Shot, But at What Cost? Unveiling the Hidden Overhead of MILS's LLM-CLIP Framework for Image Captioning	Yassir Benhammou et.al.	2504.15199v1	null
2025-04-21	Think2SQL: Reinforce LLM Reasoning Capabilities for Text2SQL	Simone Papicchio et.al.	2504.15077v1	null
2025-04-22	Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision	Shilin Zhang et.al.	2504.15046v2	null
2025-04-21	GenCLIP: Generalizing CLIP Prompts for Zero-shot Anomaly Detection	Donghyeong Kim et.al.	2504.14919v1	null
2025-04-21	Aligning Beam with Imbalanced Multi-modality: A Generative Federated Learning Approach	Jiahui Liang et.al.	2504.14835v1	null
2025-04-20	Med-2D SegNet: A Light Weight Deep Neural Network for Medical 2D Image Segmentation	Md. Sanaullah Chowdhury et.al.	[2504.14715v1](http://arxiv.org/abs/2504.147

Name		Name	Last commit message	Last commit date
Latest commit History 1,610 Commits
.github/workflows		.github/workflows
assets		assets
docs		docs
README.md		README.md
config.yaml		config.yaml
daily_arxiv.py		daily_arxiv.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Updated on 2025.08.30

6D Pose

Point Cloud Registration

Point Cloud Segmentation

Zero-shot

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Jianqiuer/Awesome6DPoseEstimation

Folders and files

Latest commit

History

Repository files navigation

Updated on 2025.08.30

6D Pose

Point Cloud Registration

Point Cloud Segmentation

Zero-shot

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages