Usage instructions: here
Table of Contents
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-09-18 | Towards Global Localization using Multi-Modal Object-Instance Re-Identification | Aneesh Chavan et.al. | 2409.12002 | null |
2024-09-17 | Obfuscation Based Privacy Preserving Representations are Recoverable Using Neighborhood Information | Kunal Chelani et.al. | 2409.11536 | null |
2024-09-20 | TISIS : Trajectory Indexing for SImilarity Search | Sara Jarrad et.al. | 2409.11301 | null |
2024-09-17 | Improving the Efficiency of Visually Augmented Language Models | Paula Ontalvilla et.al. | 2409.11148 | null |
2024-09-17 | HGSLoc: 3DGS-based Heuristic Camera Pose Refinement | Zhongyan Niu et.al. | 2409.10925 | null |
2024-09-16 | SOLVR: Submap Oriented LiDAR-Visual Re-Localisation | Joshua Knights et.al. | 2409.10247 | null |
2024-09-16 | Garment Attribute Manipulation with Multi-level Attention | Vittorio Casula et.al. | 2409.10206 | null |
2024-09-16 | Algorithmic Behaviors Across Regions: A Geolocation Audit of YouTube Search for COVID-19 Misinformation between the United States and South Africa | Hayoung Jung et.al. | 2409.10168 | null |
2024-09-14 | Evaluating Pre-trained Convolutional Neural Networks and Foundation Models as Feature Extractors for Content-based Medical Image Retrieval | Amirreza Mahbod et.al. | 2409.09430 | link |
2024-09-12 | Structured Pruning for Efficient Visual Place Recognition | Oliver Grainge et.al. | 2409.07834 | null |
2024-09-10 | GeoCalib: Learning Single-image Calibration with Geometric Optimization | Alexander Veicht et.al. | 2409.06704 | link |
2024-09-10 | Weakly-supervised Camera Localization by Ground-to-satellite Image Registration | Yujiao Shi et.al. | 2409.06471 | link |
2024-09-10 | A Cross-Font Image Retrieval Network for Recognizing Undeciphered Oracle Bone Inscriptions | Zhicong Wu et.al. | 2409.06381 | null |
2024-09-09 | Referring Expression Generation in Visually Grounded Dialogue with Discourse-aware Comprehension Guiding | Bram Willemsen et.al. | 2409.05721 | link |
2024-09-09 | Open-World Dynamic Prompt and Continual Visual Representation Learning | Youngeun Kim et.al. | 2409.05312 | null |
2024-09-12 | Training-free ZS-CIR via Weighted Modality Fusion and Similarity | Ren-Di Wu et.al. | 2409.04918 | null |
2024-09-12 | Zero-Shot Whole Slide Image Retrieval in Histopathology Using Embeddings of Foundation Models | Saghir Alfasly et.al. | 2409.04631 | null |
2024-09-06 | Reprojection Errors as Prompts for Efficient Scene Coordinate Regression | Ting-Ru Liu et.al. | 2409.04178 | null |
2024-09-06 | Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments | Therese Joseph et.al. | 2409.03998 | null |
2024-09-04 | Design and Evaluation of Camera-Centric Mobile Crowdsourcing Applications | Abby Stylianou et.al. | 2409.03012 | null |
2024-09-04 | NUDGE: Lightweight Non-Parametric Fine-Tuning of Embeddings for Retrieval | Sepanta Zeighami et.al. | 2409.02343 | link |
2024-09-03 | Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment | Konstantin Schall et.al. | 2409.01936 | link |
2024-09-02 | A Review of Image Retrieval Techniques: Data Augmentation and Adversarial Learning Approaches | Kim Jinwoo et.al. | 2409.01219 | null |
2024-09-02 | Evidential Transformers for Improved Image Retrieval | Danilo Dordevic et.al. | 2409.01082 | null |
2024-09-05 | EgoHDM: An Online Egocentric-Inertial Human Motion Capture, Localization, and Dense Mapping System | Bonan Liu et.al. | 2409.00343 | null |
2024-09-04 | Augmented Reality without Borders: Achieving Precise Localization Without Maps | Albert Gassol Puigjaner et.al. | 2408.17373 | null |
2024-09-02 | RISSOLE: Parameter-efficient Diffusion Models via Block-wise Generation and Retrieval-Guidance | Avideep Mukherjee et.al. | 2408.17095 | null |
2024-08-29 | A compact neuromorphic system for ultra energy-efficient, on-device robot localization | Adam D. Hines et.al. | 2408.16754 | link |
2024-08-29 | UAV-Based Human Body Detector Selection and Fusion for Geolocated Saliency Map Generation | Piotr Rudol et.al. | 2408.16501 | null |
2024-08-29 | Rethinking Sparse Lexical Representations for Image Retrieval in the Age of Rising Multi-Modal Large Language Models | Kengo Nakata et.al. | 2408.16296 | null |
2024-08-28 | Temporal Attention for Cross-View Sequential Image Localization | Dong Yuan et.al. | 2408.15569 | null |
2024-08-27 | Snap and Diagnose: An Advanced Multimodal Retrieval System for Identifying Plant Diseases in the Wild | Tianqi Wei et.al. | 2408.14723 | null |
2024-08-25 | LowCLIP: Adapting the CLIP Model Architecture for Low-Resource Languages in Multimodal Image Retrieval Task | Ali Asgarov et.al. | 2408.13909 | null |
2024-08-15 | Cross-Modal Denoising: A Novel Training Paradigm for Enhancing Speech-Image Retrieval | Lifeng Zhou et.al. | 2408.13705 | null |
2024-08-15 | Coarse-to-fine Alignment Makes Better Speech-image Retrieval | Lifeng Zhou et.al. | 2408.13119 | null |
2024-08-22 | Geolocation Representation from Large Language Models are Generic Enhancers for Spatio-Temporal Learning | Junlin He et.al. | 2408.12116 | null |
2024-08-21 | FUSELOC: Fusing Global and Local Descriptors to Disambiguate 2D-3D Matching in Visual Localization | Son Tung Nguyen et.al. | 2408.12037 | link |
2024-08-21 | Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations | Lintong Zhang et.al. | 2408.11966 | null |
2024-08-21 | UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation | Xiangyu Zhao et.al. | 2408.11305 | link |
2024-08-20 | GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting | Changkun Liu et.al. | 2408.11085 | null |
2024-08-19 | BrewCLIP: A Bifurcated Representation Learning Framework for Audio-Visual Retrieval | Zhenyu Lu et.al. | 2408.10383 | null |
2024-08-19 | Pose-GuideNet: Automatic Scanning Guidance for Fetal Head Ultrasound from Pose Estimation | Qianhui Men et.al. | 2408.09931 | null |
2024-08-23 | Fashion Image-to-Image Translation for Complementary Item Retrieval | Matteo Attimonelli et.al. | 2408.09847 | null |
2024-08-20 | MambaLoc: Efficient Camera Localisation via State Space Model | Jialu Wang et.al. | 2408.09680 | null |
2024-08-18 | Image-Based Geolocation Using Large Vision-Language Models | Yi Liu et.al. | 2408.09474 | null |
2024-08-15 | DM2RM: Dual-Mode Multimodal Ranking for Target Objects and Receptacles Based on Open-Vocabulary Instructions | Ryosuke Korekata et.al. | 2408.07910 | null |
2024-08-13 | PSM: Learning Probabilistic Embeddings for Multi-scale Zero-Shot Soundscape Mapping | Subash Khanal et.al. | 2408.07050 | null |
2024-08-13 | Cross-View Geolocalization and Disaster Mapping with Street-View and VHR Satellite Imagery: A Case Study of Hurricane IAN | Hao Li et.al. | 2408.06761 | link |
2024-08-13 | A Miniature Vision-Based Localization System for Indoor Blimps | Shicong Ma et.al. | 2408.06648 | null |
2024-08-10 | Cross-view image geo-localization with Panorama-BEV Co-Retrieval Network | Junyan Ye et.al. | 2408.05475 | link |
2024-08-09 | Spherical World-Locking for Audio-Visual Localization in Egocentric Videos | Heeseung Yun et.al. | 2408.05364 | null |
2024-08-06 | AMES: Asymmetric and Memory-Efficient Similarity Estimation for Instance-level Retrieval | Pavel Suma et.al. | 2408.03282 | null |
2024-08-06 | Stacking fault segregation imaging with analytical field ion microscopy | F. F. Morgado et.al. | 2408.03167 | null |
2024-08-05 | GAReT: Cross-view Video Geolocalization with Adapters and Auto-Regressive Transformers | Manu S Pillai et.al. | 2408.02840 | link |
2024-08-05 | CMR-Agent: Learning a Cross-Modal Agent for Iterative Image-to-Point Cloud Registration | Gongxin Yao et.al. | 2408.02394 | null |
2024-08-02 | On Validation of Search & Retrieval of Tissue Images in Digital Pathology | H. R. Tizhoosh et.al. | 2408.01570 | null |
2024-07-31 | VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning | Yuhang Ming et.al. | 2407.21416 | null |
2024-07-30 | Re-localization acceleration with Medoid Silhouette Clustering | Hongyi Zhang et.al. | 2407.20749 | null |
2024-07-26 | From 2D to 3D: AISG-SLA Visual Localization Challenge | Jialin Gao et.al. | 2407.18590 | null |
2024-07-24 | Revolutionizing Text-to-Image Retrieval as Autoregressive Token-to-Voken Generation | Yongqi Li et.al. | 2407.17274 | null |
2024-07-24 | Pose Estimation from Camera Images for Underwater Inspection | Luyuan Peng et.al. | 2407.16961 | null |
2024-07-23 | Masks and Manuscripts: Advancing Medical Pre-training with End-to-End Masking and Narrative Structuring | Shreyank N Gowda et.al. | 2407.16264 | null |
2024-07-22 | Leveraging Large Language Models to Geolocate Linguistic Variations in Social Media Posts | Davide Savarro et.al. | 2407.16047 | link |
2024-07-22 | RADA: Robust and Accurate Feature Learning with Domain Adaptation | Jingtai He et.al. | 2407.15791 | null |
2024-07-19 | Double-Layer Soft Data Fusion for Indoor Robot WiFi-Visual Localization | Yuehua Ding et.al. | 2407.14643 | null |
2024-07-18 | Enhancing Worldwide Image Geolocation by Ensembling Satellite-Based Ground-Level Attribute Predictors | Michael J. Bianco et.al. | 2407.13862 | null |
2024-07-18 | Visual Haystacks: Answering Harder Questions About Sets of Images | Tsung-Han Wu et.al. | 2407.13766 | link |
2024-07-17 | Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM | Markus Weißflog et.al. | 2407.12408 | null |
2024-07-17 | GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection | Jingwen Yu et.al. | 2407.11736 | link |
2024-07-16 | EndoFinder: Online Image Retrieval for Explainable Colorectal Polyp Diagnosis | Ruijie Yang et.al. | 2407.11401 | null |
2024-07-15 | No Train, all Gain: Self-Supervised Gradients Improve Deep Frozen Representations | Walter Simoncini et.al. | 2407.10964 | link |
2024-07-15 | DINO Pre-training for Vision-based End-to-end Autonomous Driving | Shubham Juneja et.al. | 2407.10803 | null |
2024-07-15 | Addressing Image Hallucination in Text-to-Image Generation through Factual Image Retrieval | Youngsun Lim et.al. | 2407.10683 | null |
2024-07-15 | General algorithm of assigning raster features to vector maps at any resolution or scale | Nan Xu et.al. | 2407.10599 | null |
2024-07-15 | An evaluation of CNN models and data augmentation techniques in hierarchical localization of mobile robots | J. J. Cabrera et.al. | 2407.10596 | link |
2024-07-15 | An experimental evaluation of Siamese Neural Networks for robot localization using omnidirectional imaging in indoor environments | J. J. Cabrera et.al. | 2407.10536 | null |
2024-07-13 | IoT-LM: Large Multisensory Language Models for the Internet of Things | Shentong Mo et.al. | 2407.09801 | link |
2024-07-12 | Are They the Same Picture? Adapting Concept Bottleneck Models for Human-AI Collaboration in Image Retrieval | Vaibhav Balloli et.al. | 2407.08908 | link |
2024-07-11 | Improving Visual Place Recognition Based Robot Navigation Through Verification of Localization Estimates | Owen Claxton et.al. | 2407.08162 | link |
2024-07-12 | Lifelong Histopathology Whole Slide Image Retrieval via Distance Consistency Rehearsal | Xinyu Zhu et.al. | 2407.08153 | null |
2024-07-10 | Geospecific View Generation -- Geometry-Context Aware High-resolution Ground View Inference from Satellite Views | Ningli Xu et.al. | 2407.08061 | null |
2024-07-09 | LVLM-empowered Multi-modal Representation Learning for Visual Place Recognition | Teng Wang et.al. | 2407.06730 | null |
2024-07-09 | CEIA: CLIP-Based Event-Image Alignment for Open-World Event-Based Understanding | Wenhao Xu et.al. | 2407.06611 | null |
2024-07-08 | Geospatial Trajectory Generation via Efficient Abduction: Deployment for Independent Testing | Divyagna Bavikadi et.al. | 2407.06447 | null |
2024-07-08 | Tile Compression and Embeddings for Multi-Label Classification in GeoLifeCLEF 2024 | Anthony Miyaguchi et.al. | 2407.06326 | link |
2024-07-08 | Pseudo-triplet Guided Few-shot Composed Image Retrieval | Bohan Hou et.al. | 2407.06001 | null |
2024-07-09 | HyCIR: Boosting Zero-Shot Composed Image Retrieval with Synthetic Labels | Yingying Jiang et.al. | 2407.05795 | null |
2024-07-06 | Granular Privacy Control for Geolocation with Vision Language Models | Ethan Mendes et.al. | 2407.04952 | link |
2024-07-05 | Elevating All Zero-Shot Sketch-Based Image Retrieval Through Multimodal Prompt Learning | Mainak Singha et.al. | 2407.04207 | link |
2024-07-04 | Exploring Diachronic and Diatopic Changes in Dialect Continua: Tasks, Datasets and Challenges | Melis Çelikkol et.al. | 2407.04010 | null |
2024-07-04 | Visualizing Dialogues: Enhancing Image Selection through Dialogue Understanding with Large Language Models | Chang-Sheng Kao et.al. | 2407.03615 | link |
2024-07-04 | A Comprehensive Analysis of Real-World Accelerometer Data Quality in a Global Smartphone-based Seismic Network | Yawen Zhang et.al. | 2407.03570 | null |
2024-07-03 | Celeb-FBI: A Benchmark Dataset on Human Full Body Images and Age, Gender, Height and Weight Estimation using Deep Learning Approach | Pronay Debnath et.al. | 2407.03486 | null |
2024-07-02 | Close, But Not There: Boosting Geographic Distance Sensitivity in Visual Place Recognition | Sergio Izquierdo et.al. | 2407.02422 | link |
2024-07-01 | Freeview Sketching: View-Aware Fine-Grained Sketch-Based Image Retrieval | Aneeshan Sain et.al. | 2407.01810 | null |
2024-07-01 | Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval | Hanwen Su et.al. | 2407.00979 | null |
2024-07-01 | Dynamically Modulating Visual Place Recognition Sequence Length For Minimum Acceptable Performance Scenarios | Connor Malone et.al. | 2407.00863 | null |
2024-06-28 | Into the Unknown: Generating Geospatial Descriptions for New Environments | Tzuf Paz-Argaman et.al. | 2406.19967 | null |
2024-06-27 | PathAlign: A vision-language model for whole slide images in histopathology | Faruk Ahmed et.al. | 2406.19578 | null |
2024-07-05 | 360 in the Wild: Dataset for Depth Prediction and View Synthesis | Kibaek Park et.al. | 2406.18898 | null |
2024-06-27 | Zero-shot Composed Image Retrieval Considering Query-target Relationship Leveraging Masked Image-text Pairs | Huaying Zhang et.al. | 2406.18836 | null |
2024-06-26 | WV-Net: A foundation model for SAR WV-mode satellite imagery trained using contrastive self-supervised learning on 10 million images | Yannik Glaser et.al. | 2406.18765 | null |
2024-06-26 | View-Invariant Pixelwise Anomaly Detection in Multi-object Scenes with Adaptive View Synthesis | Subin Varghese et.al. | 2406.18012 | null |
2024-06-25 | Tell Me Where You Are: Multimodal LLMs Meet Place Recognition | Zonglin Lyu et.al. | 2406.17520 | null |
2024-06-23 | Breaking the Frame: Image Retrieval by Visual Overlap Prediction | Tong Wei et.al. | 2406.16204 | link |
2024-06-21 | Routes to a building or a room suited to the specific needs of users | Stéphanie Jean-Daubias et.al. | 2406.14923 | null |
2024-06-19 | Towards a multimodal framework for remote sensing image change retrieval and captioning | Roger Ferrod et.al. | 2406.13424 | link |
2024-06-19 | CLIP-Branches: Interactive Fine-Tuning for Text-Image Retrieval | Christian Lülf et.al. | 2406.13322 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-09-18 | Vista3D: Unravel the 3D Darkside of a Single Image | Qiuhong Shen et.al. | 2409.12193 | link |
2024-09-18 | LEMON: Localized Editing with Mesh Optimization and Neural Shaders | Furkan Mert Algan et.al. | 2409.12024 | null |
2024-09-18 | Intraoperative Registration by Cross-Modal Inverse Neural Rendering | Maximilian Fehrentz et.al. | 2409.11983 | null |
2024-09-18 | SRIF: Semantic Shape Registration Empowered by Diffusion-based Image Morphing and Flow Estimation | Mingze Sun et.al. | 2409.11682 | null |
2024-09-18 | Gradient-Driven 3D Segmentation and Affordance Transfer in Gaussian Splatting Using 2D Masks | Joji Joseph et.al. | 2409.11681 | link |
2024-09-17 | RenderWorld: World Model with Self-Supervised 3D Label | Ziyang Yan et.al. | 2409.11356 | null |
2024-09-17 | GS-Net: Generalizable Plug-and-Play 3D Gaussian Splatting Module | Yichen Zhang et.al. | 2409.11307 | null |
2024-09-17 | SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction | Marko Mihajlovic et.al. | 2409.11211 | null |
2024-09-17 | GLC-SLAM: Gaussian Splatting SLAM with Efficient Loop Closure | Ziheng Xu et.al. | 2409.10982 | null |
2024-09-17 | HGSLoc: 3DGS-based Heuristic Camera Pose Refinement | Zhongyan Niu et.al. | 2409.10925 | null |
2024-09-16 | Phys3DGS: Physically-based 3D Gaussian Splatting for Inverse Rendering | Euntae Choi et.al. | 2409.10335 | null |
2024-09-16 | BEINGS: Bayesian Embodied Image-goal Navigation with Gaussian Splatting | Wugang Meng et.al. | 2409.10216 | link |
2024-09-16 | SplatSim: Zero-Shot Sim2Real Transfer of RGB Manipulation Policies Using Gaussian Splatting | Mohammad Nomaan Qureshi et.al. | 2409.10161 | null |
2024-09-16 | Adaptive Segmentation-Based Initialization for Steered Mixture of Experts Image Regression | Yi-Hsin Li et.al. | 2409.10101 | null |
2024-09-16 | DENSER: 3D Gaussians Splatting for Scene Reconstruction of Dynamic Urban Environments | Mahmud A. Mohamad et.al. | 2409.10041 | link |
2024-09-15 | SAFER-Splat: A Control Barrier Function for Safe Navigation with Online Gaussian Splatting Maps | Timothy Chen et.al. | 2409.09868 | null |
2024-09-15 | MesonGS: Post-training Compression of 3D Gaussians via Efficient Attribute Transformation | Shuzhao Xie et.al. | 2409.09756 | null |
2024-09-14 | GEVO: Memory-Efficient Monocular Visual Odometry Using Gaussians | Dasong Gao et.al. | 2409.09295 | null |
2024-09-17 | A Diffusion Approach to Radiance Field Relighting using Multi-Illumination Synthesis | Yohan Poirier-Ginter et.al. | 2409.08947 | null |
2024-09-13 | AdR-Gaussian: Accelerating Gaussian Splatting with Adaptive Radius | Xinzhe Wang et.al. | 2409.08669 | null |
2024-09-13 | Dense Point Clouds Matter: Dust-GS for Scene Reconstruction from Sparse Viewpoints | Shan Chen et.al. | 2409.08613 | null |
2024-09-13 | CSS: Overcoming Pose and Scene Challenges in Crowd-Sourced 3D Gaussian Splatting | Runze Chen et.al. | 2409.08562 | null |
2024-09-12 | Robust Dual Gaussian Splatting for Immersive Human-centric Volumetric Videos | Yuheng Jiang et.al. | 2409.08353 | null |
2024-09-12 | FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally | Qiuhong Shen et.al. | 2409.08270 | link |
2024-09-12 | Thermal3D-GS: Physics-induced 3D Gaussians for Thermal Infrared Novel-view Synthesis | Qian Chen et.al. | 2409.08042 | link |
2024-09-12 | SwinGS: Sliding Window Gaussian Splatting for Volumetric Video Streaming with Arbitrary Length | Bangya Liu et.al. | 2409.07759 | null |
2024-09-11 | FaVoR: Features via Voxel Rendering for Camera Relocalization | Vincenzo Polizzi et.al. | 2409.07571 | null |
2024-09-11 | Self-Evolving Depth-Supervised 3D Gaussian Splatting from Rendered Stereo Pairs | Sadra Safadoust et.al. | 2409.07456 | null |
2024-09-11 | Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video Diffusion Models | Haibo Yang et.al. | 2409.07452 | link |
2024-09-11 | Instant Facial Gaussians Translator for Relightable and Interactable Facial Rendering | Dafei Qin et.al. | 2409.07441 | null |
2024-09-11 | Single-View 3D Reconstruction via SO(2)-Equivariant Gaussian Sculpting Networks | Ruihan Xu et.al. | 2409.07245 | null |
2024-09-11 | ThermalGaussian: Thermal 3D Gaussian Splatting | Rongfeng Lu et.al. | 2409.07200 | null |
2024-09-10 | gsplat: An Open-Source Library for Gaussian Splatting | Vickie Ye et.al. | 2409.06765 | link |
2024-09-10 | GigaGS: Scaling up Planar-Based 3D Gaussians for Large Scene Surface Reconstruction | Junyi Chen et.al. | 2409.06685 | null |
2024-09-10 | Sources of Uncertainty in 3D Scene Reconstruction | Marcus Klasson et.al. | 2409.06407 | link |
2024-09-09 | Online 3D reconstruction and dense tracking in endoscopic videos | Michel Hayoz et.al. | 2409.06037 | link |
2024-09-09 | GASP: Gaussian Splatting for Physic-Based Simulations | Piotr Borycki et.al. | 2409.05819 | link |
2024-09-09 | Lagrangian Hashing for Compressed Neural Field Representations | Shrisudhan Govindarajan et.al. | 2409.05334 | null |
2024-09-12 | DreamMapping: High-Fidelity Text-to-3D Generation via Variational Distribution Mapping | Zeyu Cai et.al. | 2409.05099 | null |
2024-09-08 | GS-PT: Exploiting 3D Gaussian Splatting for Comprehensive Point Cloud Understanding via Self-supervised Learning | Keyi Liu et.al. | 2409.04963 | null |
2024-09-11 | Fisheye-GS: Lightweight and Extensible Gaussian Splatting Module for Fisheye Cameras | Zimu Liao et.al. | 2409.04751 | link |
2024-09-06 | GST: Precise 3D Human Body from a Single Image with Gaussian Splatting Transformers | Lorenza Prospero et.al. | 2409.04196 | null |
2024-09-06 | 3D-GP-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors | Yujun Huang et.al. | 2409.04013 | link |
2024-09-05 | View-Invariant Policy Learning via Zero-Shot Novel View Synthesis | Stephen Tian et.al. | 2409.03685 | null |
2024-09-05 | LM-Gaussian: Boost Sparse-view 3D Gaussian Splatting with Large Model Priors | Hanyang Yu et.al. | 2409.03456 | null |
2024-09-05 | Optimizing 3D Gaussian Splatting for Sparse Viewpoint Scene Reconstruction | Shen Chen et.al. | 2409.03213 | null |
2024-09-04 | Human-VDM: Learning Single-Image 3D Human Gaussian Splatting from Video Diffusion Models | Zhibin Liu et.al. | 2409.02851 | link |
2024-09-04 | Object Gaussian for Monocular 6D Pose Estimation from Sparse Views | Luqing Luo et.al. | 2409.02581 | null |
2024-09-04 | GGS: Generalizable Gaussian Splatting for Lane Switching in Autonomous Driving | Huasong Han et.al. | 2409.02382 | null |
2024-09-03 | DynOMo: Online Point Tracking by Dynamic Online Monocular Gaussian Reconstruction | Jenny Seidenschwarz et.al. | 2409.02104 | null |
2024-09-03 | PRoGS: Progressive Rendering of Gaussian Splats | Brent Zoomers et.al. | 2409.01761 | null |
2024-09-03 | GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting | Zixuan Guo et.al. | 2409.01581 | null |
2024-09-02 | Free-DyGS: Camera-Pose-Free Scene Reconstruction based on Gaussian Splatting for Dynamic Surgical Videos | Qian Li et.al. | 2409.01003 | null |
2024-09-06 | 3D Gaussian Splatting for Large-scale 3D Surface Reconstruction from Aerial Images | YuanZheng Wu et.al. | 2409.00381 | null |
2024-08-31 | UDGS-SLAM : UniDepth Assisted Gaussian Splatting for Monocular SLAM | Mostafa Mansour et.al. | 2409.00362 | null |
2024-08-30 | OG-Mapping: Octree-based Structured 3D Gaussians for Online Dense Mapping | Meng Wang et.al. | 2408.17223 | null |
2024-08-30 | 2DGH: 2D Gaussian-Hermite Splatting for High-quality Rendering and Better Geometry Reconstruction | Ruihan Yu et.al. | 2408.16982 | null |
2024-08-29 | ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model | Fangfu Liu et.al. | 2408.16767 | null |
2024-08-29 | OmniRe: Omni Urban Scene Reconstruction | Ziyu Chen et.al. | 2408.16760 | null |
2024-08-28 | Towards Realistic Example-based Modeling via 3D Gaussian Stitching | Xinyu Gao et.al. | 2408.15708 | null |
2024-09-05 | G-Style: Stylized Gaussian Splatting | Áron Samuel Kovács et.al. | 2408.15695 | null |
2024-08-27 | Drone-assisted Road Gaussian Splatting with Cross-view Uncertainty | Saining Zhang et.al. | 2408.15242 | link |
2024-08-27 | Learning-based Multi-View Stereo: A Survey | Fangjinhua Wang et.al. | 2408.15235 | null |
2024-08-27 | Robo-GS: A Physics Consistent Spatial-Temporal Model for Robotic Arm with Hybrid Representation | Haozhe Lou et.al. | 2408.14873 | null |
2024-08-27 | LapisGS: Layered Progressive 3D Gaussian Splatting for Adaptive Streaming | Yuang Shi et.al. | 2408.14823 | null |
2024-08-26 | Avatar Concept Slider: Manipulate Concepts In Your Human Avatar With Fine-grained Control | Yixuan He et.al. | 2408.13995 | null |
2024-08-26 | DynaSurfGS: Dynamic Surface Reconstruction with Planar-based Gaussian Splatting | Weiwei Cai et.al. | 2408.13972 | link |
2024-08-27 | Splatt3R: Zero-shot Gaussian Splatting from Uncalibrated Image Pairs | Brandon Smart et.al. | 2408.13912 | null |
2024-08-25 | TranSplat: Generalizable 3D Gaussian Splatting from Sparse Multi-View Images with Transformers | Chuanrui Zhang et.al. | 2408.13770 | null |
2024-08-25 | SceneDreamer360: Text-Driven 3D-Consistent Scene Generation with Panoramic Gaussian Splatting | Wenrui Li et.al. | 2408.13711 | link |
2024-08-23 | BiGS: Bidirectional Gaussian Primitives for Relightable 3D Gaussian Splatting | Zhenyuan Liu et.al. | 2408.13370 | null |
2024-08-23 | S4D: Streaming 4D Real-World Reconstruction with Gaussians and 3D Control Points | Bing He et.al. | 2408.13036 | link |
2024-08-23 | FLoD: Integrating Flexible Level of Detail into 3D Gaussian Splatting for Customizable Rendering | Yunji Seo et.al. | 2408.12894 | null |
2024-08-26 | GSFusion: Online RGB-D Mapping Where Gaussian Splatting Meets TSDF Fusion | Jiaxin Wei et.al. | 2408.12677 | link |
2024-08-22 | Subsurface Scattering for 3D Gaussian Splatting | Jan-Niklas Dihlmann et.al. | 2408.12282 | null |
2024-08-21 | Robust 3D Gaussian Splatting for Novel View Synthesis in Presence of Distractors | Paul Ungermann et.al. | 2408.11697 | link |
2024-08-22 | DeRainGS: Gaussian Splatting for Enhanced Scene Reconstruction in Rainy Environments | Shuhong Liu et.al. | 2408.11540 | null |
2024-08-21 | GaussianOcc: Fully Self-supervised and Efficient 3D Occupancy Estimation with Gaussian Splatting | Wanshui Gan et.al. | 2408.11447 | link |
2024-08-27 | Pano2Room: Novel View Synthesis from a Single Indoor Panorama | Guo Pu et.al. | 2408.11413 | link |
2024-08-20 | GSLoc: Efficient Camera Pose Refinement via 3D Gaussian Splatting | Changkun Liu et.al. | 2408.11085 | null |
2024-08-20 | ShapeSplat: A Large-scale Dataset of Gaussian Splats and Their Self-Supervised Pretraining | Qi Ma et.al. | 2408.10906 | null |
2024-08-20 | DEGAS: Detailed Expressions on Full-Body Gaussian Avatars | Zhijing Shao et.al. | 2408.10588 | null |
2024-08-20 | LoopSplat: Loop Closure by Registering 3D Gaussian Splats | Liyuan Zhu et.al. | 2408.10154 | link |
2024-08-19 | Implicit Gaussian Splatting with Efficient Multi-Level Tri-Plane Representation | Minye Wu et.al. | 2408.10041 | null |
2024-08-19 | SG-GS: Photo-realistic Animatable Human Avatars with Semantically-Guided Gaussian Splatting | Haoyu Zhao et.al. | 2408.09665 | null |
2024-08-20 | CHASE: 3D-Consistent Human Avatars with Sparse Inputs via Gaussian Splatting and Contrastive Learning | Haoyu Zhao et.al. | 2408.09663 | null |
2024-08-20 | Gaussian in the Dark: Real-Time View Synthesis From Inconsistent Dark Images Using Gaussian Splatting | Sheng Ye et.al. | 2408.09130 | link |
2024-08-16 | Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS | Wei Sun et.al. | 2408.08723 | null |
2024-08-16 | GS-ID: Illumination Decomposition on Gaussian Splatting via Diffusion Prior and Parametric Light Source Optimization | Kang Du et.al. | 2408.08524 | link |
2024-08-15 | WaterSplatting: Fast Underwater 3D Scene Reconstruction Using Gaussian Splatting | Huapeng Li et.al. | 2408.08206 | null |
2024-08-19 | FlashGS: Efficient 3D Gaussian Splatting for Large-scale and High-resolution Rendering | Guofeng Feng et.al. | 2408.07967 | link |
2024-08-14 | Progressive Radiance Distillation for Inverse Rendering with Gaussian Splatting | Keyang Ye et.al. | 2408.07595 | null |
2024-08-14 | 3D Gaussian Editing with A Single Image | Guan Luo et.al. | 2408.07540 | null |
2024-08-13 | SpectralGaussians: Semantic, spectral 3D Gaussian splatting for multi-spectral scene representation, visualization and analysis | Saptarshi Neil Sinha et.al. | 2408.06975 | null |
2024-08-13 | MAIR++: Improving Multi-view Attention Inverse Rendering with Implicit Lighting Representation | JunYong Choi et.al. | 2408.06707 | null |
2024-08-13 | HDRGS: High Dynamic Range Gaussian Splatting | Jiahao Wu et.al. | 2408.06543 | link |
2024-08-12 | Mipmap-GS: Let Gaussians Deform with Scale-specific Mipmap for Anti-aliasing Rendering | Jiameng Li et.al. | 2408.06286 | link |
2024-08-12 | Developing Smart MAVs for Autonomous Inspection in GPS-denied Constructions | Paoqiang Pan et.al. | 2408.06030 | null |
2024-08-12 | HeadGAP: Few-shot 3D Head Avatar via Generalizable Gaussian Priors | Xiaozheng Zheng et.al. | 2408.06019 | null |
2024-08-21 | Visual SLAM with 3D Gaussian Primitives and Depth Priors Enabling Novel View Synthesis | Zhongche Qu et.al. | 2408.05635 | null |
2024-08-09 | DreamCouple: Exploring High Quality Text-to-3D Generation Via Rectified Flow | Hangyu Li et.al. | 2408.05008 | null |
2024-08-14 | Self-augmented Gaussian Splatting with Structure-aware Masks for Sparse-view 3D Reconstruction | Lingbei Meng et.al. | 2408.04831 | null |
2024-08-06 | LumiGauss: High-Fidelity Outdoor Relighting with 2D Gaussian Splatting | Joanna Kaleta et.al. | 2408.04474 | link |
2024-08-08 | A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery | Mengya Xu et.al. | 2408.04426 | link |
2024-08-08 | InstantStyleGaussian: Efficient Art Style Transfer with 3D Gaussian Splatting | Xin-Yi Yu et.al. | 2408.04249 | null |
2024-08-07 | Towards Real-Time Gaussian Splatting: Accelerating 3DGS through Photometric SLAM | Yan Song Hu et.al. | 2408.03825 | null |
2024-08-07 | Compact 3D Gaussian Splatting for Static and Dynamic Radiance Fields | Joo Chan Lee et.al. | 2408.03822 | null |
2024-08-07 | 3iGS: Factorised Tensorial Illumination for 3D Gaussian Splatting | Zhe Jun Tang et.al. | 2408.03753 | link |
2024-08-07 | PRTGS: Precomputed Radiance Transfer of Gaussian Splats for Real-Time High-Quality Relighting | Yijia Guo et.al. | 2408.03538 | null |
2024-08-02 | A General Framework to Boost 3D GS Initialization for Text-to-3D Generation by Lexical Richness | Lutao Jiang et.al. | 2408.01269 | null |
2024-08-02 | Reality Fusion: Robust Real-time Immersive Mobile Robot Teleoperation with Volumetric Visual Data Fusion | Ke Li et.al. | 2408.01225 | link |
2024-08-07 | IG-SLAM: Instant Gaussian SLAM | F. Aykut Sarikamis et.al. | 2408.01126 | null |
2024-08-01 | LoopSparseGS: Loop Based Sparse-View Friendly Gaussian Splatting | Zhenyu Bao et.al. | 2408.00254 | null |
2024-07-31 | S-SYNTH: Knowledge-Based, Synthetic Generation of Skin Images | Andrea Kim et.al. | 2408.00191 | link |
2024-07-31 | Localized Gaussian Splatting Editing with Contextual Awareness | Hanyuan Xiao et.al. | 2408.00083 | null |
2024-07-31 | Expressive Whole-Body 3D Gaussian Avatar | Gyeongsik Moon et.al. | 2407.21686 | null |
2024-07-30 | A Comparative Study of Neural Surface Reconstruction for Scientific Visualization | Siyuan Yao et.al. | 2407.20868 | null |
2024-07-30 | SceneTeller: Language-to-3D Scene Generation | Başak Melis Öcal et.al. | 2407.20727 | null |
2024-07-29 | Registering Neural 4D Gaussians for Endoscopic Surgery | Yiming Huang et.al. | 2407.20213 | null |
2024-07-29 | Radiance Fields for Robotic Teleoperation | Maximum Wilder-Smith et.al. | 2407.20194 | link |
2024-07-26 | ScalingGaussian: Enhancing 3D Content Creation with Generative Gaussian Splatting | Shen Chen et.al. | 2407.19035 | null |
2024-07-25 | GaussianSR: High Fidelity 2D Gaussian Splatting for Arbitrary-Scale Image Super-Resolution | Jintong Hu et.al. | 2407.18046 | null |
2024-07-24 | 3D Gaussian Splatting: Survey, Technologies, Challenges, and Opportunities | Yanqi Bao et.al. | 2407.17418 | link |
2024-07-29 | DHGS: Decoupled Hybrid Gaussian Splatting for Driving Scene | Xi Shi et.al. | 2407.16600 | null |
2024-07-23 | HDRSplat: Gaussian Splatting for High Dynamic Range 3D Scene Reconstruction from Raw Images | Shreyas Singh et.al. | 2407.16503 | link |
2024-07-23 | Integrating Meshes and 3D Gaussians for Indoor Scene Reconstruction with SAM Mask Guidance | Jiyeop Kim et.al. | 2407.16173 | null |
2024-07-22 | 6DGS: 6D Pose Estimation from a Single Image and a 3D Gaussian Splatting Model | Matteo Bortolon et.al. | 2407.15484 | null |
2024-07-22 | Enhancement of 3D Gaussian Splatting using Raw Mesh for Photorealistic Recreation of Architectures | Ruizhe Wang et.al. | 2407.15435 | null |
2024-07-21 | HoloDreamer: Holistic 3D Panoramic World Generation from Text Descriptions | Haiyang Zhou et.al. | 2407.15187 | null |
2024-07-20 | Realistic Surgical Image Dataset Generation Based On 3D Gaussian Splatting | Tianle Zeng et.al. | 2407.14846 | null |
2024-07-19 | A Benchmark for Gaussian Splatting Compression and Quality Assessment Study | Qi Yang et.al. | 2407.14197 | link |
2024-07-19 | GaussianBeV: 3D Gaussian Representation meets Perception Models for BeV Segmentation | Florian Chabot et.al. | 2407.14108 | null |
2024-07-19 | DirectL: Efficient Radiance Fields Rendering for 3D Light Field Displays | Zongyuan Yang et.al. | 2407.14053 | null |
2024-07-18 | MaRINeR: Enhancing Novel Views by Matching Rendered Images with Nearby References | Lukas Bösiger et.al. | 2407.13745 | link |
2024-07-20 | Connecting Consistency Distillation to Score Distillation for Text-to-3D Generation | Zongrui Li et.al. | 2407.13584 | link |
2024-07-18 | EaDeblur-GS: Event assisted 3D Deblur Reconstruction with Gaussian Splatting | Yuchen Weng et.al. | 2407.13520 | null |
2024-07-17 | Generalizable Human Gaussians for Sparse View Synthesis | Youngjoong Kwon et.al. | 2407.12777 | link |
2024-07-17 | Splatfacto-W: A Nerfstudio Implementation of Gaussian Splatting for Unconstrained Photo Collections | Congrong Xu et.al. | 2407.12306 | null |
2024-07-16 | MVG-Splatting: Multi-View Guided Gaussian Splatting with Adaptive Quantile-Based Geometric Consistency Densification | Zhuoxiao Li et.al. | 2407.11840 | null |
2024-07-16 | Click-Gaussian: Interactive Segmentation to Any 3D Gaussians | Seokhun Choi et.al. | 2407.11793 | null |
2024-07-16 | SlingBAG: Sliding ball adaptive growth algorithm with differentiable radiation enables super-efficient iterative 3D photoacoustic image reconstruction | Shuang Li et.al. | 2407.11781 | null |
2024-07-16 | I |
Gwangtak Bae et.al. | 2407.11347 | null |
2024-07-16 | Ev-GS: Event-based Gaussian splatting for Efficient and Accurate Radiance Field Rendering | Jingqian Wu et.al. | 2407.11343 | null |
2024-07-16 | Gaussian Splatting LK | Liuyue Xie et.al. | 2407.11309 | null |
2024-07-15 | iHuman: Instant Animatable Digital Humans From Monocular Videos | Pramish Paudel et.al. | 2407.11174 | link |
2024-07-15 | Scaling 3D Reasoning with LMMs to Large Robot Mission Environments Using Datagraphs | W. J. Meijer et.al. | 2407.10743 | null |
2024-07-15 | Interactive Rendering of Relightable and Animatable Gaussian Avatars | Youyi Zhan et.al. | 2407.10707 | null |
2024-07-15 | ConTEXTure: Consistent Multiview Images to Texture | Jaehoon Ahn et.al. | 2407.10558 | null |
2024-07-16 | RecGS: Removing Water Caustic with Recurrent Gaussian Splatting | Tianyi Zhang et.al. | 2407.10318 | null |
2024-07-14 | 3DEgo: 3D Editing on the Go! | Umar Khalid et.al. | 2407.10102 | null |
2024-07-14 | SpikeGS: 3D Gaussian Splatting from Spike Streams with High-Speed Camera Motion | Jiyuan Zhang et.al. | 2407.10062 | null |
2024-07-13 | Self-supervised 3D Point Cloud Completion via Multi-view Adversarial Learning | Lintai Wu et.al. | 2407.09786 | link |
2024-07-13 | Textured-GS: Gaussian Splatting with Spatially Defined Color and Opacity | Zhentao Huang et.al. | 2407.09733 | null |
2024-07-12 | StyleSplat: 3D Object Style Transfer with Gaussian Splatting | Sahil Jain et.al. | 2407.09473 | null |
2024-07-11 | WildGaussians: 3D Gaussian Splatting in the Wild | Jonas Kulhanek et.al. | 2407.08447 | null |
2024-07-11 | Survey on Fundamental Deep Learning 3D Reconstruction Techniques | Yonge Bai et.al. | 2407.08137 | null |
2024-07-10 | Synthetic to Authentic: Transferring Realism to 3D Face Renderings for Boosting Face Recognition | Parsa Rahimi et.al. | 2407.07627 | null |
2024-07-10 | DuInNet: Dual-Modality Feature Interaction for Point Cloud Completion | Xinpu Liu et.al. | 2407.07374 | null |
2024-07-10 | MIGS: Multi-Identity Gaussian Splatting via Tensor Decomposition | Aggelina Chatziagapi et.al. | 2407.07284 | null |
2024-07-09 | Reference-based Controllable Scene Stylization with Gaussian Splatting | Yiqun Mei et.al. | 2407.07220 | null |
2024-07-10 | 3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes | Nicolas Moenne-Loccoz et.al. | 2407.07090 | null |
2024-07-09 | HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance | Guian Fang et.al. | 2407.06937 | link |
2024-07-07 | PICA: Physics-Integrated Clothed Avatar | Bo Peng et.al. | 2407.05324 | null |
2024-07-07 | GaussReg: Fast 3D Registration with Gaussian Splatting | Jiahao Chang et.al. | 2407.05254 | null |
2024-07-06 | SurgicalGaussian: Deformable 3D Gaussians for High-Fidelity Surgical Scene Reconstruction | Weixing Xie et.al. | 2407.05023 | null |
2024-07-05 | Gaussian Eigen Models for Human Heads | Wojciech Zielonka et.al. | 2407.04545 | null |
2024-07-12 | Segment Any 4D Gaussians | Shengxiang Ji et.al. | 2407.04504 | null |
2024-07-10 | GSD: View-Guided Gaussian Splatting Diffusion for 3D Reconstruction | Yuxuan Mu et.al. | 2407.04237 | null |
2024-07-04 | CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion Blur Images | Junghe Lee et.al. | 2407.03923 | null |
2024-07-04 | PFGS: High Fidelity Point Cloud Rendering via Feature Splatting | Jiaxu Wang et.al. | 2407.03857 | link |
2024-07-04 | SpikeGS: Reconstruct 3D scene via fast-moving bio-inspired sensors | Yijia Guo et.al. | 2407.03771 | null |
2024-07-13 | VEGS: View Extrapolation of Urban Scenes in 3D Gaussian Splatting using Learned Priors | Sungwon Hwang et.al. | 2407.02945 | link |
2024-07-03 | Free-SurGS: SfM-Free 3D Gaussian Splatting for Surgical Scene Reconstruction | Jiaxin Guo et.al. | 2407.02918 | link |
2024-07-04 | AutoSplat: Constrained Gaussian Splatting for Autonomous Driving Scene Reconstruction | Mustafa Khan et.al. | 2407.02598 | null |
2024-07-02 | ScaleDreamer: Scalable Text-to-3D Synthesis with Asynchronous Score Distillation | Zhiyuan Ma et.al. | 2407.02040 | link |
2024-07-02 | TrAME: Trajectory-Anchored Multi-View Editing for Text-Guided 3D Gaussian Splatting Manipulation | Chaofan Luo et.al. | 2407.02034 | null |
2024-07-02 | Image-GS: Content-Adaptive Image Representation via 2D Gaussians | Yunxiang Zhang et.al. | 2407.01866 | null |
2024-07-01 | DRAGON: Drone and Ground Gaussian Splatting for 3D Building Reconstruction | Yujin Ham et.al. | 2407.01761 | null |
2024-07-01 | GaussianStego: A Generalizable Stenography Pipeline for Generative 3D Gaussians Splatting | Chenxin Li et.al. | 2407.01301 | null |
2024-07-01 | EndoSparse: Real-Time Sparse View Synthesis of Endoscopic Scenes using Gaussian Splatting | Chenxin Li et.al. | 2407.01029 | null |
2024-07-02 | RTGS: Enabling Real-Time Gaussian Splatting on Mobile Devices Using Efficiency-Guided Pruning and Foveated Rendering | Weikai Lin et.al. | 2407.00435 | link |
2024-06-29 | OccFusion: Rendering Occluded Humans with Generative Diffusion Priors | Adam Sun et.al. | 2407.00316 | null |
2024-06-28 | SpotlessSplats: Ignoring Distractors in 3D Gaussian Splatting | Sara Sabour et.al. | 2406.20055 | null |
2024-06-28 | EgoGaussian: Dynamic Scene Understanding from Egocentric Video with 3D Gaussian Splatting | Daiwei Zhang et.al. | 2406.19811 | null |
2024-06-27 | Lightweight Predictive 3D Gaussian Splats | Junli Cao et.al. | 2406.19434 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-09-16 | P2U-SLAM: A Monocular Wide-FoV SLAM System Based on Point Uncertainty and Pose Uncertainty | Yufan Zhang et.al. | 2409.10143 | null |
2024-09-13 | SLIM: Scalable and Lightweight LiDAR Mapping in Urban Environments | Zehuan Yu et.al. | 2409.08681 | null |
2024-09-13 | Intelligent LiDAR Navigation: Leveraging External Information and Semantic Maps with LLM as Copilot | Fujing Xie et.al. | 2409.08493 | link |
2024-09-20 | EPRecon: An Efficient Framework for Real-Time Panoptic 3D Reconstruction from Monocular Video | Zhen Zhou et.al. | 2409.01807 | link |
2024-09-02 | Robust Vehicle Localization and Tracking in Rain using Street Maps | Yu Xiang Tan et.al. | 2409.01038 | link |
2024-08-25 | COMPOSE: Comprehensive Portrait Shadow Editing | Andrew Hou et.al. | 2408.13922 | null |
2024-08-26 | GSFusion: Online RGB-D Mapping Where Gaussian Splatting Meets TSDF Fusion | Jiaxin Wei et.al. | 2408.12677 | link |
2024-08-21 | Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations | Lintong Zhang et.al. | 2408.11966 | null |
2024-08-10 | TOPGN: Real-time Transparent Obstacle Detection using Lidar Point Cloud Intensity for Autonomous Robot Navigation | Kasun Weerakoon et.al. | 2408.05608 | null |
2024-08-07 | Dual-Modeling Decouple Distillation for Unsupervised Anomaly Detection | Xinyue Liu et.al. | 2408.03888 | null |
2024-08-01 | Enhancing Online Road Network Perception and Reasoning with Standard Definition Maps | Hengyuan Zhang et.al. | 2408.01471 | null |
2024-07-29 | A flexible framework for accurate LiDAR odometry, map manipulation, and localization | José Luis Blanco-Claraco et.al. | 2407.20465 | link |
2024-07-17 | Towards Revisiting Visual Place Recognition for Joining Submaps in Multimap SLAM | Markus Weißflog et.al. | 2407.12408 | null |
2024-06-27 | Efficient and Distributed Large-Scale 3D Map Registration using Tomographic Features | Halil Utku Unlu et.al. | 2406.19461 | link |
2024-07-21 | Voxel Map to Occupancy Map Conversion Using Free Space Projection for Efficient Map Representation for Aerial and Ground Robots | Scott Fredriksson et.al. | 2406.07270 | link |
2024-09-18 | RiskMap: A Unified Driving Context Representation for Autonomous Motion Planning in Urban Driving Environment | Ren Xin et.al. | 2406.04451 | null |
2024-06-04 | Multi-Scale Direction-Aware Network for Infrared Small Target Detection | Jinmiao Zhao et.al. | 2406.02037 | null |
2024-06-23 | W-Net: A Facial Feature-Guided Face Super-Resolution Network | Hao Liu et.al. | 2406.00676 | null |
2024-05-27 | CudaSIFT-SLAM: multiple-map visual SLAM for full procedure mapping in real human endoscopy | Richard Elvira et.al. | 2405.16932 | null |
2024-05-27 | Estimating Depth of Monocular Panoramic Image with Teacher-Student Model Fusing Equirectangular and Spherical Representations | Jingguo Liu et.al. | 2405.16858 | null |
2024-05-26 | Splat-SLAM: Globally Optimized RGB-only SLAM with 3D Gaussians | Erik Sandström et.al. | 2405.16544 | link |
2024-05-22 | Waverider: Leveraging Hierarchical, Multi-Resolution Maps for Efficient and Reactive Obstacle Avoidance | Victor Reijgwart et.al. | 2405.13617 | null |
2024-05-20 | RHAML: Rendezvous-based Hierarchical Architecture for Mutual Localization | Gaoming Chen et.al. | 2405.11726 | null |
2024-05-15 | Eulerian-Lagrangian Fluid Simulation on Particle Flow Maps | Junwei Zhou et.al. | 2405.09672 | link |
2024-05-09 | RoboHop: Segment-based Topological Map Representation for Open-World Visual Navigation | Sourav Garg et.al. | 2405.05792 | null |
2024-05-06 | 3D LiDAR Mapping in Dynamic Environments Using a 4D Implicit Neural Representation | Xingguang Zhong et.al. | 2405.03388 | link |
2024-05-14 | Multipath-based SLAM with Cooperation and Map Fusion in MIMO Systems | Erik Leitinger et.al. | 2405.02126 | null |
2024-04-29 | Mesh-based Photorealistic and Real-time 3D Mapping for Robust Visual Perception of Autonomous Underwater Vehicle | Jungwoo Lee et.al. | 2404.18395 | null |
2024-04-22 | "Where am I?" Scene Retrieval with Language | Jiaqi Chen et.al. | 2404.14565 | null |
2024-04-29 | Clio: Real-time Task-Driven Open-Set 3D Scene Graphs | Dominic Maggio et.al. | 2404.13696 | link |
2024-04-20 | AMMUNet: Multi-Scale Attention Map Merging for Remote Sensing Image Segmentation | Yang Yang et.al. | 2404.13408 | link |
2024-04-15 | Map-Relative Pose Regression for Visual Re-Localization | Shuai Chen et.al. | 2404.09884 | link |
2024-04-14 | VRS-NeRF: Visual Relocalization with Sparse Neural Radiance Field | Fei Xue et.al. | 2404.09271 | link |
2024-06-10 | PRISM-TopoMap: Online Topological Mapping with Place Recognition and Scan Matching | Kirill Muravyev et.al. | 2404.01674 | link |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-08-28 | SLAM2REF: Advancing Long-Term Mapping with 3D LiDAR and Reference Map Integration for Precise 6-DoF Trajectory Estimation and Map Extension | Miguel Arturo Vega Torres et.al. | 2408.15948 | link |
2024-07-20 | Hybrid PHD-PMB Trajectory Smoothing Using Backward Simulation | Yuxuan Xia et.al. | 2407.14806 | null |
2024-07-17 | GV-Bench: Benchmarking Local Feature Matching for Geometric Verification of Long-term Loop Closure Detection | Jingwen Yu et.al. | 2407.11736 | link |
2024-07-14 | GLIM: 3D Range-Inertial Localization and Mapping with GPU-Accelerated Scan Matching Factors | Kenji Koide et.al. | 2407.10344 | link |
2024-07-28 | Ego-to-Exo: Interfacing Third Person Visuals from Egocentric Views in Real-time for Improved ROV Teleoperation | Adnan Abdullah et.al. | 2407.00848 | null |
2024-06-28 | Multi-UAVs end-to-end Distributed Trajectory Generation over Point Cloud Data | Antonio Marino et.al. | 2406.19742 | null |
2024-07-04 | ESI-GAL: EEG Source Imaging-based Kinematics Parameter Estimation for Grasp and Lift Task | Anant Jain et.al. | 2406.11500 | null |
2024-06-17 | SWCF-Net: Similarity-weighted Convolution and Local-global Fusion for Efficient Large-scale Point Cloud Semantic Segmentation | Zhenchao Lin et.al. | 2406.11441 | link |
2024-05-26 | Multi-Modal UAV Detection, Classification and Tracking Algorithm -- Technical Report for CVPR 2024 UG2 Challenge | Tianchen Deng et.al. | 2405.16464 | link |
2024-04-26 | SLAM for Indoor Mapping of Wide Area Construction Environments | Vincent Ress et.al. | 2404.17215 | null |
2024-04-19 | DeeperHistReg: Robust Whole Slide Images Registration Framework | Marek Wodzinski et.al. | 2404.14434 | link |
2024-04-26 | RegWSI: Whole Slide Image Registration using Combined Deep Feature- and Intensity-Based Methods: Winner of the ACROBAT 2023 Challenge | Marek Wodzinski et.al. | 2404.13108 | null |
2024-07-16 | RetailOpt: Opt-In, Easy-to-Deploy Trajectory Estimation from Smartphone Motion Data and Retail Facility Information | Ryo Yonetani et.al. | 2404.12548 | null |
2024-04-06 | Evaluation and Optimization of Positional Accuracy for Maritime Positioning Systems | Atilla Alpay Nalcaci et.al. | 2404.04593 | null |
2024-04-05 | Towards introspective loop closure in 4D radar SLAM | Maximilian Hilger et.al. | 2404.03940 | null |
2024-03-09 | Laser-to-Vehicle Extrinsic Calibration in Low-Observability Scenarios for Subsea Mapping | Thomas Hitchcox et.al. | 2402.14993 | null |
2024-02-22 | Secure Navigation using Landmark-based Localization in a GPS-denied Environment | Ganesh Sapkota et.al. | 2402.14280 | null |
2024-02-09 | Continuous-Time Radar-Inertial and Lidar-Inertial Odometry using a Gaussian Process Motion Prior | Keenan Burnett et.al. | 2402.06174 | link |
2024-01-26 | On the detection of alpha emission from a low-voltage DC deuterium discharge with palladium electrodes | Erik P. Ziehm et.al. | 2402.05117 | null |
2024-02-06 | MMAUD: A Comprehensive Multi-Modal Anti-UAV Dataset for Modern Miniature Drone Threats | Shenghai Yuan et.al. | 2402.03706 | link |
2024-02-01 | Continuous-time Trajectory Estimation: A Comparative Study Between Gaussian Process and Spline-based Approaches | Jacob Johnson et.al. | 2402.00399 | null |
2024-01-30 | ATPPNet: Attention based Temporal Point cloud Prediction Network | Kaustab Pal et.al. | 2401.17399 | null |
2024-01-23 | A BFF-Based Attention Mechanism for Trajectory Estimation in mmWave MIMO Communications | Mohammad Shamsesalehi et.al. | 2401.13059 | null |
2024-01-05 | Partition-based Nonrigid Registration for 3D Face Model | Yuping Ye et.al. | 2401.02607 | null |
Publish Date | Title | Authors | Code | |
---|---|---|---|---|
2024-09-06 | Matched Filtering based LiDAR Place Recognition for Urban and Natural Environments | Therese Joseph et.al. | 2409.03998 | null |
2024-08-26 | Narrowing your FOV with SOLiD: Spatially Organized and Lightweight Global Descriptor for FOV-constrained LiDAR Place Recognition | Hogyun Kim et.al. | 2408.07330 | link |
2024-07-30 | SALSA: Swift Adaptive Lightweight Self-Attention for Enhanced LiDAR Place Recognition | Raktim Gautam Goswami et.al. | 2407.08260 | link |
2024-03-21 | VXP: Voxel-Cross-Pixel Large-scale Image-LiDAR Place Recognition | Yun-Jin Li et.al. | 2403.14594 | null |
2024-08-30 | Evaluation and Deployment of LiDAR-based Place Recognition in Dense Forests | Haedam Oh et.al. | 2403.14326 | null |
2024-03-04 | Map-aided annotation for pole base detection | Benjamin Missaoui et.al. | 2403.01868 | null |
2024-02-25 | VOLoc: Visual Place Recognition by Querying Compressed Lidar Map | Xudong Cai et.al. | 2402.15961 | link |
2024-03-19 | HeLiPR: Heterogeneous LiDAR Dataset for inter-LiDAR Place Recognition under Spatiotemporal Variations | Minwoo Jung et.al. | 2309.14590 | null |
2023-11-23 | Pose-Graph Attentional Graph Neural Network for Lidar Place Recognition | Milad Ramezani et.al. | 2309.00168 | link |
2023-08-24 | VNI-Net: Vector Neurons-based Rotation-Invariant Descriptor for LiDAR Place Recognition | Gengxuan Tian et.al. | 2308.12870 | null |
2023-11-29 | GeoAdapt: Self-Supervised Test-Time Adaptation in LiDAR Place Recognition Using Geometric Priors | Joshua Knights et.al. | 2308.04638 | null |
2023-05-29 | TReR: A Lightweight Transformer Re-Ranking Approach for 3D LiDAR Place Recognition | Tiago Barros et.al. | 2305.18013 | null |
2023-06-14 | CCL: Continual Contrastive Learning for LiDAR Place Recognition | Jiafeng Cui et.al. | 2303.13952 | link |
2023-03-02 | Wild-Places: A Large-Scale Dataset for Lidar Place Recognition in Unstructured Natural Environments | Joshua Knights et.al. | 2211.12732 | link |
2022-10-25 | MidasTouch: Monte-Carlo inference over distributions across sliding touch | Sudharshan Suresh et.al. | 2210.14210 | link |
2023-03-06 | Spectral Geometric Verification: Re-Ranking Point Cloud Retrieval for Metric Localization | Kavisha Vidanapathirana et.al. | 2210.04432 | link |
2023-07-12 | Uncertainty-Aware Lidar Place Recognition in Novel Environments | Keita Mason et.al. | 2210.01361 | link |
2022-11-29 | InCloud: Incremental Learning for Point Cloud Place Recognition | Joshua Knights et.al. | 2203.00807 | link |
2021-12-27 | MinkLoc3D-SI: 3D LiDAR place recognition with sparse convolutions, spherical coordinates, and intensity | Kamil Żywanowski et.al. | 2112.06539 | link |
2022-01-16 | BVMatch: Lidar-based Place Recognition Using Bird's-eye View Images | Lun Luo et.al. | 2109.00317 | link |
2022-08-28 | RPR-Net: A Point Cloud-based Rotation-aware Large Scale Place Recognition Network | Zhaoxin Fan et.al. | 2108.12790 | null |
2023-01-04 | AttDLNet: Attention-based DL Network for 3D LiDAR Place Recognition | Tiago Barros et.al. | 2106.09637 | link |
2020-07-17 | DH3D: Deep Hierarchical 3D Descriptors for Robust Large-Scale 6DoF Relocalization | Juan Du et.al. | 2007.09217 | link |