File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: iBTC: An Image-Assisting Binary and Triangle Combined Descriptor for Place Recognition by Fusing LiDAR and Camera Measurements

TitleiBTC: An Image-Assisting Binary and Triangle Combined Descriptor for Place Recognition by Fusing LiDAR and Camera Measurements
Authors
Keywordsdescriptor
LiDAR SLAM
loop detection
multimodal
Place recognition
visual SLAM
Issue Date1-Dec-2024
PublisherInstitute of Electrical and Electronics Engineers
Citation
IEEE Robotics and Automation Letters, 2024, v. 9, n. 12, p. 10858-10865 How to Cite?
Abstract

In this work, we introduce a novel multimodal descriptor, the image-assisting binary and triangle combined (iBTC) descriptor, which fuses LiDAR (Light Detection and Ranging) and camera measurements for 3D place recognition. The inherent invariance of a triangle to rigid transformations inspires us to design triangle-based descriptors. We first extract distinct 3D key points from both LiDAR and camera measurements and organize them into triplets to form triangles. By utilizing the lengths of the sides of these triangles, we can create triangle descriptors, enabling the rapid retrieval of similar triangles from a database. By encoding the geometric and visual details at the triangle vertices into binary descriptors, we augment the triangle descriptors with richer local information. This enrichment process empowers our descriptors to reject mis-matched triangle pairs. Consequently, the remaining matched triangle pairs yield accurate loop closure place indices and relative poses. In our experiments, we conduct a thorough comparison of our proposed method with several SOTA methods across public and self-collected datasets. The results demonstrate that our method exhibits superior performance in place recognition and overcomes the limitations associated with the unimodal methods like BTC, RING++, ORB-DBoW2, and NetVLAD. Additionally, we perform a time cost benchmark experiment and the result indicates that our method's time consumption is reasonable, compared with baseline methods.


Persistent Identifierhttp://hdl.handle.net/10722/354543
ISSN
2023 Impact Factor: 4.6
2023 SCImago Journal Rankings: 2.119

 

DC FieldValueLanguage
dc.contributor.authorZou, Zuhao-
dc.contributor.authorZheng, Chunran-
dc.contributor.authorYuan, Chongjian-
dc.contributor.authorZhou, Shunbo-
dc.contributor.authorXue, Kaiwen-
dc.contributor.authorZhang, Fu-
dc.date.accessioned2025-02-13T00:35:14Z-
dc.date.available2025-02-13T00:35:14Z-
dc.date.issued2024-12-01-
dc.identifier.citationIEEE Robotics and Automation Letters, 2024, v. 9, n. 12, p. 10858-10865-
dc.identifier.issn2377-3766-
dc.identifier.urihttp://hdl.handle.net/10722/354543-
dc.description.abstract<p>In this work, we introduce a novel multimodal descriptor, the image-assisting binary and triangle combined (iBTC) descriptor, which fuses LiDAR (Light Detection and Ranging) and camera measurements for 3D place recognition. The inherent invariance of a triangle to rigid transformations inspires us to design triangle-based descriptors. We first extract distinct 3D key points from both LiDAR and camera measurements and organize them into triplets to form triangles. By utilizing the lengths of the sides of these triangles, we can create triangle descriptors, enabling the rapid retrieval of similar triangles from a database. By encoding the geometric and visual details at the triangle vertices into binary descriptors, we augment the triangle descriptors with richer local information. This enrichment process empowers our descriptors to reject mis-matched triangle pairs. Consequently, the remaining matched triangle pairs yield accurate loop closure place indices and relative poses. In our experiments, we conduct a thorough comparison of our proposed method with several SOTA methods across public and self-collected datasets. The results demonstrate that our method exhibits superior performance in place recognition and overcomes the limitations associated with the unimodal methods like BTC, RING++, ORB-DBoW2, and NetVLAD. Additionally, we perform a time cost benchmark experiment and the result indicates that our method's time consumption is reasonable, compared with baseline methods.<br></p>-
dc.languageeng-
dc.publisherInstitute of Electrical and Electronics Engineers-
dc.relation.ispartofIEEE Robotics and Automation Letters-
dc.subjectdescriptor-
dc.subjectLiDAR SLAM-
dc.subjectloop detection-
dc.subjectmultimodal-
dc.subjectPlace recognition-
dc.subjectvisual SLAM-
dc.titleiBTC: An Image-Assisting Binary and Triangle Combined Descriptor for Place Recognition by Fusing LiDAR and Camera Measurements -
dc.typeArticle-
dc.identifier.doi10.1109/LRA.2024.3483032-
dc.identifier.scopuseid_2-s2.0-85207338911-
dc.identifier.volume9-
dc.identifier.issue12-
dc.identifier.spage10858-
dc.identifier.epage10865-
dc.identifier.eissn2377-3766-
dc.identifier.issnl2377-3766-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats