Prototype-Voxel Contrastive Learning for LiDAR Point Cloud Panoptic Segmentation

Liu, Minzhe; Zhou, Qiang; Zhao, Hengshuang; Li, Jianing; Du, Yuan; Keutzer, Kurt; Du, Li; Zhang, Shanghang

File Download

There are no files associated with this item.

Links for fulltext

(May Require Subscription)

Publisher Website: 10.1109/ICRA46639.2022.9811638
Scopus: eid_2-s2.0-85136335837
WOS: WOS:000941277601135
Find via

Supplementary

Citations:
- Scopus: 0
- Web of Science: 0
Appears in Collections:
- Computer Science: Conference papers

Conference Paper: Prototype-Voxel Contrastive Learning for LiDAR Point Cloud Panoptic Segmentation

Title	Prototype-Voxel Contrastive Learning for LiDAR Point Cloud Panoptic Segmentation
Authors	Liu, Minzhe Zhou, Qiang Zhao, Hengshuang Li, Jianing Du, Yuan Keutzer, Kurt Du, Li Zhang, Shanghang
Issue Date	2022
Citation	Proceedings - IEEE International Conference on Robotics and Automation, 2022, p. 9243-9250 How to Cite? DOI: http://dx.doi.org/10.1109/ICRA46639.2022.9811638
Abstract	LiDAR point cloud panoptic segmentation, including both semantic and instance segmentation, plays a critical role in meticulous scene understanding for autonomous driving. Existing 3D voxelized approaches either utilize 3D sparse convolution that only focuses on local scene understanding, or add extra and time-consuming PointNet branch to capture global feature structures. To address these limitations, we propose an end-to-end Prototype-Voxel Contrastive Learning (PVCL) framework for learning stable and discriminative semantic representations, which includes voxel-level and prototype-level contrastive learning (CL). The voxel-level CL decreases intra-class distance and increases inter-class distance among sample representations, while the prototype-level CL further reduces the dependence of CL on negative sampling and avoids the influence of outliers from the same class, enabling PVCL to be more effective for outdoor point cloud panoptic segmentation. Extensive experiments are conducted on the public point cloud panoptic segmentation datasets, Semantic-KITTI and nuScenes, where evaluations and ablation studies demonstrate PVCL achieves superior performance compared with the state-of-the-art. Our approach ranks the top on the public leaderboard of Semantic-KITTI at the time of submission, and surpasses the published 2nd rank, EfficientLPS, by 1.7% in PQ.
Persistent Identifier	http://hdl.handle.net/10722/333550
ISSN	1050-4729 2023 SCImago Journal Rankings: 1.620
ISI Accession Number ID	WOS:000941277601135

DC Field	Value	Language
dc.contributor.author	Liu, Minzhe	-
dc.contributor.author	Zhou, Qiang	-
dc.contributor.author	Zhao, Hengshuang	-
dc.contributor.author	Li, Jianing	-
dc.contributor.author	Du, Yuan	-
dc.contributor.author	Keutzer, Kurt	-
dc.contributor.author	Du, Li	-
dc.contributor.author	Zhang, Shanghang	-
dc.date.accessioned	2023-10-06T05:20:24Z	-
dc.date.available	2023-10-06T05:20:24Z	-
dc.date.issued	2022	-
dc.identifier.citation	Proceedings - IEEE International Conference on Robotics and Automation, 2022, p. 9243-9250	-
dc.identifier.issn	1050-4729	-
dc.identifier.uri	http://hdl.handle.net/10722/333550	-
dc.description.abstract	LiDAR point cloud panoptic segmentation, including both semantic and instance segmentation, plays a critical role in meticulous scene understanding for autonomous driving. Existing 3D voxelized approaches either utilize 3D sparse convolution that only focuses on local scene understanding, or add extra and time-consuming PointNet branch to capture global feature structures. To address these limitations, we propose an end-to-end Prototype-Voxel Contrastive Learning (PVCL) framework for learning stable and discriminative semantic representations, which includes voxel-level and prototype-level contrastive learning (CL). The voxel-level CL decreases intra-class distance and increases inter-class distance among sample representations, while the prototype-level CL further reduces the dependence of CL on negative sampling and avoids the influence of outliers from the same class, enabling PVCL to be more effective for outdoor point cloud panoptic segmentation. Extensive experiments are conducted on the public point cloud panoptic segmentation datasets, Semantic-KITTI and nuScenes, where evaluations and ablation studies demonstrate PVCL achieves superior performance compared with the state-of-the-art. Our approach ranks the top on the public leaderboard of Semantic-KITTI at the time of submission, and surpasses the published 2nd rank, EfficientLPS, by 1.7% in PQ.	-
dc.language	eng	-
dc.relation.ispartof	Proceedings - IEEE International Conference on Robotics and Automation	-
dc.title	Prototype-Voxel Contrastive Learning for LiDAR Point Cloud Panoptic Segmentation	-
dc.type	Conference_Paper	-
dc.description.nature	link_to_subscribed_fulltext	-
dc.identifier.doi	10.1109/ICRA46639.2022.9811638	-
dc.identifier.scopus	eid_2-s2.0-85136335837	-
dc.identifier.spage	9243	-
dc.identifier.epage	9250	-
dc.identifier.isi	WOS:000941277601135	-

File Download

Links for fulltext

(May Require Subscription)

Supplementary

Conference Paper: Prototype-Voxel Contrastive Learning for LiDAR Point Cloud Panoptic Segmentation

Export via OAI-PMH Interface in XML Formats

OR

Export to Other Non-XML Formats