File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Conference Paper: Stratified Transformer for 3D Point Cloud Segmentation

TitleStratified Transformer for 3D Point Cloud Segmentation
Authors
Keywords3D from multi-view and sensors
grouping and shape analysis
Scene analysis and understanding
Segmentation
Issue Date2022
Citation
Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2022, v. 2022-June, p. 8490-8499 How to Cite?
Abstract3D point cloud segmentation has made tremendous progress in recent years. Most current methods focus on aggregating local features, but fail to directly model long-range dependencies. In this paper, we propose Stratified Transformer that is able to capture long-range contexts and demonstrates strong generalization ability and high performance. Specifically, we first put forward a novel key sampling strategy. For each query point, we sample nearby points densely and distant points sparsely as its keys in a stratified way, which enables the model to enlarge the effective receptive field and enjoy long-range contexts at a low computational cost. Also, to combat the challenges posed by irregular point arrangements, we propose first-layer point embedding to aggregate local information, which facilitates convergence and boosts performance. Besides, we adopt contextual relative position encoding to adaptively capture position information. Finally, a memory-efficient implementation is introduced to overcome the issue of varying point numbers in each window. Extensive experiments demonstrate the effectiveness and superiority of our method on S3DIS, ScanNetv2 and ShapeNetPart datasets. Code is available at https://github.com/dvlab-research/Stratified-Transformer.
Persistent Identifierhttp://hdl.handle.net/10722/333542
ISSN
2023 SCImago Journal Rankings: 10.331
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorLai, Xin-
dc.contributor.authorLiu, Jianhui-
dc.contributor.authorJiang, Li-
dc.contributor.authorWang, Liwei-
dc.contributor.authorZhao, Hengshuang-
dc.contributor.authorLiu, Shu-
dc.contributor.authorQi, Xiaojuan-
dc.contributor.authorJia, Jiaya-
dc.date.accessioned2023-10-06T05:20:19Z-
dc.date.available2023-10-06T05:20:19Z-
dc.date.issued2022-
dc.identifier.citationProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2022, v. 2022-June, p. 8490-8499-
dc.identifier.issn1063-6919-
dc.identifier.urihttp://hdl.handle.net/10722/333542-
dc.description.abstract3D point cloud segmentation has made tremendous progress in recent years. Most current methods focus on aggregating local features, but fail to directly model long-range dependencies. In this paper, we propose Stratified Transformer that is able to capture long-range contexts and demonstrates strong generalization ability and high performance. Specifically, we first put forward a novel key sampling strategy. For each query point, we sample nearby points densely and distant points sparsely as its keys in a stratified way, which enables the model to enlarge the effective receptive field and enjoy long-range contexts at a low computational cost. Also, to combat the challenges posed by irregular point arrangements, we propose first-layer point embedding to aggregate local information, which facilitates convergence and boosts performance. Besides, we adopt contextual relative position encoding to adaptively capture position information. Finally, a memory-efficient implementation is introduced to overcome the issue of varying point numbers in each window. Extensive experiments demonstrate the effectiveness and superiority of our method on S3DIS, ScanNetv2 and ShapeNetPart datasets. Code is available at https://github.com/dvlab-research/Stratified-Transformer.-
dc.languageeng-
dc.relation.ispartofProceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition-
dc.subject3D from multi-view and sensors-
dc.subjectgrouping and shape analysis-
dc.subjectScene analysis and understanding-
dc.subjectSegmentation-
dc.titleStratified Transformer for 3D Point Cloud Segmentation-
dc.typeConference_Paper-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/CVPR52688.2022.00831-
dc.identifier.scopuseid_2-s2.0-85132707307-
dc.identifier.volume2022-June-
dc.identifier.spage8490-
dc.identifier.epage8499-
dc.identifier.isiWOS:000870759101053-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats