File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Transformer3D-Det: Improving 3D Object Detection by Vote Refinement

TitleTransformer3D-Det: Improving 3D Object Detection by Vote Refinement
Authors
Keywords3D object detection
neural network
Point cloud
transformer
Issue Date2021
Citation
IEEE Transactions on Circuits and Systems for Video Technology, 2021, v. 31, n. 12, p. 4735-4746 How to Cite?
AbstractVoting-based methods (e.g., VoteNet) have achieved promising results for 3D object detection. However, the simple voting operation in VoteNet may lead to less accurate voting results that are far away from the true object centers. In this work, we propose a simple but effective 3D object detection method called Transformer3D-Det (T3D), in which we additionally introduce a transformer based vote refinement module to refine the voting results of VoteNet and can thus significantly improve the 3D object detection performance. Specifically, our T3D framework consists of three modules: a vote generation module, a vote refinement module, and a bounding box generation module. Given an input point cloud, we first utilize the vote generation module to generate multiple coarse vote clusters. Then, the clustered coarse votes will be refined by using our transformer based vote refinement module to produce more accurate and meaningful votes. Finally, the bounding box generation module takes the refined vote clusters as the input and generates the final detection result for the input point cloud. To alleviate the impact of inaccurate votes, we also propose a new non-vote loss function to train our T3D. As a result, our T3D framework can achieve better 3D object detection performance. Comprehensive experiments on two benchmark datasets ScanNetV2 and SUN RGB-D demonstrate the effectiveness of our T3D framework for 3D object detection.
Persistent Identifierhttp://hdl.handle.net/10722/321957
ISSN
2021 Impact Factor: 5.859
2020 SCImago Journal Rankings: 0.873
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorZhao, Lichen-
dc.contributor.authorGuo, Jinyang-
dc.contributor.authorXu, Dong-
dc.contributor.authorSheng, Lu-
dc.date.accessioned2022-11-03T02:22:37Z-
dc.date.available2022-11-03T02:22:37Z-
dc.date.issued2021-
dc.identifier.citationIEEE Transactions on Circuits and Systems for Video Technology, 2021, v. 31, n. 12, p. 4735-4746-
dc.identifier.issn1051-8215-
dc.identifier.urihttp://hdl.handle.net/10722/321957-
dc.description.abstractVoting-based methods (e.g., VoteNet) have achieved promising results for 3D object detection. However, the simple voting operation in VoteNet may lead to less accurate voting results that are far away from the true object centers. In this work, we propose a simple but effective 3D object detection method called Transformer3D-Det (T3D), in which we additionally introduce a transformer based vote refinement module to refine the voting results of VoteNet and can thus significantly improve the 3D object detection performance. Specifically, our T3D framework consists of three modules: a vote generation module, a vote refinement module, and a bounding box generation module. Given an input point cloud, we first utilize the vote generation module to generate multiple coarse vote clusters. Then, the clustered coarse votes will be refined by using our transformer based vote refinement module to produce more accurate and meaningful votes. Finally, the bounding box generation module takes the refined vote clusters as the input and generates the final detection result for the input point cloud. To alleviate the impact of inaccurate votes, we also propose a new non-vote loss function to train our T3D. As a result, our T3D framework can achieve better 3D object detection performance. Comprehensive experiments on two benchmark datasets ScanNetV2 and SUN RGB-D demonstrate the effectiveness of our T3D framework for 3D object detection.-
dc.languageeng-
dc.relation.ispartofIEEE Transactions on Circuits and Systems for Video Technology-
dc.subject3D object detection-
dc.subjectneural network-
dc.subjectPoint cloud-
dc.subjecttransformer-
dc.titleTransformer3D-Det: Improving 3D Object Detection by Vote Refinement-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/TCSVT.2021.3102025-
dc.identifier.scopuseid_2-s2.0-85112624645-
dc.identifier.volume31-
dc.identifier.issue12-
dc.identifier.spage4735-
dc.identifier.epage4746-
dc.identifier.eissn1558-2205-
dc.identifier.isiWOS:000725812500018-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats