File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: APSNet: Toward Adaptive Point Sampling for Efficient 3D Action Recognition

TitleAPSNet: Toward Adaptive Point Sampling for Efficient 3D Action Recognition
Authors
Keywords3D action recognition
Accuracy-efficiency trade-off
Point cloud
Issue Date2022
Citation
IEEE Transactions on Image Processing, 2022, v. 31, p. 5287-5302 How to Cite?
AbstractObserving that it is still a challenging task to deploy 3D action recognition methods in real-world scenarios, in this work, we investigate the accuracy-efficiency trade-off for 3D action recognition. We first introduce a simple and efficient backbone network structure for 3D action recognition, in which we directly extract the geometry and motion representations from the raw point cloud videos through a set of simple operations (i.e., coordinate offset generation and mini-PoinNet). Based on the backbone network, we propose an end-to-end optimized network called adaptive point sampling network (APSNet) to achieve the accuracy-efficiency trade-off, which mainly consists of three stages: the coarse feature extraction stage, the decision making stage, and the fine feature extraction stage. In APSNet, we adaptively decide the optimal resolutions (i.e., the optimal number of points) for each pair of frames based on any input point cloud video under the given computational complexity constraint. Comprehensive experiments on multiple benchmark datasets demonstrate the effectiveness and efficiency of our newly proposed APSNet for 3D action recognition.
Persistent Identifierhttp://hdl.handle.net/10722/322001
ISSN
2021 Impact Factor: 11.041
2020 SCImago Journal Rankings: 1.778
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorLiu, Jiaheng-
dc.contributor.authorGuo, Jinyang-
dc.contributor.authorXu, Dong-
dc.date.accessioned2022-11-03T02:22:56Z-
dc.date.available2022-11-03T02:22:56Z-
dc.date.issued2022-
dc.identifier.citationIEEE Transactions on Image Processing, 2022, v. 31, p. 5287-5302-
dc.identifier.issn1057-7149-
dc.identifier.urihttp://hdl.handle.net/10722/322001-
dc.description.abstractObserving that it is still a challenging task to deploy 3D action recognition methods in real-world scenarios, in this work, we investigate the accuracy-efficiency trade-off for 3D action recognition. We first introduce a simple and efficient backbone network structure for 3D action recognition, in which we directly extract the geometry and motion representations from the raw point cloud videos through a set of simple operations (i.e., coordinate offset generation and mini-PoinNet). Based on the backbone network, we propose an end-to-end optimized network called adaptive point sampling network (APSNet) to achieve the accuracy-efficiency trade-off, which mainly consists of three stages: the coarse feature extraction stage, the decision making stage, and the fine feature extraction stage. In APSNet, we adaptively decide the optimal resolutions (i.e., the optimal number of points) for each pair of frames based on any input point cloud video under the given computational complexity constraint. Comprehensive experiments on multiple benchmark datasets demonstrate the effectiveness and efficiency of our newly proposed APSNet for 3D action recognition.-
dc.languageeng-
dc.relation.ispartofIEEE Transactions on Image Processing-
dc.subject3D action recognition-
dc.subjectAccuracy-efficiency trade-off-
dc.subjectPoint cloud-
dc.titleAPSNet: Toward Adaptive Point Sampling for Efficient 3D Action Recognition-
dc.typeArticle-
dc.description.naturelink_to_subscribed_fulltext-
dc.identifier.doi10.1109/TIP.2022.3193290-
dc.identifier.pmid35901004-
dc.identifier.scopuseid_2-s2.0-85135763469-
dc.identifier.volume31-
dc.identifier.spage5287-
dc.identifier.epage5302-
dc.identifier.eissn1941-0042-
dc.identifier.isiWOS:000842776300002-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats