File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/CVPR52688.2022.00298
- Scopus: eid_2-s2.0-85141766468
- WOS: WOS:000867754203022
- Find via

Supplementary
- Citations:
- Appears in Collections:
Conference Paper: Revisiting Skeleton-based Action Recognition
| Title | Revisiting Skeleton-based Action Recognition |
|---|---|
| Authors | |
| Keywords | Action and event recognition Video analysis and understanding |
| Issue Date | 2022 |
| Citation | Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2022, v. 2022-June, p. 2959-2968 How to Cite? |
| Abstract | Human skeleton, as a compact representation of human action, has received increasing attention in recent years. Many skeleton-based action recognition methods adopt GCNs to extract features on top of human skeletons. Despite the positive results shown in these attempts, GCN-based methods are subject to limitations in robustness, interoperability, and scalability. In this work, we propose PoseConv3D, a new approach to skeleton-based action recognition. PoseConv3D relies on a 3D heatmap volume instead of a graph sequence as the base representation of human skeletons. Compared to GCN-based methods, PoseConv3D is more effective in learning spatiotemporal features, more robust against pose estimation noises, and generalizes better in cross-dataset settings. Also, PoseConv3D can handle multiple-person scenarios without additional computation costs. The hierarchical features can be easily integrated with other modalities at early fusion stages, providing a great design space to boost the performance. PoseConv3D achieves the state-of-the-art on five of six standard skeleton-based action recognition benchmarks. Once fused with other modalities, it achieves the state-of-the-art on all eight multi-modality action recognition benchmarks. Code has been made available at: https://github.com/kennymckormick/pyskl. |
| Persistent Identifier | http://hdl.handle.net/10722/352321 |
| ISSN | 2023 SCImago Journal Rankings: 10.331 |
| ISI Accession Number ID |
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Duan, Haodong | - |
| dc.contributor.author | Zhao, Yue | - |
| dc.contributor.author | Chen, Kai | - |
| dc.contributor.author | Lin, Dahua | - |
| dc.contributor.author | Dai, Bo | - |
| dc.date.accessioned | 2024-12-16T03:58:15Z | - |
| dc.date.available | 2024-12-16T03:58:15Z | - |
| dc.date.issued | 2022 | - |
| dc.identifier.citation | Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2022, v. 2022-June, p. 2959-2968 | - |
| dc.identifier.issn | 1063-6919 | - |
| dc.identifier.uri | http://hdl.handle.net/10722/352321 | - |
| dc.description.abstract | Human skeleton, as a compact representation of human action, has received increasing attention in recent years. Many skeleton-based action recognition methods adopt GCNs to extract features on top of human skeletons. Despite the positive results shown in these attempts, GCN-based methods are subject to limitations in robustness, interoperability, and scalability. In this work, we propose PoseConv3D, a new approach to skeleton-based action recognition. PoseConv3D relies on a 3D heatmap volume instead of a graph sequence as the base representation of human skeletons. Compared to GCN-based methods, PoseConv3D is more effective in learning spatiotemporal features, more robust against pose estimation noises, and generalizes better in cross-dataset settings. Also, PoseConv3D can handle multiple-person scenarios without additional computation costs. The hierarchical features can be easily integrated with other modalities at early fusion stages, providing a great design space to boost the performance. PoseConv3D achieves the state-of-the-art on five of six standard skeleton-based action recognition benchmarks. Once fused with other modalities, it achieves the state-of-the-art on all eight multi-modality action recognition benchmarks. Code has been made available at: https://github.com/kennymckormick/pyskl. | - |
| dc.language | eng | - |
| dc.relation.ispartof | Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition | - |
| dc.subject | Action and event recognition | - |
| dc.subject | Video analysis and understanding | - |
| dc.title | Revisiting Skeleton-based Action Recognition | - |
| dc.type | Conference_Paper | - |
| dc.description.nature | link_to_subscribed_fulltext | - |
| dc.identifier.doi | 10.1109/CVPR52688.2022.00298 | - |
| dc.identifier.scopus | eid_2-s2.0-85141766468 | - |
| dc.identifier.volume | 2022-June | - |
| dc.identifier.spage | 2959 | - |
| dc.identifier.epage | 2968 | - |
| dc.identifier.isi | WOS:000867754203022 | - |
