File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1016/j.neucom.2025.131247
- Scopus: eid_2-s2.0-105013497826
- Find via

Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Article: Optimizing the temporal adjacency matrix for 3D human pose estimation through clustering
| Title | Optimizing the temporal adjacency matrix for 3D human pose estimation through clustering |
|---|---|
| Authors | |
| Keywords | 3D human pose estimation Clustering Temporal adjacency matrix |
| Issue Date | 7-Nov-2025 |
| Publisher | Elsevier |
| Citation | Neurocomputing, 2025, v. 653 How to Cite? |
| Abstract | With the booming development of multimedia technologies related to 3D images and videos, 3D human pose estimation has gained increasing attention. In 3D human pose estimation, exploring the temporal relationships between human joints is essential. Transformer-based methods for modeling temporal relationships leverage the temporal adjacency matrix within the self-attention mechanism as a key component for capturing these connections. We define temporal features that occupy a small portion of all temporal features and are distant from other temporal features in the high-dimensional space as noisy features. In the temporal adjacency matrix of self-attention, noisy features can interfere with the computation of other features, and thus their correlation with other features should be eliminated. However, existing methods overlook this issue. To address this issue, we propose a DBSCAN-based clustering module to detect noisy temporal features and an adjacency matrix masking mechanism to suppress their influence. First, we cluster the input temporal features to obtain noisy features and clustering results. Then, we eliminate the correlations of noisy features on other temporal features within the adjacency matrix and reduce the correlations between different classes. Extensive experiments on the Human3.6M and MPI-INF-3DHP datasets, using state-of-the-art methods as benchmarks, demonstrate that our approach achieves improvements of up to 6.94 % Mean Per Joint Position Error (MPJPE) compared to the original methods with ground–truth input. |
| Persistent Identifier | http://hdl.handle.net/10722/362390 |
| ISSN | 2023 Impact Factor: 5.5 2023 SCImago Journal Rankings: 1.815 |
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Wang, Yingfeng | - |
| dc.contributor.author | Li, Muyu | - |
| dc.contributor.author | Meng, Nan | - |
| dc.contributor.author | Xu, Min | - |
| dc.date.accessioned | 2025-09-23T00:31:11Z | - |
| dc.date.available | 2025-09-23T00:31:11Z | - |
| dc.date.issued | 2025-11-07 | - |
| dc.identifier.citation | Neurocomputing, 2025, v. 653 | - |
| dc.identifier.issn | 0925-2312 | - |
| dc.identifier.uri | http://hdl.handle.net/10722/362390 | - |
| dc.description.abstract | With the booming development of multimedia technologies related to 3D images and videos, 3D human pose estimation has gained increasing attention. In 3D human pose estimation, exploring the temporal relationships between human joints is essential. Transformer-based methods for modeling temporal relationships leverage the temporal adjacency matrix within the self-attention mechanism as a key component for capturing these connections. We define temporal features that occupy a small portion of all temporal features and are distant from other temporal features in the high-dimensional space as noisy features. In the temporal adjacency matrix of self-attention, noisy features can interfere with the computation of other features, and thus their correlation with other features should be eliminated. However, existing methods overlook this issue. To address this issue, we propose a DBSCAN-based clustering module to detect noisy temporal features and an adjacency matrix masking mechanism to suppress their influence. First, we cluster the input temporal features to obtain noisy features and clustering results. Then, we eliminate the correlations of noisy features on other temporal features within the adjacency matrix and reduce the correlations between different classes. Extensive experiments on the Human3.6M and MPI-INF-3DHP datasets, using state-of-the-art methods as benchmarks, demonstrate that our approach achieves improvements of up to 6.94 % Mean Per Joint Position Error (MPJPE) compared to the original methods with ground–truth input. | - |
| dc.language | eng | - |
| dc.publisher | Elsevier | - |
| dc.relation.ispartof | Neurocomputing | - |
| dc.subject | 3D human pose estimation | - |
| dc.subject | Clustering | - |
| dc.subject | Temporal adjacency matrix | - |
| dc.title | Optimizing the temporal adjacency matrix for 3D human pose estimation through clustering | - |
| dc.type | Article | - |
| dc.identifier.doi | 10.1016/j.neucom.2025.131247 | - |
| dc.identifier.scopus | eid_2-s2.0-105013497826 | - |
| dc.identifier.volume | 653 | - |
| dc.identifier.eissn | 1872-8286 | - |
| dc.identifier.issnl | 0925-2312 | - |
