File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: Optimizing the temporal adjacency matrix for 3D human pose estimation through clustering

TitleOptimizing the temporal adjacency matrix for 3D human pose estimation through clustering
Authors
Keywords3D human pose estimation
Clustering
Temporal adjacency matrix
Issue Date7-Nov-2025
PublisherElsevier
Citation
Neurocomputing, 2025, v. 653 How to Cite?
AbstractWith the booming development of multimedia technologies related to 3D images and videos, 3D human pose estimation has gained increasing attention. In 3D human pose estimation, exploring the temporal relationships between human joints is essential. Transformer-based methods for modeling temporal relationships leverage the temporal adjacency matrix within the self-attention mechanism as a key component for capturing these connections. We define temporal features that occupy a small portion of all temporal features and are distant from other temporal features in the high-dimensional space as noisy features. In the temporal adjacency matrix of self-attention, noisy features can interfere with the computation of other features, and thus their correlation with other features should be eliminated. However, existing methods overlook this issue. To address this issue, we propose a DBSCAN-based clustering module to detect noisy temporal features and an adjacency matrix masking mechanism to suppress their influence. First, we cluster the input temporal features to obtain noisy features and clustering results. Then, we eliminate the correlations of noisy features on other temporal features within the adjacency matrix and reduce the correlations between different classes. Extensive experiments on the Human3.6M and MPI-INF-3DHP datasets, using state-of-the-art methods as benchmarks, demonstrate that our approach achieves improvements of up to 6.94 % Mean Per Joint Position Error (MPJPE) compared to the original methods with ground–truth input.
Persistent Identifierhttp://hdl.handle.net/10722/362390
ISSN
2023 Impact Factor: 5.5
2023 SCImago Journal Rankings: 1.815

 

DC FieldValueLanguage
dc.contributor.authorWang, Yingfeng-
dc.contributor.authorLi, Muyu-
dc.contributor.authorMeng, Nan-
dc.contributor.authorXu, Min-
dc.date.accessioned2025-09-23T00:31:11Z-
dc.date.available2025-09-23T00:31:11Z-
dc.date.issued2025-11-07-
dc.identifier.citationNeurocomputing, 2025, v. 653-
dc.identifier.issn0925-2312-
dc.identifier.urihttp://hdl.handle.net/10722/362390-
dc.description.abstractWith the booming development of multimedia technologies related to 3D images and videos, 3D human pose estimation has gained increasing attention. In 3D human pose estimation, exploring the temporal relationships between human joints is essential. Transformer-based methods for modeling temporal relationships leverage the temporal adjacency matrix within the self-attention mechanism as a key component for capturing these connections. We define temporal features that occupy a small portion of all temporal features and are distant from other temporal features in the high-dimensional space as noisy features. In the temporal adjacency matrix of self-attention, noisy features can interfere with the computation of other features, and thus their correlation with other features should be eliminated. However, existing methods overlook this issue. To address this issue, we propose a DBSCAN-based clustering module to detect noisy temporal features and an adjacency matrix masking mechanism to suppress their influence. First, we cluster the input temporal features to obtain noisy features and clustering results. Then, we eliminate the correlations of noisy features on other temporal features within the adjacency matrix and reduce the correlations between different classes. Extensive experiments on the Human3.6M and MPI-INF-3DHP datasets, using state-of-the-art methods as benchmarks, demonstrate that our approach achieves improvements of up to 6.94 % Mean Per Joint Position Error (MPJPE) compared to the original methods with ground–truth input.-
dc.languageeng-
dc.publisherElsevier-
dc.relation.ispartofNeurocomputing-
dc.subject3D human pose estimation-
dc.subjectClustering-
dc.subjectTemporal adjacency matrix-
dc.titleOptimizing the temporal adjacency matrix for 3D human pose estimation through clustering-
dc.typeArticle-
dc.identifier.doi10.1016/j.neucom.2025.131247-
dc.identifier.scopuseid_2-s2.0-105013497826-
dc.identifier.volume653-
dc.identifier.eissn1872-8286-
dc.identifier.issnl0925-2312-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats