File Download

There are no files associated with this item.

  Links for fulltext
     (May Require Subscription)
Supplementary

Article: HRHPE: FRoI guides heterogeneous relationship representation learning for precise head pose estimation

TitleHRHPE: FRoI guides heterogeneous relationship representation learning for precise head pose estimation
Authors
KeywordsComputer vision
Facial regions of interest
Head pose estimation
Heterogeneous relationship
Transformer
Issue Date28-May-2025
PublisherElsevier
Citation
Neurocomputing, 2025, v. 647 How to Cite?
Abstract

The ability to effectively detect head positions has raised concerns in the field of computer vision. However, head pose estimation (HPE) is prone to problems, such as extreme angles, occlusion, and lighting. To effectively address this critical missing information gap in the field of HPE, we propose a heterogeneous relationship guided learning method for the representation of transannular layers that effectively captures and uses “Facial Regions of Interest” (FRoI) and heterogeneous relationships through FRoI morphable modeling. Two key observations are revealed: 1) the saliency of the facial regions of interest; and 2) the heterogeneous relationships between the adjacent postures. On this basis, three modules are proposed, namely Region Feature Generation (RFG), Hierarchical Structure Modeling (HSM) and Heterogeneous Relationship Mining (HRM). In particular, we introduce a regional attention mechanism in the RFG, which assigns a higher weighting to FRoI. In HSM, the concept of the “Rugby style” design is proposed as a model for the cross-layer structure. In HRM, we use the Transformer to explore the interdependencies between facial regions and semantic relationships in rotation of the head. Experiments with three real HPE datasets (300 W_LP, AFLW2000 and BIWI) show that our HRHPE is more efficient than the state-of-the-art methods.


Persistent Identifierhttp://hdl.handle.net/10722/357743
ISSN
2023 Impact Factor: 5.5
2023 SCImago Journal Rankings: 1.815
ISI Accession Number ID

 

DC FieldValueLanguage
dc.contributor.authorLiu, Hai-
dc.contributor.authorQian, Shijia-
dc.contributor.authorLiu, Tingting-
dc.contributor.authorCao, Zelin-
dc.contributor.authorWang, Minhong-
dc.contributor.authorJu, Jianping-
dc.contributor.authorZhang, Zhaoli-
dc.date.accessioned2025-07-22T03:14:39Z-
dc.date.available2025-07-22T03:14:39Z-
dc.date.issued2025-05-28-
dc.identifier.citationNeurocomputing, 2025, v. 647-
dc.identifier.issn0925-2312-
dc.identifier.urihttp://hdl.handle.net/10722/357743-
dc.description.abstract<p>The ability to effectively detect head positions has raised concerns in the field of computer vision. However, head pose estimation (HPE) is prone to problems, such as extreme angles, occlusion, and lighting. To effectively address this critical missing information gap in the field of HPE, we propose a heterogeneous relationship guided learning method for the representation of transannular layers that effectively captures and uses “Facial Regions of Interest” (FRoI) and heterogeneous relationships through FRoI morphable modeling. Two key observations are revealed: 1) the saliency of the facial regions of interest; and 2) the heterogeneous relationships between the adjacent postures. On this basis, three modules are proposed, namely Region Feature Generation (RFG), Hierarchical Structure Modeling (HSM) and Heterogeneous Relationship Mining (HRM). In particular, we introduce a regional attention mechanism in the RFG, which assigns a higher weighting to FRoI. In HSM, the concept of the “Rugby style” design is proposed as a model for the cross-layer structure. In HRM, we use the Transformer to explore the interdependencies between facial regions and semantic relationships in rotation of the head. Experiments with three real HPE datasets (300 W_LP, AFLW2000 and BIWI) show that our HRHPE is more efficient than the state-of-the-art methods.</p>-
dc.languageeng-
dc.publisherElsevier-
dc.relation.ispartofNeurocomputing-
dc.rightsThis work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.-
dc.subjectComputer vision-
dc.subjectFacial regions of interest-
dc.subjectHead pose estimation-
dc.subjectHeterogeneous relationship-
dc.subjectTransformer-
dc.titleHRHPE: FRoI guides heterogeneous relationship representation learning for precise head pose estimation-
dc.typeArticle-
dc.identifier.doi10.1016/j.neucom.2025.130623-
dc.identifier.scopuseid_2-s2.0-105007742004-
dc.identifier.volume647-
dc.identifier.eissn1872-8286-
dc.identifier.isiWOS:001517300500001-
dc.identifier.issnl0925-2312-

Export via OAI-PMH Interface in XML Formats


OR


Export to Other Non-XML Formats