File Download
There are no files associated with this item.
Links for fulltext
(May Require Subscription)
- Publisher Website: 10.1109/ICAIBD64986.2025.11081970
- Scopus: eid_2-s2.0-105012753598
Supplementary
-
Citations:
- Scopus: 0
- Appears in Collections:
Conference Paper: RHPENet: Robust Head Pose Estimation by Learning Heterogeneous Relationships from Facial Region Cues in Human Robot Interaction
| Title | RHPENet: Robust Head Pose Estimation by Learning Heterogeneous Relationships from Facial Region Cues in Human Robot Interaction |
|---|---|
| Authors | |
| Keywords | Facial regions of interest Head pose estimation Heterogeneous relationship Robot vision Transformer |
| Issue Date | 1-May-2025 |
| Abstract | Head pose estimation (HPE) techniques frequently encounter difficulties when handling extreme angles, occlusions, and uneven lighting conditions. In this paper, we present a novel heterogeneous relationship learning framework designed to mitigate these limitations by exploiting facial regions of interest (FRoI) and their complex interdependencies. Our approach stems from two fundamental discoveries: first, the critical importance of FRoI for pose determination, and second, the heterogeneous relationship between neighboring postures. The proposed architecture consists of three main modules: region feature generator (RFG), hierarchical structural representation (HSR), and cross-relation aggregator (CRA). The RFG incorporates an adaptive attention mechanism that prioritizes diagnostically significant facial zones. Within the HSR component, we implement a novel "Rugby-style" cross-level connectivity pattern to enhance feature integration. The CRA employs Transformer-based techniques to uncover both spatial and angular dependencies. Comprehensive evaluations conducted on major HPE benchmarks (300W_LP, AFLW2000, and BIWI) demonstrate that our RHPENet model consistently outperforms existing approaches. |
| Persistent Identifier | http://hdl.handle.net/10722/362428 |
| DC Field | Value | Language |
|---|---|---|
| dc.contributor.author | Liu, Tingting | - |
| dc.contributor.author | Qian, Shijia | - |
| dc.contributor.author | Liu, Hai | - |
| dc.contributor.author | Wang, Minhong | - |
| dc.contributor.author | Yang, Bing | - |
| dc.contributor.author | Li, You-Fu | - |
| dc.date.accessioned | 2025-09-24T00:51:28Z | - |
| dc.date.available | 2025-09-24T00:51:28Z | - |
| dc.date.issued | 2025-05-01 | - |
| dc.identifier.uri | http://hdl.handle.net/10722/362428 | - |
| dc.description.abstract | <p>Head pose estimation (HPE) techniques frequently encounter difficulties when handling extreme angles, occlusions, and uneven lighting conditions. In this paper, we present a novel heterogeneous relationship learning framework designed to mitigate these limitations by exploiting facial regions of interest (FRoI) and their complex interdependencies. Our approach stems from two fundamental discoveries: first, the critical importance of FRoI for pose determination, and second, the heterogeneous relationship between neighboring postures. The proposed architecture consists of three main modules: region feature generator (RFG), hierarchical structural representation (HSR), and cross-relation aggregator (CRA). The RFG incorporates an adaptive attention mechanism that prioritizes diagnostically significant facial zones. Within the HSR component, we implement a novel "Rugby-style" cross-level connectivity pattern to enhance feature integration. The CRA employs Transformer-based techniques to uncover both spatial and angular dependencies. Comprehensive evaluations conducted on major HPE benchmarks (300W_LP, AFLW2000, and BIWI) demonstrate that our RHPENet model consistently outperforms existing approaches.<br></p> | - |
| dc.language | eng | - |
| dc.relation.ispartof | 2025 8th International Conference on Artificial Intelligence and Big Data (ICAIBD) (23/05/2025-26/05/2025, Chengdu, Sichuan) | - |
| dc.subject | Facial regions of interest | - |
| dc.subject | Head pose estimation | - |
| dc.subject | Heterogeneous relationship | - |
| dc.subject | Robot vision | - |
| dc.subject | Transformer | - |
| dc.title | RHPENet: Robust Head Pose Estimation by Learning Heterogeneous Relationships from Facial Region Cues in Human Robot Interaction | - |
| dc.type | Conference_Paper | - |
| dc.identifier.doi | 10.1109/ICAIBD64986.2025.11081970 | - |
| dc.identifier.scopus | eid_2-s2.0-105012753598 | - |
| dc.identifier.spage | 639 | - |
| dc.identifier.epage | 645 | - |
