기본 정보
연구 분야
프로젝트
발행물
구성원
article|
gold
·인용수 0
·2026
Human pose estimation based on graph neural network: survey
Ramesh Kumar Lama, SeongKi Kim
IF 6.1Journal of King Saud University - Computer and Information Sciences
초록

Abstract Human pose estimation is a fundamental task in computer vision with widespread applications in human–computer interaction, sports analytics, and healthcare. While convolutional neural networks (CNNs) and Transformers have achieved notable success, they often struggle to capture structured body relationships, handle occlusions, and generalize effectively across diverse environments. Graph Neural Networks (GNNs), which represent human poses as structured graphs, offer a compelling alternative by explicitly modeling spatial and temporal dependencies among body joints. This survey provides a comprehensive review of GNN-based pose estimation approaches, encompassing spatial GCNs, spatiotemporal models, graph–Transformer hybrids, and hypergraph frameworks. We analyze these methods along key dimensions, including graph construction, learning paradigms, attention mechanisms, and computational efficiency, using standard benchmarks such as Human3.6 M, COCO, and MPI-INF-3DHP. Our review identifies several emerging trends and critical limitations. These include high computational cost, limited generalization to unconstrained scenarios, and inconsistent evaluation protocols. To advance the field, we outline future research directions, such as hybrid GNN–Transformer architectures, lightweight models for edge deployment, multi-modal fusion, and self-supervised learning strategies aimed at reducing annotation dependency and improving cross-domain robustness.

키워드
PoseConvolutional neural networkGraphArtificial neural networkGeneralizationDeep learningFeature learningDependency (UML)Knowledge graph
타입
article
IF / 인용수
6.1 / 0
게재 연도
2026