Pose focus transformer meet inter-part relation - Details

author：

Indexed by：

Scopus

Abstract：

Human　pose　estimation　in　crowded　scenes　is　a　challenging　task.　Due　to　overlap　and　occlusion,　it　is　difficult　to　infer　pose　clues　from　individual　keypoints.　We　proposed　PFFormer,　a　new　transformer-based　approach　that　treats　pose　estimation　as　a　hierarchical　set　prediction　problem　that　first　focuses　on　human　windows　and　coarsely　predicts　whole-body　poses　globally　within　them.　In　PFFormer,　we　designed　a　Windows　Clustering　Transformer　(WCT),　which　reorganizes　the　image　windows　by　filtering　the　attentive　windows　and　fusing　the　inattentive　ones,　allowing　the　transformer　to　focus　on　the　important　regions　while　reducing　the　interference　from　the　complex　background,　followed　by　compensating　for　the　loss　of　information　with　a　global　transformer.　Then　we　partition　the　learned　body　pose　into　a　set　of　structural　parts　and　perform　the　Inter-Part　Relation　Module　(IPRM)　to　capture　the　correlation　between　multiple　parts.　These　full-body　poses　and　component　features　are　refined　at　a　finer　level　through　the　Part-to-Joint　Decoder　(PJD).　Extensive　experiments　show　that　PFFormer　performs　favorably　against　its　counterpart　on　challenging　datasets,　including　COCO2017,　CrowdPose,　and　OChuman　datasets.　The　performance　of　crowded　scenes,　in　particular,　demonstrates　the　robustness　of　the　proposed　methods　to　deal　with　occlusion.　©　2023　Elsevier　Ltd

Keyword：

Crowded scene Human pose estimation Inter-part relation Transformer

Community：

[ 1 ] [Luo Y.]College of Computer Science and Technology, Huaqiao University, Xiamen, 361021, China
[ 2 ] [Luo Y.]Xiamen Key Laboratory of Computer Vision and Pattern Recognition, Huaqiao University, Xiamen, 361021, China
[ 3 ] [Lin H.]College of Computer Science and Technology, Huaqiao University, Xiamen, 361021, China
[ 4 ] [Lin H.]Xiamen Key Laboratory of Computer Vision and Pattern Recognition, Huaqiao University, Xiamen, 361021, China
[ 5 ] [Huang W.]Maynooth International Engineering College, Fuzhou University, Fuzhou, 350108, China
[ 6 ] [Wang Y.]College of Computer Science and Technology, Huaqiao University, Xiamen, 361021, China
[ 7 ] [Wang Y.]Xiamen Key Laboratory of Computer Vision and Pattern Recognition, Huaqiao University, Xiamen, 361021, China
[ 8 ] [Du J.]College of Computer Science and Technology, Huaqiao University, Xiamen, 361021, China
[ 9 ] [Du J.]Xiamen Key Laboratory of Computer Vision and Pattern Recognition, Huaqiao University, Xiamen, 361021, China
[ 10 ] [Guo J.-M.]Department of Electrical Engineering, National Taiwan University of Science and Technology, Taipei, 10607, China

Reprint 's Address：

Email：

Show more details

Related Keywords：

Pose focus transformer meet inter-part relation
2023，EXPERT SYSTEMS WITH APPLICATIONS
CTHPose: An Efficient and Effective CNN-Transformer Hybrid Network for Human Pose Estimation
2024，PATTERN RECOGNITION AND COMPUTER VISION, PRCV 2023, PT V
Human Pose Estimation Method Based on Flexible Model and Deep Learning
2018，2nd International Conference on Computer Science and Application Engineering (CSAE)
3D Human pose estimation from video via multi-scale multi-level spatial temporal features
2024，MULTIMEDIA TOOLS AND APPLICATIONS
Skeleton-based 3D human pose estimation with low-resolution infrared array sensor using attention based CNN-BiGRU
2023，INTERNATIONAL JOURNAL OF MACHINE LEARNING AND CYBERNETICS

Source ：

Expert Systems with Applications

ISSN： 0957-4174

Year： 2024

Volume： 240

7 . 5 0 0

JCR@2023

CAS Journal Grade：2

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count：

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to