• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Li, Jun (Li, Jun.) [1] | Bi, Yuquan (Bi, Yuquan.) [2] | Wang, Sumei (Wang, Sumei.) [3] | Li, Qiming (Li, Qiming.) [4]

Indexed by:

EI Scopus SCIE

Abstract:

High resolution and strong semantic representation are both vital for feature extraction networks of pedestrian detection. The existing high-resolution network (HRNet) has presented a promising performance for pedestrian detection. However, we observed that it still has some significant shortcomings for heavily occluded and small-scale pedestrians. In this paper, we propose to address the shortcomings by extracting semantic and spatial context from HRNet. Specifically, we propose a Context-aware Feature Representation Learning Module (CFRL-Module), which combines a Multi-scale Feature Context Extraction Parallel Block for Convolution and Self-attention (CEPCA-Block) with two parallel paths and an Equivalent FFN (EFFN) Block. The core CEPCA-Block adopts a parallel design to integrate convolution and multi-head self-attention (MHSA) with low parameter computational cost, which can obtain the deep semantic context by convolution path and precise context by MHSA path. Furthermore, to overcome the inefficiency of global MHSA in high-resolution pedestrian detection, we propose a novel local window MHSA, which can significantly reduce memory consumption but barely affect the detection performance. Cascading the proposed CFRL-Module with the anchor-free detection head constitutes our Context-aware Feature Representation Learning Anchor-Free Network (CFRLA-Net). The proposed CFRLA-Net can catch a high-level understanding of the heavily occluded and small-scale pedestrian instances based on HRNet, which can effectively solve the limitation of the insufficient feature extraction ability of HRNet for the hard samples. Experimental results show that CFRLA-Net achieves state-of-the-art performance on CityPersons, Caltech, and CrowdHuman benchmarks.

Keyword:

anchor-free context HRNet occluded and small-scale pedestrians Pedestrian detection self-attention

Community:

  • [ 1 ] [Li, Jun]Fuzhou Univ, Dept Adv Mfg, Quanzhou 362200, Fujian, Peoples R China
  • [ 2 ] [Bi, Yuquan]Fuzhou Univ, Dept Adv Mfg, Quanzhou 362200, Fujian, Peoples R China
  • [ 3 ] [Li, Qiming]Fuzhou Univ, Dept Adv Mfg, Quanzhou 362200, Fujian, Peoples R China
  • [ 4 ] [Li, Jun]Chinese Acad Sci, Quanzhou Inst Equipment Mfg, Haixi Inst, Lab Robot & Intelligent Syst, Quanzhou 362216, Fujian, Peoples R China
  • [ 5 ] [Li, Qiming]Chinese Acad Sci, Quanzhou Inst Equipment Mfg, Haixi Inst, Lab Robot & Intelligent Syst, Quanzhou 362216, Fujian, Peoples R China
  • [ 6 ] [Wang, Sumei]Hong Kong Polytech Univ, Dept Civil & Environm Engn, Hong Kong, Peoples R China

Reprint 's Address:

Show more details

Version:

Related Keywords:

Related Article:

Source :

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

ISSN: 1051-8215

Year: 2023

Issue: 9

Volume: 33

Page: 4948-4961

8 . 3

JCR@2023

8 . 3 0 0

JCR@2023

JCR Journal Grade:1

CAS Journal Grade:1

Cited Count:

WoS CC Cited Count: 3

SCOPUS Cited Count: 4

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Online/Total:213/10034449
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1