• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Li, Jun (Li, Jun.) [1] | Bi, Yuquan (Bi, Yuquan.) [2] | Wang, Sumei (Wang, Sumei.) [3] | Li, Qiming (Li, Qiming.) [4]

Indexed by:

EI

Abstract:

High resolution and strong semantic representation are both vital for feature extraction networks of pedestrian detection. The existing high-resolution network (HRNet) has presented a promising performance for pedestrian detection. However, we observed that it still has some significant shortcomings for heavily occluded and small-scale pedestrians. In this paper, we propose to address the shortcomings by extracting semantic and spatial context from HRNet. Specifically, we propose a Context-aware Feature Representation Learning Module (CFRL-Module), which combines a Multi-scale Feature Context Extraction Parallel Block for Convolution and Self-attention (CEPCA-Block) with two parallel paths and an Equivalent FFN (EFFN) Block. The core CEPCA-Block adopts a parallel design to integrate convolution and multi-head self-attention (MHSA) with low parameter computational cost, which can obtain the deep semantic context by convolution path and precise context by MHSA path. Furthermore, to overcome the inefficiency of global MHSA in high-resolution pedestrian detection, we propose a novel local window MHSA, which can significantly reduce memory consumption but barely affect the detection performance. Cascading the proposed CFRL-Module with the anchor-free detection head constitutes our Context-aware Feature Representation Learning Anchor-Free Network (CFRLA-Net). The proposed CFRLA-Net can catch a high-level understanding of the heavily occluded and small-scale pedestrian instances based on HRNet, which can effectively solve the limitation of the insufficient feature extraction ability of HRNet for the hard samples. Experimental results show that CFRLA-Net achieves state-of-the-art performance on CityPersons, Caltech, and CrowdHuman benchmarks. © 1991-2012 IEEE.

Keyword:

Benchmarking Convolution Extraction Feature extraction Semantics

Community:

  • [ 1 ] [Li, Jun]Fuzhou University, Department of Advanced Manufacturing, Quanzhou; 362200, China
  • [ 2 ] [Li, Jun]Quanzhou Institute of Equipment Manufacturing, Haixi Institute, Chinese Academy of Sciences, Laboratory of Robotics and Intelligent Systems, Quanzhou; 362216, China
  • [ 3 ] [Bi, Yuquan]Fuzhou University, Department of Advanced Manufacturing, Quanzhou; 362200, China
  • [ 4 ] [Wang, Sumei]The Hong Kong Polytechnic University, Department of Civil and Environmental Engineering, Hong Kong
  • [ 5 ] [Li, Qiming]Fuzhou University, Department of Advanced Manufacturing, Quanzhou; 362200, China
  • [ 6 ] [Li, Qiming]Quanzhou Institute of Equipment Manufacturing, Haixi Institute, Chinese Academy of Sciences, Laboratory of Robotics and Intelligent Systems, Quanzhou; 362216, China

Reprint 's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

IEEE Transactions on Circuits and Systems for Video Technology

ISSN: 1051-8215

Year: 2023

Issue: 9

Volume: 33

Page: 4948-4961

8 . 3

JCR@2023

8 . 3 0 0

JCR@2023

JCR Journal Grade:1

CAS Journal Grade:1

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 1

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Affiliated Colleges:

Online/Total:302/10870162
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1