• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Cai, Qi (Cai, Qi.) [1] | Chen, Zhifeng (Chen, Zhifeng.) [2] | Wu, Dapeng Oliver (Wu, Dapeng Oliver.) [3] | Liu, Shan (Liu, Shan.) [4] | Li, Xiang (Li, Xiang.) [5]

Indexed by:

EI

Abstract:

Occupying the most significant portion of global data traffic, video is being generated in almost every aspect of our life. Because of its huge volume, we are depending much more heavily on machine intelligence based analysis. In the meantime, video coding technology has been continuously improved for better compression efficiency. However, the state-of-the-art video coding standards, such as H.265/HEVC and versatile video coding (VVC), are still designed assuming that the compressed video will be watched by a human later. Such a design is not optimal when the compressed video will be used by computer vision applications. While the human visual system (HVS) is consistently sensitive to the content with high contrast, the impact of pixels on computer vision algorithms is task driven. For example, because of the different categories of objects used to train detection algorithms, the influence of the same image content on those detectors also varies. Therefore, human oriented video coding strategies may not be optimal when the compressed signal is further processed by algorithms, as the encoder is unaware of the task specific information. In this article, taking object detection as an example, we propose a novel video coding strategy for computer vision. By protecting the information according to its importance for an object detector rather than for the human visual system, our proposed method has the potential to achieve a better object detection performance with the same bandwidth. The main contributions of our paper are: 1) the modeling of the relationship between object detection accuracy and bit rate; 2) a back propagation based method to analyze the influence of each pixel on the detection of target objects; 3) an object detection oriented bit allocation and codec control parameter determination scheme; 4) an evaluation metric to compare the impact of video coding strategies on a given object detector over a predefined range of bit rate. Experimental results demonstrate that our proposed algorithm can better preserve the video content vital for object detection than state-of-the-art video coding schemes. © 1991-2012 IEEE.

Keyword:

Backpropagation Computer vision Image coding Image compression Object detection Object recognition Pixels Video signal processing

Community:

  • [ 1 ] [Cai, Qi]Department of Electrical and Computer Engineering, University of Florida, Gainesville; FL; 32608, United States
  • [ 2 ] [Chen, Zhifeng]Department of Physics and Information Engineering, Fuzhou University, Fuzhou; 350108, China
  • [ 3 ] [Wu, Dapeng Oliver]Department of Electrical and Computer Engineering, University of Florida, Gainesville; FL; 32608, United States
  • [ 4 ] [Liu, Shan]Media Laboratory, Tencent America, Palo Alto; CA; 94306, United States
  • [ 5 ] [Li, Xiang]Media Laboratory, Tencent America, Palo Alto; CA; 94306, United States

Reprint 's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

IEEE Transactions on Circuits and Systems for Video Technology

ISSN: 1051-8215

Year: 2021

Issue: 12

Volume: 31

Page: 4924-4937

5 . 8 5 9

JCR@2021

8 . 3 0 0

JCR@2023

ESI HC Threshold:105

JCR Journal Grade:1

CAS Journal Grade:2

Cited Count:

WoS CC Cited Count: 0

SCOPUS Cited Count: 20

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Affiliated Colleges:

Online/Total:189/10046310
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1