• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Cai, Qi (Cai, Qi.) [1] | Chen, Zhifeng (Chen, Zhifeng.) [2] | Wu, Dapeng Oliver (Wu, Dapeng Oliver.) [3] | Liu, Shan (Liu, Shan.) [4] | Li, Xiang (Li, Xiang.) [5]

Indexed by:

EI SCIE

Abstract:

Occupying the most significant portion of global data traffic, video is being generated in almost every aspect of our life. Because of its huge volume, we are depending much more heavily on machine intelligence based analysis. In the meantime, video coding technology has been continuously improved for better compression efficiency. However, the state-of-the-art video coding standards, such as H.265/HEVC and versatile video coding (VVC), are still designed assuming that the compressed video will be watched by a human later. Such a design is not optimal when the compressed video will be used by computer vision applications. While the human visual system (HVS) is consistently sensitive to the content with high contrast, the impact of pixels on computer vision algorithms is task driven. For example, because of the different categories of objects used to train detection algorithms, the influence of the same image content on those detectors also varies. Therefore, human oriented video coding strategies may not be optimal when the compressed signal is further processed by algorithms, as the encoder is unaware of the task specific information. In this article, taking object detection as an example, we propose a novel video coding strategy for computer vision. By protecting the information according to its importance for an object detector rather than for the human visual system, our proposed method has the potential to achieve a better object detection performance with the same bandwidth. The main contributions of our paper are: 1) the modeling of the relationship between object detection accuracy and bit rate; 2) a back propagation based method to analyze the influence of each pixel on the detection of target objects; 3) an object detection oriented bit allocation and codec control parameter determination scheme; 4) an evaluation metric to compare the impact of video coding strategies on a given object detector over a predefined range of bit rate. Experimental results demonstrate that our proposed algorithm can better preserve the video content vital for object detection than state-of-the-art video coding schemes.

Keyword:

bit allocation Bit rate Codecs detection accuracy modeling Detectors Encoding HEVC object detection Object detection pixel-level impact on detection Video coding Visualization

Community:

  • [ 1 ] [Cai, Qi]Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32608 USA
  • [ 2 ] [Wu, Dapeng Oliver]Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32608 USA
  • [ 3 ] [Chen, Zhifeng]Fuzhou Univ, Dept Phys & Informat Engn, Fuzhou 350108, Peoples R China
  • [ 4 ] [Liu, Shan]Tencent Amer, Media Lab, Palo Alto, CA 94306 USA
  • [ 5 ] [Li, Xiang]Tencent Amer, Media Lab, Palo Alto, CA 94306 USA

Reprint 's Address:

  • 陈志峰

    [Chen, Zhifeng]Fuzhou Univ, Dept Phys & Informat Engn, Fuzhou 350108, Peoples R China

Show more details

Version:

Related Keywords:

Related Article:

Source :

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

ISSN: 1051-8215

Year: 2021

Issue: 12

Volume: 31

Page: 4924-4937

5 . 8 5 9

JCR@2021

8 . 3 0 0

JCR@2023

ESI Discipline: ENGINEERING;

ESI HC Threshold:105

JCR Journal Grade:1

CAS Journal Grade:2

Cited Count:

WoS CC Cited Count: 10

SCOPUS Cited Count: 20

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 1

Online/Total:194/10039179
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1