• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Tang, Hui (Tang, Hui.) [1] | Zhou, Yuanbo (Zhou, Yuanbo.) [2] | Chen, Yuanbin (Chen, Yuanbin.) [3] | Zhang, Xinlin (Zhang, Xinlin.) [4] | Xue, Yuyang (Xue, Yuyang.) [5] | Lin, Xiaoyong (Lin, Xiaoyong.) [6] | Dai, Xinwei (Dai, Xinwei.) [7] | Qiu, Xintao (Qiu, Xintao.) [8] | Gao, Qinquan (Gao, Qinquan.) [9] (Scholars:高钦泉) | Tong, Tong (Tong, Tong.) [10] (Scholars:童同)

Indexed by:

EI Scopus SCIE

Abstract:

Image colorization has a wide range of applications, but it remains a challenging task due to it is an inherently ill-posed problem with multi -modal uncertainty. The advancement of deep learning techniques has provided extensive avenues for addressing image colorization. However, current works mainly suffer from two problems: inaccurate colorization leading to biased color tones (e.g., cool or warm bias) and undersaturation of images. Existing Transformer-based methods can produce impressive results, but they often come with high training costs and may result in color overflow effects. In this paper, we propose a two-stage image colorization strategy based on a color codebook. Clustering methods in the three-dimensional CIE Lab color space is proposed to integrate brightness information so that the colors in the codebook can be lifelike. In the first stage, we treat the colorization task as a classification problem based on a color codebook, and a highquality codebook is advantageous for enhancing color classification accuracy. In the second stage, different from the traditional Transformer-based method, a pyramid-type Transformer structure is used to extract rich image features to refine the colors, which can solve potential color bands, color errors and color overflow. In addition, the parameters and FLOPs are significantly smaller than other traditional Transformer-based methods. Extensive experiments demonstrate that our method outperforms state -of -the -art approaches. On the ImageNet validation set, the achieved values are 4.60, 25.23, 0.19, and 39.82 in terms of FID, PSNR, LPIPS, and CF, respectively. On the COCO-Stuff validation set, the achieved values are 5.62, 25.15, 0.19, and 36.25 in terms of FID, PSNR, LPIPS, and CF, respectively. The codes are available at https://github.com/Tanghui2000/Twostage_Image_Colorization_via_Color_Codebook.

Keyword:

Color classification Deep convolutional neural networks Image colorization Transformer

Community:

  • [ 1 ] [Tang, Hui]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou, Peoples R China
  • [ 2 ] [Zhou, Yuanbo]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou, Peoples R China
  • [ 3 ] [Chen, Yuanbin]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou, Peoples R China
  • [ 4 ] [Zhang, Xinlin]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou, Peoples R China
  • [ 5 ] [Dai, Xinwei]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou, Peoples R China
  • [ 6 ] [Qiu, Xintao]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou, Peoples R China
  • [ 7 ] [Gao, Qinquan]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou, Peoples R China
  • [ 8 ] [Tong, Tong]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou, Peoples R China
  • [ 9 ] [Tang, Hui]Fuzhou Univ, Fujian Key Lab Med Instrumentat & Pharmaceut Techn, Fuzhou, Peoples R China
  • [ 10 ] [Zhou, Yuanbo]Fuzhou Univ, Fujian Key Lab Med Instrumentat & Pharmaceut Techn, Fuzhou, Peoples R China
  • [ 11 ] [Chen, Yuanbin]Fuzhou Univ, Fujian Key Lab Med Instrumentat & Pharmaceut Techn, Fuzhou, Peoples R China
  • [ 12 ] [Zhang, Xinlin]Fuzhou Univ, Fujian Key Lab Med Instrumentat & Pharmaceut Techn, Fuzhou, Peoples R China
  • [ 13 ] [Dai, Xinwei]Fuzhou Univ, Fujian Key Lab Med Instrumentat & Pharmaceut Techn, Fuzhou, Peoples R China
  • [ 14 ] [Qiu, Xintao]Fuzhou Univ, Fujian Key Lab Med Instrumentat & Pharmaceut Techn, Fuzhou, Peoples R China
  • [ 15 ] [Gao, Qinquan]Fuzhou Univ, Fujian Key Lab Med Instrumentat & Pharmaceut Techn, Fuzhou, Peoples R China
  • [ 16 ] [Tong, Tong]Fuzhou Univ, Fujian Key Lab Med Instrumentat & Pharmaceut Techn, Fuzhou, Peoples R China
  • [ 17 ] [Gao, Qinquan]Imperial Vis Technol, Fuzhou, Peoples R China
  • [ 18 ] [Tong, Tong]Imperial Vis Technol, Fuzhou, Peoples R China
  • [ 19 ] [Xue, Yuyang]Univ Edinburgh, Edinburgh, Scotland
  • [ 20 ] [Lin, Xiaoyong]Xiamen Univ Technol, Fujian Prov Key Lab Network Audiovisual Applicat I, Xiamen, Peoples R China

Reprint 's Address:

  • [Gao, Qinquan]Fuzhou Univ, Coll Phys & Informat Engn, Fuzhou, Peoples R China;;

Show more details

Version:

Related Keywords:

Source :

EXPERT SYSTEMS WITH APPLICATIONS

ISSN: 0957-4174

Year: 2024

Volume: 250

7 . 5 0 0

JCR@2023

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count: 2

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Online/Total:267/10048954
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1