多域字符距离感知的场景文本图像超分辨率重建 - Details

author：

Huang, J.-Y. (Huang, J.-Y..) ^[1] | Chen, H.-H. (Chen, H.-H..) ^[2] | Wang, J.-B. (Wang, J.-B..) ^[3] | Chen, P.-P. (Chen, P.-P..) ^[4] (Scholars：陈平平) | Lin, Z.-J. (Lin, Z.-J..) ^[5] (Scholars：林志坚)

Indexed by：

Scopus

Abstract：

Scene　text　image　super-resolution　(STISR)　aims　to　enhance　the　resolution　and　legibility　of　text　in　low-resolution　images.　In　cases　of　spatial　deformation　or　low-resolution　text　images,　the　lack　of　details　in　text　regions　and　the　difficulty　in　aligning　semantic　cues　and　visual　features　with　character　position　make　it　difficult　to　recognize　text　effectively.　In　order　to　address　these　challenges,　this　paper　proposes　a　perceiving　multi-domain　character　distance　for　scene　text　image　super-resolution　method　(PMDC),　which　improves　the　image　text　region　and　edge　texture　details.　Firsly,　the　visual　and　semantic　features　are　extracted　by　using　the　asymmetric　convolution　module　along　with　the　semantic　prior　module.　Then　the　enhanced　position　coding　is　obtained　by　the　character　distance　perception　module　to　perceive　the　distance　change　and　semantic　similarity　between　characters.　Finally,　the　guiding　cues　and　visual　features　are　combined　to　restructure　the　pixels　and　generate　a　super-resolution　text　image.　In　comparison　to　TATT,　experimental　results　from　the　public　dataset　TextZoom　showed　an　increase　of　0.11　dB　in　the　fidelity　of　the　peak　signal-to-noise　ratio　index.　This　improvement　effectively　enhances　the　clarity　of　the　text　area　and　the　detailed　edge　texture.　Additionally,　the　recognition　accuracy　was　improved　by　1.4%,　which　effectively　enhances　the　readability　of　the　text　image.　©　2024　Chinese　Institute　of　Electronics.　All　rights　reserved.

Keyword：

attention mechanism computer vision feature information association scene text images super-resolution

Community：

[ 1 ] [Huang J.-Y.]College of Physics and Information Engineering, Fuzhou University, Fujian, Fuzhou, 350108, China
[ 2 ] [Chen H.-H.]College of Physics and Information Engineering, Fuzhou University, Fujian, Fuzhou, 350108, China
[ 3 ] [Wang J.-B.]College of Physics and Information Engineering, Fuzhou University, Fujian, Fuzhou, 350108, China
[ 4 ] [Chen P.-P.]College of Physics and Information Engineering, Fuzhou University, Fujian, Fuzhou, 350108, China
[ 5 ] [Lin Z.-J.]College of Physics and Information Engineering, Fuzhou University, Fujian, Fuzhou, 350108, China

Reprint 's Address：

Email：

Show more details

Related Keywords：

Residual Triplet Attention Network for Single-Image Super-Resolution
2021，ELECTRONICS
An Attention and Wavelet Based Spatial-Temporal Graph Neural Network for Traffic Flow and Speed Prediction
2022，Mathematics
Stock Price Prediction Using CNN-BiLSTM-Attention Model
2023，Mathematics
Efficient Optimized YOLOv8 Model with Extended Vision
2024，SENSORS
BiGA-YOLO: A Lightweight Object Detection Network Based on YOLOv5 for Autonomous Driving
2023，ELECTRONICS

Source ：

电子学报

ISSN： 0372-2112

Year： 2024

Issue： 7

Volume： 52

Page： 2262-2270

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 1

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

物理与信息工程学院、微电子学院本学院/部未明确归属的数据

Get Fulltext

DOI Library Discovery Baidu Scholar Search SCOPUS

Type
Departments

All Years Choose Year From to