• Complex
  • Title
  • Keyword
  • Abstract
  • Scholars
  • Journal
  • ISSN
  • Conference
成果搜索

author:

Zhou, Yuanbo (Zhou, Yuanbo.) [1] | Zhang, Xinlin (Zhang, Xinlin.) [2] | Deng, Wei (Deng, Wei.) [3] | Wang, Tao (Wang, Tao.) [4] | Tan, Tao (Tan, Tao.) [5] | Gao, Qinquan (Gao, Qinquan.) [6] | Tong, Tong (Tong, Tong.) [7]

Indexed by:

EI

Abstract:

Although diffusion prior-based single-image super-resolution has demonstrated remarkable reconstruction capabilities, its potential in the domain of stereo image super-resolution remains underexplored. One significant challenge lies in the inherent stochasticity of diffusion models, which makes it difficult to ensure that the generated left and right images exhibit high semantic and texture consistency. This poses a considerable obstacle to advancing research in this field. Therefore, We introduce DiffSteISR, a pioneering framework for reconstructing real-world stereo images. DiffSteISR utilizes the powerful prior knowledge embedded in pre-trained text-to-image model to efficiently recover the lost texture details in low-resolution stereo images. Specifically, DiffSteISR implements a time-aware stereo cross attention with temperature adapter (TASCATA) to guide the diffusion process, ensuring that the generated left and right views exhibit high texture consistency thereby reducing disparity error between the super-resolved images and the ground truth (GT) images. Additionally, a stereo omni attention control network (SOA ControlNet) is proposed to enhance the consistency of super-resolved images with GT images in the pixel, perceptual, and distribution space. Finally, DiffSteISR incorporates a stereo semantic extractor (SSE) to capture unique viewpoint soft semantic information and shared hard tag semantic information, thereby effectively improving the semantic accuracy and consistency of the generated left and right images. Extensive experimental results demonstrate that DiffSteISR accurately reconstructs natural and precise textures from low-resolution stereo images while maintaining a high consistency of semantic and texture between the left and right views. © 2025 Elsevier B.V.

Keyword:

Image enhancement Image reconstruction Image texture Stereo image processing Stochastic models Stochastic systems

Community:

  • [ 1 ] [Zhou, Yuanbo]Fuzhou University, Fuzhou; 350108, China
  • [ 2 ] [Zhang, Xinlin]Fuzhou University, Fuzhou; 350108, China
  • [ 3 ] [Deng, Wei]Imperial Vision Technology, Fuzhou; 350002, China
  • [ 4 ] [Wang, Tao]Fuzhou University, Fuzhou; 350108, China
  • [ 5 ] [Tan, Tao]Macao Polytechnic University, 999078, China
  • [ 6 ] [Gao, Qinquan]Fuzhou University, Fuzhou; 350108, China
  • [ 7 ] [Gao, Qinquan]Imperial Vision Technology, Fuzhou; 350002, China
  • [ 8 ] [Tong, Tong]Fuzhou University, Fuzhou; 350108, China
  • [ 9 ] [Tong, Tong]Imperial Vision Technology, Fuzhou; 350002, China

Reprint 's Address:

Email:

Show more details

Related Keywords:

Related Article:

Source :

Neurocomputing

ISSN: 0925-2312

Year: 2025

Volume: 623

5 . 5 0 0

JCR@2023

Cited Count:

WoS CC Cited Count:

SCOPUS Cited Count:

ESI Highly Cited Papers on the List: 0 Unfold All

WanFang Cited Count:

Chinese Cited Count:

30 Days PV: 0

Affiliated Colleges:

Online/Total:258/10043463
Address:FZU Library(No.2 Xuyuan Road, Fuzhou, Fujian, PRC Post Code:350116) Contact Us:0591-22865326
Copyright:FZU Library Technical Support:Beijing Aegean Software Co., Ltd. 闽ICP备05005463号-1