Computer Science > Computer Vision and Pattern Recognition

arXiv:2308.03262 (cs)

[Submitted on 7 Aug 2023]

Title:A Benchmark for Chinese-English Scene Text Image Super-resolution

Authors:Jianqi Ma, Zhetong Liang, Wangmeng Xiang, Xi Yang, Lei Zhang

View PDF

Abstract:Scene Text Image Super-resolution (STISR) aims to recover high-resolution (HR) scene text images with visually pleasant and readable text content from the given low-resolution (LR) input. Most existing works focus on recovering English texts, which have relatively simple character structures, while little work has been done on the more challenging Chinese texts with diverse and complex character structures. In this paper, we propose a real-world Chinese-English benchmark dataset, namely Real-CE, for the task of STISR with the emphasis on restoring structurally complex Chinese characters. The benchmark provides 1,935/783 real-world LR-HR text image pairs~(contains 33,789 text lines in total) for training/testing in 2$\times$ and 4$\times$ zooming modes, complemented by detailed annotations, including detection boxes and text transcripts. Moreover, we design an edge-aware learning method, which provides structural supervision in image and feature domains, to effectively reconstruct the dense structures of Chinese characters. We conduct experiments on the proposed Real-CE benchmark and evaluate the existing STISR models with and without our edge-aware loss. The benchmark, including data and source code, is available at this https URL.

Comments:	Accepted by ICCV2023
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2308.03262 [cs.CV]
	(or arXiv:2308.03262v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2308.03262

Submission history

From: Jianqi Ma [view email]
[v1] Mon, 7 Aug 2023 02:57:48 UTC (9,957 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:A Benchmark for Chinese-English Scene Text Image Super-resolution

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:A Benchmark for Chinese-English Scene Text Image Super-resolution

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators