A New Language-Independent Deep CNN for Scene Text Detection and Style Transfer in Social Media Images

被引:7
|
作者
Shivakumara, Palaiahnakote [1 ]
Banerjee, Ayan [2 ]
Pal, Umapada [2 ]
Nandanwar, Lokesh [1 ]
Lu, Tong [3 ]
Liu, Cheng-Lin [4 ,5 ]
机构
[1] Univ Malaya, Fac Comp Sci & Informat Technol, Dept Comp Syst & Technol, Kuala Lumpur 50603, Malaysia
[2] Indian Stat Inst, Kolkata 700108, India
[3] Nanjing Univ, Dept Comp Sci & Technol, Nanjing 210093, Peoples R China
[4] Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
[5] Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
关键词
Text detection; style transfer; deep learning; EfficientNet; social media images;
D O I
10.1109/TIP.2023.3287038
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Due to the adverse effect of quality caused by different social media and arbitrary languages in natural scenes, detecting text from social media images and transferring its style is challenging. This paper presents a novel end-to-end model for text detection and text style transfer in social media images. The key notion of the proposed work is to find dominant information, such as fine details in the degraded images (social media images), and then restore the structure of character information. Therefore, we first introduce a novel idea of extracting gradients from the frequency domain of the input image to reduce the adverse effect of different social media, which outputs text candidate points. The text candidates are further connected into components and used for text detection via a UNet++ like network with an EfficientNet backbone (EffiUNet++). Then, to deal with the style transfer issue, we devise a generative model, which comprises a target encoder and style parameter networks (TESP-Net) to generate the target characters by leveraging the recognition results from the first stage. Specifically, a series of residual mapping and a position attention module are devised to improve the shape and structure of generated characters. The whole model is trained end-to-end so as to optimize the performance. Experiments on our social media dataset, benchmark datasets of natural scene text detection and text style transfer show that the proposed model outperforms the existing text detection and style transfer methods in multilingual and cross-language scenario.
引用
收藏
页码:3552 / 3566
页数:15
相关论文
共 50 条
  • [1] A New Lightweight Script Independent Scene Text Style Transfer Network
    Shivakumara, Palaiahnakote
    Roy, Ayush
    Nandanwar, Lokesh
    Pal, Umapada
    Lu, Yue
    Liu, Cheng-Lin
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 37 (13)
  • [2] Language-Independent Text Tokenization Using Unsupervised Deep Learning
    Mahmoud, Hanan A. Hosni
    Hafez, Alaaeldin M.
    Alabdulkreem, Eatedal
    [J]. INTELLIGENT AUTOMATION AND SOFT COMPUTING, 2023, 35 (01): : 321 - 334
  • [3] Synthesizing Scene Text Images for Recognition with Style Transfer
    Liu, Haoran
    Zhu, Anna
    [J]. 2019 INTERNATIONAL CONFERENCE ON DOCUMENT ANALYSIS AND RECOGNITION WORKSHOPS (ICDARW), VOL 5, 2019, : 8 - 13
  • [4] Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer
    dos Santos, Cicero Nogueira
    Melnyk, Igor
    Padhi, Inkit
    [J]. PROCEEDINGS OF THE 56TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, VOL 2, 2018, : 189 - 194
  • [5] User clustering on social media using language-independent features
    Karahodza, Esma
    Donko, Dzenana
    Karahodza, Bakir
    [J]. 2024 23RD INTERNATIONAL SYMPOSIUM INFOTEH-JAHORINA, INFOTEH, 2024,
  • [6] Text Detection and Recognition from Scene Images using MSER and CNN
    Choudhary, Savita
    Singh, Nikhil Kumar
    Chichadwani, Sanjay
    [J]. 2018 SECOND INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRONICS, COMPUTERS AND COMMUNICATIONS (ICAECC), 2018,
  • [7] Deep learning for detection of text polarity in natural scene images
    Perepu, Pavan Kumar
    [J]. NEUROCOMPUTING, 2021, 431 : 1 - 6
  • [8] Advances in Clickbait and Fake News Detection Using New Language-independent Strategies
    Coste, Claudia Ioana
    Bufnea, Darius
    [J]. JOURNAL OF COMMUNICATIONS SOFTWARE AND SYSTEMS, 2021, 17 (03) : 270 - 280
  • [9] Urdu text in natural scene images: a new dataset and preliminary text detection
    Ali, Hazrat
    Iqbal, Khalid
    Mujtaba, Ghulam
    Fayyaz, Ahmad
    Bulbul, Mohammad Farhad
    Karam, Fazal Wahab
    Zahir, Ali
    [J]. PEERJ COMPUTER SCIENCE, 2021, 7
  • [10] A New Method for Arabic Text Detection in Natural Scene Images
    Gaddour, Houda
    Kanoun, Slim
    Vincent, Nicole
    [J]. INTERNATIONAL JOURNAL OF IMAGE AND GRAPHICS, 2023, 23 (01)