AENet: Image Retrieval of Kazakh Handwritten Documents Based on Attention Mechanism and Feature Aggregation

被引:0
|
作者
Chen, Gang [1 ]
Xu, Xuebin [1 ]
Wang, Jiaoyan [1 ]
Mamat, Hornisa [1 ]
Ubul, Kurban [1 ]
机构
[1] Xinjiang Univ, Sch Comp Sci & Technol, Xinjiang Key Lab Multilingual Informat Technol, Urumqi 830046, Peoples R China
基金
中国国家自然科学基金;
关键词
offline handwriting; Kazakh language; image retrieval; attention mechanism; feature representation;
D O I
10.1002/tee.24122
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Kazakh is one of the multilingual languages of China and is widely spoken in some areas of Xinjiang, China. However, due to the fact that Kazakh is a language in which several characters are glued together to form a continuous word with a unique shape and complex structural combinations of relationships. This paper explores a solution for offline image retrieval of handwritten Kazakh words, which is a challenging task because, due to the lack of relevant datasets and the special writing morphology of the Kazakh language, traditional text image retrieval algorithms often struggle to achieve satisfactory results when dealing with writing styles that are varied and adherent to the language. Therefore, a dataset of offline Kazakh handwritten document images was created in this paper. The dataset contains 300 pages of document images with 20 500 words. Then, a new model called the 'AENet' is proposed. The model utilizes an attention mechanism to focus more finely on focal regions such as centers, inflection points, and contours of handwritten word images and to capture important local features from different scales. Fusion space pyramid pooling, feature aggregation, encoding operations, and feature downscaling and reconstruction are used to extract and reconstruct more representative feature representations from local to global to capture the overall information in the word images. Through experimental evaluation on Kazak-80, Zilla-64, and HWDB1.1-375 datasets, it is verified that the method significantly improves the mAP for image retrieval of handwritten words, which is especially applicable to adhesive languages like Kazakh. (c) 2024 Institute of Electrical Engineers of Japan and Wiley Periodicals LLC.
引用
收藏
页码:1640 / 1651
页数:12
相关论文
共 50 条
  • [41] Research on Massive Image Retrieval Method of Mobile Terminal Based on Weighted Aggregation Depth Feature
    Zhao, Guotao
    Ding, Jie
    WIRELESS COMMUNICATIONS & MOBILE COMPUTING, 2022, 2022
  • [42] ESA: External Space Attention Aggregation for Image-Text Retrieval
    Zhu, Hongguang
    Zhang, Chunjie
    Wei, Yunchao
    Huang, Shujuan
    Zhao, Yao
    IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 33 (10) : 6131 - 6143
  • [43] QDFA: Query-Dependent Feature Aggregation for Medical Image Retrieval
    Huang, Yonggang
    Ma, Dianfu
    Zhang, Jun
    Zhao, Yongwang
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2012, E95D (01) : 275 - 279
  • [44] Automatic Image Matting with Attention Mechanism and Feature Fusion
    Wang X.
    Wang Q.
    Yang G.
    Guo X.
    Wang, Qiqi (wangqiqi@tust.edu.cn), 2020, Institute of Computing Technology (32): : 1473 - 1483
  • [45] Connected Component Based Word Spotting on Persian Handwritten image documents
    Mobarakeh, M. Iranpour
    Yarmohammadi, H.
    INTERNATIONAL JOURNAL OF NONLINEAR ANALYSIS AND APPLICATIONS, 2019, 10 (02): : 11 - 21
  • [46] Deep Feature Aggregation and Image Re-Ranking With Heat Diffusion for Image Retrieval
    Pang, Shanmin
    Ma, Jin
    Xue, Jianru
    Zhu, Jihua
    Ordonez, Vicente
    IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 21 (06) : 1513 - 1523
  • [47] Plaid fabric image retrieval based on deep local and global features with attention mechanism
    Zhang, Xiaoting
    Zhao, Pengyu
    Pan, Ruru
    Gao, Weidong
    TEXTILE RESEARCH JOURNAL, 2025,
  • [48] Image retrieval based on visual attention model
    Achary, Satrajit
    Devi, M. R. Vimala
    INTERNATIONAL CONFERENCE ON COMMUNICATION TECHNOLOGY AND SYSTEM DESIGN 2011, 2012, 30 : 542 - 545
  • [49] VISUAL ATTENTION FOR CONTENT BASED IMAGE RETRIEVAL
    Papushoy, Alex
    Bors, Adrian G.
    2015 IEEE INTERNATIONAL CONFERENCE ON IMAGE PROCESSING (ICIP), 2015, : 971 - 975
  • [50] Image Retrieval Based on HSV Feature and Edge Direction Feature
    Dong, Yanxue
    PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON ADVANCES IN MECHANICAL ENGINEERING AND INDUSTRIAL INFORMATICS, 2015, 15 : 1002 - 1007