AENet: Image Retrieval of Kazakh Handwritten Documents Based on Attention Mechanism and Feature Aggregation

被引:0
|
作者
Chen, Gang [1 ]
Xu, Xuebin [1 ]
Wang, Jiaoyan [1 ]
Mamat, Hornisa [1 ]
Ubul, Kurban [1 ]
机构
[1] Xinjiang Univ, Sch Comp Sci & Technol, Xinjiang Key Lab Multilingual Informat Technol, Urumqi 830046, Peoples R China
基金
中国国家自然科学基金;
关键词
offline handwriting; Kazakh language; image retrieval; attention mechanism; feature representation;
D O I
10.1002/tee.24122
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Kazakh is one of the multilingual languages of China and is widely spoken in some areas of Xinjiang, China. However, due to the fact that Kazakh is a language in which several characters are glued together to form a continuous word with a unique shape and complex structural combinations of relationships. This paper explores a solution for offline image retrieval of handwritten Kazakh words, which is a challenging task because, due to the lack of relevant datasets and the special writing morphology of the Kazakh language, traditional text image retrieval algorithms often struggle to achieve satisfactory results when dealing with writing styles that are varied and adherent to the language. Therefore, a dataset of offline Kazakh handwritten document images was created in this paper. The dataset contains 300 pages of document images with 20 500 words. Then, a new model called the 'AENet' is proposed. The model utilizes an attention mechanism to focus more finely on focal regions such as centers, inflection points, and contours of handwritten word images and to capture important local features from different scales. Fusion space pyramid pooling, feature aggregation, encoding operations, and feature downscaling and reconstruction are used to extract and reconstruct more representative feature representations from local to global to capture the overall information in the word images. Through experimental evaluation on Kazak-80, Zilla-64, and HWDB1.1-375 datasets, it is verified that the method significantly improves the mAP for image retrieval of handwritten words, which is especially applicable to adhesive languages like Kazakh. (c) 2024 Institute of Electrical Engineers of Japan and Wiley Periodicals LLC.
引用
收藏
页码:1640 / 1651
页数:12
相关论文
共 50 条
  • [21] Clothing image segmentation method based on feature learning and attention mechanism
    Gu M.
    Liu J.
    Li L.
    Cui L.
    Fangzhi Xuebao/Journal of Textile Research, 2022, 43 (11): : 163 - 171
  • [22] Image Retrieval Based on Saliency Attention
    Wen, Zhenkun
    Gao, Jinhua
    Luo, Ruijie
    Wu, Huisi
    FOUNDATIONS OF INTELLIGENT SYSTEMS (ISKE 2013), 2014, 277 : 177 - 188
  • [23] Image retrieval based on feature element
    Li, Qing
    Zhang, Yu-Jin
    Dianzi Yu Xinxi Xuebao/Journal of Electronics and Information Technology, 2003, 25 (12):
  • [24] Image retrieval based on the texture feature
    Multimedia Technology Institute, Xidian University, Xi'an 710071, China
    不详
    Guangdianzi Jiguang, 2008, 2 (230-232):
  • [25] Feature Based Image Retrieval Algorithm
    Nimi, P. U.
    Tripti, C.
    ADVANCES IN COMPUTING AND COMMUNICATIONS, PT 4, 2011, 193 : 46 - 55
  • [26] Dual-Attention-Based Feature Aggregation Network for Infrared and Visible Image Fusion
    Tang, Zhimin
    Xiao, Guobao
    Guo, Junwen
    Wang, Shiping
    Ma, Jiayi
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 72
  • [27] Handwritten Chinese Character Recognition Based on Attention Mechanism
    Huang Wanrong
    He Kai
    Liu Kun
    Gao Shengnan
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (08)
  • [28] Image Retrieval Based on Shape Feature and Color Feature
    Liu, Jun-ling
    Zhao, Hong-Wei
    Zhao, Hao-yu
    Chen, Chong-xu
    MATERIAL AND MANUFACTURING TECHNOLOGY II, PTS 1 AND 2, 2012, 341-342 : 560 - +
  • [29] Research on Graph Feature Data Aggregation Algorithm Based on Graph Convolution and Attention Mechanism
    Lei, Wenhan
    Liu, Xinyuan
    Ye, Lin
    Hu, Tao
    Gong, Lei
    Luo, Junxia
    2024 4TH INTERNATIONAL CONFERENCE ON ELECTRONIC MATERIALS AND INFORMATION ENGINEERING, EMIE 2024, 2024, : 146 - 150
  • [30] Radar Signal Recognition Based on CNN With a Hybrid Attention Mechanism and Skip Feature Aggregation
    Guo, Yuanpu
    Sun, Haixin
    Liu, Hui
    Deng, Zhenmiao
    IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2024, 73 : 1 - 13