AENet: Image Retrieval of Kazakh Handwritten Documents Based on Attention Mechanism and Feature Aggregation

被引:0
|
作者
Chen, Gang [1 ]
Xu, Xuebin [1 ]
Wang, Jiaoyan [1 ]
Mamat, Hornisa [1 ]
Ubul, Kurban [1 ]
机构
[1] Xinjiang Univ, Sch Comp Sci & Technol, Xinjiang Key Lab Multilingual Informat Technol, Urumqi 830046, Peoples R China
基金
中国国家自然科学基金;
关键词
offline handwriting; Kazakh language; image retrieval; attention mechanism; feature representation;
D O I
10.1002/tee.24122
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Kazakh is one of the multilingual languages of China and is widely spoken in some areas of Xinjiang, China. However, due to the fact that Kazakh is a language in which several characters are glued together to form a continuous word with a unique shape and complex structural combinations of relationships. This paper explores a solution for offline image retrieval of handwritten Kazakh words, which is a challenging task because, due to the lack of relevant datasets and the special writing morphology of the Kazakh language, traditional text image retrieval algorithms often struggle to achieve satisfactory results when dealing with writing styles that are varied and adherent to the language. Therefore, a dataset of offline Kazakh handwritten document images was created in this paper. The dataset contains 300 pages of document images with 20 500 words. Then, a new model called the 'AENet' is proposed. The model utilizes an attention mechanism to focus more finely on focal regions such as centers, inflection points, and contours of handwritten word images and to capture important local features from different scales. Fusion space pyramid pooling, feature aggregation, encoding operations, and feature downscaling and reconstruction are used to extract and reconstruct more representative feature representations from local to global to capture the overall information in the word images. Through experimental evaluation on Kazak-80, Zilla-64, and HWDB1.1-375 datasets, it is verified that the method significantly improves the mAP for image retrieval of handwritten words, which is especially applicable to adhesive languages like Kazakh. (c) 2024 Institute of Electrical Engineers of Japan and Wiley Periodicals LLC.
引用
收藏
页码:1640 / 1651
页数:12
相关论文
共 50 条
  • [1] Using Content Based Image Retrieval Techniques for the Indexing and Retrieval of Thai Handwritten Documents
    Sangsawad, Seksan
    Fung, Chun Che
    2010 2ND INTERNATIONAL CONFERENCE ON COMPUTER AND AUTOMATION ENGINEERING (ICCAE 2010), VOL 1, 2010, : 98 - 101
  • [2] Keywords image retrieval in historical handwritten Arabic documents
    Saabni, Raid
    El-Sana, Jihad
    JOURNAL OF ELECTRONIC IMAGING, 2013, 22 (01)
  • [3] Arabic Documents Information Retrieval for Printed, Handwritten, and Calligraphy Image
    Al-Barhamtoshy, Hassanin M.
    Jambi, Kamal M.
    Abdou, Sherif M.
    Rashwan, Mohsen A.
    IEEE ACCESS, 2021, 9 : 51242 - 51257
  • [4] An Overview Of Feature Extraction Methods For Handwritten Image Retrieval
    Ting, Gao
    Moydin, Kamil
    Hamdulla, Askar
    2018 3RD INTERNATIONAL CONFERENCE ON SMART CITY AND SYSTEMS ENGINEERING (ICSCSE), 2018, : 840 - 843
  • [5] Series feature aggregation for content-based image retrieval
    Zhang, Jun
    Ye, Lei
    COMPUTERS & ELECTRICAL ENGINEERING, 2010, 36 (04) : 691 - 701
  • [6] Word Spotting based Retrieval of Urdu Handwritten Documents
    Abidi, Ali
    Jamil, Akhtar
    Siddiqi, Imran
    Khurshid, Khurram
    13TH INTERNATIONAL CONFERENCE ON FRONTIERS IN HANDWRITING RECOGNITION (ICFHR 2012), 2012, : 331 - 336
  • [7] REGIONAL DEEP FEATURE AGGREGATION FOR IMAGE RETRIEVAL
    Jeong, Dong-Ju
    Choo, Sungkwon
    Seo, Wonkyo
    Cho, Nam Ik
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 1737 - 1741
  • [8] Beauty Product Image Retrieval Based on Multi-Feature Fusion and Feature Aggregation
    Wang, Qi
    Lai, Jingxiang
    Xu, Kai
    Liu, Wenyin
    Lei, Liang
    PROCEEDINGS OF THE 2018 ACM MULTIMEDIA CONFERENCE (MM'18), 2018, : 2063 - 2067
  • [9] Feature Aggregation With Attention for Aerial Image Segmentation
    Almadhor, Ahmad
    IEEE SENSORS JOURNAL, 2021, 21 (23) : 26978 - 26984
  • [10] Remote Sensing Image Retrieval Based on Regional Attention Mechanism
    Peng Yanfei
    Mei Jinye
    Wang Kaixin
    Zi Lingling
    Sang Yu
    LASER & OPTOELECTRONICS PROGRESS, 2020, 57 (10)