Manuscripts Image Retrieval Using Deep Learning Incorporating a Variety of Fusion Levels

被引:6
|
作者
Khayyat, Manal M. [1 ,2 ]
Elrefaei, Lamiaa A. [1 ,3 ]
机构
[1] King Abdulaziz Univ, Fac Comp & Informat Technol, Dept Comp Sci, Jeddah 21589, Saudi Arabia
[2] Umm Al Qura Univ, Preparatory Year Joint Med Track, Dept Comp Sci, Mecca 21955, Saudi Arabia
[3] Benha Univ, Dept Elect Engn, Fac Engn Shoubra, Cairo 11629, Egypt
关键词
Image retrieval; Feature extraction; Machine learning; Visualization; Image segmentation; Image color analysis; Semantics; fusion methods; similarity measurement; deep learning (DL); convolutional neural networks (CNN); long short-term memory (LSTM); DECISION LEVEL; SCORE; CLASSIFICATION; RECOGNITION; INFORMATION; FEATURES; MODEL; REPRESENTATION; ATTENTION; NETWORKS;
D O I
10.1109/ACCESS.2020.3010882
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The instantaneous search and retrieval of the most relevant images to a specific query image is a desirable application for all digital libraries. The automatic extraction and classification according to the most distinguishable features, is a crucial step to detect the similarities among images successfully. This study introduces a novel approach that utilizes a fusion model for classifying and retrieving historical Arabic manuscripts' images. To accomplish our goal, the images are first classified according to their extracted deep learning visual features utilizing a pre-trained convolutional neural network. Then, the texts written in the manuscripts' images are extracted and pre-processed to classify the images according to their textual features using an optimized bidirectional LSTM deep learning model with attention and batch normalization layers. Finally, both the visual and textual deep learning models are fused at three different fusion-levels named: decision-level, features-level, and score-level. The score-level fusion model resulted in a considerable improvement of each model used individually. Extensive experimentation and evaluation of the proposed fusion method on the collected ancient Arabic manuscripts dataset proved its robustness against other state-of-the-art methods recording 99% classification accuracy and 98% mean accuracy on the top-10 image retrieval.
引用
收藏
页码:136460 / 136486
页数:27
相关论文
共 50 条
  • [1] Deep reinforcement learning approach for manuscripts image classification and retrieval
    Khayyat, Manal M.
    Elrefaei, Lamiaa A.
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2022, 81 (11) : 15395 - 15417
  • [2] Deep reinforcement learning approach for manuscripts image classification and retrieval
    Manal M. Khayyat
    Lamiaa A. Elrefaei
    [J]. Multimedia Tools and Applications, 2022, 81 : 15395 - 15417
  • [3] A Hashing Image Retrieval Method Based on Deep Learning and Local Feature Fusion
    Nie, Yi-Liang
    Du, Ji-Xiang
    Fan, Wen-Tao
    [J]. INTELLIGENT COMPUTING THEORIES AND APPLICATION, ICIC 2017, PT I, 2017, 10361 : 200 - 210
  • [4] Visible and Infrared Image Fusion Using Deep Learning
    Zhang, Xingchen
    Demiris, Yiannis
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 45 (08) : 10535 - 10554
  • [5] Content based image retrieval using deep learning process
    R. Rani Saritha
    Varghese Paul
    P. Ganesh Kumar
    [J]. Cluster Computing, 2019, 22 : 4187 - 4200
  • [6] Content Based Image Retrieval Approach using Deep Learning
    Abdel-Nabi, Heba
    Al-Naymat, Ghazi
    Awajan, Arafat
    [J]. 2019 2ND INTERNATIONAL CONFERENCE ON NEW TRENDS IN COMPUTING SCIENCES (ICTCS), 2019, : 170 - 177
  • [7] Image Retrieval Using Latent Feature Learning By Deep Architecture
    Garg, Nishu
    Nikhitha, P.
    Tripathy, B. K.
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON COMPUTATIONAL INTELLIGENCE AND COMPUTING RESEARCH (IEEE ICCIC), 2014, : 663 - 666
  • [8] Bird Image Retrieval and Recognition Using a Deep Learning Platform
    Huang, Yo-Ping
    Basanta, Haobijam
    [J]. IEEE ACCESS, 2019, 7 : 66980 - 66989
  • [9] Content based image retrieval using deep learning process
    Saritha, R. Rani
    Paul, Varghese
    Kumar, P. Ganesh
    [J]. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS, 2019, 22 (02): : S4187 - S4200
  • [10] Fusion of Deep Learning and Compressed Domain Features for Content-Based Image Retrieval
    Liu, Peizhong
    Guo, Jing-Ming
    Wu, Chi-Yi
    Cai, Danlin
    [J]. IEEE TRANSACTIONS ON IMAGE PROCESSING, 2017, 26 (12) : 5706 - 5717