Text Extraction with Optimal Bi-LSTM

被引:0
|
作者
Nayef, Bahera H. [1 ]
Abdullah, Siti Norul Huda Sheikh [2 ]
Sulaiman, Rossilawati [2 ]
Saeed, Ashwaq Mukred [3 ]
机构
[1] Ibn Khaldun Univ Coll, Comp Tech Engn Dept, Baghdad 10011, Iraq
[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Bangi 43600, Selangor, Malaysia
[3] Xiamen Univ Malaysia, Sch Elect Engn & Artificial Intelligence, Sepang 43900, Malaysia
来源
CMC-COMPUTERS MATERIALS & CONTINUA | 2023年 / 76卷 / 03期
关键词
Deep neural network; text features; dual max-pooling; concatenating convolution neural networks; bidirectional long short memory; text connector characteristics; NEURAL-NETWORKS; RECOGNITION;
D O I
10.32604/cmc.2023.039528
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Text extraction from images using the traditional techniques of image collecting, and pattern recognition using machine learning consume time due to the amount of extracted features from the images. Deep Neural Networks introduce effective solutions to extract text features from images using a few techniques and the ability to train large datasets of images with significant results. This study proposes using Dual Maxpooling and concatenating convolution Neural Networks (CNN) layers with the activation functions Relu and the Optimized Leaky Relu (OLRelu). The proposed method works by dividing the word image into slices that contain characters. Then pass them to deep learning layers to extract feature maps and reform the predicted words. Bidirectional Short Memory (BiLSTM) layers extract more compelling features and link the time sequence from forward and backward directions during the training phase. The Connectionist Temporal Classification (CTC) function calcifies the training and validation loss rates. In addition to decoding the extracted feature to reform characters again and linking them according to their time sequence. The proposed model performance is evaluated using training and validation loss errors on the Mjsynth and Integrated Argument Mining Tasks (IAM) datasets. The result of IAM was 2.09% for the average loss errors with the proposed dual Maxpooling and OLRelu. In the Mjsynth dataset, the best validation loss rate shrunk to 2.2% by applying concatenating CNN layers, and Relu.
引用
收藏
页码:3548 / 3566
页数:19
相关论文
共 50 条
  • [1] Entity Relationship Extraction Based on Bi-LSTM and Attention Mechanism
    Wei, Ming
    Xu, Zhipeng
    Hu, Jiwei
    [J]. PROCEEDINGS OF 2021 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS '21), 2021,
  • [2] Text multi-label sentiment analysis based on Bi-LSTM
    Hu, Junlin
    Kang, Xin
    Nishide, Shun
    Ren, Fuji
    [J]. PROCEEDINGS OF 2019 6TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2019, : 16 - 20
  • [3] BI-LSTM Based Encoding and GAN for Text-to-Image Synthesis
    Talasila, Vamsidhar
    Narasingarao, M. R.
    [J]. SENSING AND IMAGING, 2022, 23 (01):
  • [4] BI-LSTM Based Encoding and GAN for Text-to-Image Synthesis
    Vamsidhar Talasila
    M. R. Narasingarao
    [J]. Sensing and Imaging, 2022, 23
  • [5] A Bi-LSTM mention hypergraph model with encoding schema for mention extraction
    Lin, Jerry Chun-Wei
    Shao, Yinan
    Zhou, Yujie
    Pirouz, Matin
    Chen, Hsing-Chung
    [J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 85 : 175 - 181
  • [6] Named Entity Recognition for Biomedical Patent Text using Bi-LSTM Variants
    Saad, Farag
    [J]. IIWAS2019: THE 21ST INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2019, : 617 - 621
  • [7] Identifying Financial Text Causality with Bi-LSTM and Two-way CNN
    Zhang, Shunxiang
    Zhang, Zhenjiang
    Zhu, Guangli
    Zhao, Tong
    Huang, Ju
    [J]. Data Analysis and Knowledge Discovery, 2022, 6 (07) : 118 - 127
  • [8] Sentiment Analysis of Short Informal Text by Tuning BERT - Bi-LSTM Model
    Agrawal, Shreyas
    Dutta, Sumanto
    Patra, Bidyut Kr
    [J]. IEEE EUROCON 2021 - 19TH INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES, 2021, : 98 - 102
  • [9] Leveraging Biomedical Resources in Bi-LSTM for Drug-Drug Interaction Extraction
    Xu, Bo
    Shi, Xiufeng
    Zhao, Zhehuan
    Zheng, Wei
    [J]. IEEE ACCESS, 2018, 6 : 33432 - 33439
  • [10] Construction of an Adverse Drug Reaction Extraction Model Based on Bi-LSTM and CRF
    Zhu, Xiaoxiao
    Yang, Zunqi
    Liu, Jing
    [J]. Data Analysis and Knowledge Discovery, 2019, 3 (02) : 90 - 97