Text Extraction with Optimal Bi-LSTM

被引：0

作者：

Nayef, Bahera H. ^{[1
]}

Abdullah, Siti Norul Huda Sheikh ^{[2
]}

Sulaiman, Rossilawati ^{[2
]}

Saeed, Ashwaq Mukred ^{[3
]}

机构：

[1] Ibn Khaldun Univ Coll, Comp Tech Engn Dept, Baghdad 10011, Iraq

[2] Univ Kebangsaan Malaysia, Fac Informat Sci & Technol, Bangi 43600, Selangor, Malaysia

[3] Xiamen Univ Malaysia, Sch Elect Engn & Artificial Intelligence, Sepang 43900, Malaysia

来源：

CMC-COMPUTERS MATERIALS & CONTINUA | 2023年 / 76卷 / 03期

关键词：

Deep neural network; text features; dual max-pooling; concatenating convolution neural networks; bidirectional long short memory; text connector characteristics; NEURAL-NETWORKS; RECOGNITION;

D O I：

10.32604/cmc.2023.039528

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Text extraction from images using the traditional techniques of image collecting, and pattern recognition using machine learning consume time due to the amount of extracted features from the images. Deep Neural Networks introduce effective solutions to extract text features from images using a few techniques and the ability to train large datasets of images with significant results. This study proposes using Dual Maxpooling and concatenating convolution Neural Networks (CNN) layers with the activation functions Relu and the Optimized Leaky Relu (OLRelu). The proposed method works by dividing the word image into slices that contain characters. Then pass them to deep learning layers to extract feature maps and reform the predicted words. Bidirectional Short Memory (BiLSTM) layers extract more compelling features and link the time sequence from forward and backward directions during the training phase. The Connectionist Temporal Classification (CTC) function calcifies the training and validation loss rates. In addition to decoding the extracted feature to reform characters again and linking them according to their time sequence. The proposed model performance is evaluated using training and validation loss errors on the Mjsynth and Integrated Argument Mining Tasks (IAM) datasets. The result of IAM was 2.09% for the average loss errors with the proposed dual Maxpooling and OLRelu. In the Mjsynth dataset, the best validation loss rate shrunk to 2.2% by applying concatenating CNN layers, and Relu.

引用

页码：3548 / 3566

页数：19

共 50 条

[1] Entity Relationship Extraction Based on Bi-LSTM and Attention Mechanism
Wei, Ming
Xu, Zhipeng
Hu, Jiwei
[J]. PROCEEDINGS OF 2021 2ND INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE AND INFORMATION SYSTEMS (ICAIIS '21), 2021,
[2] Text multi-label sentiment analysis based on Bi-LSTM
Hu, Junlin
Kang, Xin
Nishide, Shun
Ren, Fuji
[J]. PROCEEDINGS OF 2019 6TH IEEE INTERNATIONAL CONFERENCE ON CLOUD COMPUTING AND INTELLIGENCE SYSTEMS (CCIS), 2019, : 16 - 20
[3] BI-LSTM Based Encoding and GAN for Text-to-Image Synthesis
Talasila, Vamsidhar
Narasingarao, M. R.
[J]. SENSING AND IMAGING, 2022, 23 (01):
[4] BI-LSTM Based Encoding and GAN for Text-to-Image Synthesis
Vamsidhar Talasila
M. R. Narasingarao
[J]. Sensing and Imaging, 2022, 23
[5] A Bi-LSTM mention hypergraph model with encoding schema for mention extraction
Lin, Jerry Chun-Wei
Shao, Yinan
Zhou, Yujie
Pirouz, Matin
Chen, Hsing-Chung
[J]. ENGINEERING APPLICATIONS OF ARTIFICIAL INTELLIGENCE, 2019, 85 : 175 - 181
[6] Named Entity Recognition for Biomedical Patent Text using Bi-LSTM Variants
Saad, Farag
[J]. IIWAS2019: THE 21ST INTERNATIONAL CONFERENCE ON INFORMATION INTEGRATION AND WEB-BASED APPLICATIONS & SERVICES, 2019, : 617 - 621
[7] Identifying Financial Text Causality with Bi-LSTM and Two-way CNN
Zhang, Shunxiang
Zhang, Zhenjiang
Zhu, Guangli
Zhao, Tong
Huang, Ju
[J]. Data Analysis and Knowledge Discovery, 2022, 6 (07) : 118 - 127
[8] Sentiment Analysis of Short Informal Text by Tuning BERT - Bi-LSTM Model
Agrawal, Shreyas
Dutta, Sumanto
Patra, Bidyut Kr
[J]. IEEE EUROCON 2021 - 19TH INTERNATIONAL CONFERENCE ON SMART TECHNOLOGIES, 2021, : 98 - 102
[9] Leveraging Biomedical Resources in Bi-LSTM for Drug-Drug Interaction Extraction
Xu, Bo
Shi, Xiufeng
Zhao, Zhehuan
Zheng, Wei
[J]. IEEE ACCESS, 2018, 6 : 33432 - 33439
[10] Construction of an Adverse Drug Reaction Extraction Model Based on Bi-LSTM and CRF
Zhu, Xiaoxiao
Yang, Zunqi
Liu, Jing
[J]. Data Analysis and Knowledge Discovery, 2019, 3 (02) : 90 - 97

← 1 2 3 4 5 →