Intelligent Text Mining Model for English Language Using Deep Neural Network

被引:0
|
作者
Singh, Shashi Pal [1 ]
Kumar, Ajai [1 ]
Darbari, Hemant [1 ]
Kaur, Balvinder [2 ]
Tiwari, Kanchan [2 ]
Joshi, Nisheeth [2 ]
机构
[1] Ctr Dev Adv Comp, AAIG, Pune, Maharashtra, India
[2] Banasthali Vidyapith, Vanasthali, Rajasthan, India
关键词
Text processing; Sentence processing; Deep neural network (DNN); Recurrent Neural Network (RNN); Summarization;
D O I
10.1007/978-3-319-63645-0_54
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Today there exist various sources that provide information in very massive amount to serve the demand over the internet, which creates huge collection of heterogeneous data. Thus existing data can be categorized as unstructured and structured data. In this paper we propose an idea of a tool which intelligently preprocesses the unstructured data by segmenting the whole document into number of sentences, using deep learning concepts with word2vec [11] and a Recurrent Neural Network [13]. At the beginning step we use word2vec which was introduced by Tomas Mikolov with his team at Google, to generate vectors of the inputted text content which will be further forwarded to Recurrent Neural Network. RNN takes this series of vectors as input and trained Data Cleaning Recurrent Neural Network model will perform preprocessing task (including cleaning of missing, grammatically incorrect, misspelled data) to produce structured results, which then passed into automatic summarization module to generate desired summary.
引用
收藏
页码:473 / 486
页数:14
相关论文
共 50 条
  • [1] Irony detection using neural network language model, psycholinguistic features and text mining
    Ravi, Kumar
    Ravi, Vadlamani
    [J]. PROCEEDINGS OF 2018 IEEE 17TH INTERNATIONAL CONFERENCE ON COGNITIVE INFORMATICS & COGNITIVE COMPUTING (ICCI*CC 2018), 2018, : 254 - 260
  • [2] Document classification using a deep neural network in text mining
    Lee, Bo-Hui
    Lee, Su-Jin
    Choi, Yong-Seok
    [J]. KOREAN JOURNAL OF APPLIED STATISTICS, 2020, 33 (05) : 615 - 625
  • [3] Convolution Neural Network for Text Mining and Natural Language Processing
    Widiastuti, N., I
    [J]. 2ND INTERNATIONAL CONFERENCE ON INFORMATICS, ENGINEERING, SCIENCE, AND TECHNOLOGY (INCITEST 2019), 2019, 662
  • [4] A Novel Text Mining Algorithm based on Deep Neural Network
    Sheng, Xiaobao
    Wu, Xin
    Luo, Yimin
    [J]. 2016 INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT), VOL 2, 2016, : 617 - 622
  • [5] Web text mining using a hybrid intelligent system based on KDT, expert system and neural network
    Fukuda, FH
    Passos, ELP
    Pacheco, MA
    Neto, LB
    Valério, J
    De Roberto, V
    Antonio, ER
    Chiganer, L
    [J]. DATA MINING II, 2000, 2 : 363 - 372
  • [6] Speech Recognition Model for Assamese Language Using Deep Neural Network
    Singh, Moirangthem Tiken
    Barman, Partha Pratim
    Gogoi, Rupjyoti
    [J]. 2018 INTERNATIONAL CONFERENCE ON RECENT INNOVATIONS IN ELECTRICAL, ELECTRONICS & COMMUNICATION ENGINEERING (ICRIEECE 2018), 2018, : 2722 - 2727
  • [7] An Intelligent Vehicle Price Estimation Approach Using a Deep Neural Network Model
    Alnajim, Thuraya
    Alshahrani, Nouf
    Asiri, Omar
    [J]. WORLD ELECTRIC VEHICLE JOURNAL, 2024, 15 (08):
  • [8] A Deep Convolutional Neural Network Model for Intelligent Discrimination between Coal and Rocks in Coal Mining Face
    Si, Lei
    Xiong, Xiangxiang
    Wang, Zhongbin
    Tan, Chao
    [J]. MATHEMATICAL PROBLEMS IN ENGINEERING, 2020, 2020
  • [9] Improving Acoustic Model for English ASR System using Deep Neural Network
    Quoc Bao Nguyen
    Tat Thang Vu
    Chi Mai Luong
    [J]. 2015 IEEE RIVF INTERNATIONAL CONFERENCE ON COMPUTING & COMMUNICATION TECHNOLOGIES - RESEARCH, INNOVATION, AND VISION FOR THE FUTURE (RIVF), 2015, : 25 - 29
  • [10] Deep Neural Network Models for Paraphrased Text Classification in the Arabic Language
    Mahmoud, Adnen
    Zrigui, Mounir
    [J]. NATURAL LANGUAGE PROCESSING AND INFORMATION SYSTEMS (NLDB 2019), 2019, 11608 : 3 - 16