Automatic Text Summarization in Natural Language Processing

被引:0
|
作者
Desai, M. R. [1 ]
Gachhinakatti, Bhagyashree [1 ]
Balaganur, Pooja [1 ]
Rajeshwari, Y. [1 ]
Rathod, Laxmi [1 ]
机构
[1] BLDEAs VP Dr PG Halakatti Coll Engn & Technol, Dept Comp Sci & Engn, Vijayapur, India
关键词
Natural Language Processing; Text Summarization; Sentence Ranking; Word Frequency; ROUGE;
D O I
10.1109/ICMNWC52512.2021.9688499
中图分类号
TN [电子技术、通信技术];
学科分类号
0809 ;
摘要
Text summarization is one of the challenging and engrossing problems in the field of Natural Language Processing. The goal of Text Summarization is to read, understand and extract meaningful information from the unstructured data. The idea behind summarization is to get concise and precise summary from the original document. We used Sentence Ranking method based on Word Frequency to determine the important sentences in original text. The Summarizer uses three features to determine important Sentence in a document. The Term Frequency, similarity with the Title and Sentence score are the features considered. Values are assigned to each word and a sentence in a document and create a score for each word and a sentence. Normalize the scores to avoid raw scores that skew the results. The Scores of top 'N' sentences are taken for summarization. Text cleaning, Sentence tokenization, Word tokenization, Normalized Word frequency table generation, and Summarization are steps used in the Sentence Ranking Method. The method is implemented on WikiHow dataset. The results of proposed model are evaluated with ROUGE toolkit which has three evaluation metrics- Precision, Recall and F-Score.
引用
收藏
页数:5
相关论文
共 50 条
  • [1] Automatic Video summarization with Timestamps using natural language processing text fusion
    Emad, Ahmed
    Bassel, Fady
    Refaat, Mark
    Abdelhamed, Mohamed
    Shorim, Nada
    AbdelRaouf, Ashraf
    [J]. 2021 IEEE 11TH ANNUAL COMPUTING AND COMMUNICATION WORKSHOP AND CONFERENCE (CCWC), 2021, : 60 - 66
  • [2] A SUMMARIZATION METHOD AUTOMATIC TEXT THROUGH STATISTICAL DATA AND NATURAL LANGUAGE PROCESSING
    de Souza, Osvaldo
    Tabosa, Hamilton Rodrigues
    de Oliveira, Davi Martins
    de Souza Oliveira, Mayra Helena
    [J]. INFORMACAO & SOCIEDADE-ESTUDOS, 2017, 27 (03) : 307 - 320
  • [3] Optimizing Text Summarization with Sentence Clustering and Natural Language Processing
    Edress, Zahir
    Ortakci, Yasin
    [J]. International Journal of Advanced Computer Science and Applications, 2024, 15 (10) : 1123 - 1132
  • [4] Two-Level Text Summarization with Natural Language Processing
    Hande, Rupali
    Sidhwani, Avinash
    Sidhwani, Deepesh
    Shiv, Monil
    Kewalramani, Divesh
    [J]. SECOND INTERNATIONAL CONFERENCE ON COMPUTER NETWORKS AND COMMUNICATION TECHNOLOGIES, ICCNCT 2019, 2020, 44 : 285 - 291
  • [5] Natural Language Processing (NLP) based Text Summarization - A Survey
    Awasthi, Ishitva
    Gupta, Kuntal
    Bhogal, Prabjot Singh
    Anand, Sahejpreet Singh
    Soni, Piyush Kumar
    [J]. PROCEEDINGS OF THE 6TH INTERNATIONAL CONFERENCE ON INVENTIVE COMPUTATION TECHNOLOGIES (ICICT 2021), 2021, : 1310 - 1317
  • [6] Automatic Text Summarization of News Articles in Serbian Language
    Kosmajac, Dijana
    Keselj, Vlado
    [J]. 2019 18TH INTERNATIONAL SYMPOSIUM INFOTEH-JAHORINA (INFOTEH), 2019,
  • [7] Automatic Text Summarization for Indonesian Language Using TextTeaser
    Gunawan, D.
    Pasaribu, A.
    Rahmat, R. F.
    Budiarto, R.
    [J]. IAES INTERNATIONAL CONFERENCE ON ELECTRICAL ENGINEERING, COMPUTER SCIENCE AND INFORMATICS, 2017, 190
  • [8] An Associative Concept Dictionary for Natural Language Processing: Text Summarization and Word Sense Disambiguation
    Okamoto, Jun
    Ishizaki, Shun
    [J]. JOURNAL OF COGNITIVE SCIENCE, 2011, 12 (03) : 259 - 276
  • [9] Related Blogs' Summarization With Natural Language Processing
    Baliyan, Niyati
    Sharma, Aarti
    [J]. COMPUTER JOURNAL, 2021, 64 (03): : 347 - 357
  • [10] Automatic Text Summarization
    Soumya, S.
    Kumar, Geethu S.
    Naseem, Rasia
    Mohan, Saumya
    [J]. COMPUTATIONAL INTELLIGENCE AND INFORMATION TECHNOLOGY, 2011, 250 : 787 - 789