An Efficient Machine Learning-based Text Summarization in the Malayalam Language

被引:0
|
作者
Haroon, Rosna P. [1 ]
Abdul Gafur, M. [2 ]
Barakkath Nisha, U. [3 ]
机构
[1] APJ Abdul Kalam Technol Univ, Dept CSE, Ilahia Coll Engn & Technol, Thiruvananthapuram, Kerala, India
[2] APJ Abdul Kalam Technol Univ, Ilahia Coll Engn & Technol, Thiruvananthapuram, Kerala, India
[3] Sri Krishna Coll Engn & Technol, Dept IT, Coimbatore, Tamil Nadu, India
关键词
Malayalam Text Summarization; Supervised Machine Learning; SVM; Text Mining; Sentence Extraction; Summary Generation;
D O I
10.3837/tiis.2022.06.001
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatic text summarization is a procedure that packs enormous content into a more limited book that incorporates significant data. Malayalam is one of the toughest languages utilized in certain areas of India, most normally in Kerala and in Lakshadweep. Natural language processing in the Malayalam language is relatively low due to the complexity of the language as well as the scarcity of available resources. In this paper, a way is proposed to deal with the text summarization process in Malayalam documents by training a model based on the Support Vector Machine classification algorithm. Different features of the text are taken into account for training the machine so that the system can output the most important data from the input text. The classifier can classify the most important, important, average, and least significant sentences into separate classes and based on this, the machine will be able to create a summary of the input document. The user can select a compression ratio so that the system will output that much fraction of the summary. The model performance is measured by using different genres of Malayalam documents as well as documents from the same domain. The model is evaluated by considering content evaluation measures precision, recall, F score, and relative utility. Obtained precision and recall value shows that the model is trustable and found to be more relevant compared to the other summarizers.
引用
收藏
页码:1778 / 1799
页数:22
相关论文
共 50 条
  • [21] Arabic natural language processing and machine learning-based systems
    Larabi Marie-Sainte S.
    Alalyani N.
    Alotaibi S.
    Ghouzali S.
    Abunadi I.
    [J]. IEEE Access, 2019, 7 : 7011 - 7020
  • [22] Arabic Natural Language Processing and Machine Learning-Based Systems
    Marie-Sainte, Souad Larabi
    Alalyani, Nada
    Alotaibi, Sihaam
    Ghouzali, Sanaa
    Abunadi, Ibrahim
    [J]. IEEE ACCESS, 2019, 7 : 7011 - 7020
  • [23] Text Chunker for Malayalam using Memory-Based Learning
    Raj, Rekha C. T.
    Raj, Reghu P. C.
    [J]. 2015 INTERNATIONAL CONFERENCE ON CONTROL COMMUNICATION & COMPUTING INDIA (ICCC), 2015, : 595 - 599
  • [24] Social-sum-Mal: A Dataset for Abstractive Text Summarization in Malayalam
    Rahul, Raj M.
    Pankaj, Dhanya S
    [J]. ACM Transactions on Asian and Low-Resource Language Information Processing, 2024, 23 (11)
  • [25] Personalized Text Content Summarizer for Mobile Learning: An Automatic Text Summarization System with Relevance Based Language Model
    Yang, Guangbing
    Wen, Dunwei
    Kinshuk
    Chen, Nian-Shing
    Sutinen, Erkki
    [J]. 2012 IEEE FOURTH INTERNATIONAL CONFERENCE ON TECHNOLOGY FOR EDUCATION (T4E), 2012, : 90 - 97
  • [26] Attention based Abstractive Summarization of Malayalam Document
    Nambiar, Sindhya K.
    Peter, David S.
    Idicula, Sumam Mary
    [J]. AI IN COMPUTATIONAL LINGUISTICS, 2021, 189 : 250 - 257
  • [27] A Machine Learning-Based Protocol for Efficient Routing in Opportunistic Networks
    Sharma, Deepak K.
    Dhurandher, Sanjay K.
    Woungang, Isaac
    Srivastava, Rohit K.
    Mohananey, Anhad
    Rodrigues, Joel J. P. C.
    [J]. IEEE SYSTEMS JOURNAL, 2018, 12 (03): : 2207 - 2213
  • [28] A Machine Learning-Based Approach for Efficient Cloud Service Selection
    Gandhi, Uttam
    Bothera, Abhi
    Garg, Neha
    Neeraj
    Gupta, Indrajeet
    [J]. Communications in Computer and Information Science, 2022, 1528 CCIS : 626 - 632
  • [29] An Efficient Design of a Machine Learning-Based Elderly Fall Detector
    Nguyen, L. P.
    Saleh, M.
    Jeannes, R. Le Bouquin
    [J]. INTERNET OF THINGS (IOT) TECHNOLOGIES FOR HEALTHCARE, HEALTHYIOT 2017, 2018, 225 : 34 - 41
  • [30] Machine learning-based energy efficient technologies for smart grid
    Yao, Rui
    Li, Jun
    Zuo, Baofeng
    Hu, Jianli
    [J]. INTERNATIONAL TRANSACTIONS ON ELECTRICAL ENERGY SYSTEMS, 2021, 31 (09):