An Automatic Summarizer for a Low-Resourced Language

被引:1
|
作者
Pattnaik, Sagarika [1 ]
Nayak, Ajit Kumar [2 ]
机构
[1] SOA Univ, Dept CSE, ITER, Bhubaneswar, India
[2] SOA Univ, Dept CS & IT, ITER, Bhubaneswar, India
关键词
NLP; Text summarization; Extractive; Abstractive; F score;
D O I
10.1007/978-981-15-1081-6_24
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the current scenario with the availability of huge volumes of information has given rise to the quench for auto summarizers. Our paper proposes a simple auto summarizer for text document in Odia language, a language that is computationally impoverished. It is a statistical-based extractive text summarizer and does a shallow approach. The summarizer also considers some linguistic features in the process. The sentences for the summary are extracted on the basis of their significant values. The program-generated summary is evaluated against human-generated summaries on the basis of F score values and has got a performance score of 66.92% giving a clear gist of the input text.
引用
收藏
页码:285 / 295
页数:11
相关论文
共 50 条
  • [1] Analysis of Automatic Evaluation Metric on Low-Resourced Language: BERTScore vs BLEU Score
    Datta, Goutam
    Joshi, Nisheeth
    Gupta, Kusum
    [J]. SPEECH AND COMPUTER, SPECOM 2022, 2022, 13721 : 155 - 162
  • [2] Performance of Recent Large Language Models for a Low-Resourced Language
    Jayakody, Ravindu
    Dias, Gihan
    [J]. 2024 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING, IALP 2024, 2024, : 162 - 167
  • [3] A Spell Checker for a Low-resourced and Morphologically Rich Language
    Octaviano, Manolito, Jr.
    Borra, Allan
    [J]. TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 1853 - 1856
  • [4] Gramatika: A Grammar Checker for the Low-Resourced Filipino Language
    Go, Matthew Phillip
    Nocon, Nicco
    Borra, Allan
    [J]. TENCON 2017 - 2017 IEEE REGION 10 CONFERENCE, 2017, : 471 - 475
  • [5] A Need Finding Study with Low-Resourced Language Content Creators
    Nigatu, Hellina Hailu
    Canny, John
    Chasins, Sarah
    [J]. PROCEEDINGS OF THE 4TH AFRICAN CONFERENCE FOR HUMAN COMPUTER INTERACTION, AFRICHI 2023, 2023, : 1 - 4
  • [6] A First LVCSR System for Luxembourgish, a Low-Resourced European Language
    Adda-Decker, Martine
    Lamel, Lori
    Adda, Gilles
    Lavergne, Thomas
    [J]. HUMAN LANGUAGE TECHNOLOGY CHALLENGES FOR COMPUTER SCIENCE AND LINGUISTICS, 2014, 8387 : 479 - 490
  • [7] Diabetes in low-resourced countries
    Ashwal, Eran
    Hadar, Eran
    Hod, Moshe
    [J]. BEST PRACTICE & RESEARCH CLINICAL OBSTETRICS & GYNAECOLOGY, 2015, 29 (01) : 91 - 101
  • [8] Common latent representation learning for low-resourced spoken language identification
    Chen, Chen
    Bu, Yulin
    Chen, Yong
    Chen, Deyun
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 83 (12) : 34515 - 34535
  • [9] AN INVESTIGATION INTO LANGUAGE MODEL DATA AUGMENTATION FOR LOW-RESOURCED STT AND KWS
    Huang, Guangpu
    da Silva, Thiago Fraga
    Lamel, Lori
    Gauvain, Jean-Luc
    Gorin, Arseniy
    Laurent, Antoine
    Lileikyte, Rasa
    Messouadi, Abdel
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 5790 - 5794
  • [10] Common latent representation learning for low-resourced spoken language identification
    Chen, Chen
    Bu, Yulin
    Chen, Yong
    Chen, Deyun
    [J]. Multimedia Tools and Applications, 2024, 83 (12) : 34515 - 34535