Query-based Summarization for Indonesian News Articles

被引:0
|
作者
Annisa, Dininta [1 ]
Khodra, Masayu Leylia [1 ]
机构
[1] Inst Teknol Bandung, Sch Elect Engn & Informat, Bandung, Indonesia
关键词
summarization; query; Indonesian; ROUGE-2; news articles; MMR;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Query-based summarization not only finds important information, but also relevant information to query given by users. This research aims to determine the best technique and configurations of query-based summarizer for Indonesian news articles. We develop two techniques adapted from the first rank summarizers on Document Understanding Conferences which are using Wordnet-based and word cluster-based sentence scoring. We also compared three sentence selection methods i.e.: Maximal Marginal Relevance (MMR), Integer Linear Programming (ILP), and dynamic programming. Based on evaluation using ROUGE-2, the first technique which is adapted from NUS summarizer has the best recall of 0.1232, while the second one (which is adapted from IIIT Hyderabad summarizer) 0.1555. Both techniques give the best result when using MMR as sentence selection method.
引用
收藏
页数:6
相关论文
共 50 条
  • [31] QMOS: Query-based multi-documents opinion-oriented summarization
    Abdi, Asad
    Shamsuddin, Siti Mariyam
    Aliguliyev, Ramiz M.
    [J]. INFORMATION PROCESSING & MANAGEMENT, 2018, 54 (02) : 318 - 338
  • [32] QMSum: A New Benchmark for Query-based Multi-domain Meeting Summarization
    Zhong, Ming
    Yin, Da
    Yu, Tao
    Zaidi, Ahmad
    Mutuma, Mutethia
    Jha, Rahul
    Awadallah, Ahmed Hassan
    Celikyilmaz, Asli
    Liu, Yang
    Qiu, Xipeng
    Radev, Dragomir
    [J]. 2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 5905 - 5921
  • [33] CoMSum and SIBERT: A Dataset and Neural Model for Query-Based Multi-document Summarization
    Kulkarni, Sayali
    Chammas, Sheide
    Zhu, Wan
    Sha, Fei
    Ie, Eugene
    [J]. DOCUMENT ANALYSIS AND RECOGNITION - ICDAR 2021, PT II, 2021, 12822 : 84 - 98
  • [34] Query-Based Automatic Multi-document Summarization Extraction Method for Web Pages
    He, Qi
    Hao, Hong-Wei
    Yin, Xu-Cheng
    [J]. PROCEEDINGS OF THE 2011 2ND INTERNATIONAL CONGRESS ON COMPUTER APPLICATIONS AND COMPUTATIONAL SCIENCE, VOL 1, 2012, 144 : 107 - 112
  • [35] A Summarization Dataset of Slovak News Articles
    Suppa, Marek
    Adamec, Jergus
    [J]. PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 6725 - 6730
  • [36] Temporal Summarization for Online News Articles
    Kumar, Kishore M.
    Kumar, Sanampudi Suresh
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON POWER, CONTROL, SIGNALS AND INSTRUMENTATION ENGINEERING (ICPCSI), 2017, : 1376 - 1379
  • [37] Automatic Text Summarization of News Articles
    Sethi, Prakhar
    Sonawane, Sameer
    Khanwalker, Saumitra
    Keskar, R. B.
    [J]. 2017 INTERNATIONAL CONFERENCE ON BIG DATA, IOT AND DATA SCIENCE (BID), 2017, : 23 - 29
  • [38] Ontology-based text summarization for business news articles
    Wu, CW
    Liu, CL
    [J]. COMPUTERS AND THEIR APPLICATIONS, 2003, : 389 - 392
  • [39] MLASK: Multimodal Summarization of Video-based News Articles
    Krubinski, Mateusz
    Pecina, Pavel
    [J]. 17TH CONFERENCE OF THE EUROPEAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, EACL 2023, 2023, : 910 - 924
  • [40] Query-based multi-documents summarization using linguistic knowledge and content word expansion
    Abdi, Asad
    Idris, Norisma
    Alguliyev, Rasim M.
    Aliguliyev, Ramiz M.
    [J]. SOFT COMPUTING, 2017, 21 (07) : 1785 - 1801