Determining Features of News Headline in Malay News Document

被引:2
|
作者
Noah, Shahrul Azman Mohd [1 ]
Ali, Nazlena Mohamad [1 ]
Hasan, Mohd Sabri [1 ]
机构
[1] Univ Kebangsaan Malaysia, Fak Teknol & Sains Maklumat, Bangi, Malaysia
来源
关键词
headline; Natural Language Processing; malay news; text summarization; Malay corpus;
D O I
10.17576/gema-2018-1802-11
中图分类号
H [语言、文字];
学科分类号
05 ;
摘要
Headline summarization is one of the automated text summarization techniques that can reduce the problem of information overload in the retrieval system and reduce the user's cognitive burden while searching and selecting relevant documents in large quantities. This study discusses the process on the determination of Malay language system features in the news genre document. Methodology starts with analysis the corpus of Malay news documents. The corpus contains 140 core news items which were selected from the two mainstream news databases in Malaysia which are Berita Harian and Utusan Malaysia. The selection news criteria are from core news categories, sized 50 to 250 words, the years of publication from 2007 to 2012 and news genres from economic, crime, education and sports. Three linguistic experts in Malay produced a headline summary for each news document manually. The experts need to comply with three conditions which are summary extraction, select-word-inorder word selection techniques and word morphological changes. The experimental results show that three characteristics have been identified, first: the first two sentenses are the important sentences, second: the verse that contains the potential acronym definitions is chosen as the most important sentence and third: the size of the summary of the ideal headline is six words. The consideration of this feature allows a summary of the headline that can be generated automatically, just like the process done by human.
引用
收藏
页码:154 / 167
页数:14
相关论文
共 50 条
  • [41] Detecting Political Bias in News Articles Using Headline Attention
    Reddy, Rama Rohit
    Duggenpudi, Suma Reddy
    Mamidi, Radhika
    BLACKBOXNLP WORKSHOP ON ANALYZING AND INTERPRETING NEURAL NETWORKS FOR NLP AT ACL 2019, 2019, : 77 - 84
  • [42] News headline generation based on improved decoder from transformer
    Li, Zhengpeng
    Wu, Jiansheng
    Miao, Jiawei
    Yu, Xinmiao
    SCIENTIFIC REPORTS, 2022, 12 (01)
  • [43] Sentence compression learned by news headline for displaying in small device
    Lee, KJ
    Kim, JH
    INFORMATION RETRIEVAL TECHNOLOGY, 2005, 3411 : 61 - 70
  • [44] VANCOMYCIN-RESISTANT ENTEROCOCCUS-FAECIUM - HEADLINE NEWS
    GOLDMANN, DA
    INFECTION CONTROL AND HOSPITAL EPIDEMIOLOGY, 1992, 13 (12): : 695 - 699
  • [45] HEADLINE NEWS, SCIENCE VIEWS, VOL 2 - JARMUL,D
    ZIOMEK, J
    JOURNALISM QUARTERLY, 1994, 71 (01): : 226 - 227
  • [46] Myanmar News Headline Generation with Sequence-to-Sequence model
    Thu, Yamin
    Pa, Win Pa
    PROCEEDINGS OF 2020 23RD CONFERENCE OF THE ORIENTAL COCOSDA INTERNATIONAL COMMITTEE FOR THE CO-ORDINATION AND STANDARDISATION OF SPEECH DATABASES AND ASSESSMENT TECHNIQUES (ORIENTAL-COCOSDA 2020), 2020, : 117 - 122
  • [47] From headline to lifeline: does news set agenda for policy?
    Grzeslo, Jenna
    Bai, Yang
    Wang, Ryan Yang
    Min, Bumgi
    Jayakar, Krishna
    DIGITAL POLICY REGULATION AND GOVERNANCE, 2019, 21 (04) : 352 - 368
  • [48] Discourse Analysis of English News-A Contrastive Study of Stylistic Features of Straight News and News Features
    张莉
    文学界(理论版), 2010, (10) : 178+186 - 178
  • [49] Headline-Guided Extractive Summarization for Thai News Articles
    Kositcharoensuk, Pimpitchaya
    Sritrakool, Nakarin
    Pratanwanich, Ploy N.
    IEEE ACCESS, 2025, 13 : 24368 - 24382
  • [50] Exploring Relationship between Headline News Sentiment and Stock Return
    Alamsyah, Andry
    Ayu, Siska Prasetya
    Rikumahu, Brady
    2019 7TH INTERNATIONAL CONFERENCE ON INFORMATION AND COMMUNICATION TECHNOLOGY (ICOICT), 2019, : 279 - 284