Coherent arrangement of sentences extracted from multiple newspaper articles

被引:0
|
作者
Okazaki, N
Matsuo, Y
Ishizuka, M
机构
[1] Univ Tokyo, Grad Sch Informat Sci & Technol, Bunkyo Ku, Tokyo 1138656, Japan
[2] AIST, Cyber Assist Res Ctr, Koto Ku, Tokyo 1350064, Japan
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multi-document summarization is a challenge to information overload problem to provide a condensed text for a number of documents. Most multi-document summarization systems make use of extraction techniques (e.g., important sentence extraction) and compile a summary from the selected information. However, sentences gathered from multiple sources are not organized as a comprehensible text. Therefore, it is important to consider sentence ordering of extracted sentences in order to reconstruct discourse structure in a summary. We propose a novel method to plan a coherent arrangement of sentences extracted from multiple newspaper articles. Results of our experiment show that sentence reordering has a discernible effect on summary readability. The results also shows significant improvement on sentence arrangement compared to former methods.
引用
收藏
页码:882 / 891
页数:10
相关论文
共 50 条
  • [41] Predictive Power of Public Emotions as Extracted from Daily News Articles on the Movements of Stock Market Indices
    Wong, Chayanin
    Ko, In-Young
    2016 IEEE/WIC/ACM INTERNATIONAL CONFERENCE ON WEB INTELLIGENCE (WI 2016), 2016, : 705 - 708
  • [42] Spatially coherent colour image reconstruction from a trichromatic mosaic with random arrangement of chromatic samples
    Alleysson, David
    OPHTHALMIC AND PHYSIOLOGICAL OPTICS, 2010, 30 (05) : 492 - 502
  • [43] Identity Thieves and Levels of Sophistication: Findings from a National Probability Sample of American Newspaper Articles 1995-2005
    Morris, Robert G.
    DEVIANT BEHAVIOR, 2010, 31 (02) : 184 - 207
  • [45] CRITICAL DISCOURSE ANALYSIS OF NEWSPAPER ARTICLES FROM THE WASHINGTON POST AND JUTARNJI LIST REPORTING ON THE COVID-19 PANDEMIC
    Stojan, Natasa
    FOLIA LINGUISTICA ET LITTERARIA, 2023, (46): : 79 - 97
  • [46] Parsing, Semantic Networks, and Political Authority Using Syntactic Analysis to Extract Semantic Relations from Dutch Newspaper Articles
    van Atteveldt, Wouter
    Kleinnijenhuis, Jan
    Ruigrok, Nel
    POLITICAL ANALYSIS, 2008, 16 (04) : 428 - 446
  • [47] Natural bioactive lysosomes extracted from multiple cells for tumor therapy
    Zhang, Jin
    Xu, Quan
    Zhang, Yifang
    Foda, Mohamed F.
    Cai, Kai
    Liu, Qing
    Jia, Fan
    Wang, Huadong
    Xu, Fuqiang
    Han, Heyou
    Liang, Huageng
    BIOMATERIALS, 2024, 304
  • [48] Face authentication based on multiple profiles extracted from range data
    Wu, YJ
    Pan, G
    Wu, ZH
    AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 515 - 522
  • [49] Integrating Web objects extracted from multiple sites into relational database
    School of Electronic Engineering, Xidian Univ., Xi'an 710071, China
    不详
    不详
    Xi'an Dianzi Keji Daxue Xuebao, 2007, 1 (126-130+153):
  • [50] Analysing headlines as a way of downsizing news corpora: Evidence from an Arabic-English comparable corpus of newspaper articles
    Haider, Ahmad S.
    Hussein, Riyad F.
    DIGITAL SCHOLARSHIP IN THE HUMANITIES, 2020, 35 (04) : 826 - 844