Ranking Sentences for Keyphrase Extraction: A Relational Data Mining Approach

被引:5
|
作者
Ceci, Michelangelo [1 ]
Loglisci, Corrado [1 ]
Macchia, Lucrezia [1 ]
机构
[1] Univ Bari Aldo Moro, Dipartimento Informat, Bari, Italy
关键词
Document summarization; Ranking; Relational data mining; EMERGING PATTERNS;
D O I
10.1016/j.procs.2014.10.011
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Document summarization involves reducing a text document into a short set of phrases or sentences that convey the main meaning of the text. In digital libraries, summaries can be used as concise descriptions which the user can read for a rapid comprehension of the retrieved documents. Most of the existing approaches rely on the classification algorithms which tend to generate "crisp" summaries, where the phrases are considered equally relevant and no information on their degree of importance or factor of significance is provided. Motivated by this, we present a probabilistic relational data mining method to model preference relations on sentences of document images. Preference relations are then used to rank the sentences which will form the final summary. We empirically evaluate the method on real document images. (C) 2014 The Authors. Published by Elsevier B.V.
引用
收藏
页码:52 / 59
页数:8
相关论文
共 50 条
  • [1] A Ranking Approach to Keyphrase Extraction
    Jiang, Xin
    Hu, Yunhua
    Li, Hang
    [J]. PROCEEDINGS 32ND ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2009, : 756 - 757
  • [2] Keyphrase Extraction with Sequential Pattern Mining
    Wang, Qingren
    Sheng, Victor S.
    Wu, Xindong
    [J]. THIRTY-FIRST AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, 2017, : 5003 - 5004
  • [3] A Wordification Approach to Relational Data Mining
    Perovsek, Matic
    Vavpetic, Anze
    Cestnik, Bojan
    Lavra, Nada
    [J]. DISCOVERY SCIENCE, 2013, 8140 : 141 - 154
  • [4] ISKE: An unsupervised automatic keyphrase extraction approach using the iterated sentences based on graph method
    Chi, Ling
    Hu, Liang
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 223
  • [5] Turkish Keyphrase Extraction Using Multi-Criterion Ranking
    Ozdemir, Bahadir
    Cicekli, Ilyas
    [J]. 2009 24TH INTERNATIONAL SYMPOSIUM ON COMPUTER AND INFORMATION SCIENCES, 2009, : 268 - 272
  • [6] A Relational Approach to Sensor Network Data Mining
    Esposito, Floriana
    Basile, Teresa M. A.
    Di Mauro, Nicola
    Ferilli, Stefano
    [J]. INFORMATION RETRIEVAL AND MINING IN DISTRIBUTED ENVIRONMENTS, 2010, 324 : 163 - 181
  • [7] Efficient sequential pattern mining with wildcards for keyphrase extraction
    Xie, Fei
    Wu, Xindong
    Zhu, Xingquan
    [J]. KNOWLEDGE-BASED SYSTEMS, 2017, 115 : 27 - 39
  • [8] An Improved Approach to Bengali Keyphrase Extraction
    Sarkar, Kamal
    [J]. 2014 FOURTH INTERNATIONAL CONFERENCE OF EMERGING APPLICATIONS OF INFORMATION TECHNOLOGY (EAIT), 2014, : 283 - 288
  • [9] Keyphrase Extraction Using Sequential Pattern Mining and Entropy
    Wang, Qingren
    Sheng, Victor S.
    Hu, Chenyi
    [J]. 2017 IEEE INTERNATIONAL CONFERENCE ON BIG KNOWLEDGE (IEEE ICBK 2017), 2017, : 88 - 95
  • [10] Application of Grey Relational Clustering and Data Mining In Information Extraction
    Qu Zhiming
    Wang Xiaoli
    [J]. ISBIM: 2008 INTERNATIONAL SEMINAR ON BUSINESS AND INFORMATION MANAGEMENT, VOL 2, 2009, : 3 - +