Ranking Sentences for Keyphrase Extraction: A Relational Data Mining Approach

被引:5
|
作者
Ceci, Michelangelo [1 ]
Loglisci, Corrado [1 ]
Macchia, Lucrezia [1 ]
机构
[1] Univ Bari Aldo Moro, Dipartimento Informat, Bari, Italy
关键词
Document summarization; Ranking; Relational data mining; EMERGING PATTERNS;
D O I
10.1016/j.procs.2014.10.011
中图分类号
G25 [图书馆学、图书馆事业]; G35 [情报学、情报工作];
学科分类号
1205 ; 120501 ;
摘要
Document summarization involves reducing a text document into a short set of phrases or sentences that convey the main meaning of the text. In digital libraries, summaries can be used as concise descriptions which the user can read for a rapid comprehension of the retrieved documents. Most of the existing approaches rely on the classification algorithms which tend to generate "crisp" summaries, where the phrases are considered equally relevant and no information on their degree of importance or factor of significance is provided. Motivated by this, we present a probabilistic relational data mining method to model preference relations on sentences of document images. Preference relations are then used to rank the sentences which will form the final summary. We empirically evaluate the method on real document images. (C) 2014 The Authors. Published by Elsevier B.V.
引用
收藏
页码:52 / 59
页数:8
相关论文
共 50 条
  • [31] Data mining for fuzzy relational data servers
    Velmisov, A. P.
    Stetsko, A. A.
    Yarushkina, N. G.
    [J]. THEORETICAL ADVANCES AND APPLICATIONS OF FUZZY LOGIC AND SOFT COMPUTING, 2007, 42 : 220 - +
  • [32] KP-Rank: a semantic-based unsupervised approach for keyphrase extraction from text data
    Aman, Muhammad
    Abdulkadir, Said Jadid
    Aziz, Izzatdin Abdul
    Alhussian, Hitham
    Ullah, Israr
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (08) : 12469 - 12506
  • [33] KP-Rank: a semantic-based unsupervised approach for keyphrase extraction from text data
    Muhammad Aman
    Said Jadid Abdulkadir
    Izzatdin Abdul Aziz
    Hitham Alhussian
    Israr Ullah
    [J]. Multimedia Tools and Applications, 2021, 80 : 12469 - 12506
  • [34] RAKE-PMI AUTOMATED KEYPHRASE EXTRACTION An unsupervised approach for automated extraction of keyphrases
    Gupta, Somya
    Mittal, Namita
    Kumar, Alok
    [J]. PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON INFORMATICS AND ANALYTICS (ICIA' 16), 2016,
  • [35] A data-mining approach to preference-based data ranking founded on contextual information
    Miele, Antonio
    Quintarelli, Elisa
    Rabosio, Emanuele
    Tanca, Letizia
    [J]. INFORMATION SYSTEMS, 2013, 38 (04) : 524 - 544
  • [36] RAKE-PMI Automated keyphrase extraction: An unsupervised approach for automated extraction of keyphrases
    [J]. 2016, Association for Computing Machinery, 2 Penn Plaza, Suite 701, New York, NY 10121-0701, United States (25-26-August-2016):
  • [37] Relational peculiarity oriented data mining
    Zhong, N
    Liu, CN
    Yao, YY
    Ohshima, M
    Huang, MX
    Huang, JJ
    [J]. FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 575 - 578
  • [38] Mining semiconductor manufacturing data for productivity improvement - an integrated relational database approach
    Dabbas, RM
    Chen, HN
    [J]. COMPUTERS IN INDUSTRY, 2001, 45 (01) : 29 - 44
  • [39] A SYSTEMATIC-APPROACH TO LEIBNIZ THEORY OF RELATIONS AND RELATIONAL SENTENCES
    MUGNAI, M
    [J]. TOPOI-AN INTERNATIONAL REVIEW OF PHILOSOPHY, 1990, 9 (01): : 61 - 81
  • [40] An intelligent approach to data extraction and task identification for process mining
    Li, Jiexun
    Wang, Harry Jiannan
    Bai, Xue
    [J]. INFORMATION SYSTEMS FRONTIERS, 2015, 17 (06) : 1195 - 1208