Keyword extraction from emails

被引:17
|
作者
Lahiri, S. [1 ]
Mihalcea, R. [1 ]
Lai, P. -H. [2 ]
机构
[1] Univ Michigan, Ann Arbor, MI 48109 USA
[2] Samsung Res Amer, Richardson, TX 75082 USA
基金
美国国家科学基金会;
关键词
Electronic mail;
D O I
10.1017/S1351324916000231
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Emails constitute an important genre of online communication. Many of us are often faced with the daunting task of sifting through increasingly large amounts of emails on a daily basis. Keywords extracted from emails can help us combat such information overload by allowing a systematic exploration of the topics contained in emails. Existing literature on keyword extraction has not covered the email genre, and no human-annotated gold standard datasets are currently available. In this paper, we introduce a new dataset for keyword extraction from emails, and evaluate supervised and unsupervised methods for keyword extraction from emails. The results obtained with our supervised keyword extraction system (38.99% F-score) improve over the results obtained with the best performing systems participating in the SemEval 2010 keyword extraction task.
引用
收藏
页码:295 / 317
页数:23
相关论文
共 50 条
  • [1] Building a Dataset for Summarization and Keyword Extraction from Emails
    Loza, Vanessa
    Lahiri, Shibamouli
    Mihalcea, Rada
    Lai, Po-Hsiang
    [J]. LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 2441 - 2446
  • [2] Structured Data Extraction from Emails
    Mahlawi, Ashraf Q.
    Sasi, Sreela
    [J]. 2017 INTERNATIONAL CONFERENCE ON NETWORKS & ADVANCES IN COMPUTATIONAL TECHNOLOGIES (NETACT), 2017, : 323 - 328
  • [3] Keyword Extraction from Bengali News
    Showrov, Md Imran Hossain
    Sobhan, Masrur
    [J]. 2019 5TH INTERNATIONAL CONFERENCE ON ADVANCES IN ELECTRICAL ENGINEERING (ICAEE), 2019, : 658 - 662
  • [4] Keyword extraction from abstracts and titles
    Bhowmik, Rekha
    [J]. PROCEEDINGS IEEE SOUTHEASTCON 2008, VOLS 1 AND 2, 2008, : 610 - 617
  • [5] Keyword extraction from Arabic legal texts
    Rammal, Mahmoud
    Bahsoun, Zeinab
    Jabbour, Mona Al Achkar
    [J]. INTERACTIVE TECHNOLOGY AND SMART EDUCATION, 2015, 12 (01) : 62 - 71
  • [6] Contrastive Keyword Extraction from Versioned Documents
    Eder, Lukas
    Campos, Ricardo
    Jatowt, Adam
    [J]. PROCEEDINGS OF THE 32ND ACM INTERNATIONAL CONFERENCE ON INFORMATION AND KNOWLEDGE MANAGEMENT, CIKM 2023, 2023, : 5026 - 5030
  • [7] Feature Extraction and Classification of Spam Emails
    Hassan, Muhammad Ali
    Mtetwa, Nhamo
    [J]. 2018 5TH INTERNATIONAL CONFERENCE ON SOFT COMPUTING & MACHINE INTELLIGENCE (ISCMI), 2018, : 93 - 98
  • [8] Large-Scale Information Extraction from Emails with Data Constraints
    Gupta, Rajeev
    Kondapally, Ranganath
    Guha, Siddharth
    [J]. BIG DATA ANALYTICS (BDA 2019), 2019, 11932 : 124 - 139
  • [9] Unsupervised Keyword Extraction from Polish Legal Texts
    Jungiewicz, Michal
    Lopuszynski, Michal
    [J]. Advances in Natural Language Processing, 2014, 8686 : 65 - 70
  • [10] Keyword extraction from social media via AHP
    Ramay, Waheed Yousuf
    Xu Cheng-Yin
    Illahi, Inam
    [J]. HUMAN SYSTEMS MANAGEMENT, 2018, 37 (04) : 459 - 464