Entity and Event Topic Extraction from Podcast Episode Title and Description Using Entity Linking

被引:0
|
作者
Siagian, Christian [1 ]
Shabbeer, Amina [2 ]
机构
[1] Amazon, Los Angeles, CA 90064 USA
[2] Amazon, San Francisco, CA USA
关键词
Natural Language Understanding; topic extraction; entity linking;
D O I
10.1145/3543873.3587648
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To improve Amazon Music podcast services and customer engagements, we introduce Entity-Linked Topic Extraction (ELTE) to identify well-known entity and event topics from podcast episodes. An entity can be a person, organization, work-of-art, etc., while an event, such as the Opioid epidemic, occurs at specific point(s) in time. ELTE first extracts key-phrases from episode title and description metadata. It then uses entity linking to canonicalize them against Wikipedia knowledge base (KB), ensuring that the topics exist in the real world. ELTE also models NIL-predictions for entity or event topics that are not in the KB, as well as topics that are not of entity or event type. To test the model, we construct a podcast topic database of 1166 episodes from various categories. Each episode comes with a Wiki-link annotated main topic or NIL-prediction. ELTE produces the best overall Exact Match EM score of .84, with by-far the best EM of .89 among the entity or event type episodes, as well as NIL-predictions for episodes without entity or event main topic (EM score of .86).
引用
收藏
页码:768 / 772
页数:5
相关论文
共 50 条
  • [41] Improving Entity Linking Performance using Frame Semantics
    Nural, Mustafa V.
    Miller, John A.
    Arpinar, I. Budak
    2013 IEEE SEVENTH INTERNATIONAL CONFERENCE ON SEMANTIC COMPUTING (ICSC 2013), 2013, : 56 - 63
  • [42] Using graph distances for named-entity linking
    Blanco, Roi
    Boldi, Paolo
    Marino, Andrea
    SCIENCE OF COMPUTER PROGRAMMING, 2016, 130 : 24 - 36
  • [43] Entity Sentiment Extraction Using Text Ranking
    O'Neil, John
    SIGIR 2012: PROCEEDINGS OF THE 35TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, 2012, : 1024 - 1024
  • [44] Entity-Centric Topic Extraction and Exploration: A Network-Based Approach
    Spitz, Andreas
    Gertz, Michael
    ADVANCES IN INFORMATION RETRIEVAL (ECIR 2018), 2018, 10772 : 3 - 15
  • [45] litewi: A combined term extraction and entity linking method for eliciting educational ontologies from textbooks
    Conde, Angel
    Larranaga, Mikel
    Arruarte, Ana
    Elorriaga, Jon A.
    Roth, Dan
    JOURNAL OF THE ASSOCIATION FOR INFORMATION SCIENCE AND TECHNOLOGY, 2016, 67 (02) : 380 - 399
  • [46] Building a Multimodal Entity Linking Dataset From Tweets
    Adjali, Omar
    Besancon, Romaric
    Ferret, Olivier
    Le Borgne, Herve
    Grau, Brigitte
    PROCEEDINGS OF THE 12TH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION (LREC 2020), 2020, : 4285 - 4292
  • [47] LAAP: Learning the Argument of An Entity with Event Prompts for document-level event extraction
    Xu, Jinghan
    Yang, Cheng
    Kang, Xiaojun
    NEUROCOMPUTING, 2025, 613
  • [48] Entity Extraction from Portuguese Legal Documents Using Distant Supervision
    Navarezi, Lucas M.
    Sakiyama, Kenzo
    Rodrigues, Lucas S.
    Robaldo, Caio M. O.
    Lobato, Gustavo R.
    Vilela, Paulo A.
    Matsubara, Edson T.
    Fernandes, Eraldo R.
    COMPUTATIONAL PROCESSING OF THE PORTUGUESE LANGUAGE, PROPOR 2022, 2022, 13208 : 166 - 176
  • [49] Visual Description Augmented Integration Network for Multimodal Entity and Relation Extraction
    Zuo, Min
    Wang, Yingjun
    Dong, Wei
    Zhang, Qingchuan
    Cai, Yuanyuan
    Kong, Jianlei
    APPLIED SCIENCES-BASEL, 2023, 13 (10):
  • [50] Syntax grounded graph convolutional network for joint entity and event extraction
    Zhang, Junchi
    He, Qi
    Zhang, Yue
    NEUROCOMPUTING, 2021, 422 : 118 - 128