Entity and Event Topic Extraction from Podcast Episode Title and Description Using Entity Linking

被引:0
|
作者
Siagian, Christian [1 ]
Shabbeer, Amina [2 ]
机构
[1] Amazon, Los Angeles, CA 90064 USA
[2] Amazon, San Francisco, CA USA
关键词
Natural Language Understanding; topic extraction; entity linking;
D O I
10.1145/3543873.3587648
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To improve Amazon Music podcast services and customer engagements, we introduce Entity-Linked Topic Extraction (ELTE) to identify well-known entity and event topics from podcast episodes. An entity can be a person, organization, work-of-art, etc., while an event, such as the Opioid epidemic, occurs at specific point(s) in time. ELTE first extracts key-phrases from episode title and description metadata. It then uses entity linking to canonicalize them against Wikipedia knowledge base (KB), ensuring that the topics exist in the real world. ELTE also models NIL-predictions for entity or event topics that are not in the KB, as well as topics that are not of entity or event type. To test the model, we construct a podcast topic database of 1166 episodes from various categories. Each episode comes with a Wiki-link annotated main topic or NIL-prediction. ELTE produces the best overall Exact Match EM score of .84, with by-far the best EM of .89 among the entity or event type episodes, as well as NIL-predictions for episodes without entity or event main topic (EM score of .86).
引用
收藏
页码:768 / 772
页数:5
相关论文
共 50 条
  • [21] Entity Linking in 40 Languages Using MAG
    Moussallem, Diego
    Usbeck, Ricardo
    Roeder, Michael
    Ngomo, Axel-Cyrille Ngonga
    SEMANTIC WEB: ESWC 2018 SATELLITE EVENTS, 2018, 11155 : 176 - 181
  • [22] Chinese Social Media Entity Linking Based on Effective Context with Topic Semantics
    Ma, Chengfang
    Sha, Ying
    Tan, Jianlong
    Guo, Li
    Peng, Huailiang
    2019 IEEE 43RD ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE (COMPSAC), VOL 1, 2019, : 386 - 395
  • [23] Entity network prediction using multitype topic models
    Shiozaki, Hitohiro
    Eguchi, Koji
    Ohkawa, Takenao
    ADVANCES IN KNOWLEDGE DISCOVERY AND DATA MINING, PROCEEDINGS, 2008, 5012 : 705 - +
  • [24] Entity Network Prediction Using Multitype Topic Models
    Shiozaki, Hitohiro
    Eguchi, Koji
    Ohkawa, Takenao
    IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, 2008, E91D (11) : 2589 - 2598
  • [25] Entity Linking from Microblogs to Knowledge Base Using ListNet Algorithm
    Wang, Yan
    Luo, Cheng
    Li, Xin
    Liu, Yiqun
    Zhang, Min
    Ma, Shaoping
    NATURAL LANGUAGE PROCESSING AND CHINESE COMPUTING, NLPCC 2013, 2013, 400 : 277 - 287
  • [26] Statistical Entity Extraction From the Web
    Nie, Zaiqing
    Wen, Ji-Rong
    Ma, Wei-Ying
    PROCEEDINGS OF THE IEEE, 2012, 100 (09) : 2675 - 2687
  • [27] Entity Extraction from the Web with WebKnox
    Urbansky, David
    Feldmann, Marius
    Thom, James A.
    Schill, Alexander
    ADVANCES IN INTELLIGENT WEB MASTERING-2, PROCEEDINGS, 2010, 67 : 209 - +
  • [28] Named-Entity Techniques for Terrorism Event Extraction and Classification
    Inyaem, Uraiwan
    Meesad, Phayung
    Haruechaiyasak, Choochart
    2009 EIGHTH INTERNATIONAL SYMPOSIUM ON NATURAL LANGUAGE PROCESSING, PROCEEDINGS, 2009, : 175 - +
  • [29] Using Local Grammar for Entity Extraction from Clinical Reports
    Ghoulam, Aicha
    Barigou, Fatiha
    Belalem, Ghalem
    Meziane, Farid
    INTERNATIONAL JOURNAL OF INTERACTIVE MULTIMEDIA AND ARTIFICIAL INTELLIGENCE, 2015, 3 (03): : 16 - 24
  • [30] Semantic entity detection using description graphs
    Giró, X
    Marqués, F
    DIGITAL MEDIA: PROCESSING MULTIMEDIA INTERACTIVE SERVICES, 2003, : 39 - 42