Entity and Event Topic Extraction from Podcast Episode Title and Description Using Entity Linking

被引:0
|
作者
Siagian, Christian [1 ]
Shabbeer, Amina [2 ]
机构
[1] Amazon, Los Angeles, CA 90064 USA
[2] Amazon, San Francisco, CA USA
关键词
Natural Language Understanding; topic extraction; entity linking;
D O I
10.1145/3543873.3587648
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
To improve Amazon Music podcast services and customer engagements, we introduce Entity-Linked Topic Extraction (ELTE) to identify well-known entity and event topics from podcast episodes. An entity can be a person, organization, work-of-art, etc., while an event, such as the Opioid epidemic, occurs at specific point(s) in time. ELTE first extracts key-phrases from episode title and description metadata. It then uses entity linking to canonicalize them against Wikipedia knowledge base (KB), ensuring that the topics exist in the real world. ELTE also models NIL-predictions for entity or event topics that are not in the KB, as well as topics that are not of entity or event type. To test the model, we construct a podcast topic database of 1166 episodes from various categories. Each episode comes with a Wiki-link annotated main topic or NIL-prediction. ELTE produces the best overall Exact Match EM score of .84, with by-far the best EM of .89 among the entity or event type episodes, as well as NIL-predictions for episodes without entity or event main topic (EM score of .86).
引用
收藏
页码:768 / 772
页数:5
相关论文
共 50 条
  • [31] Title Named Entity Recognition using Wikipedia and Abbreviation Generation
    Park, Youngmin
    Kang, Sangwoo
    Seo, Jungyun
    2014 INTERNATIONAL CONFERENCE ON BIG DATA AND SMART COMPUTING (BIGCOMP), 2014, : 169 - 172
  • [32] Non-Entity Event Argument Extraction on Structural Representation
    Liu, Yiting
    Li, Peifeng
    2017 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP), 2017, : 306 - 309
  • [33] Joint Entity and Event Extraction with Generative Adversarial Imitation Learning
    Zhang, Tongtao
    Ji, Heng
    Sil, Avirup
    DATA INTELLIGENCE, 2019, 1 (02) : 99 - 120
  • [34] Entity and attribute extraction of terrorism event based on text corpus
    Cao W.-B.
    Wu Z.-F.
    Yang T.
    Fan Y.-R.
    Cao, Wen-Bin (490838330@qq.com), 1600, Science Press (42): : 500 - 508
  • [35] Globally normalized neural model for joint entity and event extraction
    Zhang, Junchi
    Huang, Wenzhi
    Ji, Donghong
    Ren, Yafeng
    INFORMATION PROCESSING & MANAGEMENT, 2021, 58 (05)
  • [36] Entity Ranking from Annotated Text Collections Using Multitype Topic Models
    Shiozaki, Hitohiro
    Eguchi, Koji
    FOCUSED ACCESS TO XML DOCUMENTS, 2008, 4862 : 279 - +
  • [37] Joint Entity and Event Extraction with Generative Adversarial Imitation Learning
    Tongtao Zhang
    Heng Ji
    Avirup Sil
    Data Intelligence, 2019, (02) : 99 - 120
  • [38] Semantic Annotation of Web of Things Using Entity Linking
    Nadim, Ismail
    El Ghayam, Yassine
    Sadiq, Abdelalim
    INTERNATIONAL JOURNAL OF BUSINESS ANALYTICS, 2020, 7 (04) : 1 - 13
  • [39] Graph based Tweet Entity Linking using DBpedia
    Kalloubi, Fahd
    Nfaoui, El Habib
    El Beqqali, Omar
    2014 IEEE/ACS 11TH INTERNATIONAL CONFERENCE ON COMPUTER SYSTEMS AND APPLICATIONS (AICCSA), 2014, : 501 - 506
  • [40] Improving Entity Linking using Surface Form Refinement
    Charton, Eric
    Meurs, Marie-Jean
    Jean-Louis, Ludovic
    Gagnon, Michel
    LREC 2014 - NINTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2014, : 4609 - 4615