Using Latent Dirichlet Allocation to Incorporate Domain Knowledge For Topic Transition Detection

被引:0
|
作者
Zhu, Xiaodan [1 ]
He, Xuming [1 ]
Munteanu, Cosmin [1 ]
Penn, Gerald [1 ]
机构
[1] Univ Toronto, Dept Comp Sci, Toronto, ON, Canada
关键词
slides transition detection; boundary detection;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper studies automatic detection of topic transitions for recorded presentations. This can be achieved by matching slide content with presentation transcripts directly with some similarity metrics. Such literal matching, however, misses domain-specific knowledge and is sensitive to speech recognition errors. In this paper, we incorporate relevant written materials, e.g., textbooks for lectures, which convey semantic relationships, in particular domain-specific relationships, between words. To this end, we train latent Dirichlet allocation (LDA) models on these materials and measure the similarity between slides and transcripts in the acquired hidden-topic space. This similarity is then combined with literal matchings. Experiments show that the proposed approach reduces the errors in slide transition detection by 17-41% on manual transcripts and 27-37% on automatic transcripts.
引用
收藏
页码:2442 / 2445
页数:4
相关论文
共 50 条
  • [31] Mental Disorder Detection and Measurement using Latent Dirichlet Allocation and SentiWordNet
    Tai, Chih-Hua
    Tan, Zheng-Han
    Lin, Yung-Sheng
    Chang, Yue-Shan
    [J]. 2015 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN, AND CYBERNETICS (SMC 2015): BIG DATA ANALYTICS FOR HUMAN-CENTRIC SYSTEMS, 2015, : 1215 - 1220
  • [32] Language Model Adaptation Based on Topic Probability of Latent Dirichlet Allocation
    Jeon, Hyung-Bae
    Lee, Soo-Young
    [J]. ETRI JOURNAL, 2016, 38 (03) : 487 - 493
  • [33] Topic Modeling of Online Accommodation Reviews via Latent Dirichlet Allocation
    Sutherland, Ian
    Sim, Youngseok
    Lee, Seul Ki
    Byun, Jaemun
    Kiatkawsin, Kiattipoom
    [J]. SUSTAINABILITY, 2020, 12 (05) : 1 - 15
  • [34] iLDA: An interactive latent Dirichlet allocation model to improve topic quality
    Liu, Yezheng
    Du, Fei
    Sun, Jianshan
    Jiang, Yuanchun
    [J]. JOURNAL OF INFORMATION SCIENCE, 2020, 46 (01) : 23 - 40
  • [35] Detection of Reference Topics and Suggestions using Latent Dirichlet Allocation (LDA)
    Basuki, Setio
    Azhar, Yufis
    Minarno, Agus Eko
    Aditya, Christian Sri Kusuma
    Sumadi, Fauzi Dwi Setiawan
    Ramadhan, Ardiansah Ilham
    [J]. PROCEEDINGS OF 2019 12TH INTERNATIONAL CONFERENCE ON INFORMATION & COMMUNICATION TECHNOLOGY AND SYSTEM (ICTS), 2019, : 16 - 20
  • [36] Product Aspect Detection in Customer Complaints by Using Latent Dirichlet Allocation
    Atici, Birkan
    Omurca, Sevinc Ilhan
    Ekinci, Ekin
    [J]. 2017 INTERNATIONAL CONFERENCE ON COMPUTER SCIENCE AND ENGINEERING (UBMK), 2017, : 250 - 254
  • [37] Latent Dirichlet allocation (LDA) for topic modeling of the CFPB consumer complaints
    Bastani, Kaveh
    Namavari, Hamed
    Shaffer, Jeffrey
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2019, 127 : 256 - 271
  • [38] Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey
    Jelodar, Hamed
    Wang, Yongli
    Yuan, Chi
    Feng, Xia
    Jiang, Xiahui
    Li, Yanchao
    Zhao, Liang
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2019, 78 (11) : 15169 - 15211
  • [39] Latent Dirichlet allocation (LDA) and topic modeling: models, applications, a survey
    Hamed Jelodar
    Yongli Wang
    Chi Yuan
    Xia Feng
    Xiahui Jiang
    Yanchao Li
    Liang Zhao
    [J]. Multimedia Tools and Applications, 2019, 78 : 15169 - 15211
  • [40] Analysis of the impact of investor sentiment on stock price using the latent dirichlet allocation topic model
    Chen, Meilan
    Guo, Zhiying
    Abbass, Kashif
    Huang, Wenfeng
    [J]. FRONTIERS IN ENVIRONMENTAL SCIENCE, 2022, 10