Research on Chinese Audio and Text Alignment Algorithm Based on AIC-FCM and Doc2Vec

被引:0
|
作者
Chen, Keliang [1 ]
Huang, Jianming [1 ]
Cui, Yansong [1 ]
Ren, Weizheng [1 ]
机构
[1] Beijing Univ Posts & Telecommun, Sch Elect Engn, 10 Xitucheng Rd, Beijing 100876, Peoples R China
基金
中国国家自然科学基金;
关键词
Audio and text alignment; fuzzy C-means clustering algorithm; akaike information criterion; Doc2vec; dual threshold endpoint detection; MENTION HYPERGRAPH; WORD2VEC; MODEL;
D O I
10.1145/3532852
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
"Audiobook" is a multimedia-based reading technology that has emerged in recent years. Realizing the alignment of e-book text and book audio is the most important part of its processing. This article describes an audio and text alignment algorithm using deep learning and neural network technology to improve the efficiency and quality of audiobook production. The algorithm first uses dual-threshold endpoint detection technology to segment long audio into short audio with sentence dimensions and recognizes it as short text. The threshold is calculated by AIC-FCM optimized based on simulated annealing genetic algorithm. Then the algorithm uses Doc2vec optimized by the threshold prediction method based on the average length of the short text to calculate the text similarity. Finally, proofread and output the text sequence and audio segment aligned in the time dimension to meet the needs of audiobook production. Experiments show that compared to traditional audio and text alignment algorithms, the proposed algorithm is closer to the ideal segmentation result in long audio segmentation, and the alignment effect is basically the same as Doc2vec and the time complexity is reduced by about 35%.
引用
收藏
页数:22
相关论文
共 41 条
  • [1] Chinese Text Keyword Extraction Based on Doc2vec And TextRank
    Wang, Wei
    Li, Xiangshun
    Yu, Sheng
    [J]. PROCEEDINGS OF THE 32ND 2020 CHINESE CONTROL AND DECISION CONFERENCE (CCDC 2020), 2020, : 369 - 373
  • [2] Chinese abstraction algorithm combining Doc2Vec and TextRank
    Mou, Jinjun
    Xiong, Zhibin
    [J]. BASIC & CLINICAL PHARMACOLOGY & TOXICOLOGY, 2019, 125 : 149 - 149
  • [3] Research on detection methods based on Doc2vec abnormal comments
    Chang, Wenbing
    Xu, Zhenzhong
    Zhou, Shenghan
    Cao, Wen
    [J]. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2018, 86 : 656 - 662
  • [4] A Study of the Chinese spam Classification with Doc2vec and CNN
    Gong, Hechen
    You, Fucheng
    Wang, Shaomei
    [J]. 2019 INTERNATIONAL CONFERENCE ON ADVANCED ELECTRONIC MATERIALS, COMPUTERS AND MATERIALS ENGINEERING (AEMCME 2019), 2019, 563
  • [5] Sentiment Analysis on Chinese Hotel Reviews with Doc2Vec and Classifiers
    Shuai, Qianjun
    Huang, Yamei
    Jin, Libiao
    Pang, Long
    [J]. PROCEEDINGS OF 2018 IEEE 3RD ADVANCED INFORMATION TECHNOLOGY, ELECTRONIC AND AUTOMATION CONTROL CONFERENCE (IAEAC 2018), 2018, : 1171 - 1174
  • [6] Key word extraction for short text via word2vec, doc2vec, and textrank
    Li, Jun
    Huang, Guimin
    Fan, Chunli
    Sun, Zhenglin
    Zhu, Hongtao
    [J]. TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2019, 27 (03) : 1794 - 1805
  • [7] Semi-supervised Turkish Text Categorization with Word2Vec, Doc2Vec and FastText Algorithms
    Erdinc, Hakki Yagiz
    Guran, Aysun
    [J]. 2019 27TH SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2019,
  • [8] Compressed Firmware Classification Based on Extra Trees and Doc2Vec
    Qiu, Jing
    Geng, Xiaoxu
    Sun, Guanglu
    [J]. SCIENTIFIC PROGRAMMING, 2021, 2021
  • [9] Recommendation method for academic journal submission based on doc2vec and XGBoost
    Huang ZhengWei
    Min JinTao
    Yang YanNi
    Huang Jin
    Tian Ye
    [J]. Scientometrics, 2022, 127 : 2381 - 2394
  • [10] Recommendation method for academic journal submission based on doc2vec and XGBoost
    Huang Zhengwei
    Min Jintao
    Yang Yanni
    Huang Jin
    Tian Ye
    [J]. SCIENTOMETRICS, 2022, 127 (05) : 2381 - 2394