Kernel-Based Lip Shape Clustering with Phoneme Recognition for Real-Time Voice Driven Talking Face

被引:0
|
作者
Shih, Po-Yi [1 ]
Wang, Jhing-Fa [1 ]
Chen, Zong-You [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Elect Engn, Tainan 70101, Taiwan
关键词
Real-time; Voice-driven; Kernel-based; Lip shape clustering; Phoneme recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work describes a real-time voice driven method using which a speaker's lip shape is synchronized with the corresponding speech signal, for a low bandwidth mobile devices. Phoneme recognition is generally regarded as an important task in the operation of a real-time lip-sync system. In this work, the use of the kernel-based lip shape clustering algorithm is inspired based on one-class support vector machines (SVM). A set of speaker who has similar lip shape is clustered and a cluster-dependent vowel phoneme is then constructed for each cluster. We use sum of absolute difference (SAD) as vowel lip shape likelihood to cluster into categories. Then adjust the source and destination pictures of lip shape in the transparent level using alpha blending for lip-sync animation. We find that this method outperforms conventional CHMM method in phoneme error rate (PER), 8.78% and 32.25%, respectively.
引用
收藏
页码:516 / 523
页数:8
相关论文
共 50 条
  • [1] Real-time lip-synch face animation driven by human voice
    Huang, FJ
    Chen, TH
    [J]. 1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 352 - 357
  • [2] Real-time face synthesis driven by voice
    Huang, Y
    Ding, XQ
    Gu, BN
    Shum, HY
    [J]. CAD/GRAPHICS '2001: PROCEEDINGS OF THE SEVENTH INTERNATIONAL CONFERENCE ON COMPUTER AIDED DESIGN AND COMPUTER GRAPHICS, VOLS 1 AND 2, 2001, : 393 - 398
  • [3] SVM-based phoneme classification and lip shape refinement in real-time lip-synch system
    Ko, Hanseok
    Han, David K.
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2006, 20 (07) : 1029 - 1051
  • [4] Achieving real-time lip synch via SVM-based phoneme classification and lip shape refinement
    Kim, T
    Kang, Y
    Ko, H
    [J]. FOURTH IEEE INTERNATIONAL CONFERENCE ON MULTIMODAL INTERFACES, PROCEEDINGS, 2002, : 299 - 304
  • [5] Computational Acceleration of Real-Time Kernel-Based Tracking System
    Pandey, Manoj
    Ubhi, J. S.
    Raju, Kota Solomon
    [J]. JOURNAL OF CIRCUITS SYSTEMS AND COMPUTERS, 2016, 25 (04)
  • [6] A fuzzy kernel-based method for real-time network intrusion detection
    Petrovskiy, M
    [J]. INNOVATIVE INTERNET COMMUNITY SYSTEMS, 2003, 2877 : 189 - 200
  • [7] Kernel-based online learning for real-time voltage control in distribution networks
    Cupelli, Lisette
    Esteban, Alejandro
    Ponci, Ferdinanda
    Monti, Antonello
    [J]. IET SMART GRID, 2020, 3 (05) : 638 - 645
  • [8] A Real-time Accompaniment System Based on Sung Voice Recognition
    Luo, Li
    Lu, Peng-Fei
    Wang, Zeng-Fu
    [J]. 19TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION, VOLS 1-6, 2008, : 531 - 534
  • [9] Web-Based Real-Time Gesture Recognition with Voice
    Pralhad, Ghadekar Premanand
    Abhishek, S.
    Kachare, Tejas
    Deshpande, Om
    Chounde, Rushikesh
    Tapadiya, Prachi
    [J]. INFORMATION, COMMUNICATION AND COMPUTING TECHNOLOGY (ICICCT 2021), 2021, 1417 : 119 - 131
  • [10] Web-Based Real-Time Gesture Recognition with Voice
    Pralhad, Ghadekar Premanand
    Abhishek, S.
    Kachare, Tejas
    Deshpande, Om
    Chounde, Rushikesh
    Tapadiya, Prachi
    [J]. Communications in Computer and Information Science, 2021, 1417 CCIS : 119 - 131