Kernel-Based Lip Shape Clustering with Phoneme Recognition for Real-Time Voice Driven Talking Face

被引:0
|
作者
Shih, Po-Yi [1 ]
Wang, Jhing-Fa [1 ]
Chen, Zong-You [1 ]
机构
[1] Natl Cheng Kung Univ, Dept Elect Engn, Tainan 70101, Taiwan
关键词
Real-time; Voice-driven; Kernel-based; Lip shape clustering; Phoneme recognition;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This work describes a real-time voice driven method using which a speaker's lip shape is synchronized with the corresponding speech signal, for a low bandwidth mobile devices. Phoneme recognition is generally regarded as an important task in the operation of a real-time lip-sync system. In this work, the use of the kernel-based lip shape clustering algorithm is inspired based on one-class support vector machines (SVM). A set of speaker who has similar lip shape is clustered and a cluster-dependent vowel phoneme is then constructed for each cluster. We use sum of absolute difference (SAD) as vowel lip shape likelihood to cluster into categories. Then adjust the source and destination pictures of lip shape in the transparent level using alpha blending for lip-sync animation. We find that this method outperforms conventional CHMM method in phoneme error rate (PER), 8.78% and 32.25%, respectively.
引用
收藏
页码:516 / 523
页数:8
相关论文
共 50 条
  • [31] A Real-Time Face Recognition System Based on IP Camera and SRC Algorithm
    Gan, JunYing
    Liang, XiaoJie
    Zhai, YiKui
    Zhou, Lei
    Wang, Bin
    [J]. BIOMETRIC RECOGNITION (CCBR 2014), 2014, 8833 : 120 - 127
  • [32] Real-time human face recognition using eigenface based optical filtering
    Liu, HS
    Wu, MX
    Jin, GF
    He, QS
    Yan, YB
    [J]. REAL-TIME IMAGING IV, 1999, 3645 : 24 - 31
  • [33] A real-time face recognition system based on IP camera and SRC algorithm
    [J]. Gan, Jun Ying, 1600, Springer Verlag (8833):
  • [34] Fast search real-time face recognition based on DCT coefficients distribution
    Hsia, Shih-Chang
    Wang, Szu-Hong
    Chen, Chia-Jung
    [J]. IET IMAGE PROCESSING, 2020, 14 (03) : 570 - 575
  • [35] A flexible and efficient hardware architecture for real-time face recognition based on eigenface
    Ngo, HT
    Gottumukkal, R
    Asari, VK
    [J]. IEEE COMPUTER SOCIETY ANNUAL SYMPOSIUM ON VLSI, PROCEEDINGS: NEW FRONTIERS IN VLSI DESIGN, 2005, : 280 - 281
  • [36] Design and Implementation of an FPGA-based Real-Time Face Recognition System
    Matai, Janarbek
    Irturk, Ali
    Kastner, Ryan
    [J]. 2011 IEEE 19TH ANNUAL INTERNATIONAL SYMPOSIUM ON FIELD-PROGRAMMABLE CUSTOM COMPUTING MACHINES (FCCM), 2011, : 97 - 100
  • [37] FPGA-based Low-Cost Real-Time Face Recognition
    Schaffer, Laszlo
    Kincses, Zoltan
    Pletl, Szilveszter
    [J]. 2017 IEEE 15TH INTERNATIONAL SYMPOSIUM ON INTELLIGENT SYSTEMS AND INFORMATICS (SISY), 2017, : 35 - 38
  • [38] Multi-lane architecture for eigenface based real-time face recognition
    Gottumukkal, Rajkiran
    Ngo, Hau T.
    Asari, Vijayan K.
    [J]. MICROPROCESSORS AND MICROSYSTEMS, 2006, 30 (04) : 216 - 224
  • [39] Research on the Real-time Multiple Face Detection, Tracking and Recognition Based on Video
    Sang, Haifeng
    Xu, Chao
    Wu, Danyang
    Huang, Jing
    [J]. MECHATRONICS, ROBOTICS AND AUTOMATION, PTS 1-3, 2013, 373-375 : 442 - 446
  • [40] Real-Time Continuous Phoneme Recognition System Using Class-Dependent Tied-Mixture HMM With HBT Structure for Speech-Driven Lip-Sync
    Park, Junho
    Ko, Hanseok
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2008, 10 (07) : 1299 - 1306