Estimation of speaking speed for faster face detection in video-footage

被引:1
|
作者
Ikeda, O [1 ]
机构
[1] Takushoku Univ, Fac Engn, Hachioji, Tokyo 1930985, Japan
关键词
D O I
10.1109/ICME.2005.1521455
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We previously reported a face detection system based on color segmentation using HSV. It was shown that the color is more effective than other colors not only in accurate segmentation but also in effective extraction of facial features. The first is crucial for detection and the latter for recognition. When it comes to video footages of news program, sound often accompanies the video and persons express themselves by moving facial parts while speaking. In this paper we improve the face detection in speed using both sound and video in a combined way. First, the rate of syllables spoken is estimated from the sound. Next, for a beginning short video clip of each new scene, a differential image is formed with the frame distance corresponding to the rate to find mouth and eyes. This enables us to reduce the number of sampling points for segmentation to a great degree and to enhance the reliability of the detection. Also music is discriminated from speaking by the estimation. These contribute to much faster detection of face.
引用
下载
收藏
页码:442 / 445
页数:4
相关论文
共 50 条
  • [1] Automated 3D thorax model generation using handheld video-footage
    Nadine Dussel
    Reinhard Fuchs
    Andreas W. Reske
    Thomas Neumuth
    International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 1707 - 1716
  • [2] Automated 3D thorax model generation using handheld video-footage
    Dussel, Nadine
    Fuchs, Reinhard
    Reske, Andreas W.
    Neumuth, Thomas
    INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022, 17 (09) : 1707 - 1716
  • [3] Segmentation of faces in video footage using HSV color for face detection and image retrieval
    Ikeda, O
    2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 913 - 916
  • [4] Speaking video summary based on face detection in moving region
    Jianqiang, Huang, 1600, Bentham Science Publishers B.V., P.O. Box 294, Bussum, 1400 AG, Netherlands (08):
  • [5] Face analysis in video: face detection and tracking with pose estimation
    Mliki, Hazar
    Hammami, Mohamed
    INTERNATIONAL JOURNAL OF BIOMETRICS, 2018, 10 (02) : 121 - 141
  • [6] Crowd Violence Detection from Video Footage
    Gkountakos, Konstantinos
    Ioannidis, Konstantinos
    Tsikrika, Theodora
    Vrochidis, Stefanos
    Kompatsiaris, Ioannis
    2021 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2021, : 231 - 234
  • [7] Robust speaking face identification for video analysis
    Wu, Yi
    Hu, Wei
    Wang, Tao
    Zhang, Yimin
    Cheng, Jian
    Lu, Hanqing
    ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2007, 2007, 4810 : 665 - +
  • [8] Is Faster Better? A Study of Video Playback Speed
    Lang, David
    Chen, Guanling
    Mirzaei, Kathy
    Paepcke, Andreas
    LAK20: THE TENTH INTERNATIONAL CONFERENCE ON LEARNING ANALYTICS & KNOWLEDGE, 2020, : 260 - 269
  • [9] Human face super-resolution on poor quality surveillance video footage
    Muhammad Farooq
    Matthew N. Dailey
    Arif Mahmood
    Jednipat Moonrinta
    Mongkol Ekpanyapong
    Neural Computing and Applications, 2021, 33 : 13505 - 13523
  • [10] Human face super-resolution on poor quality surveillance video footage
    Farooq, Muhammad
    Dailey, Matthew N.
    Mahmood, Arif
    Moonrinta, Jednipat
    Ekpanyapong, Mongkol
    NEURAL COMPUTING & APPLICATIONS, 2021, 33 (20): : 13505 - 13523