Estimation of speaking speed for faster face detection in video-footage

被引：1

作者：

Ikeda, O ^{[1
]}

机构：

[1] Takushoku Univ, Fac Engn, Hachioji, Tokyo 1930985, Japan

来源：

2005 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO (ICME), VOLS 1 AND 2 | 2005年

关键词：

D O I：

10.1109/ICME.2005.1521455

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We previously reported a face detection system based on color segmentation using HSV. It was shown that the color is more effective than other colors not only in accurate segmentation but also in effective extraction of facial features. The first is crucial for detection and the latter for recognition. When it comes to video footages of news program, sound often accompanies the video and persons express themselves by moving facial parts while speaking. In this paper we improve the face detection in speed using both sound and video in a combined way. First, the rate of syllables spoken is estimated from the sound. Next, for a beginning short video clip of each new scene, a differential image is formed with the frame distance corresponding to the rate to find mouth and eyes. This enables us to reduce the number of sampling points for segmentation to a great degree and to enhance the reliability of the detection. Also music is discriminated from speaking by the estimation. These contribute to much faster detection of face.

引用

下载

页码：442 / 445

页数：4

共 50 条

[1] Automated 3D thorax model generation using handheld video-footage
Nadine Dussel
Reinhard Fuchs
Andreas W. Reske
Thomas Neumuth
International Journal of Computer Assisted Radiology and Surgery, 2022, 17 : 1707 - 1716
[2] Automated 3D thorax model generation using handheld video-footage
Dussel, Nadine
Fuchs, Reinhard
Reske, Andreas W.
Neumuth, Thomas
INTERNATIONAL JOURNAL OF COMPUTER ASSISTED RADIOLOGY AND SURGERY, 2022, 17 (09) : 1707 - 1716
[3] Segmentation of faces in video footage using HSV color for face detection and image retrieval
Ikeda, O
2003 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL 3, PROCEEDINGS, 2003, : 913 - 916
[4] Speaking video summary based on face detection in moving region
Jianqiang, Huang, 1600, Bentham Science Publishers B.V., P.O. Box 294, Bussum, 1400 AG, Netherlands (08):
[5] Face analysis in video: face detection and tracking with pose estimation
Mliki, Hazar
Hammami, Mohamed
INTERNATIONAL JOURNAL OF BIOMETRICS, 2018, 10 (02) : 121 - 141
[6] Crowd Violence Detection from Video Footage
Gkountakos, Konstantinos
Ioannidis, Konstantinos
Tsikrika, Theodora
Vrochidis, Stefanos
Kompatsiaris, Ioannis
2021 INTERNATIONAL CONFERENCE ON CONTENT-BASED MULTIMEDIA INDEXING (CBMI), 2021, : 231 - 234
[7] Robust speaking face identification for video analysis
Wu, Yi
Hu, Wei
Wang, Tao
Zhang, Yimin
Cheng, Jian
Lu, Hanqing
ADVANCES IN MULTIMEDIA INFORMATION PROCESSING - PCM 2007, 2007, 4810 : 665 - +
[8] Is Faster Better? A Study of Video Playback Speed
Lang, David
Chen, Guanling
Mirzaei, Kathy
Paepcke, Andreas
LAK20: THE TENTH INTERNATIONAL CONFERENCE ON LEARNING ANALYTICS & KNOWLEDGE, 2020, : 260 - 269
[9] Human face super-resolution on poor quality surveillance video footage
Muhammad Farooq
Matthew N. Dailey
Arif Mahmood
Jednipat Moonrinta
Mongkol Ekpanyapong
Neural Computing and Applications, 2021, 33 : 13505 - 13523
[10] Human face super-resolution on poor quality surveillance video footage
Farooq, Muhammad
Dailey, Matthew N.
Mahmood, Arif
Moonrinta, Jednipat
Ekpanyapong, Mongkol
NEURAL COMPUTING & APPLICATIONS, 2021, 33 (20): : 13505 - 13523

← 1 2 3 4 5 →