Face Retrieval in Large-Scale News Video Datasets

被引：10

作者：

Thanh Duc Ngo ^{[1
]}

Hung Thanh Vu ^{[2
]}

Duy-Dinh Le ^{[3
]}

Satoh, Shin'ichi ^{[3
]}

机构：

[1] Grad Univ Adv Studies SOKENDAI, Dept Informat, Hayama, Kanagawa 2400115, Japan

[2] Univ Sci, Ho Chi Minh City, Vietnam

[3] Natl Inst Informat, Tokyo 1018430, Japan

来源：

IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS | 2013年 / E96D卷 / 08期

关键词：

face-track extraction; face-track matching; large-scale; news video; RECOGNITION;

D O I：

10.1587/transinf.E96.D.1811

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

Face retrieval in news video has been identified as a challenging task due to the huge variations in the visual appearance of the human face. Although several approaches have been proposed to deal with this problem, their extremely high computational cost limits their scalability to large-scale video datasets that may contain millions of faces of hundreds of characters. In this paper, we introduce approaches for face retrieval that are scalable to such datasets while maintaining competitive performances with state-of-the-art approaches. To utilize the variability of face appearances in video, we use a set of face images called face-track to represent the appearance of a character in a video shot. Our first proposal is an approach for extracting face-tracks. We use a point tracker to explore the connections between detected faces belonging to the same character and then group them into one face-track. We present techniques to make the approach robust against common problems caused by flash lights, partial occlusions, and scattered appearances of characters in news videos. In the second proposal, we introduce an efficient approach to match face-tracks for retrieval. Instead of using all the faces in the face-tracks to compute their similarity, our approach obtains a representative face for each face-track. The representative face is computed from faces that are sampled from the original face-track. As a result, we significantly reduce the computational cost of face-track matching while taking into account the variability of faces in face-tracks to achieve high matching accuracy. Experiments are conducted on two face-track datasets extracted from real-world news videos, of such scales that have never been considered in the literature. One dataset contains 1,497 face-tracks of 41 characters extracted from 370 hours of TRECVID videos. The other dataset provides 5,567 face-tracks of 111 characters observed from a television news program (NHK News 7) over 11 years. We make both datasets publically accessible by the research community. The experimental results show that our proposed approaches achieved a remarkable balance between accuracy and efficiency.

引用

页码：1811 / 1825

页数：15

共 50 条

[1] Face Retrieval on Large-Scale Video Data
Herrmann, Christian
Beyerer, Juergen
[J]. 2015 12TH CONFERENCE ON COMPUTER AND ROBOT VISION CRV 2015, 2015, : 192 - 199
[2] Analyzing large-scale news video databases to support knowledge visualization and intuitive retrieval
Luo, Hangzai
Fan, Jianping
Yang, Jing
Ribarsky, William
Satoh, Shinichi
[J]. VAST: IEEE SYMPOSIUM ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY 2007, PROCEEDINGS, 2007, : 107 - +
[3] A Compression Hashing Scheme for Large-scale Face Retrieval
Li, Jiayong
Ng, Wing W. Y.
Tian, Xing
[J]. 2018 8TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST 2018), 2018, : 245 - 251
[4] Large-Scale Video Retrieval Using Image Queries
Araujo, Andre
Girod, Bernd
[J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (06) : 1406 - 1420
[5] DLSTM Approach to Video Modeling with Hashing for Large-Scale Video Retrieval
Zhuang, Naifan
Ye, Jun
Hua, Kien A.
[J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3222 - 3227
[6] Attention-Based Video Hashing for Large-Scale Video Retrieval
Wang, Yingxin
Nie, Xiushan
Shi, Yang
Zhou, Xin
Yin, Yilong
[J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2021, 13 (03) : 491 - 502
[7] A Lightweight Framework for Fast Image Retrieval on Large-Scale Image Datasets
Chen, Renhai
Li, Wenwen
Rao, Guozheng
Feng, Zhiyong
[J]. 2020 9TH IEEE NON-VOLATILE MEMORY SYSTEMS AND APPLICATIONS SYMPOSIUM (NVMSA 2020), 2020, : 42 - 47
[8] Knowledge Mining and Visualization on News Webpages and Large-Scale News Video Database
Luo, Hangzai
Yang, Jiahang
Zhou, Aoying
Fan, Jianping
Hu, Tianming
[J]. 2008 IFIP INTERNATIONAL CONFERENCE ON NETWORK AND PARALLEL COMPUTING, PROCEEDINGS, 2008, : 452 - +
[9] Topic threading for structuring a large-scale news video archive
Ide, I
Mo, H
Katayama, N
Satoh, S
[J]. IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2004, 3115 : 123 - 131
[10] Exploring large-scale video news via interactive visualization
Luo, Hangzai
Fan, Jianping
Yang, Jing
Ribarsky, William
Satoh, Shin'ichi
[J]. VAST 2006: IEEE SYMPOSIUM ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY, PROCEEDINGS, 2006, : 75 - +

← 1 2 3 4 5 →