Face Retrieval in Large-Scale News Video Datasets

被引:10
|
作者
Thanh Duc Ngo [1 ]
Hung Thanh Vu [2 ]
Duy-Dinh Le [3 ]
Satoh, Shin'ichi [3 ]
机构
[1] Grad Univ Adv Studies SOKENDAI, Dept Informat, Hayama, Kanagawa 2400115, Japan
[2] Univ Sci, Ho Chi Minh City, Vietnam
[3] Natl Inst Informat, Tokyo 1018430, Japan
来源
关键词
face-track extraction; face-track matching; large-scale; news video; RECOGNITION;
D O I
10.1587/transinf.E96.D.1811
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Face retrieval in news video has been identified as a challenging task due to the huge variations in the visual appearance of the human face. Although several approaches have been proposed to deal with this problem, their extremely high computational cost limits their scalability to large-scale video datasets that may contain millions of faces of hundreds of characters. In this paper, we introduce approaches for face retrieval that are scalable to such datasets while maintaining competitive performances with state-of-the-art approaches. To utilize the variability of face appearances in video, we use a set of face images called face-track to represent the appearance of a character in a video shot. Our first proposal is an approach for extracting face-tracks. We use a point tracker to explore the connections between detected faces belonging to the same character and then group them into one face-track. We present techniques to make the approach robust against common problems caused by flash lights, partial occlusions, and scattered appearances of characters in news videos. In the second proposal, we introduce an efficient approach to match face-tracks for retrieval. Instead of using all the faces in the face-tracks to compute their similarity, our approach obtains a representative face for each face-track. The representative face is computed from faces that are sampled from the original face-track. As a result, we significantly reduce the computational cost of face-track matching while taking into account the variability of faces in face-tracks to achieve high matching accuracy. Experiments are conducted on two face-track datasets extracted from real-world news videos, of such scales that have never been considered in the literature. One dataset contains 1,497 face-tracks of 41 characters extracted from 370 hours of TRECVID videos. The other dataset provides 5,567 face-tracks of 111 characters observed from a television news program (NHK News 7) over 11 years. We make both datasets publically accessible by the research community. The experimental results show that our proposed approaches achieved a remarkable balance between accuracy and efficiency.
引用
收藏
页码:1811 / 1825
页数:15
相关论文
共 50 条
  • [1] Face Retrieval on Large-Scale Video Data
    Herrmann, Christian
    Beyerer, Juergen
    [J]. 2015 12TH CONFERENCE ON COMPUTER AND ROBOT VISION CRV 2015, 2015, : 192 - 199
  • [2] Analyzing large-scale news video databases to support knowledge visualization and intuitive retrieval
    Luo, Hangzai
    Fan, Jianping
    Yang, Jing
    Ribarsky, William
    Satoh, Shinichi
    [J]. VAST: IEEE SYMPOSIUM ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY 2007, PROCEEDINGS, 2007, : 107 - +
  • [3] A Compression Hashing Scheme for Large-scale Face Retrieval
    Li, Jiayong
    Ng, Wing W. Y.
    Tian, Xing
    [J]. 2018 8TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST 2018), 2018, : 245 - 251
  • [4] Large-Scale Video Retrieval Using Image Queries
    Araujo, Andre
    Girod, Bernd
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2018, 28 (06) : 1406 - 1420
  • [5] DLSTM Approach to Video Modeling with Hashing for Large-Scale Video Retrieval
    Zhuang, Naifan
    Ye, Jun
    Hua, Kien A.
    [J]. 2016 23RD INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2016, : 3222 - 3227
  • [6] Attention-Based Video Hashing for Large-Scale Video Retrieval
    Wang, Yingxin
    Nie, Xiushan
    Shi, Yang
    Zhou, Xin
    Yin, Yilong
    [J]. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS, 2021, 13 (03) : 491 - 502
  • [7] A Lightweight Framework for Fast Image Retrieval on Large-Scale Image Datasets
    Chen, Renhai
    Li, Wenwen
    Rao, Guozheng
    Feng, Zhiyong
    [J]. 2020 9TH IEEE NON-VOLATILE MEMORY SYSTEMS AND APPLICATIONS SYMPOSIUM (NVMSA 2020), 2020, : 42 - 47
  • [8] Knowledge Mining and Visualization on News Webpages and Large-Scale News Video Database
    Luo, Hangzai
    Yang, Jiahang
    Zhou, Aoying
    Fan, Jianping
    Hu, Tianming
    [J]. 2008 IFIP INTERNATIONAL CONFERENCE ON NETWORK AND PARALLEL COMPUTING, PROCEEDINGS, 2008, : 452 - +
  • [9] Topic threading for structuring a large-scale news video archive
    Ide, I
    Mo, H
    Katayama, N
    Satoh, S
    [J]. IMAGE AND VIDEO RETRIEVAL, PROCEEDINGS, 2004, 3115 : 123 - 131
  • [10] Exploring large-scale video news via interactive visualization
    Luo, Hangzai
    Fan, Jianping
    Yang, Jing
    Ribarsky, William
    Satoh, Shin'ichi
    [J]. VAST 2006: IEEE SYMPOSIUM ON VISUAL ANALYTICS SCIENCE AND TECHNOLOGY, PROCEEDINGS, 2006, : 75 - +