Audio-Visual Twins Database

被引:0
|
作者
Li, Jing [1 ]
Zhang, Li [1 ]
Guo, Dong [2 ]
Zhuo, Shaojie [3 ]
Sim, Terence [1 ]
机构
[1] Natl Univ Singapore, Sch Comp, Singapore 117548, Singapore
[2] Facebook, Menlo Pk, CA USA
[3] Qualcomm, Toronto, ON, Canada
关键词
RECOGNITION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Identical twins pose an interesting challenge for recognition systems due to their similar appearance. Although various biometrics have been proposed for the problem, existing works are quite limited due to the difficulty of obtaining a twins database. To encourage the methods for twins recognition and make a fair comparison of them by using the same database, we collected an audio-visual twins database at the Sixth Mojiang International Twins Festival held on 1 May 2010, China. Our database contains 39 pairs of twins in total, including Chinese, American and Russian subjects. This database contains several face images, facial motion videos and audio records for each subject. In this paper, we describe the collection procedure, organization of the database, and usage method of the database. We also show our experiments on face verification, facial motion verification and speaker verification for twins to provide usage examples of the database.
引用
收藏
页码:493 / 500
页数:8
相关论文
共 50 条
  • [1] An audio-visual speech recognition with a new mandarin audio-visual database
    Liao, Wen-Yuan
    Pao, Tsang-Long
    Chen, Yu-Te
    Chang, Tsun-Wei
    [J]. INT CONF ON CYBERNETICS AND INFORMATION TECHNOLOGIES, SYSTEMS AND APPLICATIONS/INT CONF ON COMPUTING, COMMUNICATIONS AND CONTROL TECHNOLOGIES, VOL 1, 2007, : 19 - +
  • [2] A Turkish Audio-Visual Emotional Database
    Onder, Onur
    Zhalehpour, Sara
    Erdem, Cigdem Eroglu
    [J]. 2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [3] SUTAV: A Turkish Audio-Visual Database
    Topkaya, Ibrahim Saygin
    Erdogan, Hakan
    [J]. LREC 2012 - EIGHTH INTERNATIONAL CONFERENCE ON LANGUAGE RESOURCES AND EVALUATION, 2012, : 2334 - 2337
  • [4] Development of an audio-visual database system for human identification
    Bargale, CB
    Chaudhuri, S
    Bhattacharyya, P
    [J]. AUDIO- AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, 1997, 1206 : 345 - 352
  • [5] METHODS AND CHALLENGES FOR CREATING AN EMOTIONAL AUDIO-VISUAL DATABASE
    Pandharipande, Meghna A.
    Chakraborty, Rupayan
    Kopparapu, Sunil Kumar
    [J]. 2017 20TH CONFERENCE OF THE ORIENTAL CHAPTER OF THE INTERNATIONAL COORDINATING COMMITTEE ON SPEECH DATABASES AND SPEECH I/O SYSTEMS AND ASSESSMENT (O-COCOSDA), 2017, : 183 - 188
  • [6] An audio-visual database for evaluating person tracking algorithms
    Krinidis, M
    Stamou, G
    Teutsch, H
    Spors, S
    Nikolaidis, N
    Rabenstein, R
    Pitas, L
    [J]. 2005 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOLS 1-5: SPEECH PROCESSING, 2005, : 237 - 240
  • [7] A New Audio-Visual Database to Represent Urban Path
    Qing, Ji
    [J]. INFORMATION AND BUSINESS INTELLIGENCE, PT I, 2012, 267 : 713 - 719
  • [8] CHEAVD: a Chinese natural emotional audio-visual database
    Li, Ya
    Tao, Jianhua
    Chao, Linlin
    Bao, Wei
    Liu, Yazhu
    [J]. JOURNAL OF AMBIENT INTELLIGENCE AND HUMANIZED COMPUTING, 2017, 8 (06) : 913 - 924
  • [9] BUILDING A CHINESE NATURAL EMOTIONAL AUDIO-VISUAL DATABASE
    Bao, Wei
    Li, Ya
    Gu, Mingliang
    Yang, Minghao
    Li, Hao
    Chao, Linlin
    Tao, Jianhua
    [J]. 2014 12TH INTERNATIONAL CONFERENCE ON SIGNAL PROCESSING (ICSP), 2014, : 583 - 587
  • [10] An audio-visual distance for audio-visual speech vector quantization
    Girin, L
    Foucher, E
    Feng, G
    [J]. 1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 523 - 528