Verifying Human Users in Speech-Based Interactions

被引:0
|
作者
Shirali-Shahreza, Sajad [1 ]
Ganjali, Yashar [1 ]
Balakrishnan, Ravin [1 ]
机构
[1] Univ Toronto, Dept Comp Sci, Toronto, ON M5S 1A1, Canada
关键词
Accessibility; CAPTCHA; Speech Recognition; Speech Synthesis; VERIFICATION;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Verifying that a live human is interacting with an automated speech based system is needed in some applications such as biometric authentication. In this paper, we present a method to verify that the user is human. Simply stated, our method asks the user to repeat a sentence. The reply is analyzed to verify that it is the requested sentence and said by a human, not a speech synthesis system. Our method is taking advantage of both speech synthesizer and speech recognizer limitations to detect computer programs, which is new, and potentially more accessible, way to develop CAPTCHA systems. Using an acoustic model trained on voices of over 1000 users, our system can verify the user's answer with 98% accuracy and with 80% success in distinguishing humans from computers.
引用
收藏
页码:1596 / 1599
页数:4
相关论文
共 50 条
  • [1] Speech-Based Interface For Visually Impaired Users
    Huang, Yi-Chin
    Tsai, Cheng-Hung
    [J]. IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS), 2018, : 1223 - 1228
  • [2] Speech-based Web Navigation for Limited Mobility Users
    Radostev, Vasiliy
    Berger, Serge
    Tabrizi, Justin
    Kamyshev, Pasha
    Suzuki, Hisami
    [J]. INTERSPEECH 2019, 2019, : 976 - 977
  • [3] Employers' Speech-Based First Impressions of Cochlear Implant Users
    Freeman, Valerie
    [J]. JOURNAL OF DEAF STUDIES AND DEAF EDUCATION, 2022, : 246 - 253
  • [4] An investigation of speech-based human emotion recognition
    Wang, YJ
    Guan, L
    [J]. 2004 IEEE 6TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 2004, : 15 - 18
  • [5] Introduction to the Special Issue on Multimodal Processing in Speech-Based Interactions
    Meng, Helen
    Oviatt, Sharon
    Potamianos, Gerasimos
    Rigoll, Gerhard
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (03): : 409 - 410
  • [6] Speech-based services
    Furman, DS
    Cosky, MJ
    Thomson, DL
    O'Brien, SA
    Sumner, EE
    [J]. BELL LABS TECHNICAL JOURNAL, 1999, 4 (02) : 88 - 97
  • [7] Anticipation in speech-based human-machine interfaces
    Ondas, Stanislav
    Juhar, Jozef
    Kiktova, Eva
    Zimmermann, Julius
    [J]. 2018 9TH IEEE INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2018, : 117 - 121
  • [8] Challenges in speech-based human-computer interfaces
    Minker, Wolfgang
    Pittermann, Johannes
    Pittermann, Angela
    Strauss, Petra-Maria
    Buehler, Dirk
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2007, 10 (2-3) : 109 - 119
  • [9] Factors affecting users' choice of words in speech-based interaction with public technology
    Baber C.
    Johnson G.I.
    Cleaver D.
    [J]. International Journal of Speech Technology, 1997, 2 (1) : 45 - 59
  • [10] Accessible Speech-Based and Multimodal Media Center Interface for Users with Physical Disabilities
    Turunen, Markku
    Hakulinen, Jaakko
    Melto, Aleksi
    Hella, Juho
    Laivo, Tuuli
    Rajaniemi, Juha-Pekka
    Makinen, Erno
    Soronen, Hannu
    Hansen, Mervi
    Pakarinen, Santtu
    Heimonen, Tomi
    Rantala, Jussi
    Valkama, Pellervo
    Miettinen, Toni
    Raisamo, Roope
    [J]. DEVELOPMENT OF MULTIMODAL INTERFACES: ACTIVE LISTING AND SYNCHRONY, 2010, 5967 : 66 - +