DuIVA: An Intelligent Voice Assistant for Hands-free and Eyes-free Voice Interaction with the Baidu Maps App

被引:7
|
作者
Huang, Jizhou [1 ]
Wang, Haifeng [1 ]
Ding, Shiqiang [1 ]
Wang, Shaolei [1 ]
机构
[1] Baidu Inc, Beijing, Peoples R China
关键词
Intelligent voice assistant; voice interaction; hands-free; eyes-free; user-to-app interaction; task-oriented dialogue; Baidu Maps;
D O I
10.1145/3534678.3539030
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mobile map apps such as the Baidu Maps app have become a ubiquitous and essential tool for users to find optimal routes and get turn-by-turn navigation services while driving. However, interacting with such apps while driving through visual-manual interaction modality inevitably causes driver distraction, due to the highly conspicuous nature of the time-sharing, multi-tasking behavior of the driver. In this paper, we present our efforts and findings of a 4-year longitudinal study on designing and implementing DuIVA, which is an intelligent voice assistant (IVA) embedded in the Baidu Maps app for hands-free, eyes-free human-to-app interaction in a fully voice-controlled manner. Specifically, DuIVA is designed to enable users to control the functionalities of Baidu Maps (e.g., navigation and location search) through voice interaction, rather than visual-manual interaction, which minimizes driver distraction and promotes safe driving by allowing the driver to keep "eyes on the road and hands on the wheel" while interacting with the Baidu Maps app. DuIVA has already been deployed in production at Baidu Maps since November 2017, which facilitates a better interaction modality with the Baidu Maps app and improves the accessibility and usability of the app by providing users with in-app voice activation, natural language queries, and multi-round dialogue. As of December 31, 2021, over 530 million users have used DuIVA, which demonstrates that DuIVA is an industrial-grade and production-proven solution for in-app intelligent voice assistants.
引用
收藏
页码:3040 / 3050
页数:11
相关论文
共 44 条
  • [21] VERSE: Bridging Screen Readers and Voice Assistants for Enhanced Eyes-Free Web Search
    Vtyurina, Alexander
    Fourney, Adam
    Morris, Meredith Ringel
    Findlater, Leah
    White, Ryen W.
    ASSETS'19: THE 21ST INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, 2019, : 414 - 426
  • [22] Intentional Voice Command Detection for Completely Hands-Free Speech Interface in Home Environments
    Obuchi, Yasunari
    Togami, Masahito
    Sumiyoshi, Takashi
    INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5, 2008, : 119 - 122
  • [23] VoiceDraw: A Hands-Free Voice-Driven Drawing Application for People with Motor Impairments
    Harada, Susumu
    Wobbrock, Jacob O.
    Landay, James A.
    ASSETS'07: PROCEEDINGS OF THE NINTH INTERNATIONAL ACM SIGACCESS CONFERENCE ON COMPUTERS AND ACCESSIBILITY, 2007, : 27 - 34
  • [24] A System Architecture for Hands-Free UAV Drone Control Using Intuitive Voice Commands
    Landau, Megan
    van Delden, Sebastian
    COMPANION OF THE 2017 ACM/IEEE INTERNATIONAL CONFERENCE ON HUMAN-ROBOT INTERACTION (HRI'17), 2017, : 181 - 182
  • [25] Health and Fitness Apps for Hands-Free Voice-Activated Assistants: Content Analysis
    Chung, Arlene E.
    Griffin, Ashley C.
    Selezneva, Dasha
    Gotz, David
    JMIR MHEALTH AND UHEALTH, 2018, 6 (09):
  • [26] A Hands-Free Approach With Voice to Text and Generative Artificial Intelligence: Streamlining Radiology Reporting
    Young, Austin
    Wang, Katherine E.
    Jin, Michael X.
    Avilla, Kian
    Gilotra, Kevin
    Nguyen, Pamela
    Ros, Pablo R.
    JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2025, 22 (02) : 200 - 203
  • [27] MULTI-CHANNEL NOISE REDUCTION FOR HANDS-FREE VOICE COMMUNICATION ON MOBILE PHONES
    Jin, Wenyu
    Taghizadeh, Mohammad J.
    Chen, Kainan
    Xiao, Wei
    2017 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2017, : 506 - 510
  • [28] Preference in Voice Commands and Gesture Controls With Hands-Free Augmented Reality With Novel Users
    Korkiakoski, Mikko
    Alavesa, Paula
    Kostakos, Panos
    IEEE PERVASIVE COMPUTING, 2024, 23 (01) : 18 - 26
  • [29] EyeSayCorrect: Eye Gaze and Voice Based Hands-free Text Correction for Mobile Devices
    Zhao, Maozheng
    Huang, Henry
    Li, Zhi
    Liu, Rui
    Cui, Wenzhe
    Toshniwal, Kajal
    Goel, Ananya
    Wang, Andrew
    Zhao, Xia
    Rashidian, Sina
    Baig, Furqan
    Phi, Khiem
    Zhai, Shumin
    Ramakrishnan, I. V.
    Wang, Fusheng
    Bi, Xiaojun
    IUI'22: 27TH INTERNATIONAL CONFERENCE ON INTELLIGENT USER INTERFACES, 2022, : 470 - 482
  • [30] Affine Projection Versoria Algorithm for Robust Adaptive Echo Cancellation in Hands-Free Voice Communications
    Huang, Fuyi
    Zhang, Jiashu
    Zhang, Sheng
    IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2018, 67 (12) : 11924 - 11935