DuIVA: An Intelligent Voice Assistant for Hands-free and Eyes-free Voice Interaction with the Baidu Maps App

被引:7
|
作者
Huang, Jizhou [1 ]
Wang, Haifeng [1 ]
Ding, Shiqiang [1 ]
Wang, Shaolei [1 ]
机构
[1] Baidu Inc, Beijing, Peoples R China
关键词
Intelligent voice assistant; voice interaction; hands-free; eyes-free; user-to-app interaction; task-oriented dialogue; Baidu Maps;
D O I
10.1145/3534678.3539030
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mobile map apps such as the Baidu Maps app have become a ubiquitous and essential tool for users to find optimal routes and get turn-by-turn navigation services while driving. However, interacting with such apps while driving through visual-manual interaction modality inevitably causes driver distraction, due to the highly conspicuous nature of the time-sharing, multi-tasking behavior of the driver. In this paper, we present our efforts and findings of a 4-year longitudinal study on designing and implementing DuIVA, which is an intelligent voice assistant (IVA) embedded in the Baidu Maps app for hands-free, eyes-free human-to-app interaction in a fully voice-controlled manner. Specifically, DuIVA is designed to enable users to control the functionalities of Baidu Maps (e.g., navigation and location search) through voice interaction, rather than visual-manual interaction, which minimizes driver distraction and promotes safe driving by allowing the driver to keep "eyes on the road and hands on the wheel" while interacting with the Baidu Maps app. DuIVA has already been deployed in production at Baidu Maps since November 2017, which facilitates a better interaction modality with the Baidu Maps app and improves the accessibility and usability of the app by providing users with in-app voice activation, natural language queries, and multi-round dialogue. As of December 31, 2021, over 530 million users have used DuIVA, which demonstrates that DuIVA is an industrial-grade and production-proven solution for in-app intelligent voice assistants.
引用
收藏
页码:3040 / 3050
页数:11
相关论文
共 44 条
  • [31] Hands-free speech after surgical voice rehabilitation with a Provox® voice prosthesis: experience with the Provox FreeHands HME tracheostoma valve® system
    K. J. Lorenz
    K. Groll
    A. H. Ackerstaff
    F. J. M. Hilgers
    H. Maier
    European Archives of Oto-Rhino-Laryngology, 2007, 264 : 151 - 157
  • [32] Hands-free speech after surgical voice rehabilitation with a Provox® voice prosthesis:: experience with the Provox FreeHands HME tracheostoma valve® system
    Lorenz, K. J.
    Groll, K.
    Ackerstaff, A. H.
    Hilgers, F. J. M.
    Maier, H.
    EUROPEAN ARCHIVES OF OTO-RHINO-LARYNGOLOGY, 2007, 264 (02) : 151 - 157
  • [33] Whoosh: Non-Voice Acoustics for Low-Cost, Hands-Free, and Rapid Input on Smartwatches
    Reyes, Gabriel
    Zhang, Dingtian
    Ghosh, Sarthak
    Shah, Pratik
    Wu, Jason
    Parnami, Aman
    Bercik, Bailey
    Starner, Thad
    Abowd, Gregory D.
    Edwards, W. Keith
    ISWC'16 - PROCEEDINGS OF THE 2016 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2016, : 120 - 127
  • [34] Tell Me Where To Go: Voice-Controlled Hands-Free Locomotion for Virtual Reality Systems
    Hombeck, Jan
    Voigt, Henrik
    Heggemann, Timo
    Datta, Rabi R.
    Lawonn, Kai
    2023 IEEE CONFERENCE VIRTUAL REALITY AND 3D USER INTERFACES, VR, 2023, : 123 - 134
  • [35] Development of a Hands-free Electrolarynx for Obtaining a Human-like Voice using the LPC Residual Wave
    Takeuchi, Masaki
    Soejima, Yutaro
    Ahn, Jaesol
    Lee, Kunhak
    Takaki, Ken
    Ifukube, Tohru
    Yabu, Ken-Ichiro
    Takamichi, Shinnosuke
    Sekino, Masaki
    IEEJ Transactions on Fundamentals and Materials, 2022, 142 (09): : 390 - 396
  • [36] All Birds Must Fly: The Experience of Multimodal Hands-free Gaming with Gaze and Nonverbal Voice Synchronization
    Hedeshy, Ramin
    Kumar, Chandan
    Lauer, Mike
    Staab, Stefen
    PROCEEDINGS OF THE 2022 INTERNATIONAL CONFERENCE ON MULTIMODAL INTERACTION, ICMI 2022, 2022, : 278 - 287
  • [37] Development of a hands-free electrolarynx for obtaining a human-like voice using the LPC residual wave
    Takeuchi, Masaki
    Soejima, Yutaro
    Ahn, Jaesol
    Lee, Kunhak
    Takaki, Ken
    Ifukube, Tohru
    Yabu, Ken-Ichiro
    Takamichi, Shinnosuke
    Sekino, Masaki
    ELECTRICAL ENGINEERING IN JAPAN, 2022, 215 (04)
  • [38] A frequency-domain nonlinear echo processing algorithm for high quality hands-free voice communication devices
    Qingyun Wang
    Xin Chen
    Ruiyu Liang
    Haicheng Liu
    Multimedia Tools and Applications, 2021, 80 : 10777 - 10796
  • [39] A frequency-domain nonlinear echo processing algorithm for high quality hands-free voice communication devices
    Wang, Qingyun
    Chen, Xin
    Liang, Ruiyu
    Liu, Haicheng
    MULTIMEDIA TOOLS AND APPLICATIONS, 2021, 80 (07) : 10777 - 10796
  • [40] Telecytology and Hands-Free Digital Voice Communication for High Volume Rapid On-Site Evaluation: An Workflow Optimization
    Lin, Oscar
    Rudomina, Dorota
    Feratovic, Rusmir
    Sirintrapun, Joe
    LABORATORY INVESTIGATION, 2017, 97 : 105A - 105A