DuIVA: An Intelligent Voice Assistant for Hands-free and Eyes-free Voice Interaction with the Baidu Maps App

被引:7
|
作者
Huang, Jizhou [1 ]
Wang, Haifeng [1 ]
Ding, Shiqiang [1 ]
Wang, Shaolei [1 ]
机构
[1] Baidu Inc, Beijing, Peoples R China
关键词
Intelligent voice assistant; voice interaction; hands-free; eyes-free; user-to-app interaction; task-oriented dialogue; Baidu Maps;
D O I
10.1145/3534678.3539030
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Mobile map apps such as the Baidu Maps app have become a ubiquitous and essential tool for users to find optimal routes and get turn-by-turn navigation services while driving. However, interacting with such apps while driving through visual-manual interaction modality inevitably causes driver distraction, due to the highly conspicuous nature of the time-sharing, multi-tasking behavior of the driver. In this paper, we present our efforts and findings of a 4-year longitudinal study on designing and implementing DuIVA, which is an intelligent voice assistant (IVA) embedded in the Baidu Maps app for hands-free, eyes-free human-to-app interaction in a fully voice-controlled manner. Specifically, DuIVA is designed to enable users to control the functionalities of Baidu Maps (e.g., navigation and location search) through voice interaction, rather than visual-manual interaction, which minimizes driver distraction and promotes safe driving by allowing the driver to keep "eyes on the road and hands on the wheel" while interacting with the Baidu Maps app. DuIVA has already been deployed in production at Baidu Maps since November 2017, which facilitates a better interaction modality with the Baidu Maps app and improves the accessibility and usability of the app by providing users with in-app voice activation, natural language queries, and multi-round dialogue. As of December 31, 2021, over 530 million users have used DuIVA, which demonstrates that DuIVA is an industrial-grade and production-proven solution for in-app intelligent voice assistants.
引用
收藏
页码:3040 / 3050
页数:11
相关论文
共 44 条
  • [11] Usability of a Hands-Free Voice Input Interface for Ecological Momentary Assessment
    Adaimi, Rebecca
    Ho, Ka Tai
    Thomaz, Edison
    2020 IEEE INTERNATIONAL CONFERENCE ON PERVASIVE COMPUTING AND COMMUNICATIONS WORKSHOPS (PERCOM WORKSHOPS), 2020,
  • [12] Adaptive Echo And Noise Cancellation For Car Hands-free Voice Communication
    Onur, T. Ozge
    Hacioglu, Rifat
    2013 21ST SIGNAL PROCESSING AND COMMUNICATIONS APPLICATIONS CONFERENCE (SIU), 2013,
  • [13] Bridging Screen Readers and Voice Assistants for Enhanced Eyes-Free Web Search
    Vtyurina, Alexandra
    Fourney, Adam
    Morris, Meredith Ringel
    Findlater, Leah
    White, Ryen W.
    WEB CONFERENCE 2019: PROCEEDINGS OF THE WORLD WIDE WEB CONFERENCE (WWW 2019), 2019, : 3590 - 3594
  • [14] User interaction in hands-free gaming: a comparative study of gaze-voice and touchscreen interface control
    Uludagli, Muhtar Cagkan
    Acarturk, Cengiz
    TURKISH JOURNAL OF ELECTRICAL ENGINEERING AND COMPUTER SCIENCES, 2018, 26 (04) : 1967 - 1976
  • [15] A noise robust speech activity detection algorithm for voice activated hands-free
    Bagur, H
    Seventh IASTED International Conference on Signal and Image Processing, 2005, : 1 - 5
  • [16] Achieving a hands-free computer interface using voice recognition and speech synthesis
    Evans, JR
    Tjoland, WA
    Allred, LG
    IEEE AEROSPACE AND ELECTRONIC SYSTEMS MAGAZINE, 2000, 15 (01) : 14 - 16
  • [17] Compliance, quality of life and quantitative voice quality aspects of hands-free speech
    Op de Coul, BMR
    Ackerstaff, AH
    Van As-Brooks, CJ
    Van den Hoogen, FJA
    Meeuwis, CA
    Manni, JJ
    Hilgers, FJM
    ACTA OTO-LARYNGOLOGICA, 2005, 125 (06) : 629 - 637
  • [18] Privacy-preserving hands-free voice authentication leveraging edge technology
    Alattar, Zaid Sh.
    Abbes, Tarek
    Zerai, Faouzi
    SECURITY AND PRIVACY, 2023, 6 (03)
  • [19] Hands-Free Web Browsing: Enriching the User Experience with Gaze and Voice Modality
    Sengupta, Korok
    Ke, Min
    Menges, Raphael
    Kumar, Chandan
    Staab, Steffen
    2018 ACM SYMPOSIUM ON EYE TRACKING RESEARCH & APPLICATIONS (ETRA 2018), 2018,
  • [20] Commanding and Re-Dictation: Developing Eyes-Free Voice-Based Interaction for Editing Dictated Text
    Ghosh, Debjyoti
    Liu, Can
    Zhao, Shengdong
    Hara, Kotaro
    ACM TRANSACTIONS ON COMPUTER-HUMAN INTERACTION, 2020, 27 (04)