SpeakerSense: Energy Efficient Unobtrusive Speaker Identification on Mobile Phones

被引:0
|
作者
Lu, Hong [1 ]
Brush, A. J. Bernheim [1 ]
Priyantha, Bodhi [1 ]
Karlson, Amy K. [1 ]
Liu, Jie [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
来源
PERVASIVE COMPUTING | 2011年 / 6696卷
关键词
Continuous audio sensing; mobile phones; speaker identification; energy efficiency; heterogeneous multi-processor hardware;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatically identifying the person you are talking with using continuous audio sensing has the potential to enable many pervasive computing applications from memory assistance to annotating life logging data. However, a number of challenges, including energy efficiency and training data acquisition, must be addressed before unobtrusive audio sensing is practical on mobile devices. We built SpeakerSense, a speaker identification prototype that uses a heterogeneous multi-processor hardware architecture that splits computation between a low power processor and the phone's application processor to enable continuous background sensing with minimal power requirements. Using SpeakerSense, we benchmarked several system parameters (sampling rate, GMM complexity, smoothing window size, and amount of training data needed) to identify thresholds that balance computation cost with performance. We also investigated channel compensation methods that make it feasible to acquire training data from phone calls and an automatic segmentation method for training speaker models based on one-to-one conversations.
引用
收藏
页码:188 / 205
页数:18
相关论文
共 50 条
  • [21] WISS, a Speaker Identification System for Mobile Robots
    Grondin, Francois
    Michaud, Francois
    2012 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION (ICRA), 2012, : 1817 - 1822
  • [22] Identification system based on color sequence and mobile phones
    Castro Garrido, Pilar
    Matas Miraz, Guillermo
    Bellido Outeirino, Jose
    Luque Ruiz, Irene
    Angel Gomez-Nieto, Miguel
    JOURNAL OF AMBIENT INTELLIGENCE AND SMART ENVIRONMENTS, 2012, 4 (04) : 287 - 303
  • [23] A framework for fast and secure packaging identification on mobile phones
    Diephuis, Maurits
    Voloshynovskiy, Sviatoslav
    Holotyak, Taras
    Standardo, Nabil
    Keel, Bruno
    MEDIA WATERMARKING, SECURITY, AND FORENSICS 2014, 2014, 9028
  • [24] Energy requirements of mobile phones and sensor technologies in mobile health applications
    Kreuzer, J.
    Diemer, R.
    Huber, T.
    4TH EUROPEAN CONFERENCE OF THE INTERNATIONAL FEDERATION FOR MEDICAL AND BIOLOGICAL ENGINEERING, 2009, 22 (1-3): : 1042 - 1045
  • [26] Dynamical modeling and experimental validation of a micro-speaker with corrugated diaphragm for mobile phones
    Chao, Paul C. -P.
    Wang, I-Ting
    MICROSYSTEM TECHNOLOGIES-MICRO-AND NANOSYSTEMS-INFORMATION STORAGE AND PROCESSING SYSTEMS, 2007, 13 (8-10): : 1241 - 1252
  • [27] Analysis of a dynamic speaker in mobile phones by considering mechanical, electrical, and magnetic coupling effects
    Hwang, GY
    Kim, KT
    Chung, SU
    Hwang, SM
    Kang, BS
    Hwang, IC
    JOURNAL OF APPLIED PHYSICS, 2002, 91 (10) : 6979 - 6981
  • [28] Dynamical modeling and experimental validation of a micro-speaker with corrugated diaphragm for mobile phones
    Paul C.-P. Chao
    I-Ting Wang
    Microsystem Technologies, 2007, 13 : 1241 - 1252
  • [29] Speaker Model Clustering for Efficient Speaker Identification in Large Population Applications
    Apsingekar, Vijendra Raj
    De Leon, Phillip L.
    IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2009, 17 (04): : 848 - 853
  • [30] Energy Onset Times for Speaker Identification
    Quatieri, T. F.
    Jankowski, C. R., Jr.
    Reynolds, D. A.
    IEEE SIGNAL PROCESSING LETTERS, 1994, 1 (11) : 160 - 162