SpeakerSense: Energy Efficient Unobtrusive Speaker Identification on Mobile Phones

被引:0
|
作者
Lu, Hong [1 ]
Brush, A. J. Bernheim [1 ]
Priyantha, Bodhi [1 ]
Karlson, Amy K. [1 ]
Liu, Jie [1 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
来源
PERVASIVE COMPUTING | 2011年 / 6696卷
关键词
Continuous audio sensing; mobile phones; speaker identification; energy efficiency; heterogeneous multi-processor hardware;
D O I
暂无
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Automatically identifying the person you are talking with using continuous audio sensing has the potential to enable many pervasive computing applications from memory assistance to annotating life logging data. However, a number of challenges, including energy efficiency and training data acquisition, must be addressed before unobtrusive audio sensing is practical on mobile devices. We built SpeakerSense, a speaker identification prototype that uses a heterogeneous multi-processor hardware architecture that splits computation between a low power processor and the phone's application processor to enable continuous background sensing with minimal power requirements. Using SpeakerSense, we benchmarked several system parameters (sampling rate, GMM complexity, smoothing window size, and amount of training data needed) to identify thresholds that balance computation cost with performance. We also investigated channel compensation methods that make it feasible to acquire training data from phone calls and an automatic segmentation method for training speaker models based on one-to-one conversations.
引用
收藏
页码:188 / 205
页数:18
相关论文
共 50 条
  • [1] Robust and Unobtrusive Marker Tracking on Mobile Phones
    Wagner, Daniel
    Langlotz, Tobias
    Schmalstieg, Dieter
    7TH IEEE INTERNATIONAL SYMPOSIUM ON MIXED AND AUGMENTED REALITY 2008, PROCEEDINGS, 2008, : 121 - 124
  • [2] EnLoc: Energy-Efficient Localization for Mobile Phones
    Constandache, Ionut
    Gaonkar, Shravan
    Sayler, Matt
    Choudhury, Romit Roy
    Cox, Landon
    IEEE INFOCOM 2009 - IEEE CONFERENCE ON COMPUTER COMMUNICATIONS, VOLS 1-5, 2009, : 2716 - 2720
  • [3] Towards an Energy Efficient Code Generator for Mobile Phones
    Fekete, Krisztian
    Csorba, Kristof
    Vajk, Tamas
    Forstner, Bertalan
    Pandi, Krisztian
    2013 IEEE 4TH INTERNATIONAL CONFERENCE ON COGNITIVE INFOCOMMUNICATIONS (COGINFOCOM), 2013, : 647 - 652
  • [4] Energy-Efficient Collaborative Sensing with Mobile Phones
    Sheng, Xiang
    Tang, Jian
    Zhang, Weiyi
    2012 PROCEEDINGS IEEE INFOCOM, 2012, : 1916 - 1924
  • [5] LittleRock: Enabling Energy-Efficient Continuous Sensing on Mobile Phones
    Priyantha, Bodhi
    Lymberopoulos, Dimitrios
    Liu, Jie
    IEEE PERVASIVE COMPUTING, 2011, 10 (02) : 12 - 15
  • [6] Somnography using unobtrusive motion sensors and Android-based mobile phones
    Gradl, Stefan
    Leutheuser, Heike
    Kugler, Patrick
    Biermann, Teresa
    Kreil, Sebastian
    Kornhuber, Johannes
    Bergner, Matthias
    Eskofier, Bjoern
    2013 35TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY (EMBC), 2013, : 1182 - 1185
  • [7] Demo Abstract: Enabling Energy Efficient Continuous Sensing on Mobile Phones with LittleRock
    Priyantha, Bodhi
    Lymberopoulos, Dimitrios
    Liu, Jie
    PROCEEDINGS OF THE 9TH ACM/IEEE INTERNATIONAL CONFERENCE ON INFORMATION PROCESSING IN SENSOR NETWORKS, 2010, : 420 - 421
  • [8] High performance speaker and vocabulary independent ASR technology for mobile phones
    Astrov, S
    Bauer, JG
    Stan, S
    2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 281 - 284
  • [9] ENERGY-EFFICIENT SPEAKER IDENTIFICATION WITH LOW-PRECISION NETWORKS
    Koppula, Skanda
    Glass, James
    Chandrakasan, Anantha P.
    2018 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP), 2018, : 2246 - 2250
  • [10] A modified speaker clustering method for efficient speaker identification
    Yan, JiaChang
    Wang, Lei
    2014 SEVENTH INTERNATIONAL SYMPOSIUM ON COMPUTATIONAL INTELLIGENCE AND DESIGN (ISCID 2014), VOL 2, 2014,