Human-Machine Interaction Personalization: a Review on Gender and Emotion Recognition Through Speech Analysis

被引:0
|
作者
La Mura, Monica [1 ]
Lamberti, Patrizia [1 ]
机构
[1] Univ Salerno, Dept Informat & Elect Engn & Appl Math, Fisciano, Italy
关键词
HMI; gender recognition; emotion recognition; speech analysis;
D O I
10.1109/metroind4.0iot48571.2020.9138203
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The increasing spread of pervasive technology has led to the fast development of human-centered connected systems, such as cloud-based voice services, assisted driving systems, domotics control systems, personal digital assistants. The user interacts with these systems by speaking to an artificial intelligence, which interprets the speaker's requests and takes decision accordingly. In such scenario, the real-time collection of personal information from the speaker's voice is a key function to develop in order to offer personalized services. Gender is part of the basic information needed to customize the user experience. Furthermore, knowledge about the sex of the speaker also proves useful in automatic speaker recognition and voice-based identity recognition systems, since it restricts the search space to individuals of one gender, thus speeding up the system response. Therefore, gender recognition techniques through speech analysis have largely attracted the researchers' attention. Speech analysis is usually performed by extracting some features from the speech signal that can be affected by additional factors other than the gender: emotional state of the speaker, for example, is conveyed in the speech by altering some parameters that take part to the gender recognition process. At the same time, the outcome of emotion recognition systems based on speech analysis can be affected by the speaker's gender. This paper briefly summarizes the techniques used to perform gender recognition through speech analysis and proposes a practice to take gender into account in emotion recognition methods.
引用
收藏
页码:319 / 323
页数:5
相关论文
共 50 条
  • [21] Data On Emotional Learning And Human-Machine Interaction Plenary Speech
    Esposito, Anna
    [J]. 2014 5th IEEE Conference on Cognitive Infocommunications (CogInfoCom), 2014, : 11 - 11
  • [22] BLIND SPEECH SEPARATION FOR ROBOTS WITH INTELLIGENT HUMAN-MACHINE INTERACTION
    Huang Yulei Ding Zhizhong Dai Lirong* Chen Xiaoping* (Department of Communication Engineering
    [J]. Journal of Electronics(China), 2012, (Z2) : 286 - 293
  • [23] BLIND SPEECH SEPARATION FOR ROBOTS WITH INTELLIGENT HUMAN-MACHINE INTERACTION
    Huang Yulei Ding Zhizhong Dai Lirong Chen Xiaoping Department of Communication Engineering Hefei University of Technology Hefei China Department of Electronic Engineering and Information Science University of Science and Technology of China Hefei China
    [J]. Journal of Electronics(China)., 2012, 29(Z2) (China) - 293
  • [24] Human-Machine Interaction Speech Corpus from the ROBIN project
    Pais, Vasile
    Ion, Radu
    Avram, Andrei-Marius
    Irimia, Elena
    Mititelu, Verginica Barbu
    Mitrofan, Maria
    [J]. 2021 INTERNATIONAL CONFERENCE ON SPEECH TECHNOLOGY AND HUMAN-COMPUTER DIALOGUE (SPED), 2021, : 91 - 96
  • [25] Speech emotion recognition using machine learning - A systematic review
    Madanian, Samaneh
    Chen, Talen
    Adeleye, Olayinka
    Templeton, John Michael
    Poellabauer, Christian
    Parry, Dave
    Schneidere, Sandra L.
    [J]. INTELLIGENT SYSTEMS WITH APPLICATIONS, 2023, 20
  • [26] End-to-End Myanmar Speech Recognition with Human-Machine Cooperation
    Wang, Faliang
    Yang, Yiling
    Yang, Jian
    [J]. 2022 INTERNATIONAL CONFERENCE ON ASIAN LANGUAGE PROCESSING (IALP 2022), 2022, : 156 - 161
  • [27] Cultural Dimension in Emotion Recognition for Human Machine Interaction
    Quiros-Ramirez, Maria Alejandra
    Onisawa, Takehisa
    [J]. 2014 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC), 2014, : 711 - 716
  • [28] Human-machine interaction compensating imperfection in facial expression recognition
    Sato, Mie
    Shang Yuyi
    Kasuga, Masao
    [J]. TENCON 2005 - 2005 IEEE REGION 10 CONFERENCE, VOLS 1-5, 2006, : 2598 - +
  • [29] A machine emotion transfer model for intelligent human-machine interaction based on group division
    Xiao, Guorong
    Ma, Yunju
    Liu, Cheng
    Jiang, Dazhi
    [J]. MECHANICAL SYSTEMS AND SIGNAL PROCESSING, 2020, 142
  • [30] Deep Multimodal Emotion Recognition on Human Speech: A Review
    Koromilas, Panagiotis
    Giannakopoulos, Theodoros
    [J]. APPLIED SCIENCES-BASEL, 2021, 11 (17):