Live Demonstration: Cloud-based Audio-Visual Speech Enhancement in Multimodal Hearing-aids

被引:0
|
作者
Bishnu, Abhijeet [1 ]
Gupta, Ankit [2 ]
Gogate, Mandar [3 ]
Dashtipour, Kia [3 ]
Arslan, Tughrul [1 ]
Adeel, Ahsan [4 ]
Hussain, Amir [3 ]
Sellathurai, Mathini [2 ]
Ratnarajah, Tharmalingam [1 ]
机构
[1] Univ Edinburgh, Sch Engn, Edinburgh, Midlothian, Scotland
[2] Heriot Watt Watt Univ, Sch Engn & Phys Sci, Edinburgh, Midlothian, Scotland
[3] Edinburgh Napier Univ, Sch Comp, Edinburgh, Midlothian, Scotland
[4] Univ Wolverhampton, Sch Math & Comp Sci, Wolverhampton, England
基金
英国工程与自然科学研究理事会;
关键词
D O I
10.1109/ISCAS46773.2023.10182060
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
引用
收藏
页数:1
相关论文
共 50 条
  • [1] 5G-IoT Cloud based Demonstration of Real-Time Audio-Visual Speech Enhancement for Multimodal Hearing-aids
    Gupta, Ankit
    Bishnu, Abhijeet
    Gogate, Mandar
    Dashtipour, Kia
    Arslan, Tughrul
    Adeel, Ahsan
    Hussain, Amir
    Ratnarajah, Tharmalingam
    Sellathurai, Mathini
    [J]. INTERSPEECH 2023, 2023, : 686 - 687
  • [2] An Audio-Visual Speech Enhancement System Based on 3D Image Features: An Application in Hearing Aids
    Chung, Yu-Ching
    Han, Ji-Yan
    Wang, Bo-Sin
    Zheng, Wei-Zhong
    Shen, Kung-Yao
    Lai, Ying-Hui
    [J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1131 - 1137
  • [3] Audio-visual speech recognition based on joint training with audio-visual speech enhancement for robust speech recognition
    Hwang, Jung-Wook
    Park, Jeongkyun
    Park, Rae-Hong
    Park, Hyung-Min
    [J]. APPLIED ACOUSTICS, 2023, 211
  • [4] Canonical cortical graph neural networks and its application for speech enhancement in audio-visual hearing aids
    Passos, Leandro A.
    Papa, Joao Paulo
    Hussain, Amir
    Adeel, Ahsan
    [J]. NEUROCOMPUTING, 2023, 527 : 196 - 203
  • [5] Inventory-Based Audio-Visual Speech Enhancement
    Kolossa, Dorothea
    Nickel, Robert
    Zeiler, Steffen
    Martin, Rainer
    [J]. 13TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2012 (INTERSPEECH 2012), VOLS 1-3, 2012, : 586 - 589
  • [6] Lite Audio-Visual Speech Enhancement
    Chuang, Shang-Yi
    Tsao, Yu
    Lo, Chen-Chou
    Wang, Hsin-Min
    [J]. INTERSPEECH 2020, 2020, : 1131 - 1135
  • [7] Audio-visual enhancement of speech in noise
    Girin, L
    Schwartz, JL
    Feng, G
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 109 (06): : 3007 - 3020
  • [8] Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis
    Yang, Karren
    Markovic, Dejan
    Krenn, Steven
    Agrawal, Vasu
    Richard, Alexander
    [J]. 2022 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2022, : 8217 - 8227
  • [9] Lip landmark-based audio-visual speech enhancement with multimodal feature fusion network
    Li, Yangke
    Zhang, Xinman
    [J]. NEUROCOMPUTING, 2023, 549
  • [10] Audio-Visual Speech Enhancement Using Multimodal Deep Convolutional Neural Networks
    Hou, Jen-Cheng
    Wang, Syu-Siang
    Lai, Ying-Hui
    Tsao, Yu
    Chang, Hsiu-Wen
    Wang, Hsin-Min
    [J]. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2018, 2 (02): : 117 - 128