Multi-party audio conferencing based on a simpler MCU and client-side echo cancellation

被引:0
|
作者
Li, Junlin [1 ]
He, Li-wei [2 ]
Florencio, Dinei [2 ]
机构
[1] Georgia Inst Technol, Sch Elect & Comp Engn, Atlanta, GA 30332 USA
[2] Microsoft Res, Redmond, WA 98052 USA
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional multiparty audio conferencing uses a star-shaped topology where all the clients connect to a central MCU (Multipoint Control Unit). The MCU mixes the signals from the speakers, encodes it, and sends back the encoded signal to each client. To prevent the speakers from hearing their own voices, the MCU has to produce and encode a different mixed signal for each speaker. As a result, the CPU load on the MCU increases proportionally to the number of speakers in the conference. In this paper, we introduce a new conferencing architecture, where the MCU produces a single encoded signal sum of all received signals and each client is responsible for removing its own signal if necessary. This architecture can substantially reduce CPU load on the MCU. The major challenge, however, is that the client's original speech is non-linearly distorted by the MCU encoding process. Simply subtracting the original speech from the mixed signal would produce an echo-like distortion. We solve that problem using a novel algorithm which completely removes the echo with minimal artifacts. Mean Opinion Score (MOS) results imply that the proposed algorithm works well, making the proposed multiparty audio conferencing architecture promising.
引用
收藏
页码:84 / +
页数:2
相关论文
共 4 条
  • [1] Investigating the effect of number of interlocutors on the quality of experience for multi-party audio conferencing
    Skowronek, Janto
    Raake, Alexander
    [J]. 12TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2011 (INTERSPEECH 2011), VOLS 1-5, 2011, : 836 - 839
  • [2] The Design and Implementation of Acoustic Echo Cancellation Subsystem for Multi-party Videoconferencing Systems
    Deng, Xu
    Bao, Congxiao
    Li, Xing
    [J]. 2013 INTERNATIONAL CONFERENCE ON TECHNOLOGICAL ADVANCES IN ELECTRICAL, ELECTRONICS AND COMPUTER ENGINEERING (TAEECE), 2013, : 152 - 156
  • [3] Controllable multi-party audio/video collaboration based on multicast over CERNET
    Zhang, Xuan
    Li, Xing
    Li, Chongrong
    [J]. TECHNOLOGIES FOR E-LEARNING AND DIGITAL ENTERTAINMENT, PROCEEDINGS, 2007, 4469 : 819 - +
  • [4] Blockchain-based secure deduplication of encrypted data supporting client-side semantically secure encryption without trusted third party
    Qin, Guiyun
    Li, Limin
    Liu, Pengtao
    Hu, Chengyu
    Guo, Shanqing
    [J]. TRANSACTIONS ON EMERGING TELECOMMUNICATIONS TECHNOLOGIES, 2024, 35 (04)