Multi-stream segmentation of meetings

被引:0
|
作者
Dielmann, A [1 ]
Renals, S [1 ]
机构
[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9LW, Midlothian, Scotland
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper investigates the automatic segmentation of meetings into a sequence of group actions or phases. Our work is based on a corpus of multiparty meetings collected in a meeting room instrumented with video cameras, lapel microphones and a microphone array. We have extracted a set of feature streams, in this case extracted from the audio data, based on speaker turns, prosody and a transcript of what was spoken. We have related these signals to the higher level semantic categories via a multistream statistical model based on dynamic Bayesian networks (DBNs). We report on a set of experiments in which different DBN architectures are compared, together with the different feature streams. The resultant system has an action error rate of 9%.
引用
收藏
页码:167 / 170
页数:4
相关论文
共 50 条
  • [11] Multi-stream Information-Based Neural Network for Mammogram Mass Segmentation
    Li, Zhilin
    Deng, Zijian
    Chen, Li
    Gui, Yu
    Cai, Zhigang
    Liao, Jianwei
    [J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 267 - 278
  • [12] Multi-stream Cell Segmentation with Low-level Cues for Multi-modality Images
    Lou, Wei
    Yu, Xinyi
    Liu, Chenyu
    Wan, Xiang
    Li, Guanbin
    Liu, Siqi
    Li, Haofeng
    [J]. COMPETITIONS IN NEURAL INFORMATION PROCESSING SYSTEMS, VOL 212, 2022, 212
  • [13] Multi-Size Multi-Stream FFT
    Schwoerer, Ludwig
    Bui, Thao
    Zielinski, Ernst
    [J]. PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL MULTI-CONFERENCE ON WIRELESS AND OPTICAL COMMUNICATIONS, 2006, : 396 - +
  • [14] Medical image segmentation based on active fusion-transduction of multi-stream features?
    Shu, Yucheng
    Zhang, Jing
    Xiao, Bin
    Li, Weisheng
    [J]. KNOWLEDGE-BASED SYSTEMS, 2021, 220
  • [15] Stream fusion for multi-stream automatic speech recognition
    Sagha, Hesam
    Li, Feipeng
    Variani, Ehsan
    Millan, Jose del R.
    Chavarriaga, Ricardo
    Schuller, Bjoern
    [J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (04) : 669 - 675
  • [16] Multi-stream Attention-based BLSTM with Feature Segmentation for Speech Emotion Recognition
    Chiba, Yuya
    Nose, Takashi
    Ito, Akinori
    [J]. INTERSPEECH 2020, 2020, : 3301 - 3305
  • [17] Arabic Handwriting Recognition Based on Synchronous Multi-stream HMM Without Explicit Segmentation
    Jayech, Khaoula
    Mahjoub, Mohamed Ali
    Ben Amara, Najoua Essoukri
    [J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2015), 2015, 9121 : 136 - 145
  • [18] Multi-stream dynamic video Summarization
    Elfeki, Mohamed
    Wang, Liqiang
    Borji, Ali
    [J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 185 - 195
  • [19] A multi-stream network for retrosynthesis prediction
    Qiang Zhang
    Juan Liu
    Wen Zhang
    Feng Yang
    Zhihui Yang
    Xiaolei Zhang
    [J]. Frontiers of Computer Science, 2024, 18
  • [20] Multi-stream fusion for speaker classification
    Shafran, Izhak
    [J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2007, 4343 LNAI : 298 - 312