Multi-stream segmentation of meetings

被引：0

作者：

Dielmann, A ^{[1
]}

Renals, S ^{[1
]}

机构：

[1] Univ Edinburgh, Ctr Speech Technol Res, Edinburgh EH8 9LW, Midlothian, Scotland

来源：

2004 IEEE 6TH WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING | 2004年

关键词：

D O I：

暂无

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper investigates the automatic segmentation of meetings into a sequence of group actions or phases. Our work is based on a corpus of multiparty meetings collected in a meeting room instrumented with video cameras, lapel microphones and a microphone array. We have extracted a set of feature streams, in this case extracted from the audio data, based on speaker turns, prosody and a transcript of what was spoken. We have related these signals to the higher level semantic categories via a multistream statistical model based on dynamic Bayesian networks (DBNs). We report on a set of experiments in which different DBN architectures are compared, together with the different feature streams. The resultant system has an action error rate of 9%.

引用

页码：167 / 170

页数：4

共 50 条

[11] Multi-stream Information-Based Neural Network for Mammogram Mass Segmentation
Li, Zhilin
Deng, Zijian
Chen, Li
Gui, Yu
Cai, Zhigang
Liao, Jianwei
[J]. ARTIFICIAL NEURAL NETWORKS AND MACHINE LEARNING - ICANN 2022, PT I, 2022, 13529 : 267 - 278
[12] Multi-stream Cell Segmentation with Low-level Cues for Multi-modality Images
Lou, Wei
Yu, Xinyi
Liu, Chenyu
Wan, Xiang
Li, Guanbin
Liu, Siqi
Li, Haofeng
[J]. COMPETITIONS IN NEURAL INFORMATION PROCESSING SYSTEMS, VOL 212, 2022, 212
[13] Multi-Size Multi-Stream FFT
Schwoerer, Ludwig
Bui, Thao
Zielinski, Ernst
[J]. PROCEEDINGS OF THE SIXTH IASTED INTERNATIONAL MULTI-CONFERENCE ON WIRELESS AND OPTICAL COMMUNICATIONS, 2006, : 396 - +
[14] Medical image segmentation based on active fusion-transduction of multi-stream features?
Shu, Yucheng
Zhang, Jing
Xiao, Bin
Li, Weisheng
[J]. KNOWLEDGE-BASED SYSTEMS, 2021, 220
[15] Stream fusion for multi-stream automatic speech recognition
Sagha, Hesam
Li, Feipeng
Variani, Ehsan
Millan, Jose del R.
Chavarriaga, Ricardo
Schuller, Bjoern
[J]. INTERNATIONAL JOURNAL OF SPEECH TECHNOLOGY, 2016, 19 (04) : 669 - 675
[16] Multi-stream Attention-based BLSTM with Feature Segmentation for Speech Emotion Recognition
Chiba, Yuya
Nose, Takashi
Ito, Akinori
[J]. INTERSPEECH 2020, 2020, : 3301 - 3305
[17] Arabic Handwriting Recognition Based on Synchronous Multi-stream HMM Without Explicit Segmentation
Jayech, Khaoula
Mahjoub, Mohamed Ali
Ben Amara, Najoua Essoukri
[J]. HYBRID ARTIFICIAL INTELLIGENT SYSTEMS (HAIS 2015), 2015, 9121 : 136 - 145
[18] Multi-stream dynamic video Summarization
Elfeki, Mohamed
Wang, Liqiang
Borji, Ali
[J]. 2022 IEEE WINTER CONFERENCE ON APPLICATIONS OF COMPUTER VISION (WACV 2022), 2022, : 185 - 195
[19] A multi-stream network for retrosynthesis prediction
Qiang Zhang
Juan Liu
Wen Zhang
Feng Yang
Zhihui Yang
Xiaolei Zhang
[J]. Frontiers of Computer Science, 2024, 18
[20] Multi-stream fusion for speaker classification
Shafran, Izhak
[J]. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2007, 4343 LNAI : 298 - 312

← 1 2 3 4 5 →