Automatic Estimation of Presentation Skills Using Speech, Slides and Gestures

被引：3

作者：

Hanani, Abualsoud ^{[1
]}

Al-Amleh, Mohammad ^{[1
]}

Bazbus, Waseem ^{[1
]}

Salameh, Saleem ^{[1
]}

机构：

[1] Birzeit Univ, Birzeit, Palestine

来源：

SPEECH AND COMPUTER, SPECOM 2017 | 2017年 / 10458卷

关键词：

Presentation skills; Audio features; Gesture; Slides features; Multi-Modality;

D O I：

10.1007/978-3-319-66429-3_17

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

This paper proposes an automatic system which uses multimodal techniques for automatically estimating oral presentation skills. It is based on a set of features from three sources; audio, gesture and power-point slides. Machine learning techniques are used to classify each presentation into two classes (high vs. low) and into three classes; low, average, and high-quality presentation. Around 448 Multimodal recordings of the MLA'14 dataset were used for training and evaluating three different 2-class and 3-class classifiers. Classifiers were evaluated for each feature type independently and for all features combined together. The best accuracy of the 2-class systems is 90.1% achieved by SVM trained on audio features and 75% for 3-class systems achieved by random forest trained on slides features. Combining three feature types into one vector improves all systems accuracy by around 5%.

引用

页码：182 / 191

页数：10

共 50 条

[1] Automatic synchronization of speech transcript and slides in presentation
Chen, Y
Heng, WJ
[J]. PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II: COMMUNICATIONS-MULTIMEDIA SYSTEMS & APPLICATIONS, 2003, : 568 - 571
[2] Automatic Coloring of Terms in Presentation Slides Using Word Vectors
Yagura, Tomoyuki
Hochin, Teruhisa
Nomiya, Hiroki
[J]. 2019 20TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2019, : 233 - 238
[3] Smart Presentation System using Hand Gestures and Indonesian Speech Command
Wardhany, Vivien Arief
Kurnia, Muhammad Hendrick
Sukaridhoto, Sritrusta
Sudarsono, Amang
Pramadihanto, Dadet
[J]. 2015 INTERNATIONAL ELECTRONICS SYMPOSIUM (IES), 2015, : 68 - 72
[4] Dynamic Language Model Adaptation Using Presentation Slides for Lecture Speech Recognition
Yamazaki, Hiroki
Iwano, Koji
Shinoda, Koichi
Furui, Sadaoki
Yokota, Haruo
[J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 89 - 92
[5] Automatic Era: Presentation slides from Academic Paper
Bhandare, Anuja A.
Awati, Chetan J.
Kharade, Sonam S.
[J]. 2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND DYNAMIC OPTIMIZATION TECHNIQUES (ICACDOT), 2016, : 809 - 814
[6] PRESENTATION SLIDES USING NASA TECHNOLOGY
BIRD, J
[J]. DATA PROCESSING, 1986, 28 (01): : 28 - 29
[7] Automatic Organization and Generation of Presentation Slides for E- Learning
Sathiyamurthy, K.
Geetha, T. V.
[J]. INTERNATIONAL JOURNAL OF DISTANCE EDUCATION TECHNOLOGIES, 2012, 10 (03) : 35 - 52
[8] A novel approach to automatic detection of presentation slides in educational videos
Zhao, Baoquan
Lin, Shujin
Qi, Xin
Wang, Ruomei
Luo, Xiaonan
[J]. NEURAL COMPUTING & APPLICATIONS, 2018, 29 (05): : 1369 - 1382
[9] Automatic Quality Assessment of Speech-Driven Synthesized Gestures
He, Zhiyuan
[J]. INTERNATIONAL JOURNAL OF COMPUTER GAMES TECHNOLOGY, 2022, 2022
[10] MAPPING GESTURES TO SPEECH USING THE KINECT
Muttena, Sanjivi
Sriram, S.
Shiva, R.
[J]. 2014 International Conference on Science Engineering and Management Research (ICSEMR), 2014,

← 1 2 3 4 5 →