Automatic Estimation of Presentation Skills Using Speech, Slides and Gestures

被引:3
|
作者
Hanani, Abualsoud [1 ]
Al-Amleh, Mohammad [1 ]
Bazbus, Waseem [1 ]
Salameh, Saleem [1 ]
机构
[1] Birzeit Univ, Birzeit, Palestine
来源
关键词
Presentation skills; Audio features; Gesture; Slides features; Multi-Modality;
D O I
10.1007/978-3-319-66429-3_17
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This paper proposes an automatic system which uses multimodal techniques for automatically estimating oral presentation skills. It is based on a set of features from three sources; audio, gesture and power-point slides. Machine learning techniques are used to classify each presentation into two classes (high vs. low) and into three classes; low, average, and high-quality presentation. Around 448 Multimodal recordings of the MLA'14 dataset were used for training and evaluating three different 2-class and 3-class classifiers. Classifiers were evaluated for each feature type independently and for all features combined together. The best accuracy of the 2-class systems is 90.1% achieved by SVM trained on audio features and 75% for 3-class systems achieved by random forest trained on slides features. Combining three feature types into one vector improves all systems accuracy by around 5%.
引用
收藏
页码:182 / 191
页数:10
相关论文
共 50 条
  • [1] Automatic synchronization of speech transcript and slides in presentation
    Chen, Y
    Heng, WJ
    [J]. PROCEEDINGS OF THE 2003 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOL II: COMMUNICATIONS-MULTIMEDIA SYSTEMS & APPLICATIONS, 2003, : 568 - 571
  • [2] Automatic Coloring of Terms in Presentation Slides Using Word Vectors
    Yagura, Tomoyuki
    Hochin, Teruhisa
    Nomiya, Hiroki
    [J]. 2019 20TH IEEE/ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL/DISTRIBUTED COMPUTING (SNPD), 2019, : 233 - 238
  • [3] Smart Presentation System using Hand Gestures and Indonesian Speech Command
    Wardhany, Vivien Arief
    Kurnia, Muhammad Hendrick
    Sukaridhoto, Sritrusta
    Sudarsono, Amang
    Pramadihanto, Dadet
    [J]. 2015 INTERNATIONAL ELECTRONICS SYMPOSIUM (IES), 2015, : 68 - 72
  • [4] Dynamic Language Model Adaptation Using Presentation Slides for Lecture Speech Recognition
    Yamazaki, Hiroki
    Iwano, Koji
    Shinoda, Koichi
    Furui, Sadaoki
    Yokota, Haruo
    [J]. INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, 2007, : 89 - 92
  • [5] Automatic Era: Presentation slides from Academic Paper
    Bhandare, Anuja A.
    Awati, Chetan J.
    Kharade, Sonam S.
    [J]. 2016 INTERNATIONAL CONFERENCE ON AUTOMATIC CONTROL AND DYNAMIC OPTIMIZATION TECHNIQUES (ICACDOT), 2016, : 809 - 814
  • [6] PRESENTATION SLIDES USING NASA TECHNOLOGY
    BIRD, J
    [J]. DATA PROCESSING, 1986, 28 (01): : 28 - 29
  • [7] Automatic Organization and Generation of Presentation Slides for E- Learning
    Sathiyamurthy, K.
    Geetha, T. V.
    [J]. INTERNATIONAL JOURNAL OF DISTANCE EDUCATION TECHNOLOGIES, 2012, 10 (03) : 35 - 52
  • [8] A novel approach to automatic detection of presentation slides in educational videos
    Zhao, Baoquan
    Lin, Shujin
    Qi, Xin
    Wang, Ruomei
    Luo, Xiaonan
    [J]. NEURAL COMPUTING & APPLICATIONS, 2018, 29 (05): : 1369 - 1382
  • [9] Automatic Quality Assessment of Speech-Driven Synthesized Gestures
    He, Zhiyuan
    [J]. INTERNATIONAL JOURNAL OF COMPUTER GAMES TECHNOLOGY, 2022, 2022
  • [10] MAPPING GESTURES TO SPEECH USING THE KINECT
    Muttena, Sanjivi
    Sriram, S.
    Shiva, R.
    [J]. 2014 International Conference on Science Engineering and Management Research (ICSEMR), 2014,