AFExplorer: Visual analysis and interactive selection of audio features

被引:0
|
作者
Wang, Lei [1 ]
Sun, Guodao [1 ]
Wang, Yunchao [1 ]
Ma, Ji [1 ]
Zhao, Xiaomin [1 ]
Liang, Ronghua [1 ]
机构
[1] College of Computer Science, Zhejiang University of Technology, Hangzhou,310023, China
基金
中国国家自然科学基金;
关键词
Frequency domain analysis - Audio acoustics - Audio systems - Feature Selection - Quality control - Time domain analysis - Visualization;
D O I
暂无
中图分类号
学科分类号
摘要
Acoustic quality detection is vital in the manufactured products quality control field since it represents the conditions of machines or products. Recent work employed machine learning models in manufactured audio data to detect anomalous patterns. A major challenge is how to select applicable audio features to meliorate model's accuracy and precision. To relax this challenge, we extract and analyze three audio feature types including Time Domain Feature, Frequency Domain Feature, and Cepstrum Feature to help identify the potential linear and non-linear relationships. In addition, we design a visual analysis system, namely AFExplorer, to assist data scientists in extracting audio features and selecting potential feature combinations. AFExplorer integrates four main views to present detailed distribution and relevance of the audio features, which helps users observe the impact of features visually in the feature selection. We perform the case study with AFExplore according to the ToyADMOS and MIMII Dataset to demonstrate the usability and effectiveness of the proposed system. © 2022 The Author(s)
引用
收藏
页码:47 / 55
相关论文
共 50 条
  • [1] AFExplorer: Visual analysis and interactive selection of audio features
    Wang, Lei
    Sun, Guodao
    Wang, Yunchao
    Ma, Ji
    Zhao, Xiaomin
    Liang, Ronghua
    VISUAL INFORMATICS, 2022, 6 (01): : 47 - 55
  • [2] Interactive selection of visual features through reinforcement learning
    Jodogne, S
    Piater, JH
    RESEARCH AND DEVELOPMENT IN INTELLIGENT SYSTEMS XXI, 2005, : 285 - 298
  • [3] EXTRACTING AUDIO-VISUAL FEATURES FOR EMOTION RECOGNITION THROUGH ACTIVE FEATURE SELECTION
    Haider, Fasih
    Pollak, Senja
    Albert, Pierre
    Luz, Saturnino
    2019 7TH IEEE GLOBAL CONFERENCE ON SIGNAL AND INFORMATION PROCESSING (IEEE GLOBALSIP), 2019,
  • [4] Analysis of Correlation between Audio and Visual Speech Features for Clean Audio Feature Prediction in Noise
    Almajai, Ibrahim
    Milner, Ben
    Darch, Jonathan
    INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, 2006, : 2470 - 2473
  • [5] Fusing audio and visual features of speech
    Pan, H
    Liang, ZP
    Huang, TS
    2000 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING, VOL III, PROCEEDINGS, 2000, : 214 - 217
  • [6] Semantic analysis based on fusion of audio/visual features for soccer video
    Wang, Zengkai
    PROCEEDINGS OF THE 10TH INTERNATIONAL CONFERENCE OF INFORMATION AND COMMUNICATION TECHNOLOGY, 2021, 183 : 563 - 571
  • [7] Analysis of lip geometric features for audio-visual speech recognition
    Kaynak, MN
    Zhi, Q
    Cheok, AD
    Sengupta, K
    Han, Z
    Chung, KC
    IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2004, 34 (04): : 564 - 570
  • [8] Audio-visual speaker identification based on the use of dynamic audio and visual features
    Fox, N
    Reilly, RB
    AUDIO-BASED AND VIDEO-BASED BIOMETRIC PERSON AUTHENTICATION, PROCEEDINGS, 2003, 2688 : 743 - 751
  • [9] Market potential for interactive audio-visual media
    Leurdijk, A
    Limonard, S
    First International Conference on Automated Production of Cross Media Content for Multi-channel Distribution, Proceedings, 2005, : 163 - 170
  • [10] XY Domain: An Interactive Audio-Visual Map
    Parallel, Charlotte
    Moore, Tony
    JUNCTURES-THE JOURNAL FOR THEMATIC DIALOGUE, 2016, (17): : 9 - 14