pyAudioAnalysis: An Open-Source Python']Python Library for Audio Signal Analysis

被引:255
|
作者
Giannakopoulos, Theodoros [1 ]
机构
[1] NCSR Demokritos, Computat Intelligence Lab, Inst Informat & Telecommun, Athens 15310, Greece
来源
PLOS ONE | 2015年 / 10卷 / 12期
基金
欧盟地平线“2020”;
关键词
D O I
10.1371/journal.pone.0144610
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Audio information plays a rather important role in the increasing digital content that is available today, resulting in a need for methodologies that automatically analyze such content: audio event recognition for home automations and surveillance systems, speech recognition, music information retrieval, multimodal analysis (e.g. audio-visual analysis of online videos for content-based recommendation), etc. This paper presents pyAudioAnalysis, an open-source Python library that provides a wide range of audio analysis procedures including: feature extraction, classification of audio signals, supervised and unsupervised segmentation and content visualization. pyAudioAnalysis is licensed under the Apache License and is available at GitHub (https://github.com/tyiannak/pyAudioAnalysis/). Here we present the theoretical background behind the wide range of the implemented methodologies, along with evaluation metrics for some of the methods. pyAudioAnalysis has been already used in several audio analysis research applications: smart-home functionalities through audio event detection, speech emotion recognition, depression classification based on audiovisual features, music segmentation, multimodal content-based movie recommendation and health applications (e.g. monitoring eating habits). The feedback provided from all these particular audio applications has led to practical enhancement of the library.
引用
收藏
页数:17
相关论文
共 50 条
  • [41] QuTiP: An open-source Python']Python framework for the dynamics of open quantum systems
    Johansson, J. R.
    Nation, P. D.
    Nori, Franco
    [J]. COMPUTER PHYSICS COMMUNICATIONS, 2012, 183 (08) : 1760 - 1772
  • [42] Sleep: An Open-Source Python']Python Software for Visualization, Analysis, and Staging of Sleep Data
    Combrisson, Etienne
    Vallat, Raphael
    Eichenlaub, Jean-Baptiste
    O'Reilly, Christian
    Lajnef, Tarek
    Guillot, Aymeric
    Ruby, Perrine M.
    Jerbi, Karim
    [J]. FRONTIERS IN NEUROINFORMATICS, 2017, 11
  • [43] RamanSPy: An Open-Source Python']Python Package for Integrative Raman Spectroscopy Data Analysis
    Georgiev, Dimitar
    Pedersen, Simon Vilms
    Xie, Ruoxiao
    Fernandez-Galiana, Alvaro
    Stevens, Molly M.
    Barahona, Mauricio
    [J]. ANALYTICAL CHEMISTRY, 2024, 96 (21) : 8492 - 8500
  • [44] PyOMA2: An Open-Source Python']Python Software for Operational Modal Analysis
    Pasca, Dag Pasquale
    Margoni, Diego Federico
    Rosso, Marco Martino
    Aloisio, Angelo
    [J]. PROCEEDINGS OF THE 10TH INTERNATIONAL OPERATIONAL MODAL ANALYSIS CONFERENCE, VOL 1, IOMAC 2024, 2024, 514 : 423 - 434
  • [45] Kudi: A free open-source python library for the analysis of properties along reaction paths
    Stefan Vogt-Geisse
    [J]. Journal of Molecular Modeling, 2016, 22
  • [46] PyLandStats: An open-source Python']Pythonic library to compute landscape metrics
    Bosch, Marti
    [J]. PLOS ONE, 2019, 14 (12):
  • [47] DiffraPy: An open-source Python']Python software for seismic diffraction imaging
    Zakarewicz, Guilherme
    Maciel, Susanne
    Biloti, Ricardo
    [J]. COMPUTERS & GEOSCIENCES, 2024, 186
  • [48] SigMT: An open-source Python']Python package for magnetotelluric data processing
    Ajithabh, K. S.
    Patro, Prasanta K.
    [J]. COMPUTERS & GEOSCIENCES, 2023, 171
  • [49] An Open-Source Automated Peptide Synthesizer Based on Arduino and Python']Python
    Gali, Hariprasad
    [J]. SLAS TECHNOLOGY, 2017, 22 (05): : 493 - 499
  • [50] Decisi-o-rama: An open-source Python']Python library for multi-attribute value/ utility decision analysis
    Chacon-Hurtado, J. C.
    Scholten, L.
    [J]. ENVIRONMENTAL MODELLING & SOFTWARE, 2021, 135