Fusion of acoustic and deep features for pig cough sound recognition

被引:22
|
作者
Shen, Weizheng [1 ]
Ji, Nan [1 ]
Yin, Yanling [1 ]
Dai, Baisheng [1 ]
Tu, Ding [2 ]
Sun, Baihui [1 ,3 ]
Hou, Handan [4 ]
Kou, Shengli [5 ]
Zhao, Yize [6 ]
机构
[1] Northeast Agr Univ, Sch Elect Engn & Informat, Harbin 150030, Peoples R China
[2] Guangxi Univ Sci & Technol, Tus Coll Digit, Liuzhou 545000, Peoples R China
[3] Heilongjiang Acad Agr Machinery Sci, Mudanjiang Branch, Mudanjiang 157000, Peoples R China
[4] Harbin Finance Univ, Sch Comp Sci, Harbin 150030, Peoples R China
[5] Northeast Agr Univ, Sch Elect Engn & Informat, Harbin, Peoples R China
[6] Univ Calif Irvine, Donald Bren Sch Informat & Comp Sci, Dept Comp Sci, Irvine, CA USA
基金
中国国家自然科学基金;
关键词
Pig cough; Feature fusion; Time-frequency representations; Convolutional neural networks; CLASSIFICATION; ENHANCEMENT;
D O I
10.1016/j.compag.2022.106994
中图分类号
S [农业科学];
学科分类号
09 ;
摘要
The recognition of pig cough sound is a prerequisite for early warning of respiratory diseases in pig houses, which is essential for detecting animal welfare and predicting productivity. With respect to pig cough recognition, it is a highly crucial step to create representative pig sound characteristics. To this end, this paper proposed a feature fusion method by combining acoustic and deep features from audio segments. First, a set of acoustic features from different domains were extracted from sound signals, and recursive feature elimination based on random forest (RF-RFE) was adopted to conduct feature selection. Second, time-frequency representations (TFRs) involving constant-Q transform (CQT) and short-time Fourier transform (STFT) were employed to extract visual features from a fine-tuned convolutional neural network (CNN) model. Finally, the ensemble of the two kinds of features was fed into support vector machine (SVM) by early fusion to identify pig cough sounds. This work investigated the performance of the proposed acoustic and deep features fusion, which achieved 97.35% accuracy for pig cough recognition. The results provide further evidence for the effectiveness of combining acoustic and deep spectrum features as a robust feature representation for pig cough recognition.
引用
收藏
页数:7
相关论文
共 50 条
  • [1] An investigation of fusion strategies for boosting pig cough sound recognition
    Yin, Yanling
    Ji, Nan
    Wang, Xipeng
    Shen, Weizheng
    Dai, Baisheng
    Kou, Shengli
    Liang, Chen
    [J]. COMPUTERS AND ELECTRONICS IN AGRICULTURE, 2023, 205
  • [2] Recognition of Pig Cough Sound Based on Deep Belief Nets
    [J]. 2018, Chinese Society of Agricultural Machinery (49):
  • [3] DNN-HMM based acoustic model for continuous pig cough sound recognition
    Zhao, Jian
    Li, Xuan
    Liu, Wanghong
    Gao, Yun
    Lei, Minggang
    Tan, Hequn
    Yang, Di
    [J]. INTERNATIONAL JOURNAL OF AGRICULTURAL AND BIOLOGICAL ENGINEERING, 2020, 13 (03) : 186 - 193
  • [4] Heterogeneous fusion of biometric and deep physiological features for accurate porcine cough recognition
    Wang, Buyu
    Qi, Jingwei
    An, Xiaoping
    Wang, Yuan
    [J]. PLOS ONE, 2024, 19 (02):
  • [5] Investigation of acoustic and visual features for pig cough classification
    Ji, Nan
    Shen, Weizheng
    Yin, Yanling
    Bao, Jun
    Dai, Baisheng
    Hou, Handan
    Kou, Shengli
    Zhao, Yize
    [J]. BIOSYSTEMS ENGINEERING, 2022, 219 : 281 - 293
  • [6] Deep fusion framework for speech command recognition using acoustic and linguistic features
    Sunakshi Mehra
    Seba Susan
    [J]. Multimedia Tools and Applications, 2023, 82 : 38667 - 38691
  • [7] Deep fusion framework for speech command recognition using acoustic and linguistic features
    Mehra, Sunakshi
    Susan, Seba
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2023, 82 (25) : 38667 - 38691
  • [8] Recognition Method of Pig Cough Based on Deep Neural Network
    Shen, Mingxia
    Wang, Mengyu
    Liu, Longshen
    Chen, Jia
    Tai, Meng
    Zhang, Wei
    [J]. Nongye Jixie Xuebao/Transactions of the Chinese Society for Agricultural Machinery, 2022, 53 (05): : 257 - 266
  • [9] Fusion of acoustic and tokenization features for speaker recognition
    Tong, Rong
    Ma, Bin
    Lee, Kong-Aik
    You, Changhuai
    Zhu, Donglai
    Kinnunen, Tomi
    Sun, Hanwu
    Dong, Minghui
    Chng, Eng-Siong
    Li, Haizhou
    [J]. CHINESE SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, 2006, 4274 : 566 - +
  • [10] Research on Improved DenseNets Pig Cough Sound Recognition Model Based on SENets
    Song, Hang
    Zhao, Bin
    Hu, Jun
    Sun, Haonan
    Zhou, Zheng
    [J]. ELECTRONICS, 2022, 11 (21)