Research on Algorithm of Segment and Classification of audio in Broadcast

被引:0
|
作者
Sun, Yuhang [1 ]
Zhou, Jian [1 ]
机构
[1] Commun Univ China, Coll Informat Engn, Beijing, Peoples R China
关键词
component; segment; classification; features; maximum likelihood criterion; low computational complexity; SPEECH/MUSIC;
D O I
10.1109/ISCID.2017.11
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
There are two common classes of audio, speech and music, in broadcast. An algorithm of segment and classification of audio designed in this paper will be applied to audio equalizer in the digital audio broadcasting transmitter. The algorithm described is based on zero crossing rate and energy that are able to separated non-silence segments from audio, and on Modified low energy ratio (MLER) and maximum likelihood criterion, which provides the ability to distinguish the two classes, speech and music. Modified low energy ratio is obtained on the basis of energy resulting in low computational complexity, leading to runs easily in real time. Experiment results to data show performance with the accuracy rate of over 96%.
引用
收藏
页码:316 / 319
页数:4
相关论文
共 50 条
  • [1] Classification of audio events in broadcast news
    Liu, Z
    Huang, Q
    [J]. 1998 IEEE SECOND WORKSHOP ON MULTIMEDIA SIGNAL PROCESSING, 1998, : 364 - 369
  • [2] Audio segmentation, classification and clustering in a broadcast news task
    Meinedo, H
    Neto, J
    [J]. 2003 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, VOL II, PROCEEDINGS: SPEECH II; INDUSTRY TECHNOLOGY TRACKS; DESIGN & IMPLEMENTATION OF SIGNAL PROCESSING SYSTEMS; NEURAL NETWORKS FOR SIGNAL PROCESSING, 2003, : 5 - 8
  • [3] Broadcast News Audio Classification using SVM Binary Trees
    Vavrek, Jozef
    Vozarikova, Eva
    Pleva, Matus
    Juhar, Jozef
    [J]. 2012 35TH INTERNATIONAL CONFERENCE ON TELECOMMUNICATIONS AND SIGNAL PROCESSING (TSP), 2012, : 469 - 473
  • [4] GMM with modified weight applied into audio segment classification
    Zhang Lei
    Zhao Yi
    Xiang Xuezhi
    [J]. MATERIALS, MECHATRONICS AND AUTOMATION, PTS 1-3, 2011, 467-469 : 692 - 697
  • [5] Audio Coding algorithm for one-segment broadcasting
    Suzuki, Masanao
    Ota, Yasuji
    Itoh, Takashi
    [J]. FUJITSU SCIENTIFIC & TECHNICAL JOURNAL, 2008, 44 (03): : 367 - 373
  • [6] CLASSIFICATION OF BROADCAST NEWS AUDIO DATA EMPLOYING BINARY DECISION ARCHITECTURE
    Vavrek, Jozef
    Fecilak, Peter
    Juhar, Jozef
    Cizmar, Anton
    [J]. COMPUTING AND INFORMATICS, 2017, 36 (04) : 857 - 886
  • [7] Advances in unsupervised audio classification and segmentation for the broadcast news and NGSW corpora
    Huang, RQ
    Hansen, JHL
    [J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2006, 14 (03): : 907 - 919
  • [8] Research on Digital Audio Watermark Technology for Broadcast Information Security
    Wang, Xiuli
    Wang, Xuan
    Liu, Shouxun
    [J]. PROCEEDINGS OF 2012 INTERNATIONAL CONFERENCE ON IMAGE ANALYSIS AND SIGNAL PROCESSING, 2012, : 230 - 235
  • [9] Algorithm for real-time comparison of audio streams for broadcast supervision
    Lorkiewicz, Mateusz
    Stankowski, Jakub
    Klimaszewski, Krzysztof
    [J]. 2018 25TH INTERNATIONAL CONFERENCE ON SYSTEMS, SIGNALS AND IMAGE PROCESSING (IWSSIP), 2018,
  • [10] SOUND OF BROADCAST AUDIO
    SMALL, E
    [J]. DB-SOUND ENGINEERING MAGAZINE, 1978, 12 (09): : 26 - 27