Audio-Driven Multimedia Content Authentication as a Service

被引:0
|
作者
Vryzas, Nikolaos [1 ]
Katsaounidou, Anastasia [1 ]
Kotsakis, Rigas [1 ]
Dimoulas, Charalampos [1 ]
Kalliris, George [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Journalism & Mass Media, Lab Elect Media, Thessaloniki, Greece
来源
关键词
DIGITAL AUDIO; IDENTIFICATION;
D O I
暂无
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
In the current paper, we present a framework for providing supervisory tools for multimedia Content Authentication As A Service (CAAAS). A double compression method for discontinuity detection in audio signals is implemented and integrated in the provided web service. The user can upload audio/video content or provide links and thereafter, a feature vector is extracted from the audio modality of the selected content for the investigation of discontinuities of the signal via the proposed algorithms. Several visualizations are returned to the user, indicating possible points of forgery in the audio/visual file. Moreover, an audio tampering detection methodology by unsupervised clustering of short-window non-vocal segments, in order to identify differentiations of the acoustic environment of speech signals is presented and evaluated.
引用
收藏
页数:8
相关论文
共 50 条
  • [1] An audio-driven dancing avatar
    Ofli, Ferda
    Demir, Yasemin
    Yemez, Yucel
    Erzin, Engin
    Tekalp, A. Murat
    Balci, Koray
    Kizoglu, Idil
    Akarun, Lale
    Canton-Ferrer, Cristian
    Tilmanne, Joelle
    Bozkurt, Elif
    Erdem, A. Tanju
    [J]. JOURNAL ON MULTIMODAL USER INTERFACES, 2008, 2 (02) : 93 - 103
  • [2] An audio-driven dancing avatar
    Ferda Ofli
    Yasemin Demir
    Yücel Yemez
    Engin Erzin
    A. Murat Tekalp
    Koray Balcı
    İdil Kızoğlu
    Lale Akarun
    Cristian Canton-Ferrer
    Joëlle Tilmanne
    Elif Bozkurt
    A. Tanju Erdem
    [J]. Journal on Multimodal User Interfaces, 2008, 2 : 93 - 103
  • [3] Photorealistic Audio-driven Video Portraits
    Wen, Xin
    Wang, Miao
    Richardt, Christian
    Chen, Ze-Yin
    Hu, Shi-Min
    [J]. IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (12) : 3457 - 3466
  • [4] Audio-Driven Laughter Behavior Controller
    Ding, Yu
    Huang, Jing
    Pelachaud, Catherine
    [J]. IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2017, 8 (04) : 546 - 558
  • [5] Audio-Driven Emotional Video Portraits
    Ji, Xinya
    Zhou, Hang
    Wang, Kaisiyuan
    Wu, Wayne
    Loy, Chen Change
    Cao, Xun
    Xu, Feng
    [J]. 2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14075 - 14084
  • [6] Efficient audio-driven multimedia indexing through similarity-based speech / music discrimination
    Nikolaos Tsipas
    Lazaros Vrysis
    Charalampos Dimoulas
    George Papanikolaou
    [J]. Multimedia Tools and Applications, 2017, 76 : 25603 - 25621
  • [7] Efficient audio-driven multimedia indexing through similarity-based speech/music discrimination
    Tsipas, Nikolaos
    Vrysis, Lazaros
    Dimoulas, Charalampos
    Papanikolaou, George
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (24) : 25603 - 25621
  • [8] Audio-Driven Talking Face Generation: A Review
    Liu, Shiguang
    [J]. JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2023, 71 (7-8): : 408 - 419
  • [9] Audio-Driven Talking Video Frame Restoration
    Cheng, Harry
    Guo, Yangyang
    Yin, Jianhua
    Chen, Haonan
    Wang, Jiafang
    Nie, Liqiang
    [J]. IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4110 - 4122
  • [10] Audio-Driven Deformation Flow for Effective Lip Reading
    Feng, Dalu
    Yang, Shuang
    Shan, Shiguang
    Chen, Xilin
    [J]. 2022 26TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2022, : 274 - 280