Audio-Driven Multimedia Content Authentication as a Service

被引：0

作者：

Vryzas, Nikolaos ^{[1
]}

Katsaounidou, Anastasia ^{[1
]}

Kotsakis, Rigas ^{[1
]}

Dimoulas, Charalampos ^{[1
]}

Kalliris, George ^{[1
]}

机构：

[1] Aristotle Univ Thessaloniki, Dept Journalism & Mass Media, Lab Elect Media, Thessaloniki, Greece

来源：

146TH AES CONVENTION | 2019年

关键词：

DIGITAL AUDIO; IDENTIFICATION;

D O I：

暂无

中图分类号：

O42 [声学];

学科分类号：

070206 ; 082403 ;

摘要：

In the current paper, we present a framework for providing supervisory tools for multimedia Content Authentication As A Service (CAAAS). A double compression method for discontinuity detection in audio signals is implemented and integrated in the provided web service. The user can upload audio/video content or provide links and thereafter, a feature vector is extracted from the audio modality of the selected content for the investigation of discontinuities of the signal via the proposed algorithms. Several visualizations are returned to the user, indicating possible points of forgery in the audio/visual file. Moreover, an audio tampering detection methodology by unsupervised clustering of short-window non-vocal segments, in order to identify differentiations of the acoustic environment of speech signals is presented and evaluated.

引用

页数：8

共 50 条

[1] An audio-driven dancing avatar
Ofli, Ferda
Demir, Yasemin
Yemez, Yucel
Erzin, Engin
Tekalp, A. Murat
Balci, Koray
Kizoglu, Idil
Akarun, Lale
Canton-Ferrer, Cristian
Tilmanne, Joelle
Bozkurt, Elif
Erdem, A. Tanju
JOURNAL ON MULTIMODAL USER INTERFACES, 2008, 2 (02) : 93 - 103
[2] An audio-driven dancing avatar
Ferda Ofli
Yasemin Demir
Yücel Yemez
Engin Erzin
A. Murat Tekalp
Koray Balcı
İdil Kızoğlu
Lale Akarun
Cristian Canton-Ferrer
Joëlle Tilmanne
Elif Bozkurt
A. Tanju Erdem
Journal on Multimodal User Interfaces, 2008, 2 : 93 - 103
[3] Photorealistic Audio-driven Video Portraits
Wen, Xin
Wang, Miao
Richardt, Christian
Chen, Ze-Yin
Hu, Shi-Min
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2020, 26 (12) : 3457 - 3466
[4] Audio-Driven Laughter Behavior Controller
Ding, Yu
Huang, Jing
Pelachaud, Catherine
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2017, 8 (04) : 546 - 558
[5] Audio-Driven Emotional Video Portraits
Ji, Xinya
Zhou, Hang
Wang, Kaisiyuan
Wu, Wayne
Loy, Chen Change
Cao, Xun
Xu, Feng
2021 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION, CVPR 2021, 2021, : 14075 - 14084
[6] Efficient audio-driven multimedia indexing through similarity-based speech / music discrimination
Nikolaos Tsipas
Lazaros Vrysis
Charalampos Dimoulas
George Papanikolaou
Multimedia Tools and Applications, 2017, 76 : 25603 - 25621
[7] Efficient audio-driven multimedia indexing through similarity-based speech/music discrimination
Tsipas, Nikolaos
Vrysis, Lazaros
Dimoulas, Charalampos
Papanikolaou, George
MULTIMEDIA TOOLS AND APPLICATIONS, 2017, 76 (24) : 25603 - 25621
[8] Audio-Driven Talking Face Generation: A Review
Liu, Shiguang
JOURNAL OF THE AUDIO ENGINEERING SOCIETY, 2023, 71 (7-8): : 408 - 419
[9] Audio-Driven Talking Video Frame Restoration
Cheng, Harry
Guo, Yangyang
Yin, Jianhua
Chen, Haonan
Wang, Jiafang
Nie, Liqiang
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 26 : 4110 - 4122
[10] Audio-Driven Facial Animation with Deep Learning: A Survey
Jiang, Diqiong
Chang, Jian
You, Lihua
Bian, Shaojun
Kosk, Robert
Maguire, Greg
INFORMATION, 2024, 15 (11)

← 1 2 3 4 5 →