CNN-Based Multi-Modal Camera Model Identification on Video Sequences

被引：8

作者：

Dal Cortivo, Davide ^{[1
]}

Mandelli, Sara ^{[1
]}

Bestagini, Paolo ^{[1
]}

Tubaro, Stefano ^{[1
]}

机构：

[1] Politecn Milan, Dipartimento Elettron Informaz & Bioingn, I-20133 Milan, Italy

来源：

JOURNAL OF IMAGING | 2021年 / 7卷 / 08期

关键词：

camera model identification; video forensics; audio forensics; convolutional neural networks;

D O I：

10.3390/jimaging7080135

中图分类号：

TB8 [摄影技术];

学科分类号：

0804 ;

摘要：

Identifying the source camera of images and videos has gained significant importance in multimedia forensics. It allows tracing back data to their creator, thus enabling to solve copyright infringement cases and expose the authors of hideous crimes. In this paper, we focus on the problem of camera model identification for video sequences, that is, given a video under analysis, detecting the camera model used for its acquisition. To this purpose, we develop two different CNN-based camera model identification methods, working in a novel multi-modal scenario. Differently from mono-modal methods, which use only the visual or audio information from the investigated video to tackle the identification task, the proposed multi-modal methods jointly exploit audio and visual information. We test our proposed methodologies on the well-known Vision dataset, which collects almost 2000 video sequences belonging to different devices. Experiments are performed, considering native videos directly acquired by their acquisition devices and videos uploaded on social media platforms, such as YouTube and WhatsApp. The achieved results show that the proposed multi-modal approaches significantly outperform their mono-modal counterparts, representing a valuable strategy for the tackled problem and opening future research to even more challenging scenarios.

引用

页数：20

共 50 条

[41] A Novel Deep Multi-Modal Feature Fusion Method for Celebrity Video Identification
Chen, Jianrong
Yang, Li
Xu, Yuanyuan
Huo, Jing
Shi, Yinghuan
Gao, Yang
PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2535 - 2538
[42] Multi-modal face tracking in multi-camera environments
Kang, HB
Cho, SH
COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2005, 3691 : 814 - 821
[43] CNN Based Yeast Cell Segmentation in Multi-Modal Fluorescent Microscopy Data
Aydin, Ali Selman
Dubey, Abhinandan
Dovrat, Daniel
Aharoni, Amir
Shilkrot, Roy
2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 753 - 759
[44] Multi-modal humor segment prediction in video
Yang, Zekun
Nakashima, Yuta
Takemura, Haruo
MULTIMEDIA SYSTEMS, 2023, 29 (04) : 2389 - 2398
[45] A Multi-modal System for Video Semantic Understanding
Lv, Zhengwei
Lei, Tao
Liang, Xiao
Shi, Zhizhong
Liu, Duoxing
CCKS 2021 - EVALUATION TRACK, 2022, 1553 : 34 - 43
[46] Hierarchically multi-modal indexing of soccer video
Liu, Yuchi
Wu, Lingda
Lei, Zhen
Xie, Yuxiang
12TH INTERNATIONAL MULTI-MEDIA MODELLING CONFERENCE PROCEEDINGS, 2006, : 393 - 396
[47] Lightweight multi-modal emotion recognition model based on modal generation
Liu, Peisong
Che, Manqiang
Luo, Jiangchuan
2022 9TH INTERNATIONAL FORUM ON ELECTRICAL ENGINEERING AND AUTOMATION, IFEEA, 2022, : 430 - 435
[48] Multi-modal Dependency Tree for Video Captioning
Zhao, Wentian
Wu, Xinxiao
Luo, Jiebo
ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
[49] Multi-modal Laughter Recognition in Video Conversations
Escalera, Sergio
Puertas, Eloi
Radeva, Petia
Pujol, Oriol
2009 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPR WORKSHOPS 2009), VOLS 1 AND 2, 2009, : 869 - 874
[50] Multi-modal human identification system
Ivanov, Y
WACV 2005: SEVENTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2005, : 164 - 170

← 1 2 3 4 5 →