CNN-Based Multi-Modal Camera Model Identification on Video Sequences

被引:8
|
作者
Dal Cortivo, Davide [1 ]
Mandelli, Sara [1 ]
Bestagini, Paolo [1 ]
Tubaro, Stefano [1 ]
机构
[1] Politecn Milan, Dipartimento Elettron Informaz & Bioingn, I-20133 Milan, Italy
关键词
camera model identification; video forensics; audio forensics; convolutional neural networks;
D O I
10.3390/jimaging7080135
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Identifying the source camera of images and videos has gained significant importance in multimedia forensics. It allows tracing back data to their creator, thus enabling to solve copyright infringement cases and expose the authors of hideous crimes. In this paper, we focus on the problem of camera model identification for video sequences, that is, given a video under analysis, detecting the camera model used for its acquisition. To this purpose, we develop two different CNN-based camera model identification methods, working in a novel multi-modal scenario. Differently from mono-modal methods, which use only the visual or audio information from the investigated video to tackle the identification task, the proposed multi-modal methods jointly exploit audio and visual information. We test our proposed methodologies on the well-known Vision dataset, which collects almost 2000 video sequences belonging to different devices. Experiments are performed, considering native videos directly acquired by their acquisition devices and videos uploaded on social media platforms, such as YouTube and WhatsApp. The achieved results show that the proposed multi-modal approaches significantly outperform their mono-modal counterparts, representing a valuable strategy for the tackled problem and opening future research to even more challenging scenarios.
引用
收藏
页数:20
相关论文
共 50 条
  • [41] A Novel Deep Multi-Modal Feature Fusion Method for Celebrity Video Identification
    Chen, Jianrong
    Yang, Li
    Xu, Yuanyuan
    Huo, Jing
    Shi, Yinghuan
    Gao, Yang
    PROCEEDINGS OF THE 27TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA (MM'19), 2019, : 2535 - 2538
  • [42] Multi-modal face tracking in multi-camera environments
    Kang, HB
    Cho, SH
    COMPUTER ANALYSIS OF IMAGES AND PATTERNS, PROCEEDINGS, 2005, 3691 : 814 - 821
  • [43] CNN Based Yeast Cell Segmentation in Multi-Modal Fluorescent Microscopy Data
    Aydin, Ali Selman
    Dubey, Abhinandan
    Dovrat, Daniel
    Aharoni, Amir
    Shilkrot, Roy
    2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW), 2017, : 753 - 759
  • [44] Multi-modal humor segment prediction in video
    Yang, Zekun
    Nakashima, Yuta
    Takemura, Haruo
    MULTIMEDIA SYSTEMS, 2023, 29 (04) : 2389 - 2398
  • [45] A Multi-modal System for Video Semantic Understanding
    Lv, Zhengwei
    Lei, Tao
    Liang, Xiao
    Shi, Zhizhong
    Liu, Duoxing
    CCKS 2021 - EVALUATION TRACK, 2022, 1553 : 34 - 43
  • [46] Hierarchically multi-modal indexing of soccer video
    Liu, Yuchi
    Wu, Lingda
    Lei, Zhen
    Xie, Yuxiang
    12TH INTERNATIONAL MULTI-MEDIA MODELLING CONFERENCE PROCEEDINGS, 2006, : 393 - 396
  • [47] Lightweight multi-modal emotion recognition model based on modal generation
    Liu, Peisong
    Che, Manqiang
    Luo, Jiangchuan
    2022 9TH INTERNATIONAL FORUM ON ELECTRICAL ENGINEERING AND AUTOMATION, IFEEA, 2022, : 430 - 435
  • [48] Multi-modal Dependency Tree for Video Captioning
    Zhao, Wentian
    Wu, Xinxiao
    Luo, Jiebo
    ADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 34 (NEURIPS 2021), 2021, 34
  • [49] Multi-modal Laughter Recognition in Video Conversations
    Escalera, Sergio
    Puertas, Eloi
    Radeva, Petia
    Pujol, Oriol
    2009 IEEE COMPUTER SOCIETY CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPR WORKSHOPS 2009), VOLS 1 AND 2, 2009, : 869 - 874
  • [50] Multi-modal human identification system
    Ivanov, Y
    WACV 2005: SEVENTH IEEE WORKSHOP ON APPLICATIONS OF COMPUTER VISION, PROCEEDINGS, 2005, : 164 - 170