CNN-Based Multi-Modal Camera Model Identification on Video Sequences

被引:8
|
作者
Dal Cortivo, Davide [1 ]
Mandelli, Sara [1 ]
Bestagini, Paolo [1 ]
Tubaro, Stefano [1 ]
机构
[1] Politecn Milan, Dipartimento Elettron Informaz & Bioingn, I-20133 Milan, Italy
关键词
camera model identification; video forensics; audio forensics; convolutional neural networks;
D O I
10.3390/jimaging7080135
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
Identifying the source camera of images and videos has gained significant importance in multimedia forensics. It allows tracing back data to their creator, thus enabling to solve copyright infringement cases and expose the authors of hideous crimes. In this paper, we focus on the problem of camera model identification for video sequences, that is, given a video under analysis, detecting the camera model used for its acquisition. To this purpose, we develop two different CNN-based camera model identification methods, working in a novel multi-modal scenario. Differently from mono-modal methods, which use only the visual or audio information from the investigated video to tackle the identification task, the proposed multi-modal methods jointly exploit audio and visual information. We test our proposed methodologies on the well-known Vision dataset, which collects almost 2000 video sequences belonging to different devices. Experiments are performed, considering native videos directly acquired by their acquisition devices and videos uploaded on social media platforms, such as YouTube and WhatsApp. The achieved results show that the proposed multi-modal approaches significantly outperform their mono-modal counterparts, representing a valuable strategy for the tackled problem and opening future research to even more challenging scenarios.
引用
收藏
页数:20
相关论文
共 50 条
  • [31] Automated Multi-Modal Video Editing for Ads Video
    Lin, Qin
    Pang, Nuo
    Hong, Zhiying
    PROCEEDINGS OF THE 29TH ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA, MM 2021, 2021, : 4823 - 4827
  • [32] Cross-modal learning with multi-modal model for video action recognition based on adaptive weight training
    Zhou, Qingguo
    Hou, Yufeng
    Zhou, Rui
    Li, Yan
    Wang, Jinqiang
    Wu, Zhen
    Li, Hung-Wei
    Weng, Tien-Hsiung
    CONNECTION SCIENCE, 2024, 36 (01)
  • [33] Multi-Modal CNN Features Fusion for Emotion Recognition: A Modified Xception Model
    Shahzad, H. M.
    Bhatti, Sohail Masood
    Jaffar, Arfan
    Rashid, Muhammad
    Akram, Sheeraz
    IEEE ACCESS, 2023, 11 : 94281 - 94289
  • [34] CNN-based algorithm for drusen identification
    Checco, Paolo
    Corinto, Fernando
    2006 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS, VOLS 1-11, PROCEEDINGS, 2006, : 2181 - +
  • [35] Multi-Modal Multi-Action Video Recognition
    Shi, Zhensheng
    Liang, Ju
    Li, Qianqian
    Zheng, Haiyong
    Gu, Zhaorui
    Dong, Junyu
    Zheng, Bing
    2021 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2021), 2021, : 13658 - 13667
  • [36] Multi-modal multi-view video coding based on correlation analysis
    Jiang, Gang-Yi
    Zhang, Yun
    Yu, Mei
    Jisuanji Xuebao/Chinese Journal of Computers, 2007, 30 (12): : 2205 - 2211
  • [37] Multi-modal information augmented model for micro-video recommendation
    Huo Y.
    Jin B.
    Liao Z.
    Zhejiang Daxue Xuebao (Gongxue Ban)/Journal of Zhejiang University (Engineering Science), 2024, 58 (06): : 1142 - 1152
  • [38] CNN-based fish iris identification
    Schraml, Rudolf
    Wimmer, Georg
    Hofbauer, Heinz
    Jalilian, Ehsaneddin
    Bekkozhayeva, Dinara
    Cisar, Petr
    Uhl, Andreas
    2022 30TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2022), 2022, : 628 - 632
  • [39] A CNN-based vortex identification method
    Liang Deng
    Yueqing Wang
    Yang Liu
    Fang Wang
    Sikun Li
    Jie Liu
    Journal of Visualization, 2019, 22 : 65 - 78
  • [40] A CNN-based vortex identification method
    Deng, Liang
    Wang, Yueqing
    Liu, Yang
    Wang, Fang
    Li, Sikun
    Liu, Jie
    JOURNAL OF VISUALIZATION, 2019, 22 (01) : 65 - 78