Deep Learning-Based Context-Aware Video Content Analysis on IoT Devices

被引:5
|
作者
Gad, Gad [1 ]
Gad, Eyad [2 ]
Cengiz, Korhan [3 ,4 ]
Fadlullah, Zubair [1 ,5 ]
Mokhtar, Bassem [3 ,6 ]
机构
[1] Lakehead Univ, Dept Comp Sci, Thunder Bay, ON P7B 5E1, Canada
[2] Nile Univ, Sch Engn & Appl Sci, Giza 12677, Egypt
[3] Univ Fujairah, Coll Informat Technol, Fujairah 1207, U Arab Emirates
[4] Trakya Univ, Dept Elect Elect Engn, TR-22030 Edirne, Turkey
[5] Thunder Bay Reg Hlth Res Inst TBRHRI, Thunder Bay, ON P7B 7A5, Canada
[6] Alexandria Univ, Fac Engn, Dept Elect Engn, Alexandria 21544, Egypt
关键词
video content analysis; LSTM; transformer-based model; video captioning; Internet of Things (IoT);
D O I
10.3390/electronics11111785
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Integrating machine learning with the Internet of Things (IoT) enables many useful applications. For IoT applications that incorporate video content analysis (VCA), deep learning models are usually used due to their capacity to encode the high-dimensional spatial and temporal representations of videos. However, limited energy and computation resources present a major challenge. Video captioning is one type of VCA that describes a video with a sentence or a set of sentences. This work proposes an IoT-based deep learning-based framework for video captioning that can (1) Mine large open-domain video-to-text datasets to extract video-caption pairs that belong to a particular domain. (2) Preprocess the selected video-caption pairs including reducing the complexity of the captions' language model to improve performance. (3) Propose two deep learning models: A transformer-based model and an LSTM-based model. Hyperparameter tuning is performed to select the best hyperparameters. Models are evaluated in terms of accuracy and inference time on different platforms. The presented framework generates captions in standard sentence templates to facilitate extracting information in later stages of the analysis. The two developed deep learning models offer a trade-off between accuracy and speed. While the transformer-based model yields a high accuracy of 97%, the LSTM-based model achieves near real-time inference.
引用
收藏
页数:15
相关论文
共 50 条
  • [21] Research on context-aware group recommendation based on deep learning
    Xu, Haibo
    Jiang, Chengshun
    [J]. NEURAL COMPUTING & APPLICATIONS, 2020, 32 (06): : 1745 - 1754
  • [22] DeepCARSKit: A deep learning based context-aware recommendation library
    Zheng, Yong
    [J]. SOFTWARE IMPACTS, 2022, 13
  • [23] Lightweight Architecture for IoT Devices with Context-aware Autonomous Control
    Serra, Bruno
    Gomes, Luis
    Vale, Zita
    [J]. 2019 IEEE WIRELESS COMMUNICATIONS AND NETWORKING CONFERENCE WORKSHOP (WCNCW), 2019,
  • [24] The effectiveness of a meaningful learning-based evaluation model for context-aware mobile learning
    Huang, Yueh-Min
    Chiu, Po-Sheng
    [J]. BRITISH JOURNAL OF EDUCATIONAL TECHNOLOGY, 2015, 46 (02) : 437 - 447
  • [25] A Generic IoT Architecture for Ubiquitous Context-Aware Learning
    Shapsough, Salsabeel Y.
    Zualkernan, Imran A.
    [J]. IEEE TRANSACTIONS ON LEARNING TECHNOLOGIES, 2020, 13 (03): : 449 - 464
  • [26] DLSDHMS: Design of a deep learning-based analysis model for secure and distributed hospital management using context-aware sidechains
    Reddy, Vonteru Srikanth
    Debasis, Kumar
    [J]. HELIYON, 2023, 9 (11)
  • [27] Deep learning-based human motion recognition for predictive context-aware human-robot collaboration
    Wang, Peng
    Liu, Hongyi
    Wang, Lihui
    Gao, Robert X.
    [J]. CIRP ANNALS-MANUFACTURING TECHNOLOGY, 2018, 67 (01) : 17 - 20
  • [28] A context-aware encryption protocol suite for edge computing-based IoT devices
    Dar, Zaineb
    Ahmad, Adnan
    Khan, Farrukh Aslam
    Zeshan, Furkh
    Iqbal, Razi
    Sherazi, Hafiz Husnain Raza
    Bashir, Ali Kashif
    [J]. JOURNAL OF SUPERCOMPUTING, 2020, 76 (04): : 2548 - 2567
  • [29] A context-aware encryption protocol suite for edge computing-based IoT devices
    Zaineb Dar
    Adnan Ahmad
    Farrukh Aslam Khan
    Furkh Zeshan
    Razi Iqbal
    Hafiz Husnain Raza Sherazi
    Ali Kashif Bashir
    [J]. The Journal of Supercomputing, 2020, 76 : 2548 - 2567
  • [30] An IoT Analytics Embodied Agent Model based on Context-Aware Machine Learning
    Nascimento, Nathalia
    Alencar, Paulo
    Lucena, Carlos
    Cowan, Donald
    [J]. 2018 IEEE INTERNATIONAL CONFERENCE ON BIG DATA (BIG DATA), 2018, : 5170 - 5175