Deep Learning-Based Context-Aware Video Content Analysis on IoT Devices

被引:5
|
作者
Gad, Gad [1 ]
Gad, Eyad [2 ]
Cengiz, Korhan [3 ,4 ]
Fadlullah, Zubair [1 ,5 ]
Mokhtar, Bassem [3 ,6 ]
机构
[1] Lakehead Univ, Dept Comp Sci, Thunder Bay, ON P7B 5E1, Canada
[2] Nile Univ, Sch Engn & Appl Sci, Giza 12677, Egypt
[3] Univ Fujairah, Coll Informat Technol, Fujairah 1207, U Arab Emirates
[4] Trakya Univ, Dept Elect Elect Engn, TR-22030 Edirne, Turkey
[5] Thunder Bay Reg Hlth Res Inst TBRHRI, Thunder Bay, ON P7B 7A5, Canada
[6] Alexandria Univ, Fac Engn, Dept Elect Engn, Alexandria 21544, Egypt
关键词
video content analysis; LSTM; transformer-based model; video captioning; Internet of Things (IoT);
D O I
10.3390/electronics11111785
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Integrating machine learning with the Internet of Things (IoT) enables many useful applications. For IoT applications that incorporate video content analysis (VCA), deep learning models are usually used due to their capacity to encode the high-dimensional spatial and temporal representations of videos. However, limited energy and computation resources present a major challenge. Video captioning is one type of VCA that describes a video with a sentence or a set of sentences. This work proposes an IoT-based deep learning-based framework for video captioning that can (1) Mine large open-domain video-to-text datasets to extract video-caption pairs that belong to a particular domain. (2) Preprocess the selected video-caption pairs including reducing the complexity of the captions' language model to improve performance. (3) Propose two deep learning models: A transformer-based model and an LSTM-based model. Hyperparameter tuning is performed to select the best hyperparameters. Models are evaluated in terms of accuracy and inference time on different platforms. The presented framework generates captions in standard sentence templates to facilitate extracting information in later stages of the analysis. The two developed deep learning models offer a trade-off between accuracy and speed. While the transformer-based model yields a high accuracy of 97%, the LSTM-based model achieves near real-time inference.
引用
收藏
页数:15
相关论文
共 50 条
  • [41] Support Context-Aware IoT Content Request in Information Centric Networks
    Dong, Lijun
    Wang, Guoqiang
    [J]. 2016 25TH WIRELESS AND OPTICAL COMMUNICATION CONFERENCE (WOCC), 2016,
  • [42] Context-Aware Cognitive Radio Using Deep Learning
    Paisana, Francisco
    Selim, Ahmed
    Kist, Maicon
    Alvarez, Pedro
    Tallon, Justin
    Bluemm, Christian
    Puschmann, Andre
    DaSilva, Luiz
    [J]. 2017 IEEE INTERNATIONAL SYMPOSIUM ON DYNAMIC SPECTRUM ACCESS NETWORKS (IEEE DYSPAN), 2017,
  • [43] An IoT Platform with Monitoring Robot Applying CNN-Based Context-Aware Learning
    Shin, Moonsun
    Paik, Woojin
    Kim, Byungcheol
    Hwang, Seonmin
    [J]. SENSORS, 2019, 19 (11)
  • [44] Context-Aware Hybrid Encoding for Privacy-Preserving Computation in IoT Devices
    Khalili, Hossein
    Chien, Hao-Jen
    Hass, Amin
    Sehatbakhsh, Nader
    [J]. IEEE INTERNET OF THINGS JOURNAL, 2024, 11 (01) : 1054 - 1064
  • [45] An IoT Framework Based on SDN and NFV for Context-Aware Security
    Ong, Arlyn Verina
    Peradilla, Marnel
    [J]. 12TH INTERNATIONAL CONFERENCE ON UBIQUITOUS AND FUTURE NETWORKS (ICUFN 2021), 2021, : 167 - 172
  • [46] Towards Proactive Context-Aware IoT Environments by Means of Federated Learning
    Rentero-Trejo, Ruben
    Flores-Martin, Daniel
    Galan-Jimenez, Jaime
    Garcia-Alonso, Jose
    Manuel Murillo, Juan
    Berrocal, Javier
    [J]. ICWE 2021 WORKSHOPS, ICWE 2021 INTERNATIONAL WORKSHOPS, 2022, 1508 : 27 - 38
  • [47] Using Federated Learning to Achieve Proactive Context-Aware IoT Environments
    Rentero-Trejo, Ruben
    Flores-Martin, Daniel
    Galan-Jimenez, Jaime
    Garcia-Alonso, Jose
    Manuel Murillo, Juan
    Berrocal, Javier
    [J]. JOURNAL OF WEB ENGINEERING, 2022, 21 (01): : 53 - 74
  • [48] Context-Aware Mixed Reality: A Learning-Based Framework for Semantic-Level Interaction
    Chen, L.
    Tang, W.
    John, N. W.
    Wan, T. R.
    Zhang, J. J.
    [J]. COMPUTER GRAPHICS FORUM, 2020, 39 (01) : 484 - 496
  • [49] Real-time Context-aware learning System for IoT Applications
    Das, Bhaskar
    Almhana, Jalal
    Karim, Lutful
    [J]. 2022 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM 2022), 2022, : 305 - 310
  • [50] Context-Aware User Profiling and Multimedia Content Classification for Smart Devices
    Otebolaku, Abayomi Moradeyo
    Andrade, Maria Teresa
    [J]. 2014 28TH INTERNATIONAL CONFERENCE ON ADVANCED INFORMATION NETWORKING AND APPLICATIONS WORKSHOPS (WAINA), 2014, : 560 - 565