Multi-modal features and correlation incorporated Naive Bayes classifier for a semantic-enriched lecture video retrieval system

被引:9
|
作者
Poornima, N. [1 ]
Saleena, B. [1 ]
机构
[1] VIT, Sch Comp Sci & Engn, Chennai Campus, Madras, Tamil Nadu, India
来源
IMAGING SCIENCE JOURNAL | 2018年 / 66卷 / 05期
关键词
Key frame; keywords; image texture; GOM; tesseract; SEGMENTATION;
D O I
10.1080/13682199.2017.1419549
中图分类号
TB8 [摄影技术];
学科分类号
0804 ;
摘要
With the advancement of science and technology and the development of internet sources, e-learning draws huge attention as it is capable of providing classroom lectures in the form of videos. Here, a method has been proposed for effective retrieval of the lecture videos, which employs the correlated naive Bayes (CNB) classifier. In this proposed method, Optical Character Recognition uses the tesseract classifier and GOM (Gabor Ordinal Measure) to extract the textual features and the image texture from the key frames. K-means clustering clusters the features and the classifier retrieves the relevant video. Experimentation has been done in the MATLAB and the parameters such as precision, recall, and F measure of the CNB are compared over the other methods such as K-Nearest Neighbour and naive Bayes (NB) classifiers. The CNB classifier achieves a better precision, recall, and F measure rate of 0.9366, 0.9511, and 0.9426, respectively.
引用
下载
收藏
页码:263 / 277
页数:15
相关论文
共 12 条
  • [1] A multi-modal system for the retrieval of semantic video events
    Amir, A
    Basu, S
    Iyengar, G
    Lin, CY
    Naphade, M
    Smith, JR
    Srinivasan, S
    Tseng, B
    COMPUTER VISION AND IMAGE UNDERSTANDING, 2004, 96 (02) : 216 - 236
  • [2] Multi-modal Language Models for Lecture Video Retrieval
    Chen, Huizhong
    Cooper, Matthew
    Joshi, Dhiraj
    Girod, Bernd
    PROCEEDINGS OF THE 2014 ACM CONFERENCE ON MULTIMEDIA (MM'14), 2014, : 1081 - 1084
  • [3] A Multi-modal System for Video Semantic Understanding
    Lv, Zhengwei
    Lei, Tao
    Liang, Xiao
    Shi, Zhizhong
    Liu, Duoxing
    CCKS 2021 - EVALUATION TRACK, 2022, 1553 : 34 - 43
  • [4] Deep Multi-Modal Hashing With Semantic Enhancement for Multi-Label Micro-Video Retrieval
    Jing, Peiguang
    Sun, Haoyi
    Nie, Liqiang
    Li, Yun
    Su, Yuting
    IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2024, 36 (10) : 5080 - 5091
  • [5] An Intelligent Advertisement Short Video Production System via Multi-Modal Retrieval
    Wei, Yanheng
    Huang, Lianghua
    Zhang, Yanhao
    Zheng, Yun
    Pan, Pan
    PROCEEDINGS OF THE 45TH INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL (SIGIR '22), 2022, : 3368 - 3372
  • [6] A multi-modal lecture video indexing and retrieval framework with multi-scale residual attention network and multi-similarity computation
    Debnath, A.
    Rao, K. Sreenivasa
    Das, Partha P.
    SIGNAL IMAGE AND VIDEO PROCESSING, 2024, 18 (03) : 1993 - 2006
  • [7] A multi-modal lecture video indexing and retrieval framework with multi-scale residual attention network and multi-similarity computation
    A. Debnath
    K. Sreenivasa Rao
    Partha P. Das
    Signal, Image and Video Processing, 2024, 18 : 1993 - 2006
  • [8] Fashion Focus: Multi-modal Retrieval System for Video Commodity Localization in E-commerce
    Zhang, Yanhao
    Wang, Qiang
    Pan, Pan
    Zheng, Yun
    Da, Cheng
    Sun, Siyang
    Xu, Yinghui
    THIRTY-FIFTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, THIRTY-THIRD CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE AND THE ELEVENTH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2021, 35 : 16127 - 16128
  • [9] Using Multi-Modal Semantic Association Rules to fuse keywords and visual features automatically for Web image retrieval
    He, Ruhan
    Xiong, Naixue
    Yang, Laurence T.
    Park, Jong Hyuk
    INFORMATION FUSION, 2011, 12 (03) : 223 - 230
  • [10] Cross-Modal Learning Based on Semantic Correlation and Multi-Task Learning for Text-Video Retrieval
    Wu, Xiaoyu
    Wang, Tiantian
    Wang, Shengjin
    ELECTRONICS, 2020, 9 (12) : 1 - 17