An Intelligent Retrieval Method for Audio and Video Content: Deep Learning Technology Based on Artificial Intelligence

被引:0
|
作者
Sun, Maojin [1 ]
机构
[1] CEICloud Data Storage Technol Beijing Co Ltd, Beijing 101111, Peoples R China
来源
IEEE ACCESS | 2024年 / 12卷
关键词
Feature extraction; Accuracy; 5G mobile communication; Deep learning; Visualization; Audio-visual systems; Information retrieval; Information systems; Audio-video content retrieval; deep learning; feature extraction; cross-modal retrieval; intelligent retrieval;
D O I
10.1109/ACCESS.2024.3450920
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
To address the challenges of efficient intelligent retrieval and cross-modal analysis brought by the surge in audio-video data, this study proposes an intelligent retrieval method for audio-video content based on deep learning techniques, aimed at improving retrieval efficiency and accuracy. This method extracts audio features using the Visual Geometry Group Network (VGG) and employs an adaptive clustering keyframe extraction algorithm (SKM) to extract video features. By integrating cross-learning within an embedding network, it enhances retrieval efficiency and accuracy. The test results on the CMU-MOSEI dataset demonstrate that our method outperforms traditional models such as Principal Component Analysis (PCA), Canonical Correlation Analysis (CCA), and state-of-the-art deep learning models like Deep Canonical Correlation Analysis (DCCA) and Domain-Adversarial Neural Network (DANN) in multimodal data processing and real-world retrieval tasks. In video processing, the average fidelity is 0.693, and the average compression ratio is 0.936, representing improvements of 30.75% and 7.09%, respectively, compared to traditional methods. Through the application of deep learning technology, this study not only optimizes the processing of single modalities but also enhances the handling of cross-modal data through a cross-learning framework.
引用
收藏
页码:123430 / 123446
页数:17
相关论文
共 50 条
  • [31] Construction and recognition of acoustic ID of ancient coins based on deep learning of artificial intelligence for audio signals
    Xiaoxue Jin
    Xiufeng Wang
    Xinqiang Cao
    Chaohua Xue
    [J]. Heritage Science, 11
  • [32] A Resource Scheduling Method for Enterprise Management Based on Artificial Intelligence Deep Learning
    Zhu, Lujie
    Huang, Li
    [J]. MOBILE INFORMATION SYSTEMS, 2022, 2022
  • [33] Intelligent video and audio applications for learning enhancement
    Andrzej Czyzewski
    Bozena Kostek
    [J]. Journal of Intelligent Information Systems, 2012, 38 : 555 - 574
  • [34] Intelligent video and audio applications for learning enhancement
    Czyzewski, Andrzej
    Kostek, Bozena
    [J]. JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2012, 38 (03) : 555 - 574
  • [35] Application of Intelligent Safety Supervision Based on Artificial Intelligence Technology
    Sun Rongrong
    Song Xin
    Li Qing
    Ning Baifeng
    Zhang Bing
    [J]. 2020 IEEE CONFERENCE ON TELECOMMUNICATIONS, OPTICS AND COMPUTER SCIENCE (TOCS), 2020, : 429 - 431
  • [36] Intelligent oil and gas field based on artificial intelligence technology
    Veliyev, E. F.
    Shirinov, S. V.
    Mammedbeyli, T. E.
    [J]. SOCAR PROCEEDINGS, 2022, (04): : 70 - 75
  • [37] Accurate calibration of physical education actions based on artificial intelligence deep learning technology
    Chen, Wenta
    Hou, Yanwu
    [J]. REVISTA INTERNACIONAL DE MEDICINA Y CIENCIAS DE LA ACTIVIDAD FISICA Y DEL DEPORTE, 2022, 22 (85): : 267 - 282
  • [38] Learning Evaluation Method Based on Artificial Intelligence Technology and Its Application in Education
    Bao, Hongguang
    Liu, Hua
    [J]. JOURNAL OF ELECTRICAL SYSTEMS, 2024, 20 (03) : 1833 - 1842
  • [39] Mango wine making process optimization based on artificial intelligence deep learning technology
    Hua Xubin
    Lin Qiao
    Gong Fayong
    Cai Li
    Liu Junhua
    [J]. EXPERT SYSTEMS, 2024, 41 (06)
  • [40] Content-Based Video Retrieval With Prototypes of Deep Features
    Yoon, Hyeok
    Han, Ji-Hyeong
    [J]. IEEE ACCESS, 2022, 10 : 30730 - 30742