Fish behavior recognition based on an audio-visual multimodal interactive fusion network

被引:0
|
作者
Yang, Yuxin [1 ,2 ,3 ,4 ]
Yu, Hong [1 ,2 ,3 ,4 ]
Zhang, Xin [1 ,2 ,3 ,4 ]
Zhang, Peng [1 ,2 ,3 ,4 ]
Tu, Wan [1 ,2 ,3 ,4 ]
Gu, Lishuai [1 ,2 ,3 ,4 ]
机构
[1] College of Information Engineering, Dalian Ocean University, Liaoning, Dalian,116023, China
[2] Dalian Key Laboratory of Smart Fisheries, Liaoning, Dalian,116023, China
[3] Key Laboratory of Environment Controlled Aquaculture (Dalian Ocean University), Liaoning, Dalian,116023, China
[4] Liaoning Provincial Key of Marine Information Technology, Liaoning, Dalian,116023, China
关键词
Compendex;
D O I
10.1016/j.aquaeng.2024.102471
中图分类号
学科分类号
摘要
引用
收藏
相关论文
共 50 条
  • [1] Audio-Visual Fusion Network Based on Conformer for Multimodal Emotion Recognition
    Guo, Peini
    Chen, Zhengyan
    Li, Yidi
    Liu, Hong
    [J]. ARTIFICIAL INTELLIGENCE, CICAI 2022, PT II, 2022, 13605 : 315 - 326
  • [2] Multimodal Attentive Fusion Network for audio-visual event recognition
    Brousmiche, Mathilde
    Rouat, Jean
    Dupont, Stephane
    [J]. INFORMATION FUSION, 2022, 85 : 52 - 59
  • [3] Multimodal Sparse Transformer Network for Audio-Visual Speech Recognition
    Song, Qiya
    Sun, Bin
    Li, Shutao
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 34 (12) : 10028 - 10038
  • [4] Fuzzy-Neural-Network Based Audio-Visual Fusion for Speech Recognition
    Wu, Gin-Der
    Tsai, Hao-Shu
    [J]. 2019 1ST INTERNATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE IN INFORMATION AND COMMUNICATION (ICAIIC 2019), 2019, : 210 - 214
  • [5] Audio-Visual Action Recognition Using Transformer Fusion Network
    Kim, Jun-Hwa
    Won, Chee Sun
    [J]. APPLIED SCIENCES-BASEL, 2024, 14 (03):
  • [6] Audio-Visual Fusion Based on Interactive Attention for Person Verification
    Jing, Xuebin
    He, Liang
    Song, Zhida
    Wang, Shaolei
    [J]. SENSORS, 2023, 23 (24)
  • [7] Multimodal Deep Convolutional Neural Network for Audio-Visual Emotion Recognition
    Zhang, Shiqing
    Zhang, Shiliang
    Huang, Tiejun
    Gao, Wen
    [J]. ICMR'16: PROCEEDINGS OF THE 2016 ACM INTERNATIONAL CONFERENCE ON MULTIMEDIA RETRIEVAL, 2016, : 281 - 284
  • [8] Lip landmark-based audio-visual speech enhancement with multimodal feature fusion network
    Li, Yangke
    Zhang, Xinman
    [J]. NEUROCOMPUTING, 2023, 549
  • [9] Continuous Phoneme Recognition based on Audio-Visual Modality Fusion
    Richter, Julius
    Liebold, Jeanine
    Gerkamnn, Timo
    [J]. 2022 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2022,
  • [10] Robust Audio-Visual Speech Recognition Based on Hybrid Fusion
    Liu, Hong
    Li, Wenhao
    Yang, Bing
    [J]. 2020 25TH INTERNATIONAL CONFERENCE ON PATTERN RECOGNITION (ICPR), 2021, : 7580 - 7586