Multi-modal hierarchical fusion network for fine-grained paper classification

被引:0
|
作者
Tan Yue
Yong Li
Jiedong Qin
Zonghai Hu
机构
[1] Beijing University of Posts and Telecommunications,School of Electronic Engineering, Beijing Key Laboratory of Work Safety Intelligent Monitoring
来源
关键词
Information fusion; Multi-modal information processing; Natural language processing application; Paper classification;
D O I
暂无
中图分类号
学科分类号
摘要
Because huge amount of scientific papers have been published at an accelerating rate, it is beneficial to do intelligent paper classification, especially fine-grained classification. However, existing natural language processing techniques are mostly coarse-grained. Some characteristics of fine-grained scientific paper classification needs special attention. One is that the number of data may well be quite limited. Number of papers in the lower level sub-fields inevitably becomes less. Meanwhile, emerging sub-fields with new discoveries will have few papers, nevertheless these sub-fields can be important. Furthermore, fine-grained labeling of scientific papers requires high expertise and is time consuming. Another aspect of scientific papers is that they contain multi-modal information. To address the above two issues, we propose a multi-modal hierarchical fusion network (MHFNet) for fine-grained paper classification. We treat paper abstract features, image features, and paper title features as three modalities. The MobileNetV2 model and the ALBERT model are combined in the proposed model to encode multi-modal information. Comparison results with baseline methods on both sufficiently large datasets and number-limited datasets show improvements, even more on number-limited datasets.
引用
收藏
页码:31527 / 31543
页数:16
相关论文
共 50 条
  • [1] Multi-modal hierarchical fusion network for fine-grained paper classification
    Yue, Tan
    Li, Yong
    Qin, Jiedong
    Hu, Zonghai
    [J]. MULTIMEDIA TOOLS AND APPLICATIONS, 2024, 83 (11) : 31527 - 31543
  • [2] MKTformer: Fine-grained Meter Classification Based on Multi-modal Knowledge Transfer
    Zheng, Zhaoye
    Zhang, Ke
    Shi, Chaojun
    Zheng, Fei
    [J]. 2023 ASIA PACIFIC SIGNAL AND INFORMATION PROCESSING ASSOCIATION ANNUAL SUMMIT AND CONFERENCE, APSIPA ASC, 2023, : 1564 - 1570
  • [3] Fine-Grained Image Classification Based on Multi-Modal Features and Enhanced Alignment
    Han, Jing
    Zhang, Tianpeng
    Lyu, Xueqiang
    [J]. Beijing Youdian Daxue Xuebao/Journal of Beijing University of Posts and Telecommunications, 2024, 47 (04): : 130 - 135
  • [4] Multi-Modal Domain Adaptation for Fine-Grained Action Recognition
    Munro, Jonathan
    Damen, Dima
    [J]. 2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR), 2020, : 119 - 129
  • [5] Multi-Modal Domain Adaptation for Fine-grained Action Recognition
    Munro, Jonathan
    Damen, Dima
    [J]. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW), 2019, : 3723 - 3726
  • [6] Automatic Fine-Grained BIM element classification using Multi-Modal deep learning (MMDL)
    Liu, Hao
    Gan, Vincent J. L.
    Cheng, Jack C. P.
    Zhou, Shanjing
    [J]. ADVANCED ENGINEERING INFORMATICS, 2024, 61
  • [7] Complemental Attention Multi-Feature Fusion Network for Fine-Grained Classification
    Miao, Zhuang
    Zhao, Xun
    Wang, Jiabao
    Li, Yang
    Li, Hang
    [J]. IEEE SIGNAL PROCESSING LETTERS, 2021, 28 : 1983 - 1987
  • [8] A Multi-modal Approach to Fine-grained Opinion Mining on Video Reviews
    Marrese-Taylor, Edison
    Rodriguez-Opazo, Cristian
    Balazs, Jorge A.
    Gould, Stephen
    Matsuo, Yutaka
    [J]. PROCEEDINGS OF THE SECOND GRAND CHALLENGE AND WORKSHOP ON MULTIMODAL LANGUAGE (CHALLENGE-HML), VOL 1, 2020, : 8 - 18
  • [9] Fine-grained Activities Recognition with Coarse-grained Labeled Multi-modal Data
    Hu, Zhizhang
    Yu, Tong
    Zhang, Yue
    Pan, Shijia
    [J]. UBICOMP/ISWC '20 ADJUNCT: PROCEEDINGS OF THE 2020 ACM INTERNATIONAL JOINT CONFERENCE ON PERVASIVE AND UBIQUITOUS COMPUTING AND PROCEEDINGS OF THE 2020 ACM INTERNATIONAL SYMPOSIUM ON WEARABLE COMPUTERS, 2020, : 644 - 649
  • [10] Multi Fine-Grained Fusion Network for Depression Detection
    Zhou, Li
    Liu, Zhenyu
    Li, Yutong
    Duan, Yuchi
    Yu, Huimin
    Hu, Bin
    [J]. ACM Transactions on Multimedia Computing, Communications and Applications, 2024, 20 (08)