TranSQ: Transformer-Based Semantic Query for Medical Report Generation

被引:4
|
作者
Kong, Ming [1 ]
Huang, Zhengxing [1 ]
Kuang, Kun [1 ,2 ]
Zhu, Qiang [1 ]
Wu, Fei [1 ,2 ,3 ,4 ]
机构
[1] Zhejiang Univ, Inst Artificial Intelligence, Hangzhou, Peoples R China
[2] Key Lab Corneal Dis Res Zhejiang Prov, Hangzhou, Peoples R China
[3] Zhejiang Univ, Shanghai Inst Adv Study, Shanghai, Peoples R China
[4] Shanghai AI Lab, Shanghai, Peoples R China
关键词
Medical report generation; Transformer; Auxiliary diagnosis; Deep learning;
D O I
10.1007/978-3-031-16452-1_58
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Medical report generation, which aims at automatically generating coherent reports with multiple sentences for the given medical images, has received growing research interest due to its tremendous potential in facilitating clinical workflow and improving health services. Due to the highly patterned nature of medical reports, each sentence can be viewed as the description of an image observation with a specific purpose. To this end, this study proposes a novel Transformer-based Semantic Query (TranSQ) model that treats the medical report generation as a direct set prediction problem. Specifically, our model generates a set of semantic features to match plausible clinical concerns and compose the report with sentence retrieval and selection. Experimental results on two prevailing radiology report datasets, i.e., IU X-Ray and MIMIC-CXR, demonstrate that our model outperforms state-of-the-art models on the generation task in terms of both language generation effectiveness and clinical efficacy, which highlights the utility of our approach in generating medical reports with topics of clinical concern as well as sentence-level visual-semantic attention mappings. The source code is available at https://github.com/zjukongming/TranSQ.
引用
收藏
页码:610 / 620
页数:11
相关论文
共 50 条
  • [1] Simulating doctors' thinking logic for chest X-ray report generation via Transformer-based Semantic Query learning
    Gao, Danyang
    Kong, Ming
    Zhao, Yongrui
    Huang, Jing
    Huang, Zhengxing
    Kuang, Kun
    Wu, Fei
    Zhu, Qiang
    MEDICAL IMAGE ANALYSIS, 2024, 91
  • [2] SeTransformer: A Transformer-Based Code Semantic Parser for Code Comment Generation
    Li, Zheng
    Wu, Yonghao
    Peng, Bin
    Chen, Xiang
    Sun, Zeyu
    Liu, Yong
    Paul, Doyle
    IEEE TRANSACTIONS ON RELIABILITY, 2023, 72 (01) : 258 - 273
  • [3] Hierarchical Transformer-based Query by Multiple Documents
    Huang, Zhiqi
    Naseri, Shahrzad
    Bonab, Hamed
    Sarwar, Sheikh Muhammad
    Allan, James
    PROCEEDINGS OF THE 2023 ACM SIGIR INTERNATIONAL CONFERENCE ON THE THEORY OF INFORMATION RETRIEVAL, ICTIR 2023, 2023, : 105 - 115
  • [4] A Medical Semantic-Assisted Transformer for Radiographic Report Generation
    Wang, Zhanyu
    Tang, Mingkang
    Wang, Lei
    Li, Xiu
    Zhou, Luping
    MEDICAL IMAGE COMPUTING AND COMPUTER ASSISTED INTERVENTION, MICCAI 2022, PT III, 2022, 13433 : 655 - 664
  • [5] Incorporating Medical Knowledge to Transformer-based Language Models for Medical Dialogue Generation
    Naseem, Usman
    Bandi, Ajay
    Raza, Shaina
    Rashid, Junaid
    Chakravarthi, Bharathi Raja
    PROCEEDINGS OF THE 21ST WORKSHOP ON BIOMEDICAL LANGUAGE PROCESSING (BIONLP 2022), 2022, : 110 - 115
  • [6] TransRSS: Transformer-based Radar Semantic Segmentation
    Zou, Hao
    Xie, Zhen
    Ou, Jiarong
    Gao, Yutao
    2023 IEEE INTERNATIONAL CONFERENCE ON ROBOTICS AND AUTOMATION, ICRA, 2023, : 6965 - 6972
  • [7] BertSRC: transformer-based semantic relation classification
    Lee, Yeawon
    Son, Jinseok
    Song, Min
    BMC MEDICAL INFORMATICS AND DECISION MAKING, 2022, 22 (01)
  • [8] BertSRC: transformer-based semantic relation classification
    Yeawon Lee
    Jinseok Son
    Min Song
    BMC Medical Informatics and Decision Making, 22
  • [9] Anchor DETR: Query Design for Transformer-Based Object Detection
    Wang, Yingming
    Zhang, Xiangyu
    Yang, Tong
    Sun, Jian
    THIRTY-SIXTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE / THIRTY-FOURTH CONFERENCE ON INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE / THE TWELVETH SYMPOSIUM ON EDUCATIONAL ADVANCES IN ARTIFICIAL INTELLIGENCE, 2022, : 2567 - 2575
  • [10] Transformer-Based Semantic Segmentation for Recycling Materials in Construction
    Wang, Xin
    Han, Wei
    Mo, Sicheng
    Cai, Ting
    Gong, Yijing
    Li, Yin
    Zhu, Zhenhua
    COMPUTING IN CIVIL ENGINEERING 2023-DATA, SENSING, AND ANALYTICS, 2024, : 25 - 33