Multilingual Simplification of Medical Texts

被引:0
|
作者
Joseph, Sebastian [1 ]
Kazanas, Kathryn [1 ]
Reina, Keziah [1 ]
Ramanathan, Vishnesh J. [2 ]
Xu, Wei [2 ]
Wallace, Byron C. [3 ]
Li, Junyi Jessy [1 ]
机构
[1] Univ Texas Austin, Austin, TX 78712 USA
[2] Georgia Inst Technol, Atlanta, GA USA
[3] Northeastern Univ, Boston, MA USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Automated text simplification aims to produce simple versions of complex texts. This task is especially useful in the medical domain, where the latest medical findings are typically communicated via complex, technical articles. This creates barriers for laypeople seeking access to up-to-date medical findings, consequently impeding progress on health literacy. Most existing work on medical text simplification has focused on monolingual settings, with the result that such evidence would be available only in just one language (most often, English). This work addresses this limitation via multilingual simplification, i.e., directly simplifying complex texts into simplified texts in multiple languages. We introduce MULTICOCHRANE, the first sentence-aligned multilingual text simplification dataset for the medical domain in four languages: English, Spanish, French, and Farsi. We evaluate fine-tuned and zero-shot models across these languages with extensive human assessments and analyses. Although models can generate viable simplified texts, we identify several outstanding challenges that this dataset might be used to address.
引用
收藏
页码:16662 / 16692
页数:31
相关论文
共 50 条
  • [1] Paragraph-level Simplification of Medical Texts
    Devaraj, Ashwin
    Marshall, Iain J.
    Wallace, Byron C.
    Li, Junyi Jessy
    2021 CONFERENCE OF THE NORTH AMERICAN CHAPTER OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS: HUMAN LANGUAGE TECHNOLOGIES (NAACL-HLT 2021), 2021, : 4972 - 4984
  • [2] Paragraph-level Simplification of Medical Texts
    Devaraj, Ashwin
    Marshall, Iain J.
    Wallace, Byron C.
    Li, Junyi Jessy
    NAACL-HLT 2021 - 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Proceedings of the Conference, 2021, : 4972 - 4984
  • [3] Paragraph-level simplification of medical texts
    Devaraj, Ashwin
    Wallace, Byron C.
    Marshall, Iain J.
    Li, Junyi Jessy
    arXiv, 2021,
  • [4] Assessing AI Simplification of Medical Texts: Readability and Content Fidelity
    Picton, Bryce
    Andalib, Saman
    Spina, Aidin
    Camp, Brandon
    Solomon, Sean S.
    Liang, Jason
    Chen, Patrick M.
    Chen, Jefferson W.
    Hsu, Frank P.
    Oh, Michael Y.
    INTERNATIONAL JOURNAL OF MEDICAL INFORMATICS, 2025, 195
  • [5] Creation of a parallel corpora from comparable corpora for the simplification of medical texts in French
    Cardon, Remi
    Grabar, Natalia
    TRAITEMENT AUTOMATIQUE DES LANGUES, 2020, 61 (02): : 15 - 39
  • [6] Exceptional Texts on the Multilingual Web
    Brelstaff, Gavin
    Chessa, Francesca
    WWW'15 COMPANION: PROCEEDINGS OF THE 24TH INTERNATIONAL CONFERENCE ON WORLD WIDE WEB, 2015, : 847 - 851
  • [7] Compression of multilingual aligned texts
    Conley, Ehud S.
    Klein, Shmuel T.
    DCC 2006: DATA COMPRESSION CONFERENCE, PROCEEDINGS, 2006, : 442 - 442
  • [8] Med-EASi: Finely Annotated Dataset and Models for Controllable Simplification of Medical Texts
    Basu, Chandrayee
    Vasu, Rosni
    Yasunaga, Michihiro
    Yang, Qian
    THIRTY-SEVENTH AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE, VOL 37 NO 12, 2023, : 14093 - 14101
  • [9] Automatic Simplification of Lithuanian Administrative Texts
    Mandravickaite, Justina
    Rimkiene, Egle
    Kapkan, Danguole Kotryna
    Kalinauskaite, Danguole
    Krilavicius, Tomas
    ALGORITHMS, 2024, 17 (11)
  • [10] The concordance of multilingual legal texts at the WTO
    Condon, Bradly J.
    JOURNAL OF MULTILINGUAL AND MULTICULTURAL DEVELOPMENT, 2012, 33 (06) : 525 - 538