Anonymisation Models for Text Data: State of the Art, Challenges and Future Directions

被引:0
|
作者
Lison, Pierre [1 ]
Pilan, Ildiko [1 ]
Sanchez, David [2 ]
Batet, Montserrat [2 ]
Ovrelid, Lilja [3 ]
机构
[1] Norwegian Comp Ctr, Oslo, Norway
[2] Univ Rovira & Virgili, CYBERCAT, UNESCO Chair Data Privacy, Tarragona, Spain
[3] Univ Oslo, Language Technol Grp, Oslo, Norway
关键词
DE-IDENTIFICATION; PRIVACY PROTECTION; INFORMATION; SURROGATES; REDACTION; RELEASE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This position paper investigates the problem of automated text anonymisation, which is a pre-requisite for secure sharing of documents containing sensitive information about individuals. We summarise the key concepts behind text anonymisation and provide a review of current approaches. Anonymisation methods have so far been developed in two fields with little mutual interaction, namely natural language processing and privacy-preserving data publishing. Based on a case study, we outline the benefits and limitations of these approaches and discuss a number of open challenges, such as (1) how to account for multiple types of semantic inferences, (2) how to strike a balance between disclosure risk and data utility and (3) how to evaluate the quality of the resulting anonymisation. We lay out a case for moving beyond sequence labelling models and incorporate explicit measures of disclosure risk into the text anonymisation process.
引用
收藏
页码:4188 / 4203
页数:16
相关论文
共 50 条
  • [21] Cyber resilience in industrial networks: A state of the art, challenges, and future directions
    Alrumaih, Thuraya N. I.
    Alenazi, Mohammed J. F.
    AlSowaygh, Nouf A.
    Humayed, Abdulmalik A.
    Alablani, Ibtihal A.
    JOURNAL OF KING SAUD UNIVERSITY-COMPUTER AND INFORMATION SCIENCES, 2023, 35 (09)
  • [22] Challenges, Opportunities and Future Directions of Smart Manufacturing: A State of Art Review
    Phuyal, Sudip
    Bista, Diwakar
    Bista, Rabindra
    SUSTAINABLE FUTURES, 2020, 2
  • [23] Natural Language Generation for Visualizations: State of the Art, Challenges and Future Directions
    Hoque, E.
    Islam, M. Saidul
    COMPUTER GRAPHICS FORUM, 2024,
  • [24] IoT Forensics: A State-of-the-Art Review, Challenges and Future Directions
    Alenezi, Ahmed
    Atlam, Hany F.
    Alsagri, Reem
    Alassafi, Madini O.
    Wills, Gary B.
    PROCEEDINGS OF THE 4TH INTERNATIONAL CONFERENCE ON COMPLEXITY, FUTURE INFORMATION SYSTEMS AND RISK (COMPLEXIS), 2019, : 106 - 115
  • [25] Knowledge Discovery and interactive Data Mining in Bioinformatics - State-of-the-Art, future challenges and research directions
    Holzinger, Andreas
    Dehmer, Matthias
    Jurisica, Igor
    BMC BIOINFORMATICS, 2014, 15
  • [26] Knowledge Discovery and interactive Data Mining in Bioinformatics - State-of-the-Art, future challenges and research directions
    Andreas Holzinger
    Matthias Dehmer
    Igor Jurisica
    BMC Bioinformatics, 15
  • [27] Data science for engineering design: State of the art and future directions
    Chiarello, Filippo
    Belingheri, Paola
    Fantoni, Gualtiero
    COMPUTERS IN INDUSTRY, 2021, 129
  • [28] Computational intelligence approaches for classification of medical data: State-of-the-art, future challenges and research directions
    Kalantari, Ali
    Kamsin, Amirrudin
    Shamshirband, Shahaboddin
    Gani, Abdullah
    Alinejad-Rokny, Hamid
    Chronopoulos, Anthony T.
    NEUROCOMPUTING, 2018, 276 : 2 - 22
  • [29] Global archaeomagnetic data: The state of the art and future challenges
    Brown, Maxwell C.
    Hervé, Gwenaël
    Korte, Monika
    Genevey, Agnès
    Physics of the Earth and Planetary Interiors, 2021, 318
  • [30] Global archaeomagnetic data: The state of the art and future challenges
    Brown, Maxwell C.
    Herve, Gwenael
    Korte, Monika
    Genevey, Agnes
    PHYSICS OF THE EARTH AND PLANETARY INTERIORS, 2021, 318