Anonymisation Models for Text Data: State of the Art, Challenges and Future Directions

被引:0
|
作者
Lison, Pierre [1 ]
Pilan, Ildiko [1 ]
Sanchez, David [2 ]
Batet, Montserrat [2 ]
Ovrelid, Lilja [3 ]
机构
[1] Norwegian Comp Ctr, Oslo, Norway
[2] Univ Rovira & Virgili, CYBERCAT, UNESCO Chair Data Privacy, Tarragona, Spain
[3] Univ Oslo, Language Technol Grp, Oslo, Norway
关键词
DE-IDENTIFICATION; PRIVACY PROTECTION; INFORMATION; SURROGATES; REDACTION; RELEASE;
D O I
暂无
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This position paper investigates the problem of automated text anonymisation, which is a pre-requisite for secure sharing of documents containing sensitive information about individuals. We summarise the key concepts behind text anonymisation and provide a review of current approaches. Anonymisation methods have so far been developed in two fields with little mutual interaction, namely natural language processing and privacy-preserving data publishing. Based on a case study, we outline the benefits and limitations of these approaches and discuss a number of open challenges, such as (1) how to account for multiple types of semantic inferences, (2) how to strike a balance between disclosure risk and data utility and (3) how to evaluate the quality of the resulting anonymisation. We lay out a case for moving beyond sequence labelling models and incorporate explicit measures of disclosure risk into the text anonymisation process.
引用
收藏
页码:4188 / 4203
页数:16
相关论文
共 50 条
  • [1] Vehicular Clouds: State of the Art, Challenges and Future Directions
    Aloqaily, Moayad
    Kantarci, Burak
    Mouftah, Hussein T.
    2015 IEEE JORDAN CONFERENCE ON APPLIED ELECTRICAL ENGINEERING AND COMPUTING TECHNOLOGIES (AEECT), 2015,
  • [2] A review of operations research models in invasive species management: state of the art, challenges, and future directions
    İ. Esra Büyüktahtakın
    Robert G. Haight
    Annals of Operations Research, 2018, 271 : 357 - 403
  • [3] Implementation of the zooplankton functional response in plankton models: State of the art, recent challenges and future directions
    Morozov, Andrew
    Poggiale, Jean-Christophe
    Cordoleani, Flora
    PROGRESS IN OCEANOGRAPHY, 2012, 103 : 80 - 91
  • [4] A review of operations research models in invasive species management: state of the art, challenges, and future directions
    Buyuktahtakin, I. Esra
    Haight, Robert G.
    ANNALS OF OPERATIONS RESEARCH, 2018, 271 (02) : 357 - 403
  • [5] Current state of the art, challenges and future directions for audio watermarking
    Swanson, MD
    Zhu, B
    Tewfik, AH
    IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA COMPUTING AND SYSTEMS, PROCEEDINGS VOL 1, 1999, : 19 - 24
  • [6] Securing Outsourced VNFs: Challenges, State of the Art, and Future Directions
    Marku, Enio
    Biczok, Gergely
    Boyd, Colin
    IEEE COMMUNICATIONS MAGAZINE, 2020, 58 (07) : 72 - 77
  • [7] Bioinspired Soft Robotics: State of the Art, Challenges, and Future Directions
    Maxwell Hammond
    Venanzio Cichella
    Caterina Lamuta
    Current Robotics Reports, 2023, 4 (3): : 65 - 80
  • [8] Modelling forest ecosystems: state of the art, challenges, and future directions
    Landsberg, J
    CANADIAN JOURNAL OF FOREST RESEARCH, 2003, 33 (03) : 385 - 397
  • [9] Text Mining: Challenges and Future Directions
    Akilan, A.
    2015 2ND INTERNATIONAL CONFERENCE ON ELECTRONICS AND COMMUNICATION SYSTEMS (ICECS), 2015, : 1679 - 1683
  • [10] Data Warehouse State of the art and future challenges
    El Moukhi, Nawfal
    El Azami, Ikram
    Mouloudi, Aziz
    2015 INTERNATIONAL CONFERENCE ON CLOUD TECHNOLOGIES AND APPLICATIONS (CLOUDTECH 15), 2015, : 306 - 311