Review: Privacy-Preservation in the Context of Natural Language Processing

Data privacy is one of the highly discussed issues in recent years as we encounter data breaches and privacy scandals often. This raises a lot of concerns about the ways the data is acquired and the potential information leaks. Especially in the field of Artificial Intelligence (AI), the widely usin...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Darshini Mahendran, Changqing Luo, Bridget T. Mcinnes
Formato: article
Lenguaje:EN
Publicado: IEEE 2021
Materias:
Acceso en línea:https://doaj.org/article/7f94d320d3ff44129ad0a37f7dcb7b12
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:7f94d320d3ff44129ad0a37f7dcb7b12
record_format dspace
spelling oai:doaj.org-article:7f94d320d3ff44129ad0a37f7dcb7b122021-11-18T00:04:30ZReview: Privacy-Preservation in the Context of Natural Language Processing2169-353610.1109/ACCESS.2021.3124163https://doaj.org/article/7f94d320d3ff44129ad0a37f7dcb7b122021-01-01T00:00:00Zhttps://ieeexplore.ieee.org/document/9592788/https://doaj.org/toc/2169-3536Data privacy is one of the highly discussed issues in recent years as we encounter data breaches and privacy scandals often. This raises a lot of concerns about the ways the data is acquired and the potential information leaks. Especially in the field of Artificial Intelligence (AI), the widely using of AI models aggravates the vulnerability of user privacy because a considerable portion of user data that AI models used is represented in natural language. In the past few years, many researchers have proposed NLP-based methods to address these data privacy challenges. To the best of our knowledge, this is the first interdisciplinary review discussing privacy preservation in the context of NLP. In this paper, we present a comprehensive review of previous research conducted to gather techniques and challenges of building and testing privacy-preserving systems in the context of Natural Language Processing (NLP). We group the different works under four categories: 1) Data privacy in the medical domain, 2) Privacy preservation in the technology domain, 3) Analysis of privacy policies, and 4) Privacy leaks detection in the text representation. This review compares the contributions and pitfalls of the various privacy violation detection and prevention works done using NLP techniques to help guide a path ahead.Darshini MahendranChangqing LuoBridget T. McinnesIEEEarticleData privacynatural language processingprivacy preservationprivacy policyElectrical engineering. Electronics. Nuclear engineeringTK1-9971ENIEEE Access, Vol 9, Pp 147600-147612 (2021)
institution DOAJ
collection DOAJ
language EN
topic Data privacy
natural language processing
privacy preservation
privacy policy
Electrical engineering. Electronics. Nuclear engineering
TK1-9971
spellingShingle Data privacy
natural language processing
privacy preservation
privacy policy
Electrical engineering. Electronics. Nuclear engineering
TK1-9971
Darshini Mahendran
Changqing Luo
Bridget T. Mcinnes
Review: Privacy-Preservation in the Context of Natural Language Processing
description Data privacy is one of the highly discussed issues in recent years as we encounter data breaches and privacy scandals often. This raises a lot of concerns about the ways the data is acquired and the potential information leaks. Especially in the field of Artificial Intelligence (AI), the widely using of AI models aggravates the vulnerability of user privacy because a considerable portion of user data that AI models used is represented in natural language. In the past few years, many researchers have proposed NLP-based methods to address these data privacy challenges. To the best of our knowledge, this is the first interdisciplinary review discussing privacy preservation in the context of NLP. In this paper, we present a comprehensive review of previous research conducted to gather techniques and challenges of building and testing privacy-preserving systems in the context of Natural Language Processing (NLP). We group the different works under four categories: 1) Data privacy in the medical domain, 2) Privacy preservation in the technology domain, 3) Analysis of privacy policies, and 4) Privacy leaks detection in the text representation. This review compares the contributions and pitfalls of the various privacy violation detection and prevention works done using NLP techniques to help guide a path ahead.
format article
author Darshini Mahendran
Changqing Luo
Bridget T. Mcinnes
author_facet Darshini Mahendran
Changqing Luo
Bridget T. Mcinnes
author_sort Darshini Mahendran
title Review: Privacy-Preservation in the Context of Natural Language Processing
title_short Review: Privacy-Preservation in the Context of Natural Language Processing
title_full Review: Privacy-Preservation in the Context of Natural Language Processing
title_fullStr Review: Privacy-Preservation in the Context of Natural Language Processing
title_full_unstemmed Review: Privacy-Preservation in the Context of Natural Language Processing
title_sort review: privacy-preservation in the context of natural language processing
publisher IEEE
publishDate 2021
url https://doaj.org/article/7f94d320d3ff44129ad0a37f7dcb7b12
work_keys_str_mv AT darshinimahendran reviewprivacypreservationinthecontextofnaturallanguageprocessing
AT changqingluo reviewprivacypreservationinthecontextofnaturallanguageprocessing
AT bridgettmcinnes reviewprivacypreservationinthecontextofnaturallanguageprocessing
_version_ 1718425232463626240