Sharing data in the form of text is important for a wide range of activities but it also raises a concern about privacy when sharing data that could be sensitive. Automated text anonymization is a solution for removing all the sensitive information from documents. However, this is a challenging task due to the unstructured form of textual data and the ambiguity of natural language. In this work, we present our implementation of an automated anonymization system, built in a modular structure, for documents written in Portuguese. Four different methods of anonymization are evaluated and compared. Two methods replace the sensitive information by artificial labels: suppression and tagging. The other two methods replace the information by textua...
The benefits of technological development have to be used and enjoyed, and it also includes the lega...
While vast amounts of personal data are shared daily on public online platforms and used by companie...
Text data collections enable the deployment of artificial intelligence algorithms for novel tasks. S...
Sharing data in the form of text is important for a wide range of activities but it also raises a co...
We present a novel benchmark and associated evaluation metrics for assessing the performance of text...
Despite its undeniable advantages, the exponential growth of data analytic capabilities implies a si...
Proceedings of: The International Conference on Knowledge Discovery and Information Retrieval, Octob...
AbstractOrganizations often have a dilemma in relation to their documents: ensure confidentiality of...
Publisher Copyright: © 2022 Copyright for this paper by its authors.The EU General Data Protection R...
O Processamento de Linguagem Natural (PLN) teve uma evolução explosiva nos últimos 5 anos, princip...
Data sharing is a central aspect of judicial systems. The openly accessible documents can make the j...
National audienceIn order to ease research data sharing and scientific comparison, researchers need ...
International audienceThis paper presents two anonymisation methods to process an SMS corpus. The fi...
Netanos (Named Entity-based Text ANonymization for Open Science) is a natural language processing so...
In this work, we propose a bootstrapping strategy so as to enrich anonymization models for Catalan m...
The benefits of technological development have to be used and enjoyed, and it also includes the lega...
While vast amounts of personal data are shared daily on public online platforms and used by companie...
Text data collections enable the deployment of artificial intelligence algorithms for novel tasks. S...
Sharing data in the form of text is important for a wide range of activities but it also raises a co...
We present a novel benchmark and associated evaluation metrics for assessing the performance of text...
Despite its undeniable advantages, the exponential growth of data analytic capabilities implies a si...
Proceedings of: The International Conference on Knowledge Discovery and Information Retrieval, Octob...
AbstractOrganizations often have a dilemma in relation to their documents: ensure confidentiality of...
Publisher Copyright: © 2022 Copyright for this paper by its authors.The EU General Data Protection R...
O Processamento de Linguagem Natural (PLN) teve uma evolução explosiva nos últimos 5 anos, princip...
Data sharing is a central aspect of judicial systems. The openly accessible documents can make the j...
National audienceIn order to ease research data sharing and scientific comparison, researchers need ...
International audienceThis paper presents two anonymisation methods to process an SMS corpus. The fi...
Netanos (Named Entity-based Text ANonymization for Open Science) is a natural language processing so...
In this work, we propose a bootstrapping strategy so as to enrich anonymization models for Catalan m...
The benefits of technological development have to be used and enjoyed, and it also includes the lega...
While vast amounts of personal data are shared daily on public online platforms and used by companie...
Text data collections enable the deployment of artificial intelligence algorithms for novel tasks. S...