Summary: The presence of personally identifiable information (PII) in natural language portions of electronic health records (EHRs) constrains their broad reuse. Despite continuous improvements in automated detection of PII, residual identifiers require manual validation and correction. Here, we describe an automated de-identification system that employs an ensemble architecture, incorporating attention-based deep-learning models and rule-based methods, supported by heuristics for detecting PII in EHR data. Detected identifiers are then transformed into plausible, though fictional, surrogates to further obfuscate any leaked identifier. Our approach outperforms existing tools, with a recall of 0.992 and precision of 0.979 on the i2b2 2014 da...
In the last years, the need to de-identify privacy-sensitive information within Electronic Health Re...
An abundance of electronic health records (EHR) is produced every day within healthcare. The records...
Medical data is an important part of modern medicine. However, with the rapid increase in the amount...
The widespread adoption of Electronic Health Records (EHRs) means an unprecedented amount of patient...
The widespread adoption of Electronic Health Records (EHRs) means an unprecedented amount of patient...
Objective: Evaluate the effectiveness and robustness of Anonym, a tool for de-identifying free-text ...
Background: Text-based patient medical records are a vital resource in medical research. In order to...
One broad goal of biomedical informatics is to generate fully-synthetic, faithfully representative e...
BackgroundElectronic health records (EHRs) provide enormous potential for health research but also p...
Medical health records often contain clinical investigations results and critical information regard...
A recent promise to access unstructured clinical data from electronic health records on large-scale ...
A major hurdle in the development of natural language processing (NLP) methods for Electronic Health...
Objective: Patient notes in electronic health records (EHRs) may contain critical information for me...
In the last years, the need to de-identify privacy-sensitive information within Electronic Health Re...
Medical health records often contain clinical investigations results and critical information regard...
In the last years, the need to de-identify privacy-sensitive information within Electronic Health Re...
An abundance of electronic health records (EHR) is produced every day within healthcare. The records...
Medical data is an important part of modern medicine. However, with the rapid increase in the amount...
The widespread adoption of Electronic Health Records (EHRs) means an unprecedented amount of patient...
The widespread adoption of Electronic Health Records (EHRs) means an unprecedented amount of patient...
Objective: Evaluate the effectiveness and robustness of Anonym, a tool for de-identifying free-text ...
Background: Text-based patient medical records are a vital resource in medical research. In order to...
One broad goal of biomedical informatics is to generate fully-synthetic, faithfully representative e...
BackgroundElectronic health records (EHRs) provide enormous potential for health research but also p...
Medical health records often contain clinical investigations results and critical information regard...
A recent promise to access unstructured clinical data from electronic health records on large-scale ...
A major hurdle in the development of natural language processing (NLP) methods for Electronic Health...
Objective: Patient notes in electronic health records (EHRs) may contain critical information for me...
In the last years, the need to de-identify privacy-sensitive information within Electronic Health Re...
Medical health records often contain clinical investigations results and critical information regard...
In the last years, the need to de-identify privacy-sensitive information within Electronic Health Re...
An abundance of electronic health records (EHR) is produced every day within healthcare. The records...
Medical data is an important part of modern medicine. However, with the rapid increase in the amount...