Sensitive data is normally required to develop rule-based or train machine learning-based models for de-identifying electronic health record (EHR) clinical notes; and this presents important problems for patient privacy. In this study, we add non-sensitive public datasets to EHR training data; (i) scientific medical text and (ii) Wikipedia word vectors. The data, all in Swedish, is used to train a deep learning model using recurrent neural networks. Tests on pseudonymized Swedish EHR clinical notes showed improved precision and recall from 55.62% and 80.02% with the base EHR embedding layer, to 85.01% and 87.15% when Wikipedia word vectors are added. These results suggest that non-sensitive text from the general domain can be used to train ...
The widespread adoption of Electronic Health Records (EHRs) means an unprecedented amount of patient...
The widespread adoption of Electronic Health Records (EHRs) means an unprecedented amount of patient...
Neural Network (NN) architectures are used more and more to model large amounts of data, such as tex...
Sensitive data is normally required to develop rule-based or train machine learning-based models for...
An abundance of electronic health records (EHR) is produced every day within healthcare. The records...
Background Text mining and natural language processing of clinical text, such as notes from electron...
A major hurdle in the development of natural language processing (NLP) methods for Electronic Health...
One broad goal of biomedical informatics is to generate fully-synthetic, faithfully representative e...
Electronic health records (EHRs) are a rich source of information for medical research and public he...
In the last years, the need to de-identify privacy-sensitive information within Electronic Health Re...
In the last years, the need to de-identify privacy-sensitive information within Electronic Health Re...
markdownabstractThe use of electronic health records (EHRs) has grown rapidly in the last decade. Th...
Medical data is an important part of modern medicine. However, with the rapid increase in the amount...
The impact of de-identification on data quality and, in particular, utility for developing models fo...
Electronic health records (EHR) contain a lot of valuable information about individual patients and ...
The widespread adoption of Electronic Health Records (EHRs) means an unprecedented amount of patient...
The widespread adoption of Electronic Health Records (EHRs) means an unprecedented amount of patient...
Neural Network (NN) architectures are used more and more to model large amounts of data, such as tex...
Sensitive data is normally required to develop rule-based or train machine learning-based models for...
An abundance of electronic health records (EHR) is produced every day within healthcare. The records...
Background Text mining and natural language processing of clinical text, such as notes from electron...
A major hurdle in the development of natural language processing (NLP) methods for Electronic Health...
One broad goal of biomedical informatics is to generate fully-synthetic, faithfully representative e...
Electronic health records (EHRs) are a rich source of information for medical research and public he...
In the last years, the need to de-identify privacy-sensitive information within Electronic Health Re...
In the last years, the need to de-identify privacy-sensitive information within Electronic Health Re...
markdownabstractThe use of electronic health records (EHRs) has grown rapidly in the last decade. Th...
Medical data is an important part of modern medicine. However, with the rapid increase in the amount...
The impact of de-identification on data quality and, in particular, utility for developing models fo...
Electronic health records (EHR) contain a lot of valuable information about individual patients and ...
The widespread adoption of Electronic Health Records (EHRs) means an unprecedented amount of patient...
The widespread adoption of Electronic Health Records (EHRs) means an unprecedented amount of patient...
Neural Network (NN) architectures are used more and more to model large amounts of data, such as tex...