International audienceIn sensitive domains, the sharing of corpora is restricted due to confidentiality, copyrights, or trade secrets. Automatic text generation can help alleviate these issues by producing synthetic texts that mimic the linguistic properties of real documents while preserving confidentiality. In this study, we assess the usability of synthetic corpus as a substitute training corpus for clinical information extraction. Our goal is to automatically produce a clinical case corpus annotated with clinical entities and to evaluate it for a named entity recognition (NER) task. We use two auto-regressive neural models partially or fully trained on generic French texts and fine-tuned on clinical cases to produce a corpus of syntheti...
One of the central tasks of medical text analysis is to extract and structure meaningful information...
Disorder named entity recognition (DNER) is a fundamental task of biomedical natural language proces...
In the Big Data era, there is an increasing need to fully exploit and analyze the huge quantity of i...
International audienceIn sensitive domains, the sharing of corpora is restricted due to confidential...
A major hurdle in the development of natural language processing (NLP) methods for Electronic Health...
International audienceA vast amount of crucial information about patients resides solely in unstruct...
Background: Named entity recognition (NER) systems are commonly built using supervised methods that ...
The health and life science domains are well known for their wealth of named entities found in large...
Electronic health record systems are ubiquitous and the majority of patients’ data are now being col...
The health and life science domains are well known for their wealth of named entities found in large...
Medical texts such as radiology reports or electronic health records are a powerful source of data f...
AbstractBackgroundTo facilitate research applying Natural Language Processing to clinical documents,...
International audienceOBJECTIVE:We aimed to enhance the performance of a supervised model for clinic...
Funder: EPSRC Healtex Feasibility Funding (grant EP/N027280/1): "Towards Shareable Data in Clinical ...
One of the central tasks of medical text analysis is to extract and structure meaningful information...
Disorder named entity recognition (DNER) is a fundamental task of biomedical natural language proces...
In the Big Data era, there is an increasing need to fully exploit and analyze the huge quantity of i...
International audienceIn sensitive domains, the sharing of corpora is restricted due to confidential...
A major hurdle in the development of natural language processing (NLP) methods for Electronic Health...
International audienceA vast amount of crucial information about patients resides solely in unstruct...
Background: Named entity recognition (NER) systems are commonly built using supervised methods that ...
The health and life science domains are well known for their wealth of named entities found in large...
Electronic health record systems are ubiquitous and the majority of patients’ data are now being col...
The health and life science domains are well known for their wealth of named entities found in large...
Medical texts such as radiology reports or electronic health records are a powerful source of data f...
AbstractBackgroundTo facilitate research applying Natural Language Processing to clinical documents,...
International audienceOBJECTIVE:We aimed to enhance the performance of a supervised model for clinic...
Funder: EPSRC Healtex Feasibility Funding (grant EP/N027280/1): "Towards Shareable Data in Clinical ...
One of the central tasks of medical text analysis is to extract and structure meaningful information...
Disorder named entity recognition (DNER) is a fundamental task of biomedical natural language proces...
In the Big Data era, there is an increasing need to fully exploit and analyze the huge quantity of i...