The state of the art in natural language processing is based on transformer models that are pre-trained on general knowledge and enable efficient transfer learning in a wide variety of downstream tasks even with limited data sets. However, these models significantly decrease performance when operating in specific and sectoral domains. This is problematic in the Italian legal context, as there are many discrepancies between the language found in generic open source corpora (e.g., Wikipedia and news articles) and legal language, which can be cryptic, Latin-based, and domain idiolectal formulas. In this paper, we introduce the ITALIAN-LEGAL-BERT model with additional pre-training of the Italian BERT model on Italian civil law corpora. It ach...
The use of corpus linguistics for technical translations has largely been advocated by scholars over...
Labelling factual information on the token level in legal cases requires legal expertise and is time...
Labelling factual information on the token level in legal cases requires legal expertise and is time...
The state of the art in natural language processing is based on transformer models that are pre-trai...
Automatic identification of rhetorical roles can help in many downstream applications of legal docum...
Automatic identification of rhetorical roles can help in many downstream applications of legal docum...
Automatic identification of rhetorical roles can help in many downstream applications of legal docum...
This paper presents the result of the research project Lexdatafication that aims to model the legal...
This paper presents the result of the research project Lexdatafication that aims to model the legal...
International audienceLanguage models have proven to be very useful when adapted to specific domains...
the study presents a corpus-based analysis of lexico-grammatical features of Italian legal lay-langu...
Transformer-based architectures have in recent years advanced state-of-the-art performance in Natura...
1noThe aim of this chapter is to provide an overview of the interdisciplinary academic field of lega...
BERT has achieved impressive performance in several NLP tasks. However, there has been limited inves...
Following the implementation of South Tyrol’s Statute of Autonomy, the public administrations of the...
The use of corpus linguistics for technical translations has largely been advocated by scholars over...
Labelling factual information on the token level in legal cases requires legal expertise and is time...
Labelling factual information on the token level in legal cases requires legal expertise and is time...
The state of the art in natural language processing is based on transformer models that are pre-trai...
Automatic identification of rhetorical roles can help in many downstream applications of legal docum...
Automatic identification of rhetorical roles can help in many downstream applications of legal docum...
Automatic identification of rhetorical roles can help in many downstream applications of legal docum...
This paper presents the result of the research project Lexdatafication that aims to model the legal...
This paper presents the result of the research project Lexdatafication that aims to model the legal...
International audienceLanguage models have proven to be very useful when adapted to specific domains...
the study presents a corpus-based analysis of lexico-grammatical features of Italian legal lay-langu...
Transformer-based architectures have in recent years advanced state-of-the-art performance in Natura...
1noThe aim of this chapter is to provide an overview of the interdisciplinary academic field of lega...
BERT has achieved impressive performance in several NLP tasks. However, there has been limited inves...
Following the implementation of South Tyrol’s Statute of Autonomy, the public administrations of the...
The use of corpus linguistics for technical translations has largely been advocated by scholars over...
Labelling factual information on the token level in legal cases requires legal expertise and is time...
Labelling factual information on the token level in legal cases requires legal expertise and is time...