textNatural language pipelines consist of various natural language algorithms that use the annotations of a previous algorithm to compute more annotations. These algorithms tend to be expensive in terms of computational power. Therefore it is advantageous to parallelize them in order to reduce the time necessary to analyze a large document collection. The goal of this project was to develop a new framework to encapsulate algorithms such that they may be used as part of a pipeline without any additional work. The framework consists of a custom-built data structure called Slab which implements type safety and functional transparency to integrate itself into the Scala programming language. Because of this integration, it is possible to use Spa...
The Intelcomp NLP pipeline can be defined as a collection of tools that apply the requested transfor...
In recent years there has been a growing interest in the commercial deployment of NLP technologies. ...
International audienceCommon Crawl is a considerably large, heterogeneous multilingual corpus compri...
Spark NLP is a Natural Language Processing (NLP) library built on top of Apache Spark ML. It provide...
This thesis is intended to deal with questions related to the processing of naturally occurring text...
International audienceNatural Language Processing (NLP) of textual data is usually broken down into ...
Natural Language Processing continues to grow in popularity in a range of research and commercial ap...
We classify and review current approaches to software infrastructure for research, development and d...
This paper describes a conceptual framework that enables online NLP pipelined applications to solve ...
Tanl (Natural Language Text Analytics) is a suite of tools for text analytics based on the software ...
Natural Language Processing (NLP)is an important research direction, since it addresses the needs of...
The vision of Grid technology has motivated a huge effort in developing a Grid architecture which w...
In this paper, we describe Teanga, a linked data based platform for natural language processing (NLP...
In this paper, we describe Teanga, a linked data based platform for natural language processing (NLP...
The field of natural language processing (aka NLP) is an intersection of the study of linguistics, c...
The Intelcomp NLP pipeline can be defined as a collection of tools that apply the requested transfor...
In recent years there has been a growing interest in the commercial deployment of NLP technologies. ...
International audienceCommon Crawl is a considerably large, heterogeneous multilingual corpus compri...
Spark NLP is a Natural Language Processing (NLP) library built on top of Apache Spark ML. It provide...
This thesis is intended to deal with questions related to the processing of naturally occurring text...
International audienceNatural Language Processing (NLP) of textual data is usually broken down into ...
Natural Language Processing continues to grow in popularity in a range of research and commercial ap...
We classify and review current approaches to software infrastructure for research, development and d...
This paper describes a conceptual framework that enables online NLP pipelined applications to solve ...
Tanl (Natural Language Text Analytics) is a suite of tools for text analytics based on the software ...
Natural Language Processing (NLP)is an important research direction, since it addresses the needs of...
The vision of Grid technology has motivated a huge effort in developing a Grid architecture which w...
In this paper, we describe Teanga, a linked data based platform for natural language processing (NLP...
In this paper, we describe Teanga, a linked data based platform for natural language processing (NLP...
The field of natural language processing (aka NLP) is an intersection of the study of linguistics, c...
The Intelcomp NLP pipeline can be defined as a collection of tools that apply the requested transfor...
In recent years there has been a growing interest in the commercial deployment of NLP technologies. ...
International audienceCommon Crawl is a considerably large, heterogeneous multilingual corpus compri...