International audienceDuring this internship, we worked on improving an open domain question answering system. We addressed the document selection part which is structured as a text ranking task. The first step was to explore classical methods, also known as sparse retrievers, and to test those algorithms on our evaluation dataset. These methods produced only minor differences in performance. The next step was to employ deep language models, namely the BERT based architectures. A variety of techniques and designs were considered. First, we tackled the lack of data to train such models on the French language, followed by the definition of the problem (classification or ranking), and finally, we addressed the problem of limited text length in...
International audienceThis paper describes the EQueR-EVALDA Evaluation Campaign, the French evaluati...
Standard bag-of-words term-matching techniques in document retrieval fail to exploit rich semantic i...
The complexity for being at the forefront regarding information retrieval systems are constantly inc...
International audienceDuring this internship, we worked on improving an open domain question answeri...
International audienceThis chapter is dedicated to factual question answering, i.e. extracting preci...
Title from PDF of title page viewed June 14, 2021Thesis advisor: Yugyung LeeVitaIncludes bibliograph...
This paper proposes to tackle Question Answering on a specific domain by developing a multi-tier sys...
World Wide Web search engines process millions of queries per day from users all over the world. Eff...
Natural Language Understanding is one of the most challenging objectives of Artificial Intelligence....
Although traditional search engines can retrieval thousands or millions of web links related to inpu...
Open-domain question answering (OpenQA) is an essential but challenging task in natural language pro...
Having computer systems capable of answering questions has been a goal within Natural Language Proce...
This thesis describes advancements in question answering along three general directions: model archi...
People rely on data to understand the world and inform their decision-making. However, effective acc...
We review the EfficientQA competition from NeurIPS 2020. The competition focused on open-domain ques...
International audienceThis paper describes the EQueR-EVALDA Evaluation Campaign, the French evaluati...
Standard bag-of-words term-matching techniques in document retrieval fail to exploit rich semantic i...
The complexity for being at the forefront regarding information retrieval systems are constantly inc...
International audienceDuring this internship, we worked on improving an open domain question answeri...
International audienceThis chapter is dedicated to factual question answering, i.e. extracting preci...
Title from PDF of title page viewed June 14, 2021Thesis advisor: Yugyung LeeVitaIncludes bibliograph...
This paper proposes to tackle Question Answering on a specific domain by developing a multi-tier sys...
World Wide Web search engines process millions of queries per day from users all over the world. Eff...
Natural Language Understanding is one of the most challenging objectives of Artificial Intelligence....
Although traditional search engines can retrieval thousands or millions of web links related to inpu...
Open-domain question answering (OpenQA) is an essential but challenging task in natural language pro...
Having computer systems capable of answering questions has been a goal within Natural Language Proce...
This thesis describes advancements in question answering along three general directions: model archi...
People rely on data to understand the world and inform their decision-making. However, effective acc...
We review the EfficientQA competition from NeurIPS 2020. The competition focused on open-domain ques...
International audienceThis paper describes the EQueR-EVALDA Evaluation Campaign, the French evaluati...
Standard bag-of-words term-matching techniques in document retrieval fail to exploit rich semantic i...
The complexity for being at the forefront regarding information retrieval systems are constantly inc...