Standardized approaches to relevance classification in information retrieval use generative statistical models to identify the presence or absence of certain topics that might make a document relevant to the searcher. These approaches have been used to better predict relevance on the basis of what the document is “about”, rather than a simple-minded analysis of the bag of words contained within the document. In more recent times, this idea has been extended by using pre-trained deep learning models and text representations, such as GloVe or BERT. These use an external corpus as a knowledge-base that conditions the model to help predict what a document is about. This paper adopts a hybrid approach that leverages the structure of knowledge em...
The Internet is a remarkably complex technical system. Its rapid growth has also brought technical i...
This thesis presents a series of conceptual and empirical developments on the ranking and retrieval ...
In this thesis, we study methods to leverage information from fully-structured knowledge bases (KBs...
Standardized approaches to relevance classification in information retrieval use generative statisti...
Standardized approaches to relevance classification in information retrieval use generative statisti...
This paper presents a deep semantic simi-larity model (DSSM), a special type of deep neural networks...
This paper presents a deep semantic simi-larity model (DSSM), a special type of deep neural networks...
Text documents can be described by a number of abstract concepts such as semantic category, writing ...
The proliferation of deliverable knowledge on the web, along with the rapidly increasing number of a...
International audienceThe extraction and the disambiguation of knowledge guided by textual resources...
Measuring relevance of documents with respect to a user’s query is at the heart of information retri...
International audienceThe extraction and the disambiguation of knowledge guided by textual resources...
Standard bag-of-words term-matching techniques in document retrieval fail to exploit rich semantic i...
The discovery of useful data for a given problem is of primary importance since data scientists usua...
Document classification or categorization with algorithms is a well-known problem in information sci...
The Internet is a remarkably complex technical system. Its rapid growth has also brought technical i...
This thesis presents a series of conceptual and empirical developments on the ranking and retrieval ...
In this thesis, we study methods to leverage information from fully-structured knowledge bases (KBs...
Standardized approaches to relevance classification in information retrieval use generative statisti...
Standardized approaches to relevance classification in information retrieval use generative statisti...
This paper presents a deep semantic simi-larity model (DSSM), a special type of deep neural networks...
This paper presents a deep semantic simi-larity model (DSSM), a special type of deep neural networks...
Text documents can be described by a number of abstract concepts such as semantic category, writing ...
The proliferation of deliverable knowledge on the web, along with the rapidly increasing number of a...
International audienceThe extraction and the disambiguation of knowledge guided by textual resources...
Measuring relevance of documents with respect to a user’s query is at the heart of information retri...
International audienceThe extraction and the disambiguation of knowledge guided by textual resources...
Standard bag-of-words term-matching techniques in document retrieval fail to exploit rich semantic i...
The discovery of useful data for a given problem is of primary importance since data scientists usua...
Document classification or categorization with algorithms is a well-known problem in information sci...
The Internet is a remarkably complex technical system. Its rapid growth has also brought technical i...
This thesis presents a series of conceptual and empirical developments on the ranking and retrieval ...
In this thesis, we study methods to leverage information from fully-structured knowledge bases (KBs...