We assess the suitability of word embeddings for practical information retrieval scenarios. Thus, we assume that users issue ad-hoc short queries where we return the first twenty retrieved documents after applying a boolean matching operation between the query and the documents. We compare the performance of several techniques that leverage word embeddings in the retrieval models to compute the similarity between the query and the documents, namely word centroid similarity, paragraph vectors, Word Mover’s distance, as well as our novel inverse document frequency (IDF) re-weighted word centroid similarity. We evaluate the performance using the ranking metrics mean average precision, mean reciprocal rank, and normalized discounted cumulative ...
International audienceThis paper focuses on question retrieval which is a crucial and tricky task in...
Document embeddings, created with methods ranging from simple heuristics to statistical and deep mod...
The need for an efficient method to find the furthermost appropriate document corresponding to a par...
We assess the suitability of word embeddings for practical information retrieval scenarios. Thus, we...
Recent advances in neural language models have contributed new methods for learning distributed vect...
Measuring the semantic similarity of texts has a vital role in various tasks from the field of natur...
We consider the following problem: given neural language models (embeddings) each of which is traine...
© 2017 IEEE. Query expansion has been widely used to select additional words that are related to the...
Measuring the semantic similarity of texts has a vital role in various tasks from the field of natur...
In the vector space model for information retrieval, term vectors are pair-wise orthogonal, that is,...
In this paper we implement a document retrieval system using the Lucene tool and we conduct some exp...
Evaluating semantic similarity of texts is a task that assumes paramount importance in real-world ap...
International audienceTraditional Information Retrieval (IR) models are based on bag-of-words paradi...
Determining semantic similarity between texts is important in many tasks in information retrieval su...
Word embeddings are useful in many tasks in Natural Language Processing and Information Retrieval, s...
International audienceThis paper focuses on question retrieval which is a crucial and tricky task in...
Document embeddings, created with methods ranging from simple heuristics to statistical and deep mod...
The need for an efficient method to find the furthermost appropriate document corresponding to a par...
We assess the suitability of word embeddings for practical information retrieval scenarios. Thus, we...
Recent advances in neural language models have contributed new methods for learning distributed vect...
Measuring the semantic similarity of texts has a vital role in various tasks from the field of natur...
We consider the following problem: given neural language models (embeddings) each of which is traine...
© 2017 IEEE. Query expansion has been widely used to select additional words that are related to the...
Measuring the semantic similarity of texts has a vital role in various tasks from the field of natur...
In the vector space model for information retrieval, term vectors are pair-wise orthogonal, that is,...
In this paper we implement a document retrieval system using the Lucene tool and we conduct some exp...
Evaluating semantic similarity of texts is a task that assumes paramount importance in real-world ap...
International audienceTraditional Information Retrieval (IR) models are based on bag-of-words paradi...
Determining semantic similarity between texts is important in many tasks in information retrieval su...
Word embeddings are useful in many tasks in Natural Language Processing and Information Retrieval, s...
International audienceThis paper focuses on question retrieval which is a crucial and tricky task in...
Document embeddings, created with methods ranging from simple heuristics to statistical and deep mod...
The need for an efficient method to find the furthermost appropriate document corresponding to a par...