We assess the suitability of word embeddings for practical information retrieval scenarios. Thus, we assume that users issue ad-hoc short queries where we return the first twenty retrieved documents after applying a boolean matching operation between the query and the documents. We compare the performance of several techniques that leverage word embeddings in the retrieval models to compute the similarity between the query and the documents, namely word centroid similarity, paragraph vectors, Word Mover’s distance, as well as our novel inverse document frequency (IDF) re-weighted word centroid similarity. We evaluate the performance using the ranking metrics mean average precision, mean reciprocal rank, and normalized discounted cumulative ...
We propose a method named WordRank to extract a few salient words from the target document and then ...
International audienceTraditional Information Retrieval (IR) models are based on bag-of-words paradi...
© 2017 IEEE. Query expansion has been widely used to select additional words that are related to the...
We assess the suitability of word embeddings for practical information retrieval scenarios. Thus, we...
Measuring the semantic similarity of texts has a vital role in various tasks from the field of natur...
Measuring the semantic similarity of texts has a vital role in various tasks from the field of natur...
The need for an efficient method to find the furthermost appropriate document corresponding to a par...
Abstract. Measuring the similarity between documents and queries has been extensively studied in inf...
Evaluating semantic similarity of texts is a task that assumes paramount importance in real-world ap...
Measuring document similarity is important in order to find documents which are similar to a given q...
Document similarity search is to find documents similar to a given query document and return a ranke...
International audienceWord Embeddings (WE) have recently imposed themselves as a standard for repres...
Determining semantic similarity between texts is important in many tasks in information retrieval su...
Particularly, information retrieval resultsas documents are typically too extensive.Consequently, a ...
Top-k words selection is a technique used to detect and return the k most similar words to a given w...
We propose a method named WordRank to extract a few salient words from the target document and then ...
International audienceTraditional Information Retrieval (IR) models are based on bag-of-words paradi...
© 2017 IEEE. Query expansion has been widely used to select additional words that are related to the...
We assess the suitability of word embeddings for practical information retrieval scenarios. Thus, we...
Measuring the semantic similarity of texts has a vital role in various tasks from the field of natur...
Measuring the semantic similarity of texts has a vital role in various tasks from the field of natur...
The need for an efficient method to find the furthermost appropriate document corresponding to a par...
Abstract. Measuring the similarity between documents and queries has been extensively studied in inf...
Evaluating semantic similarity of texts is a task that assumes paramount importance in real-world ap...
Measuring document similarity is important in order to find documents which are similar to a given q...
Document similarity search is to find documents similar to a given query document and return a ranke...
International audienceWord Embeddings (WE) have recently imposed themselves as a standard for repres...
Determining semantic similarity between texts is important in many tasks in information retrieval su...
Particularly, information retrieval resultsas documents are typically too extensive.Consequently, a ...
Top-k words selection is a technique used to detect and return the k most similar words to a given w...
We propose a method named WordRank to extract a few salient words from the target document and then ...
International audienceTraditional Information Retrieval (IR) models are based on bag-of-words paradi...
© 2017 IEEE. Query expansion has been widely used to select additional words that are related to the...