International audienceTop-k keyword and top-k document extraction are very popular text analysis techniques. Top-k keywords and documents are often computed on-the-fly, but they exploit weighted vocabularies that are costly to build. To compare competing weighting schemes and database implementations, benchmarking is customary. To the best of our knowledge, no benchmark currently addresses these problems. Hence, in this paper, we present T²K², a top-k keywords and documents benchmark, and its decision support-oriented evolution T²K²D². Both benchmarks feature a real tweet dataset and queries with various complexities and selectivities. They help evaluate weighting schemes and database implementations in terms of computing performance. To il...
A lot of improvement has gone in the area of information retrieval. But, still improvements can be d...
Keyword search is the most popular technique for querying large tree-structured datasets, often of u...
While there have been several studies related to the effect of term weighting on classification accu...
International audienceTop-k keyword and top-k document extraction are very popular text analysis tec...
International audienceInformation retrieval from textual data focuses on the construction of vocabul...
International audienceExtracting top-k keywords and documents using weighting schemes are popular te...
This thesis focuses on top-k document retrieval. The study of such query evaluation method is motiva...
Abstract: This paper is about data extraction from top-k web pages, which explain top k occurrences ...
The research on optimization of top-k SPARQL query would largely benefit from the establishment of a...
International audienceThe general problem of answering top-k queries can be modeled using lists of d...
Abstract—The development of search engines is taking at a very fast rate. A lot of algorithms have b...
Abstract. The research on optimization of top-k SPARQL query would largely benefit from the establis...
Abstract. The research on optimization of top-k SPARQL query would largely benefit from the establis...
textabstractThe TPC-D benchmark was developed almost 20 years ago, and even though its current exist...
Text search engines return a set of k documents ranked by similarity to a query. Typically, document...
A lot of improvement has gone in the area of information retrieval. But, still improvements can be d...
Keyword search is the most popular technique for querying large tree-structured datasets, often of u...
While there have been several studies related to the effect of term weighting on classification accu...
International audienceTop-k keyword and top-k document extraction are very popular text analysis tec...
International audienceInformation retrieval from textual data focuses on the construction of vocabul...
International audienceExtracting top-k keywords and documents using weighting schemes are popular te...
This thesis focuses on top-k document retrieval. The study of such query evaluation method is motiva...
Abstract: This paper is about data extraction from top-k web pages, which explain top k occurrences ...
The research on optimization of top-k SPARQL query would largely benefit from the establishment of a...
International audienceThe general problem of answering top-k queries can be modeled using lists of d...
Abstract—The development of search engines is taking at a very fast rate. A lot of algorithms have b...
Abstract. The research on optimization of top-k SPARQL query would largely benefit from the establis...
Abstract. The research on optimization of top-k SPARQL query would largely benefit from the establis...
textabstractThe TPC-D benchmark was developed almost 20 years ago, and even though its current exist...
Text search engines return a set of k documents ranked by similarity to a query. Typically, document...
A lot of improvement has gone in the area of information retrieval. But, still improvements can be d...
Keyword search is the most popular technique for querying large tree-structured datasets, often of u...
While there have been several studies related to the effect of term weighting on classification accu...