We develop a new method to rank the degree of similarity between Boolean expressions, contrast it with other known methods, and describe its implementation. Our method reduces time and space complexity from exponential to polynomial in the number of Boolean terms. Index Terms - Boolean query, information retrieval, ranking, resource discovery, similarity measure. 1 Introduction Most library information systems let users make Boolean queries against their database. Internet resource discovery systems, such as WAIS [1] and our Indie [2], also support Boolean queries. Frequently, users find it convenient if the retrieval system returns the answers to their queries in a ranked order. This paper develops an efficient algorithm to rank the simi...
The National Library of Medicine's (NLM) IRVIS project has been evaluating “similarity ranking”...
Abstract: Query clustering is a task that groups similar queries automatically without using predet...
Dataset search engines help scientists to find research datasets for scientific experiments. Current...
As the number of Internet servers increases rapidly, it becomes difficult to determine the relevant ...
In Information Retrieval (IR), Data Mining (DM), and Machine Learning (ML), similarity measures have...
An information retrieval (IR) process begins when a user enters a query into the system. Queries are...
We propose a way of measuring the similarity of a Boolean vector to a given set of Boolean vectors, ...
AbstractWe propose a way of measuring the similarity of a Boolean vector to a given set of Boolean v...
Metric databases are databases where a metric distance function is defined for pairs of database obj...
Abstract. To automatically retrieve documents or images from a database, retrieval systems use simil...
We present a framework for discovering sets of web queries having similar latent needs, called searc...
Many application scenarios, e.g., marketing analysis, sensor networks, and medical and biological ap...
We present a framework for discovering sets of web queries having similar latent needs, called searc...
A similarity query is to find from a collection of items those that are similar to a given query ite...
Semantic Similarity measures between words plays an important role in information retrieval, natural...
The National Library of Medicine's (NLM) IRVIS project has been evaluating “similarity ranking”...
Abstract: Query clustering is a task that groups similar queries automatically without using predet...
Dataset search engines help scientists to find research datasets for scientific experiments. Current...
As the number of Internet servers increases rapidly, it becomes difficult to determine the relevant ...
In Information Retrieval (IR), Data Mining (DM), and Machine Learning (ML), similarity measures have...
An information retrieval (IR) process begins when a user enters a query into the system. Queries are...
We propose a way of measuring the similarity of a Boolean vector to a given set of Boolean vectors, ...
AbstractWe propose a way of measuring the similarity of a Boolean vector to a given set of Boolean v...
Metric databases are databases where a metric distance function is defined for pairs of database obj...
Abstract. To automatically retrieve documents or images from a database, retrieval systems use simil...
We present a framework for discovering sets of web queries having similar latent needs, called searc...
Many application scenarios, e.g., marketing analysis, sensor networks, and medical and biological ap...
We present a framework for discovering sets of web queries having similar latent needs, called searc...
A similarity query is to find from a collection of items those that are similar to a given query ite...
Semantic Similarity measures between words plays an important role in information retrieval, natural...
The National Library of Medicine's (NLM) IRVIS project has been evaluating “similarity ranking”...
Abstract: Query clustering is a task that groups similar queries automatically without using predet...
Dataset search engines help scientists to find research datasets for scientific experiments. Current...