Similarity queries are fundamental operations that are used extensively in many modern applications, whereas disk ar-rays are powerful storage media of increasing importance. The basic trade-off in similarity query processing in such a system is that increased parallelism leads to higher resource consumptions and low throughput, whereas low parallelism leads to higher response times. Here, we propose a tech-nique which is based on a careful investigation of the cur-rently available data in order to exploit parallelism up to a point, retaining low response times during query processing. The underlying access method is a variation of the R*-tree, which is distributed among the components of a disk array, whereas the system is simulated using ...
Data analysts spend more than 80% of time on data cleaning and integration in the whole process of d...
Abstract. Query-by-content, by means of similarity search, is a funda-mental operation for applicati...
Despite many research efforts, similarity queries are still poorly supported by current systems. We ...
A similarity query is to find from a collection of items those that are similar to a given query ite...
Several methods exists for performing similarity searches quickly using metric indexing. However, mo...
In a multiple disk environment it is desirable to have techniques for efficient parallel execution o...
This work examines the possibilities of employing highly parallel architectures in database systems,...
Metric databases are databases where a metric distance function is defined for pairs of database obj...
Most similarity search techniques map the data objects into some high-dimensional feature space. The...
Most similarity search techniques map the data objects into some high-dimensional feature space. The...
Advances in processor and network technologies have catalyzed the growth of data intensive applicati...
Today, a myriad of data sources, from the Internet to business operations to scientific instruments,...
Abstract. Similarity queries searching for the most similar objects in a database compared to a give...
Due to the increasing complexity of current digital data, similarity search has become a fundamental...
Similarity search is important for many data-intensive applications to identify a set of similar obj...
Data analysts spend more than 80% of time on data cleaning and integration in the whole process of d...
Abstract. Query-by-content, by means of similarity search, is a funda-mental operation for applicati...
Despite many research efforts, similarity queries are still poorly supported by current systems. We ...
A similarity query is to find from a collection of items those that are similar to a given query ite...
Several methods exists for performing similarity searches quickly using metric indexing. However, mo...
In a multiple disk environment it is desirable to have techniques for efficient parallel execution o...
This work examines the possibilities of employing highly parallel architectures in database systems,...
Metric databases are databases where a metric distance function is defined for pairs of database obj...
Most similarity search techniques map the data objects into some high-dimensional feature space. The...
Most similarity search techniques map the data objects into some high-dimensional feature space. The...
Advances in processor and network technologies have catalyzed the growth of data intensive applicati...
Today, a myriad of data sources, from the Internet to business operations to scientific instruments,...
Abstract. Similarity queries searching for the most similar objects in a database compared to a give...
Due to the increasing complexity of current digital data, similarity search has become a fundamental...
Similarity search is important for many data-intensive applications to identify a set of similar obj...
Data analysts spend more than 80% of time on data cleaning and integration in the whole process of d...
Abstract. Query-by-content, by means of similarity search, is a funda-mental operation for applicati...
Despite many research efforts, similarity queries are still poorly supported by current systems. We ...