International audienceExisting systems dealing with the increasing volume of data series cannot guarantee interactive response times, even for fundamental tasks such as similarity search. Therefore, it is necessary to develop analytic approaches that support exploration and decision making by providing progressive results, before the final and exact ones have been computed. Prior works lack both efficiency and accuracy when applied to large-scale data series collections. We present and experimentally evaluate ProS, a new probabilistic learning-based method that provides quality guarantees for progressive Nearest Neighbor (NN) query answering. We develop our method for k-NN queries and demonstrate how it can be applied with the two most popu...
Abstract: The k-Nearest Neighbor (k-NN) is very simple and powerful approach to conceptually approxi...
Similarity search is one of the most studied research fields in data mining. Given a query data poin...
Many real applications consume data that is intrinsically uncertain, noisy and error-prone. In this ...
International audienceExisting systems dealing with the increasing volume of data series cannot guar...
Existing systems dealing with the increasing volume of data series cannot guarantee interactive resp...
International audienceExisting systems dealing with the increasing volume of data series cannot guar...
International audienceTime series data are increasing at a dramatic rate, yet their analysis remains...
International audienceWe present a progressive algorithm for approximate k-nearest neighbor search. ...
International audienceWe present PANENE, a progressive algorithm for approximate nearest neighbor in...
As databases increasingly integrate different types of information such as time-series, multimedia a...
We introduce a new probabilistic proximity search algorithm for range and K-nearest neighbor (K-NN) ...
In this paper, we present continuous research on data analysis based on our previous work on similar...
This paper presents two metrics for the Nearest Neighbor Classifier that share the property of being...
Abstract The nearest neighbour (NN) classification rule is usuallychosen in a large number of patter...
AbstractMany methods have been proposed to measure the similarity between time series data sets, eac...
Abstract: The k-Nearest Neighbor (k-NN) is very simple and powerful approach to conceptually approxi...
Similarity search is one of the most studied research fields in data mining. Given a query data poin...
Many real applications consume data that is intrinsically uncertain, noisy and error-prone. In this ...
International audienceExisting systems dealing with the increasing volume of data series cannot guar...
Existing systems dealing with the increasing volume of data series cannot guarantee interactive resp...
International audienceExisting systems dealing with the increasing volume of data series cannot guar...
International audienceTime series data are increasing at a dramatic rate, yet their analysis remains...
International audienceWe present a progressive algorithm for approximate k-nearest neighbor search. ...
International audienceWe present PANENE, a progressive algorithm for approximate nearest neighbor in...
As databases increasingly integrate different types of information such as time-series, multimedia a...
We introduce a new probabilistic proximity search algorithm for range and K-nearest neighbor (K-NN) ...
In this paper, we present continuous research on data analysis based on our previous work on similar...
This paper presents two metrics for the Nearest Neighbor Classifier that share the property of being...
Abstract The nearest neighbour (NN) classification rule is usuallychosen in a large number of patter...
AbstractMany methods have been proposed to measure the similarity between time series data sets, eac...
Abstract: The k-Nearest Neighbor (k-NN) is very simple and powerful approach to conceptually approxi...
Similarity search is one of the most studied research fields in data mining. Given a query data poin...
Many real applications consume data that is intrinsically uncertain, noisy and error-prone. In this ...