The effective grouping, or partitioning, of semistructured data is of fundamental importance when providing support for queries. Partitions allow items within the data set that share common structural properties to be identified efficiently. This allows queries that make use of these properties, such as branching path expressions, to be accelerated. Here, we evaluate the effectiveness of several partitioning techniques by establishing the number of partitions that each scheme can identify over a given data set. In particular, we explore the use of parameterised indexes, based upon the notion of forward and backward bisimilarity, as a means of partitioning semistructured data; demonstrating that even restricted instances of such indexes can ...
During the last decade, stability-based measures became popular in order to validate the results of ...
Dividing a set S $\mathcal{S} = \{x_i=(x_1^{(i)}+\dots+x_n^{(i)})^T \in \mathbb{R}^n:i=1,\dots,m\}$ ...
When a phenomenon is described by a parametric model and multiple datasets are available, a key prob...
The effective grouping, or partitioning, of semistructured data is of fundamental importance when pr...
Over the last decades, a great variety of data mining techniques have been developed to reach goals ...
Growing user expectations of anywhere, anytime access to information require new types of data repre...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Problems in data analysis often require the unsupervised partitioning of a data set into clusters. M...
The purpose of discussed optimal valid partitioning (OVP) methods is uncovering of ordinal or conti...
Graphical models are one of the most prominent frameworks to model complex systems and efficiently q...
We prove a theorem on partitioning point sets in Ed (d fixed) and give an efficient construction of ...
A process of similar data items into groups is called data clustering. Partitioning a Data Set into ...
When a phenomenon is described by a parametric model and multiple datasets are available, a key prob...
This book summarizes the state-of-the-art in partitional clustering. Clustering, the unsupervised cl...
When a phenomenon is described by a parametric model and multiple datasets are available, a key prob...
During the last decade, stability-based measures became popular in order to validate the results of ...
Dividing a set S $\mathcal{S} = \{x_i=(x_1^{(i)}+\dots+x_n^{(i)})^T \in \mathbb{R}^n:i=1,\dots,m\}$ ...
When a phenomenon is described by a parametric model and multiple datasets are available, a key prob...
The effective grouping, or partitioning, of semistructured data is of fundamental importance when pr...
Over the last decades, a great variety of data mining techniques have been developed to reach goals ...
Growing user expectations of anywhere, anytime access to information require new types of data repre...
Thesis: M. Eng., Massachusetts Institute of Technology, Department of Electrical Engineering and Com...
Problems in data analysis often require the unsupervised partitioning of a data set into clusters. M...
The purpose of discussed optimal valid partitioning (OVP) methods is uncovering of ordinal or conti...
Graphical models are one of the most prominent frameworks to model complex systems and efficiently q...
We prove a theorem on partitioning point sets in Ed (d fixed) and give an efficient construction of ...
A process of similar data items into groups is called data clustering. Partitioning a Data Set into ...
When a phenomenon is described by a parametric model and multiple datasets are available, a key prob...
This book summarizes the state-of-the-art in partitional clustering. Clustering, the unsupervised cl...
When a phenomenon is described by a parametric model and multiple datasets are available, a key prob...
During the last decade, stability-based measures became popular in order to validate the results of ...
Dividing a set S $\mathcal{S} = \{x_i=(x_1^{(i)}+\dots+x_n^{(i)})^T \in \mathbb{R}^n:i=1,\dots,m\}$ ...
When a phenomenon is described by a parametric model and multiple datasets are available, a key prob...