this paper we argue that one can store semistructured data in relational format, by exploiting the regularities inherent in existing semistructured data instances. "Most" of the data will be stored in relational format: the outliers, and possible future insertions, will be still stored in a self-describing way. We propose to use data mining techniques to extract a "good" relational schema for a given semistructured data instance. Our algorithm accepts a variety of input parameters, such as maximum number of relations allowed, maximum number of attributes per relation, and, optionally, a collection of queries on the semistructured data for which the relational storage has to be optimized. Experimental results on the DBLP ...
We propose to use object-relational database management systems to store and manage semi-structured ...
JRelix is a relational database implementation that supports not only traditional relational algebra...
One example of semistructured data sources is the World Wide Web (WWW). In the semistructured world,...
Nowadays, relational databases have become the de facto standard to store large quantities of data. ...
Data is typically complex and relational. Therefore, the development of relational data mining metho...
Semistructured data is one of the new challenging research areas in the database community. We belie...
We introduce relational redescription mining, that is, the task of finding two structurally differen...
University of Technology, Sydney. Faculty of Information Technology.NO FULL TEXT AVAILABLE. Access i...
A major obstacle to fully integrated deployment of many data mining algorithms is the assumption tha...
One fundamental limitation of classical statistical modeling is the assumption that data is represen...
We discuss the use of database methods for data mining. Recently impressive results have been achiev...
Traditional database management requires design and ensures declarativity. In the context of semistr...
As data management applications grow more complex, they are likely to need efficient distributed que...
Motivated by an analogy with matrix factorization, we introduce the problem of factorizing relationa...
In the field of machine learning, methods for learning from single-table data have received much mor...
We propose to use object-relational database management systems to store and manage semi-structured ...
JRelix is a relational database implementation that supports not only traditional relational algebra...
One example of semistructured data sources is the World Wide Web (WWW). In the semistructured world,...
Nowadays, relational databases have become the de facto standard to store large quantities of data. ...
Data is typically complex and relational. Therefore, the development of relational data mining metho...
Semistructured data is one of the new challenging research areas in the database community. We belie...
We introduce relational redescription mining, that is, the task of finding two structurally differen...
University of Technology, Sydney. Faculty of Information Technology.NO FULL TEXT AVAILABLE. Access i...
A major obstacle to fully integrated deployment of many data mining algorithms is the assumption tha...
One fundamental limitation of classical statistical modeling is the assumption that data is represen...
We discuss the use of database methods for data mining. Recently impressive results have been achiev...
Traditional database management requires design and ensures declarativity. In the context of semistr...
As data management applications grow more complex, they are likely to need efficient distributed que...
Motivated by an analogy with matrix factorization, we introduce the problem of factorizing relationa...
In the field of machine learning, methods for learning from single-table data have received much mor...
We propose to use object-relational database management systems to store and manage semi-structured ...
JRelix is a relational database implementation that supports not only traditional relational algebra...
One example of semistructured data sources is the World Wide Web (WWW). In the semistructured world,...