In many scientific tasks we are interested in discovering whether there exist any correlations in our data. This raises many questions, such as how to reliably and interpretably measure correlation between a multivariate set of attributes, how to do so without having to make assumptions on distribution of the data or the type of correlation, and, how to efficiently discover the top-most reliably correlated attribute sets from data. In this paper we answer these questions for discovery tasks in categorical data. In particular, we propose a corrected-for-chance, consistent, and efficient estimator for normalized total correlation, by which we obtain a reliable, naturally interpretable, non-parametric measure for correlation over multivaria...
Correlation determination brings out relationships in data that had not been seen before and it is i...
Statistical data mining refers to methods for identifying and validating interesting patterns from a...
Recently, there has been considerable interest in computing strongly correlated pairs in large datab...
In many scientific tasks we are interested in discovering whether there exist any correlations in ou...
textMachine learning, data mining, and statistical methods work by representing real-world objects i...
We introduce a method to learn a hierarchy of successively more abstract represen-tations of complex...
This paper addresses some of the foundational issues associated with discovering the best few corre-...
Recently, there has been considerable interest in efficiently computing strongly correlated pairs in...
Given a set of data objects, correlation computing refers to the problem of efficiently finding grou...
In this thesis, we develop novel methods for correlation analysis in multivariate data, with a speci...
We study the problem of mining correlated patterns. Correlated patterns have advantages over associa...
When can reliable inference be drawn in fue "Big Data" context? This paper presents a framework for ...
Existing research on mining quantitative databases mainly focuses on mining associations. However, m...
© Springer International Publishing AG 2016. There are different types of correlation patterns betwe...
Time-series data analysis is essential in many modern applications, such as financial markets, senso...
Correlation determination brings out relationships in data that had not been seen before and it is i...
Statistical data mining refers to methods for identifying and validating interesting patterns from a...
Recently, there has been considerable interest in computing strongly correlated pairs in large datab...
In many scientific tasks we are interested in discovering whether there exist any correlations in ou...
textMachine learning, data mining, and statistical methods work by representing real-world objects i...
We introduce a method to learn a hierarchy of successively more abstract represen-tations of complex...
This paper addresses some of the foundational issues associated with discovering the best few corre-...
Recently, there has been considerable interest in efficiently computing strongly correlated pairs in...
Given a set of data objects, correlation computing refers to the problem of efficiently finding grou...
In this thesis, we develop novel methods for correlation analysis in multivariate data, with a speci...
We study the problem of mining correlated patterns. Correlated patterns have advantages over associa...
When can reliable inference be drawn in fue "Big Data" context? This paper presents a framework for ...
Existing research on mining quantitative databases mainly focuses on mining associations. However, m...
© Springer International Publishing AG 2016. There are different types of correlation patterns betwe...
Time-series data analysis is essential in many modern applications, such as financial markets, senso...
Correlation determination brings out relationships in data that had not been seen before and it is i...
Statistical data mining refers to methods for identifying and validating interesting patterns from a...
Recently, there has been considerable interest in computing strongly correlated pairs in large datab...