An important topic in machine-learning / data-mining is that of analyzing binary datasets. A binary dataset consists of a subset of n-vectors (observations) with binary components, each of which has an associated binary outcome (the class of the observation). Clearly, the set of n-vectors and their outcomes represent a partially defined Boolean function. The central problem of machine-learning / data-mining, the so-called classification problem, consists in finding an "extension" of the partially defined Boolean function closely approximating a hidden ("target") function. Various methods have been developed to solve this and related problems, such as identifying misclassified observations, revealing irrelevant and/or redundant variables, et...
This thesis studies the generalization behavior of algorithms in Sample Compression Settings. It ext...
Based on mixture models, we present a Bayesian method (called BClass) to classify biological entitie...
We consider data sets that consist of n-dimensional binary vectors representing positive and negativ...
We devise a feature selection method in terms of a follow-out utility of a special classification pr...
Motivation: Combinatorial effects, in which several variables jointly influence an output or respons...
International audienceWe are designing new data mining techniques on boolean contexts to identify a ...
Studied are differences of two approaches targeted to reveal latent variables in binary data. These ...
We present a new process of analyzing data to determine critical at-tributes in a classification pro...
The attention towards binary data coding increased consistently in the last decade due to several re...
Motivation: Combinatorial effects, in which several variables jointly influence an output or respons...
Boolean implications (if-then rules) provide a conceptually simple, uniform and highly scalable way ...
Boolean implications (if-then rules) provide a conceptually simple, uniform and highly scalable way ...
This paper develops an alternative method for gene selection that combines model based clustering an...
Finding methods to increase the complexity of the Boolean discriminant functions and to stay within ...
Motivation. Binary classification is a common problem in many types of research including clinical a...
This thesis studies the generalization behavior of algorithms in Sample Compression Settings. It ext...
Based on mixture models, we present a Bayesian method (called BClass) to classify biological entitie...
We consider data sets that consist of n-dimensional binary vectors representing positive and negativ...
We devise a feature selection method in terms of a follow-out utility of a special classification pr...
Motivation: Combinatorial effects, in which several variables jointly influence an output or respons...
International audienceWe are designing new data mining techniques on boolean contexts to identify a ...
Studied are differences of two approaches targeted to reveal latent variables in binary data. These ...
We present a new process of analyzing data to determine critical at-tributes in a classification pro...
The attention towards binary data coding increased consistently in the last decade due to several re...
Motivation: Combinatorial effects, in which several variables jointly influence an output or respons...
Boolean implications (if-then rules) provide a conceptually simple, uniform and highly scalable way ...
Boolean implications (if-then rules) provide a conceptually simple, uniform and highly scalable way ...
This paper develops an alternative method for gene selection that combines model based clustering an...
Finding methods to increase the complexity of the Boolean discriminant functions and to stay within ...
Motivation. Binary classification is a common problem in many types of research including clinical a...
This thesis studies the generalization behavior of algorithms in Sample Compression Settings. It ext...
Based on mixture models, we present a Bayesian method (called BClass) to classify biological entitie...
We consider data sets that consist of n-dimensional binary vectors representing positive and negativ...