This dissertation studies the incorporation of monotonicity constraints as a type of domain knowledge into a data mining process. Monotonicity constraints are enforced at two stages¿data preparation and data modeling. The main contributions of the research are a novel procedure to test the degree of monotonicity of a real data set, a greedy algorithm to transform non-monotone into monotone data, and extended and novel approaches for building monotone decision models. The results from simulation and real case studies show that enforcing monotonicity can considerably improve knowledge discovery and facilitate the decision-making process for end-users by deriving more accurate, stable and plausible decision models
Growing popularity of the Internet and innovative storage technology have caused a true data explosi...
Abstract. Generator of hypotheses is a new method for data mining. It makes possible to classify the...
Longitudinal datasets contain repeated measurements of the same variables at different points in tim...
This thesis describes a number of new data mining algorithms which were the result of our research i...
The objective of data mining is the extraction of knowledge from databases. In practice, one often e...
textabstractThe monotonicity property is ubiquitous in our lives and it appears in different roles: ...
textabstractThe monotonicity constraint is a common side condition imposed on modeling problems as d...
Abstract. We present a simple Data Mining Logic (DML) that can express common data mining tasks, lik...
In many application areas of machine learning, prior knowledge concerning the monotonicity of relati...
In many real world applications classification models are required to be in line with domain knowled...
The assessment of knowledge derived from databases depends on many factors. Decision makers often ne...
textabstractFor classification problems with ordinal attributes very often the class attribute shoul...
In many data mining applications, it is a priori known that the target function should satisfy certa...
Machine learning algorithms (learners) are typically expected to produce monotone learning curves, m...
Many data mining algorithms do not make use of existing domain knowledge when constructing their mod...
Growing popularity of the Internet and innovative storage technology have caused a true data explosi...
Abstract. Generator of hypotheses is a new method for data mining. It makes possible to classify the...
Longitudinal datasets contain repeated measurements of the same variables at different points in tim...
This thesis describes a number of new data mining algorithms which were the result of our research i...
The objective of data mining is the extraction of knowledge from databases. In practice, one often e...
textabstractThe monotonicity property is ubiquitous in our lives and it appears in different roles: ...
textabstractThe monotonicity constraint is a common side condition imposed on modeling problems as d...
Abstract. We present a simple Data Mining Logic (DML) that can express common data mining tasks, lik...
In many application areas of machine learning, prior knowledge concerning the monotonicity of relati...
In many real world applications classification models are required to be in line with domain knowled...
The assessment of knowledge derived from databases depends on many factors. Decision makers often ne...
textabstractFor classification problems with ordinal attributes very often the class attribute shoul...
In many data mining applications, it is a priori known that the target function should satisfy certa...
Machine learning algorithms (learners) are typically expected to produce monotone learning curves, m...
Many data mining algorithms do not make use of existing domain knowledge when constructing their mod...
Growing popularity of the Internet and innovative storage technology have caused a true data explosi...
Abstract. Generator of hypotheses is a new method for data mining. It makes possible to classify the...
Longitudinal datasets contain repeated measurements of the same variables at different points in tim...