Abstract. We consider multisplitting of numerical value ranges, a task that is encountered as a discretization step preceding induction and also embedded into learning algorithms. We are interested in finding the partition that optimizes the value of a given attribute evaluation function. For most commonly used evaluation functions this task takes quadratic time in the number of potential cut points in the numerical range. Hence, it is a potential bottleneck in data mining algorithms. We present two techniques that speed up the optimal multisplitting task. The first one aims at discarding cut point candidates in a quick linear-time preprocessing scan before embarking on the actual search. We generalize the definition of boundary points by F...
AbstractIn this paper we present a dynamic programming algorithm for finding optimal elimination tre...
Constraint Satisfaction and Optimization are important areas of Artificial Intelligence. However, i...
Multi-task learning has gained popularity due to the advantages it provides with respect to resource...
Often in supervised learning numerical attributes require special treatment and do not fit the learn...
Numerical data poses a problem to symbolic learning methods, since numerical value ranges inherently...
The efficiency of the otherwise expedient decision tree learning can be impaired in processing data-...
Data mining is the process of extracting informative patterns from data stored in a database or data...
Recently, multiresolution analysis and wavelets have had a great impact on computer algorithm design...
Many common approaches to detecting changepoints, for example based on statistical criteria such as ...
An important subproblem in supervised tasks such as decision tree induction and subgroup discovery i...
Given is a problem sequence and a probability distribution (the bias) on programs computing solution...
Maximization of submodular functions on a ground set is a NP-hard combinatorial optimization problem...
We explore in this paper efficient algorithmic solutions to robustsubspace segmentation. We propose ...
Learning general functional dependencies between arbitrary input and output spaces is one of the key...
We consider the problem of approximating a signal P with another signal F consisting of a few piecew...
AbstractIn this paper we present a dynamic programming algorithm for finding optimal elimination tre...
Constraint Satisfaction and Optimization are important areas of Artificial Intelligence. However, i...
Multi-task learning has gained popularity due to the advantages it provides with respect to resource...
Often in supervised learning numerical attributes require special treatment and do not fit the learn...
Numerical data poses a problem to symbolic learning methods, since numerical value ranges inherently...
The efficiency of the otherwise expedient decision tree learning can be impaired in processing data-...
Data mining is the process of extracting informative patterns from data stored in a database or data...
Recently, multiresolution analysis and wavelets have had a great impact on computer algorithm design...
Many common approaches to detecting changepoints, for example based on statistical criteria such as ...
An important subproblem in supervised tasks such as decision tree induction and subgroup discovery i...
Given is a problem sequence and a probability distribution (the bias) on programs computing solution...
Maximization of submodular functions on a ground set is a NP-hard combinatorial optimization problem...
We explore in this paper efficient algorithmic solutions to robustsubspace segmentation. We propose ...
Learning general functional dependencies between arbitrary input and output spaces is one of the key...
We consider the problem of approximating a signal P with another signal F consisting of a few piecew...
AbstractIn this paper we present a dynamic programming algorithm for finding optimal elimination tre...
Constraint Satisfaction and Optimization are important areas of Artificial Intelligence. However, i...
Multi-task learning has gained popularity due to the advantages it provides with respect to resource...