This paper introduces a novel method for obtaining increased predictive performance from transparent models in situations where production input vectors are available when building the model. First, labeled training data is used to build a powerful opaque model, called an oracle. Second, the oracle is applied to production instances, generating predicted target values, which are used as labels. Finally, these newly labeled instances are utilized, in different combinations with normal training data, when inducing a transparent model. Experimental results, on 26 UCI data sets, show that the use of oracle coaches significantly improves predictive performance, compared to standard model induction. Most importantly, both accuracy and AUC results...
Knowledge discovered in a database must be represented in a form that is easy to understand. Small, ...
Associated research group: Critical Systems Research GroupThe oracle--a judge of the correctness of ...
Abstract. We evaluate the power of decision tables as a hypothesis space for supervised learning alg...
This paper introduces a novel method for obtaining increased predictive performance from transparent...
In many real-world scenarios, predictive modelsneed to be interpretable, thus ruling out many machin...
In real-world scenarios, interpretable models are often required to explain predictions, and to allo...
Abstract—Some data mining problems require predictivemodels to be not only accurate but also compreh...
Abstract—Random forest is an often used ensemble technique, renowned for its high predictive perform...
Abstract—The primary goal of predictive modeling is to achieve high accuracy when the model is appli...
In this study, the task of obtaining accurate andcomprehensible concept descriptions of a specific s...
Nowadays, crowdsourcing is being widely used to collect training data for solving classification pro...
We look at a specific aspect of model interpretability: models often need to be constrained in size ...
In many real-life problems, obtaining labelled data can be a very expensive and laborious task, whil...
Classification is an important data mining problem. Given a training database of records, each tagge...
Transparency has become a key desideratum of machine learning. Properties such as interpretability o...
Knowledge discovered in a database must be represented in a form that is easy to understand. Small, ...
Associated research group: Critical Systems Research GroupThe oracle--a judge of the correctness of ...
Abstract. We evaluate the power of decision tables as a hypothesis space for supervised learning alg...
This paper introduces a novel method for obtaining increased predictive performance from transparent...
In many real-world scenarios, predictive modelsneed to be interpretable, thus ruling out many machin...
In real-world scenarios, interpretable models are often required to explain predictions, and to allo...
Abstract—Some data mining problems require predictivemodels to be not only accurate but also compreh...
Abstract—Random forest is an often used ensemble technique, renowned for its high predictive perform...
Abstract—The primary goal of predictive modeling is to achieve high accuracy when the model is appli...
In this study, the task of obtaining accurate andcomprehensible concept descriptions of a specific s...
Nowadays, crowdsourcing is being widely used to collect training data for solving classification pro...
We look at a specific aspect of model interpretability: models often need to be constrained in size ...
In many real-life problems, obtaining labelled data can be a very expensive and laborious task, whil...
Classification is an important data mining problem. Given a training database of records, each tagge...
Transparency has become a key desideratum of machine learning. Properties such as interpretability o...
Knowledge discovered in a database must be represented in a form that is easy to understand. Small, ...
Associated research group: Critical Systems Research GroupThe oracle--a judge of the correctness of ...
Abstract. We evaluate the power of decision tables as a hypothesis space for supervised learning alg...