Exponentially increasing data volumes, coupled with new modes of analysis have created significant new opportunities for data scientists. However, the stochastic nature of many data science techniques results in tradeoffs between costs and accuracy. For example, machine learning algorithms can be trained iteratively and indefinitely with diminishing returns in terms of accuracy. In this paper we explore the cost-accuracy tradeoff through three representative examples: we vary the number of models in an ensemble, the number of epochs used to train a machine learning model, and the amount of data used to train a machine learning model. We highlight the feasibility and benefits of being able to measure, quantify, and predict cost accuracy trad...
How to assess the performance of machine learning algorithms is a problem of increasing interest an...
International audienceStrong empirical evidence that one machine-learning algorithm A outperforms an...
Practitioners of data mining and machine learning have long observed that the imbalance of classes i...
Exponentially increasing data volumes, coupled with new modes of analysis have created significant n...
The last several years have seen the emergence of datasets of an unprecedented scale, and solving va...
Classification is a well-studied problem in machine learning and data mining. Classifier performance...
Applying machine learning to real problems is non-trivial because many important steps are needed to...
m-Estimation represents a broad class of estimators, including least-squares and maximum likelihood,...
In many machine learning domains, misclassification costs are different for different examples, in t...
The capability of effectively quantifying the uncertainty associated to a given prediction is an imp...
When the amount of data is reasonably small, we can usually fit this data to a simple model and use ...
This work proposes a way to align statistical modeling with decision making. We provide a method tha...
classification, quantification, cost quantification, text mining This paper promotes a new task for ...
Abstract. How to assess the performance of machine learning algorithms is a problem of increasing in...
Estimating quantities is an important everyday task. We analyzed the performance of various estimati...
How to assess the performance of machine learning algorithms is a problem of increasing interest an...
International audienceStrong empirical evidence that one machine-learning algorithm A outperforms an...
Practitioners of data mining and machine learning have long observed that the imbalance of classes i...
Exponentially increasing data volumes, coupled with new modes of analysis have created significant n...
The last several years have seen the emergence of datasets of an unprecedented scale, and solving va...
Classification is a well-studied problem in machine learning and data mining. Classifier performance...
Applying machine learning to real problems is non-trivial because many important steps are needed to...
m-Estimation represents a broad class of estimators, including least-squares and maximum likelihood,...
In many machine learning domains, misclassification costs are different for different examples, in t...
The capability of effectively quantifying the uncertainty associated to a given prediction is an imp...
When the amount of data is reasonably small, we can usually fit this data to a simple model and use ...
This work proposes a way to align statistical modeling with decision making. We provide a method tha...
classification, quantification, cost quantification, text mining This paper promotes a new task for ...
Abstract. How to assess the performance of machine learning algorithms is a problem of increasing in...
Estimating quantities is an important everyday task. We analyzed the performance of various estimati...
How to assess the performance of machine learning algorithms is a problem of increasing interest an...
International audienceStrong empirical evidence that one machine-learning algorithm A outperforms an...
Practitioners of data mining and machine learning have long observed that the imbalance of classes i...