The need for low bias algorithms in classification learning from large data sets

  • Brain, Damien
  • Webb, Geoffrey I.
Open PDF
Publication date
January 2002
Publisher
Springer Science and Business Media LLC

Abstract

This paper reviews the appropriateness for application to large data sets of standard machine learning algorithms, which were mainly developed in the context of small data sets. Sampling and parallelisation have proved useful means for reducing computation time when learning from large data sets. However, such methods assume that algorithms that were designed for use with what are now considered small data sets are also fundamentally suitable for large data sets. It is plausible that optimal learning from large data sets requires a different type of algorithm to optimal learning from small data sets. This paper investigates one respect in which data set size may affect the requirements of a learning algorithm — the bias plus variance ...

Extracted data

Loading...

Related items

The need for low bias algorithms in classification learning from large data sets
  • Damien Brain
  • Geoffrey I. Webb
January 2002

Abstract. This paper reviews the appropriateness for application to large data sets of standard mach...

Learning from large data : bias, variance, sampling, and learning curves
  • Brain, Damien.
January 2003

One of the fundamental machine learning tasks is that of predictive classification. Given that organ...

Fast machine learning algorithms for large data
  • Choudhury, A.
January 2002

Traditional machine learning has been largely concerned with developing techniques for small or mode...

We use cookies to provide a better user experience.