We consider an online learning problem (classification or prediction) involving disparate sources of sequentially arriving data, whereby a user over time learns the best set of data sources to use in constructing the classifier by exploiting their similarity. We first show that, when (1) the similarity information among data sources is known, and (2) data from different sources can be acquired without cost, then a judicious selection of data from different sources can effectively enlarge the training sample size compared to using a single data source, thereby improving the rate and performance of learning; this is achieved by bounding the classification error of the resulting classifier. We then relax assumption (1) and characterize the lo...
We consider the problem of choosing, sequentially, a map which assigns elements of a set A to a few ...
Designing online algorithms with machine learning predictions is a recent technique beyond the worst...
With an ever increasing demand on large scale data, difficulties exist in terms of processing and ut...
We consider online similarity prediction problems over networked data. We begin by relat-ing this ta...
A challenge for mining large-scale streaming data overlooked by most existing studies on online lear...
Relative similarity learning, as an important learning scheme for information retrieval, aims to lea...
Abstract. With the increasing volume of data in the world, the best approach for learning from this ...
We consider the problem of choosing, sequentially, a map which assigns elements of a set A to a few ...
We consider situations where training data is abundant and computing resources are comparatively sca...
Deep metric learning aims to learn a deep embedding that can capture the semantic similarity of data...
Learning, prediction and identification has been a main topic of interest in science and engineering...
Machine learning algorithms are now being deployed in practically all areas of our lives. Part of th...
Abstract—For many data mining and machine learning tasks, the quality of a similarity measure is the...
Learning, prediction and identification has been a main topic of interest in science and engineering...
The speed with which a learning algorithm converges as it is presented with more data is a central p...
We consider the problem of choosing, sequentially, a map which assigns elements of a set A to a few ...
Designing online algorithms with machine learning predictions is a recent technique beyond the worst...
With an ever increasing demand on large scale data, difficulties exist in terms of processing and ut...
We consider online similarity prediction problems over networked data. We begin by relat-ing this ta...
A challenge for mining large-scale streaming data overlooked by most existing studies on online lear...
Relative similarity learning, as an important learning scheme for information retrieval, aims to lea...
Abstract. With the increasing volume of data in the world, the best approach for learning from this ...
We consider the problem of choosing, sequentially, a map which assigns elements of a set A to a few ...
We consider situations where training data is abundant and computing resources are comparatively sca...
Deep metric learning aims to learn a deep embedding that can capture the semantic similarity of data...
Learning, prediction and identification has been a main topic of interest in science and engineering...
Machine learning algorithms are now being deployed in practically all areas of our lives. Part of th...
Abstract—For many data mining and machine learning tasks, the quality of a similarity measure is the...
Learning, prediction and identification has been a main topic of interest in science and engineering...
The speed with which a learning algorithm converges as it is presented with more data is a central p...
We consider the problem of choosing, sequentially, a map which assigns elements of a set A to a few ...
Designing online algorithms with machine learning predictions is a recent technique beyond the worst...
With an ever increasing demand on large scale data, difficulties exist in terms of processing and ut...