Kernel ridge regression (KRR) is a popular scheme for non-linear non-parametric learning. However, existing implementations of KRR require that all the data is stored in the main memory, which severely limits the use of KRR in contexts where data size far exceeds the memory size. Such applications are increasingly common in data mining, bioinformatics, and control. A powerful paradigm for computing on data sets that are too large for memory is the streaming model of computation, where we process one data sample at a time, discarding each sample before moving on to the next one. In this paper, we propose StreaMRAK - a streaming version of KRR. StreaMRAK improves on existing KRR schemes by dividing the problem into several levels of resolutio...
Nonparametric inference techniques provide promising tools for probabilistic reasoning in high-dime...
Kernel methods consistently outperformed previous generations of learning techniques. They provide a...
One approach to improving the running time of kernel-based machine learning methods is to build a sm...
The field of streaming algorithms has enjoyed a deal of focus from the theoretical computer science ...
International audienceLarge-scale kernel ridge regression (KRR) is limited by the need to store a la...
Most studies on graph classification focus on designing fast and effective kernels. Several fast sub...
Kernel methods play a central role in machine learning and statistics, but algorithms for such metho...
This thesis studies clustering problems on data streams, specifically with applications to metric sp...
Streaming algorithms must process a large quantity of small updates quickly to allow queries about t...
This electronic version was submitted by the student author. The certified thesis is available in th...
Nowadays, streaming data analysis has become a relevant area of research in machine learning. Most o...
With an immense growth in data, there is a great need for training and testing machine learning mode...
Kernel methods are nonparametric feature extraction techniques that attempt to boost the learning ca...
Traditional machine learning has been largely concerned with developing techniques for small or mode...
Data streams have emerged as a natural computational model for numerous applications of big data pro...
Nonparametric inference techniques provide promising tools for probabilistic reasoning in high-dime...
Kernel methods consistently outperformed previous generations of learning techniques. They provide a...
One approach to improving the running time of kernel-based machine learning methods is to build a sm...
The field of streaming algorithms has enjoyed a deal of focus from the theoretical computer science ...
International audienceLarge-scale kernel ridge regression (KRR) is limited by the need to store a la...
Most studies on graph classification focus on designing fast and effective kernels. Several fast sub...
Kernel methods play a central role in machine learning and statistics, but algorithms for such metho...
This thesis studies clustering problems on data streams, specifically with applications to metric sp...
Streaming algorithms must process a large quantity of small updates quickly to allow queries about t...
This electronic version was submitted by the student author. The certified thesis is available in th...
Nowadays, streaming data analysis has become a relevant area of research in machine learning. Most o...
With an immense growth in data, there is a great need for training and testing machine learning mode...
Kernel methods are nonparametric feature extraction techniques that attempt to boost the learning ca...
Traditional machine learning has been largely concerned with developing techniques for small or mode...
Data streams have emerged as a natural computational model for numerous applications of big data pro...
Nonparametric inference techniques provide promising tools for probabilistic reasoning in high-dime...
Kernel methods consistently outperformed previous generations of learning techniques. They provide a...
One approach to improving the running time of kernel-based machine learning methods is to build a sm...