We develop and apply a novel framework which is designed to extract information in the form of a positive definite kernel matrix from possibly crude, noisy, incomplete, inconsistent dissimilarity information between pairs of objects, obtainable in a variety of contexts. Any positive definite kernel defines a consistent set of distances, and the fitted kernel provides a set of coordinates in Euclidean space which attempt to respect the information available, while controlling for complexity of the kernel. The resulting set of coordinates are highly appropriate for visualization and as input to classification and clustering algorithms. The framework is formulated in terms of a class of optimization problems which can be solved efficiently usi...
Modern machine learning techniques are proving to be extremely valuable for the analysis of data in ...
This paper studies a new framework for learning a predictor in the presence of multiple kernel funct...
Complex biological data generated from various experiments are stored in diverse data types in multi...
In recent years, more and more high-throughput data sources useful for protein complex prediction ha...
For many biomedical modelling tasks a number of different types of data may influence predictions ma...
Determining protein sequence similarity is an important task for protein classification and homology...
For many biomedical modelling tasks a number of different types of data may influence predictions ma...
This thesis addresses the problem of finding robust, fast and precise learning methods for noisy, in...
Motivation: Various approaches based on features extracted from protein sequences and often machine ...
A key issue in supervised protein classification is the representation of input sequences of amino a...
Motivation: Building an accurate protein classification system depends critically upon choosing a go...
We present a framework for efficient extrapolation of reduced rank approximations, graph kernels, an...
Biological sequence classification (such as protein remote homology detection) solely based on seque...
This paper brings together two strands of machine learning of increasing importance: kernel methods ...
In this thesis we address three fundamental problems in computer vision using kernel methods. We fir...
Modern machine learning techniques are proving to be extremely valuable for the analysis of data in ...
This paper studies a new framework for learning a predictor in the presence of multiple kernel funct...
Complex biological data generated from various experiments are stored in diverse data types in multi...
In recent years, more and more high-throughput data sources useful for protein complex prediction ha...
For many biomedical modelling tasks a number of different types of data may influence predictions ma...
Determining protein sequence similarity is an important task for protein classification and homology...
For many biomedical modelling tasks a number of different types of data may influence predictions ma...
This thesis addresses the problem of finding robust, fast and precise learning methods for noisy, in...
Motivation: Various approaches based on features extracted from protein sequences and often machine ...
A key issue in supervised protein classification is the representation of input sequences of amino a...
Motivation: Building an accurate protein classification system depends critically upon choosing a go...
We present a framework for efficient extrapolation of reduced rank approximations, graph kernels, an...
Biological sequence classification (such as protein remote homology detection) solely based on seque...
This paper brings together two strands of machine learning of increasing importance: kernel methods ...
In this thesis we address three fundamental problems in computer vision using kernel methods. We fir...
Modern machine learning techniques are proving to be extremely valuable for the analysis of data in ...
This paper studies a new framework for learning a predictor in the presence of multiple kernel funct...
Complex biological data generated from various experiments are stored in diverse data types in multi...