We present Random Partition Kernels, a new class of kernels derived by demonstrating a natural connection between random partitions of objects and kernels between those objects. We show how the construction can be used to create kernels from methods that would not normally be viewed as random partitions, such as Random Forest. To demonstrate the potential of this method, we propose two new kernels, the Random Forest Kernel and the Fast Cluster Kernel, and show that these kernels consistently outperform standard kernels on problems involving real-world datasets. Finally, we show how the form of these kernels lend themselves to a natural approximation that is appropriate for certain big data problems, allowing $O(N)$ inference in methods such...
One approach to improving the running time of kernel-based machine learning methods is to build a sm...
In early 20’s, the first person to notice the link between Random Forests (RF)and Kernel Method...
This paper investigates data dependent kernels that are derived directly from data. This has been an...
We present Random Partition Kernels, a new class of kernels derived by demonstrating a natu-ral conn...
We introduce the Mondrian kernel, a fast random feature approximation to the Laplace kernel. It is s...
We introduce the Mondrian kernel, a fast $\textit{random feature}$ approximation to the Laplace kern...
A random forest is a popular machine learning ensemble method that has proven successful in solving ...
Big Data is one of the major challenges of statistical science and has numerous consequences from a...
Big Data is one of the major challenges of statistical science and has numerous consequences from al...
International audienceBased on decision trees combined with aggregation and bootstrap ideas, random ...
International audienceLeveraging on recent random matrix advances in the performance analysis of ker...
Kernel methods and neural networks are two important schemes in the supervised learning field. The t...
To accelerate the training of kernel machines, we propose to map the input data to a randomized low-...
A random forest is a popular machine learning ensemble method that has proven successful in solving ...
We propose a new probabilistic graph kernel. It is defined by the set of frequent subtrees generated...
One approach to improving the running time of kernel-based machine learning methods is to build a sm...
In early 20’s, the first person to notice the link between Random Forests (RF)and Kernel Method...
This paper investigates data dependent kernels that are derived directly from data. This has been an...
We present Random Partition Kernels, a new class of kernels derived by demonstrating a natu-ral conn...
We introduce the Mondrian kernel, a fast random feature approximation to the Laplace kernel. It is s...
We introduce the Mondrian kernel, a fast $\textit{random feature}$ approximation to the Laplace kern...
A random forest is a popular machine learning ensemble method that has proven successful in solving ...
Big Data is one of the major challenges of statistical science and has numerous consequences from a...
Big Data is one of the major challenges of statistical science and has numerous consequences from al...
International audienceBased on decision trees combined with aggregation and bootstrap ideas, random ...
International audienceLeveraging on recent random matrix advances in the performance analysis of ker...
Kernel methods and neural networks are two important schemes in the supervised learning field. The t...
To accelerate the training of kernel machines, we propose to map the input data to a randomized low-...
A random forest is a popular machine learning ensemble method that has proven successful in solving ...
We propose a new probabilistic graph kernel. It is defined by the set of frequent subtrees generated...
One approach to improving the running time of kernel-based machine learning methods is to build a sm...
In early 20’s, the first person to notice the link between Random Forests (RF)and Kernel Method...
This paper investigates data dependent kernels that are derived directly from data. This has been an...