International audienceWe study in this paper flat and hierarchical classification strategies in the context of large-scale taxonomies. To this end, we first propose a multiclass, hierarchi-cal data dependent bound on the generalization error of classifiers deployed in large-scale taxonomies. This bound provides an explanation to several empirical results reported in the literature, related to the performance of flat and hierarchical classifiers. We then introduce another type of bound targeting the approximation error of a family of classifiers, and derive from it features used in a meta-classifier to decide which nodes to prune (or flatten) in a large-scale taxonomy. We finally illustrate the theoretical developments through several experi...
In this work we implement and evaluate a methodology to classify multi-labeled web documents into la...
Abstract—Recent large-scale hierarchical classification tasks typically have tens of thousands of cl...
This thesis addresses three major types of classi?cation problems in data mining: 1) ?at classi?cati...
International audienceWe study in this paper flat and hierarchical classification strategies in the ...
In the era of Big Data, we need efficient and scalable machine learning algorithms which can perform...
International audienceIn the context of supervised learning, the training data for large-scale hiera...
Poster paper 0344International audienceWhile multi-class categorization of documents has been of res...
International audienceGoing beyond the traditional text classification, involving a few tens of clas...
International audienceIn many of the large-scale physical and social complex systems phenomena fat-t...
Abstract — Large-scale classification taxonomies have thousands of classes, deep hierarchies and ske...
University of Minnesota Ph.D. dissertation. January 2009. Major: Statistics. Advisor: Xiaotong Shen....
Text documents in the web are in hierarchy, increase in the content, information grows over the year...
We study the problem of hierarchical classification when labels corresponding to partial and/or mult...
Traditional flat classification methods (e.g., binary, multiclass, and multi-label classification) s...
International audienceExtreme classification task where the number of classes is very large has rece...
In this work we implement and evaluate a methodology to classify multi-labeled web documents into la...
Abstract—Recent large-scale hierarchical classification tasks typically have tens of thousands of cl...
This thesis addresses three major types of classi?cation problems in data mining: 1) ?at classi?cati...
International audienceWe study in this paper flat and hierarchical classification strategies in the ...
In the era of Big Data, we need efficient and scalable machine learning algorithms which can perform...
International audienceIn the context of supervised learning, the training data for large-scale hiera...
Poster paper 0344International audienceWhile multi-class categorization of documents has been of res...
International audienceGoing beyond the traditional text classification, involving a few tens of clas...
International audienceIn many of the large-scale physical and social complex systems phenomena fat-t...
Abstract — Large-scale classification taxonomies have thousands of classes, deep hierarchies and ske...
University of Minnesota Ph.D. dissertation. January 2009. Major: Statistics. Advisor: Xiaotong Shen....
Text documents in the web are in hierarchy, increase in the content, information grows over the year...
We study the problem of hierarchical classification when labels corresponding to partial and/or mult...
Traditional flat classification methods (e.g., binary, multiclass, and multi-label classification) s...
International audienceExtreme classification task where the number of classes is very large has rece...
In this work we implement and evaluate a methodology to classify multi-labeled web documents into la...
Abstract—Recent large-scale hierarchical classification tasks typically have tens of thousands of cl...
This thesis addresses three major types of classi?cation problems in data mining: 1) ?at classi?cati...