Recent works have shown the relevance of constraint programming to tackle data mining tasks. This thesis follows this approach and addresses motif discovery in sequential data. We focus in particular, in the case of classified sequences, on the search for motifs that best fit each individual class. We propose a language of constraints over matrix domains to model such problems. The language assumes a preprocessing of the data set (e.g., by pre-computing the locations of each character in each sequence) and views a motif as the choice of a sub-matrix (i.e., characters, sequences, and locations). We introduce different matrix constraints (compatibility of locations with the database, class covering, location-based character ordering common to...
Given an input sequence of data, a motif is a repeating pattern, possibly interspersed with `dont ca...
International audience—Sequential pattern mining under various constraints is a challenging data min...
International audience—Sequential pattern mining under various constraints is a challenging data min...
Recent works have shown the relevance of constraint programming to tackle data mining tasks. This th...
Recent works have shown the relevance of constraint programming to tackle data mining tasks. This th...
Des travaux récents ont montré l’intérêt de la programmation par contraintes pour la fouille de donn...
International audienceConsiderable effort has been invested over the years in ad-hoc algorithms for ...
Itemset and pattern mining has numerous applications ranging from Marketing to Bioinformatics. We in...
Itemset and pattern mining has numerous applications ranging from Marketing to Bioinformatics. We in...
Short paperInternational audienceThis paper addresses the discovery of discriminative nary motifs in...
Abstract Background Discovering approximately repeated patterns, or motifs, in biological sequences ...
Pattern mining is a significant field of Knowledge Discovery inDatabases. This thesis deals with the...
Pattern mining is a significant field of Knowledge Discovery inDatabases. This thesis deals with the...
In this paper, we describe an algorithm for the localization of structured models, i.e. sequences of...
In this paper we describe an algorithm for the localization of structured models, i.e. sequences of ...
Given an input sequence of data, a motif is a repeating pattern, possibly interspersed with `dont ca...
International audience—Sequential pattern mining under various constraints is a challenging data min...
International audience—Sequential pattern mining under various constraints is a challenging data min...
Recent works have shown the relevance of constraint programming to tackle data mining tasks. This th...
Recent works have shown the relevance of constraint programming to tackle data mining tasks. This th...
Des travaux récents ont montré l’intérêt de la programmation par contraintes pour la fouille de donn...
International audienceConsiderable effort has been invested over the years in ad-hoc algorithms for ...
Itemset and pattern mining has numerous applications ranging from Marketing to Bioinformatics. We in...
Itemset and pattern mining has numerous applications ranging from Marketing to Bioinformatics. We in...
Short paperInternational audienceThis paper addresses the discovery of discriminative nary motifs in...
Abstract Background Discovering approximately repeated patterns, or motifs, in biological sequences ...
Pattern mining is a significant field of Knowledge Discovery inDatabases. This thesis deals with the...
Pattern mining is a significant field of Knowledge Discovery inDatabases. This thesis deals with the...
In this paper, we describe an algorithm for the localization of structured models, i.e. sequences of...
In this paper we describe an algorithm for the localization of structured models, i.e. sequences of ...
Given an input sequence of data, a motif is a repeating pattern, possibly interspersed with `dont ca...
International audience—Sequential pattern mining under various constraints is a challenging data min...
International audience—Sequential pattern mining under various constraints is a challenging data min...