We investigate in this paper the problem of mining disjunctive emerging patterns in high-dimensional biomedical datasets. Disjunctive emerging patterns are sets of features that are very frequent among samples of a target class, cases in a case–control study, for example, and are very rare among all other samples. We, for the very first time, demonstrate that this problem can be solved using minimal transversals in a hypergraph. We propose a new divide-and-conquer algorithm that enables us to efficiently compute disjunctive emerging patterns in parallel and distributed environments. We conducted experiments using real-world microarray gene expression datasets to assess the performance of our approach. Our results show that our approach is m...
This paper surveys approaches to the discovery of patterns in biosequences and places these approach...
Background Mining frequent gene regulation sequential patterns in time course microarray datasets is...
Describing and capturing significant differences between two classes of data is an important data mi...
We focus, in this paper, on the computational challenges of identifying disjunctive Boolean patterns...
The mining of changes or differences or other comparative patterns from a pair of datasets is an int...
Discriminative pattern mining is one of the most important techniques in data mining. This challengi...
Abstract—Discriminative patterns can provide valuable insights into data sets with class labels, tha...
This paper introduces a new type of discriminative subgraph pattern called breaker emerging subgraph...
The emergence of automated high-throughput sequencing technologies has resulted in a huge increase o...
In this tutorial chapter, we review basics about frequent pattern mining algorithms, including items...
Abstract. An increasing number of biomedical tasks, such as pattern-based biclustering, require the ...
In recent years, we have seen a rapid increase in the available DNA and protein data coming from var...
International audienceEmerging patterns are patterns of great interest for discovering information f...
Abstract Searching for interesting common subgraphs in graph data is a well-studied problem in data ...
Pattern discovery in biological sequences (e.g., DNA se-quences) is one of the most challenging task...
This paper surveys approaches to the discovery of patterns in biosequences and places these approach...
Background Mining frequent gene regulation sequential patterns in time course microarray datasets is...
Describing and capturing significant differences between two classes of data is an important data mi...
We focus, in this paper, on the computational challenges of identifying disjunctive Boolean patterns...
The mining of changes or differences or other comparative patterns from a pair of datasets is an int...
Discriminative pattern mining is one of the most important techniques in data mining. This challengi...
Abstract—Discriminative patterns can provide valuable insights into data sets with class labels, tha...
This paper introduces a new type of discriminative subgraph pattern called breaker emerging subgraph...
The emergence of automated high-throughput sequencing technologies has resulted in a huge increase o...
In this tutorial chapter, we review basics about frequent pattern mining algorithms, including items...
Abstract. An increasing number of biomedical tasks, such as pattern-based biclustering, require the ...
In recent years, we have seen a rapid increase in the available DNA and protein data coming from var...
International audienceEmerging patterns are patterns of great interest for discovering information f...
Abstract Searching for interesting common subgraphs in graph data is a well-studied problem in data ...
Pattern discovery in biological sequences (e.g., DNA se-quences) is one of the most challenging task...
This paper surveys approaches to the discovery of patterns in biosequences and places these approach...
Background Mining frequent gene regulation sequential patterns in time course microarray datasets is...
Describing and capturing significant differences between two classes of data is an important data mi...