Philosophiae Doctor - PhDSummary: Expressed sequence tag database is a rich and fast growing source of data for gene expression analysis and drug discovery. Clustering of raw EST data is a necessary step for further analysis and one of the most challenging problems of modem computational biology. There are a few systems, designed for this purpose and a few more are currently under development. These systems are reviewed in the "Literature and software review". Different strategies of supervised and unsupervised clustering are discussed, as well as sequence comparison techniques, such as based on alignment or oligonucleotide compositions. Analysis of potential bottlenecks and estimation of computation complexity of EST clustering is done...
International audienceBackground: An important problem in computational biology is the automatic det...
We present a method for evaluating the suitability of different string dissimilarity measures and cl...
MotivationSimilarity clustering of next-generation sequences (NGS) is an important computational pro...
Philosophiae Doctor - PhDExpressed sequence tag database is a rich and fast growing source of data f...
BACKGROUND: The continuous flow of EST data remains one of the richest sources for discoveries in mo...
The article describes two new clustering algorithms for DNA nucleotide sequences, summarizes the res...
The article describes two new clustering algorithms for DNA nucleotide sequences, summarizes the res...
Expressed sequence tags, abbreviated ESTs, are DNA molecules experimentally derived from expressed p...
We present a fast algorithm for sequence clustering and searching which works with large sequence da...
EST clustering is a simple, yet effective method to discover all the genes present in a variety of s...
Background Expressed sequence tags (ESTs) are single pass reads from randomly selected cDNA clones. ...
One of the fundamental components of large-scale gene discovery projects is that of clustering of ex...
Motivation: Efficient clustering is important for handling the large amount of available EST sequenc...
Our work involves developing an intelligent, time- and memory-efficient parallel clustering algorith...
Clustering is a key step in the processing of Expressed Sequence Tags (ESTs). The primary goal of cl...
International audienceBackground: An important problem in computational biology is the automatic det...
We present a method for evaluating the suitability of different string dissimilarity measures and cl...
MotivationSimilarity clustering of next-generation sequences (NGS) is an important computational pro...
Philosophiae Doctor - PhDExpressed sequence tag database is a rich and fast growing source of data f...
BACKGROUND: The continuous flow of EST data remains one of the richest sources for discoveries in mo...
The article describes two new clustering algorithms for DNA nucleotide sequences, summarizes the res...
The article describes two new clustering algorithms for DNA nucleotide sequences, summarizes the res...
Expressed sequence tags, abbreviated ESTs, are DNA molecules experimentally derived from expressed p...
We present a fast algorithm for sequence clustering and searching which works with large sequence da...
EST clustering is a simple, yet effective method to discover all the genes present in a variety of s...
Background Expressed sequence tags (ESTs) are single pass reads from randomly selected cDNA clones. ...
One of the fundamental components of large-scale gene discovery projects is that of clustering of ex...
Motivation: Efficient clustering is important for handling the large amount of available EST sequenc...
Our work involves developing an intelligent, time- and memory-efficient parallel clustering algorith...
Clustering is a key step in the processing of Expressed Sequence Tags (ESTs). The primary goal of cl...
International audienceBackground: An important problem in computational biology is the automatic det...
We present a method for evaluating the suitability of different string dissimilarity measures and cl...
MotivationSimilarity clustering of next-generation sequences (NGS) is an important computational pro...