Within this paper we are proposing and testing a new strategy for detection and measurement of similarity between sequences of proteins. Our approach has its roots in computational linguistics and the related techniques for quantifying and comparing content in strings of characters. The pairwise comparison of proteins relies on the content regularities expected to uniquely characterize each sequence. These regularities are captured by n-gram based modelling techniques and exploited by cross-entropy related measures. In this new attempt to incorporate theoretical ideas from computational linguistics into the field of bioinformatics, we experimented using two implementations having always as ultimate goal the development of practical, computa...
Advances in sequencing technologies led to rapid increase in the number and diversity of biological ...
In bioinformatics and computational biology, methods for biological sequence comparison play the mos...
Quantification of similarities between protein sequences or DNA/RNA strands is a (sub-)task that is ...
Within this paper we are proposing and testing a new strategy for detection and measurement of simil...
Abstract. Within this paper we are proposing and testing a new strategy for detection and measuremen...
Abstract Background Many proposed statistical measures can efficiently compare protein sequence to f...
Background: DNA sequence analysis is an important research topic in bioinformatics. Evaluating the s...
AbstractIn this article, a novel approach for extracting features from protein sequences is proposed...
In this thesis we examine issues surrounding the development of software that predicts the function ...
Abstract Motivation: Distance measures built on the notion of text compression have b...
There exist many computational methods for finding similarity in gene sequence, finding suitable met...
AbstractThis paper presents a method for classifying a large and mixed set of uncharacterized sequen...
Abstract Background Classification of protein sequences is a central problem in computational biolog...
Protein sequence classification is a challenging problem. We are attempting to use the Mutual Inform...
The increasing number of sequenced genomes motivates the use of evolutionary patterns to detect gene...
Advances in sequencing technologies led to rapid increase in the number and diversity of biological ...
In bioinformatics and computational biology, methods for biological sequence comparison play the mos...
Quantification of similarities between protein sequences or DNA/RNA strands is a (sub-)task that is ...
Within this paper we are proposing and testing a new strategy for detection and measurement of simil...
Abstract. Within this paper we are proposing and testing a new strategy for detection and measuremen...
Abstract Background Many proposed statistical measures can efficiently compare protein sequence to f...
Background: DNA sequence analysis is an important research topic in bioinformatics. Evaluating the s...
AbstractIn this article, a novel approach for extracting features from protein sequences is proposed...
In this thesis we examine issues surrounding the development of software that predicts the function ...
Abstract Motivation: Distance measures built on the notion of text compression have b...
There exist many computational methods for finding similarity in gene sequence, finding suitable met...
AbstractThis paper presents a method for classifying a large and mixed set of uncharacterized sequen...
Abstract Background Classification of protein sequences is a central problem in computational biolog...
Protein sequence classification is a challenging problem. We are attempting to use the Mutual Inform...
The increasing number of sequenced genomes motivates the use of evolutionary patterns to detect gene...
Advances in sequencing technologies led to rapid increase in the number and diversity of biological ...
In bioinformatics and computational biology, methods for biological sequence comparison play the mos...
Quantification of similarities between protein sequences or DNA/RNA strands is a (sub-)task that is ...