The concept of "words" in continuous languages devoid of blanks is introduced and an operational definition of words given. With this novel concept nucleotide sequences become object for linguistic analysis. The typical word size of the nucleotide language is found to be 3 to 5 (tri- to pentamers). Different genomes have distinct vocabularies. Comparison of these vocabularies can serve as a basis for revealing functional and evolutionary relatedness of sequences
In this paper, we are concerned with analysing formal linguistic properties of DNA sequences in whic...
This article investigates aspects of similarity between complete sequences of mitochondrial DNA by d...
This thesis tries to present different approaches of analysis of genomic data and classification of ...
The concept of "words" in continuous languages devoid of blanks is introduced and an operational def...
The linguistic approach to the analysis of nucleotide sequences reveals a powerful tool for a number...
One of the critical requirements of data analysis involving large DNA sequences is an effective stat...
Biological macromolecules have many features that resemble modern languages. Thus, linguistic approa...
This tutorial was one of eight tutorials selected to be presented at the Third International Confere...
A new algorithm is presented for vocabulary analysis (word detection) in texts of human origin. It p...
A study of the relation between a structure of symbol sequences and the meaning of them encrypted in...
A new algorithm is presented for vocabulary analysis (word detection) in texts of human origin. It p...
The comparison of sound sequences (words, morphemes) constitutes the core of many techniques and met...
The main part of the thesis is concerned with large-scale studies of codon usage in completely seque...
Abstract: Symbolic sequence decomposition into a set of consecutive, distinct subsequences (mers) is...
A current barrier for successful rational drug design is the lack of understanding of the structure ...
In this paper, we are concerned with analysing formal linguistic properties of DNA sequences in whic...
This article investigates aspects of similarity between complete sequences of mitochondrial DNA by d...
This thesis tries to present different approaches of analysis of genomic data and classification of ...
The concept of "words" in continuous languages devoid of blanks is introduced and an operational def...
The linguistic approach to the analysis of nucleotide sequences reveals a powerful tool for a number...
One of the critical requirements of data analysis involving large DNA sequences is an effective stat...
Biological macromolecules have many features that resemble modern languages. Thus, linguistic approa...
This tutorial was one of eight tutorials selected to be presented at the Third International Confere...
A new algorithm is presented for vocabulary analysis (word detection) in texts of human origin. It p...
A study of the relation between a structure of symbol sequences and the meaning of them encrypted in...
A new algorithm is presented for vocabulary analysis (word detection) in texts of human origin. It p...
The comparison of sound sequences (words, morphemes) constitutes the core of many techniques and met...
The main part of the thesis is concerned with large-scale studies of codon usage in completely seque...
Abstract: Symbolic sequence decomposition into a set of consecutive, distinct subsequences (mers) is...
A current barrier for successful rational drug design is the lack of understanding of the structure ...
In this paper, we are concerned with analysing formal linguistic properties of DNA sequences in whic...
This article investigates aspects of similarity between complete sequences of mitochondrial DNA by d...
This thesis tries to present different approaches of analysis of genomic data and classification of ...