This master thesis describes theoretical knowledge of biological sequences, principles entropy rate estimates and possibilities of compression of DNA sequences using the substitution methods. Thesis includes practical application of the compression algorithm and practical estimation of entropy
A comprehensive data base is analyzed to determine the Shannon information content of a protein sequ...
Rapid advancements in research in the field of DNA sequence discovery has led to a vast range of com...
The ever increasing growth of the production of high-throughput sequencing data poses a serious chal...
The increasing volume of biological data requires finding new ways to save these data in genetic ban...
This paper introduces a novel algorithm for biological sequence compression that makes use of both s...
Data compression at its base is concerned with how information is organized in data. Understanding t...
The multivariate entropy distance (MED) method is a new highly efficient and accurate gene identific...
A new simple method is found for efficient and accurate identification of coding sequences in prokar...
As the usage of technology increases rapidly today, the amount of data created also increases expone...
The purpose of this project is to compare the complexities of different species\u27 mitochondrial ge...
A detailed entropy analysis by the recent novelty of 'lumping' is performed in some DNA sequences. O...
DNA sequencing is the process of determining the ordered sequence of the four nucleotide bases in a ...
Abstract — Biological sequences from different species are called orthologs if they evolved from a s...
Many of the same modeling methods used in natural languages, specifically Markov models and HMM\u27s...
V diplomskem delu smo na kratko opisali lastnosti in značilnosti zaporedij deoksiribonukleinske kisl...
A comprehensive data base is analyzed to determine the Shannon information content of a protein sequ...
Rapid advancements in research in the field of DNA sequence discovery has led to a vast range of com...
The ever increasing growth of the production of high-throughput sequencing data poses a serious chal...
The increasing volume of biological data requires finding new ways to save these data in genetic ban...
This paper introduces a novel algorithm for biological sequence compression that makes use of both s...
Data compression at its base is concerned with how information is organized in data. Understanding t...
The multivariate entropy distance (MED) method is a new highly efficient and accurate gene identific...
A new simple method is found for efficient and accurate identification of coding sequences in prokar...
As the usage of technology increases rapidly today, the amount of data created also increases expone...
The purpose of this project is to compare the complexities of different species\u27 mitochondrial ge...
A detailed entropy analysis by the recent novelty of 'lumping' is performed in some DNA sequences. O...
DNA sequencing is the process of determining the ordered sequence of the four nucleotide bases in a ...
Abstract — Biological sequences from different species are called orthologs if they evolved from a s...
Many of the same modeling methods used in natural languages, specifically Markov models and HMM\u27s...
V diplomskem delu smo na kratko opisali lastnosti in značilnosti zaporedij deoksiribonukleinske kisl...
A comprehensive data base is analyzed to determine the Shannon information content of a protein sequ...
Rapid advancements in research in the field of DNA sequence discovery has led to a vast range of com...
The ever increasing growth of the production of high-throughput sequencing data poses a serious chal...