Advancements in high-throughput DNA sequencing technologies and ambitious goals for their use are resulting in the generation of a deluge of unannotated sequenced genomes. This makes computational tools that can aid in annotation increasingly valuable. Here, we provide a detailed exploration of the utility as well as the limitations of average mutual information (AMI) in several steps of genome annotation. For a genomic sequence, AMI is a measure of the information a base contains about the base separated by a fixed lag. A profile is constructed by calculating AMI at multiple lags. In addition to traditional AMI, we employ two AMI variants: expanded AMI and expanded-adjusted AMI, both of which preserve some granular detail discarded by AMI. F...
The Human Genome Project and advances in DNA sequencing technologies have revolutionized the identif...
In the wake of advanced DNA sequencing technology, a large number of bacterial, animal, and plant ge...
High-throughput genome sequencing projects produce a huge amount of raw biological data on a daily b...
One of the important steps in the annotation of genomes is the identification of regions in the geno...
Background: Occult organizational structures in DNA sequences may hold the key to understanding fun...
In the last ten years, numerous complete and almost complete genome sequences have been made availab...
About this book * Cutting-edge genome analysis methods from leading bioinformaticians An accurate de...
Large amounts of genome sequence data are available and much more will become available in the near ...
One basic problem in the analysis of DNA sequences is the recognition of protein-coding genes. Compu...
DNA sequencing is used to read the nucleotides composing the genetic material that forms individual ...
Machine learning enables a computer to learn a relationship between two assumingly related types of ...
AbstractWe report here the use of the mutual information theory for the certification of annotated r...
Evolutionary distance measures provide a means of identifying and organizing related organisms by co...
Science and engineering rely on the accumulation and dissemination of knowledge to make di...
Background: In the post-genomic era several methods of computational genomics are emerging to unders...
The Human Genome Project and advances in DNA sequencing technologies have revolutionized the identif...
In the wake of advanced DNA sequencing technology, a large number of bacterial, animal, and plant ge...
High-throughput genome sequencing projects produce a huge amount of raw biological data on a daily b...
One of the important steps in the annotation of genomes is the identification of regions in the geno...
Background: Occult organizational structures in DNA sequences may hold the key to understanding fun...
In the last ten years, numerous complete and almost complete genome sequences have been made availab...
About this book * Cutting-edge genome analysis methods from leading bioinformaticians An accurate de...
Large amounts of genome sequence data are available and much more will become available in the near ...
One basic problem in the analysis of DNA sequences is the recognition of protein-coding genes. Compu...
DNA sequencing is used to read the nucleotides composing the genetic material that forms individual ...
Machine learning enables a computer to learn a relationship between two assumingly related types of ...
AbstractWe report here the use of the mutual information theory for the certification of annotated r...
Evolutionary distance measures provide a means of identifying and organizing related organisms by co...
Science and engineering rely on the accumulation and dissemination of knowledge to make di...
Background: In the post-genomic era several methods of computational genomics are emerging to unders...
The Human Genome Project and advances in DNA sequencing technologies have revolutionized the identif...
In the wake of advanced DNA sequencing technology, a large number of bacterial, animal, and plant ge...
High-throughput genome sequencing projects produce a huge amount of raw biological data on a daily b...