Motivation: It is known that most genomic regions of special inter-est, e.g., horizontally acquired sequences, genomic islands, etc., have distinct word (m-mer) compositions. Most of the earlier work along this direction, addressed di- and tri-nucleotide compositions. We present an approach that can be applied to analyze composi-tions of any given word size. The method, called the centroid ap-proach, can reveal compositionally distinct regions in genomic se-quences, for any given word size. Results: We applied our method to 50 bacterial genomes and demonstrated its ability to identify embedded sequences of varying lengths from distantly related organisms. We also investigated the genetic makeup of the regions identified as compositionally d...
Similarity Plot (S-plot) is a Windows-based application for large-scale comparisons and 2-dimensiona...
Abstract Background Data mining in large DNA sequences is a major challenge in microbial genomics an...
94 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1999.This work presents three new a...
Motivation: It is known that most genomic regions of special interest, e.g. horizontally acquired se...
Motivation: It is known that the protein-coding regions exhibit a lower degree of compositional vari...
Recent developments in genomic and proteomic sequencing technologies haverevolutionized research in ...
Bacterial genomes vary extensively in terms of both gene content and gene sequence. This plasticity ...
The main part of the thesis is concerned with large-scale studies of codon usage in completely seque...
Bacterial genomes vary extensively in terms of both gene content and gene sequence. This plasticity ...
Bacterial genomes vary extensively in terms of both gene content and gene sequence. This plasticity ...
Bacterial genomes vary extensively in terms of both gene content and gene sequence. This plasticity ...
At present the genomes of many organisms have been sequenced, meaning that their nucleotide structur...
94 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1999.This work presents three new a...
BackgroundWhereas genome sequencing has given us high-resolution pictures of many different species ...
Numerous sequencing projects have unveiled partial and full microbial genomes. The data produced far...
Similarity Plot (S-plot) is a Windows-based application for large-scale comparisons and 2-dimensiona...
Abstract Background Data mining in large DNA sequences is a major challenge in microbial genomics an...
94 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1999.This work presents three new a...
Motivation: It is known that most genomic regions of special interest, e.g. horizontally acquired se...
Motivation: It is known that the protein-coding regions exhibit a lower degree of compositional vari...
Recent developments in genomic and proteomic sequencing technologies haverevolutionized research in ...
Bacterial genomes vary extensively in terms of both gene content and gene sequence. This plasticity ...
The main part of the thesis is concerned with large-scale studies of codon usage in completely seque...
Bacterial genomes vary extensively in terms of both gene content and gene sequence. This plasticity ...
Bacterial genomes vary extensively in terms of both gene content and gene sequence. This plasticity ...
Bacterial genomes vary extensively in terms of both gene content and gene sequence. This plasticity ...
At present the genomes of many organisms have been sequenced, meaning that their nucleotide structur...
94 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1999.This work presents three new a...
BackgroundWhereas genome sequencing has given us high-resolution pictures of many different species ...
Numerous sequencing projects have unveiled partial and full microbial genomes. The data produced far...
Similarity Plot (S-plot) is a Windows-based application for large-scale comparisons and 2-dimensiona...
Abstract Background Data mining in large DNA sequences is a major challenge in microbial genomics an...
94 p.Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 1999.This work presents three new a...