Genomic DNA is fragmented into segments using the Jensen-Shannon divergence. Use of this criterion results in the fragments being entropically homogeneous to within a predefined level of statistical significance. Application of this procedure is made to complete genomes of organisms from archaebacteria, eubacteria, and eukaryotes. The distribution of fragment lengths in bacterial and primitive eukaryotic DNAs shows two distinct regimes of power-law scaling. The characteristic length separating these two regimes appears to be an intrinsic property of the sequence rather than a finite-size artifact, and is independent of the significance level used in segmenting a given genome. Fragment length distributions obtained in the segmentation of the...
Background: Segmental duplication is widely held to be an important mode of genome growth and evolut...
Genome evolution is shaped by a multitude of mutational processes, including point mutations, insert...
Motivation: DNA sequences can be represented by sequences of four symbols, but it is often useful to...
Genomic DNA is fragmented into segments using the Jensen-Shannon divergence. Use of this criterion r...
Heterogeneous DNA sequences can be partitioned into homogeneous domains that are comprised of the fo...
Eukaryotic genomes display segmental patterns of variation in various properties, including GC conte...
By using the Jensen-Shannon divergence, genomic DNA can be divided into compositionally distinct dom...
Conserved, ultraconserved and other classes of constrained elements (collectively referred as CNEs h...
International audienceSince the sequencing of large genomes, many statistical features of their sequ...
Conserved, ultraconserved and other classes of constrained elements (collectively referred as CNEs h...
We proposed a new index of the classification of organisms (cells) based on the appearance frequency...
Abstract.—The amplified fragment length polymorphism (AFLP) technique is being increasingly used in ...
Motivation: DNA segmentation, i.e. the partitioning of DNA in compositionally homogeneous segments, ...
Repeats or Transposable Elements (TEs) are highly repeated sequence stretches, present in virtually ...
We introduce Markov models for segmentation of symbolic sequences, extending a segmentation procedur...
Background: Segmental duplication is widely held to be an important mode of genome growth and evolut...
Genome evolution is shaped by a multitude of mutational processes, including point mutations, insert...
Motivation: DNA sequences can be represented by sequences of four symbols, but it is often useful to...
Genomic DNA is fragmented into segments using the Jensen-Shannon divergence. Use of this criterion r...
Heterogeneous DNA sequences can be partitioned into homogeneous domains that are comprised of the fo...
Eukaryotic genomes display segmental patterns of variation in various properties, including GC conte...
By using the Jensen-Shannon divergence, genomic DNA can be divided into compositionally distinct dom...
Conserved, ultraconserved and other classes of constrained elements (collectively referred as CNEs h...
International audienceSince the sequencing of large genomes, many statistical features of their sequ...
Conserved, ultraconserved and other classes of constrained elements (collectively referred as CNEs h...
We proposed a new index of the classification of organisms (cells) based on the appearance frequency...
Abstract.—The amplified fragment length polymorphism (AFLP) technique is being increasingly used in ...
Motivation: DNA segmentation, i.e. the partitioning of DNA in compositionally homogeneous segments, ...
Repeats or Transposable Elements (TEs) are highly repeated sequence stretches, present in virtually ...
We introduce Markov models for segmentation of symbolic sequences, extending a segmentation procedur...
Background: Segmental duplication is widely held to be an important mode of genome growth and evolut...
Genome evolution is shaped by a multitude of mutational processes, including point mutations, insert...
Motivation: DNA sequences can be represented by sequences of four symbols, but it is often useful to...