The majority of the human genome consists of repeated sequences. An important type of repeated sequences common in the human genome are tandem repeats, where identical copies appear next to each other. For example, in the sequence AGTCGC, TGTG is a tandem repeat, that may be generated from AGTCTGC by a tandem duplication of length 2. In this work, we investigate the possibility of generating a large number of sequences from a seed, i.e. a small initial string, by tandem duplications of bounded length. We study the capacity of such a system, a notion that quantifies the system’s generating power. Our results include exact capacity values for certain tandem duplication string systems. In addition, motivated by the role of DNA sequences in exp...
Mutation processes such as point mutation, insertion, deletion, and duplication (including tandem a...
The ability to store data in the DNA of a living organism has applications in a variety of areas inc...
AbstractThe genomes of many species are dominated by short sequences repeated consecutively called t...
The majority of the human genome consists of repeated sequences. An important type of repeated seque...
The majority of the human genome consists of repeated sequences. An important type of repeats commo...
It is known that the majority of the human genome consists of duplicated sequences. Furthermore, it ...
It is known that the majority of the human genome consists of repeated sequences. Furthermore, it is...
It is known that the majority of the human genome consists of repeated sequences. Furthermore, it is...
We study random string-duplication systems, called Pólya string models, motivated by certain random ...
The set of all $ q $-ary strings that do not contain repeated substrings of length $ \leqslant\! 3 $...
We study random string-duplication systems, which we call Pólya string models. These are motivated b...
Duplication mutations play a critical role in the generation of biological sequences. Simultaneousl...
In computational biology, tandem duplication is an important biological phenomenon which can occur e...
AbstractWe consider a new type of language defined by a word through iterative factor duplications, ...
Erwin Chargaff in 1950 made an experimental observation that the count of A is equal to the count of...
Mutation processes such as point mutation, insertion, deletion, and duplication (including tandem a...
The ability to store data in the DNA of a living organism has applications in a variety of areas inc...
AbstractThe genomes of many species are dominated by short sequences repeated consecutively called t...
The majority of the human genome consists of repeated sequences. An important type of repeated seque...
The majority of the human genome consists of repeated sequences. An important type of repeats commo...
It is known that the majority of the human genome consists of duplicated sequences. Furthermore, it ...
It is known that the majority of the human genome consists of repeated sequences. Furthermore, it is...
It is known that the majority of the human genome consists of repeated sequences. Furthermore, it is...
We study random string-duplication systems, called Pólya string models, motivated by certain random ...
The set of all $ q $-ary strings that do not contain repeated substrings of length $ \leqslant\! 3 $...
We study random string-duplication systems, which we call Pólya string models. These are motivated b...
Duplication mutations play a critical role in the generation of biological sequences. Simultaneousl...
In computational biology, tandem duplication is an important biological phenomenon which can occur e...
AbstractWe consider a new type of language defined by a word through iterative factor duplications, ...
Erwin Chargaff in 1950 made an experimental observation that the count of A is equal to the count of...
Mutation processes such as point mutation, insertion, deletion, and duplication (including tandem a...
The ability to store data in the DNA of a living organism has applications in a variety of areas inc...
AbstractThe genomes of many species are dominated by short sequences repeated consecutively called t...