Abstract We describe a novel algorithm for information recovery from DNA sequences by using a digital filter. This work proposes a three-part algorithm to decide the k-mer or q-gram word density. Employing a finite impulse response digital filter, one can calculate the sequence's k-mer or q-gram word density. Further principal component analysis is used on word density distribution to analyze the dissimilarity between sequences. A dissimilarity matrix is thus formed and shows the appearance of cluster formation. This cluster formation is constructed based on the alignment-free sequence method. Furthermore, the clusters are used to build phylogenetic relations. The cluster algorithm is in good agreement with alignment-based algorithms. The p...
DNAsequence decomposition into k-mers and their frequency counting, defines a mapping of a sequence ...
In this work we seek clusters of genomic words in human DNA by studying their inter-word lag distrib...
DNA sequence decomposition into k-mers (substrings of length k) and their frequency counting, define...
With the discovery of new DNAs, a fundamental problem arising is how to categorize those DNA sequenc...
The article describes two new clustering algorithms for DNA nucleotide sequences, summarizes the res...
The article describes two new clustering algorithms for DNA nucleotide sequences, summarizes the res...
The article describes two new clustering algorithms for DNA nucleotide sequences, summarizes the res...
The article describes two new clustering algorithms for DNA nucleotide sequences, summarizes the res...
Information theory is a branch of mathematics that overlaps with communications, biology, and medica...
K-mer frequency statistics of biological sequences is a very important and important problem in biol...
A novel distance method for sequence classification and intraspecie phylogeny reconstruction is prop...
A novel distance method for sequence classification and intraspecie phylogeny reconstruction is prop...
The applications of machine learning algorithms to the analysis of data sets of DNA sequences are ve...
In genome analysis, k-mer-based comparison methods have become standard tools. However, even though ...
DNAsequence decomposition into k-mers and their frequency counting, defines a mapping of a sequence ...
DNAsequence decomposition into k-mers and their frequency counting, defines a mapping of a sequence ...
In this work we seek clusters of genomic words in human DNA by studying their inter-word lag distrib...
DNA sequence decomposition into k-mers (substrings of length k) and their frequency counting, define...
With the discovery of new DNAs, a fundamental problem arising is how to categorize those DNA sequenc...
The article describes two new clustering algorithms for DNA nucleotide sequences, summarizes the res...
The article describes two new clustering algorithms for DNA nucleotide sequences, summarizes the res...
The article describes two new clustering algorithms for DNA nucleotide sequences, summarizes the res...
The article describes two new clustering algorithms for DNA nucleotide sequences, summarizes the res...
Information theory is a branch of mathematics that overlaps with communications, biology, and medica...
K-mer frequency statistics of biological sequences is a very important and important problem in biol...
A novel distance method for sequence classification and intraspecie phylogeny reconstruction is prop...
A novel distance method for sequence classification and intraspecie phylogeny reconstruction is prop...
The applications of machine learning algorithms to the analysis of data sets of DNA sequences are ve...
In genome analysis, k-mer-based comparison methods have become standard tools. However, even though ...
DNAsequence decomposition into k-mers and their frequency counting, defines a mapping of a sequence ...
DNAsequence decomposition into k-mers and their frequency counting, defines a mapping of a sequence ...
In this work we seek clusters of genomic words in human DNA by studying their inter-word lag distrib...
DNA sequence decomposition into k-mers (substrings of length k) and their frequency counting, define...