A new algorithm is presented for vocabulary analysis (word detection) in texts of human origin. It performs at 60%–70% overall accuracy and greater than 80% accuracy for longer words, and approximately 85% sensitivity on Alice in Wonderland , a considerable improvement on previous methods. When applied to protein sequences, it detects short sequences analogous to words in human texts, i.e. intolerant to changes in spelling (mutation), and relatively context-independent in their meaning (function). Some of these are homonyms of up to 7 amino acids, which can assume different structures in different proteins. Others are ultra-conserved stretches of up to 18 amino acids within proteins of less than 40% overall identity, reflecting extreme cons...
Over the past two decades, many ingenious efforts have been made in protein remote homology detectio...
The enormous growth of biomolecular databases makes it increasingly important to have fast and autom...
The enormous growth of biomolecular databases makes it increasingly important to have fast and autom...
A new algorithm is presented for vocabulary analysis (word detection) in texts of human origin. It p...
A new algorithm is presented for vocabulary analysis (word detection) in texts of human origin. It p...
The amino acid sequences of proteins determine their three-dimensional structures and functions. How...
The amino acid sequences of proteins determine their three-dimensional structures and functions. How...
AbstractProtein structure and function information is coded in amino acid sequences. However, the re...
A current barrier for successful rational drug design is the lack of understanding of the structure ...
A current barrier for successful rational drug design is the lack of understanding of the structure ...
One of the fundamental challenges in computational biology is the identification of evolutionarily r...
Over the past two decades, many ingenious efforts have been made in protein remote homology detectio...
A palindrome is a set of characters that reads the same forwards and backwards. Since the discovery ...
Gene and protein sequence analyses, central components of studies in modem biology are easily amen...
A palindrome is a set of characters that reads the same forwards and backwards. Since the discovery ...
Over the past two decades, many ingenious efforts have been made in protein remote homology detectio...
The enormous growth of biomolecular databases makes it increasingly important to have fast and autom...
The enormous growth of biomolecular databases makes it increasingly important to have fast and autom...
A new algorithm is presented for vocabulary analysis (word detection) in texts of human origin. It p...
A new algorithm is presented for vocabulary analysis (word detection) in texts of human origin. It p...
The amino acid sequences of proteins determine their three-dimensional structures and functions. How...
The amino acid sequences of proteins determine their three-dimensional structures and functions. How...
AbstractProtein structure and function information is coded in amino acid sequences. However, the re...
A current barrier for successful rational drug design is the lack of understanding of the structure ...
A current barrier for successful rational drug design is the lack of understanding of the structure ...
One of the fundamental challenges in computational biology is the identification of evolutionarily r...
Over the past two decades, many ingenious efforts have been made in protein remote homology detectio...
A palindrome is a set of characters that reads the same forwards and backwards. Since the discovery ...
Gene and protein sequence analyses, central components of studies in modem biology are easily amen...
A palindrome is a set of characters that reads the same forwards and backwards. Since the discovery ...
Over the past two decades, many ingenious efforts have been made in protein remote homology detectio...
The enormous growth of biomolecular databases makes it increasingly important to have fast and autom...
The enormous growth of biomolecular databases makes it increasingly important to have fast and autom...