Abstract—Since the introduction of next generation sequencing there is a demand for sophisticated methods to classify proteins based on sequence data. The approaches for this task typically involve the alignment among raw sequences and the extraction of discrete high level feature from the protein sequences for recognition. In this paper, we employ two different machine learning methods to perform the task, i.e. Hidden Markov Model and Random Forests. Profile Hidden Markov Models are built from multiple alignment of raw sequence data and amino acid emission and transition parameters are estimated for a given alignment and effectively harness the power of aligning a test protein to a model built form many proteins. Random Forests on the othe...
Hidden Markov model (HMM) techniques are used to model families of biological sequences. A smooth an...
Protein fold recognition is an important step towards solving protein function and tertiary structur...
In this paper, we address the problem of identifying protein functionality using the information con...
Detecting similarity in biological sequences is a key element to understanding the mechanisms of lif...
models in biological sequence analysis The vast increase of data in biology has meant that many aspe...
Abstract: Computational analysis of proteins can be used for structure prediction of newly identifie...
Plötz T, Fink GA. Pattern recognition methods for advanced stochastic protein sequence analysis usin...
Accurately predicting phosphorylation sites in proteins is an important issue in postgenomics, for w...
This tutorial was one of eight tutorials selected to be presented at the Third International Confere...
Three-dimensional protein structures can be divided into classes in which proteins demonstrate high ...
Rigorous computation methods are needed to unleash the power hidden in the DNA and protein sequences...
Data mining is a process that uses a variety of data analysis tools to discover patterns and relatio...
Genome sequencing projects are advancing at a staggering pace and are daily producing large amounts ...
We apply Hidden Markov Models (HMMs) to the problem of statistical modeling and multiple alignment o...
Hidden Markov Models (HMMs) became recently important and popular among bioinformatics researchers, ...
Hidden Markov model (HMM) techniques are used to model families of biological sequences. A smooth an...
Protein fold recognition is an important step towards solving protein function and tertiary structur...
In this paper, we address the problem of identifying protein functionality using the information con...
Detecting similarity in biological sequences is a key element to understanding the mechanisms of lif...
models in biological sequence analysis The vast increase of data in biology has meant that many aspe...
Abstract: Computational analysis of proteins can be used for structure prediction of newly identifie...
Plötz T, Fink GA. Pattern recognition methods for advanced stochastic protein sequence analysis usin...
Accurately predicting phosphorylation sites in proteins is an important issue in postgenomics, for w...
This tutorial was one of eight tutorials selected to be presented at the Third International Confere...
Three-dimensional protein structures can be divided into classes in which proteins demonstrate high ...
Rigorous computation methods are needed to unleash the power hidden in the DNA and protein sequences...
Data mining is a process that uses a variety of data analysis tools to discover patterns and relatio...
Genome sequencing projects are advancing at a staggering pace and are daily producing large amounts ...
We apply Hidden Markov Models (HMMs) to the problem of statistical modeling and multiple alignment o...
Hidden Markov Models (HMMs) became recently important and popular among bioinformatics researchers, ...
Hidden Markov model (HMM) techniques are used to model families of biological sequences. A smooth an...
Protein fold recognition is an important step towards solving protein function and tertiary structur...
In this paper, we address the problem of identifying protein functionality using the information con...