Abstract — This paper presents a new algorithm based on the segment and combine paradigm, for automatic classification of biological sequences. It classifies sequences by aggregating the information about their subsequences predicted by a classifier derived by machine learning from a random sample of training subsequences. This generic approach is combined with decision tree based ensemble methods, scalable both with respect to sample size and vocabulary size. The method is applied to three families of problems: DNA sequence recognition, splice junction detection, and gene regulon prediction. With respect to standard approaches based on n-grams, it appears competitive in terms of accuracy, flexibility, and scalability. The paper also highli...
DNA is the building block of life, which contains encoded genetic instructions for building living o...
In this dissertation we analyze biological sequences using two proposed methods of characterization....
Abstract:- Biological data mining has become an important research area in recent years due to the e...
peer reviewedThis paper presents a new algorithm based on the segment and combine paradigm, for auto...
In recent years we have witnessed an exponential increase in the amount of biological information, e...
In this work we consider an inference task that biologists are very good at: deciphering biological ...
Many open problems in bioinformatics involve elucidating underlying functional signals in biological...
this paper we present a novel methodology for sequence classification, based on sequential pattern m...
Doctor of PhilosophyDepartment of Computing and Information SciencesDoina CarageaRecent advancements...
Inductive learning methods, such as neural networks and decision trees, have become a popular approa...
This chapter focuses on the use of ensembles of classifiers in Bioinformatics. Due to the complex re...
Abstract The study proposes a novel model for DNA sequence classification that combines machine lear...
The applications of machine learning algorithms to the analysis of data sets of DNA sequences are ve...
Many open problems in bioinformatics involve elucidating underlying functional signals in biological...
This paper presents an application of methods from the machine learning domain to solving the task o...
DNA is the building block of life, which contains encoded genetic instructions for building living o...
In this dissertation we analyze biological sequences using two proposed methods of characterization....
Abstract:- Biological data mining has become an important research area in recent years due to the e...
peer reviewedThis paper presents a new algorithm based on the segment and combine paradigm, for auto...
In recent years we have witnessed an exponential increase in the amount of biological information, e...
In this work we consider an inference task that biologists are very good at: deciphering biological ...
Many open problems in bioinformatics involve elucidating underlying functional signals in biological...
this paper we present a novel methodology for sequence classification, based on sequential pattern m...
Doctor of PhilosophyDepartment of Computing and Information SciencesDoina CarageaRecent advancements...
Inductive learning methods, such as neural networks and decision trees, have become a popular approa...
This chapter focuses on the use of ensembles of classifiers in Bioinformatics. Due to the complex re...
Abstract The study proposes a novel model for DNA sequence classification that combines machine lear...
The applications of machine learning algorithms to the analysis of data sets of DNA sequences are ve...
Many open problems in bioinformatics involve elucidating underlying functional signals in biological...
This paper presents an application of methods from the machine learning domain to solving the task o...
DNA is the building block of life, which contains encoded genetic instructions for building living o...
In this dissertation we analyze biological sequences using two proposed methods of characterization....
Abstract:- Biological data mining has become an important research area in recent years due to the e...