The principal topic of this work is the application of data mining techniques, in particular of machine learning, to the discovery of knowledge in a protein database. In the first chapter a general background is presented. Namely, in section 1.1 we overview the methodology of a Data Mining project and its main algorithms. In section 1.2 an introduction to the proteins and its supporting file formats is outlined. This chapter is concluded with section 1.3 which defines that main problem we pretend to address with this work: determine if an amino acid is exposed or buried in a protein, in a discrete way (i.e.: not continuous), for five exposition levels: 2%, 10%, 20%, 25% and 30%. In the second chapter, following closely the CRISP-DM me...
Proteins interactions mediate all biological systems in a cell; understanding their interactions me...
Mining biological data is an emergent area at the intersection between bioinformatics and data minin...
Data mining of protein databases poses special challenges because many protein databases are non- re...
Data mining of protein databases poses special challenges because many protein databases are non-rel...
Dissertação apresentada para obtenção de Grau de Doutor em Bioquímica,Bioquímica Estrutural, pela U...
Computerized applications are employed all around the world, an enormous amount of data is collected...
Proteins are composed of twenty different types of amino acids, small organic molecules with differ...
The goal of this thesis is to develop a computational method based on machine learning techniques fo...
Includes bibliographical references (leaves 75-80).Proteins are organic compounds made up of chains ...
In this thesis, the author pursues the target of improving accuracy of protein structural prediction...
This thesis concerns two areas of bioinformatics related by their role in protein structure and func...
Classification is a data mining tast that has been useful in several application areas, particularly...
The most significant impediment for protein structure prediction is the inadequacy of conformation s...
Proteínas desempenham uma grande variedade de funções biológicas. O conhecimento da estrutura tridim...
A propriedade das proteínas de se ligarem umas as outras de forma altamente específica, formando com...
Proteins interactions mediate all biological systems in a cell; understanding their interactions me...
Mining biological data is an emergent area at the intersection between bioinformatics and data minin...
Data mining of protein databases poses special challenges because many protein databases are non- re...
Data mining of protein databases poses special challenges because many protein databases are non-rel...
Dissertação apresentada para obtenção de Grau de Doutor em Bioquímica,Bioquímica Estrutural, pela U...
Computerized applications are employed all around the world, an enormous amount of data is collected...
Proteins are composed of twenty different types of amino acids, small organic molecules with differ...
The goal of this thesis is to develop a computational method based on machine learning techniques fo...
Includes bibliographical references (leaves 75-80).Proteins are organic compounds made up of chains ...
In this thesis, the author pursues the target of improving accuracy of protein structural prediction...
This thesis concerns two areas of bioinformatics related by their role in protein structure and func...
Classification is a data mining tast that has been useful in several application areas, particularly...
The most significant impediment for protein structure prediction is the inadequacy of conformation s...
Proteínas desempenham uma grande variedade de funções biológicas. O conhecimento da estrutura tridim...
A propriedade das proteínas de se ligarem umas as outras de forma altamente específica, formando com...
Proteins interactions mediate all biological systems in a cell; understanding their interactions me...
Mining biological data is an emergent area at the intersection between bioinformatics and data minin...
Data mining of protein databases poses special challenges because many protein databases are non- re...