Data mining of protein databases poses special challenges because many protein databases are non- relational whereas most data mining and machine learning algorithms assume the input data to be a type of rela- tional database that is also representable as an ARFF file. We developed a method to restructure protein databases so that they become amenable for various data mining and machine learning tools. Our restructuring method en- abled us to apply both decision tree and support vector machine classifiers to a pancreatic protein database. The SVM classifier that used both GO term and PFAM families to characterize proteins gave us over 73% accuracy in predicting whether a protein is involved in pancreatic cancer
The Pancreatic Expression Database (PED, http://www.pancreasexpression.org) continues to be a major ...
The principal topic of this work is the application of data mining techniques, in particular of mach...
An early diagnosis of cancer is crucial to improving the survival rate and to prolong the lives of p...
Data mining of protein databases poses special challenges because many protein databases are non- re...
Data mining of protein databases poses special challenges because many protein databases are non-rel...
This paper considers two types of protein data. First, data about protein function described in a nu...
Pancreatic cancer (PC) is a highly malignant tumor derived from pancreas tissue and is one of the le...
This report presents an approach to predict pancreatic cancer using Support Vector Machine Classific...
Biocuration in the omics sciences has become paramount, as research in these fields rapidly evolves ...
Abstract Background We present an effective, rapid, systematic data mining approach for identifying ...
Supervised learning methods are used when one wants to construct a classifier. To use such a method,...
The proliferation of biological databases and the easy access enabled by the Internet is having a be...
<p><b>Copyright information:</b></p><p>Taken from "Pancreatic Expression database: a generic model f...
Pancreatic cancer is one of the most fatal types of cancer due to its difficulty of being diagnosed ...
Copyright © 2015 Fei Yuan et al.This is an open access article distributed under the Creative Common...
The Pancreatic Expression Database (PED, http://www.pancreasexpression.org) continues to be a major ...
The principal topic of this work is the application of data mining techniques, in particular of mach...
An early diagnosis of cancer is crucial to improving the survival rate and to prolong the lives of p...
Data mining of protein databases poses special challenges because many protein databases are non- re...
Data mining of protein databases poses special challenges because many protein databases are non-rel...
This paper considers two types of protein data. First, data about protein function described in a nu...
Pancreatic cancer (PC) is a highly malignant tumor derived from pancreas tissue and is one of the le...
This report presents an approach to predict pancreatic cancer using Support Vector Machine Classific...
Biocuration in the omics sciences has become paramount, as research in these fields rapidly evolves ...
Abstract Background We present an effective, rapid, systematic data mining approach for identifying ...
Supervised learning methods are used when one wants to construct a classifier. To use such a method,...
The proliferation of biological databases and the easy access enabled by the Internet is having a be...
<p><b>Copyright information:</b></p><p>Taken from "Pancreatic Expression database: a generic model f...
Pancreatic cancer is one of the most fatal types of cancer due to its difficulty of being diagnosed ...
Copyright © 2015 Fei Yuan et al.This is an open access article distributed under the Creative Common...
The Pancreatic Expression Database (PED, http://www.pancreasexpression.org) continues to be a major ...
The principal topic of this work is the application of data mining techniques, in particular of mach...
An early diagnosis of cancer is crucial to improving the survival rate and to prolong the lives of p...