In this paper we study the problem of classifying chemical com-pound datasets. We present a sub-structure-based classifica-tion algorithm that decouples the sub-structure discovery pro-cess from the classification model construction and uses frequent subgraph discovery algorithms to find all topological and geo-metric sub-structures present in the dataset. The advantage of our approach is that during classification model construction, all relevant sub-structures are available allowing the classifier to intelligently select the most discriminating ones. The computa-tional scalability is ensured by the use of highly efficient frequent subgraph discovery algorithms coupled with aggressive feature selection. Our experimental evaluation on eight...
Database for small organic chemical molecules usually contain millions of structures. The screening ...
Over the years, frequent itemset discovery algorithms have been used to find interesting patterns in...
In recent years a number of methods were invented in the data mining/machine learning field that hav...
In this paper we study the problem of classifying chemical compound datasets. We present a sub-stru...
Abstract—Computational techniques that build models to correctly assign chemical compounds to variou...
In this paper we study the problem of classifying chemical compound datasets. We present an algorit...
Structured data represented in the form of graphs arises in several fields of the science and the g...
In recent years the development of computational techniques that build models to correctly assign ch...
In recent years the development of computational techniques that build models to correctly assign ch...
The discovery of the relationships between chemical structure and biological function is central to ...
INTRODUCTION Substructures and subgraphs of chemical structures are becoming increasingly important...
As data mining techniques are being increasingly applied tonon-traditional domains, existing approac...
Metrics for structured data have received an increasing interest in the machine learning community. ...
Substructure mining is a well-established technique used frequently in drug discovery. Its aim is to...
konstanz.de Molecular substructure mining is currently an intensively studied research area. In this...
Database for small organic chemical molecules usually contain millions of structures. The screening ...
Over the years, frequent itemset discovery algorithms have been used to find interesting patterns in...
In recent years a number of methods were invented in the data mining/machine learning field that hav...
In this paper we study the problem of classifying chemical compound datasets. We present a sub-stru...
Abstract—Computational techniques that build models to correctly assign chemical compounds to variou...
In this paper we study the problem of classifying chemical compound datasets. We present an algorit...
Structured data represented in the form of graphs arises in several fields of the science and the g...
In recent years the development of computational techniques that build models to correctly assign ch...
In recent years the development of computational techniques that build models to correctly assign ch...
The discovery of the relationships between chemical structure and biological function is central to ...
INTRODUCTION Substructures and subgraphs of chemical structures are becoming increasingly important...
As data mining techniques are being increasingly applied tonon-traditional domains, existing approac...
Metrics for structured data have received an increasing interest in the machine learning community. ...
Substructure mining is a well-established technique used frequently in drug discovery. Its aim is to...
konstanz.de Molecular substructure mining is currently an intensively studied research area. In this...
Database for small organic chemical molecules usually contain millions of structures. The screening ...
Over the years, frequent itemset discovery algorithms have been used to find interesting patterns in...
In recent years a number of methods were invented in the data mining/machine learning field that hav...