Recent applications of recurrent neural networks (RNN) enable training models that sample the chemical space. In this study we train RNN with molecular string representations (SMILES) with a subset of the enumerated database GDB-13 (975 million molecules). We show that a model trained with 1 million structures (0.1% of the database) reproduces 68.9% of the entire database after training, when sampling 2 billion molecules. We also developed a method to assess the quality of the training process using negative log-likelihood plots. Furthermore, we use a mathematical model based on the “coupon collector problem” that compares the trained model to an upper bound and thus we are able to quantify how much it has learned. We also suggest that this...
Chemical autoencoders are attractive models as they combine chemical space navigation with possibili...
Machine learning (ML) and Artificial Intelligence (AI) have had a renaissance during the last few ye...
Chemical autoencoders are attractive models as they combine chemical space navigation with possibili...
Abstract Recent applications of recurrent neural networks (RNN) enable training models that sample t...
Recent applications of recurrent neural networks (RNN) enable training models that sample the chemic...
Recent applications of Recurrent Neural Networks enable training models that sample the chemical spa...
Recurrent Neural Networks (RNNs) trained with a set of molecules represented as unique (canonical) S...
A Recurrent Neural Network (RNN) trained with a set of molecules represented as SMILES strings can g...
This directory contains sets of molecules used to train chemical language models in the paper, "Lear...
Chemical space is a concept to organize molecular diversity by postulating that different molecules ...
Deep generative models of molecules have grown immensely in popularity, trained on relevant datasets...
The number of 'small' molecules that may be of interest to chemical biologists - chemical space - is...
Eine wichtige Aufgabe in der Pharmakologie, Toxikologie und in der Arz- neimittelentwicklung ist die...
Models based on machine learning can enable accurate and fast molecular property predictions, which ...
Generative artificial intelligence models present a fresh approach to chemogenomics and de novo drug...
Chemical autoencoders are attractive models as they combine chemical space navigation with possibili...
Machine learning (ML) and Artificial Intelligence (AI) have had a renaissance during the last few ye...
Chemical autoencoders are attractive models as they combine chemical space navigation with possibili...
Abstract Recent applications of recurrent neural networks (RNN) enable training models that sample t...
Recent applications of recurrent neural networks (RNN) enable training models that sample the chemic...
Recent applications of Recurrent Neural Networks enable training models that sample the chemical spa...
Recurrent Neural Networks (RNNs) trained with a set of molecules represented as unique (canonical) S...
A Recurrent Neural Network (RNN) trained with a set of molecules represented as SMILES strings can g...
This directory contains sets of molecules used to train chemical language models in the paper, "Lear...
Chemical space is a concept to organize molecular diversity by postulating that different molecules ...
Deep generative models of molecules have grown immensely in popularity, trained on relevant datasets...
The number of 'small' molecules that may be of interest to chemical biologists - chemical space - is...
Eine wichtige Aufgabe in der Pharmakologie, Toxikologie und in der Arz- neimittelentwicklung ist die...
Models based on machine learning can enable accurate and fast molecular property predictions, which ...
Generative artificial intelligence models present a fresh approach to chemogenomics and de novo drug...
Chemical autoencoders are attractive models as they combine chemical space navigation with possibili...
Machine learning (ML) and Artificial Intelligence (AI) have had a renaissance during the last few ye...
Chemical autoencoders are attractive models as they combine chemical space navigation with possibili...