<p>1. Approximately 3.7 million (3,700,103) PubChem compounds were used in training the fragment vectors. Five million compounds (from CID 1 to 5,000,000) were downloaded in SMILES format from PubChem Download Service<sup>1</sup> for this purpose. Inorganic part of salts, charges on certain atoms were neutralized, components of mixtures were split and only one chemical from sets of duplicates were kept. Approximately 100,000 chemicals were held out for future needs.</p> <p>2. Hansen<sup>2</sup> and Bursi<sup>3</sup> Ames mutagenicity benchmark datasets were used. The data was preprocessed in the same way as the PubChem chemicals, except in the case of duplicates, the compound with the highest mutagenicity value was retained, leaving...
The availability of suitable diverse fragment- and lead-oriented screening compounds is key for the ...
Most libraries for fragment-based drug discovery are restricted to 1,000–10,000 compounds, but over ...
In this work, we introduce an automated, efficient, and elegant model to combine all pieces of evide...
This article describes an unsupervised machine learning method for computing distributed vector repr...
The first step in hit optimisation is the identification of the pharmacophore, which is normally ach...
The 4.5 million organic molecules with up to 20 non-hydrogen atoms in PubChem were analyzed using th...
ABSTRACT: Using a benchmark Ames mutagenicity data set, we evaluated the performance of molecular fi...
Description of the files Fragments from libraries: allFragments_annotations.sdf: SDF file contai...
*S Supporting Information ABSTRACT: Analyzing the chemical space coverage in commercial fragment scr...
© 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim This is, to our knowledge, the most comprehensiv...
Fragment-based drug discovery (FBDD) is well suited for discovering both drug leads and chemical pro...
peer reviewedAbstract Compound (or chemical) databases are an invaluable resource for many scientifi...
Mutagenicity is one of the most important end points of toxicity. Due to high cost and laboriousness...
Fragment-based approaches have now become an important component of the drug discovery process. At t...
For a database of 826 chemicals tested for carcinogenicity, we fragmented the structural formula of ...
The availability of suitable diverse fragment- and lead-oriented screening compounds is key for the ...
Most libraries for fragment-based drug discovery are restricted to 1,000–10,000 compounds, but over ...
In this work, we introduce an automated, efficient, and elegant model to combine all pieces of evide...
This article describes an unsupervised machine learning method for computing distributed vector repr...
The first step in hit optimisation is the identification of the pharmacophore, which is normally ach...
The 4.5 million organic molecules with up to 20 non-hydrogen atoms in PubChem were analyzed using th...
ABSTRACT: Using a benchmark Ames mutagenicity data set, we evaluated the performance of molecular fi...
Description of the files Fragments from libraries: allFragments_annotations.sdf: SDF file contai...
*S Supporting Information ABSTRACT: Analyzing the chemical space coverage in commercial fragment scr...
© 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim This is, to our knowledge, the most comprehensiv...
Fragment-based drug discovery (FBDD) is well suited for discovering both drug leads and chemical pro...
peer reviewedAbstract Compound (or chemical) databases are an invaluable resource for many scientifi...
Mutagenicity is one of the most important end points of toxicity. Due to high cost and laboriousness...
Fragment-based approaches have now become an important component of the drug discovery process. At t...
For a database of 826 chemicals tested for carcinogenicity, we fragmented the structural formula of ...
The availability of suitable diverse fragment- and lead-oriented screening compounds is key for the ...
Most libraries for fragment-based drug discovery are restricted to 1,000–10,000 compounds, but over ...
In this work, we introduce an automated, efficient, and elegant model to combine all pieces of evide...