Molecular dynamics (MD) simulations present a data-mining challenge, given that they can generate a considerable amount of data but often rely on limited or biased human interpretation to examine their information content. By not asking the right questions of MD data we may miss critical information hidden within it. We combine dimensionality reduction (UMAP) and unsupervised hierarchical clustering (HDBSCAN) to quantitatively characterize prevalent coordination environments of chemical species within MD data. By focusing on local coordination, we significantly reduce the amount of data to be analyzed by extracting all distinct molecular formulas within a given coordination sphere. We then efficiently combine UMAP and HDBSCAN with alignment...
The reshuffling mobility of molecular building blocks in self-assembled micelles is a determinant ke...
This article describes an unsupervised machine learning method for computing distributed vector repr...
We propose an approach for summarizing the output of long simulations of complex systems, affording ...
Unsupervised learning is becoming an essential tool to analyze the increasingly large amounts of dat...
Automated analyses of the outcome of a simulation have been an important part of atomistic modeling ...
Data mining techniques depend strongly on how the data are represented and how distance between samp...
From simple clustering techniques to more sophisticated neural networks, the use of machine learning...
ABSTRACT: Given the large number of crystal structures and NMR ensembles that have been solved to da...
Most of the current understanding of structure–property relations at the molecular and the supramole...
We present an unsupervised data processing workflow that is specifically designed to obtain a fast c...
Molecular dynamics (MD) simulation is the workhorse of various scientific domains but is limited by ...
Thesis (Master's)--University of Washington, 2021Understanding molecules and molecular interactions ...
Data mining techniques depend strongly on how the data are represented and how distance between samp...
Essential Dynamics (ED) is a powerful tool for analyzing molecular dynamics (MD) simulations and it ...
Abstract In many complex molecular systems, the macroscopic ensemble’s properties are...
The reshuffling mobility of molecular building blocks in self-assembled micelles is a determinant ke...
This article describes an unsupervised machine learning method for computing distributed vector repr...
We propose an approach for summarizing the output of long simulations of complex systems, affording ...
Unsupervised learning is becoming an essential tool to analyze the increasingly large amounts of dat...
Automated analyses of the outcome of a simulation have been an important part of atomistic modeling ...
Data mining techniques depend strongly on how the data are represented and how distance between samp...
From simple clustering techniques to more sophisticated neural networks, the use of machine learning...
ABSTRACT: Given the large number of crystal structures and NMR ensembles that have been solved to da...
Most of the current understanding of structure–property relations at the molecular and the supramole...
We present an unsupervised data processing workflow that is specifically designed to obtain a fast c...
Molecular dynamics (MD) simulation is the workhorse of various scientific domains but is limited by ...
Thesis (Master's)--University of Washington, 2021Understanding molecules and molecular interactions ...
Data mining techniques depend strongly on how the data are represented and how distance between samp...
Essential Dynamics (ED) is a powerful tool for analyzing molecular dynamics (MD) simulations and it ...
Abstract In many complex molecular systems, the macroscopic ensemble’s properties are...
The reshuffling mobility of molecular building blocks in self-assembled micelles is a determinant ke...
This article describes an unsupervised machine learning method for computing distributed vector repr...
We propose an approach for summarizing the output of long simulations of complex systems, affording ...