Explainable and interpretable unsupervised machine learning helps one to understand the underlying structure of data. We introduce an ensemble analysis of machine learning models to consolidate their interpretation. Its application shows that restricted Boltzmann machines compress consistently into a few bits the information stored in a sequence of five amino acids at the start or end of α-helices or β-sheets. The weights learned by the machines reveal unexpected properties of the amino acids and the secondary structure of proteins: (i) His and Thr have a negligible contribution to the amphiphilic pattern of α-helices; (ii) there is a class of α-helices particularly rich in Ala at their end; (iii) Pro occupies most often slots otherwise occ...
Background Identification of the tertiary structure (3D structure) of a protein is a fundamental pr...
Prediction of structural classes of proteins has been pursued using various features of proteins suc...
The field of machine learning, which aims to develop computer algorithms that improve with experienc...
Explainable and interpretable unsupervised machine learning helps one to understand the underlying s...
AbstractThere are several possibilities for definition and derivation of sequence patterns associate...
Restricted Boltzmann machines (RBM) are graphical models that learn jointly a probability distributi...
The secondary and tertiary structure of a protein has a primary role in determining its function. Ev...
Machine learning (ML) is ubiquitous in bioinformatics, due to its versatility. One of the most cruci...
Many of the central questions in bioinformatics relate to protein structure and function. We are mai...
Recent advances in sequencing and synthesis technologies have sparked extraordinary growth in large-...
Many of the central questions in bioinformatics relate to protein structure and function. We are mai...
A goal of unsupervised machine learning is to disentangle representations of complex high-dimensiona...
The ubiquitous availability of genome sequencing data explains the popularity of machine learning-ba...
An important open problem in molecular biology is how to use computational methods to understand the...
International audienceStatistical models for families of evolutionary related proteins have recently...
Background Identification of the tertiary structure (3D structure) of a protein is a fundamental pr...
Prediction of structural classes of proteins has been pursued using various features of proteins suc...
The field of machine learning, which aims to develop computer algorithms that improve with experienc...
Explainable and interpretable unsupervised machine learning helps one to understand the underlying s...
AbstractThere are several possibilities for definition and derivation of sequence patterns associate...
Restricted Boltzmann machines (RBM) are graphical models that learn jointly a probability distributi...
The secondary and tertiary structure of a protein has a primary role in determining its function. Ev...
Machine learning (ML) is ubiquitous in bioinformatics, due to its versatility. One of the most cruci...
Many of the central questions in bioinformatics relate to protein structure and function. We are mai...
Recent advances in sequencing and synthesis technologies have sparked extraordinary growth in large-...
Many of the central questions in bioinformatics relate to protein structure and function. We are mai...
A goal of unsupervised machine learning is to disentangle representations of complex high-dimensiona...
The ubiquitous availability of genome sequencing data explains the popularity of machine learning-ba...
An important open problem in molecular biology is how to use computational methods to understand the...
International audienceStatistical models for families of evolutionary related proteins have recently...
Background Identification of the tertiary structure (3D structure) of a protein is a fundamental pr...
Prediction of structural classes of proteins has been pursued using various features of proteins suc...
The field of machine learning, which aims to develop computer algorithms that improve with experienc...