Although convolutional neural networks (CNNs) have been applied to a variety of computational genomics problems, there remains a large gap in our understanding of how they build representations of regulatory genomic sequences. Here we perform systematic experiments on synthetic sequences to reveal how CNN architecture, specifically convolutional filter size and max-pooling, influences the extent that sequence motif representations are learned by first layer filters. We find that CNNs designed to foster hierarchical representation learning of sequence motifs – assembling partial features into whole features in deeper layers – tend to learn distributed representations, i.e. partial motifs. On the other hand, CNNs that are designed to limit th...
A central challenge in population genetics is the detection of genomic footprints of selection. As m...
The detection of regulatory sequences in DNA is a challenging problem, especially when considered in...
In metagenomic analyses the rapid and accurate identification of DNA sequences is important. This is...
Although convolutional neural networks (CNNs) have been applied to a variety of computational genomi...
ABSTRACT Deep convolutional neural networks (CNNs) trained on regulatory genomic sequences tend to b...
Deep convolutional networks trained on regulatory genomic sequences tend to learn distributed repres...
A common goal in the convolutional neural network (CNN) modeling of genomic data is to discover spec...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
Over the past decade, neural networks have been successful at making predictions from biological seq...
High-throughput sequencing (HTS) has led to many breakthroughs in basic and translational biology re...
International audienceThe growing number of annotated biological sequences available makes it possib...
Transcription factors (TFs) bind DNA by recognizing specific sequence motifs, typically of length 6-...
DNA sequences are the basic data type that is processed to perform a generic study of biological dat...
Convolutional neural networks (CNNs) have achieved significant advancements in biological sequence a...
Over the past decade, neural networks have been successful at making predictions from biological seq...
A central challenge in population genetics is the detection of genomic footprints of selection. As m...
The detection of regulatory sequences in DNA is a challenging problem, especially when considered in...
In metagenomic analyses the rapid and accurate identification of DNA sequences is important. This is...
Although convolutional neural networks (CNNs) have been applied to a variety of computational genomi...
ABSTRACT Deep convolutional neural networks (CNNs) trained on regulatory genomic sequences tend to b...
Deep convolutional networks trained on regulatory genomic sequences tend to learn distributed repres...
A common goal in the convolutional neural network (CNN) modeling of genomic data is to discover spec...
Thesis: S.M., Massachusetts Institute of Technology, Department of Electrical Engineering and Comput...
Over the past decade, neural networks have been successful at making predictions from biological seq...
High-throughput sequencing (HTS) has led to many breakthroughs in basic and translational biology re...
International audienceThe growing number of annotated biological sequences available makes it possib...
Transcription factors (TFs) bind DNA by recognizing specific sequence motifs, typically of length 6-...
DNA sequences are the basic data type that is processed to perform a generic study of biological dat...
Convolutional neural networks (CNNs) have achieved significant advancements in biological sequence a...
Over the past decade, neural networks have been successful at making predictions from biological seq...
A central challenge in population genetics is the detection of genomic footprints of selection. As m...
The detection of regulatory sequences in DNA is a challenging problem, especially when considered in...
In metagenomic analyses the rapid and accurate identification of DNA sequences is important. This is...