Doctor of PhilosophyDepartment of Computing and Information SciencesDoina CarageaRecent advancements in biological sciences have resulted in the availability of large amounts of sequence data (DNA and protein sequences). Biological sequence data can be annotated using machine learning techniques, but most learning algorithms require data to be represented by a vector of features. In the absence of biologically informative features, k-mers generated using a sliding window-based approach are commonly used to represent biological sequences. A larger k value typically results in better features; however, the number of k-mer features is exponential in k, and many k-mers are not informative. Feature selection is widely used to reduce the dim...
Many open problems in bioinformatics involve elucidating underlying functional signals in biological...
International audienceFeature extraction is an unavoidable task, especially in the critical step of ...
Most existing methods for sequence-based classification use exhaustive feature generation, employing...
Doctor of PhilosophyDepartment of Computing and Information SciencesDoina CarageaRecent advancements...
DNA sequence decomposition into k-mers (substrings of length k) and their frequency counting, define...
DNA sequence decomposition into k-mers (substrings of length k) and their frequency counting, define...
Background: Many open problems in bioinformatics involve elucidating underlying functional signals i...
DNAsequence decomposition into k-mers and their frequency counting, defines a mapping of a sequence ...
DNAsequence decomposition into k-mers and their frequency counting, defines a mapping of a sequence ...
<div><p>Background</p><p>Many open problems in bioinformatics involve elucidating underlying functio...
Many open problems in bioinformatics involve elucidating underlying functional signals in biological...
International audienceFeature extraction is an unavoidable task, especially in the critical step of ...
International audienceFeature extraction is an unavoidable task, especially in the critical step of ...
International audienceFeature extraction is an unavoidable task, especially in the critical step of ...
Recent advances in next-generation sequencing technologies have resulted in an exponential increase ...
Many open problems in bioinformatics involve elucidating underlying functional signals in biological...
International audienceFeature extraction is an unavoidable task, especially in the critical step of ...
Most existing methods for sequence-based classification use exhaustive feature generation, employing...
Doctor of PhilosophyDepartment of Computing and Information SciencesDoina CarageaRecent advancements...
DNA sequence decomposition into k-mers (substrings of length k) and their frequency counting, define...
DNA sequence decomposition into k-mers (substrings of length k) and their frequency counting, define...
Background: Many open problems in bioinformatics involve elucidating underlying functional signals i...
DNAsequence decomposition into k-mers and their frequency counting, defines a mapping of a sequence ...
DNAsequence decomposition into k-mers and their frequency counting, defines a mapping of a sequence ...
<div><p>Background</p><p>Many open problems in bioinformatics involve elucidating underlying functio...
Many open problems in bioinformatics involve elucidating underlying functional signals in biological...
International audienceFeature extraction is an unavoidable task, especially in the critical step of ...
International audienceFeature extraction is an unavoidable task, especially in the critical step of ...
International audienceFeature extraction is an unavoidable task, especially in the critical step of ...
Recent advances in next-generation sequencing technologies have resulted in an exponential increase ...
Many open problems in bioinformatics involve elucidating underlying functional signals in biological...
International audienceFeature extraction is an unavoidable task, especially in the critical step of ...
Most existing methods for sequence-based classification use exhaustive feature generation, employing...