We introduce a new representation and feature extraction method for biological sequences. Named bio-vectors (BioVec) to refer to biological sequences in general with protein-vectors (ProtVec) for proteins (amino-acid sequences) and gene-vectors (GeneVec) for gene sequences, this representation can be widely used in applications of deep learning in proteomics and genomics. In the present paper, we focus on protein-vectors that can be utilized in a wide array of bioinformatics investigations such as family classification, protein visualization, structure prediction, disordered protein identification, and protein-protein interaction prediction. In this method, we adopt artificial neural network approaches and represent a protein sequence with ...
A broad and simple definition of `language' is a set of sequences constructed from a finite set of s...
A broad and simple definition of `language' is a set of sequences constructed from a finite set of s...
A new protein fold recognition method is described which is both fast and reliable. The method uses ...
We introduce a new representation and feature extraction method for biological sequences. Named bio-...
<div><p>We introduce a new representation and feature extraction method for biological sequences. Na...
Protein-Protein Interactions (PPIs) are a crucial mechanism underpinning the function of the cell. S...
Many life activities and key functions in organisms are maintained by different types of proteinS...
Modern sequencing initiatives have uncovered a large number of protein sequence data. The exponentia...
A number of protein sequences are found and added to the database but its functional properties are ...
We propose a feature vector approach to characterize the variation in large data sets of biological ...
This capstone project examines the performance of existing embedding based alignment-free methods f...
The function of any protein depends directly on its secondary and tertiary structure. Proteins can f...
The classical sequence-structure-function paradigm for proteins illustrates that the amino acid sequ...
To classify proteins into functional families based on their primary sequences, existing classificat...
Protein sequences of the same family typically share common patterns which imply their structural fu...
A broad and simple definition of `language' is a set of sequences constructed from a finite set of s...
A broad and simple definition of `language' is a set of sequences constructed from a finite set of s...
A new protein fold recognition method is described which is both fast and reliable. The method uses ...
We introduce a new representation and feature extraction method for biological sequences. Named bio-...
<div><p>We introduce a new representation and feature extraction method for biological sequences. Na...
Protein-Protein Interactions (PPIs) are a crucial mechanism underpinning the function of the cell. S...
Many life activities and key functions in organisms are maintained by different types of proteinS...
Modern sequencing initiatives have uncovered a large number of protein sequence data. The exponentia...
A number of protein sequences are found and added to the database but its functional properties are ...
We propose a feature vector approach to characterize the variation in large data sets of biological ...
This capstone project examines the performance of existing embedding based alignment-free methods f...
The function of any protein depends directly on its secondary and tertiary structure. Proteins can f...
The classical sequence-structure-function paradigm for proteins illustrates that the amino acid sequ...
To classify proteins into functional families based on their primary sequences, existing classificat...
Protein sequences of the same family typically share common patterns which imply their structural fu...
A broad and simple definition of `language' is a set of sequences constructed from a finite set of s...
A broad and simple definition of `language' is a set of sequences constructed from a finite set of s...
A new protein fold recognition method is described which is both fast and reliable. The method uses ...