The EcoGene project involves the examination of Escherichia coli K-12 DNA sequences and accompanying annotation in the public databases in order to refine the representation and prediction of the entire set of E. coli K-12 chromosomally encoded protein sequences. The results of this ongoing effort have been deposited in the SWISSPROT protein sequence database as sequencing of the E. coli genome has progressed to completion in recent years. Through this continuing research, we have discovered that the prediction of low molecular weight (small) proteins, arbitrarily defined as protein sequences < or = 150 amino acids (aa) in length, is problematic and requires special attention. We describe the small protein subset of EcoGene and the approach...
The number of gene products available for structural and functional study is increasing at an unprec...
The function of proteins is often mediated by short linear segments of their amino acid sequence, ca...
We present the sequence of 176 kilobases of the Escherichia coli K-12 genome, from katG at 89.2 to a...
The EcoGene project involves the examination of Escherichia coli K-12 DNA sequences and accompanying...
Motivation: Accurate prediction of genes encoding small proteins (on the order of 50 amino acids or ...
ABSTRACT Small proteins consisting of 50 or fewer amino acids have been identified as regulators of ...
Genome sequences are available for increasing numbers of organisms. The proteomes (protein complemen...
In the past, short protein-coding genes were often disregarded by genome annotation pipelines. Trans...
In the past, short protein-coding genes were often disregarded by genome annotation pipelines. Trans...
Ensemble of proteins in a bacterial species hold relevance for understanding the biochemical and met...
Proteins with molecular weights of <25 kDa are involved in major biological processes such as riboso...
searched Escherichia coli genome offers the opportunity to explore the value of using protein famili...
Forty-two protein spots of observed Mr 6^15 kDa were resolved by two-dimensional gel electrophoresis...
searched Escherichia coli genome offers the opportunity to explore the value of using protein famili...
Much attention is being paid to protein databases as an important information source for proteome re...
The number of gene products available for structural and functional study is increasing at an unprec...
The function of proteins is often mediated by short linear segments of their amino acid sequence, ca...
We present the sequence of 176 kilobases of the Escherichia coli K-12 genome, from katG at 89.2 to a...
The EcoGene project involves the examination of Escherichia coli K-12 DNA sequences and accompanying...
Motivation: Accurate prediction of genes encoding small proteins (on the order of 50 amino acids or ...
ABSTRACT Small proteins consisting of 50 or fewer amino acids have been identified as regulators of ...
Genome sequences are available for increasing numbers of organisms. The proteomes (protein complemen...
In the past, short protein-coding genes were often disregarded by genome annotation pipelines. Trans...
In the past, short protein-coding genes were often disregarded by genome annotation pipelines. Trans...
Ensemble of proteins in a bacterial species hold relevance for understanding the biochemical and met...
Proteins with molecular weights of <25 kDa are involved in major biological processes such as riboso...
searched Escherichia coli genome offers the opportunity to explore the value of using protein famili...
Forty-two protein spots of observed Mr 6^15 kDa were resolved by two-dimensional gel electrophoresis...
searched Escherichia coli genome offers the opportunity to explore the value of using protein famili...
Much attention is being paid to protein databases as an important information source for proteome re...
The number of gene products available for structural and functional study is increasing at an unprec...
The function of proteins is often mediated by short linear segments of their amino acid sequence, ca...
We present the sequence of 176 kilobases of the Escherichia coli K-12 genome, from katG at 89.2 to a...