Abstract Background The ability to efficiently search and filter datasets depends on access to high quality metadata. While most biomedical repositories require data submitters to provide a minimal set of metadata, some such as the Gene Expression Omnibus (GEO) allows users to specify additional metadata in the form of textual key-value pairs (e.g. sex: female). However, since there is no structured vocabulary to guide the submitter regarding the metadata terms to use, consequently, the 44,000,000+ key-value pairs in GEO suffer from numerous quality issues including redundancy, heterogeneity, inconsistency, and incompleteness. Such issues hinder the ability of scientists to hone in on datasets that meet their requirements and point to a nee...
Genomic data are growing at unprecedented pace, along with new protocols, update polices, formats an...
One of the key challenges of microarray studies is to derive biological insights from the unpreceden...
While there exists an abundance of open biomedical data, the lack of high-quality metadata makes it ...
Background: The ability to efficiently search and filter datasets depends on access to high quality ...
A crucial and limiting factor in data reuse is the lack of accurate, structured, and complete descri...
There is a great deal of interest in analyzing very large data sets in the biomedical sciences. This...
Clustering is a long-standing problem in computer science and is applied in virtually any scientific...
<p>In biomedicine, good metadata is crucial to finding experimental datasets, to understand how expe...
The Gene Expression Omnibus (GEO) contains more than two million digital samples from functional gen...
The Gene Expression Omnibus (GEO) contains more than two million digital samples from functional gen...
Applications of clustering algorithms in biomedical research are ubiquitous, with typical examples i...
<p>Current approaches to metadata discovery are dependent on manual curations which are time consumi...
Abstract—Dealing with data means to group information into a set of categories either in order to le...
The massive volumes of data in biological sequence databases provide a remarkable resource for large...
Conference proceeding from the Fifth IASTED Conference on Computational Intelligence, August 2010, p...
Genomic data are growing at unprecedented pace, along with new protocols, update polices, formats an...
One of the key challenges of microarray studies is to derive biological insights from the unpreceden...
While there exists an abundance of open biomedical data, the lack of high-quality metadata makes it ...
Background: The ability to efficiently search and filter datasets depends on access to high quality ...
A crucial and limiting factor in data reuse is the lack of accurate, structured, and complete descri...
There is a great deal of interest in analyzing very large data sets in the biomedical sciences. This...
Clustering is a long-standing problem in computer science and is applied in virtually any scientific...
<p>In biomedicine, good metadata is crucial to finding experimental datasets, to understand how expe...
The Gene Expression Omnibus (GEO) contains more than two million digital samples from functional gen...
The Gene Expression Omnibus (GEO) contains more than two million digital samples from functional gen...
Applications of clustering algorithms in biomedical research are ubiquitous, with typical examples i...
<p>Current approaches to metadata discovery are dependent on manual curations which are time consumi...
Abstract—Dealing with data means to group information into a set of categories either in order to le...
The massive volumes of data in biological sequence databases provide a remarkable resource for large...
Conference proceeding from the Fifth IASTED Conference on Computational Intelligence, August 2010, p...
Genomic data are growing at unprecedented pace, along with new protocols, update polices, formats an...
One of the key challenges of microarray studies is to derive biological insights from the unpreceden...
While there exists an abundance of open biomedical data, the lack of high-quality metadata makes it ...