Bioinformatics: GEOmetadb: powerful alternative search engine for the Gene Expression Omnibus
Abstract
The NCBI Gene Expression Omnibus (GEO) represents the largest public repository of microarray data. However, finding data in GEO can be challenging. We have developed GEOmetadb in an attempt to make querying the GEO metadata both easier and more powerful. All GEO metadata records as well as the relationships between them are parsed and stored in a local MySQL database. A powerful, flexible web search interface with several convenient utilities provides query capabilities not available via NCBI tools. In addition, a Bioconductor package, GEOmetadb that utilizes an SQLite export of the entire GEOmetadb database is also available, rendering the entire GEO database accessible with the full power of SQL-based queries from within R.