AN OPEN ACCESS DATABASE FOR GENE REGULATORY ELEMENT AND POLYMORPHISM ANNOTATION
The Open REGulatory ANNOtation database (ORegAnno) is an open database for the curation of known regulatory elements from scientific literature.
Annotation is collected from users worldwide for various biological assays and is automatically cross-referenced against PubMED,
Entrez Gene, EnsEMBL,
dbSNP, the eVOC: Cell type ontology,
and the Taxonomy database, where appropriate, with information
regarding the original experimentation performed (evidence).
ORegAnno further provides an open validation process for all regulatory annotation in the public
domain. Assigned validators receive notification of new records in the database and are able to cross-reference the citation to ensure record
integrity. Validators have the ability to modify any record (deprecating the old record and creating a new one) if an error is found.
Further, any contributor to the database can comment on any annotation by marking errors, or adding special reports into function as they see fit. These features
of ORegAnno ensure that the collection is of the highest quality and uniquely provides a dynamic view of our changing understanding of gene regulation
in the various genomes. As a first step, we recommend reading through our Help page.
The ORegAnno data and web application are all LGPL open-source to encourage the development
and maintenance of the database to new information and experimentation techniques. Please use our current citation information when referring to ORegAnno data in publication. We encourage interested contributors to send email to the ORegAnno
mailing list at oreganno@bcgsc.ca or to visit the mailing-list archives.
NEWS
|
MOST RECENTLY ANNOTATED PUBLICATIONS
|
October 17th, 2007 Warning: there are persistent case sensitivity issues with the Boolean search features. A fix will be availability shortly. |
September 21st, 2007 There will be an interruption of ORegAnno service on the 21st as the BCGSC systems undergo maintenance |
July 26th, 2007 A set of NRSF/REST binding sites identified using ChipSeq by Caltech/Stanford has been added [more] |
|
|
More news...
|
|
Lin CJ et al., NF-1C, Sp1, and Sp3 are essential for transcription of the human gene for P450c17 (steroid 17alpha-hydroxylase/17,20 lyase) in human adrenal NCI-H295A cells. Mol Endocrinol 2001
Wang XB et al., Two SNPs in the promoter region of the CTLA-4 gene affect binding of transcription factors and are associated with human myasthenia gravis. J Intern Med 2008
Li S et al., Nuclear factor I and mammary gland factor (STAT5) play a critical role in regulating rat whey acidic protein gene expression in transgenic mice. Mol Cell Biol 1995
Amemiya K et al., Adjacent nuclear factor-1 and activator protein binding sites in the enhancer of the neurotropic JC virus. A common characteristic of many brain-specific genes. J Biol Chem 1992
Mink S et al., A mammary cell-specific enhancer in mouse mammary tumor virus DNA is composed of multiple regulatory elements including binding sites for CTF/NFI and a novel transcription factor, mammary cell-activating factor. Mol Cell Biol 1992
|
INCORPORATED DATASETS
ORegAnno provides the ability to incorporate well-established datasets and provide
relevant citation and ongoing maintenance functionality for these sets. For more information read the help pages on datasets or our citation page.
| AVAILABLE NOW |
 |
|
| AVAILABLE NOW |
 |
|
| AVAILABLE NOW |
 |
|
| AVAILABLE NOW |
 |
|
| AVAILABLE NOW |
 |
|
| AVAILABLE NOW |
 |
|
| AVAILABLE NOW |
 |
|
| AVAILABLE NOW |
 |
|
| AVAILABLE NOW |
 |
|
| AVAILABLE NOW |
 |
|
| AVAILABLE NOW |
 |
|
| AVAILABLE NOW |
 |
|
| AVAILABLE NOW |
 |
|
| AVAILABLE NOW |
 |
|
| AVAILABLE NOW |
 |
|
| AVAILABLE NOW |
 |
|
WHY ANOTHER GENE REGULATION DATABASE?
Many databases and datasets exist that annotate regulatory elements in general or from niche areas of gene regulation.
We provide a list on the Other Resources page.
While these databases provide insight into gene regulation, certain challenges exist with each.
In all cases, the data are static or maintained internally, and few or no mechanisms exist in which a community of researchers can
add value to the annotations. Also, in some cases, like TRANSFAC, the latest data does not exists in the public domain.
In some cases, like TRRD, the data are difficult
to find or access in any convenient way. In MPromDB, the data
are limited to human, mouse, and rat, and are not accessible programatically. EPD, SCPD, PlantCARE, HemoPDB, and OPD are examples of databases that target only certain types of promoters. In dbQSNP, existing functional variation is ignored, despite it contributing new insight into candidate promoter polymorphisms.
In every database, negative controls are disregarded; no information is supplied on regions searched, or instances where annotation was unsuccessfully tested for transcription-affecting properties. This is problematic when the development of sophisticated computational biology
techniques depends on being able to access this information. To address these issues we have created an open annotation and open source database initiative that promotes
the development of a unified resource for this information. ORegAnno uses advanced programming techniques and APIs suchs as Hibernate, Lucene,
JAAS, MD5, Axis Web Services, Struts, and JSPs to deliver its data either graphically through this website or programatically through Web Services to any location in
the world.
|