Overview

The gene association files ingested from GO Consortium members are shown in the table below. Files are in the GO annotation file format and are compressed using the UNIX gzip utility. Please see the upstream resource information for further details on the annotation set. Any errors or omissions in annotations should be reported by writing to the GO Helpdesk.


Filtered Files

These files are taxon-specific and reflect the work of specific projects, primarily the model organisms database groups, to provide comprehensive, non-redundant annotation files for their organism. All the files in this table have been filtered using the annotation file QC pipeline. A major component to the filtering is the requirement that particular taxon IDs can only be included within the association files provided by specific projects; the current list of authoritative groups and major model organisms can be found below.


Filtered Annotation File Downloads for 2024-07-19 release

Species/Database Entity type Annotations File
Species/Database Entity type Annotations File
Mus musculus
Mouse Genome Informatics (mgi)
n/a 379350 mgi.gaf (gzip)
Sus scrofa
EBI Gene Ontology Annotation Database (goa)
protein 151376 goa_pig.gaf (gzip)
Escherichia coli
Encyclopedia of E. coli metabolism (ecocyc)
n/a 57516 ecocyc.gaf (gzip)
Pseudomonas aeruginosa
Pseudomonas Genome Project (pseudocap)
n/a 3617 pseudocap.gaf (gzip)
Homo sapiens
EBI Gene Ontology Annotation Database (goa)
protein 769674 goa_human.gaf (gzip)
Caenorhabditis elegans
WormBase database of nematode biology (wb)
n/a 129178 wb.gaf (gzip)
Bos taurus
EBI Gene Ontology Annotation Database (goa)
protein 133337 goa_cow.gaf (gzip)
Gallus gallus
EBI Gene Ontology Annotation Database (goa)
protein 131169 goa_chicken.gaf (gzip)
Canis lupus familiaris
EBI Gene Ontology Annotation Database (goa)
protein 146134 goa_dog.gaf (gzip)

Copyright © 1999-2024 the Gene Ontology (CC-BY 4.0)
HelpdeskCitation/attributionTerms of use
Member of the Open Biological and Biomedical Ontologies

The Gene Ontology Consortium is funded by the National Human Genome Research Institute (US National Institutes of Health), grant number HG012212, with co-funding by NIGMS.