Overview

The gene association files ingested from GO Consortium members are shown in the table below. Files are in the GO annotation file format and are compressed using the UNIX gzip utility. Please see the upstream resource information for further details on the annotation set. Any errors or omissions in annotations should be reported by writing to the GO Helpdesk.


Filtered Files

These files are taxon-specific and reflect the work of specific projects, primarily the model organisms database groups, to provide comprehensive, non-redundant annotation files for their organism. All the files in this table have been filtered using the annotation file QC pipeline. A major component to the filtering is the requirement that particular taxon IDs can only be included within the association files provided by specific projects; the current list of authoritative groups and major model organisms can be found below.


Filtered Annotation File Downloads for 2024-12-22 release

Species/Database Entity type Annotations File
Species/Database Entity type Annotations File
Dictyostelium discoideum
dictyBase (dictyBase)
n/a 81783 dictybase.gaf (gzip)
Mus musculus
Mouse Genome Informatics (mgi)
n/a 407233 mgi.gaf (gzip)
Solanaceae
Sol Genomics Network (sgn)
gene 1354 sgn.gaf (gzip)
Sus scrofa
EBI Gene Ontology Annotation Database (goa)
protein 158842 goa_pig.gaf (gzip)
Danio rerio
Zebrafish Information Network (zfin)
n/a 224901 zfin.gaf (gzip)
Escherichia coli
Encyclopedia of E. coli metabolism (ecocyc)
n/a 58742 ecocyc.gaf (gzip)
Rattus norvegicus
Rat Genome Database (rgd)
n/a 482190 rgd.gaf (gzip)
Saccharomyces cerevisiae
Saccharomyces Genome Database (sgd)
n/a 120713 sgd.gaf (gzip)
Schizosaccharomyces pombe
PomBase (pombase)
n/a 51801 pombase.gaf (gzip)
Plasmodium falciparum
GeneDB (genedb)
n/a 10678 genedb_pfalciparum.gaf (gzip)
Pseudomonas aeruginosa
Pseudomonas Genome Project (pseudocap)
n/a 3612 pseudocap.gaf (gzip)
Drosophila melanogaster
FlyBase (fb)
n/a 135612 fb.gaf (gzip)
Homo sapiens
EBI Gene Ontology Annotation Database (goa)
protein 782020 goa_human.gaf (gzip)
Caenorhabditis elegans
WormBase database of nematode biology (wb)
n/a 121444 wb.gaf (gzip)
Bos taurus
EBI Gene Ontology Annotation Database (goa)
protein 163910 goa_cow.gaf (gzip)
Leishmania major
GeneDB (genedb)
n/a 9858 genedb_lmajor.gaf (gzip)
Xenopus
Xenbase (xenbase)
n/a 302633 xenbase.gaf (gzip)
Schizosaccharomyces japonicus
JaponicusDB (japonicusdb)
n/a 34698 japonicusdb.gaf (gzip)
Multi-species
Reactome - a curated knowledgebase of biological pathways (reactome)
n/a 101420 reactome.gaf (gzip)
Multi-species
Candida Genome Database (cgd)
n/a 362699 cgd.gaf (gzip)
Gallus gallus
EBI Gene Ontology Annotation Database (goa)
protein 192626 goa_chicken.gaf (gzip)
Canis lupus familiaris
EBI Gene Ontology Annotation Database (goa)
protein 152544 goa_dog.gaf (gzip)
Trypanosoma brucei
GeneDB (genedb)
n/a 20209 genedb_tbrucei.gaf (gzip)
Arabidopsis thaliana
The Arabidopsis Information Resource (tair)
n/a 235783 tair.gaf (gzip)

Copyright © 1999-2024 the Gene Ontology (CC-BY 4.0)
HelpdeskCitation/attributionTerms of use
Member of the Open Biological and Biomedical Ontologies

The Gene Ontology Consortium is funded by the National Human Genome Research Institute (US National Institutes of Health), grant number HG012212, with co-funding by NIGMS.