How many RefSeq gene models have GO terms?

Downloaded RefSeq gene models from UCSC Genome Browser.

Total of 34,565 unique RefSeq ID

Using this script refseq2go.pl join refseq to gene ontology (GO) terms

17,458 / 34,565 have GO terms, about half.

In total there were 169,087 GO terms (10,316 unique) for the 17,458 RefSeq:

50,868 Component
53,144 Function
65,075 Process

Top 10 Components

5156 nucleus
4699 cytoplasm
4101 integral to membrane
3769 membrane
3113 plasma membrane
1892 intracellular
1886 extracellular region
1369 cytosol
1261 mitochondrion
1013 integral to plasma membrane

Top 10 Functions

6588 protein binding
2796 metal ion binding
1975 zinc ion binding
1960 nucleotide binding
1749 DNA binding
1488 ATP binding
1272 receptor activity
1156 transferase activity
982 sequence-specific DNA binding transcription factor activity
956 hydrolase activity

Top 10 Processes

1103 signal transduction
1082 regulation of transcription, DNA-dependent
914 multicellular organismal development
914 regulation of transcription
596 biological_process
574 response to stimulus
545 oxidation reduction
539 cell adhesion
516 ion transport
502 protein phosphorylation

Print Friendly, PDF & Email



Creative Commons License
This work is licensed under a Creative Commons
Attribution 4.0 International License
.
2 comments Add yours

Leave a Reply

Your email address will not be published.

This site uses Akismet to reduce spam. Learn how your comment data is processed.