Summary plots from GEMINI

I’m a fan of GEMINI and have been using it for for a year and a half for various exome projects. I have written two scripts that can generate variant summaries from a GEMINI database. I prefer bar plots over the pie charts created by VEP.

A summary pie chart created by VEP.

I’ve shared the two scripts on GitHub Gist.

chmod 755 plot_gemini.*

R is used to create the plots and requires three packages: ggplot2, reshape2, and ggthemes. Install the packages if you haven’t already.

my_required <- c('ggplot2', 'reshape2', 'ggthemes')

Now if you run plot_gemini.R, you should get:

Error: Please provide input file
Execution halted

I’ve put the two scripts in my bin directory, which is in my PATH. Next I just need to change into the directory with my GEMINI database and run The script expects one *.db file in the directory.

cd /path/to/gemini/database

# generates five PDFs
ls -1 *.pdf

Refer to the GEMINI documentation for more information on the variant categories (which are based on the Sequence Ontology).

Summary of all variants.

Summary of high impact variants.

Summary of low impact variants.

Summary of medium impact variants.

Summary of variant types.

The two scripts are shown below. Basically the script saves the SQL result from GEMINI and uses R to plot the tables. You can easily modify the scripts to create other summary plots.

Creative Commons License
This work is licensed under a Creative Commons
Attribution 4.0 International License

