10x single cell BAM files

The Chromium Single Cell 3′ Solution is a commercial platform developed by 10x Genomics for preparing single cell cDNA libraries for performing single cell RNA-seq. In addition, 10x Genomics have developed an entire software suite called Cell Ranger that can process the raw BCL files produced by an Illumina sequencer and output a final gene-barcode…

Continue Reading

Interactive plots in R

Interactive plots, as the name suggests, are plots that users can interact with. In my last post, I mentioned that for interactive heatmaps I use the d3heatmap package. To get started with this post, I’ll create the same heatmap as my last post but this time using the d3heatmap package.

Continue Reading

Making a heatmap in R with the pheatmap package

For a while, heatmap.2() from the gplots package was my function of choice for creating heatmaps in R. Then I discovered the superheat package, which attracted me because of the side plots. However, shortly afterwards I discovered pheatmap and I have been mainly using it for all my heatmaps (except when I need to interact…

Continue Reading

Compiling R with GNU Readline

Updated 2018 March 23rd for R-3.4.4 I use a lot of shortcuts provided by GNU Readline. I recently compiled R without Readline support and it was almost unusable! This was because I ran into the error: configure: error: –with-readline=yes (default) and headers/libs are not available To circumvent this I compiled R by running: ./configure –with-readline=no…

Continue Reading

Merging two 10x single cell datasets

I was going to write a post on using the Seurat alignment method as a batch correction tool but as it turned out the two datasets that I chose didn’t seem to have strong batch effects! I heard about the alignment method sometime last year but was motivated to try it out after listening to…

Continue Reading

Annotating variants with a custom file

The Variant Effect Predictor (VEP) tool can be used for annotating variants with respect to custom annotation sources. This is useful if gene models of interest are not represented in the Ensembl or RefSeq databases. To get started, first install VEP since it takes some time.

Continue Reading

Getting started with HISAT, StringTie, and Ballgown

A popular toolset used for analysing RNA-seq data is the tuxedo suite, which consists of TopHat and Cufflinks. The suite provided a start to finish pipeline that allowed users to map reads, assemble transcripts, and perform differential expression analyses. A newer “tuxedo suite” has been developed and is made up of three tools: HISAT, StringTie,…

Continue Reading

7th Anniversary

I reached a million views on 2017 September 27th. Near the start of September, I had wondered if I would reach a million before my 7th anniversary, which is today. I used the traffic to this site to predict when I would hit the mark. Not the best fit. Use only 2017 data to predict….

Continue Reading