Getting started with Arabidopsis thaliana genomics

I have started to work on Arabidopsis thaliana, as I mentioned in my last post. As noted in the Encyclopedia of life: Arabidopsis thaliana is the most widely used model organism in plant biology. Its small genome size, fully sequenced in the year 2000, chromosome number, fast growth cycle (from seed germination to set in…

Continue Reading

Read GTF file into R

The Gene Transfer Format (GTF) is a refinement of the General Feature Format (GFF). A GFF file has nine columns: seqname The name of the sequence; must be a chromosome or scaffold. source The program that generated this feature. feature The name of this type of feature, e.g. “CDS”, “start_codon”, “stop_codon”, and “exon” start The…

Continue Reading

Getting started with Seurat

This post is outdated; please refer to the official Seurat vignettes for more information. This post follows the Peripheral Blood Mononuclear Cells (PBMCs) tutorial for 2,700 single cells. It was written while I was going through the tutorial and contains my notes. The dataset for this tutorial can be downloaded from the 10X Genomics dataset…

Continue Reading