The pipeline uses RNA-SeQC to quantify gene expression levels from RNA-seq data. It accepts alignments to the reference genome generated by STAR, with reads sorted by genomic coordinates. The software also compute key quality metrics, such as yield, alignment and duplication rates, transcript coverage, and RNA integrity scores.
rnaseqc collapsed_gene_model.gtf \
deduped.bam \
. \
-s OUT \
--stranded rf
Note: The above command assumes a paired-end, RF/fr-firststrand stranded protocol. The collapsed gene model was generate using GENCODE comprehensive gene annotations. For more detailed information please refer to the GENCODE documentation under “Genome Annotations” section.
The pipeline implements RNA-SeQC version 2.4.2. The software call is executed through a Python wrapper.
All the relevant code can be accessed in the GitHub repository:
1: Original author: Francois Aguet, Modified by: Michele Berselli