pipelines-docs

Gene Quantification

The pipeline uses RNA-SeQC to quantify gene expression levels from RNA-seq data. It accepts alignments to the reference genome generated by STAR, with reads sorted by genomic coordinates. The software also compute key quality metrics, such as yield, alignment and duplication rates, transcript coverage, and RNA integrity scores.

Quantifying Gene Expression

Gene expression quantification
rnaseqc collapsed_gene_model.gtf \
        deduped.bam \
        . \
        -s OUT \
        --stranded rf

Note: The above command assumes a paired-end, RF/fr-firststrand stranded protocol. The collapsed gene model was generate using GENCODE comprehensive gene annotations. For more detailed information please refer to the GENCODE documentation under “Genome Annotations” section.

Implementation with RNA-SeQC

The pipeline implements RNA-SeQC version 2.4.2. The software call is executed through a Python wrapper.

Source Code

All the relevant code can be accessed in the GitHub repository:

1: Original author: Francois Aguet, Modified by: Michele Berselli


Home - Overview - Alignment - Duplicate Reads - Transcripts - Genes