The paired-end short-read analysis pipeline for RNA-seq data is based on GTEx1 and TOPMed2 analysis pipelines. It is designed for per-sample execution, handling one or multiple sets of paired FASTQ files.
To meet the scalability demands, the pipeline utilizes a more efficient software implementation from Sentieon. Sentieon offers a comprehensive toolkit that replicates the original STAR and GATK algorithms, while enhancing computational efficiency.
1: Lonsdale, J., Thomas, J., Salvatore, M. et al. The Genotype-Tissue Expression (GTEx) project. Nat Genet 45, 580–585 (2013). doi: 10.1038/ng.2653; 2: Taliun, D., Harris, D.N., Kessler, M.D. et al. Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program. Nature 590, 290–299 (2021). doi: 10.1038/s41586-021-03205-y