For alignment, the pipeline uses minimap2 on each FASTQ file. The reads are then sorted by genomic coordinates, and an integrity check is performed on the resulting BAM file.
sentieon minimap2 -Y -L --eqx --secondary=no -ax map-ont reference.fasta reads.fastq |
samtools sort -o sorted.bam -
Arguments:
CG
.To confirm the integrity of the alignment BAM file, in-house Python code checks for the presence of the 28-byte empty block representing the EOF marker in BAM format.
Sentieon implementation replicates the original minimap2 code. The pipeline is using Sentieon version 202308.01, corresponding to minimap2 version 2.26.
All the relevant code can be accessed in the GitHub repository: