Description of the Workflow

After DNA sequencing, data is transferred to NYU's phoenix compute cluster where automated programs and scripts submit the sequence reads for demultiplexing. This is followed by analysis in the targeted exome pipeline using two programs; the sns program which implements a standard exome variant calling pipeline tailored for phoenix's software environment, and the snsxt program which includes extra analysis and reporting steps customized for usage in the NGS580 gene panel.

Pipeline Description

After demultiplexing, low quality bases are trimmed by Trimmomatic. Reads are then aligned to the hg19 reference genome using BWA MEM, and passed through Sambamba for quality filtering and deduplication. Reads are then analyzed for quality metrics using the Rsamtools package with custom R scripts, and depth of coverage at target regions with GATK DepthOfCoverage. These reads are then used in copy number variant analysis using a custom pipeline built around CNVkit, and are passed through a standard GATK pipeline for recalibration and realignment before being used in variant calling with GATK HaplotypeCaller (unpaired samples) and MuTect2 (tumor-normal sample pairs), along with LoFreq for high sensitivity variant calling of unpaired samples.

Variant calls are annotated with ANNOVAR, and all results are aggregated in a custom report which delivers a summary of analysis results, metrics, and files via email for clinical review.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Description_of_the_workflow.md

Description_of_the_workflow.md

Description of the Workflow

Pipeline Description

Files

Description_of_the_workflow.md

Latest commit

History

Description_of_the_workflow.md

File metadata and controls

Description of the Workflow

Pipeline Description