Skip to content

Commit

Permalink
Don't include Wuhan root in Nextstrain profile
Browse files Browse the repository at this point in the history
Drop forced inclusion of Wuhan/1 root in the Nextstrain GISAID profile and swap rooting to use "best", ie temporally optimal rooting. This allows the root to be the common ancestor of the subsampled sequences. This makes it so that with the changes to time-based subsampling in the previous commit, the "6m" analysis includes samples from the previous 30m and the TMRCA is in ~2021.

This set up should be significantly more future proof than needing to continually make new clade-specific (eg /21L/) roots as selective sweeps occur.
  • Loading branch information
trvrb committed Jul 24, 2024
1 parent db6684c commit 4dd3f2a
Show file tree
Hide file tree
Showing 2 changed files with 6 additions and 0 deletions.
6 changes: 6 additions & 0 deletions nextstrain_profiles/nextstrain-gisaid/builds.yaml
Original file line number Diff line number Diff line change
Expand Up @@ -20,7 +20,9 @@ genes: ["ORF1a", "ORF1b", "S", "ORF3a", "E", "M", "ORF6", "ORF7a", "ORF7b", "ORF
use_nextalign: true
include_hcov19_prefix: True

# use custom include file that doesn't specify Wuhan/1
files:
include: "nextstrain_profiles/nextstrain-gisaid/include.txt"
description: "nextstrain_profiles/nextstrain-gisaid/nextstrain_description.md"

inputs:
Expand Down Expand Up @@ -928,6 +930,10 @@ subsampling:
max_sequences: 75
exclude: "--exclude-where 'region!=Oceania'"

# root via temporal fit rather than explicit outgroup
refine:
root: "best"

# if different traits should be reconstructed for some builds, specify here
# otherwise the default trait config in defaults/parameters.yaml will used
traits:
Expand Down
Empty file.

0 comments on commit 4dd3f2a

Please sign in to comment.