nextclade
Bioinformatics tool for virus genome alignment, clade assignment and qc checks. More information: https://docs.nextstrain.org/projects/nextclade/en/stable/user/nextclade-cli/index.html.
- Align sequences to user provided [r]eference, [o]utputting the alignment to a file:
nextclade run path/to/sequences.fa -r path/to/reference.fa -o path/to/alignment.fa
- Create a [t]SV report, auto-downloading the latest [d]ataset:
nextclade run path/to/fasta -d dataset_name -t path/to/report.tsv
- List all available datasets:
nextclade dataset list
- Download the latest SARS-CoV-2 dataset:
nextclade dataset get --name sars-cov-2 --output-dir path/to/directory
- Use a downloaded [D]ataset, producing all [O]utputs:
nextclade run -D path/to/dataset_dir -O path/to/output_dir path/to/sequences.fasta
- Run on multiple files:
nextclade run -d dataset_name -t path/to/output_tsv -- path/to/input_fasta_1 path/to/input_fasta_2 …
- Try reverse complement if sequence does not align:
nextclade run --retry-reverse-complement -d dataset_name -t path/to/output_tsv path/to/input_fasta