identifier | type | PT | TP | TN | PF | FP | FN |
---|---|---|---|---|---|---|---|
2019-nCoV-noblis_1 | Probe | 22 | 3 | 415 | NA | NA | NA |
2019-nCoV-noblis_2 | Probe | 25 | NA | 691 | NA | NA | NA |
2019-nCoV-noblis_3 | Probe | 25 | NA | 656 | NA | NA | NA |
2019-nCoV-noblis_4 | Probe | 25 | NA | 356 | NA | 3 | NA |
ncov_e_gene | Probe | 25 | NA | 75 | 353 | 15 | NA |
ncov_n_gene | Probe | 25 | NA | 73 | NA | 339 | NA |
ncov_rdrp_1 | Probe | NA | 25 | 169 | 433 | 85 | NA |
ncov_rdrp_2 | Probe | NA | 25 | 586 | 1 | 61 | NA |
The confusion matrix indicates the outcome of each assay alignment to each subject sequence. The PT and PF categories represent true positives and false positives with 100% identity. The calculation accounts for intended taxonomic target and evaluates the percentage similarity for each assay component separately. This report is based on a percentage similarity threshold of greater than or equal to 90%.
The heatmap represents the average alignment percentage similarity of each assay component to a subject. The y-axis sorts each subject accession by taxonomic identifier. To avoid clutter, the figure hides the subject accession.
The alignment figure aggregates mismatches of assay amplicons to the top subject alignments comprising the PT, TP, and FN categories. The counts indicate mismatch type: transition (trs), transversion (trv), dissimilar (dis), insertion (ins), and deletion (del). The gray boxes indicate the component regions of the assays.
## ./src/pset.py nncov.tsv nt env_nt gss epi -drivername sqlite -database data/tax/tax.db -conf-blastn task=blastn
## Database: Nucleotide collection (nt)
## 57,030,965 sequences; 257,768,224,489 total bases
##
## Date: Oct 27, 2019 6:48 PM Longest sequence: 99,791,824 bases
##
## BLASTDB Version: 4
##
## Database: environmental samples
## 124,601,035 sequences; 134,952,479,671 total bases
##
## Date: Dec 10, 2019 5:25 AM Longest sequence: 40,002,466 bases
##
## BLASTDB Version: 4
##
## Database: Genome Survey Sequence, includes single-pass genomic data, exon-trapped sequences, and Alu PCR sequences.
## 40,638,741 sequences; 26,313,188,574 total bases
##
## Date: Dec 9, 2019 7:00 PM Longest sequence: 72,873 bases
##
## BLASTDB Version: 4
##
## Database: epi
## 26 sequences; 747,047 total bases
##
## Date: Jan 24, 2020 1:13 PM Longest sequence: 29,903 bases
##
## BLASTDB Version: 4