Thank you for sharing these data and for providing details on the sequencing approach used to generate the Andes virus consensus sequence.
It would be very useful to better understand the sequencing and analysis workflow, particularly to assess which technical challenges may arise when sequencing this virus directly from clinical samples.
Could you provide some additional details on the sample and sequencing output? In particular:
What was the Ct value or estimated viral load of the original blood sample?
Was sequencing performed directly from the clinical sample, or was any enrichment step used before sequencing?
If enrichment was used, what type of strategy was applied, for example amplicon-based enrichment, hybrid capture, target enrichment, rRNA depletion, or another approach?
How many total reads were generated for the sample?
How many reads, or what percentage of reads, were assigned to Andes virus?
Which bioinformatic workflow was used for read classification, mapping/assembly and/or consensus generation?
Which reference sequence or database was used for mapping or taxonomic assignment?
Were any criteria applied beyond the minimum coverage threshold of 5 reads to mask low-confidence positions or validate the consensus?
Thank you again!