nCoV-2019 Spike Protein Receptor Binding Domain Shares High Amino Acid Identity With a Coronavirus Recovered from a Pangolin Viral Metagenomic Dataset

_SARSlike_PlusWuhan_YunnanSPIKEPlusPangolin_CodonAlignedPROT.fasta.gz (9.1 KB)

The sequence names are not great in this file, but I included accession numbers so you can get more information on any sequence. The PDF image of the tree shows how the. spike genes are related to each other.
BetaCoronaviruses_114_WuhanCladeHandAlignedPlusPangolin2_IQtreePDF.pdf (7.2 KB)
SASR_SARSlikePlusPangolinCodonAligned.FASTA.gz (638.1 KB)

The complete genomes codon-aligned have a few small regions in individual sequences which are not “optimally” aligned. But overall, having a DNA alignment that translates to amino acids in one frame, is useful for studying selection pressure and other things.