Continuing the discussion from Mid/Early release - 45 new EBOV genomes from Sierra Leone:
We have dates of reporting for many of these sequence but some are missing from the WHO line list. We can impute these dates from other samples with adjacent patient ids so here is some documentation of the logic used for these imputations. The dates here (in dd/mm/yy form) are the dates or reporting of the cases but these are almost always the same as the date of initial sample collection where this is known.
G4955 is likely from 2014-08-13:
G5119 likely from 2014-08-19 or 2014-08-20:
G5640 is likely from 2014-09-10 to 2014-09-12:
G5982, G5983, G5997, G6012 & G6020 are likely from 2014-09-23 to 2014-09-25
The remaining 3 - G6089, G6091, G6104 - aer likely to be on or after 2014-09-25 but probably not by much: