The severe acute respiratory coronavirus 2 (SARS-CoV-2) is the cause of the global outbreak of COVID-19. The epidemic accelerated in Philadelphia, PA, in the spring of 2020, with the city experiencing a first peak of infections on 15 April, followed by a decline through midsummer. Here, we investigate spread of the epidemic in the first wave in Philadelphia using full-genome sequencing of 52 SARSCoV-2 samples obtained from 27 hospitalized patients collected between 30 March and 17 July 2020. Sequences most commonly resembled lineages circulating at earlier times in New York, suggesting transmission primarily from this location, though a minority of Philadelphia genomes matched sequences from other sites, suggesting additional introductions. Multiple genomes showed even closer matches to other Philadelphia isolates, suggestive of ongoing transmission within Philadelphia. We found that all of our isolates contained the D614G substitution in the viral spike and belong to lineages variously designated B.1, Nextstrain clade 20A or 20C, and GISAID clade G or GH. There were no viral sequence polymorphisms detectably associated with disease outcome. For some patients, genome sequences were determined longitudinally or concurrently from multiple body sites. In both cases, some comparisons showed reproducible polymorphisms, suggesting initial seeding with multiple variants and/or accumulation of polymorphisms after infection. These results thus provide data on the sources of SARS-CoV-2 infection in Philadelphia and begin to explore the dynamics within hospitalized patients. IMPORTANCE Understanding how SARS-CoV-2 spreads globally and within infected individuals is critical to the development of mitigation strategies. We found that most lineages in Philadelphia had resembled sequences from New York, suggesting infection primarily but not exclusively from this location. Many genomes had even nearer neighbors within Philadelphia, indicating local spread. Multiple genome sequences were available for some subjects and in a subset of cases could be shown to differ between time points and body sites within an individual, indicating heterogeneous viral populations within individuals and raising questions on the mechanisms responsible. There was no evidence that different lineages were associated with different outcomes in patients, emphasizing the importance of individual-specific vulnerability.
- Genome sequencing
ASJC Scopus subject areas