Genome Structural Diversity among 31 Bordetella pertussis Isolates from Two Recent U.S. Whooping Cough Statewide Epidemics.

Bowden KE; Weigand MR; Peng Y; Cassiday PK; Sammons S; Knipe K; Rowe LA; Loparev V; Sheth M; Weening K; Tondella ML; Williams MM

doi:10.1128/msphere.00036-16

Genome Structural Diversity among 31 Bordetella pertussis Isolates from Two Recent U.S. Whooping Cough Statewide Epidemics.

Affiliations

1. Division of Bacterial Diseases, Centers for Disease Control and Prevention, Atlanta, Georgia, USA.
Authors
Bowden KE¹
Weigand MR¹
Peng Y¹
Cassiday PK¹
Tondella ML¹
Williams MM¹
(6 authors)
2. Biotechnology Core Facility Branch, Division of Scientific Resources, Centers for Disease Control and Prevention, Atlanta, Georgia, USA.
Authors
Sammons S²
Knipe K²
Rowe LA²
Loparev V²
Sheth M²
(5 authors)
3. Vermont Department of Health Laboratory, Burlington, Vermont, USA.
Authors
Weening K³
(1 author)

ORCIDs linked to this article

Msphere, 11 May 2016, 1(3):e00036-16
https://doi.org/10.1128/msphere.00036-16 PMID: 27303739 PMCID: PMC4888882

This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.

Free full text in Europe PMC

Abstract

During 2010 and 2012, California and Vermont, respectively, experienced statewide epidemics of pertussis with differences seen in the demographic affected, case clinical presentation, and molecular epidemiology of the circulating strains. To overcome limitations of the current molecular typing methods for pertussis, we utilized whole-genome sequencing to gain a broader understanding of how current circulating strains are causing large epidemics. Through the use of combined next-generation sequencing technologies, this study compared de novo, single-contig genome assemblies from 31 out of 33 Bordetella pertussis isolates collected during two separate pertussis statewide epidemics and 2 resequenced vaccine strains. Final genome architecture assemblies were verified with whole-genome optical mapping. Sixteen distinct genome rearrangement profiles were observed in epidemic isolate genomes, all of which were distinct from the genome structures of the two resequenced vaccine strains. These rearrangements appear to be mediated by repetitive sequence elements, such as high-copy-number mobile genetic elements and rRNA operons. Additionally, novel and previously identified single nucleotide polymorphisms were detected in 10 virulence-related genes in the epidemic isolates. Whole-genome variation analysis identified state-specific variants, and coding regions bearing nonsynonymous mutations were classified into functional annotated orthologous groups. Comprehensive studies on whole genomes are needed to understand the resurgence of pertussis and develop novel tools to better characterize the molecular epidemiology of evolving B. pertussis populations. IMPORTANCE Pertussis, or whooping cough, is the most poorly controlled vaccine-preventable bacterial disease in the United States, which has experienced a resurgence for more than a decade. Once viewed as a monomorphic pathogen, B. pertussis strains circulating during epidemics exhibit diversity visible on a genome structural level, previously undetectable by traditional sequence analysis using short-read technologies. For the first time, we combine short- and long-read sequencing platforms with restriction optical mapping for single-contig, de novo assembly of 31 isolates to investigate two geographically and temporally independent U.S. pertussis epidemics. These complete genomes reshape our understanding of B. pertussis evolution and strengthen molecular epidemiology toward one day understanding the resurgence of pertussis.

Free full text

mSphere. 2016 May-Jun; 1(3): e00036-16.

Published online 2016 May 11. https://doi.org/10.1128/mSphere.00036-16

PMCID: PMC4888882

PMID: 27303739

Genome Structural Diversity among 31 Bordetella pertussis Isolates from Two Recent U.S. Whooping Cough Statewide Epidemics

Katherine E. Bowden,^a Michael R. Weigand,^a Yanhui Peng,^a Pamela K. Cassiday,^a Scott Sammons,^b Kristen Knipe,^b Lori A. Rowe,^b Vladimir Loparev,^b Mili Sheth,^b Keeley Weening,^c M. Lucia Tondella,^a and Margaret M. Williams^a

Melanie Blokesch, Editor

Melanie Blokesch, Swiss Federal Institute of Technology Lausanne;

Author information Article notes Copyright and License information Disclaimer

This article has been cited by other articles in PMC.

Go to:

Associated Data

Supplementary Materials: Data Set S1 .
Assembly results for 2 vaccine strains and 33 isolates collected during the California (CA) 2010 and Vermont (VT) 2012 statewide pertussis epidemics that were selected for whole-genome sequencing. Download Data Set S1, XLSX file, 0.1 MB.
Copyright © 2016 Bowden et al. This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Figure S1
Genome structural variation among epidemic isolates from California (blue) and Vermont (red). (A) Maximum-likelihood phylogenetic reconstruction based on the order and orientation of conserved, colinear genome fragments identified by whole-genome alignment. (B) Alignment of 14 select genomes with different structural architectures (highlighted with asterisks in panel A). Connected blocks of the same color represent conserved regions, and flags indicate the presence of an IS481 (black), IS1002 (blue), or rRNA operon (red) at the boundary between colinear regions. Download Figure S1, TIF file, 0.4 MB.
Copyright © 2016 Bowden et al. This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Data Set S2 .
Sequence comparison of virulence-related genes in B. pertussis epidemic isolates compared to E476 (Tohama I). Download Data Set S2, XLSX file, 0.1 MB.
Copyright © 2016 Bowden et al. This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Data Set S3 .
Seven hundred thirty-one variants in 33 B. pertussis epidemic genomes. Download Data Set S3, XLSX file, 0.1 MB.
Copyright © 2016 Bowden et al. This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Data Set S4 .
Thirty-four nonsynonymous variants in at least 26 of the 33 B. pertussis epidemic genomes. Download Data Set S4, XLSX file, 0.1 MB.
Copyright © 2016 Bowden et al. This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Data Set S5 .
Five hundred sixty-one variants in California B. pertussis epidemic genomes. Download Data Set S5, XLSX file, 0.1 MB.
Copyright © 2016 Bowden et al. This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Data Set S6 .
Four hundred thirteen variants in Vermont B. pertussis epidemic genomes. Download Data Set S6, XLSX file, 0.1 MB.
Copyright © 2016 Bowden et al. This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Data Set S7 .
Forty-nine California state-specific B. pertussis epidemic genome variants. Download Data Set S7, XLSX file, 0.1 MB.
Copyright © 2016 Bowden et al. This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Data Set S8 .
Thirty-four Vermont state-specific B. pertussis epidemic genome variants. Download Data Set S8, XLSX file, 0.1 MB.
Copyright © 2016 Bowden et al. This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.
Data Set S9 .
State-specific nonsynonymous B. pertussis epidemic genome variants. Download Data Set S9, XLSX file, 0.1 MB.
Copyright © 2016 Bowden et al. This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Go to:

ABSTRACT

IMPORTANCE Pertussis, or whooping cough, is the most poorly controlled vaccine-preventable bacterial disease in the United States, which has experienced a resurgence for more than a decade. Once viewed as a monomorphic pathogen, B. pertussis strains circulating during epidemics exhibit diversity visible on a genome structural level, previously undetectable by traditional sequence analysis using short-read technologies. For the first time, we combine short- and long-read sequencing platforms with restriction optical mapping for single-contig, de novo assembly of 31 isolates to investigate two geographically and temporally independent U.S. pertussis epidemics. These complete genomes reshape our understanding of B. pertussis evolution and strengthen molecular epidemiology toward one day understanding the resurgence of pertussis.

KEYWORDS: Bordetella pertussis, genome rearrangements, optical mapping, pertactin, whole-genome sequencing

Go to:

INTRODUCTION

Bordetella pertussis is the causative agent of whooping cough (pertussis), a respiratory disease affecting all age groups, but with the highest disease severity in unvaccinated infants. Whole-cell vaccines against pertussis were introduced in the United States during the 1940s, and greatly reduced disease incidence, but were replaced during the 1990s by acellular vaccines, which produced less-severe side effects (1,–4). In addition to diphtheria and tetanus toxoids, the entire childhood vaccination series of diphtheria–tetanus–acellular-pertussis (DTaP) vaccine contains inactivated pertussis toxin (Ptx) and one or more additional virulence-related bacterial components: filamentous hemagglutinin (Fha), pertactin (Prn), or fimbria (Fim) types 2 and 3. Additionally, a single dose of the adolescent and adult booster Tdap (diphtheria, tetanus, and acellular-pertussis vaccine) was recommended in 2005 to counteract the rise of reported cases in adolescents and adults (3, 5, 6). Despite availability, administration, and high coverage of the acellular vaccine reported in the country as a whole, pertussis cases in the United States over the past decade have increased to record numbers not seen since the 1950s. Many states have experienced epidemic levels of pertussis cases in recent years, specifically California in 2010 and Washington and Vermont in 2012 (7,–11). In 2010, California reported over 9,000 cases (23.4 cases per 100,000 residents), while in 2012 Vermont reported over 600 cases (103 cases per 100,000 residents), over 10 times more than average for that time of year (7, 10). With incidence being highest in unvaccinated infants younger than 6 months and fully vaccinated preadolescents (7 to 10 years) in California, previous studies concluded that unprotected infants were at the highest risk for disease, while waning protection from the childhood acellular vaccine series contributed to disease in preadolescents (7, 12). While there is no single explanation for this resurgence of cases, it has been ascribed to many factors such as pathogen adaptation, improved surveillance and laboratory diagnostics, and waning protection and immune response provided by the acellular vaccine (12,–14).

Since 2010, the U.S. population of circulating B. pertussis strains has increasingly become deficient in Prn, an acellular vaccine component, due to various mutations, primarily the disruption of prn by the mobile genetic element IS481 (11, 14,–18). In 2012, 92% of isolates collected in Vermont did not produce pertactin, suggesting that a selective advantage to prn deficiency may have played a role in that epidemic (14). In separate global locations, two isolates lacking Ptx have been obtained recently, one of which is also Prn deficient (19, 20). The loss of vaccine immunogens identified by current molecular typing methods provides strong evidence that current populations of B. pertussis no longer reflect the genotypic profile of vaccine strains and could be adapting to the immune response elicited by the acellular vaccine (11, 14, 15, 19, 21,–24). This, along with data supporting higher rates of evolution in vaccine antigen genes, has generated concern that other genes encoding vaccine components may harbor mutations not identifiable through current molecular typing techniques (25). Additionally, the recent analysis of a statewide pertussis epidemic in Washington during 2012 revealed that pulsed-field gel electrophoresis (PFGE), a whole-genome restriction digest analysis, was the most powerful indicator of diversity in this population of circulating pertussis strains (11). With evidence suggesting that current strains exhibit genomic diversity that nucleotide variation identified by short-read sequencing fails to explain, it is of great interest to employ long-read technologies to detect mutations and assess the role of genome structure in pertussis resurgence (11, 15, 26).

With hundreds of insertion sequences (ISs) and repeat regions in current circulating strains of B. pertussis, whole-genome sequencing is challenging. There are currently over 400 B. pertussis draft genomes available in public databases, all of which include at least 200 contigs assembled from short-read sequencing. Such fragmented assemblies provide limited information about genome arrangement or gene order and are, therefore, not suitable for evaluating contributions of rearrangement diversity toward pertussis incidence, resurgence, or strain evolution. Complete, de novo genome assembly of B. pertussis is essential in pursuit of this information and now possible with long-read (>1-kb) sequencing and whole-genome mapping (27, 28). The recent release of complete genomes highlights the contrast between modern pertussis strains and current vaccine references, at both the nucleotide and the structural levels (29, 30). In an effort to better understand pertussis resurgence and evolution, here we characterize genome structure and provide direct evidence of large genome rearrangements in annotated, single-contig genome assemblies from 31 isolates collected during two geographically and temporally distinct pertussis epidemics. This depth of resolution facilitates analysis that is not possible through conventional molecular typing or short-read sequencing. The availability of these complete genomes will hopefully aid future transcriptomics and proteomics pursuits toward a deeper understanding of pertussis reemergence and increased incidence.

Go to:

RESULTS

Reference-free assembly of complete genomes.

Through the use of Pacific Biosciences (PacBio) RSII and Illumina MiSeq sequencing technologies, 33 genomes (31 epidemic, 2 vaccine) (Table 1) were de novo assembled into single contigs. All genomes were approximately 4.1 Mb in size with an average G+C content of 67.7%. Detailed assembly metrics for each genome are outlined in Data Set S1 in the supplemental material. The genome assembly of I475 provided no circular overlap to ensure closing of the single contig, while the genome assemblies for I488 and I517 failed one or more validation steps, resulting in multiple contig(s). Annotation information, including the number of genes, coding sequences (CDS), pseudogenes, frameshifted genes, and insertion sequences, is outlined in Data Set S1 in the supplemental material. In addition, all genomes contained 51 tRNAs and 3 rRNA operons. Tohama I (E476) and the Chinese vaccine (CS) strain (C393) resequenced in this study were 16.8 kb and 9.5 kb larger than the NCBI reference genomes Tohama I and CS, respectively, due to the discovery of additional IS481 copies (see Data Set S1).

TABLE 1

Metadata for B. pertussis isolates collected during the California 2010 and Vermont 2012 statewide epidemics and vaccine strains that were selected for whole-genome sequencing^l

Isolate	Location	Yr of isolation	Age at symptom onset	Vaccination status	PFGE type	MLVA^a	prn^b	ptxP^b	ptxA^b	fimH^b,^c	prn genotype	Prn production^d
H374	CA	2010	Infant	UV	CDC002	16	2	3	1	1	WT prn	+
H375^e	CA	2010	Infant	UV	CDC268	186	1	1	2	1	Del nt 26-109^f	−
H378	CA	2010	Infant	UV	CDC253	27	2	3	1	1	IS481::240 Fwd^g	−
H379	CA	2010	Infant	UV	CDC046	27	2	3	1	2	WT prn	+
H380^e	CA	2010	Infant	UV	CDC013	27	2	3	1	2	WT prn	+
H489	CA	2010	Infant	UV	CDC082	27	2	3	1	2	WT prn	+
H542^e	CA	2010	Infant	UV	CDC269	27	2	3	1	1	WT prn	+
H559^e	CA	2010	Infant	UV	CDC253	27	2	3	1	1	IS481::240 Fwd^g	−
H561	CA	2010	Infant	UV	CDC170	16	2	3	1	2	WT prn	+
H563^e	CA	2010	Infant	≥1 dose	CDC271	27	2	3	1	2	WT prn	+
H564	CA	2010	Child	UV	CDC013	27	2	3	1	2	WT prn	+
H622	CA	2010	Infant	UV	CDC217	27	2	3	1	1	WT prn	+
H627	CA	2010	Infant	UV	CDC217	27	2	3	1	1	WT prn	+
H788	VT	2011	Infant	UV	CDC046	128	2	3	1	2	WT prn	+
I669	VT	2011	Adult	UV	CDC013	27	2	3	1	2	WT prn	+
I468	VT	2012	Child	UV	CDC002	27	2	3	1	1	SC @ nt 1273	−
I469	VT	2012	Child	≥1 dose	CDC342	27	2	3	1	2	IS481::1613 Rev^h	−
I472	VT	2012	Adolescent	UV	CDC046	27	2	3	1	2	IS481::1613 Rev^h	−
I475	VT	2012	Adult	UV	CDC237	27	2	3	1	1	IS481::1613 Fwdⁱ	−
I476	VT	2012	Adolescent	UTD	CDC300	27	2	3	1	1	IS481::1613 Fwdⁱ	−
I480	VT	2012	Child	≥1 dose	CDC217	27	2	3	1	1	WT prn	+
I483	VT	2012	Infant	UTD	CDC237	27	2	3	1	1	IS481::1613 Fwdⁱ	−
I488	VT	2012	Child	≥1 dose	CDC343	27	2	3	1	1	IS481::1613 Fwdⁱ	−
I496	VT	2012	Child	UTD	CDC343	27	2	3	1	1	IS481::1613 Fwdⁱ	−
I498	VT	2012	Adult	≥1 dose	CDC253	27	2	3	1	1	IS481::240 Rev^j	−
I517	VT	2012	Child	≥1 dose	CDC344	27	2	3	1	1	IS481::1613 Fwdⁱ	−
I518	VT	2012	Child	UTD	CDC002	27	2	3	1	1	SC @ nt 1273	−
I521	VT	2012	Child	≥1 dose	CDC237	27	2	3	1	1	IS481::1613 Fwdⁱ	−
I538	VT	2012	Child	UNK	CDC002	27	2	3	1	1	prnP (–74 nt)^k	−
I539	VT	2012	Child	UTD	CDC002	27	2	3	1	1	prnP (–74 nt)^k	−
I646	VT	2012	Child	UV	CDC274	27	2	3	1	1	WT prn	+
I656	VT	2012	Child	UTD	CDC010	27	2	3	1	1	WT prn	+
I707	VT	2012	Child	UTD	CDC253	27	2	3	1	1	WT prn	+
C393	China	1951	UNK	NA	CDC052	UNK	1	1	2	1	WT prn	+
E476	Japan	1954	UNK	NA	CDC232	38	1	1	2	1	WT prn	+

^aMLVA type is defined by the repeat counts for VNTR1, VNTR3, VNTR4, VNTR5, and VNTR6 (21).

^bSingle-copy locus (21).

^cFormerly referred to as fim3.

^dBased on enzyme-linked immunosorbent assay.

^eFatal cases.

^fSignal sequence deletion (nt 26 to 109).

^gIS481 forward insertion at nt 240.

^hIS481 reverse insertion at nt 1613.

ⁱIS481 forward insertion at nt 1613.

^jIS481 reverse insertion at nt 240.

^kPromoter disruption (–74 nt), previously identified as promoter inversion (11, 15).

^lAbbreviations: MLVA, multilocus variable-number tandem-repeat analysis; CA, California; VT, Vermont; nt, nucleotide; UTD, up to date; UV, unvaccinated; UNK, unknown; WT, wild-type; SC, stop codon; NA, not applicable.

Data Set S1

Assembly results for 2 vaccine strains and 33 isolates collected during the California (CA) 2010 and Vermont (VT) 2012 statewide pertussis epidemics that were selected for whole-genome sequencing. Download Data Set S1, XLSX file, 0.1 MB.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Large-scale genome rearrangements.

Multiple insertions, deletions, and inversions were seen in all genomes of epidemic isolates relative to E476 and C393 using both optical mapping and whole-genome sequence alignment (Fig. 1). Such genome structural diversity was also observed within epidemic isolates, which included 16 discrete, large-scale architectural profiles. The 13 California genomes comprised 10 rearrangement profiles, and 11 profiles were seen in the 18 Vermont genomes. Further, only five structural profiles were observed in both states, with five and six additional profiles distinct to California and Vermont, respectively (examples of these structural profiles can be found in Fig. S1B in the supplemental material).

An external file that holds a picture, illustration, etc.
Object name is sph0031620780001.jpg

FIG 1

Large-scale genome rearrangements within a subset of California (H561 and H563, blue) and Vermont (I496 and I707, red) epidemic isolates compared to vaccine strains E476 and C393 as visualized by whole-genome restriction mapping and alignment with MapSolver (A) and genome sequence alignment with progressiveMauve (B). Connecting lines indicate either conserved restriction fragments (A) or homologous sequence blocks (B). Sequence maps begin at the replication origin, and the approximate replication terminus is indicated with an arrow.

Figure S1

Genome structural variation among epidemic isolates from California (blue) and Vermont (red). (A) Maximum-likelihood phylogenetic reconstruction based on the order and orientation of conserved, colinear genome fragments identified by whole-genome alignment. (B) Alignment of 14 select genomes with different structural architectures (highlighted with asterisks in panel A). Connected blocks of the same color represent conserved regions, and flags indicate the presence of an IS481 (black), IS1002 (blue), or rRNA operon (red) at the boundary between colinear regions. Download Figure S1, TIF file, 0.4 MB.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

To visualize the dynamic relationship between the vaccine strains and isolates collected during the California and Vermont epidemics resulting from genome structural changes, a Maximum Likelihood for Gene Order (MLGO) tree was constructed from a multiple sequence alignment (Fig. 2). This clustering revealed eight major clades (A to H) that distinguished the epidemic isolates and vaccine strains based on gene rearrangement and gene order (Fig. 2). Isolates with the same or similar PFGE profiles clustered by genome rearrangement (Fig. 2). The vaccine strains formed a distinct clade (H) which is distant from the U.S. epidemic isolates and B1917 (CP009751) and B1920 (CP009752) (29). B1917, the ptxP3 lineage strain corresponding to current circulating strains, clusters with isolates in clade D, while B1920, the ptxP1 lineage strain predominant from 1960 to the 1990s, is no more closely related to I646 and the only ptxP1 epidemic isolate (H375) than it is to the rest of the epidemic and vaccine strain genomes (Fig. 2) (29). Four clades (B, D, E, and F) show clustering of epidemic isolates by state and prn production status. Further, the two isolates collected in Vermont during 2011 (H788 and I669) are diverse and not closely related to each other by genome structure (Fig. 2).

An external file that holds a picture, illustration, etc.
Object name is sph0031620780002.jpg

FIG 2

Maximum likelihood for gene order (MLGO) clustering of epidemic strains from the 2010 California (blue) and 2012 Vermont (red) statewide pertussis epidemics. Publicly available genomes (Tohama I, CS, B1917, and B1920) and two resequenced vaccine strains (C393 and E476) were also included (black). Pertactin production and deficiency of each strain are indicated by pink and black boxes, respectively. Clades referenced within the text are indicated by letters. The inset displays banding patterns of the PFGE profiles of all epidemic and vaccine strains sequenced in this study. The dendrogram of PFGE profiles was created using the unweighted pair group method using average linkages (UPGMA) with 1% band tolerance and optimization settings. Genomes for the epidemic isolates I488 and I517 were excluded from the analysis (see Materials and Methods and Data Set S1 in the supplemental material).

Rearrangement boundaries.

To better understand what mediates observed rearrangements, the genome content at each predicted rearrangement boundary was investigated further. When comparing California and Vermont isolates alone, 26 conserved homologous blocks (≥1.5 kb) were identified in these 31 genomes, separated by 25 predicted rearrangement boundaries (Table 2; see also Fig. S1 in the supplemental material). Eighty percent (n = 20) of these boundaries were composed of an IS481 element (Table 2; see also Fig. S1B). Additionally, 12% (n = 3) were composed of an rrn operon, while two boundaries were composed of either an IS1002 element or a combination of IS1002 and IS481 (Table 2; see also Fig. S1B). When two vaccine strains (C393 and E476) were included in the alignment, all additional predicted rearrangement boundaries contained IS481 (Table 2). This larger comparison identified 45 conserved homologous blocks (≥1.5 kb), separated by 44 predicted rearrangement boundaries, 89% (n = 39) of which included copies of IS481 (Table 2).

TABLE 2

Characteristics of rearrangement boundaries

Characteristic	Value for strain type:
Characteristic	Epidemic	Epidemic and vaccine
No. of genomes	31^a	33^a
No. of rearrangement boundaries	25	44
No. of copies/genome
IS481 (≥231)	20	39
rRNA (3)	3	3
IS1002 (4)	1	1
IS1002/IS481	1	1

^aI488 and I517 excluded from analysis.

Virulence gene comparisons.

The sequences of 44 virulence-related genes for all epidemic genomes and C393 were compared to E476 (see Data Set S2 in the supplemental material). The resulting alignments revealed that 30 of these virulence-related genes showed no sequence differences relative to E476. Gene sequences extracted from the assemblies for prn, ptxP, ptxA, and fimH (fim3) exhibited the same allelic variations previously identified through multilocus sequence typing (MLST) and prn sequence typing by PCR (Table 1; see also Data Set S2). Unique single nucleotide polymorphisms (SNPs) identified in virulence-related genes of the epidemic isolates are displayed in Table 3 and in Data Set S2 in the supplemental material. Six SNPs in virulence-related genes were found to be unique to a single isolate, while two were identified in all epidemic isolates, including C393 (Table 3). Most of the SNPs identified (7/10) resulted in an amino acid change (Table 3). Interestingly, a majority of SNPs in virulence-related genes were found in GC-rich regions (Table 3). Additionally, no epidemic genomes harbored the 23S A2047G mutation known to produce erythromycin/azithromycin resistance (31,–33).

TABLE 3

Unique SNPs in virulence-related genes^a of B. pertussis epidemic isolates compared to E476 (Tohama I)

Gene	Isolate(s)	SNP location	Region characteristic	Amino acid change	Protein expressed? (49)	Reference(s)
bapC	H374	655 A→G	String of 2 G’s	219 Asp→Gly	Unknown	This study
bfrD	H564	1634 G→A	GC-rich region	545 Ser→Asn	Unknown	This study
bipA	H563	1070 G→T	GC-rich region	357 Arg→Leu	Unknown	This study
brkA	I646	640 C→T	GC-rich region	214 Pro→Ser	Unknown	This study
bscC	H375	1677 G→A	String of 2 A’s	NA^b	Yes	This study; 49
bvgR	C393	36 C→T	String of 3 T’s	NA	Unknown	This study; 23
bvgS	All epidemic isolates, C393	2113 A→G	GC-rich region	705 Lys→Glu	Yes	47, 49, 50, 51, 52
fimD	All epidemic isolates, C393	356 T→C	GC-rich region	119 Phe→Ser	Yes	47, 49, 50, 53, 54
ptxB	All epidemic isolates	133 G→A	GC-rich region	45 Gly→Ser	Yes	47, 49, 50, 55
ptxC	All epidemic isolates except H375	681 C→T	String of 2 T’s	NA	Yes	47, 49, 56

^aDoes not include previously identified SNPs associating with MLST typing loci.

^bNA, not applicable.

Data Set S2

Sequence comparison of virulence-related genes in B. pertussis epidemic isolates compared to E476 (Tohama I). Download Data Set S2, XLSX file, 0.1 MB.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Variant analysis.

The phylogenetic relationships among epidemic isolates were reconstructed from a concatenated alignment of 408 variable sites (Fig. 3). There was greater diversity among the 13 California isolates (Fig. 3), with H375 being most divergent, and no clustering was observed among isolates recovered from fatal cases (Fig. 3). In contrast, Vermont isolates exhibited more homogeneity (Fig. 3). Additionally, four California genomes (H563, H561, H374, and H564) clustered among the earliest collected Vermont genomes, three of which (H788, I475, and I669) were collected from unvaccinated patients (Fig. 3). Pertactin-deficient and pertactin-producing isolates are dispersed throughout the tree (Fig. 3). The two isolates with the prn promoter disruption (I538 and I539) are exclusive to one clade (Fig. 3; Table 1).

An external file that holds a picture, illustration, etc.
Object name is sph0031620780003.jpg

FIG 3

Maximum likelihood phylogenetic reconstruction from the concatenated alignment of 408 variable sites (SNVs and MNVs). Vaccine strains resequenced in this study are indicated in black, California isolates are indicated in blue, and Vermont isolates are indicated in red. Yellow circles denote pertactin-deficient strains, and black circles denote strains recovered from fatal cases. The scale bar indicates number of substitutions per site and has been corrected for ascertainment bias based on the nucleotide composition of invariant sites in E476. All epidemic genomes were included in this analysis.

To characterize putative phenotypic effects of observed nucleotide variants, all mutations were classified as noncoding, synonymous, or nonsynonymous, as seen in Fig. 4. Within the 33 epidemic genomes analyzed, a total of 731 variants were discovered and 39.4% (n = 288) were identified as nonsynonymous (Fig. 4A; see also Data Set S3 in the supplemental material). Of these, 34 unique protein-encoding genes were mutated in at least 26 of the 33 epidemic genomes and 17% were found to be involved in inorganic ion transport and metabolism (Fig. 4B; see also Data Set S4). Epidemic genomes were divided by state and characterized in the same manner to identify state-specific mutations. California epidemic genomes contained more variants than Vermont (561 and 413, respectively), with a majority of noncoding mutations (82%) specific to California genomes and a majority of nonsynonymous mutations (53%) specific to Vermont epidemic genomes (Fig. 4A; see also Data Sets S5, S6, S7, and S8). The California isolate H375, the only isolate from the ptxP1 lineage, exhibited the largest number of isolate-specific variants of all epidemic genomes (n = 90) and accounted for seven additional noncoding variants upon exclusion from the California-specific variant analysis (Fig. 4A; see also Data Set S3). The genes affected by nonsynonymous mutations specific to each state were further identified (Table 4). In at least nine of the 12 California epidemic genomes, 63% (n = 5) of the nonsynonymous mutations were seen in five IS1663 elements (Table 4; see also Data Set S9). Nine of the 18 nonsynonymous mutations specific to Vermont affected proteins involved in various metabolic pathways (Table 4; see also Data Set S9). Additionally, no variants were found to be exclusive to prn-deficient isolates outside the prn gene or isolates collected from fatal cases.

An external file that holds a picture, illustration, etc.
Object name is sph0031620780004.jpg

FIG 4

Sequence variants within epidemic genomes. (A) Distribution of noncoding (green), synonymous (yellow), and nonsynonymous (blue) variants. (B) Functional classification of predicted protein sequences with observed nonsynonymous mutation in at least 26 epidemic genomes categorized according to EggNOG v.4. Variants were statistically significant by Fisher's exact test.

TABLE 4

State-specific nonsynonymous variants in California and Vermont epidemic B. pertussis genomes^a

Nonsynonymous variant in state	Functional category	Location in E476 (Tohama I)	Variant within gene	Amino acid change	No. of genomes
California
Transposase for IS1663	Replication, recombination, and repair	116940	947 T→A	316 Leu→Gln	13
Transposase for IS1663	Replication, recombination, and repair	527395	805 T→C	269 Tyr→His	12
Transposase for IS1663	Replication, recombination, and repair	1080959	947 T→A	316 Leu→Gln	9
Transposase for IS1663	Replication, recombination, and repair	1090325	947 T→A	316 Leu→Gln	10
Transposase for IS1663	Replication, recombination, and repair	1447860	947 T→A	316 Leu→Gln	12
Transposase	Replication, recombination, and repair	3457234	947 A→T	316 Gln→Leu	12
Regulator (hypothetical protein)	Signal transduction mechanisms	3614341	11 G→A	4 Gly→Glu	12
Vermont
Zinc transporter ZupT	Inorganic ion transport and metabolism	198405	194 T→C	65 Val→Ala	20
FAD/FMN-containing dehydrogenase	Energy production and conversion	216761	19 T→C	7 Ser→Pro	20
RNase E	Translation, ribosomal structure, and biogenesis	489152	2480 C→T	827 Pro→Leu	20
Endoribonuclease l-PSP (RutC family protein)	Function unknown	528567	186 C→G	62 Asp→Glu	20
Diacylglycerol kinase	Lipid transport and metabolism	868389	97 G→A	33 Asp→Asn	19
Glutamate-ammonia-ligase adenylyltransferase	Posttranslational modification, protein turnover, chaperones	1333208	2519 G→A	840 Gly→Asp	20
Aldo-ketoreductase	Function unknown	1388597	625 G→C	209 Ala→Pro	19
Multidrug efflux pump subunit AcrB	Inorganic ion transport and metabolism	2208591	3221 A→G	1074 Asn→Ser	19
Acyltransferase	Function unknown	2271460	24 G→C	8 Gln→His	17
Hypothetical protein	Function unknown	2276856	158 T→C	53 Val→Ala	20
Transposase	Replication, recombination, and repair	2405180	899 C→G	300 Thr→Ser	19
Potassium-transporting ATPase subunit C	Inorganic ion transport and metabolism	2640489	245 T→C	82 Ile→Thr	19
5-Hydroxyvalerate dehydrogenase (HVD)	Energy production and conversion	3011513	1016 C→T	339 Ala→Val	20
NADPH dehydrogenase	Energy production and conversion	3072916	55 A→G	19 Lys→Glu	20
ADP-dependent (S)-NAD(P)H- hydrate dehydratase	Carbohydrate transport and metabolism	3538308	166 G→A	56 Gly→Arg	11
Putative oligopeptide transporter	Function unknown	1189451-1189452	1765_GC_1766	589 Gly fs	19
ATP-cobalamin adenosyltransferase	Coenzyme transport and metabolism	3304784-3304785	545_CCGG_546	183 Ala fs	17
Hypothetical protein	Function unknown	3490857-3490858	1130_CTA_1131	377 Glu delins Asp*	20

^aAbbreviations: fs, frameshift; delins, deletion/insertion; FAD, flavin adenine dinucleotide; FMN, flavin mononucleotide; PSP, perchloric acid-soluble protein; *, produces premature stop codon.

Data Set S3

Seven hundred thirty-one variants in 33 B. pertussis epidemic genomes. Download Data Set S3, XLSX file, 0.1 MB.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Data Set S4

Thirty-four nonsynonymous variants in at least 26 of the 33 B. pertussis epidemic genomes. Download Data Set S4, XLSX file, 0.1 MB.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Data Set S5

Five hundred sixty-one variants in California B. pertussis epidemic genomes. Download Data Set S5, XLSX file, 0.1 MB.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Data Set S6

Four hundred thirteen variants in Vermont B. pertussis epidemic genomes. Download Data Set S6, XLSX file, 0.1 MB.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Data Set S7

Forty-nine California state-specific B. pertussis epidemic genome variants. Download Data Set S7, XLSX file, 0.1 MB.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Data Set S8

Thirty-four Vermont state-specific B. pertussis epidemic genome variants. Download Data Set S8, XLSX file, 0.1 MB.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

Data Set S9

State-specific nonsynonymous B. pertussis epidemic genome variants. Download Data Set S9, XLSX file, 0.1 MB.

This content is distributed under the terms of the Creative Commons Attribution 4.0 International license.

To address the issue of vaccine-driven evolution and to determine if certain genes are under selective pressure, the frequencies of nonsynonymous variants and variants within virulence genes across all epidemic genomes were determined. Virulence gene variants had a rate of 0.22 bp/kb compared to 0.24 bp/kb for total epidemic genome variants, with no significant difference between the two. However, nonsynonymous variants had a higher frequency (0.10 bp/kb) than synonymous variants (0.03 bp/kb) (P value = 0.0029).

Go to:

DISCUSSION

This study illustrates the first effort to evaluate relationships among isolates within and between statewide pertussis epidemics through comparison of complete genome sequences. Prior to the availability of single-contig genomes, PFGE has been the only whole-genome indicator used to fully assess molecular epidemiology and patient transmission of circulating strains of B. pertussis (34). To date, analyses of gene order and genome structure heterogeneity have been limited by the resolution of available methods (35, 36). Additionally, few genomes are available with a complete genome assembly that has been confirmed with a second method such as optical mapping. During the process of complete genome assembly, genome optical maps not only provided structural confirmation of assemblies but proved to be useful as a possible genome typing tool with greater detail than that found with PFGE (with approximate location of restriction cut sites mapped). Depending on future questions, it may be beneficial to develop a hierarchical approach, in which optical mapping is first used to compare genome structures as a typing method, followed by genome sequencing if more detail is required.

Here, we generated high-quality genome assemblies and showed their capacity for detecting discrete genome structural variations among two geographically and temporally independent pertussis epidemics at a resolution never seen before. Furthermore, clustering these genomes according to rearrangement patterns revealed correlations with PFGE profiles and associating prn mutations. However, no strong association was observed between global genome structure and geography, patient fatality, vaccination status, or time of collection (Fig. 2; Table 1). The genome structures of two vaccine strains resequenced as part of this study differed greatly from epidemic isolates, supporting a growing body of evidence that suggests that current circulating strains of B. pertussis have dramatically changed since the isolation of vaccine references (Fig. 2).

Recently, Bart et al. (29) sequenced complete genomes of representative isolates from two predominate global lineages of ptxP3 and ptxP1 (B1917 and B1920, respectively) which exhibited structural architectures similar to those of the U.S. epidemic isolates reported here with comparable molecular typing profiles (Table 1; Fig. 2). The finding that B1920 clustered near the only ptxP1 epidemic isolate may indicate a shift in rearrangement profiles that corresponds to the shift from ptxP1 and ptxP3 lineages. Additionally, B1917 and B1920 strains were isolated in 2000 in the Netherlands, temporally and geographically distant from the epidemic isolates sequenced here, suggesting that rearrangement profiles may be stable over time. Although rearrangements have been identified by others, attempts to measure rearrangement rates in B. pertussis have been limited by the low resolution of PFGE, and reports of gene order changes following multiple laboratory passages, which differ from natural infection cycles, are conflicting (36,–40). Therefore, the historical pattern of genome rearrangement remains unclear. Only through methods like optical mapping and complete genome sequencing, which have been applied to B. pertussis U.S. epidemic isolates here for the first time, can these questions be addressed.

Although a diverse number of PFGE profiles among isolates are observed in this and previous studies, the genomic variation underlying this diversity can now be interrogated (11). To do so, we identified genes located at predicted rearrangement boundaries in all fully assembled genomes in this study. The highly conserved rrn operons and repetitive mobile elements IS481 and IS1002 were found at predicted rearrangement breakpoints in both epidemic and vaccine isolate genomes, the majority of which were composed of IS481 elements (Table 2; see also Fig. S1 in the supplemental material). Almost all predicted rearrangements were in the form of inversions flanked by inverted sequence repeats, often “symmetric” around the origin or terminus of replication, maintaining replichore balance important for genome stability (41, 42). These data support previous evidence that rearrangements in the B. pertussis genome are mediated by insertion sequence (IS) elements through homologous recombination between inverted repeats, presumably during replication, a mechanism long thought to maintain genome plasticity (22, 35, 36, 41, 43, 44).

Aside from rearrangements, mobile elements have been shown to play a role in the creation of pseudogenes, genome reduction, and adaptive evolution toward host specificity (22, 36, 43, 45, 46). Accurate identification of all IS481, IS1002, and IS1663 insertions is necessary to evaluate their contribution to genome evolution, and previous studies using short-read sequencing have been unsuccessful (24, 47). Our pipeline provided more robust sequencing data for proper placement of insertion sequences to construct more accurate assemblies, evident when comparing the corrected number of IS481 elements discovered in the resequenced vaccine strains E476 and C393 and assembly structure errors found when comparing the NCBI CS reference genome with the resequenced C393 (see Data Set S1 in the supplemental material). Alignment of complete genome sequences provides further evidence that IS481 transposition remains active, consistent with previous reports, but also suggests that this expansion of repetitive sequence may provide additional opportunities for rearrangement (48). With the exception of H561, most Vermont and California epidemic genomes contained more IS481 copies than both Tohama I and E476. Additionally, further acquisition of insertion sequence copies in currently circulating strains of B. pertussis may indicate intraspecies host adaptation through gene inactivation that allows B. pertussis to evade protective immunity elicited by the acellular vaccine and cause disease in a vaccinated population. This claim is further supported by studies indicating that the majority of prn-deficient U.S. isolates harbor an IS481 within the prn gene locus (11, 15). However, the results of this study suggest that the role of mobile elements in B. pertussis genome evolution may not be limited to reduction, raising questions about the potential fitness implications of rearrangement, as others have also speculated (40).

In an effort to identify potential correlates of protection not included in the U.S. acellular vaccine, nonsynonymous mutations were investigated in 40 additional virulence-related genes and throughout the genomes of all epidemic isolates. Many virulence-related gene mutations were identified in isolates collected from infants and children with various effects on amino acid sequence within either a single isolate or all epidemic isolates (Tables 1 and and3)3) (23, 47, 49,–56). Although statistically insignificant due to the small sample size of this study, rates of SNPs in virulence genes seem to support the idea of genetic drift and are not found to occur more frequently than in other genes. However, the rate of nonsynonymous variants suggests that genetic elements might be under selective pressure. The majority of nonsynonymous mutations (53%) were found to affect proteins either identified as transporters or associated with the transport and metabolism of various cellular molecules (Fig. 4B; see also Data Set S4 in the supplemental material). This overrepresentation of SNPs in transport proteins was also seen in a study of SNP density in the global population of pertussis isolates and may provide insight into the mechanism by which current strains have adapted to the current global vaccination state (24). Although state-specific mutations were discovered in IS elements and metabolic proteins in California and Vermont, respectively, it is difficult to make phenotypic implications based on state specificity, disease presentation, or demographic due to the highly biased selection of the small number of isolates chosen for whole-genome sequencing and the lack of a functional assessment in the study. Transcriptomics, protein expression, and functional assays of proteins affected by nonsynonymous mutations in these genomes are needed to fully understand how these mutations affect the ability of B. pertussis to infect and cause disease, with varying severity, in the host.

It is clear that we do not fully understand the correlates of protection against B. pertussis. However, this study has expanded our understanding of prn deficiency, specifically with genomic data visualizing the true nature of the previously predicted promoter inversion and providing evidence that no conserved single or multiple mutational events may be compensating for prn deficiency in this collection of isolates, calling into question the role that pertactin plays in pertussis disease (11, 15). This analysis in conjunction with proteomics investigation of several of these isolates, which we are actively pursuing, provides a starting point for determining what proteins are required for disease in certain host populations while identifying potential candidates for future vaccine components.

Advances in next-generation sequencing technology, whole-genome mapping, and bioinformatics improve access to complete genomes, which allow comparative analyses at both the nucleotide and the structural levels. In this study, we compared 31 complete genomes from two recent statewide B. pertussis epidemics with vaccine strains and observed the first definitive view of genome structural variation through rearrangement at a nucleotide sequence resolution within this species. These data challenge the previously held view of population clonality, revealing new levels of diversity, even within geographically defined epidemics. As more complete genomic, transcriptomic, and proteomic data are made available for additional strains collected across broader time periods, it will become possible to discern the stability and rate of rearrangements, when certain structural profiles emerged, and whether profiles shift between epidemic and nonepidemic years. Such further analyses are needed to determine how rearrangement events correlate with SNP phylogenetic relationships and aid development of a comprehensive typing methodology that incorporates nucleotide and structural variation to draw meaningful conclusions from associated clinical metadata. It remains unclear if and how rearrangements play a role in adaption and virulence. Genomes of isolates collected from asymptomatic carriers are needed as a baseline, in conjunction with transcriptomic and proteomic analysis, to determine what insertion elements or rearrangements, if any, are specific to increased disease severity in some patients. Although this study cannot link rearrangements to virulence or adaptation, it highlights the need for complete assemblies to fully evaluate the contribution of pathogen evolution, in addition to waning immunity and incomplete protection from current vaccines, toward pertussis reemergence.

Go to:

MATERIALS AND METHODS

Bacterial strains.

Isolates submitted to the California Department of Public Health (CDPH) were received in three manners. First, local health departments passively submitted isolates to CDPH after performing primary isolation or after hospital submission. For B. pertussis deaths, CDPH actively reached out to the local health jurisdiction or hospital to determine if an isolate was available and requested submission to CDPH. Additionally, in 2010 local public health laboratories were sent a request for submission to CDPH of any B. pertussis isolate that had not been previously submitted. The 13 California isolates sequenced in this study (Table 1) were chosen as part of a collaborative project between CDPH and the CDC focused on determining the phylogenetic relationship of B. pertussis isolates from California infants less than 3 months old where B. pertussis infection led to either fatal or nonfatal pertussis disease (49). From November 2011 to December 2012, 4,100 nasopharyngeal specimens were received at the Vermont Department of Health Laboratory (VDHL) from pediatric practices and hospitals and tested by culture and, if ordered by the physician, by PCR. During the outbreak, 72% of all PCR-positive specimens were confirmed by culture, of which the VDHL submitted 432 isolates to the CDC in 2013 for molecular characterization. In general, prn genotype and PFGE profiles were taken into account to select the most diverse pool of 20 Vermont isolates used for whole-genome sequencing. The CS strain (C393) and Tohama I isolate (E476) were also resequenced in this study (Table 1).

Genomic DNA preparation.

Isolates were cultured on Regan-Lowe agar without cephalexin for 72 h at 37°C. Genomic DNA isolation and purification were conducted according to the Qiagen Gentra Puregene yeast/bacterial kit standard protocol with slight modification (Qiagen, Valencia, CA). Briefly, two aliquots of approximately 1 × 10⁹ bacterial cells were harvested and resuspended in 500 µl of 0.85% sterile saline and then pelleted by centrifugation for 1 min at 16,000 × g. Following the protocol to completion, 100 µl of DNA hydration solution was added to dissolve the genomic DNA. Aliquots were quantified and qualified using a NanoDrop 2000 (Thermo Fisher Scientific Inc., Wilmington, DE) spectrophotometer.

Genome sequencing and assembly.

Whole-genome shotgun sequencing of each isolate was performed using a combination of the PacBio RSII (Pacific Biosciences, Menlo Park, CA) and Illumina MiSeq (Illumina, San Diego, CA) platforms. Genomic DNA libraries were prepared for PacBio sequencing runs using the SMRTbell template prep kit 1.0 and polymerase binding kit P4, while MiSeq libraries were prepared using the NEB Ultra Library prep kit (New England BioLabs, Ipswich, MA). PacBio sequencing reads were filtered with the following cutoffs: 500-bp minimum subread length, 0.80 minimum polymerase read quality, and 100-bp minimum polymerase read length. De novo genome assembly of passed reads was performed first using the Hierarchical Genome Assembly Process (HGAP, v3; Pacific Biosciences) with a 6-kb minimum seed read length, 15× target coverage, 0.06 overlap error rate, and 40-bp minimum overlap length (57). These initial assemblies were further improved by unambiguously mapped PacBio reads with a minimum length of 50 bp and quality score of 75 using BLASR (v1) with a maximum divergence of 30% and minimum anchor of 12 bp (28). The resulting consensus sequences were determined with Quiver (v1), manually checked for circularity, and then reordered to match the start of the CS reference sequence (NC_017223).

Assemblies were confirmed by comparison to restriction digest optical maps (as described below) and further “polished” by mapping Illumina MiSeq PE-150 reads using CLC Genomics Workbench (v7.5; CLC bio, Boston, MA). Raw reads were first trimmed at both ends to remove bases with quality scores less than 0.01 and ambiguous nucleotides and then filtered to remove reads less than 45 bp. The resulting trimmed reads were mapped against the HGAP assembly with the following parameters: mismatch cost of 2, insertion/deletion cost of 3, length fraction of 0.95, similarity fraction of 0.95, and local alignment end gap calculation. Potential errors were identified using the Basic Variant Detection module with the following parameters: maximum coverage of 1,000×, minimum coverage of 10×, minimum read count of 5, minimum variant frequency of 51%, neighborhood radius of 5 bp, minimum central quality score of 25, and minimum average neighborhood quality score of 20. Detect errors were then corrected either manually or with a custom Perl script. Assembly statistics are detailed in Data Set S1 in the supplemental material. Final assemblies were annotated using the NCBI automated Prokaryotic Genome Annotation Pipeline (PGAP).

Optical mapping.

Optical maps for each isolate were prepared from cells of single 1-mm colony equivalents following growth on Regan-Lowe agar without cephalexin using the Argus system (OpGen, Gaithersburg, MD) according to special company protocols. Briefly, high-molecular-mass bacterial DNA (205-kbp average size) was isolated with minimal shearing and applied to a chemically modified glass surface with fabricated microfluidic channels. The stretched DNA on the channels was digested in situ with KpnI in a partial digestion mode and stained with a JoeJoe fluorescent dye on an automatic MapCard processor. To confirm revealed unusual insertions and duplications, restriction enzyme BamHI was used. The digested DNA molecules were imaged using an Argus fluorescence microscope and Path-Finder automated image-acquisition and tiling optical map assembly software (OpGen). The resulting single-molecule restriction maps were assembled into consensus whole-genome maps with Gentig software (OpGen) that recurrently aligned overlapping DNA molecules with similar fragments to calculate a concluding map. Final whole-genome maps in this study are composites from at least 32 single fragmented molecules at every point and typically represent an average depth of 50 to 300 molecules. Restriction map alignments between different strains were generated using MapSolver software (v.2.1.1; OpGen, Gaithersburg, MD). Pairwise alignments were performed between all maps using an alignment score of 3. Optical maps for each of the 33 epidemic isolate and vaccine strain single-contig assemblies were compared to the in silico restriction map of the vaccine isolate Tohama I (NC_002929.2) using MapSolver as an independent validation. Optical maps were also compared to in silico digests of HGAP assemblies to confirm genome structure, and when necessary, de novo assemblies were repeated.

Whole-genome alignment.

Whole-genome assemblies of epidemic isolates were aligned using progressiveMauve (58) along with C393, E476, and four publicly available complete genomes: CS (NC_017223), Tohama I (NC_002929), B1917 (CP009751), and B1920 (CP009752). Genomes were clustered based on changes in order and orientation of homologous sequence blocks using the MLGO pipeline (59). Multicontig assemblies for I488 and I517 were excluded from whole-genome alignment and rearrangement analyses.

Virulence gene and variant analysis.

Genome assemblies for all 33 epidemic isolates were compared to E476 (Tohama I) using CLC Genomics Workbench. Coding regions of 44 known virulence genes were found using BLASTn and aligned against E476 using MEGA6 (60) to detect SNPs and predict resulting amino acid changes. All SNPs were confirmed by manual inspection of Illumina read alignments. Additionally, we have proposed the nomenclature of two genes, fimW (fim2) and fimH (fim3), to better conform to bacterial nomenclature standards and avoid confusion in allele typing.

Whole-genome variant analysis was performed by mapping from Illumina reads of each isolate to the E476 genome. The mapping parameters were the same as described for genome “polishing” above with a few exceptions. These exceptions include a length fraction of 0.90 and a similarity fraction of 0.90. Variants were determined using the Basic Variant Detection tool in CLC Genomics Workbench. A phylogenetic tree was constructed from single nucleotide variants (SNVs; n = 758) and multiple nucleotide variants (MNVs) (n = 123) after removal of variants in proximity to IS481 and IS1002 insertions. The resulting 408-bp concatenated alignment was used to calculate a maximum likelihood phylogenetic reconstruction with ascertainment bias correction using RAxML v8 (61). Proteins containing nonsynonymous mutations were predicted based on the E476 genome annotation information and then were further classified into functional categories using HMMER v3.1b2 (http://hmmer.org/) (62) to search a betaproteobacterium-specific subset of EggNOG ver. 4.1 (63).

Nucleotide sequence accession numbers.

The whole-genome shotgun sequences have been deposited at DDBJ/EMBL/GenBank under the accession numbers CP010249 to CP010266, CP010347, CP010838 to CP010847, CP010961 to CP010964, JWLA00000000, and JWLB00000000. The versions described in this paper are the first versions. The project description and related metadata are accessible through BioProject number PRJNA266616.

Go to:

ACKNOWLEDGMENTS

We thank the personnel from the public health departments of California and Vermont for their contributions to isolate and patient data collection, molecular testing, and isolate submission to CDC. Specifically, we acknowledge Jennifer Zipprich, Kathleen Winter, Kathleen Harriman, and John Talarico from the California Department of Public Health Immunization Branch. In addition, we thank Matthew Thomas, Susan Schoenfeld, and Patsy Kelso from the Vermont Department of Health Epidemiology Program and Valarie Devlin, Jessica Chenette, Kathleen Treadway, Christine Matusevich, Alan Finn, Barb Cote, Becky Temple, and Laura Kamhi from the Vermont Department of Health Laboratory. We also thank the additional laboratory personnel, specifically Mike Frace, within the Biotechnology Core Facility Branch at the Centers for Disease Control and Prevention for their efforts in sequencing all genomes for this study.

The findings and conclusions in this report are those of the authors and do not necessarily represent the official position of the Centers for Disease Control and Prevention.

Go to:

Funding Statement

This research received no specific grant from any funding agency in the public, commercial, or not-for-profit sectors.

Go to:

REFERENCES

1. Tanaka M, Vitek CR, Pascual FB, Bisgard KM, Tate JE, Murphy TV. 2003. Trends in pertussis among infants in the United States, 1980–1999. JAMA 290:2968–2975. 10.1001/jama.290.22.2968. [Abstract] [CrossRef] [Google Scholar]

2. Davis SF, Strebel PM, Cochi SL, Zell ER, Hadler SC. 1992. Pertussis surveillance—United States, 1989–1991. MMWR Surveill Summ 41:11–19. [Abstract] [Google Scholar]

3. Güriş D, Strebel PM, Bardenheier B, Brennan M, Tachdjian R, Finch E, Wharton M, Livengood JR. 1999. Changing epidemiology of pertussis in the United States: increasing reported incidence among adolescents and adults, 1990–1996. Clin Infect Dis 28:1230–1237. 10.1086/514776. [Abstract] [CrossRef] [Google Scholar]

4. Zanardi L, Pascual FB, Bisgard K, Murphy T, Wharton M, Maurice E. 2002. Pertussis—United States, 1997–2000. MMWR Morb Mortal Wkly Rep 51:73–76. [Abstract] [Google Scholar]

5. Broder KR, Cortese MM, Iskander JK, Kretsinger K, Slade BA, Brown KH, Mijalski CM, Tiwari T, Weston EJ, Cohn AC, Srivastava PU, Moran JS, Schwartz B, Murphy TV, Advisory Committee on Immunization Practices . 2006. Preventing tetanus, diphtheria, and pertussis among adolescents: use of tetanus toxoid, reduced diphtheria toxoid and acellular pertussis vaccines recommendations of the Advisory Committee on Immunization Practices (ACIP). MMWR Recomm Rep 55(RR-3):1–34. [Abstract] [Google Scholar]

6. Kretsinger K, Broder KR, Cortese MM, Joyce MP, Ortega-Sanchez I, Lee GM, Tiwari T, Cohn AC, Slade BA, Iskander JK, Mijalski CM, Brown KH, Murphy TV, Centers for Disease Control and Prevention, Advisory Committee on Immunization Practices, Healthcare Infection Control Practices Advisory Committee . 2006. Preventing tetanus, diphtheria, and pertussis among adults: use of tetanus toxoid, reduced diphtheria toxoid and acellular pertussis vaccine recommendations of the Advisory Committee on Immunization Practices (ACIP) and recommendation of ACIP, supported by the Healthcare Infection Control Practices Advisory Committee (HICPAC), for use of Tdap among health-care personnel. MMWR Recomm Rep 55(RR-17):1–37. [Abstract] [Google Scholar]

7. Winter K, Harriman K, Zipprich J, Schechter R, Talarico J, Watt J, Chavez G. 2012. California pertussis epidemic, 2010. J Pediatr 161:1091–1096. 10.1016/j.jpeds.2012.05.041. [Abstract] [CrossRef] [Google Scholar]

8. DeBolt C, Tasslimi A, Bardi J, Leader BT, Hiatt B, Qin X, Patel M, Martin S, Tondella ML, Cassiday P, Faulkner A, Messonnier NE, Clark TA, Meyer S. 2012. Pertussis epidemic—Washington, 2012. MMWR Morb Mortal Wkly Rep 61:517–522. [Abstract] [Google Scholar]

9. Centers for Disease Control and Prevention (CDC) 2012. Summary of notifiable diseases–United States, 2010. MMWR Morb Mortal Wkly Rep 59:1–111. [Abstract] [Google Scholar]

10. Adams DA, Jajosky RA, Ajani U, Kriseman J, Sharp P, Onwen DH, Schley AW, Anderson WJ, Grigoryan A, Aranas AE, Wodajo MS, Abellera JP, Centers for Disease Control and Prevention (CDC) 2014. Summary of notifiable diseases—United States, 2012. MMWR Morb Mortal Wkly Rep 61:1–121. [Abstract] [Google Scholar]

11. Bowden KE, Williams MM, Cassiday PK, Milton A, Pawloski L, Harrison M, Martin SW, Meyer S, Qin X, DeBolt C, Tasslimi A, Syed N, Sorrell R, Tran M, Hiatt B, Tondella ML. 2014. Molecular epidemiology of the pertussis epidemic in Washington State in 2012. J Clin Microbiol 52:3549–3557. 10.1128/JCM.01189-14. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

12. Misegades LK, Winter K, Harriman K, Talarico J, Messonnier NE, Clark TA, Martin SW. 2012. Association of childhood pertussis with receipt of 5 doses of pertussis vaccine by time since last vaccine dose, California, 2010. JAMA 308:2126–2132. 10.1001/jama.2012.14939. [Abstract] [CrossRef] [Google Scholar]

13. Klein NP, Bartlett J, Rowhani-Rahbar A, Fireman B, Baxter R. 2012. Waning protection after fifth dose of acellular pertussis vaccine in children. N Engl J Med 367:1012–1019. 10.1056/NEJMoa1200850. [Abstract] [CrossRef] [Google Scholar]

14. Martin SW, Pawloski L, Williams M, Weening K, DeBolt C, Qin X, Reynolds L, Kenyon C, Giambrone G, Kudish K, Miller L, Selvage D, Lee A, Skoff TH, Kamiya H, Cassiday PK, Tondella ML, Clark TA. 2015. Pertactin-negative Bordetella pertussis strains: evidence for a possible selective advantage. Clin Infect Dis 60:223–227. 10.1093/cid/ciu788. [Abstract] [CrossRef] [Google Scholar]

15. Pawloski LC, Queenan AM, Cassiday PK, Lynch AS, Harrison MJ, Shang W, Williams MM, Bowden KE, Burgos-Rivera B, Qin X, Messonnier N, Tondella ML. 2014. Prevalence and molecular characterization of pertactin-deficient Bordetella pertussis in the United States. Clin Vaccine Immunol 21:119–125. 10.1128/CVI.00717-13. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

16. Quinlan T, Musser KA, Currenti SA, Zansky SM, Halse TA. 2014. Pertactin-negative variants of Bordetella pertussis in New York State: a retrospective analysis, 2004–2013. Mol Cell Probes 28:138–140. 10.1016/j.mcp.2013.12.003. [Abstract] [CrossRef] [Google Scholar]

17. Zeddeman A, van Gent M, Heuvelman CJ, van der Heide HG, Bart MJ, Advani A, Hallander HO, von Koenig CH, Riffelman M, Storsaeter J, Vestrheim DF, Dalby T, Krogfelt KA, Fry NK, Barkoff AM, Mertsola J, He Q, Mooi F. 2014. Investigations into the emergence of pertactin-deficient Bordetella pertussis isolates in six European countries, 1996 to 2012. EuroSurveill 19:17–27. 10.2807/1560-7917.ES2014.19.33.20881. [Abstract] [CrossRef] [Google Scholar]

18. Tsang RS, Shuel M, Jamieson FB, Drews S, Hoang L, Horsman G, Lefebvre B, Desai S, St-Laurent M. 2014. Pertactin-negative Bordetella pertussis strains in Canada: characterization of a dozen isolates based on a survey of 224 samples collected in different parts of the country over the last 20 years. Int J Infect Dis 28:65–69. 10.1016/j.ijid.2014.08.002. [Abstract] [CrossRef] [Google Scholar]

19. Bouchez V, Brun D, Cantinelli T, Dore G, Njamkepo E, Guiso N. 2009. First report and detailed characterization of B. pertussis isolates not expressing pertussis toxin or pertactin. Vaccine 27:6034–6041. 10.1016/j.vaccine.2009.07.074. [Abstract] [CrossRef] [Google Scholar]

20. Williams MM, Sen K, Weigand MR, Skoff TH, Cunningham VA, Halse TA, Tondella ML, CDC Pertussis Working Group . 2016. Bordetella pertussis strain lacking pertactin and pertussis toxin. Emerg Infect Dis 22:319–322. 10.3201/eid2202.151332. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

21. Schmidtke AJ, Boney KO, Martin SW, Skoff TH, Tondella ML, Tatti KM. 2012. Population diversity among Bordetella pertussis isolates, United States, 1935–2009. Emerg Infect Dis 18:1248–1255. 10.3201/eid1808.120082. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

22. Parkhill J, Sebaihia M, Preston A, Murphy LD, Thomson N, Harris DE, Holden MT, Churcher CM, Bentley SD, Mungall KL, Cerdeño-Tárraga AM, Temple L, James K, Harris B, Quail MA, Achtman M, Atkin R, Baker S, Basham D, Bason N, Cherevach I, Chillingworth T, Collins M, Cronin A, Davis P, Doggett J, Feltwell T, Goble A, Hamlin N, Hauser H, Holroyd S, Jagels K, Leather S, Moule S, Norberczak H, O’Neil S, Ormond D, Price C, Rabbinowitsch E, Rutter S, Sanders M, Saunders D, Seeger K, Sharp S, Simmonds M, Skelton J, Squares R, Squares S, Stevens K, Unwin L. 2003. Comparative analysis of the genome sequences of Bordetella pertussis, Bordetella parapertussis and Bordetella bronchiseptica. Nat Genet 35:32–40. 10.1038/ng1227. [Abstract] [CrossRef] [Google Scholar]

23. Zhang S, Xu Y, Zhou Z, Wang S, Yang R, Wang J, Wang L. 2011. Complete genome sequence of Bordetella pertussis CS, a Chinese pertussis vaccine strain. J Bacteriol 193:4017–4018. 10.1128/JB.05184-11. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

24. Bart MJ, Harris SR, Advani A, Arakawa Y, Bottero D, Bouchez V, Cassiday PK, Chiang CS, Dalby T, Fry NK, Gaillard ME, van Gent M, Guiso N, Hallander HO, Harvill ET, He Q, van der Heide HG, Heuvelman K, Hozbor DF, Kamachi K, Karataev GI, Lan R, Lutyłska A, Maharjan RP, Mertsola J, Miyamura T, Octavia S, Preston A, Quail MA, Sintchenko V, Stefanelli P, Tondella ML, Tsang RS, Xu Y, Yao SM, Zhang S, Parkhill J, Mooi FR. 2014. Global population structure and evolution of Bordetella pertussis and their relationship with vaccination. mBio 5:e01074-14. 10.1128/mBio.01074-14. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

25. Sealey KL, Harris SR, Fry NK, Hurst LD, Gorringe AR, Parkhill J, Preston A. 2015. Genomic analysis of isolates from the United Kingdom 2012 pertussis outbreak reveals that vaccine antigen genes are unusually fast evolving. J Infect Dis 212:294–301. 10.1093/infdis/jiu665. [Abstract] [CrossRef] [Google Scholar]

26. Harvill ET, Goodfield LL, Ivanov Y, Meyer JA, Newth C, Cassiday P, Tondella ML, Liao P, Zimmerman J, Meert K, Wessel D, Berger J, Dean JM, Holubkov R, Burr J, Liu T, Brinkac L, Kim M, Losada L. 2013. Genome sequences of 28 Bordetella pertussis U.S. outbreak strains dating from 2010 to 2012. Genome Announc 1:e01075-13. 10.1128/genomeA.01075-13. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

27. Xavier BB, Sabirova J, Pieter M, Hernalsteens JP, de Greve H, Goossens H, Malhotra-Kumar S. 2014. Employing whole genome mapping for optimal de novo assembly of bacterial genomes. BMC Res Notes 7:484. 10.1186/1756-0500-7-484. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

28. Chin CS, Alexander DH, Marks P, Klammer AA, Drake J, Heiner C, Clum A, Copeland A, Huddleston J, Eichler EE, Turner SW, Korlach J. 2013. Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat Methods 10:563–569. 10.1038/nmeth.2474. [Abstract] [CrossRef] [Google Scholar]

29. Bart MJ, Zeddeman A, van der Heide HG, Heuvelman K, van Gent M, Mooi FR. 2014. Complete genome sequences of Bordetella pertussis isolates B1917 and B1920, representing two predominant global lineages. Genome Announc 2:e01301-14. 10.1128/genomeA.01301-14. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

30. Bart MJ, van der Heide HG, Zeddeman A, Heuvelman K, van Gent M, Mooi FR. 2015. Complete genome sequences of 11 Bordetella pertussis strains representing the pandemic ptxP3 lineage. Genome Announc 3:e01394-15. 10.1128/genomeA.01394-15. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

31. Bartkus JM, Juni BA, Ehresmann K, Miller CA, Sanden GN, Cassiday PK, Saubolle M, Lee B, Long J, Harrison AR, Besser JM. 2003. Identification of a mutation associated with erythromycin resistance in Bordetella pertussis: implications for surveillance of antimicrobial resistance. J Clin Microbiol 41:1167–1172. 10.1128/JCM.41.3.1167-1172.2003. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

32. Wang Z, Han R, Liu Y, Du Q, Liu J, Ma C, Li H, He Q, Yan Y. 2015. Direct detection of erythromycin-resistant Bordetella pertussis in clinical specimens by PCR. J Clin Microbiol 53:3418–3422. 10.1128/JCM.01499-15. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

33. Yang Y, Yao K, Ma X, Shi W, Yuan L, Yang Y. 2015. Variation in Bordetella pertussis susceptibility to erythromycin and virulence-related genotype changes in China (1970–2014). PLoS One 10:e0138941. 10.1371/journal.pone.0138941. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

34. Bisgard KM, Christie CD, Reising SF, Sanden GN, Cassiday PK, Gomersall C, Wattigney WA, Roberts NE, Strebel PM. 2001. Molecular epidemiology of Bordetella pertussis by pulsed-field gel electrophoresis profile: Cincinnati, 1989–1996. J Infect Dis 183:1360–1367. 10.1086/319858. [Abstract] [CrossRef] [Google Scholar]

35. Stibitz S, Yang MS. 1999. Genomic plasticity in natural populations of Bordetella pertussis. J Bacteriol 181:5512–5515. [Europe PMC free article] [Abstract] [Google Scholar]

36. Brinig MM, Cummings CA, Sanden GN, Stefanelli P, Lawrence A, Relman DA. 2006. Significant gene order and expression differences in Bordetella pertussis despite limited gene content variation. J Bacteriol 188:2375–2382. 10.1128/JB.188.7.2375-2382.2006. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

37. Advani A, Donnelly D, Hallander H. 2004. Reference system for characterization of Bordetella pertussis pulsed-field gel electrophoresis profiles. J Clin Microbiol 42:2890–2897. 10.1128/JCM.42.7.2890-2897.2004. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

38. Khattak MN, Matthews RC. 1993. Genetic relatedness of Bordetella species as determined by macrorestriction digests resolved by pulsed-field gel electrophoresis. Int J Syst Bacteriol 43:659–664. 10.1099/00207713-43-4-659. [Abstract] [CrossRef] [Google Scholar]

39. Beall B, Cassiday PK, Sanden GN. 1995. Analysis of Bordetella pertussis isolates from an epidemic by pulsed-field gel electrophoresis. J Clin Microbiol 33:3083–3086. [Europe PMC free article] [Abstract] [Google Scholar]

40. Belcher T, Preston A. 2015. Bordetella pertussis evolution in the (functional) genomics era. Pathog Dis 73:ftv064. 10.1093/femspd/ftv064. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

41. Darling AE, Miklós I, Ragan MA. 2008. Dynamics of genome rearrangement in bacterial populations. PLoS Genet 4:e1000128. 10.1371/journal.pgen.1000128. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

42. Hughes D. 2000. Evaluating genome dynamics: the constraints on rearrangements within bacterial genomes. Genome Biol 1:reviews0006. 10.1186/gb-2000-1-6-reviews0006. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

43. Bryant J, Chewapreecha C, Bentley SD. 2012. Developing insights into the mechanisms of evolution of bacterial pathogens from whole-genome sequences. Future Microbiol 7:1283–1296. 10.2217/fmb.12.108. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

44. Stibitz S, Yang MS. 1997. Genomic fluidity of Bordetella pertussis assessed by a new method for chromosomal mapping. J Bacteriol 179:5820–5826. [Europe PMC free article] [Abstract] [Google Scholar]

45. Lerat E, Ochman H. 2004. Psi-Phi: exploring the outer limits of bacterial pseudogenes. Genome Res 14:2273–2278. 10.1101/gr.2925604. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

46. McCutcheon JP, Moran NA. 2012. Extreme genome reduction in symbiotic bacteria. Nat Rev Microbiol 10:13–26. 10.1038/nrmicro2670. [Abstract] [CrossRef] [Google Scholar]

47. Bart MJ, van Gent M, van der Heide HG, Boekhorst J, Hermans P, Parkhill J, Mooi FR. 2010. Comparative genomics of prevaccination and modern Bordetella pertussis strains. BMC Genomics 11:627. 10.1186/1471-2164-11-627. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

48. Stibitz S. 1998. IS481 and IS1002 of Bordetella pertussis create a 6-base-pair duplication upon insertion at a consensus target site. J Bacteriol 180:4963–4966. [Europe PMC free article] [Abstract] [Google Scholar]

49. Williamson YM, Moura H, Whitmon J, Woolfitt AR, Schieltz DM, Rees JC, Guo S, Kirkham H, Bouck D, Ades EW, Tondella ML, Carlone GM, Sampson JS, Barr JR. 2015. A proteomic characterization of Bordetella pertussis clinical isolates associated with a California state pertussis outbreak. Int J Proteomics 2015:536537. 10.1155/2015/536537. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

50. Maharjan RP, Gu C, Reeves PR, Sintchenko V, Gilbert GL, Lan R. 2008. Genome-wide analysis of single nucleotide polymorphisms in Bordetella pertussis using comparative genomic sequencing. Res Microbiol 159:602–608. 10.1016/j.resmic.2008.08.004. [Abstract] [CrossRef] [Google Scholar]

51. Herrou J, Debrie AS, Willery E, Renauld-Mongénie G, Locht C, Mooi F, Jacob-Dubuisson F, Antoine R. 2009. Molecular evolution of the two-component system BvgAS involved in virulence regulation in Bordetella. PLoS One 4:e6996. 10.1371/journal.pone.0006996. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

52. Goyard S, Bellalou J, Mireau H, Ullmann A. 1994. Mutations in the Bordetella pertussis bvgS gene that confer altered expression of the fhaB gene in Escherichia coli. J Bacteriol 176:5163–5166. [Europe PMC free article] [Abstract] [Google Scholar]

53. Geuijen CA, Willems RJ, Bongaerts M, Top J, Gielen H, Mooi FR. 1997. Role of the Bordetella pertussis minor fimbrial subunit, FimD, in colonization of the mouse respiratory tract. Infect Immun 65:4222–4228. [Europe PMC free article] [Abstract] [Google Scholar]

54. Hazenbos WL, Geuijen CA, van den Berg BM, Mooi FR, van Furth R. 1995. Bordetella pertussis fimbriae bind to human monocytes via the minor fimbrial subunit FimD. J Infect Dis 171:924–929. 10.1093/infdis/171.4.924. [Abstract] [CrossRef] [Google Scholar]

55. Stein PE, Boodhoo A, Armstrong GD, Cockle SA, Klein MH, Read RJ. 1994. The crystal structure of pertussis toxin. Structure 2:45–57. 10.1016/S0969-2126(00)00007-1. [Abstract] [CrossRef] [Google Scholar]

56. Van Loo IH, Heuvelman KJ, King AJ, Mooi FR. 2002. Multilocus sequence typing of Bordetella pertussis based on surface protein genes. J Clin Microbiol 40:1994–2001. 10.1128/JCM.40.6.1994-2001.2002. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

57. Chaisson MJ, Tesler G. 2012. Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory. BMC Bioinformatics 13:238. 10.1186/1471-2105-13-238. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

58. Darling AE, Mau B, Perna NT. 2010. progressiveMauve: multiple genome alignment with gene gain, loss and rearrangement. PLoS One 5:e11147. 10.1371/journal.pone.0011147. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

59. Hu F, Lin Y, Tang J. 2014. MLGO: phylogeny reconstruction and ancestral inference from gene-order data. BMC Bioinformatics 15:354. 10.1186/s12859-014-0354-6. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

60. Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. 2013. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol 30:2725–2729. 10.1093/molbev/mst197. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

61. Stamatakis A. 2014. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics 30:1312–1313. 10.1093/bioinformatics/btu033. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

62. Finn RD, Clements J, Eddy SR. 2011. HMMER web server: interactive sequence similarity searching. Nucleic Acids Res 39:W29–W37. 10.1093/nar/gkr367. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

63. Powell S, Forslund K, Szklarczyk D, Trachana K, Roth A, Huerta-Cepas J, Gabaldón T, Rattei T, Creevey C, Kuhn M, Jensen LJ, von Mering C, Bork P. 2014. EggNOG v4.0: nested orthology inference across 3686 organisms. Nucleic Acids Res 42:D231–D239. 10.1093/nar/gkt1253. [Europe PMC free article] [Abstract] [CrossRef] [Google Scholar]

Articles from mSphere are provided here courtesy of American Society for Microbiology (ASM)

Full text links

Read article at publisher's site: https://doi.org/10.1128/msphere.00036-16

Read article for free, from open access legal sources, via Unpaywall: https://msphere.asm.org/content/msph/1/3/e00036-16.full.pdf

Citations & impact

Impact metrics

Citations

Jump to Citations

Data citations

Jump to Data

Citations of article over time

Alternative metrics

Altmetric item for https://www.altmetric.com/details/7352621

Altmetric
Discover the attention surrounding your research
https://www.altmetric.com/details/7352621

Smart citations by scite.ai
Explore citation contexts and check if this article has been supported or disputed.
https://scite.ai/reports/10.1128/msphere.00036-16

Supporting

Mentioning

Contrasting

Article citations

Strengthening Bordetella pertussis genomic surveillance by direct sequencing of residual positive specimens.
Peng Y, Williams MM, Xiaoli L, Simon A, Fueston H, Tondella ML, Weigand MR
J Clin Microbiol, 62(4):e0165323, 06 Mar 2024
Cited by: 0 articles | PMID: 38445858 | PMCID: PMC11005353
Free full text in Europe PMC
Whole-genome comparison of two same-genotype macrolide-resistant Bordetella pertussis isolates collected in Japan.
Koide K, Uchitani Y, Yamaguchi T, Otsuka N, Goto M, Kenri T, Kamachi K
PLoS One, 19(2):e0298147, 15 Feb 2024
Cited by: 0 articles | PMID: 38359004 | PMCID: PMC10868825
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC
Genomic characterization of Bordetella pertussis in South Africa, 2015-2019.
Moosa F, du Plessis M, Weigand MR, Peng Y, Mogale D, de Gouveia L, Nunes MC, Madhi SA, Zar HJ, Reubenson G, Ismail A, Tondella ML, Cohen C, Walaza S, von Gottberg A, Wolter N
Microb Genom, 9(12), 01 Dec 2023
Cited by: 1 article | PMID: 38117675 | PMCID: PMC10763497
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC
Whole-Genome Analysis of Bordetella pertussis MT27 Isolates from School-Associated Outbreaks: Single-Nucleotide Polymorphism Diversity and Threshold of the Outbreak Strains.
Kamachi K, Koide K, Otsuka N, Goto M, Kenri T
Microbiol Spectr, 11(3):e0406522, 16 May 2023
Cited by: 2 articles | PMID: 37191540 | PMCID: PMC10269452
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC
Genomic structural plasticity of rodent-associated Bartonella in nature.
de Sousa KCM, Gutiérrez R, Yahalomi D, Shalit T, Markus B, Nachum-Biala Y, Hawlena H, Marcos-Hadad E, Hazkani-Covo E, de Rezende Neves HH, Covo S, Harrus S
Mol Ecol, 31(14):3784-3797, 10 Jun 2022
Cited by: 2 articles | PMID: 35620948 | PMCID: PMC9540758
This article is in the Europe PMC Open access subset. Refer to the copyright information in the article for licensing details.
Free full text in Europe PMC

Go to all (34) article citations

Data

Data behind the article

This data has been text mined from the article, or deposited into data resources.

BioStudies: supplemental material and supporting data

http://www.ebi.ac.uk/biostudies/studies/S-EPMC4888882?xr=true

BioProject

(1 citation) BioProject - PRJNA266616

Nucleotide Sequences (Showing 9 of 9)

(1 citation) ENA - CP009752
(1 citation) ENA - CP010961
(1 citation) ENA - CP010266
(1 citation) ENA - CP010347
(1 citation) ENA - CP010964
(1 citation) ENA - CP010249
(1 citation) ENA - CP010838
(1 citation) ENA - CP010847
(1 citation) ENA - CP009751

Show less

RefSeq - NCBI Reference Sequence Database

(1 citation) RefSeq - NC_017223

Data that cites the article

This data has been provided by curated databases and other sources that have cited the article.

Nucleotide Sequences (Showing 5 of 3662)

Bordetella pertussis heat-shock protein(ENA - AMT65406)
Bordetella pertussis preprotein translocase subunit SecD(ENA - AMT65410)
Bordetella pertussis magnesium transporter(ENA - AMT65405)
Bordetella pertussis chemotaxis protein CheZ(ENA - AMT65403)
Bordetella pertussis preprotein translocase subunit SecF(ENA - AMT65409)

Go to all (3662) records in ENA

Search life-sciences literature (45,103,589 articles, preprints and more)

Genome Structural Diversity among 31 Bordetella pertussis Isolates from Two Recent U.S. Whooping Cough Statewide Epidemics.

Author information

Affiliations

Authors

Authors

Authors

ORCIDs linked to this article

Abstract

Free full text

Genome Structural Diversity among 31 Bordetella pertussis Isolates from Two Recent U.S. Whooping Cough Statewide Epidemics

Katherine E. Bowden

Michael R. Weigand

Yanhui Peng

Pamela K. Cassiday

Scott Sammons

Kristen Knipe

Lori A. Rowe

Vladimir Loparev

Mili Sheth

Keeley Weening

M. Lucia Tondella

Margaret M. Williams

Associated Data

ABSTRACT

INTRODUCTION

RESULTS

Reference-free assembly of complete genomes.

TABLE 1

Data Set S1

Large-scale genome rearrangements.

Figure S1

Rearrangement boundaries.

TABLE 2

Virulence gene comparisons.

TABLE 3

Data Set S2

Variant analysis.

TABLE 4

Data Set S3

Data Set S4

Data Set S5

Data Set S6

Data Set S7

Data Set S8

Data Set S9

DISCUSSION

MATERIALS AND METHODS

Bacterial strains.

Genomic DNA preparation.

Genome sequencing and assembly.

Optical mapping.

Whole-genome alignment.

Virulence gene and variant analysis.

Nucleotide sequence accession numbers.

ACKNOWLEDGMENTS

Funding Statement

REFERENCES

Full text links

Citations & impact

Impact metrics

Citations of article over time

Alternative metrics

Article citations

Data

Data behind the article

BioStudies: supplemental material and supporting data

BioProject

Nucleotide Sequences (Showing 9 of 9)

RefSeq - NCBI Reference Sequence Database

Data that cites the article

Nucleotide Sequences (Showing 5 of 3662)

Similar Articles

Partnerships & funding