Population Structure, Antibiotic Resistance, and Uropathogenicity of Klebsiella variicola

Infections caused by antibiotic-resistant bacterial pathogens are a growing public health threat. Understanding of pathogen relatedness and biology is imperative for tracking outbreaks and developing therapeutics. Here, we detail the phylogenetic structure of 145 K. variicola genomes from different continents. Our results have important clinical ramifications as high-risk antibiotic resistance genes are present in K. variicola genomes from a variety of geographic locations and as we demonstrate that K. variicola clinical isolates can establish higher bladder titers than K. pneumoniae. Differential presence of these pilus genes inK. variicola isolates may indicate adaption for specific environmental niches. Therefore, due to the potential of multidrug resistance and pathogenic efficacy, identification of K. variicola and K. pneumoniae to a species level should be performed to optimally improve patient outcomes during infection. This work provides a foundation for our improved understanding of K. variicola biology and pathogenesis.

variicola indicated that two isolates, KvMX2 (FLLH01.1) and YH43 (GCF_001548315.1), are more closely related to one another than to the remainder (143/145) of the K. variicola genomes. Given that K. quasipneumoniae can be differentiated into two subspecies based on ANI with the BLAST method (ANIb), we used the JSpecies ANIb program to specifically compare KvMX2 and YH43 with K. pneumoniae ATCC BAA-1705, K. quasipneumoniae ATCC 7000603, and 3 other K. variicola genomes (10). KvMX2 and Yh43 have 98.02% ANIb with one another but an average of 96.67%, 96.65%, and 96.68% ANIb with WUSM_KV_53, WUSM_KV_15, and K. variicola At-22, respectively (Table S1). Consistent with our pyANI ANIm result, none of the K. variicola strains had Ͼ95% ANIb with K. pneumoniae ATCC BAA-1705 or K. quasipneumoniae ATCC 7000603. These data suggest that MALDI-TOF MS or yggE PCR-RFLP may be effective means to differentiate K. variicola from K. pneumoniae in the absence of WGS.
K. variicola population structure has 2 lineages and 26 clusters in the second lineage. Core-genome alignment of the 1,262 genes at 90% identity shared by strains in all Klebsiella species and a Kluyvera georgiana outgroup shows that the K. variicola isolates are in a cluster with K. pneumoniae, K. quasipneumoniae, and the newly described K. quasivariicola (11) (Table S2; Fig. S1). Core-genome alignment of the 3,430 core genes at 95% nucleotide identity for the entire gene length by all 145 K. variicola genomes indicates that KvMX2 and Yh43 are distantly related to the other 143 genomes ( Fig. 2a; Table S2). These other genomes form a star-like phylogeny showing deep-branching clusters radiating from the center of the tree. FastGear, which uses hierBAPS to identify lineages and then searches for recombination between lineages, supported the differentiation of KvMX2 and Yh43 into a separate lineage from the other genomes and identified 6 instances of recombination between these two lineages (Table S3) (12,13).
Phylogenomic network analysis and quantification of recombination from parSNP showed minimal recombination within the 143 K. variicola lineage 2 genomes, with approximately 1.62% of the K. variicola genome believed to be recombinant ( Fig. S2a; Table S3) (14). The Nearest Neighbor network of the 3,496 genes shared by the lineage 2 genomes and a recombination-free phylogenetic tree of the 143 genomes from  parSNP showed many deep-branching clades with a star-like phylogeny ( Fig. 2b; Table S2). This tree topology was similar with and without recombination, which suggests that K. variicola lineages emerged early from a single common ancestor into equally distant clades across different environments (Fig. S2b). Quantitative clustering of the 143 genomes in the second lineage with ClusterPicker showed that 56.6% (81/143) of genomes fall into 26 clusters, with 57.7% (15/26) of the clusters containing more than 2 genomes (Fig. 2b) (15). Only 46.2% (12/26) of clusters contain isolates from both WUSM_KV and NCBI. The largest clusters, 24 and 21, each contain 7 genomes. Cluster 21 contained WUSM_KV_10 and 6 genomes from an analysis of patient isolates at an intensive care unit in Seattle, WA (USA). Although they were in the same cluster, WUSM_KV_10 differed from these isolates at 1,882 sites across the 4,867 genes shared at 95% identity (Table S2 and Table S4).
To better understand the context of the 4 groups in lineage 2, we aligned the 2,932 genes shared among the 145 K. variicola genomes, Klebsiella (formerly Enterobacter) aerogenes KCTC 2190, K. quasipneumoniae ATCC 700603, and K. pneumoniae ATCC BAA-1705 at Ն90% identity to create a dendrogram ( Fig. 2c; Table S2). This method preserved the conservation of the lineage 2 groups but showed a different order. The only discrepancy observed is that, in the lineage 2 phylogenetic tree, cluster 3 appeared to be in the A group; however, both 521_SSON and 524_SBOY are more similar to C group genomes in the dendrogram. This incongruence is consistent with cluster 3 radiating away from cluster 4 near the center of the phylogenetic tree (Fig. 2b). Addition of metadata onto the dendrogram showed that the K. variicola cohort spans most geographic locations, with the notable exception of Africa and Oceania (Fig. 2c). The K. variicola genomes showed a remarkable level of source diversity, with representative isolates from animals (n ϭ 4), fungi (n ϭ 2), plants (n ϭ 7), water (n ϭ 3), and industrial waste (n ϭ 6). However, as a testament to the pathogenic potential of K. variicola, 79.5% (114/145) of genomes came from sites associated with humans. Of the human-associated sites, 40.4% (46/114) came from urine and 19.2% (22/114) came from blood (Fig. 2c). We did not observe any apparent association with geography, habitat, or infection site for any of the K. variicola clades. Sixty-seven of 145 isolates had a sequence type (ST) identified using the K. pneumoniae multilocus sequence type scheme (Table S5). Consistent with the distance between lineages, 44 different STs were identified. ST1562 and ST641 had the highest number of isolates (n ϭ 4). In summary, these data demonstrate that K. variicola has a diverse population structure and can be found in a variety of environmental and host niches.
We used the K. pneumoniae BIGSdb database (https://bigsdb.pasteur.fr/klebsiella/ klebsiella.html) and BLASTN to identify canonical Klebsiella VGs in the K. variicola strains ( Fig. 3b; Table S5). In contrast to ARGs, previously characterized Klebsiella VGs were found only sporadically in the K. variicola cohort. Interestingly, the all allantoin utilization operon and arc, fdrA, gcl, glxKR, hyi, and ybbWY genes were found in the distantly related YH43 genome as well as the closely related BIDMC90, k385, and WUSM_KV_03 genomes. irp12 and the ybt operon were found together in the three isolates 50878013, MGH 20, and WUSM_KV_10. irp1 was found on 3 additional instances but with no other VGs. Among 8 isolates containing the full all or ybt operon, six had only 3 ARGs; however, 50878013 contains the ybt operon and irp12 and has 5 ARGs, including the bla OXA-48 carbapenemases, while k385 had 17 ARGs but no carbapenemases.
We used the Enterobacteriaceae PlasmidFinder database to identify characterized plasmid replicons in the K. variicola genomes (17). Twenty-nine unique replicons were identified in 11 groups, but 41% (12/29) of replicons were in the IncF group. A single IncF replicon also had the greatest conservation across K. variicola genomes, as 57.9% (84/145) of genomes contained the IncF(K) replicon (GenBank accession no. JN233704). We found a significant association between isolates that harbored greater than the median number of ARGs and greater than the median number of plasmid replicons using the chi-square test (P Ͻ 0.00001) (Table S5).
WUSM K. variicola cohort strains are susceptible to most antibiotics. We constructed a network diagram of ARGs and isolates to identify connectivity within the K. variicola strains from our cohort (Fig. 4a). WUSM_KV_55 had twice as many ARGs (n ϭ 12) as the next closest isolate, WUSM_KV_26 (n ϭ 6). Most notably, WUSM_KV_55 contained the carbapenemase gene bla KPC-2 . In addition to the core ␤-lactamase bla LEN-2 , this isolate also contained a bla CTX-M-14 gene. Redundancy was again observed for the ARGs against aminoglycosides and sulfonamides, as WUSM_KV_55 contained aac(6=)lb-cr, aadA16, sul1, and sul2. Within our cohort, this isolate was the only isolate found to harbor additional quinolone (qnrB6), rifampin (arr-3), and amphenicol (floR) ARGs. Interestingly, it possesses oqxB but not oqxA. Conversely, WUSM_KV_35 harbored the lowest number of acquired ARGs, as it lacked oqxAB but carried bla LEN-24 .
We used Kirby-Bauer disk diffusion to quantify phenotypic resistance of the WUSM K. variicola strains to several clinically relevant antibiotics (Fig. 4b). Klebsiella species are generally considered intrinsically resistant to ampicillin due to a conserved ␤-lactamase gene. In our cohort, 3/55 isolates were unexpectedly susceptible to ampicillin while the rest were resistant. Despite phenotypic sensitivity to ampicillin, the genomes for WUSM_KV_25, WUSM_KV_34, and WUSM_KV_82 encode bla LEN-24 , bla LEN22 , and bla LEN-16 , respectively. These bla LEN alleles were also found in isolates intermediate and resistant to ampicillin. As expected, WUSM_KV_55 was the only isolate resistant to both meropenem and ceftazidime, presumably due to carriage of bla KPC-2 . Additionally, it was the only isolate intermediate to ciprofloxacin. Four isolates were resistant to trimethoprim-sulfamethoxazole, but only WUSM_KV_50 and WUSM_KV_55 had identified ARGs that would explain this phenotype.
Changes in fim operon are associated with uropathogenicity in a murine UTI model. Given that 70% (39/56) of K. variicola strains from our cohort were isolated from the human urinary tract, we wanted to assess uropathogenicity in a diverse subset of these isolates. We transurethrally inoculated C3H/HeN mice with 10 7 CFU/ml of 5 individual K. variicola strains, or the model uropathogenic K. pneumoniae TOP52 strain, for comparison ( Fig. 5a) (3,18,19). Similarly to previously published infections with K. pneumoniae TOP52, the K. variicola strains exhibited large variations in bacterial CFU recovered from the bladder at 24 h postinfection (hpi). Compared to TOP52, WUSM_KV_39 was the only isolate with a significantly increased bladder burden (P ϭ 0.0094). Bacterial loads of WUSM_KV_10 and WUSM_KV_39 were both significantly higher than WUSM_KV_09 and WUSM_KV_14 (Fig. 5a). Despite this variability among  bladder CFU results, the results of kidney titer determinations at 24 hpi were not significantly different among strains by ANOVA (P ϭ 0.1270). As observed in the bladder, however, WUSM_KV_10 and WUSM_KV_39 achieved significantly higher kidney CFU than WUSM_KV_14.
Given the variation in bladder burden, we wanted to assess if differences in uropathogenicity could be related to expression of type 1 pili, a key virulence factor for UTI encoded by the fim operon (19,20). In K. pneumoniae and Escherichia coli, expression of type 1 pili is controlled by a region of invertible DNA (fimS site) (20,21). Orientation of the fimS site in the "ON" position enables production of type1 pili and increased urovirulence. Under identical growth conditions, WUSM_KV_39 had a higher population with the fimS promoter region in the "ON" orientation than the other strains tested (Fig. 5b). Furthermore, consistent with its success in the bladder, WUSM_KV_39 was found to produce the greatest amount of FimA (the main structural component of type 1 pili), as measured by immunoblotting (Fig. 5c). WUSM_KV_03, WUSM_KV_09, and WUSM_KV_39 all produced significantly more FimA than K. pneumoniae TOP52. Interestingly, WUSM_KV_14 did not produce appreciable levels of FimA by this assay (Fig. 5c).
As we discovered significant variability in type 1 piliation, we specifically investigated changes in fim operon sequence between these isolates by viewing the Prokka coding sequence annotation in Easyfig and Jalview (Fig. 5d) (22,23). We found that WUSM_KV_14 had a predicted truncated FimD usher sequence. A guanine-to-adenine single nucleotide polymorphism (SNP) in the fimD gene changed a predicted tryptophan residue into a premature stop codon, likely explaining the observed lack of production of type 1 pili. Additionally, in WUSM_KV_39, Prokka annotated a hypothetical protein in the intergenic region between fimB and fimE and included a gap replacing a thymine and a guanine-to-adenine SNP. The altered fimB/fimE intergenic region in WUSM_KV_39 may play a role in its increased expression of type 1 pili. Together, these data demonstrate that variation exists among K. variicola genomes that may account for differential urinary tract niche proclivity among isolates.
K. variicola contains both conserved and novel usher genes. The fim operon is one of the best-characterized chaperone-usher pathways (CUPs); given the observed importance of the fim operon in K. variicola uropathogenicity, we searched the pangenome of our K. variicola cohort to identify the complete repertoire of CUP operons (24). Seventeen unique usher sequences at 95% identity were identified across the 55 WUSM K. variicola genomes, and an amino acid sequence alignment showed that they were distributed in 5 Nuccio and Baumler (25) clades ( Fig. 6a; Table S6). From this analysis, we discovered 9 new usher genes previously undescribed in Klebsiella, which we name kva through kvi (Table S6). KviA and KveB usher sequences were found to cluster within the pi () clade, making them the first description of a P-pilus apparatus in Klebsiella. The recently named ␥* subclade contained the greatest amount (7/17) of K. variicola usher sequences; 5 of these 7 were previously reported in K. pneumoniae, while KvcC and KvdB are first reported here.
FimD and the usher sequences for KpaC, KvaB, KpeC, and KpjC were present in all 55 WUSM K. variicola isolates (Fig. 6b). KvgC, KvhC, KviA, and KpcC were each found in only one isolate. KpgC, MrkC, KvbC, KpbC, KvcC, KveB, KvfC, and KvdB can be considered accessory usher sequences in this cohort, as they were absent in certain strains. The most notable pattern evident from the hierarchical clustering of the presence/ absence for all usher genes in our K. variicola cohort is that isolates WUSM_KV_10 through WUSM_KV_21 all carry the KvdB sequence but not KpbC.
Eight of the 9 newly described usher sequences had highest BLASTP hits of Ն99% identity across the entire length of the gene against the nonredundant protein sequences database in April 2018, and all of them were previously annotated as being found in Enterobacteriaceae, Klebsiella, or K. variicola (Table S6). All of the usher genes except kvi were in operons that included a chaperone, at least one subunit, and a putative adhesin (Table S6). KvhC, the usher protein with the lowest BLASTP identity value, had 76% identity to several genes from Enterobacter species (Fig. S3a). The contig with the kvh operon also contained several genes that had possible roles in prophage integration and transposase activity (Fig. S3b). Our results indicate that K. variicola strains harbor a diverse set of usher genes, which may augment K. variicola fitness across a variety of environmental niches, and these operons may be acquired from other Enterobacteriaceae.

DISCUSSION
A previous phylogenomic study used split-network analysis to demonstrate that the K. variicola phylogroup (formerly KPIII) is distinct from K. pneumoniae (KPI) and K. quasipneumoniae (KPII) (26). As an orthogonal method, we used ANI software, the gold standard for in silico species delineation, to recreate this differentiation of phylogroups as separate species (8). Historically, differentiation between K. pneumoniae and K. variicola has been difficult, as evidenced by misannotation of K. variicola as K. pneu- moniae in public genome sequence databases (Fig. 1). These misannotated K. variicola strains came from a variety of geographic regions and were not exclusive to any cluster. Within our sequenced cohort, differentiation of K. variicola from K. pneumoniae and K. quasipneumoniae using MALDI-TOF MS and yggE PCR-RFLP was supported by ANI. This indicates that yggE PCR-RFLP (3) would be a feasible alternative for clinical labs across the globe lacking access to MALDI-TOF MS or WGS. Additionally, hierarchical clustering of the ANI values and core-genome phylogeny demonstrated that 2 K. variicola genomes were distinctly separate from the other 143 in our cohort. ANIb values between these genomes and the other K. variicola genomes were ϳ96%, similar to what was observed for K. quasipneumoniae. The differences in ANIb values contributed to the delineation of K. quasipneumoniae into two subspecies, Klebsiella quasipneumoniae subsp. quasipneumoniae and Klebsiella quasipneumoniae subsp. similipneumoniae (27). Further phenotypic comparisons, including the sole carbon source utilization used for differentiation of the K. quasipneumoniae subspecies, between KvMX2/ Yh43 and other K. variicola isolates is required to unequivocally qualify these as separate subspecies (27).
Numerous studies have shown that K. pneumoniae has a deep-branching phylogenetic structure with minimal recombination occurring within K. pneumoniae strains and between K. pneumoniae and K. variicola/K. quasipneumoniae (26,28). Importantly, though, large-scale recombination events may be clinically relevant, as evidenced by research on the origin of the frequently carbapenem-resistant ST258 lineage (29,30). Our results demonstrate that like K. pneumoniae, K. variicola shows minimal recombination within its genome, and its population structure is composed of numerous clades in a star-like phylogeny. A star-like population structure with deep-branching relationships between isolates (n ϭ 29 and n ϭ 28) was also found in two previously published K. variicola phylogenetic trees (2,31).
Similarly to our work, a previous investigation did not identify any geographic distinction when genomes from within the United States were compared to those from outside the United States (2). The 6 genomes in cluster 21 with WUSM_KV_10 were from ICU patient samples in Seattle, WA, which provides the first evidence of clonal groups responsible for K. variicola infections in some settings (32). Although they were closely related compared against all K. variicola genomes, there were still 1,882 SNPs between WUSM_KV_10 and the other 6 genomes. Interestingly, clusters were not restricted to human infections, as cluster 24 contains 3 genomes from bovine mastitis (NL49, NL58, and NL58) and hospital isolates (VRCO0246, VRCO00242, VRCO00244, and VRCO00243) (https://www.ncbi.nlm.nih.gov/bioproject/361595) (33).
As expected for K. variicola, bla LEN ␤-lactamases were the most conserved ARGs. A previous report unexpectedly found a K. variicola isolate that harbored the bla OKP gene commonly found in K. quasipneumoniae; however, we did not identify such instances within our cohort (2). Although chromosomally carried in K. pneumoniae, fosA was identified in only 1/145 of the K. variicola genomes (34,35). Additionally, as previously found in K. pneumoniae clinical isolate cohorts, we found oqxAB efflux pump genes widespread across K. variicola genomes (36)(37)(38). Although these genes may be ubiquitous in K. variicola, 0 of 55 isolates we tested had resistance to ciprofloxacin; the single example with intermediate susceptibility carried a qnrB6 gene. This is not atypical for Enterobacteriaceae possessing oqxAB, as one study found 100% prevalence of oxqAB in K. pneumoniae but no quinolone resistance (37). It is possible that for K. variicola, similarly to K. pneumoniae, high expression of oqxAB is essential for phenotypic resistance to quinolones (36). In K. pneumoniae, expansion of clonal groups is associated with carbapenemase carriage (i.e., ST258 and bla KPC ); however, we did not observe any associations between carbapenemase genes and K. variicola clusters. Indeed, only 1.81% (1/55) of K. variicola strains within our institutional cohort had a carbapenemase gene and the regional resistance rate for meropenem between K. pneumoniae and K. variicola in 2017 was similar. bla NDM -positive K. variicola strains have been identified in clinical and environmental samples, but bla KPC -positive genomes came exclusively from clinical sources. KPN1481 (bla NDM-1 ) was annotated as a urine-derived isolate, but GJ1, GJ2, and GJ3 (all bla NDM-9 ) were found in the Gwangju tributary in South Korea (2,39). In contrast, WUSM_KV_55 (bla KPC-2 ) was isolated from bronchoalveolar lavage fluid, KP007 (bla KPC-2 ) from an intra-abdominal site, and 223/14 (bla KPC-6 ) from a laparotomy wound (40,41). IncF plasmids, the most abundant replicon identified in the K. variicola cohort, are known carriers of antibiotic resistance genes, including bla CTX-M and bla OXA ␤-lactamases (42). Consistent with their widespread identification in K. variicola, IncF plasmids are frequently found in K. pneumoniae and E. coli (43,44).
K. pneumoniae is the second leading cause of urinary tract infections (45). Given previous misclassification of K. variicola as K. pneumoniae and the high frequency at which K. variicola was isolated from the urinary tract, we were interested in comparing the uropathogenicity of our K. variicola isolates to the well-studied model K. pneumoniae TOP52 isolate (3,18,19). We identified strain-dependent virulence capacity, with UTIs from WUSM_KV_39 yielding statistically significant higher bladder CFU than K. pneumoniae TOP52. Quantification of metrics used to study uropathogenicity in E. coli and K. pneumoniae show increased fimS in the "ON" orientation and increased FimA production by WUSM_KV_39; these findings provide a plausible explanation for why WUSM_KV_39 performed better than K. pneumoniae TOP52 and all WUSM_KV isolates excluding WUSM_KV_10 (46). While we do not yet understand the role of the putative protein identified between recombinases fimB and fimE in WUSM_KV_39, one could postulate that this difference may affect fimbrial expression. Additionally, the poorest performer in the urinary tract, WUSM_KV_14, encodes a mutation resulting in a truncated fimD usher sequence which likely explains its lack of FimA production. As with other bacterial pathogens, it is likely that specific virulence factors are required for K. variicola competency in distinct body niches (47,48). Further work is therefore warranted to test if yersiniabactin and allantoin utilization promote lung and liver infections, respectively, in K. variicola as they do in K. pneumoniae (49)(50)(51)(52).
K. variicola carries usher genes previously identified in K. pneumoniae and 9 novel ushers (53). Interestingly, KveB and KviA are the first report of usher proteins in Klebsiella. The best-studied operon, pap in E. coli, is a major contributor to pyelonephritis as the PapG adhesin can bind Gal-␣-(1-4)-Gal exposed on human kidney cells (54). Other usher genes have been shown to be essential for biofilm formation, plant cell adhesion, and murine gut colonization, further demonstrating their role in niche differentiation (53). Clustering of the presence/absence of these ushers showed the absence of KpbC but presence of KvdB in 11 of the WUSM_KV genomes, a phenomenon similar to that observed for UshC and YraJ in E. coli (55). All 4 of these usher types were found in the ␥* clade, suggesting an exclusionary form of functional redundancy between usher genes (55). Usher genes and CUP operons are frequently exchanged horizontally between Enterobacteriaceae genera (55). Indeed, we have found that the KvhC usher protein has only 76% amino acid identity to any existing proteins in the nonredundant protein sequence database and that the kvh operon is situated next to multiple prophage-and transposase-associated genes.
In this investigation, we used phenotypic and genomic analyses to better understand the diversity of K. variicola genomes, both from our institution and across the globe (using publicly available NCBI genomes). Then, we assessed the functional consequences of ARGs and VGs toward antibiotic resistance and uropathogenicity. One limitation of our study is that our mouse infections and phenotypic analyses are performed with nonisogenic strains. If existing genetic modification systems in K. pneumoniae are shown to be useful for gene knockouts in K. variicola, further work can be performed to mechanistically validate our findings. An additional limitation is that ϳ30 genomes of K. variicola have been uploaded to NCBI since we initiated our comparison. These may further elucidate differences in population structure, although even with almost 300 genomes, one study indicates that K. pneumoniae diversity remains undersampled (26).
Our work represents the first large-scale genomic analysis of K. variicola across multiple institutions and the first use of a murine model to study K. variicola pathogenesis. We unequivocally show that whole-genome comparisons can separate K. variicola from K. pneumoniae and offer convenient alternative methods for laboratories without access to WGS to differentiate these species. Importantly, we demonstrate that high-risk ARGs and VGs are present in K. variicola genomes from a variety of geographic locations. This may have clinical ramifications, as we demonstrate that some K. variicola clinical isolates can be superior uropathogens compared to K. pneumoniae. Similarly to E. coli and K. pneumoniae, the diversity of CUP operons in these isolates could complement additional acquired virulence genes and enable infection of specific niches. Therefore, it is imperative that K. variicola and K. pneumoniae continue to be differentiated in the clinical laboratory, so that we may apply data on differential gene repertoire, clinical behavior, and niche specificity to the goal of ultimately improving patient outcomes.
The remaining 58 isolates were randomly selected from a banked collection of K. pneumoniae strains historically recovered from clinical specimens (29 from urine, 25 from blood, and 1 each from abdominal wound, tracheal aspirate, bronchial washing, and bile). Each of these isolates underwent Bruker MALDI-TOF MS and yggE PCR-RFLP to confirm their identification. Five percent (3/58) were confirmed as K. variicola using MALDI-TOF MS and the yggE PCR-RFLP assay.
Illumina whole-genome sequencing and publicly available Klebsiella genomes. Pure frozen stocks of the presumptive 113 Klebsiella isolates were plated on blood agar to isolate single colonies. Approximately 10 colonies were suspended using a sterile cotton swab into water, and total genomic DNA was extracted using the Bacteremia kit (Qiagen). An 0.5-ng amount of DNA was used as input for sequencing libraries using the Nextera kit (Illumina) (56). Libraries were pooled and sequenced on an Illumina NextSeq 2500 high-output system to obtain ϳ2.5 million 2 ϫ 150-bp reads. Demultiplexed reads had Illumina adapters removed with Trimmomatic v.36 and decontaminated with DeconSeq v0.4.3 (57,58). Draft genomes were assembled with SPAdes v3.11.0, and the scaffolds.fasta files were used as input for QUAST v 4.5 to measure the efficacy of assembly (see Table S1 in the supplemental material) (59,60). All contigs of Ն500 bp in length were annotated for open reading frames with Prokka v1.12 (61).
To increase the number of genomes for downstream analysis, 50 K. variicola genomes were obtained from NCBI genomes (https://www.ncbi.nlm.nih.gov/genome/) in September 2017 (Table S1). Additionally, as it is possible that previously sequenced K. variicola may be incorrectly described as K. pneumoniae, we submitted the complete genome of the K. variicola reference strain At-22 to NCBI BLASTN against the nonredundant nucleotide collection and the whole-genome shotgun sequence databases using default settings in September 2017. Using this method, we obtained 41 genomes of K. pneumoniae with the minimum observed query length of 38% at 99% identity (Table S1). Given that the cohort of genomes analyzed in our study includes isolates initially misannotated, we refer to them as either the NCBI genome or assembly (https://www.ncbi.nlm.nih.gov/assembly) accession key. Sequenced and acquired isolates were analyzed using a variety of computational programs (Text S1). In silico sequence typing was performed using mlst v2.11 (https://github.com/tseemann/mlst) and the BIGSdb database (https:// bigsdb.pasteur.fr/klebsiella/klebsiella.html).
Antimicrobial susceptibility testing. K. variicola isolates underwent antimicrobial susceptibility testing per laboratory standard operating procedures using Kirby-Bauer disk diffusion on Mueller-Hinton agar (BD BBL Mueller-Hinton II agar), in accordance with Clinical and Laboratory Standards Institute (CLSI) standards. Disk diffusion results were interpreted using CLSI Enterobacteriaceae disk diffusion breakpoints (62). Briefly, 4 to 5 colonies from pure isolates were used to create a 0.5 McFarland suspension of the organism in sterile saline. A sterile, nontoxic cotton swab was dipped into the bacterial suspension, and a lawn of the organism was plated to Mueller-Hinton agar. Antimicrobial Kirby-Bauer disks were applied, and the plate was incubated at 35°C in room air for 16 to 24 h. The diameters of the zones of growth inhibition surrounding each antimicrobial disk were recorded in millimeters.
Mouse urinary tract infections. Bacterial strains from our K. variicola cohort and K. pneumoniae TOP52 were used to inoculate 7-to 8-week-old female C3H/HeN mice (Envigo) by transurethral catheterization as previously described (18,19,63). The K. variicola strains were selected to encompass a range of genetically distinct isolates. WUSM_KV_03 and WUSM_KV_10 were specifically chosen as they contain the all and ybt operons, respectively. Static 20-ml cultures were started from freezer stocks, grown in Luria-Bertani (LB) broth at 37°C for 16 h, and centrifuged for 5 min at 8,000 ϫ g, and the resultant pellet was resuspended in phosphate-buffered saline (PBS) and diluted to approximately 4 ϫ 10 8 CFU/ml. Fifty milliliters of this suspension was used to infect each mouse with an inoculum of 2 ϫ 10 7 CFU/ml. Inocula were verified by serial dilution and plating. At 24 hpi, bladders and kidneys were aseptically harvested, homogenized in sterile PBS via Bullet Blender (Next Advance) for 5 min, serially Potter et al. diluted, and plated on LB agar. All animal procedures were approved by the Institutional Animal Care and Use Committee at Washington University School of Medicine. Phase assays. To determine the orientation of the fimS phase switch in Klebsiella, a phase assay was adapted as previously described (20). An 817-bp fragment including fimS was PCR amplified using Taq polymerase (Invitrogen) and the primers 5=-GGGACAGATACGCGTTTGAT-3= and 5=-GGCCTAACTGAACGG TTTGA-3= and then digested with HinfI (New England Biolabs). Digestion products were separated by electrophoresis on a 1% agarose gel. A phase-ON switch yields products of 605 and 212 bp, and a phase-OFF switch yields products of 496 and 321 bp.
Statistics. CFU/bladder and CFU/kidney for both experimental replicates were used as input for ordinary one-way ANOVA to judge significance. Pairwise comparisons of CFU/bladder and CFU/kidney values were performed by using the nonparametric Mann-Whitney U test. Similarly, normalized quantifications of relative FimA amounts (FimA/GroEL) and fimS in "ON" position (fimS "ON"/fimS "OFF") were compared using the Mann-Whitney U test. All P values of Ͻ0.05 were considered significant, and all calculations were performed in GraphPad Prism v7.04.
Accession number(s). The genomes have all been deposited in NCBI under BioProject accession no. PRJNA473122.