Skip to main content
  • ASM
    • Antimicrobial Agents and Chemotherapy
    • Applied and Environmental Microbiology
    • Clinical Microbiology Reviews
    • Clinical and Vaccine Immunology
    • EcoSal Plus
    • Eukaryotic Cell
    • Infection and Immunity
    • Journal of Bacteriology
    • Journal of Clinical Microbiology
    • Journal of Microbiology & Biology Education
    • Journal of Virology
    • mBio
    • Microbiology and Molecular Biology Reviews
    • Microbiology Resource Announcements
    • Microbiology Spectrum
    • Molecular and Cellular Biology
    • mSphere
    • mSystems
  • Log in
  • My alerts
  • My Cart

Main menu

  • Home
  • Articles
    • Latest Articles
    • COVID-19 Special Collection
    • Archive
    • Minireviews
  • Topics
    • Applied and Environmental Science
    • Clinical Science and Epidemiology
    • Ecological and Evolutionary Science
    • Host-Microbe Biology
    • Molecular Biology and Physiology
    • Therapeutics and Prevention
  • For Authors
    • Submit a Manuscript
    • Scope
    • Editorial Policy
    • Submission, Review, & Publication Processes
    • Organization and Format
    • Errata, Author Corrections, Retractions
    • Illustrations and Tables
    • Nomenclature
    • Abbreviations and Conventions
    • Publication Fees
    • Ethics Resources and Policies
  • About the Journal
    • About mBio
    • Editor in Chief
    • Board of Editors
    • AAM Fellows
    • For Reviewers
    • For the Media
    • For Librarians
    • For Advertisers
    • Alerts
    • RSS
    • FAQ
  • ASM
    • Antimicrobial Agents and Chemotherapy
    • Applied and Environmental Microbiology
    • Clinical Microbiology Reviews
    • Clinical and Vaccine Immunology
    • EcoSal Plus
    • Eukaryotic Cell
    • Infection and Immunity
    • Journal of Bacteriology
    • Journal of Clinical Microbiology
    • Journal of Microbiology & Biology Education
    • Journal of Virology
    • mBio
    • Microbiology and Molecular Biology Reviews
    • Microbiology Resource Announcements
    • Microbiology Spectrum
    • Molecular and Cellular Biology
    • mSphere
    • mSystems

User menu

  • Log in
  • My alerts
  • My Cart

Search

  • Advanced search
mBio
publisher-logosite-logo

Advanced Search

  • Home
  • Articles
    • Latest Articles
    • COVID-19 Special Collection
    • Archive
    • Minireviews
  • Topics
    • Applied and Environmental Science
    • Clinical Science and Epidemiology
    • Ecological and Evolutionary Science
    • Host-Microbe Biology
    • Molecular Biology and Physiology
    • Therapeutics and Prevention
  • For Authors
    • Submit a Manuscript
    • Scope
    • Editorial Policy
    • Submission, Review, & Publication Processes
    • Organization and Format
    • Errata, Author Corrections, Retractions
    • Illustrations and Tables
    • Nomenclature
    • Abbreviations and Conventions
    • Publication Fees
    • Ethics Resources and Policies
  • About the Journal
    • About mBio
    • Editor in Chief
    • Board of Editors
    • AAM Fellows
    • For Reviewers
    • For the Media
    • For Librarians
    • For Advertisers
    • Alerts
    • RSS
    • FAQ
Research Article

Bioinformatic Genome Comparisons for Taxonomic and Phylogenetic Assignments Using Aeromonas as a Test Case

Sophie M. Colston, Matthew S. Fullmer, Lidia Beka, Brigitte Lamy, J. Peter Gogarten, Joerg Graf
Edward G. Ruby, Editor
Sophie M. Colston
aDepartment of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Matthew S. Fullmer
aDepartment of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Lidia Beka
aDepartment of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Brigitte Lamy
bLaboratoire de Bactériologie-Virologie, UMR 5119, Equipe Pathogènes et Environnements, Université Montpellier, Montpellier, France
cLaboratoire de Bactériologie, Centre Hospitalier Universitaire de Montpellier, Montpellier, France
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
J. Peter Gogarten
aDepartment of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Joerg Graf
aDepartment of Molecular and Cell Biology, University of Connecticut, Storrs, Connecticut, USA
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
Edward G. Ruby
University of Wisconsin Madison
Roles: Editor
  • Find this author on Google Scholar
  • Find this author on PubMed
  • Search for this author on this site
DOI: 10.1128/mBio.02136-14
  • Article
  • Figures & Data
  • Info & Metrics
  • PDF
Loading

ABSTRACT

Prokaryotic taxonomy is the underpinning of microbiology, as it provides a framework for the proper identification and naming of organisms. The “gold standard” of bacterial species delineation is the overall genome similarity determined by DNA-DNA hybridization (DDH), a technically rigorous yet sometimes variable method that may produce inconsistent results. Improvements in next-generation sequencing have resulted in an upsurge of bacterial genome sequences and bioinformatic tools that compare genomic data, such as average nucleotide identity (ANI), correlation of tetranucleotide frequencies, and the genome-to-genome distance calculator, or in silico DDH (isDDH). Here, we evaluate ANI and isDDH in combination with phylogenetic studies using Aeromonas, a taxonomically challenging genus with many described species and several strains that were reassigned to different species as a test case. We generated improved, high-quality draft genome sequences for 33 Aeromonas strains and combined them with 23 publicly available genomes. ANI and isDDH distances were determined and compared to phylogenies from multilocus sequence analysis of housekeeping genes, ribosomal proteins, and expanded core genes. The expanded core phylogenetic analysis suggested relationships between distant Aeromonas clades that were inconsistent with studies using fewer genes. ANI values of ≥96% and isDDH values of ≥70% consistently grouped genomes originating from strains of the same species together. Our study confirmed known misidentifications, validated the recent revisions in the nomenclature, and revealed that a number of genomes deposited in GenBank are misnamed. In addition, two strains were identified that may represent novel Aeromonas species.

IMPORTANCE Improvements in DNA sequencing technologies have resulted in the ability to generate large numbers of high-quality draft genomes and led to a dramatic increase in the number of publically available genomes. This has allowed researchers to characterize microorganisms using genome data. Advantages of genome sequence-based classification include data and computing programs that can be readily shared, facilitating the standardization of taxonomic methodology and resolving conflicting identifications by providing greater uniformity in an overall analysis. Using Aeromonas as a test case, we compared and validated different approaches. Based on our analyses, we recommend cutoff values for distance measures for identifying species. Accurate species classification is critical not only to obviate the perpetuation of errors in public databases but also to ensure the validity of inferences made on the relationships among species within a genus and proper identification in clinical and veterinary diagnostic laboratories.

INTRODUCTION

Rapid improvements in DNA sequencing technologies are providing new approaches to address prevailing questions in the field of microbiology (1–3). For example, next-generation sequencing greatly enhanced the discovery of virulence factors through comparative genomics (4), enabled epidemiological studies of recent disease outbreaks (5), led to the discovery of the rare biosphere (6), and provided insights into the physiology of uncultured microbes through metatranscriptomics (7). The increasing amounts of data also brought challenges in ensuring the accuracy of annotations in databases (8). Since many analyses are based on comparisons to known sequences, errors in a database can be easily propagated in other databases and affect subsequent studies. Microbial taxonomy is one area in which the advances in next-generation sequencing have yet to be implemented to their full potential, even though several applications have shown great promise (9, 10). Prokaryotic taxonomy has been traditionally regarded as consisting of three interrelated components: classification, nomenclature, and characterization (11). Only nomenclature is strictly regulated in the International Code of Nomenclature of Bacteria (12). It is important to reconcile nomenclature when rigorous classification and characterization methods reveal an inconsistency in the composition of a particular named species.

The organizing principle of microbial taxonomy is to group related organisms together that are distinct from other groups. DNA-DNA hybridization (DDH) is the traditional “gold standard” of circumscribing a bacterial species, as this method provides an assessment of the overall similarity of the heritable material, with phylogenetic data providing information about neighboring organisms. The current DDH standard for strains to be considered belonging to the same species is that ≥70% of the DNA from the two strains reassociates with a ≤5°C difference in melting temperatures (13). However, laboratory-based DDH measurements are not without challenges, given that DDH values can be difficult to reproduce and therefore may vary, depending on the reannealing temperature used or a laboratory’s particular method employed (14). In addition, the data cannot be archived, nor are they portable between laboratories, and as such the data cannot be readily built upon when describing a new species (15).

In contrast to DDH, DNA sequence information can be easily archived and readily transferred between laboratories. Standardized bioinformatic analyses on the same data set can be performed by different laboratories, which facilitates collaborations and, potentially, the resolution of disagreements (16). Examples of such molecular methods include multilocus sequence analysis (MLSA), which provides important information about the evolutionary relationships of bacteria and allows grouping of related strains (14). MLSA has emerged as a powerful tool for classifying bacterial strains, as it relies on the allelic differences among multiple conserved housekeeping genes (17). In MLSA, the sequences are typically concatenated to overcome the lack of resolution seen in the topology of single-gene trees, but this method may mask the different evolutionary processes underlying the individual genes (18, 19). In addition, there is no consensus as to what degree of sequence variation correlates with species boundaries, which is partly due to different genes evolving at different rates and also that a few selected genes represent only a fraction of the vast amount of information contained within an entire genome.

The field of microbiology is undergoing dramatic changes, with more genomes becoming available due to the rapidly improving technology and declining cost of sequencing. In addition to closed or finished genomes, “improved” high-quality draft genomes for which the annotations have been validated have been deemed suitable for comparative genomic studies (20). The relative ease of producing such genomes provides new opportunities for assessing taxonomic relationships, discovering new taxa, and sharing data between researchers. As a result, new tools are being developed to make use of these data, including a bioinformatic approach for calculating the DDH. One of these, the genome-to-genome distance calculator, referred to here as in silico DDH (isDDH), produces values that compare closely with experimentally derived DDH values (9, 21). Another method calculates the average nucleotide identity (ANI) among conserved and shared genes. The use of ANI has been proposed as a new standard for defining microbial species, and it is gaining wide acceptance (16, 22). The most current proposal recommends use of an ANI threshold of 95 to 96% along with support from tetranucleotide frequency correlation coefficient values (23, 24). Recently, a few studies combined either MLSA or the analysis of genes common to all members of a genus (core genome) with the overall similarity of the genome by using ANI for species identification (15, 25). We wanted to compare isDDH and ANI for species identification combined with phylogenetic approaches, using a genus with a complicated but relatively well-described phylogeny.

The genus Aeromonas makes for an ideal test case, because it contains a large number of species, biovars, and subspecies and its taxonomy has been the subject of much debate (26). Collectively, Aeromonas members are found in a number of habitats and in association with various animals, ranging from beneficial symbionts of leeches and zebrafish to pathogens of amphibians, fish, and humans (26, 27). Fourteen species of Aeromonas were recognized in the latest addition of Bergey’s Manual of Systematic Bacteriology in 2005 (28). Since then, over a dozen have been propose, while the statuses of five species and two subspecies have been called into question. An accurate taxonomy for this genus is not only critical as a tool to differentiate benign from potentially virulent species, but it is also essential as the foundation for ecological studies.

A number of taxonomic controversies exist within the Aeromonas genus, namely, the synonymity of the following groups: the proposed novel species A. culicicola and A. ichthiosmia with A. veronii (29–31), A. enteropelogenes with A. trota (31–34), A. allosaccharophila with A. veronii (30), A. hydrophila subsp. anaerogenes with A. caviae (28, 35), and A. hydrophila subsp. dhakensis with A. aquariorum, which ultimately led to a proposal of a new species, A. dhakensis (36–38), All of these controversies are likely due, at least in part, to the limitations of past and current methods to consistently distinguish to the species level. Some of these controversies (e.g., whether the taxon A. allosaccharophila reaches the species level) could not even be unambiguously clarified with the most recent methods, with several MLSA schemes with partial sequences of up to seven housekeeping genes (33, 34, 39–41). A finding of some of these studies and of a study investigating discrepancies in the analysis of 16S rRNA genes (42, 43) was that recombination occur frequently between members of this genus, which renders phylogenies with single or a few genes challenging.

The use of whole genome sequences has been regarded as a promising avenue for the future of Aeromonas taxonomic and phylogenetic studies (41). In the present study, we generated improved, high-quality draft genome sequences from 27 type strains and 6 additional strains. These genomes were supplemented by 23 additional genomes of Aeromonas strains available in public databases. Our approach was to determine the phylogeny in three ways, by using (i) 16 housekeeping genes that were used in four recent MLSA classifications (HK), (ii) ribosomal protein coding gene (RG), and (iii) the expanded core (EC), which are the genes present in at least 90% of the 56 strains. In addition, we performed ANI analysis and isDDH (9, 16, 21, 22) to determine the overall similarity of the genomes. We examined our data with regard to the above-mentioned taxonomic controversies, as these provided the means to validate our approach. We also investigated the relationships of deeper phylogenetic branches in the Aeromonas genus. This approach led to the identification of candidate novel species and is presented as a methodology that may be applied to other genera as well.

RESULTS

Genome sequences.A total of 56 Aeromonas genomes were used in this study, representing type strains of 29 currently recognized or proposed species, of which 27 were sequenced in-house and 2 were available in GenBank. The additional 23 genomes were non-type strains and auxiliary strains of interest. For seven of the Aeromonas species, multiple strains were used in this study, and strain designations were employed to distinguish among them (A. allosaccharophila, A. caviae, A. dhakensis, A. hydrophila, A. media, A. salmonicida, and A. veronii); for the remainder of the species, only the type strain was used, which is indicated by a superscript T. For the 33 genomes obtained for this study, the average genome coverage ranged from 30- to 260-fold and the number of scaffolds ranged from 22 to 332 with an average of 88 (Table 1). The completeness of the genomes was assessed by screening the genomes for 16 housekeeping genes and 47 ribosomal protein-coding genes. All 63 genes were present in the 56 genomes. The genome sizes estimated from the draft genomes generated for this study ranged from 3.90 Mbp (A. fluvialisT) to 5.18 Mbp (A. piscicolaT), with an average of 4.51 Mbp. The average G+C content of the aeromonads ranged from 58.1% (A. australiensisT) to 62.8% (A. taiwanensisT), with a mean of 60.2%. Based on the quality of the genomes and verification of the automated annotation, we consider these genomes to be improved, high-quality draft genomes (20).

View this table:
  • View inline
  • View popup
TABLE 1 

General features of the Aeromonas genomes

Phylogenetic analysis.One goal of our study was to reevaluate the phylogenetic relationships of the Aeromonas species by using three phylogenies, HK, RP, and EC, derived from different sets of genes: 16 housekeeping genes, 47 ribosomal protein-coding genes, and the expanded core, which included 2,710 ortholog groups (OG), respectively. Due to the differences in the number of informative sites, the EC phylogeny had the strongest support values for all of the nodes, although both the HK and EC phylogenies provided new insights into the relationships of distant clades (Fig. 1). The RP phylogeny had the lowest support values, as these genes are more conserved (see Fig. S1 in the supplemental material). In both the HK and EC phylogenies, we found the same eight major monophyletic groups, or clades, which are defined as groups of taxa in a phylogeny that each share an ancestor, to the exclusion of all other taxa included in the analysis (Fig. 1). Interestingly, we found several differences between the HK and EC phylogenies. In the HK phylogeny, clades 6 and 7 represent shallow branches that are nested within larger groups formed by clades 2 to 7, 3 to 7, and 4 to 7; however, in the EC phylogeny, clade 6 is basal to the large clade containing clades 2 to 5, 7, and 8. Moreover, in the EC phylogeny, clades 2 and 7 form one clade, while clades 3 to 5 form another clade, which is also inconsistent with the HK phylogeny where clade 7 forms a clade with 6 that is nested within a large grouping containing clades 3 to 7. As the expanded core did not require each ortholog group (i.e., homologs that appear to have evolved from the same ancestral gene in the organismal most recent ancestor of the group) to be present in every genome, we repeated the analysis using the strict core with only those ortholog groups that were present in all genomes. The strict core phylogeny was consistent with the EC phylogeny (see Fig. S2 in the supplemental material), indicating that the ortholog groups present in all genomes did not represent variations in the topology observed between the strict versus expanded core.

Figure S1

Maximum likelihood reconstruction using 47 ribosomal proteins. Accessory proteins, such as methyltransferases, were excluded. Support values are percent bootstrap values. Download Figure S1, EPS file, 0.6 MB.
Copyright © 2014 Colston et al.

This is an open-access article distributed under the terms of the Creative Commons Attribution-Noncommercial-ShareAlike 3.0 Unported license, which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original author and source are credited.

Figure S2

Strict core genome phylogeny reconstructed derived from 1,850 ortholog groups present in all 56 taxa based on our approximate maximum likelihood reconstruction. Branch supports are aLRT SH-like support values. The topology of the strict core is not different from that of the 90% relaxed core. This suggests that the differences between the houskeeping 16-gene phylogeny and the relaxed core are not the result of gene transfers occurring in only some of the taxa. Download Figure S2, EPS file, 0.6 MB.
Copyright © 2014 Colston et al.

This is an open-access article distributed under the terms of the Creative Commons Attribution-Noncommercial-ShareAlike 3.0 Unported license, which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original author and source are credited.

FIG 1 
  • Open in new tab
  • Download powerpoint
FIG 1 

(A) Maximum likelihood reconstruction of 16 single-copy housekeeping genes. Support values are represented by dots: red (90%+ bootstraps), orange (80%+), yellow (70%+). (B) Approximate maximum likelihood reconstruction of 2,710 orthologous groups found in 90% or more of the taxa. aLRT SH-like support values equal to or greater than 0.97 are represented by red dots. The species A. veronii, A. hydrophila, A. dhakensis, A. salmonicida, and A. caviae are color-coded in both trees. Additionally, two previously misidentified taxa, A. veronii AMC 34 and A. hydrophila AH4, are shown in red and teal, respectively. Eight well-supported clades were shared between the two reconstructions. They are shown by the colored bars and are numbered 1 through 8.

Most of the general relationships observed in our study were consistent with those reported in the published literature. The recently proposed species, A. dhakensis, which was determined to be synonymous with A. aquariorum (44), was originally a subspecies of A. hydrophila. All three phylogenies support that these strains form one well-supported clade that is distinct from A. hydrophila. Interestingly, six A. hydrophila genomes that we obtained from GenBank clearly clustered within A. dhakensis. Our study also grouped the strain SSU with A. dhakensis, which supports its recent reclassification from A. hydrophila to A. dhakensis (45). Misnamed genomes in GenBank should be corrected and resolved with thorough classification data to prevent further misidentifications.

Our comprehensive analysis revealed an important difference compared to the previous MLSA by Murcia-Martinez et al., which was based on partial sequences of seven genes (34). In that study, the A. trota isolates (which included A. enteropelogenesT) grouped with A. hydrophila and A. aquariorum, whereas in the HK and EC phylogenies of our study, A. enteropelogenesT and A. trotaT formed a clade with a group that included the A. veronii group, or AVG (A. veronii bv. sobria, A. veronii bv. veronii, and A. allosaccharophila), and A. jandaeiT. This finding is in agreement with those of the study by Roger et al. (33). Examination of individual gene trees suggests that the varied placement was due to the use of different housekeeping genes in these two studies (see Fig. S3 to S6 in the supplemental material) and underscores the limitations of MLSA approaches that use shorter fragments of fewer genes, compared to studies using the expanded core or a large set of full-length housekeeping genes. Our study also confirmed the synonymity of A. trota and A. enteropelogenes (31, 32).

The AVG itself is a controversial collection of species, which includes A. culicicolaT and A. ichthiosmiaT, both initially described as new species but subsequently reclassified as A. veronii based on DNA relatedness and biochemical characterization (29–31). Our data support the synonymity of A. culicicolaT and A. ichthiosmiaT with A. veronii, as the two strains grouped together with the A. veronii strains in one well-supported clade (Fig. 1A and B). An interesting aspect of this species is that there are two reported A. veronii biovars, which differ phenotypically in that A. veronii bv. veronii is positive (100%) for esculin hydrolysis and ornithine decarboxylation while A. veronii bv. sobria is negative for both reactions (46). In our analysis, the three strains of A. veronii bv. veronii (CECT 4257T, AMC35, AER397) grouped together with A. veronii B565 in a strongly supported clade within the larger A. veronii clade, which supports A. veronii bv. veronii as a bona fide biovar. Comparisons of the A. veronii genomes revealed that members of A. veronii bv. veronii encode a β-glucosidase (EC 3.2.1.21; 793 aa) and an ornithine decarboxylase (EC 4.1.1.17; 745 aa) not found among members of A. veronii bv. sobria, suggesting that these two enzymes may facilitate the reactions involving esculin and ornithine, respectively. Based on this data, A. veronii B565, whose genome contains both genes, is a presumptive member of the A. veronii bv. veronii. The two A. allosaccharophila strains (CECT 4199T and BVH88) also formed a strongly supported clade that was near but distinct from A. veronii, which suggests that A. allosaccharophila is a separate species. In our analysis, we also included the newest proposed Aeromonas species, A. australiensisT, which is monophyletic with A. fluvialisT and A. sobriaT and the AVG.

The other phylogenetic relationships supported the relationships described in previously published reports, such as the well-supported clade formed by A. simiaeT, A. diversaT, and A. schubertiiT that is distinct from all the other Aeromonas species (Fig. 1) and observed in all three phylogenies. The close relatedness between A. piscicola and A. bestiarum (47) was also recovered in our analyses. Our results also support that strain CECT 4221, described as A. hydrophila subsp. anaerogenes, clusters within the A. caviae taxon.

Assessment of genome similarity using isDDH and ANI.The information gained from the phylogenetic analyses provides an important depiction of the evolutionary relationships of different strains but does not translate directly into the overall similarity of the genomes, which was determined through DDH. We used two different in silico or bioinformatics approaches, isDDH and ANI, that have been proposed to overcome the challenges of conventional laboratory-based DDH to evaluate the genomic similarity of bacteria, and we evaluated the congruence of these methods (Fig. 2) (9, 16, 21, 22).

FIG 2 
  • Open in new tab
  • Download powerpoint
FIG 2 

ANI and isDDH values. The lower triangle displays ANI values, and the upper triangle shows the isDDH values. ANI values are colored according to three historical species cutoff values: 94% (yellow), 95% (orange), and 96%+ (red). The isDDH values displayed are the upper limits of the 95% confidence intervals and are colored red if the met the laboratory DDH species cutoff of 70% hybridization. ANI of 96% correlates well with 70% isDDH values, with only the A. allosaccharophila isolates failing to match (68.7%).

Two excellent examples for validating this approach are A. culicicolaT and A. ichthiosmiaT, which were initially proposed as novel species and later reclassified as A. veronii based in part on DDH values that exceeded 70%. The predicted point estimates of the isDDH values we obtained for these two strains were all slightly below 70% (69.1 to 69.6% and 67.4 to 68.2, respectively) compared to all other named A. veronii strains (see Fig. S7 in the supplemental material). However, when taking into consideration the 95% confidence interval (CI) for every comparison of these two strains, all CIs encompassed the 70% threshold (upper CI borders, 70.6 to 71.8%), affirming that they are indeed A. veronii. While these isDDH values were lower than what we observed for other pairwise A. veronii strain comparisons, the median hybridization value for A. culicicolaT and A. ichthiosmiaT to A. veronii was only 2.2% below that of the A. veronii comparisons (71.6% versus 73.8%). Additionally, both strains also had ANI values at or above the 96% level, compared to the other named A. veronii strains, which supports that A. culicicolaT and A. ichthiosmiaT are part of the A. veronii species, albeit near the periphery. The isDDH and ANI values were consistent with previously published results (29, 30).

Figure S7

Point estimates of in silico DNA-DNA hybridization (isDDH) values without 95% confidence intervals. The values displayed are colored (red) when the laboratory’s DDH species cutoff of 70% hybridization was met. Download Figure S7, EPS file, 11.8 MB.
Copyright © 2014 Colston et al.

This is an open-access article distributed under the terms of the Creative Commons Attribution-Noncommercial-ShareAlike 3.0 Unported license, which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original author and source are credited.

The taxonomic status of A. allosaccharophila has been controversial, and it has been suggested that it is a member of A. veronii (30). The upper borders of the 95% CI for the isDDH values for A. allosaccharophila are below 70% compared to the A. veronii strains. Additionally, the ANI values are all ~94%. These data support the status of A. allosaccharophila as a bona fide species that is closely related to A. veronii. Interestingly, while the HK, RP, and EC phylogenies all grouped the two A. allosaccharophila genomes (CECT 4199T and BVH88) together and separate from A. veronii, the ANI and the upper 95% CI isDDH values between the two A. allosaccharophila genomes were both just under the species cutoff boundary, at 95.8% and 68.7%, respectively. These data suggest that BVH88 may not be a member of the A. allosaccharophila species, but a greater number of strains in this clade will need to be evaluated to clarify their relationships. Two other species, A. fluvialis (ANI, ~92%) and A. australiensis (ANI, ~93%), also group near A. veronii. Their isDDH estimates register ~52% compared to A. veronii.

Another group of species that has recently attracted attention is A. aquariorum, A. hydrophila subsp. dhakensi, and A. hydrophila. The partition of the group comprised of A. aquariorum/A. hydrophila subsp. dhakensis strains from the A. hydrophila group, which includes the type strain (CECT 839), was recovered conclusively by every method we used in our study. The branch lengths of the HK phylogeny between A. dhakensis and A. hydrophila (~0.075 substitutions/site) were similar to those separating many named species in the HK reconstruction, such as those between A. eucrenophilaT and A. tectaT (~1.0 substitutions/site), A. schubertiiT and A. diversaT (~0.09 substitutions/site), A. rivuliT and A. molluscorumT (~0.06 substitutions/site), and A. piscicolaT and A. bestiarumT (~0.04 substitutions/site). Similar relationships were observed in the RP and EC phylogenies. Further evidence comes from the ANI data, which showed only 93% similarity between the two different clades. This is well below the 96% species cutoff recommended by Richter (23). This conclusion was further supported by isDDH data, in which A. dhakensis and A. hydrophila strains all scored below 60% between species when using the upper border of the 95% CI, while within each partition all values were well above 70%. These data confirm that these two clades represent two discrete species rather than constituents of one, as was originally proposed (48).

A. piscicolaT and A. bestiarumT grouped together and formed one clade with A. popoffiiT. The ANI between A. piscicolaT and A. bestiarumT was 95.2%, which is near the 96% suggested species cutoff (23). However, while their isDDH values were higher than most between-species comparisons (61.1% point estimate, 64.4% at the upper 95% CI), they still fell short of what one would expect for members of the same species. It will be important to add more strains of these two groups in future analyses to gain better insight into the relationships between these taxa. Based on the current data, a 96% cutoff for the ANI value seems appropriate for Aeromonas species delineations.

Discovery of novel species.We also included two strains in our analysis that seemed unusual based either on previous studies or preliminary data. AMC 34, a clinical isolate described as A. veronii bv. veronii, had a long branch length and clustered away from other A. veronii bv. veronii strains in a previous study (41). Strain AH4 was published as A. hydrophila by investigators that had obtained this isolate from the water of a storage container for medicinal leeches (49). In the HK phylogeny, AMC 34 clustered well outside the A. veronii clade, near A. jandaei Tand A. fluvialisT, with bootstrap support values in excess of 90% (Fig. 1A). Similarly, the EC phylogeny placed AMC 34 outside of A. veronii with high support (Fig. 1B). The ANI between AMC 34 and the AVG was ~94%, while the isDDH was only ~58% compared to the same taxa (Fig. 2). Taken together, the data strongly support AMC 34 as a new species.

The other strain, AH4, was identified by a clinical diagnostic laboratory as A. hydrophila (49). In all of our phylogenetic analyses, AH4 grouped with A. piscicolaT and A. bestiarumT with high support. This placement and its distance from A. hydrophila were strongly supported by the ANI and isDDH data (Fig. 2). AH4 registered only ~89% to both the A. hydrophila and A. dhakensis groups but much higher values to A. bestiarumT (~94%) and A. piscicolaT (~93%). isDDH also supported the conclusion that AH4 is not likely a member of A. bestiarum (~55%) or A. piscicola (~52%) and is distinct from the A. hydrophila (~38%) and A. dhakensis (37%) groups.

All of our bioinformatics analyses indicated that the strains AMC 34 and AH4 represent two new species; however, we were restricted to a single isolate of each, which precluded the assessment of the variabilities of biochemical tests (see Table S1 in the supplemental material). In addition, we were unable to include one recently published type strain, A. cavernicola CCM7641T (50) or one proposed new species, A. lusitana (34), which has not yet been officially described. Using the available MLSA data, we were able to show that AMC 34 and AH4 did not cluster near these two species and are thus not likely members of either A. cavernicola or A. lusitana (see Fig. S8 in the supplemental material). The accessibility of the genomes published for this study will provide other researchers with the opportunity to determine the probable taxonomic position of candidate novel species, an important capability in light of the number of taxonomic problems described for Aeromonas.

Figure S8

Approximate maximum likelihood reconstruction, including Aeromonas cavernicola MDC 2508T and Aeromonas lusitana MDC 2473T. The data set included only the seven genes (atpD, dnaJ, dnaX, gyrA, gyrB, recA, and rpoD) for which A. cavernicola and A. lusitana have partial CDS available in the NCBI database. Values displayed on the branches are aLRT SH-like support values. Download Figure S8, EPS file, 0.7 MB.
Copyright © 2014 Colston et al.

This is an open-access article distributed under the terms of the Creative Commons Attribution-Noncommercial-ShareAlike 3.0 Unported license, which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original author and source are credited.

Table S1

Biochemical test results for novel Aeromonas species AMC 34 and AH4 Table S1, DOCX file, 0.02 MB.
Copyright © 2014 Colston et al.

This is an open-access article distributed under the terms of the Creative Commons Attribution-Noncommercial-ShareAlike 3.0 Unported license, which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original author and source are credited.

Comparison of phylogenetic and genetic distance measures.The delineation of organisms into taxonomic groups is based on their evolutionary histories and genetic distances. In this study, we utilized five different approaches, of which two were phylogeny independent (isDDH and ANI) and three had a phylogenetic component (HK, RP, and EC phylogenies). To guide subsequent studies, we wanted to evaluate whether these approaches were in agreement with one another and whether some were more informative than others. Even though isDDH and ANI use different algorithms for the calculations, e.g., ANI evaluates the similarity of shared elements between two genomes, while isDDH estimates the overall similarity of two genomes, the results were very consistent (Fig. 3). The r2 value was 0.957 for the entire data set, and when restricted to comparisons of more closely related strains (isDDH of ≥55%), the r2 was 0.996. These values demonstrated that at least for this data set, either method can be used for determining overall genome similarities. When isDDH (upper 95% CI) and ANI results were compared to the P-distance of the entire EC data set, the r2 values were low for both approaches, 0.599 and 0.713, respectively. When the data set was restricted to comparisons of genomes that had at least a similarity of ≥50% based on isDDH, the correlation coefficients were 0.943 and 0.965for isDDH and ANI, respectively (see Fig. S9 in the supplemental material). This indicated that either approach works well at separating closely related genomes but not for determining more distant relationships.

Figure S9

P-distance of the expanded core alignment comparison to isDDH (a) and the percent ANI (b). The pairwise percent similarities of 56 genomes were determined by using either isDDH or ANI and plotted against the P-distance of the expanded core. When we compared the isDDH and ANI results to the P-distance of the entire EC data set, the r2 value was low for both approaches, 0.599 and 0.713, respectively, but when the data set was restricted to comparisons of genomes that had a similarity of ≥50% based on isDDH, the correlation coefficients were 0.943 and 0.965, respectively. Download Figure S9, PDF file, 0.3 MB.
Copyright © 2014 Colston et al.

This is an open-access article distributed under the terms of the Creative Commons Attribution-Noncommercial-ShareAlike 3.0 Unported license, which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original author and source are credited.

FIG 3 
  • Open in new tab
  • Download powerpoint
FIG 3 

Comparison of isDDH and ANI results. The pairwise percent similarities of 56 genomes were determined using either isDDH or ANI. The two approaches revealed a significant correlation, with an r2 of 0.957. When testing samples with an isDDH values of ≥50%, the r2 was 0.9996.

Most researchers characterize strains by analyzing the sequences of only one or two genes. We wanted to ascertain whether there are particular genes that are better suited than others for an initial analysis. One important concern is that horizontal gene transfer of gene fragments and not just entire genes can occur among aeromonads and result in conflicting phylogenies (41). Thus, relying on any one gene can produce erroneous results. On the other hand, including a preponderance of genes that represent a highway of gene sharing in a concatenation may result in phylogenies that reflect neither organismal evolution nor any individual gene history (51). The individual gene trees (see Fig. S3 to 6 in the supplemental material) for the 16 housekeeping genes were compared to the phylogeny derived from the consensus tree using the approximately unbiased (AU) test (52). The set of maximum likelihood (ML) trees generated from bootstrap samples of the MLSA data were significantly different from the best gene tree for each gene. When maximum likelihood trees from bootstrap samples of the 16 housekeeping genes were compared to the MLSA tree, 15 of the gene tree sets were significantly different from the MLSA best tree. Only one of the bootstrap samples for recA had a P value of ≥0.05 (P = 0.93). These results reveal that no individual gene tree properly reflects or is even compatible with the phylogeny of the MLSA tree.

Figure S3

Maximum likelihood reconstruction of the (A) atpD gene A, (B) dnaJ gene, (C) dnaK gene, and (D) dnaX gene. Support values are percent bootstrap values. Download Figure S3, EPS file, 2.1 MB.
Copyright © 2014 Colston et al.

This is an open-access article distributed under the terms of the Creative Commons Attribution-Noncommercial-ShareAlike 3.0 Unported license, which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original author and source are credited.

DISCUSSION

Our polyphasic genome comparison utilizing both phylogenetic and genetic distance metrics was by and large consistent with the current understanding of the phylogenetic relationships of the species contained within the genus Aeromonas, which had been hitherto based on laboratory-determined DDH values, biochemical tests, and multilocus sequence typing. Importantly, we were able to gain new insights into the overall relationships of the Aeromonas species with the phylogeny generated from the expanded core and the HK genes. There were eight major clades from the EC that were largely consistent with the HK phylogeny (Fig. 1). One major difference between the two phylogenies was the placement of A. salmonicida (clade 7) and A. hydrophila and A. dhakensis (clade 2). In the EC phylogeny, they form one strongly supported clade, but in the HK phylogeny they are separated by two well-supported nodes (Fig. 1). This suggests that other components of the genome are forcing A. hydrophila and A. salmonicida together in the expanded core phylogeny. Due to the limited resolution, the RP phylogeny did not provide additional support. A strict core phylogeny using only ortholog groups present in all 56 taxa shared the topology of the EC tree, suggesting that the conflict with the HK method was due to genes present in 100% of the genomes (see Fig. S2 in the supplemental material). One should consider, however, that the EC phylogeny may have inherent biases which might lead to an inaccurate depiction of organismal phylogeny. At this point, we cannot establish which topology is correct, since gene transfer between divergent groups has the potential to lead to trees from concatenated data sets that do not reflect the vertical inheritance (19). Gene transfer frequency is usually biased toward close relatives, thus reinforcing the signal due to shared ancestry (53, 54). In contrast, highways of gene sharing between more distant species can obscure the vertical phylogenetic signal due to shared ancestry (51, 55). For phylogenetic relationships within each of the clades 1 through 7, the HK and EC phylogenies appear to approximate organismal phylogeny (Fig. 1). On the other hand, relationships between these clades remain ambiguous. Differences in substitution rates and saturation with substitutions make it difficult to apply ANI and isDDH to higher taxonomic levels. Future work will need to include the evaluation of the 2,710 individual trees from the EC analysis in a combined analysis, such as the one described by Bansal, Alm, and Kellis (56), to determine the major conflicting phylogenetic signals retained in these genomes. Even so, both the HK and EC phylogenies provided more information regarding the relationships of different Aeromonas species than previous MLSA studies.

The psychrophilic aeromonads have been differentiated from the mesophilic strains based on growth physiology, biochemical properties, and virulence characteristics. Although there certainly are important differences among these characteristics, whole-genome information groups them clearly among the mesophilic species, near A. hydrophila and A. dhakensis. One interesting distinction of the A. salmonicida clade is that there is much less genetic diversity, indicated by the isDDH values for strains of the same species. The four A. salmonicida genomes had isDDH values ≥98.5%, in comparison to A. hydrophila (≥75.7%), A. dhakensis (≥78.3%), and A. veronii (≥70.4%). This was consistent with a study that suggested a clonal distribution of A. salmonicida subsp. salmonicida based on identical pulse electrophoresis DNA fingerprints, which showed identical banding patterns from strains isolated from different geographical regions (57). This difference in genetic diversity could reflect different evolutionary driving forces for A. salmonicida strains. One conjecture is that perhaps they are adapted for a virulent lifestyle in fish, where clonal outbreaks are more likely to occur. It is also possible that there is a sampling bias, which future studies employing more strains should help to resolve.

One of our goals was to assess the utility of bioinformatics approaches to replace traditional taxonomic approaches for species identification. Despite the shortcomings and challenges of laboratory DDH, whole-genome content comparisons collectively represent the most valuable criterion for demarcation of bacterial species. As more bacterial genomes are sequenced and the information is made accessible, the use of whole genome sequences in the characterization of bacterial species provides opportunities that should not be ignored. This approach has been used in clarifying the taxonomic positions in some cases, e.g., for Acinetobacter using ANI and core gene phylogeny (15) and for Vibrio using MLSA based on genome information (58). To our knowledge, however, an approach utilizing isDDH and ANI combined with HK, RG, and EC phylogenies has not yet been done for a genus characterized by a complicated taxonomy and using a plurality of its members.

Aeromonas is an interesting test case for a number of reasons. This genus is comprised of a large number of species capable of diverse associations depending on the species. The spectrum encompasses benign and virulent species, a range that can also exist within a single species. A. hydrophila, A. caviae, and A. veronii have long been associated with human disease (26). Recently, A. dhakensis was recognized as a new virulent species (59), a distinction obfuscated in part due to A. dhakensis strains initially regarded as A. hydrophila. Of the numerous Aeromonas species that have been proposed and characterized, many of those species have been redefined and renamed as new information has been presented. This shifting nomenclature is a manifestation of the inefficiencies inherent in current taxonomic methods for Aeromonas. While the number of publically available Aeromonas genomes has increased dramatically in the last few years, most of the type strains are yet to be fully sequenced. We produced improved, high-quality draft genomes for these type strains and for some non-type strains of interest. Our results recapitulated known phylogenetic relationships and provided further insights into several others. This study also identified the breakpoints between species, indicating that this approach can be used to identify new species. For demarcating species boundaries, isDDH and ANI produced similar results, as reflected in the correlation of the values observed when using the upper 95% CI bound to the isDDH estimates (Fig. 3). The current version of isDDH is only available in a Web-based interface that requires manually uploading the sequence information, while ANI can be easily run on local servers. Consequently, we found ANI to be more time-effective when dealing with a large number of strains. For smaller studies, isDDH would be equally fast for computing and would also have the benefit of confidence intervals and probability statistics.

Apart from the fact that our approach could confidently and consistently resolve recent taxonomic controversies, our analysis also revealed that two strains, AMC 34 and AH4, represent new Aeromonas species. This conclusion is based on the distance in the genome content according to ANI and isDDH values, as well as the phylogenetic distances of the strains. These finding highlights two important advantages of bioinformatic assessment of genome similarity: (i) the expensive generation of the raw data does not have to be repeated by other research groups, and (ii) interlaboratory variations in DDH determinations can be overcome by agreeing to a cutoff value with standardized parameters in bioinformatic analyses. To facilitate the progress of other research groups in the Aeromonas field, we have set up a website (http://aeromonasgenomes.uconn.edu) that allows users to query and download all of the available Aeromonas genomes, contains the scripts we used in our analysis, and provides a summary of our current distance measures.

Another important finding from our analysis was that, out of the 23 publically available Aeromonas genomes that we analyzed, 8 (34.8%) are inconsistenly named. In large part this was due to the recent reclassification of A. hydrophila subsp. dhakensis as A. dhakensis and the reclassification of A. aquariorum as A. dhakensis. While the initial misclassifications are understandable, efforts should be taken to correct and update the nomenclature to curtail the promulgation of inaccurate information. NCBI currently allows only the original submitter to request the name change (http://www.ncbi.nlm.nih.gov/books/NBK51157/). One possibility would be to involve the community at large to provide input on such discrepancies.

The ability to generate improved, high-quality draft genome sequences rapidly and inexpensively, and of a sufficient quality for robust phylogenetic analyses (20), is changing the landscape of how one can investigate microbial taxonomy and should lead to a change in the requirements of performing laboratory-based DDH for species descriptions. An additional benefit of genome sequencing is that it offers a comprehensive resource to explore the myriad of potential metabolic capabilities, physiology, virulence factors, and antibiotic resistance profiles for the strains studied. The advantages of in silico DDH or ANI have been elegantly stated before (9, 16, 21, 22), and we have provided strong support for implementing these approaches in today’s microbial taxonomic studies. However, we recognize that the procedure of officially naming and describing new organisms is understandably a conservative and carefully regulated process; the effects on many different constituents have to be considered, since any amendments will result in broad effects for the scientific community at large. In this study, we provided data from a genus with a complex and controversial taxonomy and demonstrated the accuracy of the bioinformatics approach to identify new species and to correct erroneous identifications from previous studies. Utilizing the same software, code, and parameters for the data analysis, one can readily compare findings of other groups, thus supplanting arguments concerning laboratory methodologies with practical discussions on appropriate cutoff levels. For this test case study with Aeromonas, an isDDH of ≥70% at the upper 95% confidence interval or an ANI value of ≥96% was consistent for genomes belonging to the same species. Distance in the EC phylogeny is another metric that can be useful in species identification; in our study, a distance of ≤0.026 indicated that the genomes belong to the same species. It is likely that these types of values will also be applicable to other genera.

MATERIALS AND METHODS

Strains, growth conditions, and biochemical tests.For the genome data set, we included all of the type strains for Aeromonas with the exception A. cavernicola (50), as well as all other Aeromonas genomes deposited into public databases as of 17 July 2013. For the type strains, 2 were publically available and 27 were sequenced in-house. For additional strains, 21 were available publically and 6 were sequenced in-house. The bacteria were grown at the optimal growth temperature for the strain in LB broth or on LB agar (1.5%) plates for 16 to 18 h (60). For biochemical tests, API 20NE strips (bioMérieux, Marcy l’Etoile, France) were used in accordance with the manufacturer’s instructions. Separate tests for ornithine decarboxylase (ODC) activity and esculin hydrolysis were assessed using ODC broth and bile esculin agar (Sigma-Aldrich, St. Louis, MO). Tests were performed in triplicate.

Library preparation and genome sequencing.Genomic DNA was extracted using the MasterPure DNA purification kit (Epicenter, Madison, WI) and quantified using a Qubit 2.0 fluorometer (Life Technologies, Carlsbad, CA). DNA was also checked for quality by using a NanoDrop instrument (NanoDrop Products, Wilmington, DE) as well as on an agarose gel. Libraries were prepared from the genomic DNA using a Nextera or Nextera XT DNA sample preparation kit (Illumina, Inc., San Diego, CA). Library concentrations were determined by using the Qubit fluorometer and bioanalyzer (Agilent Technologies, Santa Clara, CA) prior to sequencing on a MiSeq benchtop sequencer (Illumina, Inc.) at the Microbial Analysis Resources and Services facility at the University of Connecticut (Storrs, CT).

Assembly and annotation.Paired Illumina reads were trimmed and assembled into scaffolded contigs by using the de novo assembler of CLC Genomics Workbench versions 6.0.04 to 7.0.04 (CLC-bio, Aarhus, Denmark). Annotation of the contigs was accomplished using the Rapid Automated Annotation using Subsystem Technology (RAST) server (61). All Aeromonas completed and draft annotated assemblies from the NCBI ftp repository that were used in this study were downloaded, back-engineered into contigs, and submitted to RAST for reannotation to mitigate any biases in the RAST annotation algorithms by applying them equally to each genome. The completeness of the genomes was initially assessed by screening for 17 housekeeping genes and 47 ribosomal proteins. We failed to detected ppsA (phosphoenolpyruvate synthase) in A. fluvialis. A thorough investigation employing mapping of reads to reference sequences and examining the region containing ppsA in the other strains suggested that this gene may not be present in this organism, and thus we excluded ppsA from the analysis.

MLSA reference tree and individual gene tree generation.Sixteen housekeeping genes (atpD, dnaJ, dnaK, dnaX, gltA, groL, gyrA, gyrB, metG, mdh, radA, recA, rpoC, rpoD, tsf, and zipA) were used for MLSA (33, 34, 39). The DNA-directed RNA polymerase subunit beta′ (rpoC) was used in the MLSA dataset. Adding rpoB to the dataset or switching it for rpoC did not change the phylogeny resulting from the MLSA analysis depicted in Fig. 1. These genes were initially chosen in three separate MLSA studies for their conservation among all aeromonads, ease of PCR primer design, broad distribution, and single copy number in the chromosome. The full-length sequence of each gene was initially derived from the previously published genome of A. veronii Hm21 (62), and these sequences served as queries for BLAST searches against the annotated proteins of all 56 genomes. Multiple sequence alignments (MSAs) were generated by translating the genes to protein sequences in SeaView (63), aligning the proteins using MUSCLE (v.3.8.31) (64) and then back-translating to the nucleotide sequences prior to the phylogenetic analysis. Each MSA was manually evaluated, and any sequences showing poor alignment were examined further, including comparison against the nonredundant database using BLAST and excluded if not found to be the correct protein. In-house scripts created a concatenated alignment of all 16 genes. A model of evolution was determined by using the Akaike information criterion with correction for small sample size (AICc), as implemented in jModelTest 2.1.4. An ML phylogeny was generated from the concatenated MSA, and individual gene phylogenies from the individual gene MSAs were determined by using PhyML (v 3.0_360-500M) (65). PhyML parameters consisted of a GTR model, estimated p-invar, 4 substitution rate categories, estimated gamma distribution, and subtree pruning and regrafting enabled with 100 bootstrap replicates. Using the same approach, phylogenies were determined for each of the 16 housekeeping genes.

Ribosomal reference tree generation.Forty-seven ribosomal proteins were obtained from the BioCyc website (66). These served as queries for BLAST searches against the annotated proteins of all 56 genomes. Multiple sequence alignments were generated as described above for the MLSA tree. The AICc reported the best-fitting model to be GTR plus gamma estimation plus invariable site estimation.

Core genome comparison.To define a core genome, the annotated protein open reading frames (ORFs) from each genome were used as BLAST queries against the protein ORFs of each other genome in the study, using in-house Perl scripting. The BLAST outputs were processed into OGs with MCL-edge v14-137 (67, 68) (http://micans.org/mcl/). The inflation value was set to 10 in order to break the OGs down into smaller clusters that more closely resembled individual genes rather than families. A relaxed core was defined by extracting OGs present in at least 90% of the taxa used in this study. Where a taxon had multiple entries in a single OG, the first entry reported by MCL was arbitrarily included and the others were excluded. Each OG was aligned using MUSCLE v 3.8.31 (64). In-house Perl scripting concatenated the OGs into a single alignment. Owing to the scale of the concatenated alignment, FastTreeMP (69) was used to perform the phylogenetic reconstruction. The substitution model used was WAG.

Pairwise sequence distance calculations and identity calculations.Sequence distances were calculated using the SaveDist function in PAUP* v4.0b10 (70). The distance type calculated was the P-distance.

Average nucleotide identity/tetramer analysis.Assembled contigs were reconstituted from the RAST-generated GenBank files for all genomes by using the seqret function of the EMBOSS package (71). All genomes were treated in the same manner to ensure that any biases were consistent across the entire data set. JSpecies1.2.1 (23) was used to analyze these contig sets for the ANI and tetramer usage patterns, using default parameters. We report here the averages of the reciprocal comparisons.

Tree comparisons using the approximately unbiased test.Per site log likelihoods were generated in RAxML v 7.3.5 (72). The AU tests (52) were carried out in the CONSEL v 1.20 package (73). Comparisons were made with HK tree against the 100 bootstrap replicates from each individual gene. Likewise, each best individual gene tree was compared against 100 bootstrap replicates of the HK tree.

In silico DNA-DNA hybridization.Estimates of isDDH were made using the Genome-to-Genome Distance Calculator (GGDC) (9, 21). The contig files were uploaded to the GGDC 2.0 Web server (http://ggdc.dsmz.de/distcalc2.php), where isDDH calculations were performed. Formula 2 alone was used for analysis, since it calculates isDDH estimates independent of genome lengths and is recommended by the authors of GGDC for use with any incomplete genomes (9, 21). The point estimate plus the 95% model-based confidence intervals were used for analysis.

Figure S4

Maximum likelihood reconstruction of the (A) gltA gene, (B) groL gene, (C) gyrA gene, and (D) gyrB gene. Support values are percent bootstraps values. Download Figure S4, EPS file, 2.1 MB.
Copyright © 2014 Colston et al.

This is an open-access article distributed under the terms of the Creative Commons Attribution-Noncommercial-ShareAlike 3.0 Unported license, which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original author and source are credited.

Figure S5

Maximum likelihood reconstruction of the (A) mdh gene, (B) metG gene, (C) radA gene, and (D) recA gene. Support values are percent bootstrap values. Download Figure S5, EPS file, 1.5 MB.
Copyright © 2014 Colston et al.

This is an open-access article distributed under the terms of the Creative Commons Attribution-Noncommercial-ShareAlike 3.0 Unported license, which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original author and source are credited.

Figure S6

Maximum likelihood reconstruction of the (A) rpoB gene, (B) rpoD gene, (C) tsf gene, and (D) zipA gene. Support values are percent bootstrap values. Download Figure S6, PDF file, 2 MB.
Copyright © 2014 Colston et al.

This is an open-access article distributed under the terms of the Creative Commons Attribution-Noncommercial-ShareAlike 3.0 Unported license, which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original author and source are credited.

ACKNOWLEDGMENTS

We thank E. Talagrand for excellent technical assistance, A. Horneman and R. M. Humphries for providing strains, the UConn Bioinformatics Facility for providing computing resources and the Microbial Analysis, Resources and Services Facility for access to an Illumina MiSeq system.

This research was supported through NIH R01 GM095390 (Joerg Graf, Peter Visscher, and Hilary G. Morrison), USDA ARS agreement 58-1930-4-002, and the National Science Foundation (DEB 0830024).

FOOTNOTES

    • Received 13 October 2014
    • Accepted 17 October 2014
    • Published 18 November 2014
  • Copyright © 2014 Colston et al.

This is an open-access article distributed under the terms of the Creative Commons Attribution-Noncommercial-ShareAlike 3.0 Unported license, which permits unrestricted noncommercial use, distribution, and reproduction in any medium, provided the original author and source are credited.

REFERENCES

  1. 1.↵
    1. Didelot X,
    2. Bowden R,
    3. Wilson DJ,
    4. Peto TE,
    5. Crook DW
    . 2012. Transforming clinical microbiology with bacterial genome sequencing. Nat. Rev. Genet. 13:601–612. doi:10.1038/nrm3437.
    OpenUrlCrossRefPubMed
  2. 2.↵
    1. Pallen MJ,
    2. Loman NJ,
    3. Penn CW
    . 2010. High-throughput sequencing and clinical microbiology: progress, opportunities and challenges. Curr. Opin. Microbiol. 13:625–631. doi:10.1016/j.mib.2010.08.003.
    OpenUrlCrossRefPubMed
  3. 3.↵
    1. Ribeca P,
    2. Valiente G
    . 2011. Computational challenges of sequence classification in microbiomic data. Brief. Bioinform. 12:614–625. doi:10.1093/bib/bbr019.
    OpenUrlCrossRefPubMed
  4. 4.↵
    1. Chen L,
    2. Xiong Z,
    3. Sun L,
    4. Yang J,
    5. Jin Q
    . 2012. VFDB 2012 update: toward the genetic diversity and molecular evolution of bacterial virulence factors. Nucleic Acids Res. 40:D641–D645. doi:10.1093/nar/gkr989.
    OpenUrlCrossRefPubMedWeb of Science
  5. 5.↵
    1. Grad YH,
    2. Lipsitch M,
    3. Feldgarden M,
    4. Arachchi HM,
    5. Cerqueira GC,
    6. Fitzgerald M,
    7. Godfrey P,
    8. Haas BJ,
    9. Murphy CI,
    10. Russ C,
    11. Sykes S,
    12. Walker BJ,
    13. Wortman JR,
    14. Young S,
    15. Zeng Q,
    16. Abouelleil A,
    17. Bochicchio J,
    18. Chauvin S,
    19. Desmet T,
    20. Gujja S,
    21. McCowan C,
    22. Montmayeur A,
    23. Steelman S,
    24. Frimodt-Møller J,
    25. Petersen AM,
    26. Struve C,
    27. Krogfelt KA,
    28. Bingen E,
    29. Weill FX,
    30. Lander ES,
    31. Nusbaum C,
    32. Birren BW,
    33. Hung DT,
    34. Hanage WP
    . 2012. Genomic epidemiology of the Escherichia coli O104:H4 outbreaks in Europe, 2011. Proc. Natl. Acad. Sci. U. S. A. 109:3065–3070. doi:10.1073/pnas.1121491109.
    OpenUrlAbstract/FREE Full Text
  6. 6.↵
    1. Sogin ML,
    2. Morrison HG,
    3. Huber JA,
    4. Mark Welch D,
    5. Huse SM,
    6. Neal PR,
    7. Arrieta JM,
    8. Herndl GJ
    . 2006. Microbial diversity in the deep sea and the underexplored “rare biosphere”. Proc. Natl. Acad. Sci. U. S. A. 103:12115–12120. doi:10.1073/pnas.0605127103.
    OpenUrlAbstract/FREE Full Text
  7. 7.↵
    1. Bomar L,
    2. Maltz M,
    3. Colston S,
    4. Graf J
    . 2011. Directed culturing of microorganisms using metatranscriptomics. mBio 2(2):e00012-11. doi:10.1128/mBio.00012-11.
    OpenUrlAbstract/FREE Full Text
  8. 8.↵
    1. Schnoes AM,
    2. Brown SD,
    3. Dodevski I,
    4. Babbitt PC
    . 2009. Annotation error in public databases: misannotation of molecular function in enzyme superfamilies. PLOS Comput. Biol. 5:e1000605. doi:10.1371/journal.pcbi.10000605.
    OpenUrlCrossRefPubMed
  9. 9.↵
    1. Meier-Kolthoff JP,
    2. Auch AF,
    3. Klenk HP,
    4. Göker M
    . 2013. Genome sequence-based species delimitation with confidence intervals and improved distance functions. BMC Bioinformatics 14:60. doi:10.1186/1471-2105-14-60.
    OpenUrlCrossRefPubMed
  10. 10.↵
    1. Chun J,
    2. Rainey FA
    . 2014. Integrating genomics into the taxonomy and systematics of the Bacteria and Archaea. Int. J. Syst. Evol. Microbiol. 64:316–324. doi:10.1099/ijs.0.054171-0.
    OpenUrlCrossRefPubMed
  11. 11.↵
    1. Brenner DJ,
    2. Staley JT,
    3. Krieg NR
    . 2005. Classification of procaryotic organisms and the concept of bacterial speciation, p 27–32. In Brenner DJ, Staley JT, Krieg NR, Garrity GM (ed), Bergey's manual of systematic bacteriology, vol 2. The proteobacteria. Springer Verlag, New York, NY.
  12. 12.↵
    1. Lapage SP,
    2. Sneath PHA,
    3. Lessel EF,
    4. Skerman VBD,
    5. Seeliger HPR,
    6. Clark WA
    . 1992. International code of nomenclature of Bacteria: bacteriological code, 1990 revision. ASM Press, Washington, DC.
  13. 13.↵
    1. Stackebrandt E,
    2. Frederiksen W,
    3. Garrity GM,
    4. Grimont PA,
    5. Kämpfer P,
    6. Maiden MC,
    7. Nesme X,
    8. Rosselló-Mora R,
    9. Swings J,
    10. Trüper HG,
    11. Vauterin L,
    12. Ward AC,
    13. Whitman WB
    . 2002. Report of the ad hoc committee for the re-evaluation of the species definition in bacteriology. Int. J. Syst. Evol. Microbiol. 52:1043–1047. doi:10.1099/ijs.0.02360-0.
    OpenUrlCrossRefPubMedWeb of Science
  14. 14.↵
    1. Gevers D,
    2. Cohan FM,
    3. Lawrence JG,
    4. Spratt BG,
    5. Coenye T,
    6. Feil EJ,
    7. Stackebrandt E,
    8. Van de Peer Y,
    9. Vandamme P,
    10. Thompson FL,
    11. Swings J
    . 2005. Opinion: re-evaluating prokaryotic species. Nat. Rev. Microbiol. 3:733–739. doi:10.1038/nrmicro1236.
    OpenUrlCrossRefPubMedWeb of Science
  15. 15.↵
    1. Chan JZ,
    2. Halachev MR,
    3. Loman NJ,
    4. Constantinidou C,
    5. Pallen MJ
    . 2012. Defining bacterial species in the genomic era: insights from the genus Acinetobacter. BMC Microbiol. 12:302. doi:10.1186/1471-2180-12-302.
    OpenUrlCrossRefPubMed
  16. 16.↵
    1. Konstantinidis KT,
    2. Tiedje JM
    . 2005. Genomic insights that advance the species definition for prokaryotes. Proc. Natl. Acad. Sci. U. S. A. 102:2567–2572. doi:10.1073/pnas.0409727102.
    OpenUrlAbstract/FREE Full Text
  17. 17.↵
    1. Clarke SC,
    2. Diggle MA,
    3. Edwards GF
    . 2002. Multilocus sequence typing and porA gene sequencing differentiates strains of Neisseria meningitidis during case clusters. Br. J. Biomed. Sci. 59:160–162.
    OpenUrlPubMed
  18. 18.↵
    1. Kämpfer P,
    2. Glaeser SP
    . 2012. Prokaryotic taxonomy in the sequencing era—the polyphasic approach revisited. Environ. Microbiol. 14:291–317. doi:10.1111/j.1462-2920.2011.02615.x.
    OpenUrlCrossRefPubMed
  19. 19.↵
    1. Lapierre P,
    2. Lasek-Nesselquist E,
    3. Gogarten JP
    . 2014. The impact of HGT on phylogenomic reconstruction methods. Brief. Bioinform. 15:79–90. doi:10.1093/bib.bbs050.
    OpenUrlCrossRefPubMed
  20. 20.↵
    1. Chain PSG,
    2. Grafham DV,
    3. Fulton RS,
    4. Fitzgerald MG,
    5. Hostetler J,
    6. Muzny D,
    7. Ali J,
    8. Birren B,
    9. Bruce DC,
    10. Buhay C,
    11. Cole JR,
    12. Ding Y,
    13. Dugan S,
    14. Field D,
    15. Garrity GM,
    16. Gibbs R,
    17. Graves T,
    18. Han CS,
    19. Harrison SH,
    20. Highlander S,
    21. Hugenholtz P,
    22. Khouri HM,
    23. Kodira CD,
    24. Kolker E,
    25. Kyrpides NC,
    26. Lang D,
    27. Lapidus A,
    28. Malfatti SA,
    29. Markowitz V,
    30. Metha T,
    31. Nelson KE,
    32. Parkhill J,
    33. Pitluck S,
    34. Qin X,
    35. Read TD,
    36. Schmutz J,
    37. Sozhamannan S,
    38. Sterk P,
    39. Strausberg RL,
    40. Sutton G,
    41. Thomson NR,
    42. Tiedje JM,
    43. Weinstock G,
    44. Wollam A,
    45. Consortium GSCHMPJ,
    46. Detter JC
    . 2009. Genomics. Genome project standards in a new era of sequencing. Science (New York, NY) 326:236–237. doi:10.1126/science.1180614.
    OpenUrlCrossRef
  21. 21.↵
    1. Auch AF,
    2. von Jan M,
    3. Klenk HP,
    4. Göker M
    . 2010. Digital DNA-DNA hybridization for microbial species delineation by means of genome-to-genome sequence comparison. Stand. Genomic Sci. 2:117–134. doi:10.4056/sigs.531120.
    OpenUrlCrossRefPubMedWeb of Science
  22. 22.↵
    1. Konstantinidis KT,
    2. Ramette A,
    3. Tiedje JM
    . 2006. Toward a more robust assessment of intraspecies diversity, using fewer genetic markers. Appl. Environ. Microbiol. 72:7286–7293. doi:10.1128/AEM.01398-06.
    OpenUrlAbstract/FREE Full Text
  23. 23.↵
    1. Richter M,
    2. Rosselló-Móra R
    . 2009. Shifting the genomic gold standard for the prokaryotic species definition. Proc. Natl. Acad. Sci. U. S. A. 106:19126–19131. doi:10.1073/pnas.0906412106.
    OpenUrlAbstract/FREE Full Text
  24. 24.↵
    1. Scortichini M,
    2. Marcelletti S,
    3. Ferrante P,
    4. Firrao G
    . 2013. A genomic redefinition of Pseudomonas avellanae species. PLoS One 8:e75794. doi:10.1371/journal.pone.0075794.
    OpenUrlCrossRefPubMed
  25. 25.↵
    1. Tarazona E,
    2. Lucena T,
    3. Arahal DR,
    4. Macián MC,
    5. Ruvira MA,
    6. Pujalte MJ
    . 2014. Multilocus sequence analysis of putative Vibrio mediterranei strains and description of Vibrio thalassae sp. nov. Syst. Appl. Microbiol. 37:320–328. doi:10.1016/j.syapm.2014.05.005.
    OpenUrlCrossRefPubMed
  26. 26.↵
    1. Janda JM,
    2. Abbott SL
    . 2010. The genus Aeromonas: taxonomy, pathogenicity, and infection. Clin. Microbiol. Rev. 23:35–73. doi:10.1128/CMR.00039-09.
    OpenUrlAbstract/FREE Full Text
  27. 27.↵
    1. Cheesman SE,
    2. Neal JT,
    3. Mittge E,
    4. Seredick BM,
    5. Guillemin K
    . 2011. Epithelial cell proliferation in the developing zebrafish intestine is regulated by the Wnt pathway and microbial signaling via Myd88. Proc. Natl. Acad. Sci. U. S. A. 108:4570–4577. doi:10.1073/pnas.1000072107.
    OpenUrlAbstract/FREE Full Text
  28. 28.↵
    1. Martin-Carnahan A,
    2. Joseph SW
    . 2005. Genus I. Aeromonas Stanier 1943:213AL, p 557–578. In Brenner DJ, Krieg NR, Staley JT, Garrity GM (ed), Bergey's manual of systematic bacteriology, vol 2, 2nd ed. Springer Verlag, New York, NY.
    OpenUrl
  29. 29.↵
    1. Huys G,
    2. Cnockaert M,
    3. Swings J
    . 2005. Aeromonas culicicola Pidiyar et al. 2002 is a later subjective synonym of Aeromonas veronii Hickman-Brenner et al. 1987. Syst. Appl. Microbiol. 28:604–609. doi:10.1016/j.syapm.2005.03.012.
    OpenUrlCrossRefPubMedWeb of Science
  30. 30.↵
    1. Huys G,
    2. Kämpfer P,
    3. Swings J
    . 2001. New DNA-DNA hybridization and phenotypic data on the species Aeromonas ichthiosmia and Aeromonas allosaccharophila: A. ichthiosmia Schubert et al. 1990 is a later synonym of A. veronii Hickman-Brenner et al. 1987. Syst. Appl. Microbiol. 24:177–182. doi:10.1078/0723-2020-00038.
    OpenUrlCrossRefPubMed
  31. 31.↵
    1. Collins MD,
    2. Martinez-Murcia AJ,
    3. Cai J
    . 1993. Aeromonas enteropelogenes and Aeromonas ichthiosmia are identical to Aeromonas trota and Aeromonas veronii, respectively, as revealed by small-subunit rRNA sequence analysis. Int. J. Syst. Bacteriol. 43:855–856. doi:10.1099/00207713-43-4-855.
    OpenUrlCrossRefPubMed
  32. 32.↵
    1. Huys G,
    2. Denys R,
    3. Swings J
    . 2002. DNA-DNA reassociation and phenotypic data indicate synonymy between Aeromonas enteropelogenes Schubert et al. 1990 and Aeromonas trota Carnahan et al. 1991. Int. J. Syst. Evol. Microbiol. 52:1969–1972. doi:10.1099/ijs.0.01996-0.
    OpenUrlCrossRefPubMed
  33. 33.↵
    1. Roger F,
    2. Marchandin H,
    3. Jumas-Bilak E,
    4. Kodjo A,
    5. colBVH study group,
    6. Lamy B
    . 2012. Multilocus genetics to reconstruct aeromonad evolution. BMC Microbiol. 12:62. doi:10.1186/1471-2180-12-62.
    OpenUrlCrossRefPubMed
  34. 34.↵
    1. Martinez-Murcia AJ,
    2. Monera A,
    3. Saavedra MJ,
    4. Oncina R,
    5. Lopez-Alvarez M,
    6. Lara E,
    7. Figueras MJ
    . 2011. Multilocus phylogenetic analysis of the genus Aeromonas. Syst. Appl. Microbiol. 34:189–199. doi:10.1016/j.syapm.2010.11.014.
    OpenUrlCrossRefPubMed
  35. 35.↵
    1. Miñana-Galbis D,
    2. Farfán M,
    3. Albarral V,
    4. Sanglas A,
    5. Lorén JG,
    6. Fusté MC
    . 2013. Reclassification of Aeromonas hydrophila subspecies anaerogenes. Syst. Appl. Microbiol. 36:306–308. doi:10.1016/j.syapm.2013.04.006.
    OpenUrlCrossRefPubMed
  36. 36.↵
    1. Huys G,
    2. Kämpfer P,
    3. Albert MJ,
    4. Kühn I,
    5. Denys R,
    6. Swings J
    . 2002. Aeromonas hydrophila subsp. dhakensis subsp. nov., isolated from children with diarrhoea in Bangladesh, and extended description of Aeromonas hydrophila subsp. hydrophila (Chester 1901) Stanier 1943 (approved lists 1980). Int. J. Syst. Evol. Microbiol. 52:705–712. doi:10.1099/ijs.0.01844-0.
    OpenUrlCrossRefPubMedWeb of Science
  37. 37.↵
    1. Figueras MJ,
    2. Beaz-Hidalgo R,
    3. Senderovich Y,
    4. Laviad S,
    5. Halpern M
    . 2011. Re-identification of Aeromonas isolates from chironomid egg masses as the potential pathogenic bacteria Aeromonas aquariorum. Environ. Microbiol. Rep. 3:239–244. doi:10.1111/j.1758-2229.2010.00216.x.
    OpenUrlCrossRefPubMed
  38. 38.↵
    1. Beaz-Hidalgo R,
    2. Martínez-Murcia A,
    3. Figueras MJ
    . 2014. Corrigendum to “Reclassification of Aeromonas hydrophila subsp. dhakensis Huys et al. 2002 and Aeromonas aquariorum Martínez-Murcia et al. 2008 as Aeromonas dhakensis sp. nov. comb nov. and emendation of the species Aeromonas hydrophila.” [Syst. Appl. Microbiol. 36:171–176, 2013.] Syst. Appl. Microbiol. 37:543. doi:10.1016/j.syapm.2012.12.007.
    OpenUrlCrossRef
  39. 39.↵
    1. Martino ME,
    2. Fasolato L,
    3. Montemurro F,
    4. Rosteghin M,
    5. Manfrin A,
    6. Patarnello T,
    7. Novelli E,
    8. Cardazzo B
    . 2011. Definition of microbial diversity in Aeromonas strains based on multilocus sequence typing, phenotype and presence of putative genes of virulence. Appl. Environ. Microbiol. 77:4986–5000. doi:10.1128/AEM.00708-11.
    OpenUrlAbstract/FREE Full Text
  40. 40.↵
    1. Loren JG,
    2. Farfan M,
    3. Fuste MC
    . 2014. Molecular phylogenetics and temporal diversification in the genus Aeromonas based on the sequences of five housekeeping genes. PLoS One 9:e88805. doi:10.1371/journal.pone.0088805.
    OpenUrlCrossRefPubMed
  41. 41.↵
    1. Silver AC,
    2. Williams D,
    3. Faucher J,
    4. Horneman AJ,
    5. Gogarten JP,
    6. Graf J
    . 2011. Complex evolutionary history of the Aeromonas veronii group revealed by host interaction and DNA sequence data. PLoS One 6:e16751. doi:10.1371/journal.pone.0016751.
    OpenUrlCrossRefPubMed
  42. 42.↵
    1. Morandi A,
    2. Zhaxybayeva O,
    3. Gogarten JP,
    4. Graf J
    . 2005. Evolutionary and diagnostic implications of intragenomic heterogeneity in the 16S rRNA gene in Aeromonas strains. J. Bacteriol. 187:6561–6564. doi:10.1128/JB.187.18.6561-6564.2005.
    OpenUrlAbstract/FREE Full Text
  43. 43.↵
    1. Roger F,
    2. Lamy B,
    3. Jumas-Bilak E,
    4. Kodjo A,
    5. colBVH Study Group,
    6. Marchandin H
    . 2012. Ribosomal multi-operon diversity: an original perspective on the genus Aeromonas. PLoS One 7:e46268. doi:10.1371/journal.pone.0046268.
    OpenUrlCrossRefPubMed
  44. 44.↵
    1. Beaz-Hidalgo R,
    2. Martinez-Murcia A,
    3. Figueras MJ
    . 2013. Reclassification of Aeromonas hydrophila subsp. dhakensis Huys et al. 2002 and Aeromonas aquariorum Martinez-Murcia et al. 2008 as Aeromonas dhakensis sp. nov. comb. nov. and emendation of the species Aeromonas hydrophila. Syst. Appl. Microbiol. 36:171–176. doi:10.1016/j.syapm.2012.12.007.
    OpenUrlCrossRef
  45. 45.↵
    1. Grim CJ,
    2. Kozlova EV,
    3. Ponnusamy D,
    4. Fitts EC,
    5. Sha J,
    6. Kirtley ML,
    7. van Lier CJ,
    8. Tiner BL,
    9. Erova TE,
    10. Joseph SJ,
    11. Read TD,
    12. Shak JR,
    13. Joseph SW,
    14. Singletary E,
    15. Felland T,
    16. Baze WB,
    17. Horneman AJ,
    18. Chopra AK
    . 2014. Functional genomic characterization of virulence factors from necrotizing fasciitis-causing strains of Aeromonas hydrophila. Appl. Environ. Microbiol. 80:4162–4183. doi:10.1128/AEM.00486-14.
    OpenUrlAbstract/FREE Full Text
  46. 46.↵
    1. Carnahan AM,
    2. Behram S,
    3. Joseph SW
    . 1991. Aerokey II: a flexible key for identifying clinical Aeromonas species. J. Clin. Microbiol. 29:2843–2849.
    OpenUrlAbstract/FREE Full Text
  47. 47.↵
    1. Beaz-Hidalgo R,
    2. Alperi A,
    3. Buján N,
    4. Romalde JL,
    5. Figueras MJ
    . 2010. Comparison of phenotypical and genetic identification of Aeromonas strains isolated from diseased fish. Syst. Appl. Microbiol. 33:149–153. doi:10.1016/j.syapm.2010.02.002.
    OpenUrlCrossRefPubMed
  48. 48.↵
    1. Martínez-Murcia A,
    2. Monera A,
    3. Alperi A,
    4. Figueras MJ,
    5. Saavedra MJ
    . 2009. Phylogenetic evidence suggests that strains of Aeromonas hydrophila subsp. dhakensis belong to the species Aeromonas aquariorum sp. nov. Curr. Microbiol. 58:76–80. doi:10.1007/s00284-008-9278-6.
    OpenUrlCrossRefPubMed
  49. 49.↵
    1. Giltner CL,
    2. Bobenchik AM,
    3. Uslan DZ,
    4. Deville JG,
    5. Humphries RM
    . 2013. Ciprofloxacin-resistant Aeromonas hydrophila cellulitis following leech therapy. J. Clin. Microbiol. 51:1324–1326. doi:10.1128/JCM.03217-12.
    OpenUrlAbstract/FREE Full Text
  50. 50.↵
    1. Martínez-Murcia A,
    2. Beaz-Hidalgo R,
    3. Svec P,
    4. Saavedra MJ,
    5. Figueras MJ,
    6. Sedlacek I
    . 2013. Aeromonas cavernicola sp. nov., isolated from fresh water of a brook in a cavern. Curr. Microbiol. 66:197–204. doi:10.1007/s00284-012-0253-x.
    OpenUrlCrossRefPubMed
  51. 51.↵
    1. Beiko RG,
    2. Harlow TJ,
    3. Ragan MA
    . 2005. Highways of gene sharing in prokaryotes. Proc. Natl. Acad. Sci. U. S. A. 102:14332–14337. doi:10.1073/pnas.0504068102.
    OpenUrlAbstract/FREE Full Text
  52. 52.↵
    1. Shimodaira H
    . 2002. An approximately unbiased test of phylogenetic tree selection. Syst. Biol. 51:492–508. doi:10.1080/10635150290069913.
    OpenUrlCrossRefPubMedWeb of Science
  53. 53.↵
    1. Andam CP,
    2. David W,
    3. Gogarten JP
    . 2010. Biased gene transfer mimics patterns created through shared ancestry. Proc. Natl. Acad. Sci. U. S. A. 107:10679–10684. doi:10.1073/pnas.1001418107.
    OpenUrlAbstract/FREE Full Text
  54. 54.↵
    1. Pace NR,
    2. Sapp J,
    3. Goldenfeld N
    . 2012. Phylogeny and beyond: Scientific, historical, and conceptual significance of the first tree of life. Proc. Natl. Acad. Sci. 109:1011–1018. doi:10.1073/pnas.1109716109.
    OpenUrlAbstract/FREE Full Text
  55. 55.↵
    1. Williams D,
    2. Fournier GP,
    3. Lapierre P,
    4. Swithers KS,
    5. Green AG,
    6. Andam CP,
    7. Gogarten JP
    . 2011. A rooted net of life. Biol. Direct 6:45. doi:10.1186/1745-6150-6-45.
    OpenUrlCrossRefPubMed
  56. 56.↵
    1. Bansal MS,
    2. Alm EJ,
    3. Kellis M
    . 2013. Reconciliation revisited: handling multiple optima when reconciling with duplication, transfer, and loss. J. Comput. Biol. 20:738–754.
    OpenUrlCrossRefPubMed
  57. 57.↵
    1. García JA,
    2. Larsen JL,
    3. Dalsgaard I,
    4. Pedersen K
    . 2000. Pulsed-field gel electrophoresis analyis of Aeromonas salmonicida ssp. salmonicida. FEMS Microbiol. Lett. 190:163–166. doi:10.1111/j.1574-6968.2000.tb09280.x.
    OpenUrlCrossRefPubMed
  58. 58.↵
    1. Thompson CC,
    2. Vicente ACP,
    3. Souza RC,
    4. Vasconcelos ATR,
    5. Vesth T,
    6. Alves N Jr.,
    7. Ussery DW,
    8. Iida T,
    9. Thompson FL
    . 2009. Genomic taxonomy of vibrios. BMC Evol. Biol. 9:258. doi:10.1186/1471-2148-9-258.
    OpenUrlCrossRefPubMed
  59. 59.↵
    1. Chen PL,
    2. Wu CJ,
    3. Chen CS,
    4. Tsai PJ,
    5. Tang HJ,
    6. Ko WC
    . 2014. A comparative study of clinical Aeromonas dhakensis and Aeromonas hydrophila isolates in southern Taiwan: A. dhakensis is more predominant and virulent. Clin. Microbiol. Infect. 20:O428–O434. doi:10.1111/1469-0691.12456.
    OpenUrlCrossRef
  60. 60.↵
    1. Sambrook J,
    2. Russell DW
    . 2001. Molecular cloning: a laboratory manual, 3rd ed. Cold Spring Harbor, New York, NY.
  61. 61.↵
    1. Overbeek R,
    2. Olson R,
    3. Pusch GD,
    4. Olsen GJ,
    5. Davis JJ,
    6. Disz T,
    7. Edwards RA,
    8. Gerdes S,
    9. Parrello B,
    10. Shukla M,
    11. Vonstein V,
    12. Wattam AR,
    13. Xia F,
    14. Stevens R
    . 2014. The SEED and the Rapid Annotation of microbial genomes using Subsystems Technology (RAST). Nucleic Acids Res. 42:D206–D214. doi:10.1093/nar/gkt1226.
    OpenUrlCrossRefPubMedWeb of Science
  62. 62.↵
    1. Bomar L,
    2. Stephens WZ,
    3. Nelson MC,
    4. Velle K,
    5. Guillemin K,
    6. Graf J
    . 2013. Draft genome sequence of Aeromonas veronii Hm21, a symbiotic isolate from the medicinal leech digestive tract. Genome Announc. 1(5):e00800-13. doi:10.1128/genomeA.00800-13.
    OpenUrlAbstract/FREE Full Text
  63. 63.↵
    1. Gouy M,
    2. Guindon S,
    3. Gascuel O
    . 2010. SeaView, version 4: a multiplatform graphical user interface for sequence alignment and phylogenetic tree building. Mol. Biol. Evol. 27:221–224. doi:10.1093/molbev/msp259.
    OpenUrlCrossRefPubMedWeb of Science
  64. 64.↵
    1. Edgar RC
    . 2004. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5:113. doi:10.1186/1471-2105-5-113.
    OpenUrlCrossRefPubMed
  65. 65.↵
    1. Guindon S,
    2. Dufayard JF,
    3. Lefort V,
    4. Anisimova M,
    5. Hordijk W,
    6. Gascuel O
    . 2010. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML. Syst. Biol. 59:307–321. doi:10.1093/sysbio/syq010.
    OpenUrlCrossRefPubMedWeb of Science
  66. 66.↵
    1. Caspi R,
    2. Altman T,
    3. Dale JM,
    4. Dreher K,
    5. Fulcher CA,
    6. Gilham F,
    7. Kaipa P,
    8. Karthikeyan AS,
    9. Kothari A,
    10. Krummenacker M,
    11. Latendresse M,
    12. Mueller LA,
    13. Paley S,
    14. Popescu L,
    15. Pujar A,
    16. Shearer AG,
    17. Zhang P,
    18. Karp PD
    . 2010. The MetaCyc database of metabolic pathways and enzymes and the BioCyc collection of pathway/genome databases. Nucleic Acids Res. 38:D473–D479. doi:10.1093/nar/gkp875.
    OpenUrlCrossRefPubMedWeb of Science
  67. 67.↵
    1. van Dongen S,
    2. Abreu-Goodger C
    . 2012. Using MCL to extract clusters from networks. Methods Mol. Biol. 804:281–295. doi:10.1007/978-1-61779-361-5_15.
    OpenUrlCrossRefPubMedWeb of Science
  68. 68.↵
    1. Enright AJ,
    2. Van Dongen S,
    3. Ouzounis CA
    . 2002. An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 30:1575–1584. doi:10.1093/nar/30.7.1575.
    OpenUrlCrossRefPubMedWeb of Science
  69. 69.↵
    1. Price MN,
    2. Dehal PS,
    3. Arkin AP
    . 2010. FastTree 2—approximately maximum-likelihood trees for large alignments. PLoS One 5:e9490. doi:10.1371/journal.pone.0009490.
    OpenUrlCrossRefPubMed
  70. 70.↵
    1. Swofford DL
    . 2002. PAUP*: phylogenetic analysis using parsimony (and other methods), 4th ed. Sinauer Associates, Sunderland, MA.
  71. 71.↵
    1. Rice P,
    2. Longden I,
    3. Bleasby A
    . 2000. EMBOSS: the European Molecular Biology open software suite. Trends Genet. 16:276–277. doi:10.1016/S0168-9525(00)02024-2.
    OpenUrlCrossRefPubMedWeb of Science
  72. 72.↵
    1. Stamatakis A
    . 2006. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics 22:2688–2690. doi:10.1093/bioinformatics/btl446.
    OpenUrlCrossRefPubMedWeb of Science
  73. 73.↵
    1. Shimodaira H,
    2. Hasegawa M
    . 2001. CONSEL: for assessing the confidence of phylogenetic tree selection. Bioinformatics 17:1246–1247. doi:10.1093/bioinformatics/17.12.1246.
    OpenUrlCrossRefPubMedWeb of Science
  74. 74.
    1. Seshadri R,
    2. Joseph SW,
    3. Chopra AK,
    4. Sha J,
    5. Shaw J,
    6. Graf J,
    7. Haft D,
    8. Wu M,
    9. Ren Q,
    10. Rosovitz MJ,
    11. Madupu R,
    12. Tallon L,
    13. Kim M,
    14. Jin S,
    15. Vuong H,
    16. Stine OC,
    17. Ali A,
    18. Horneman AJ,
    19. Heidelberg JF
    . 2006. Genome sequence of Aeromonas hydrophila ATCC 7966T: jack of all trades. J. Bacteriol. 188:8272–8282. doi:10.1128/JN.00621-06.
    OpenUrlAbstract/FREE Full Text
  75. 75.
    1. Spataro N,
    2. Farfán M,
    3. Albarral V,
    4. Sanglas A,
    5. Lorén JG,
    6. Fusté MC,
    7. Bosch E
    . 2013. Draft genome sequence of Aeromonas molluscorum strain 848TT, isolated from bivalve molluscs. Genome Announc. 1(3):e00382-13. doi:10.1128/genomeA.00382-13.
    OpenUrlAbstract/FREE Full Text
  76. 76.
    1. Beatson SA,
    2. das Graças de Luna M,
    3. Bachmann NL,
    4. Alikhan NF,
    5. Hanks KR,
    6. Sullivan MJ,
    7. Wee BA,
    8. Freitas-Almeida AC,
    9. Dos Santos PA,
    10. de Melo JT,
    11. Squire DJ,
    12. Cunningham AF,
    13. Fitzgerald JR,
    14. Henderson IR
    . 2011. Genome sequence of the emerging pathogen Aeromonas caviae. J. Bacteriol. 193:1286–1287. doi:10.1128/JB.01337-10.
    OpenUrlAbstract/FREE Full Text
  77. 77.
    1. Wu CJ,
    2. Wang HC,
    3. Chen CS,
    4. Shu HY,
    5. Kao AW,
    6. Chen PL,
    7. Ko WC
    . 2012. Genome sequence of a novel human pathogen, Aeromonas aquariorum. J. Bacteriol. 194:4114–4115. doi:10.1128/JB.00621-12.
    OpenUrlAbstract/FREE Full Text
  78. 78.
    1. Chan KG,
    2. Puthucheary SD,
    3. Chan XY,
    4. Yin WF,
    5. Wong CS,
    6. Too WS,
    7. Chua KH
    . 2011. Quorum sensing in Aeromonas species isolated from patients in Malaysia. Curr. Microbiol. 62:167–172. doi:10.1007/s00284-010-9689-z.
    OpenUrlCrossRefPubMed
  79. 79.
    1. Tekedar HC,
    2. Waldbeiser GC,
    3. Karsi A,
    4. Liles MR,
    5. Griffin MJ,
    6. Vamenta S,
    7. Sonstegard T,
    8. Hossain M,
    9. Schroeder SG,
    10. Khoo L,
    11. Lawrence ML
    . 2013. Complete genome sequence of a channel catfish epidemic isolate, Aeromonas hydrophila strain ML09-119. Genome Announc. 1(5):e00755-13. doi:10.1128/genomeA.00755-13.
    OpenUrlAbstract/FREE Full Text
  80. 80.
    1. Han JE,
    2. Kim JH,
    3. Choresca C,
    4. Shin SP,
    5. Jun JW,
    6. Park SC
    . 2013. Draft genome sequence of a clinical isolate, Aeromonas hydrophila SNUFPC-A8, from a moribund cherry salmon (Oncorhynchus masou masou). Genome Announc. 1(1):e00133-12. doi:10.1128/genomeA.00133-12.
    OpenUrlAbstract/FREE Full Text
  81. 81.
    1. Chai B,
    2. Wang H,
    3. Chen X
    . 2012. Draft genome sequence of high-melanin-yielding Aeromonas media strain WS. J. Bacteriol. 194:6693–6694. doi:10.1128/JB.01807-12.
    OpenUrlAbstract/FREE Full Text
  82. 82.
    1. Han JE,
    2. Kim JH,
    3. Shin SP,
    4. Jun JW,
    5. Chai JY,
    6. Park SC
    . 2013. Draft genome sequence of Aeromonas salmonicida subsp. achromogenes AS03, an atypical strain isolated from crucian carp (Carassius carassius) in the Republic of Korea. Genome Announc. 1:e00791-13. doi:10.1128/genomeA.00791-132229.
    OpenUrlAbstract/FREE Full Text
  83. 83.
    1. Reith ME,
    2. Singh RK,
    3. Curtis B,
    4. Boyd JM,
    5. Bouevitch A,
    6. Kimball J,
    7. Munholland J,
    8. Murphy C,
    9. Sarty D,
    10. Williams J,
    11. Nash JH,
    12. Johnson SC,
    13. Brown LL
    . 2008. The genome of Aeromonas salmonicida subsp. salmonicida A449: insights into the evolution of a fish pathogen. BMC Genomics 9:427. doi:10.1186/1471-2164-9-427.
    OpenUrlCrossRefPubMed
  84. 84.
    1. Charette SJ,
    2. Brochu F,
    3. Boyle B,
    4. Filion G,
    5. Tanaka KH,
    6. Derome N
    . 2012. Draft genome sequence of the virulent strain 01-B526 of the fish pathogen Aeromonas salmonicida. J. Bacteriol. 194:722–723. doi:10.1128/JB.06276-11.
    OpenUrlAbstract/FREE Full Text
  85. 85.
    1. Li Y,
    2. Liu Y,
    3. Zhou Z,
    4. Huang H,
    5. Ren Y,
    6. Zhang Y,
    7. Li G,
    8. Zhou Z,
    9. Wang L
    . 2011. Complete genome sequence of Aeromonas veronii strain B565. J. Bacteriol. 193:3389–3390. doi:10.1128/JB.00347-11.
    OpenUrlAbstract/FREE Full Text
PreviousNext
Back to top
Download PDF
Citation Tools
Bioinformatic Genome Comparisons for Taxonomic and Phylogenetic Assignments Using Aeromonas as a Test Case
Sophie M. Colston, Matthew S. Fullmer, Lidia Beka, Brigitte Lamy, J. Peter Gogarten, Joerg Graf
mBio Nov 2014, 5 (6) e02136-14; DOI: 10.1128/mBio.02136-14

Citation Manager Formats

  • BibTeX
  • Bookends
  • EasyBib
  • EndNote (tagged)
  • EndNote 8 (xml)
  • Medlars
  • Mendeley
  • Papers
  • RefWorks Tagged
  • Ref Manager
  • RIS
  • Zotero
Print

Alerts
Sign In to Email Alerts with your Email Address
Email

Thank you for sharing this mBio article.

NOTE: We request your email address only to inform the recipient that it was you who recommended this article, and that it is not junk mail. We do not retain these email addresses.

Enter multiple addresses on separate lines or separate them with commas.
Bioinformatic Genome Comparisons for Taxonomic and Phylogenetic Assignments Using Aeromonas as a Test Case
(Your Name) has forwarded a page to you from mBio
(Your Name) thought you would be interested in this article in mBio.
CAPTCHA
This question is for testing whether or not you are a human visitor and to prevent automated spam submissions.
Share
Bioinformatic Genome Comparisons for Taxonomic and Phylogenetic Assignments Using Aeromonas as a Test Case
Sophie M. Colston, Matthew S. Fullmer, Lidia Beka, Brigitte Lamy, J. Peter Gogarten, Joerg Graf
mBio Nov 2014, 5 (6) e02136-14; DOI: 10.1128/mBio.02136-14
del.icio.us logo Digg logo Reddit logo Twitter logo CiteULike logo Facebook logo Google logo Mendeley logo
  • Top
  • Article
    • ABSTRACT
    • INTRODUCTION
    • RESULTS
    • DISCUSSION
    • MATERIALS AND METHODS
    • ACKNOWLEDGMENTS
    • FOOTNOTES
    • REFERENCES
  • Figures & Data
  • Info & Metrics
  • PDF

Related Articles

Cited By...

About

  • About mBio
  • Editor in Chief
  • Board of Editors
  • AAM Fellows
  • Policies
  • For Reviewers
  • For the Media
  • For Librarians
  • For Advertisers
  • Alerts
  • RSS
  • FAQ
  • Permissions
  • Journal Announcements

Authors

  • ASM Author Center
  • Submit a Manuscript
  • Author Warranty
  • Article Types
  • Ethics
  • Contact Us

Follow #mBio

@ASMicrobiology

       

ASM Journals

ASM journals are the most prominent publications in the field, delivering up-to-date and authoritative coverage of both basic and clinical microbiology.

About ASM | Contact Us | Press Room

 

ASM is a member of

Scientific Society Publisher Alliance

 

American Society for Microbiology
1752 N St. NW
Washington, DC 20036
Phone: (202) 737-3600

Copyright © 2021 American Society for Microbiology | Privacy Policy | Website feedback

Online ISSN: 2150-7511