The Sequence Listing associated with this application is provided in text format in lieu of a paper copy, and is hereby incorporated by reference into the specification. The name of the text file containing the Sequence Listing is 480361_402WO_SEQUENCE_LISTING.txt. The text file is 10.9 KB, was created on Apr. 27, 2019, and is being submitted electronically via EFS-Web.
Increasingly, the ability to identify genetically related organisms is used to inform decision-making in a variety of contexts, including medicine and agriculture practices. For example, infectious bacteria are commonly tested for genetic relatedness to determine the likelihood that a given isolate will be susceptible or resistant to one or more antibiotics based on its predicted relatedness to bacteria with known susceptibility profiles (i.e., a shared genotype is predictive of a known, shared phenotypic trait). Determining relatedness can help avoiding potential ‘drug-bug’ mismatches, which occur in up to 25% of prescriptions (Tchesnokova et al., J. Clin. Microbiol. 51(9):2991-2999 (2013)), and misuse of up to 50% of antibiotics.
As another example, clonal relatedness and clonal frequencies of T cells (e.g., on the basis of T cell receptor gene recombination) and cancerous cells (e.g., clonal evolution as indicia of pathogenesis, immunologic escape, and resistance to therapy) are important factors in monitoring disease and choosing courses of treatment. See, e.g., Bianci and Munshi, Blood 125(20):3049-3058; see also Cha et al., Sci Transl Med. 6(238):238ra70 (2014). Identifying relatedness of clonal plants is desirable for maintaining preferred levels of homo- or heterogeneity with agricultural species; e.g., stress-resistant or herbicide-resistant strains.
However, current methods of identifying genetic relatedness typically involve costly, time-consuming techniques such as DNA sequencing, pulsed-field gel electrophoresis, microarray, and highly technical subsequent analysis. In addition, for some cell types, days of culture and preparation are also needed before sequencing and analysis can be performed. Thus, potentially crucial decisions may be delayed. Accordingly, there is a need for improved methods and tools for determining genetic relatedness.
Provided herein are methods and compositions for generating a clonotype and for applying clonotype information to useful ends, such as treating disease. In certain aspects, the instant disclosure provides methods and related tools for quickly generating clonotypes that are reliable indicators of genetic relatedness and diversity. Clonotypes are generated from a library of input sequences of interest and comprise one or more genetic features selected for their ability to distinguish among and between clonally related organisms.
In other aspects, methods and compositions are provided for determining the presence or absence of a SNP from a predictive clonotype in Klebsiella. Briefly, using a clonotype-generating method as described herein, a clonotype comprised of 7 single nucleotide polymorphisms (SNPs) was interrogated using PCR and shown to predict clonal relatedness and antibiotic resistance of Klebsiella more accurately, quickly, and cost-effectively than standard multi-locus sequencing. High-fidelity primers that selectively bind and amplify the Klebsiella SNP sequences are also provided herein. Kits comprising the primers and optional additional reagents, including an optional Lookup Table that informs treatment selections based on predicted antibiotic susceptibility and Klebsiella clonotype.
In further aspects, methods are provided for determining antibiotic susceptibility of Klebsiella based on the presence or absence of the 7-SNP clonotype. In another aspect, the present disclosure provides methods for treating a Klebsiella infection in a patient.
Prior to setting forth this disclosure in more detail, it may be helpful to an understanding thereof to provide definitions of certain terms to be used herein. Additional definitions are set forth throughout this disclosure.
In the present description, any concentration range, percentage range, ratio range, or integer range is to be understood to include the value of any integer within the recited range and, when appropriate, fractions thereof (such as one tenth and one hundredth of an integer), unless otherwise indicated. Also, any number range recited herein relating to any physical feature, such as polymer subunits, size or thickness, are to be understood to include any integer within the recited range, unless otherwise indicated. As used herein, the term “about” means±20% of the indicated range, value, or structure, unless otherwise indicated. It should be understood that the terms “a” and “an” as used herein refer to “one or more” of the enumerated components. The use of the alternative (e.g., “or”) should be understood to mean either one, both, or any combination thereof of the alternatives. As used herein, the terms “include,” “have” and “comprise” are used synonymously, which terms and variants thereof are intended to be construed as non-limiting.
In addition, it should be understood that the individual compounds, or groups of compounds, derived from the various combinations of the structures and substituents described herein, are disclosed by the present application to the same extent as if each compound or group of compounds was set forth individually.
The term “consisting essentially of” is not equivalent to “comprising” and refers to the specified materials or steps of a claim, or to those that do not materially affect the basic characteristics of a claimed subject matter.
As used herein, the terms “phylogroup,” “phylogenetic group,” or “clonal group” are used to refer to a group of organisms having a common developmental or evolutionary history. Phylogroups may be determined on the basis of shared genetic markers, such as DNA or RNA sequences, and are sometimes depicted in a phylogenetic tree showing the evolutionary relationships between and among phylogenetic groups (e.g., similarities and differences in physical or genetic characteristics) and in relation to a common ancestor.
As used herein, the term “clonotype” refers to a set of genetic features specific to a genetically related lineage of organisms. In certain embodiments of the present disclosure, a clonotype is characterized by the presence or absence of one or more genetic markers, e.g., SNPs. In specific embodiments, a clonotype is characterized by the presence or absence of SNPs within a set of SNPs, such as a set of 5, 6, 7, 8, 9 or 10 (or more) SNPs that make up a clonotype.
As used herein, a “sequence type” (also referred to herein as a “ST”) refers to a group of organisms that share a particular combination of alleles along particular genetic loci, as determined by sequencing. In bacterial studies, sequence types are typically determined by multilocus sequence typing (MLST) comparison of alleles, wherein a bacterial isolate is characterized by DNA sequences of internal fragments of multiple housekeeping genes. Sequence typing and MLST are discussed in further detail in Larsen et al., J. Clin. Microbiol. 50(4):1355-1361 (2012), the typing techniques of which are incorporated by reference herein in their entirety. In certain embodiments, a clonotype may include organisms of one ST or from a plurality of sequence types. Loci chosen for sequence typing (or generating a clonotype) can include “housekeeping” genes that are constitutively expressed and are required for basic organismal maintenance and function; e.g., metabolism, cell growth, or the like. Chosen loci may also contain or be from coding regions for proteins specific to the type of organism of interest. For example, loci for sequence typing or for generating a clonotype for T cells can be from coding regions for T cell receptor components (e.g., V, D, and J variable region alleles), CD8 or CD4 co-receptor components, CD3 components, and so on. Loci useful for typing bacterial pathogens include coding regions for virulence factors, such as, for example, adhesins, hyaluronidases, proteases, lipases, DNases, hemolysins, endotoxins, exotoxins, iron-binding proteins, capsules, adhesion pili, flagella, lipopolysaccharides, or the like.
As used herein, a “clonal complex” refers to a collection of sequence types sharing common alleles, e.g., a collection of sequence types connected on a spanning tree (see, e.g., Teixeira et al., PLoS One 10(3):e0119315 (2015)) constructed using multi-locus sequence typing. As used herein, a “clonal sub-complex” refers to a collection of sequence types that are connected with a common founder sequence type by shared common alleles.
As used herein, “nucleic acid” or “nucleic acid molecule” or “polynucleotide” refers to any of deoxyribonucleic acid (DNA), ribonucleic acid (RNA), oligonucleotides, fragments generated, for example, by the polymerase chain reaction (PCR) or by in vitro translation, and fragments generated by any of ligation, scission, endonuclease action, or exonuclease action. In certain embodiments, the nucleic acids of the present disclosure are produced by PCR. Nucleic acids may be composed of monomers that are naturally occurring nucleotides (such as deoxyribonucleotides and ribonucleotides), analogs of naturally occurring nucleotides (e.g., α-enantiomeric forms of naturally-occurring nucleotides), or a combination of both. Modified nucleotides can have modifications in or replacement of sugar moieties, or pyrimidine or purine base moieties. Nucleic acid monomers can be linked by phosphodiester bonds or analogs of such linkages. Analogs of phosphodiester linkages include phosphorothioate, phosphorodithioate, phosphoroselenoate, phosphorodiselenoate, phosphoroanilothioate, phosphoranilidate, phosphoramidate, and the like. Nucleic acid molecules can be either single stranded or double stranded. In certain embodiments of the present disclosure, nucleotides or gaps in a nucleotide sequence are named according to standard IUPAC convention, i.e., A, T, C, G, U, R (A or G), Y (C or T), S (G or C), W (A or T), K (G or T), M (A or C), B (C or G or T), D (A or G or T), H (A or C or T), V (A or C or G), N (any base), or −(gap).
The term “isolated” means that the material is removed from its original environment (e.g., the natural environment if it is naturally occurring). For example, a naturally occurring nucleic acid present in a microorganism is not isolated, but the same nucleic acid, separated from some or all of the co-existing materials in the natural system, is isolated. The term “gene” means the segment of DNA involved in producing a polypeptide chain; it includes regions preceding and following the coding region “leader and trailer” as well as intervening sequences (introns) between individual coding segments (exons). A “locus” (plural: loci) is a specific location of a gene or DNA sequence in or on a chromosome. “Alleles” are variants of a DNA sequence located at a given locus.
The term “nucleic acid amplification process” or “nucleic acid amplification reaction” refers to any process or reaction for specifically amplifying (i.e., generating one or more copies of) a target nucleic acid sequence, such as a DNA, a RNA, a or a cDNA, e.g., DNA from a pathogenic bacterium or a cell, such as a human cell. Numerous methods for amplifying nucleic acids are known, including various types of polymerase chain reaction (PCR; e.g., quantitative PCR such as QRT-PCR, ligation-mediated PCR, RT-PCR, amplified fragment length polymorphism, digital PCR, assembly PCR, touchdown PCR, nested PCR, multiplex PCR, and the like, which methods, related reagents, common reaction parameters, and common variations thereon, are known to those of ordinary skill in the art). Illustrative methods include loop-mediated isothermal amplification (LAMP) protocols that include two or three layers (depending on the number of primer pairs used) of specificity control, which can use colorimetry (double-stranded DNA dyes) or simple turbidity (Mg2P2O7 precipitation) for the reaction read-out. Other isothermal amplification methods include recombinase polymerase amplification (RPA) and helicase-dependent amplification (HAD). Both methods utilize colorimetry for detection, using essentially the same instrumentation platforms as LAMP.
“Sequence identity,” as used herein, refers to the percentage of amino acid residues in one sequence that are identical with the amino acid residues in another reference polypeptide sequence after aligning the sequences and introducing gaps, if necessary, to achieve the maximum percent sequence identity, and not considering any conservative substitutions as part of the sequence identity. The percentage sequence identity values can be generated using the NCBI BLAST2.0 software as defined by Altschul et al. (1997) “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs”, Nucleic Acids Res. 25:3389-3402, with the parameters set to default values.
Certain tools of statistical analysis (e.g., two-sided one-sample t-test, two-tailed Fisher's exact test) are referred to herein. In certain embodiments, modified statistical tools are referred to, which are described in detail herein.
In certain aspects, the present disclosure provides methods for generating a clonotype. Clonotypes generated using the methods of this disclosure are useful for quickly and reliably identifying genetically related organisms without the need for potentially costly and time-consuming nucleic acid sequencing. Such generated clonotypes may be used, for example, to develop therapeutic regimens based on susceptibility to particular therapies.
To generate a clonotype, genetic features must be identified that are sufficiently common among individual organisms but that are also representative of the genetic diversity within a larger group of organisms; e.g., a species or a genus. More specifically, genetic features are selected for having refining power to identify closely related groups of organisms with a high degree of precision, while genetic features identified as having low or no refining power are excluded. Briefly, methods comprise: (a) generating a full binary data set from a nucleotide position library; (b) generating a reduced binary data set; (c) generating a Polymorphic Information Content (PIC) of each nucleotide position in the reduced binary data set; (d) identifying all possible pairs of nucleotide positions in the reduced binary data set; (e) generating a PIC differential; and (f) selecting non-discarded nucleotide positions to generate a clonotype. Each of these steps of the clonotype generation method is addressed in turn herein.
In some embodiments, presently disclosed methods for generating a clonotype include providing, obtaining, or constructing a nucleotide position library based on aligned and concatenated sequences from a genetic locus or from genetic loci, such as, for example, two or more alleles corresponding to a locus. As used herein, a “nucleotide position library” refers to a collection of nucleotide (e.g., purine or pyrimidine base) positions of an input nucleotide sequence or sequences. In certain embodiments, a position in a nucleotide position library corresponds to the position of the nucleotide along the input sequence; e.g., a third base in a genetic locus, such as a “G” of an “ATG” start codon of a locus is in a third nucleotide position in a library based on the locus sequence, and the “A” and “T” of the start codon are respectively in first and second positions of the library. A nucleotide position library may comprise or consist of any number of nucleotide positions (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotide positions, or tens, or hundreds, or thousands, or tens of thousands, or hundreds of thousands, or millions of nucleotide positions, and may, in certain embodiments, comprise nucleotide positions comprising an entire genome), preferably 7 nucleotide positions. By way of illustration, Table 1 below provides an example of an initial nucleotide position library that includes 6 positions in 5 alleles.
The initial nucleotide position library is then converted into a refined nucleotide position library by removing non-informative nucleotide positions. Exemplary non-informative nucleotide positions include positions having gaps or those that are monomorphic (i.e., having the same base in the same position in all of the sequences of a library), which are removed from the initial library. By way of illustration, the nucleotide position library of Table 1 includes non-informative positions at nucleotide positions 2 (monomorphic) and 5 (gap in Allele 2), so these positions are removed to produce the refined nucleotide position library shown in Table 2.
Next, nucleotides at the remaining positions within the library are assigned binary data values according to their frequency of occurrence at the position. More particularly, a most-frequently occurring nucleotide at a position is assigned a first binary value, and all other nucleotide bases occurring at the position are assigned the other, different binary value, to generate a full binary data set; i.e., a binary data set that is obtained by removing gapped positions and monomorphic positions from a nucleotide position library and assigning first and second binary values to all remaining positions in the nucleotide position library as described herein. For example, for Position 1 in Table 2 above, “T” is assigned a “1”, while “A”, “G”, and “C” are each assigned a “0”. For Position 3 in Table 2, “G” is assigned a “1”, while “C”, and “A” are each assigned a “0”. For Position 4 in Table 2, “G” is assigned a “1”, while “T”, “C”, and “A” are each assigned a “0”. For Position 6 in Table 3, “T” is assigned a “1”, while “A”, “G”, and “C” are each assigned a “0”.
Assigning binary values to the refined nucleotide library shown in Table 2 generates a full binary data set, as exemplified in Table 3.
The assigned binary values are arbitrary and may be assigned any binary value of interest, provided that the value usage is applied consistently across positions and alleles in the library. For example, although Table 3 shows the most-frequently occurring nucleotide at each position assigned a “1” and the other nucleotide bases occurring at the position assigned a “0”, the opposite binary values may be assigned; i.e., the most-frequently occurring nucleotide at each position may be assigned a “0” and the other nucleotide bases occurring at the position may be assigned a “1”.
Next, nucleotide positions having identical (e.g., “1-0-1-0-1-0” vs. “1-0-1-0-1-0”) or reverse-identical (e.g., “1-0-1-0-1-0” vs. “0-1-0-1-0-1”) binary distribution patterns relative to other nucleotide positions provide no refining power over one another and, therefore, one nucleotide position of each pair of identical or reverse-identical nucleotide positions is removed from the library to generate a reduced binary data set, as shown in Table 4. For example, Positions 4 and 6 in Table 3 have identical distribution patterns, and, therefore, either Position 4 or Position 6 would be removed from the library to generate a reduced binary data set.
The polymorphic information content (PIC) of each nucleotide position remaining in the reduced binary data set is calculated using a novel, modified version of the PIC calculation provided in Botstein et al., Am. J. Hum. Genet. 32:314-331 (1980), which modified version considers the subtracted square values of the frequency of each nucleotide at the position to provide information on the value of the nucleotide position as a marker of relatedness and diversity. By way of example, two nucleotide positions within a library of ten alleles have nucleotide frequencies as follows:
PIC=1−((0.9)2+(0.1)2)=(1−(0.82))=0.18;
PIC=1−((0.5)2+(0.5)2)=(1−(0.5))=0.5.
Accordingly, a difference at position 2 separates a group of 10 alleles into two groups of 5 alleles each and is useful as a marker of diversity within the population of alleles. In contrast, a difference at position 1 separates a group of 10 alleles into one group of 9 alleles and one group of 1 allele, and is therefore less valuable as an identifier of diversity within the group of alleles. Accordingly, position 2 has higher discriminatory value than position 1.
Next, all possible pairs of nucleotide positions in the reduced binary data set are identified, and a PIC differential is generated by comparing (i) the pairwise sum of binary distribution differences between two nucleotide positions of each possible pair and (ii) the overall mean sum of binary distribution differences of all possible pairs. If the pairwise sum of binary distribution differences of (i) is smaller than the overall mean sum of the binary distribution differences of (ii), the nucleotide position with the lower PIC of the two nucleotide positions in a pair is discarded.
By way of illustration, the following calculations are made using a hypothetical data set of ten alleles (Q-Z) having three nucleotide positions (1-3) therein:
For each of the position pairs: 1 vs. 2 and 1 vs. 3, the pairwise sum of differences (5) is greater than the overall mean sum of differences in the set (4.67). For position pair: 2 vs. 3, however, the pairwise sum (4) is smaller than the overall mean sum of differences in the set (4.67). This means that the distribution of nucleotides in positions 2 and 3 is less diverse than in the data set as a whole. Thus, the nucleotide position (position 3) having the lower PIC of the position pair: 2 vs. 3 is discarded.
Finally, non-discarded nucleotide positions are selected based on PIC values to generate a clonotype. In certain embodiments, nucleotide positions are sorted in decreasing order of PIC values, and positions having higher PIC values are chosen for the clonotype (e.g., all nucleotide positions with PIC values above a certain predetermined threshold, or simply a predetermined number of nucleotides having the highest PIC values, such as, for example, the 5, 6, 7, 8, 9, 10, 100, etc., PIC values).
In certain embodiments, the present disclosure provides a method for generating a clonotype, wherein the method comprises:
(a) generating a full binary data set from a nucleotide position library, wherein the full binary data set comprises, for each nucleotide position in the library,
wherein the nucleotide position library comprises an aligned, concatenated nucleic acid sequence set obtained from one or more loci in a genome, in which (i) nucleotide positions with a gap and (ii) nucleotide positions that are monomorphic in the sequence set are discarded;
(b) generating a reduced binary data set, comprising discarding from the full binary data set one nucleotide position from each pair of nucleotide positions having identical or reverse-identical binary distribution patterns;
(c) generating a Polymorphic Information Content (PIC) of each nucleotide position in the reduced binary data set, wherein
PIC=[1−Σ(frequency of the assigned first binary value at the position)2+(frequency of assigned second binary value at the position)2)];
(d) identifying all possible pairs of nucleotide positions in the reduced binary data set;
(e) generating a PIC differential, wherein the PIC differential comprises:
wherein the nucleotide position with the lower PIC of the two nucleotide positions in a pair is discarded when the pairwise sum of binary distribution differences of (i) is smaller than the overall mean sum of the binary distribution differences of (ii); and
(f) selecting non-discarded nucleotide positions to generate a clonotype.
In further embodiments, a method for generating a clonotype further comprises, following (e)(ii) and prior to (f), ordering the non-discarded nucleotide positions according to PIC value; e.g., PIC (position 1)>PIC (position 2)>PIC (position 4)>PIC (position 4)>PIC (position 5). Without wishing to be bound by theory, nucleotide positions having a higher PIC value have higher diversity than nucleotide positions having a lower PIC value, and may provide for a more informative clonotype.
In any of the aforementioned embodiments, a nucleotide position library can comprise nucleic acid sequences from one or more one allele of the one or more locus. Allelic sequences for various loci of various organisms are available online, including at, for example, the National Center for Biotechnology Information (NCBI) online (ncbi.hlm.nih.gov).
In any of the aforementioned embodiments, a nucleotide position library of the present disclosure can comprise nucleic acid sequences from a bacterium; a human cell, which in some embodiments comprises a T cell (see, e.g., Thor Straten et al., J. Transl. Med. 2(1):11 (2004)); a tumor; a non-human animal; or a plant.
In certain embodiments, a nucleotide position library comprises sequences a bacterium, such as, for example, an infectious bacterium. In particular embodiments, a nucleotide position library comprises sequences from: Acinetobacter baumannii; Actinomyces israelii; Actinomyces gerencseriae; Anaplasma species; Ancylostoma braziliense; Angiostrongylus; Anisakis; Arcanobacterium haemolyticum; Junin virus; Ascaris lumbricoides; Aspergillus species; an Astroviridae family member; Anaplasma phagocytophilum; Actinomycetoma sp.; Babesia sp.; Bacillus anthracis, Bacillus cereus; Bacillus sp.; Bacteroides sp.; Balantidium coli; Bartonella; Batrachochytrium dendrabatidis; Baylisascaris species; Blastocystis sp.; Blastomyces dermatitidis; Bartonella bacilliformis; Bartonella henselae; Borrelia burgdorferi; Borrelia hermsii; Borrelia recurrentis; Borrelia garinii; Borrelia afzeliil; Bordetella pertussis; Brucella sp.; Brevibacterium sp.; Burkholderia mallei; Burkholderia pseudomallei; Burkholderia cepacia; Campylobacter sp.; Candida sp.; Capillaria philippinensis; Capillaria hepatica; Capillaria aerophile, Chlamydia trachomatis; Chlamydophila pneumoniae; Chlamydophila psittaci; Citrobacter freundii; Citrobacter koserii; Citrobacter sedlakii; Citrobacter sp.; Clonorchis sinensis; Corynebacterium diphtheria; Corynebacterium sp; Clostridium botulinum; Clostridium difficile; Clostridium tetani; Clostridium perfringens; Clostridium sp; Coxiella burnetii; Cryptococcus neoformans; Cryptosporidium sp.; Cyclospora cayetanensis; Escherichia coli; Escherichia coli O157:H7, Escherichia coli O111; Escherichia coli O104.H4; Ehrlichia ewingii; Ehrlichia chaffeensis; Ehrlichia sp.; Echinococcus sp.; Enterococcus faecalis; Enterococcus faecium; Enterococcus sp.; Entamoeba histolytica; Enterobacter aerogenes; Enterobacter cloacae; Fusobacterium sp.; Fonsecaea pedrosoi; Francisella tularensis; Geotrichum candidum; Haemophilus ducreyi; Haemophilus influenza; Helicobacter pylori; Klebsiella pneumoniae; Klebsiella oxytoca; Klebsiella granulomatis; Klebsiella variicola; Klebsiella sp.; Kingella kingae; Kluyvera ascorbate, Legionella pneumophila; Leptospira sp.; Listeriamonocytogenes; Mycobacterium tuberculosis; Mycobacterium ulcerans; Mycobacterium leprae; Mycobacterium lepromatosis; Mycoplasma pneumoniae; Moraxella sp.; Morganella morganii; Neisseria gonorrhoeae; Neisseria meningitides; Nocardia asteroids; Piedraia hortae; Pantoea agglomerans; Pseudomonas aeruginosa; Pseudomonas sp.; Proteus mirabilis; Proteus sp.; Pasteurella sp.; Prevotella sp.; Propionibacterium propionicus; Rickettsia rickettsia; Rickettsia prowazekii; Rickettsia typhi; Rickettsia akari; Raoultella ornithinolytica; Raoultella planticola; Raoultella sp.; Streptococcus pneumoniae; Streptococcus pyogenes; Streptococcus agalactiae; Streptococcus sp.; Salmonella enterica subsp. Enterica; serovar typhi; Salmonella sp.; Shigella sp.; Staphylococcus aureus; Staphylococcus saprophyticus; Staphylococcus epidermidis; Staphylococcus haemolyticus Staphylococcus sp.; Serratia marcensens; Serratia liquefaciens; Serratia grimesii; Serratia maltophilia; Trypanosoma brucei; Trichosporon beigelii; Ureaplasma urealyticum; Vibrio cholera; Vibrio vulnificus; Vibrio parahaemolyticus; Yersinia pestis; Yersinia enterocolitica, or Yersinia pseudotuberculosis.
Nucleotide sequences of various organisms, such as infectious bacteria, and including sequences from bacterial strains and alleles of bacterial genes, can be readily found using, for example, the ENTREZ genome browsing tool (ncbi.nlm.nih.gov) or the PATRIC genome browser (e.g., v3.5.11; patricbrc.org/view/DataType/Genomes).
In some embodiments, a nucleotide position library comprises sequences from one or more loci that is associated with a predetermined sequence type (ST) or multi-locus sequence typing (MLST) scheme (see, e.g., Larsen et al., J. Clin. Microbiol. 50(4):1355-1361 (2012), the typing techniques of which are incorporated by reference herein in their entirety; see also the online MLST databases at pubmlst.org/databases.shtml, which organisms, databases, sequence types, isolates, and published references are incorporated herein by reference. However, a predetermined sequence type or MLST scheme is not required, and a nucleotide position library of the present disclosure may be constructed using whole genome sequence or sequences from any part thereof, and in the absence of a known ST or MLST scheme.
In certain embodiments, generating a clonotype comprises selecting 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 40, 50, 60, 70, 80, 90, 100, or more non-discarded nucleotide positions, each having a PIC value above a pre-determined threshold PIC value or otherwise having a desired PIC value. In particular embodiments, selecting the one or more nucleotide positions comprises selecting 7 nucleotide positions (e.g., for testing via PCR using an 8-well PCR tube strip or plate, with a control reaction in the 8th well).
In further embodiments, a clonotype-generating method of the instant disclosure further comprises testing the generated clonotype on a sample comprising nucleic acids from the organism or cell type of interest (i.e., the organism or cell type from which the nucleic acid sequences comprising the nucleotide position library were obtained), wherein the organism or cell type of interest is of one or more predetermined sequence type, wherein the testing comprises:
(a) performing an amplification reaction on the sample using forward and reverse primers for the clonotype nucleotide positions; and
(b) comparing results from the amplification reaction with the one or more predetermined sequence type.
In some embodiments, the nucleic acid amplification reaction comprises a polymerase chain reaction (PCR), such as, for example, a quantitative polymerase chain reaction (qPCR).
In certain embodiments, primers are provided for use in nucleic acid amplification processes of this disclosure; e.g., to detect the presence or absence of SNPs in Klebsiella, or to test a generated clonotype on a sample. As referred to herein, primers (also referred to herein as forward polynucleotides, forward oligonucleotides, reverse polynucleotides, or reverse oligonucleotides) are typically short oligonucleotides (e.g., ranging from about 10 to about 35 bases) that serve as the starting material for a nucleic acid amplification process. Primers typically include at least one region of sequence that is complementary to a target sequence to be amplified, and in some cases are perfectly complementary to a target sequence over their full length. In certain embodiments, a primer contains one or more introduced SNP relative to the complementary sequence of the target sequence, such that the primer sequence is not perfectly complementary to the nucleic acid sequence to be amplified in at least one nucleotide position. Without wishing to be bound by theory, certain such introduced SNPs allow for improved fidelity in target-specific nucleic acid amplification reactions relative to reference a primer sequence that does not contain the introduced one or more SNPs, and can thereby reduce or eliminate false positive results. In some embodiments, a primer comprises a deletion (i.e., one or more missing nucleotide, which may comprise contiguous missing nucleotides) relative to the template sequence to be amplified.
In some embodiments, primers of the present disclosure are designed to hybridize with sequences that contain genetic features (e.g., SNPs) identified according to the presently disclosed clonotyping methods (e.g., Klebsiella SNPs as disclosed herein) and are suitable for use in any nucleic acid amplification process. Although the results of a nucleic acid amplification process of the present disclosure can be evaluated by turbidity or using UV-light (e.g., SYBR-Green dye), other known methods and instruments may be used to visualize a result of a nucleic acid amplification process, such as, for example, the ESE-Quant Tube Scanner (Qiagen, Inc), the Genie IITM (Pro-Lab Diagnostics, Inc.), the Rotor-Gene Q instrument for RT-PCR, or the like.
In certain embodiments, a primer comprises a forward primer sequence or a reverse primer sequence according to any one of SEQ ID NOs: 1-48, 60, or 61. In further embodiments, a primer pair is provided, wherein the primers are capable of selectively amplifying a target sequence comprising a phoE54 SNP, a rpoB130 SNP, a infB279 SNP, a mdh315 SNP, a phoE336 SNP, a phoE354 SNP, or a mdh429 SNP. In some embodiments, primer pairs are provided that comprise one or more of the following primer pairs:
(i) a phoE54 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:4 and a phoE54 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:8 or SEQ ID NO:11,
(ii) a rpoB130 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:20 or SEQ ID NO:19 and a rpoB130 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:25 or SEQ ID NO:60,
(iii) a infB279 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:29 and a infB279 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:33 or SEQ ID NO:35,
(iv) a mdh315 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:41 or SEQ ID NO:49 and a mdh315 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:42,
(v) a phoE336 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:10 and a phoE336 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:11,
(vi) a phoE354 forward primer comprising the nucleic acid sequence of SEQ ID NO:16 and a phoE354 reverse primer comprising the nucleic acid sequence of SEQ ID NO:11, and
(vii) a mdh429 forward primer comprising or consisting of the nucleic acid sequence of any one of SEQ ID NOs:45, 41, or 49, and a mdh429 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:48.
As used herein, “phoE54” refers to a SNP that is an “A” nucleotide at position 54 of the gene encoding outer membrane pore protein E (phoE) in Klebsiella. UniProt entries exist for phoE of a number of Klebsiella species, strains, and subspecies, including K. pnemoniae, K. oxytoca, K. michiganensis, K. quasipneumoniae, K. aerogenes, K. LTGPAF-6F, K. OBRC7, K. RIT-PI-d, K. variicola, and related strains and subspecies.
As used herein, “rpoB130” refers to a SNP that is a “G” nucleotide at position 130 of the gene encoding DNA-directed RNA polymerase subunit beta in Klebsiella. UniProt entries exist for rpoB of a number of Klebsiella species, strains, and subspecies, including K. pnemoniae, K. oxytoca, K. michiganensis, K. quasipneumoniae, K. aerogenes, K. LTGPAF-6F, K. OBRC7, K. RIT-PI-d, K. variicola, and related strains and subspecies.
As used herein, “infB279” refers to a SNP that is a “T” nucleotide at position 279 of the gene encoding translation initiation factor IF-2 (infB) in Klebsiella. UniProt entries exist for infB of a number of Klebsiella species, strains, and subspecies, including K. pnemoniae, K. oxytoca, K. michiganensis, K. quasipneumoniae, K. aerogenes, K. LTGPAF-6F, K. OBRC7, K. RIT-PI-d, K. variicola, and related strains and subspecies.
As used herein, “mdh315” refers to a SNP that is a “C” nucleotide at position 279 of the gene encoding methanol dehydrogenase (mdh) in Klebsiella. UniProt entries exist for mdh of a number of Klebsiella species, strains, and subspecies, including K. pnemoniae, K. oxytoca, K. michiganensis, K. quasipneumoniae, K. aerogenes, K. LTGPAF-6F, K. OBRC7, K. RIT-PI-d, K. variicola, and related strains and subspecies.
As used herein, “phoE336” refers to a SNP that is a “G” nucleotide at position 336 of the Klebsiella phoE gene. As used herein, “phoE354” refers to a SNP that is a “C” nucleotide at position 354 of the Klebsiella phoE gene.
As used herein, “mdh429” refers to a SNP that is a “C” nucleotide at position 429 of the Klebsiella mdh gene.
In certain embodiments, the primer pairs comprise:
(i) a phoE54 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:4 and a phoE54 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:8 or SEQ ID NO:11,
(ii) a rpoB130 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:20 or SEQ ID NO:19 and a rpoB130 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:25 or SEQ ID NO:60,
(iii) a infB279 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:29 and a infB279 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:33 or SEQ ID NO:35,
(iv) a mdh315 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:41 or SEQ ID NO:49 and a mdh315 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:42,
(v) a phoE336 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:10 and a phoE336 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:11,
(vi) a phoE354 forward primer comprising the nucleic acid sequence of SEQ ID NO:16 and a phoE354 reverse primer comprising the nucleic acid sequence of SEQ ID NO:11, and
(vii) a mdh429 forward primer comprising or consisting of the nucleic acid sequence of any one of SEQ ID NOs:45, 41, or 49, and a mdh429 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:48.
In certain embodiments, a Klebsiella SNP is located within a sequence that is hybridized by, or is amplified by an amplification reaction containing, a forward primer or a reverse primer of the present disclosure, as shown in Table 11.
In certain embodiments, a primer of the instant disclosure can comprise a naturally occurring nucleotide, a modified nucleotide, or both. “Naturally occurring nucleotides” includes deoxyribonucleotides and ribonucleotides. The term “modified nucleotides” includes nucleotides with modified or substituted sugar groups or the like (e.g., modified with bromouridine, arabinoside, or 2′3′-dideoxyribose). The term “oligonucleotide linkages” includes oligonucleotide linkages such as phosphorothioate, phosphorodithioate, phosphoroselenoate, phosphorodiselenoate, phosphoroanilothioate, phoshoraniladate, phosphoroamidate, or the like. See, e.g., LaPlanche et al., 1986, Nucl. Acids Res., 14:9081; Stec et al., 1984, J. Am. Chem. Soc., 106:6077; Stein et al., 1988, Nucl. Acids Res., 16:3209; Zon et al., 1991, Anti-Cancer Drug Design, 6:539; Zon et al., 1991, OLIGONUCLEOTIDES AND ANALOGUES: A PRACTICAL APPROACH, pp. 87-108 (F. Eckstein, Ed.), Oxford University Press, Oxford England; Stec et al., U.S. Pat. No. 5,151,510; Uhlmann and Peyman, 1990, Chemical Reviews, 90:543, the disclosures of which are hereby incorporated by reference for any purpose. A primer or a nucleotide of this disclosure can, in some embodiments, include a detectable label to enable detection of the primer or hybridization thereof.
In some embodiments, primers are provided that are capable of hybridizing under moderate to high stringency conditions to a target sequence as provided herein, or a fragment thereof, or a complementary sequence thereof. By way of illustration, suitable moderately stringent conditions for testing the hybridization of a polynucleotide as provided herein with other polynucleotides include prewashing in a solution of 5×SSC, 0.5% SDS, 1.0 mM EDTA (pH 8.0); hybridizing at 50° C.-60° C., 5×SSC, overnight; followed by washing twice at 65° C. for 20 minutes with each of 2×, 0.5× and 0.2×SSC containing 0.1% SDS. The stringency of hybridization can be readily manipulated, such as by altering the salt content of the hybridization solution and/or the temperature at which the hybridization is performed. For example, suitable highly stringent hybridization conditions include those described above, with the exception that the temperature of hybridization is increased, e.g., to 60° C.-65° C. or 65° C. 70° C. As described herein, primers that are “specific for” a particular target or template sequence can hybridize with the target or template sequence at a temperature of at least about 57° C. or above.
Also provided herein are kits that comprise one or more of the herein disclosed primers or primer pairs and optional additional components. In certain embodiments, a kit is provided that comprises:
(a) forward primer and reverse primer pairs for at least seven Klebsiella single nucleotide polymorphisms (SNPs), wherein the SNPs comprise phoE54, rpoB130, infB279, mdh315, phoE336, phoE354, and mdh429, and wherein the primer pairs comprise one or more of the following primer pairs:
(i) a phoE54 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:4 and a phoE54 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:8 or SEQ ID NO:11,
(ii) a rpoB130 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:20 or SEQ ID NO:19 and a rpoB130 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:25 or SEQ ID NO:60,
(iii) a infB279 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:29 and a infB279 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:33 or SEQ ID NO:35,
(iv) a mdh315 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:41 or SEQ ID NO:49 and a mdh315 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:42,
(v) a phoE336 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:10 and a phoE336 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:11,
(vi) a phoE354 forward primer comprising the nucleic acid sequence of SEQ ID NO:16 and a phoE354 reverse primer comprising the nucleic acid sequence of SEQ ID NO:11, and
(vii) a mdh429 forward primer comprising or consisting of the nucleic acid sequence of any one of SEQ ID NOs:45, 41, or 49, and a mdh429 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:48;
(b) optional additional reagents for performing a nucleic acid amplification reaction (e.g., one or more of a polymerase, such as a Taq polymerase, a buffer for the polymerase, a polymerase reaction cofactor such as MgCl2, Mg2+, K+, a nucleotide mix, a nucleic acid stain such as SYBR Green 1, dimethylsulfoxide (DMSO), sterile water, formamide, bovine serum albumin (BSA), and Betaine);
(c) an optional Lookup Table; and
(d) an optional instruction for identifying a Klebsiella clonotype and determining the Klebsiella susceptibility to one or more antibiotics.
In further embodiments, the Lookup Table is Lookup Table 1. The information contained in Lookup Table 1 below was obtained from urine specimens from patients with Klebsiella infections from several clinics within different regions of the United States. CT=clonotypes determined using a 7-SNP combination: phoE54; rpoB130; infB279; mdh315; phoE336; phoE354; and mdh429. Other abbreviations: ampicillin (AMP), amoxicillin/clavulanate (AMC), CS1 (first generation cephalosporins), CS3 (third generation cephalosporins), trimethoprim/sulfamethoxazole (TS), ciprofloxacin (CIP), nitrofurantoin (NIT), imipenem (IMI), fosfomycin (FOS), tetracycline (TET), and ceftazidime vs ceftazidime/clavulanate to determine production of extended-spectrum beta-lactamases (ESBLs). “Y” and “N” indications of allowance or non-recommendation, respectively, are based on a 20% resistance threshold to the indicated antibiotic; Y indicates that more than 80% of the tested isolates within the indicated clonotype are susceptible to the indicated antibiotic, and N indicates that less than 80% of the isolates are susceptible to the antibiotic.
Klebsiella spp.
In further embodiments, a Lookup Table is constructed as described in U.S. Patent Publication No. US 2016/0251702; e.g., assigning the probability that an isolate belonging to a particular clonotype will be sensitive or resistant to different antibiotics on a scale from 0 to 100, with 0 being completely resistant and 100 being completely sensitive. If 90-100% bacteria that belong to the particular clonotype are sensitive to particular antibiotic, the respective cell in the Lookup Table is colored green, and this antibiotic is recommended to be used for treatment; pale green indicates 80-90% sensitivity level, and treatment is allowed too. Yellow (75-80%) and orange (70-75%) indicate that treatment is still allowed but with caution, and switching to a different antibiotic is recommended. Red indicates that more than 30% of bacteria are resistant to this antibiotic, and the latter should be rejected as a choice for treatment. Of course, it will be understood that a Lookup Table can be designed or adjusted to reflect differences in relevant characteristics of the clonotyped organism (e.g., resistance profiles of Klebsiella isolates (for example, such differences as may be seen from Klebsiella isolates from other collection points and/or other collection periods), antigen specificity of T cells, tumor susceptibility to chemotherapies and immunotherapies, etc.), and may include information about other or additional antibiotics or other reagents, and may provide still other information.
In particular embodiments, the primer pairs of a kit comprise:
(i) a phoE54 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:4 and a phoE54 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:8 or SEQ ID NO:11,
(ii) a rpoB130 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:20 or SEQ ID NO:19 and a rpoB130 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:25 or SEQ ID NO:60,
(iii) a infB279 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:29 and a infB279 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:33 or SEQ ID NO:35,
(iv) a mdh315 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:41 or SEQ ID NO:49 and a mdh315 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:42,
(v) a phoE336 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:10 and a phoE336 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:11,
(vi) a phoE354 forward primer comprising the nucleic acid sequence of SEQ ID NO:16 and a phoE354 reverse primer comprising the nucleic acid sequence of SEQ ID NO:11, and
(vii) a mdh429 forward primer comprising or consisting of the nucleic acid sequence of any one of SEQ ID NOs:45 or 41 or 49, and a mdh429 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:48.
In still further embodiments, at least two of the primer pairs selected from (a)(i)-(a)(vii) are mixed in a single container.
In another aspect, the present disclosure provides methods for determining the presence or absence of a single nucleotide polymorphism (SNP) in Klebsiella, wherein a method comprises performing a nucleic acid amplification process on DNA isolated from Klebsiella obtained from a patient sample, wherein the nucleic acid amplification process comprises use of forward and reverse primer pairs specific for at least seven different Klebsiella single nucleotide polymorphisms (SNPs), wherein the at least seven different SNPs comprise phoE54, rpoB130, infB279, mdh315, phoE336, phoE354, and mdh429, and determining the presence or absence of one or more of the phoE54, rpoB130, infB279, mdh315, phoE336, phoE354, and mdh429 SNPs.
Briefly, the presently disclosed SNPs were identified using a herein disclosed clonotype generating method as useful indicators of Klebsiella clonality and antibiotic resistance. In certain embodiments, the SNP-specific primer pairs comprise a forward and a reverse primer selected from SEQ ID NOs:1-48 and 60-61 that, in an amplification reaction, can specifically amplify a SNP-containing region of interest.
In certain embodiments, the primer pairs comprise one or more of the following primer pairs:
(i) a phoE54 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:4 and a phoE54 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:8 or SEQ ID NO:11,
(ii) a rpoB130 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:20 or SEQ ID NO:19 and a rpoB130 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:25 or SEQ ID NO:60,
(iii) a infB279 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:29 and a infB279 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:33 or SEQ ID NO:35,
(iv) a mdh315 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:41 or SEQ ID NO:49 and a mdh315 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:42,
(v) a phoE336 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:10 and a phoE336 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:11,
(vi) a phoE354 forward primer comprising the nucleic acid sequence of SEQ ID NO:16 and a phoE354 reverse primer comprising the nucleic acid sequence of SEQ ID NO:11, and
(vii) a mdh429 forward primer comprising or consisting of the nucleic acid sequence of any one of SEQ ID NOs:45 or 41 or 49, and a mdh429 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:48.
In certain embodiments, the primer pairs comprise:
(i) a phoE54 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:4 and a phoE54 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:8 or SEQ ID NO:11,
(ii) a rpoB130 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:20 or SEQ ID NO:19 and a rpoB130 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:25 or SEQ ID NO:60,
(iii) a infB279 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:29 and a infB279 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:33 or SEQ ID NO:35,
(iv) a mdh315 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:41 or SEQ ID NO:49 and a mdh315 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:42,
(v) a phoE336 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:10 and a phoE336 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:11,
(vi) a phoE354 forward primer comprising the nucleic acid sequence of SEQ ID NO:16 and a phoE354 reverse primer comprising the nucleic acid sequence of SEQ ID NO:11, and
(vii) a mdh429 forward primer comprising or consisting of the nucleic acid sequence of any one of SEQ ID NOs:45 or 41 or 49, and a mdh429 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:48.
In yet another aspect, the present disclosure provides methods for determining antibiotic susceptibility of a pathogenic organism, such as, e.g., an infectious bacteria, wherein the methods comprise (a) amplifying fragments from a genome (e.g., polynucleotide fragments from a genome) of the pathogenic organism using primer pairs specific for one or more SNPs, wherein the one or more SNPs constitute a clonotype, (b) detecting the presence or absence of the one or more SNPs to identify the clonotype, and (c) comparing the clonotype to a Lookup Table that correlates one or more clonotype of the pathogenic organism with an antibiotic susceptibility profile (e.g., whether known clonotypes of the pathogenic organism are known to be susceptible to one or more antibiotic or antibiotic class as described herein. In certain embodiments, the one or more SNPs that constitute a clonotype are identified using a clonotype-generating method of the present disclosure. In some embodiments, a method comprises (a) amplifying fragments comprising one or more SNPs of the present disclosure; e.g., phoE54, rpoB130, infB279, mdh315, phoE336, phoE354, and mdh429, in a pathogenic organism, (b) detecting the presence or absence of the one or more SNPs to identify the clonotype, and (c) comparing the clonotype to a Lookup Table.
For example, in certain embodiments, a method for determining antibiotic susceptibility of Klebsiella comprises:
(a) amplifying polynucleotide fragments from a Klebsiella genome using forward and reverse primer pairs specific for at least seven different Klebsiella single nucleotide polymorphisms (SNPs), wherein the at least seven different SNPs comprise phoE54, rpoB130, infB279, mdh315, phoE336, phoE354, and mdh429, and wherein the primer pairs comprise a phoE54-specific primer pair, a rpoB130-specific primer pair, a infB279-specific primer pair, a mdh315-specific primer pair, a phoE336-specific primer pair, a phoE354-specific primer pair, and a mdh429-specific primer pair, according to the SNP-specific forward and reverse primers of SEQ ID NOs: 1-48;
(b) detecting the presence or absence of one or more of the at least seven SNPs in the Klebsiella genome to identify the Klebsiella clonotype; and
(c) comparing the Klebsiella clonotype to a Lookup Table to determine the Klebsiella susceptibility to one or more antibiotics.
In certain embodiments, the primer pairs comprise one or more of the following primer pairs:
(i) a phoE54 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:4 and a phoE54 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:8 or SEQ ID NO:11,
(ii) a rpoB130 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:20 or SEQ ID NO:19 and a rpoB130 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:25 or SEQ ID NO:60,
(iii) a infB279 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:29 and a infB279 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:33 or SEQ ID NO:35,
(iv) a mdh315 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:41 or SEQ ID NO:49 and a mdh315 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:42,
(v) a phoE336 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:10 and a phoE336 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:11,
(vi) a phoE354 forward primer comprising the nucleic acid sequence of SEQ ID NO:16 and a phoE354 reverse primer comprising the nucleic acid sequence of SEQ ID NO:11, and
(vii) a mdh429 forward primer comprising or consisting of the nucleic acid sequence of any one of SEQ ID NOs:45 or 41 or 49, and a mdh429 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:48.
In certain embodiments, the primer pairs comprise:
(i) a phoE54 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:4 and a phoE54 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:8 or SEQ ID NO:11,
(ii) a rpoB130 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:20 or SEQ ID NO:19 and a rpoB130 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:25 or SEQ ID NO:60,
(iii) a infB279 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:29 and a infB279 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:33 or SEQ ID NO:35,
(iv) a mdh315 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:41 or SEQ ID NO:45 and a mdh315 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:42,
(v) a phoE336 forward primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:10 and a phoE336 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:11,
(vi) a phoE354 forward primer comprising the nucleic acid sequence of SEQ ID NO:16 and a phoE354 reverse primer comprising the nucleic acid sequence of SEQ ID NO:11, and
(vii) a mdh429 forward primer comprising or consisting of the nucleic acid sequence of any one of SEQ ID NOs:45 or 41 or 49, and a mdh429 reverse primer comprising or consisting of the nucleic acid sequence of SEQ ID NO:48.
In any of the herein disclosed kits or methods, a reference primer pair specific for mdh and comprising the nucleotide sequences according to SEQ ID NO:49 (forward primer) and 55 (reverse primer) may be used. Other reference primers specific for mdh that may be used in any of the herein disclosed kits or methods comprise or consist of a nucleotide sequence according to SEQ ID NOS:50-54 or 56-59.
In any of the herein disclosed methods of determining an antibiotic susceptibility, an antibiotic comprises a penicillin (e.g., ampicillin (AMP), mezlocillin, piperacillin, ticarcillin, methicillin, oxacillin, and the like), amoxicillin/clavulanate (A/C), a first-generation cephalosporin, a second-generation cephalosporin (e.g., cefaclor, cefamandole, cefonicid, ceforanide, cefuroxime, and the like), a third generation cephalosporin, a fourth generation cephalosporin (e.g., ceflidine, cefepime, cefluprenam, cefozopan, cefpirome, cefquinome, and the like), trimethoprim/sulfamethoxazole (T/S), a fluorquinolone, nitrofurantoin (NIT), a tetracycline (TET)(e.g., doxycycline, minocycline, oxytetracycline, tetracycline, and the like), a macrolide (e.g., azithromycin, clarithromycin, dirithromycin, erythromycin, roxithromycin, troleandomycin, and the like), an aminoglycoside (e.g., amikacin, gentamicin, kanamycin, neomycin, streptomycin, tobramycin, and the like), imipenem (IMI), ceftazidime/clavulanate, or any combination thereof.
In some embodiments, a first-generation cephalosporin comprises cefadoxil, cephradine, cefazolin, cephalexin, cefacetrile, cefadroxyl, cephaloglycin, cephalonium, cephaloridine, cephalothin, cephapirin, cephatrizine, cefazaflur, cefazedone, cefradine, cefroxadine, ceftezole, or any combination thereof. In particular embodiments, a first-generation cephalosporin comprises cefazolin.
In certain embodiments, a third-generation cephalosporin comprises cefcapene, cefdaloxime, cefdinir, cefditoren, cefetamet, cefixime, cefmenoxime, cefodizime, cefotaxime, cefovecin, cefpimizole, cefpodoxime, cefteram, ceftamere, ceftibuten, ceftiofur, ceftiolene, ceftizoxime, ceftriaxone, cefopperazone, ceftazidime, oxacephem, latomoxef, or any combination thereof. In particular embodiments, a third-generation cephalosporin comprises ceftriaxone (CTR).
In some embodiments, a fluorquinolone comprises flumequine, oxolinic acid, rosoxacin, ciprofloxacin, fleroxacin, lomefloxacin, nadifloxacin, norflocaxin, ofloxacin, pefloxacin, rufloxacin, balofloxacin, grepafloxacin, levofloxacin, pazufloxacin, sparfloxacin, temafloxacin, cinafloxacin, gatifloxacin, moxiflocaxin, sitafloxacin, prulifloxacin, trovafloxacin, clinafloxacin, or any combination thereof. In particular embodiments, a fluorquinolone comprises ciproflaxin (CIP).
Also provided herein are methods for treating an infection in a patient, wherein a method comprises administering to a patient in need thereof an effective amount of one or more antibiotic, wherein a pathogenic organism infecting the patient is known to be susceptible to the one or more administered antibiotics as determined using a method or a kit according to the present disclosure.
In certain embodiments, a pathogenic organism infecting a patient comprises Klebsiella. Briefly, Klebsiella bacteria are typically found in human intestines and feces, where they do not cause disease. Infections most commonly occur in hospital or healthcare settings and typically enter via the respiratory tract (e.g., via ventilators), urinary tract, bloodstream (e.g., via a contaminated intravenous catheter), surgical sites, or wounds, and can cause pneumonia, meningitis, sepsis, fever, chills, rash, light-headedness, and other conditions and symptoms. Standard treatment includes antibiotics and combinations thereof, though some Klebsiella are resistant to most antibiotics, including, in some instances, carbapenems.
As understood by a person skilled in the medical art, the terms, “treat” and “treatment,” refer to medical management of a disease, disorder, or condition of a subject (i.e., patient, host, who may be a human or non-human animal) (see, e.g., Stedman's Medical Dictionary). In general, an appropriate dose and treatment regimen provide one or more antibiotic in an amount sufficient to provide therapeutic or prophylactic benefit. Therapeutic or prophylactic benefit resulting from therapeutic treatment or prophylactic or preventative methods include, for example an improved clinical outcome, wherein the object is to prevent or retard or otherwise reduce (e.g., decrease in a statistically significant manner relative to an untreated control) an undesired physiological change or disorder, or to prevent, retard or otherwise reduce the expansion or severity of such a disease or disorder. Beneficial or desired clinical results from treating a subject include abatement, lessening, or alleviation of symptoms that result from or are associated the disease or disorder to be treated; decreased occurrence of symptoms; improved quality of life; longer disease-free status (i.e., decreasing the likelihood or the propensity that a subject will present symptoms on the basis of which a diagnosis of a disease is made); diminishment of extent of disease; stabilized (i.e., not worsening) state of disease; delay or slowing of disease progression; amelioration or palliation of the disease state; and remission (whether partial or total), whether detectable or undetectable; or overall survival.
“Treatment” can also mean prolonging survival when compared to expected survival if a subject were not receiving treatment. Subjects in need of the methods and compositions described herein include those who already have the disease or disorder, as well as subjects prone to have or at risk of developing the disease or disorder. Subjects in need of prophylactic treatment include subjects in whom the disease, condition, or disorder is to be prevented (i.e., decreasing the likelihood of occurrence or recurrence of the disease or disorder). The clinical benefit provided by the compositions (and preparations comprising the compositions) and methods described herein can be evaluated by design and execution of in vitro assays, preclinical studies, and clinical studies in subjects to whom administration of the compositions is intended to benefit (e.g., by slowed or reversed rates of infection spread, by lower bacterial counts in patient samples, reduction in severity of symptoms, and so on).
A “therapeutically effective amount” or “effective amount” of a composition (e.g., antibiotic or combination of antibiotics) of this disclosure refers to that amount of compound sufficient to result in amelioration of one or more symptoms of the disease being treated in a statistically significant manner. When referring to an individual active ingredient, administered alone, a therapeutically effective dose refers to the effects of that ingredient alone. When referring to a combination, an effective dose refers to the combined amounts of active ingredients or combined adjunctive active ingredient with a composition (such as, for example, an antibiotic) that results in a therapeutic effect, whether administered serially or simultaneously. An appropriate dose, suitable duration, and frequency of administration of the compositions will be determined by such factors as the age, size, gender, and condition of the patient; the type and severity of the disease, condition, or disorder; the particular form of the active ingredient; and the method of administration.
In certain embodiments, a method of treating an infection (e.g., a Klebsiella infection) in a patient comprises administering one or more antibiotics selected from a penicillin (e.g., ampicillin (AMP), mezlocillin, piperacillin, ticarcillin, methicillin, oxacillin, and the like), amoxicillin/clavulanate (A/C), a first-generation cephalosporin, a second-generation cephalosporin (e.g., cefaclor, cefamandole, cefonicid, ceforanide, cefuroxime, and the like), a third generation cephalosporin, a fourth generation cephalosporin (e.g., ceflidine, cefepime, cefluprenam, cefozopan, cefpirome, cefquinome, and the like), trimethoprim/sulfamethoxazole (T/S), a fluorquinolone, nitrofurantoin (NIT), a tetracycline (TET)(e.g., doxycycline, minocycline, oxytetracycline, tetracycline, and the like), a macrolide (e.g., azithromycin, clarithromycin, dirithromycin, erythromycin, roxithromycin, troleandomycin, and the like), an aminoglycoside (e.g., amikacin, gentamicin, kanamycin, neomycin, streptomycin, tobramycin, and the like), imipenem (IMI), ceftazidime/clavulanate, or any combination thereof.
In some embodiments, a first-generation cephalosporin comprises cefadoxil, cephradine, cefazolin, cephalexin, cefacetrile, cefadroxyl, cephaloglycin, cephalonium, cephaloridine, cephalothin, cephapirin, cephatrizine, cefazaflur, cefazedone, cefradine, cefroxadine, ceftezole, or any combination thereof. In particular embodiments, a first-generation cephalosporin comprises cefazolin.
In certain embodiments, a third-generation cephalosporin comprises cefcapene, cefdaloxime, cefdinir, cefditoren, cefetamet, cefixime, cefmenoxime, cefodizime, cefotaxime, cefovecin, cefpimizole, cefpodoxime, cefteram, ceftamere, ceftibuten, ceftiofur, ceftiolene, ceftizoxime, ceftriaxone, cefopperazone, ceftazidime, oxacephem, latomoxef, or any combination thereof. In particular embodiments, a third-generation cephalosporin comprises ceftriaxone (CTR).
In some embodiments, a fluorquinolone comprises flumequine, oxolinic acid, rosoxacin, ciprofloxacin, fleroxacin, lomefloxacin, nadifloxacin, norflocaxin, ofloxacin, pefloxacin, rufloxacin, balofloxacin, grepafloxacin, levofloxacin, pazufloxacin, sparfloxacin, temafloxacin, cinafloxacin, gatifloxacin, moxiflocaxin, sitafloxacin, prulifloxacin, trovafloxacin, clinafloxacin, or any combination thereof. In particular embodiments, a fluorquinolone comprises ciproflaxin (CIP).
In any of the presently disclosed methods of detecting the presence or absence of a SNP, or of determining antibiotic susceptibility of a pathogenic organism (e.g., Klebsiella), or of treating an infection in a patient, the pathogenic organism may be from a patient sample (e.g., a wound swab, an abscess aspirate, a fecal swab, urine, blood, saliva, sputum, a nasal swab, a tracheal swab, or a skin swipe), an invasive medical instrument (e.g., a bronchoscope, a breathing tube, a catheter, a surgical instrument, or the like), or a patient-accessible surface in a healthcare or elder care setting (e.g., a wall, a floor, a curtain, a bedsheet, a pillow, a doorknob, etc., in a hospital, an urgent care clinic, an ambulance, a physician's office, a nursing home, or the like).
In some embodiments, the pathogenic organism is from a patient sample selected from the group consisting of a wound swab, an abscess aspirate, a fecal swab, urine, blood, saliva, sputum, a nasal swab, a tracheal swab, and a skin swipe.
In further embodiments, the patient sample is or was previously fractionated to separate bacterial components from non-bacterial nucleic acids, ureas, and solids, prior to performing the nucleic acid amplification process. In still further embodiments, fractionated bacteria from a patient sample are lysed or were lysed prior to performing the amplification process or step.
A clonotype-generating process was designed to identify features of genetic relatedness based on input sequences. A flow chart showing the steps of an exemplary clonotype-generating process is provided in
To directly compare the resolution provided by a generated-clonotype with the resolution provided by standard multi-locus sequence typing (MLST) approach, known sequence types, or “STs” (combinations of alleles of five housekeeping Klebsiella loci for both K. pneumoniae and oxytoca species; bigsdb.pasteur.fr/klebsiella/klebsiella.html) were obtained without a priori knowledge as to whether the STs could be used to determine genetic relatedness of the Klebsiella clinical isolates. The alleles were from the gapA, infB, mdh, phoE and rpoB loci. The allele sequences were then concatenated and aligned to create a nucleotide position library. Nucleotide positions with a gap in one or more of the input allele sequences or that were monomorphic across the input allele sequences were discarded from the library, such that the resulting library consisted of non-gapped, polymorphic nucleotide positions.
Next, a full binary data set was created by assigning binary values to each position remaining within the library; specifically, a first binary value was assigned to the nucleotide base that appeared most frequently at the position and a second, different, binary value was assigned to all other nucleotide bases that appeared at the position within the library.
A reduced binary data set was then generated by discarding from the full binary data set one nucleotide position from each pair of nucleotide positions that had identical or reverse-identical binary distribution patterns; in other words, if a first nucleotide position shared an identical (e.g., “1-0-1-0-1-0” vs. “1-0-1-0-1-0”) or reverse-identical (e.g., “1-0-1-0-1-0” vs. “0-1-0-1-0-1”) binary distribution with a second nucleotide position in the library, either the first or the second nucleotide position was discarded from the library for failing to add further discriminatory power over the other nucleotide position.
Next, from the reduced binary data set, the polymorphic information content (PIC) of each nucleotide position in the library was calculated based on a modified version of the equation disclosed in Botstein et al., Am. J. Hum. Genet. 32:314-331 (1980). Specifically, for each position in the library, PIC was calculated as [1−Σ(frequency of the assigned first binary value at the position)2+(frequency of assigned second binary value at the position)2)].
All possible pairs of nucleotide positions within the reduced binary data set were then compared, and two calculations were performed: (i) the pairwise sum of binary distribution differences between each nucleotide position in each possible pair; and (ii) the overall mean sum of binary differences in the reduced binary data set based on all of the possible pairs. Where the pairwise sum of (i) was smaller than the overall mean sum of (ii), the nucleotide position with the lower PIC of the two nucleotide positions in the given pair was discarded. Finally, the remaining nucleotide positions were sorted in decreasing order of PIC values, and 10 nucleotide positions within the 5 selected Klebsiella loci were selected for further testing.
The ability to quickly and efficiently type bacteria by clonotyping is particularly valuable in clinical settings (e.g., identifying antibiotic-resistant bacteria in a hospital setting). A useful clonotype should be at least as accurate for predicting relatedness as a standard sequence-based typing scheme, which takes much longer to perform. Prior to testing the clonotype generated in Example 1 on a discovery set of Klebsiella isolates, a reference was established by typing the isolates according to a standard MLST scheme and testing antibiotic resistance.
First, a discovery set of 387 clinical extraintestinal Klebsiella spp. isolates was obtained. The five MLST loci described in Example 1 were sequenced for all 387 isolates. A phylogenetic tree was constructed using concatenated sequences of alleles for five MLST loci, gapA (450 bp), infB (318 bp), mdh (477 bp), phoE (420 bp), rpoB (501 bp) (MEGA 7.0 software). In particular, STs that were not found in the klebsiella MLST database (bigsdb.pasteur.fr/klebsiella/klebsiella.html; searched in January 2017) and that differed from a known ST by only one single nucleotide polymorphism (out of 2,166 nucleotides from the combined five MLST loci) were assigned the same number as the known ST. STs that were not found in the MLST database and that differed from a known ST by more than one SNP were termed “new” STs and were each assigned a sequential number using an in-house scheme. STs that belonged to one distinct branch of the phylogenetic tree were also assigned to respective phylogroups (
To evaluate the clonal relationship between different STs, a spanning tree of the 387-isolate discovery set was built using eBURST v 3.0 software (eburst.mlst.net/). All neighboring STs that differed in one out of five alleles were assigned to the same clonal complex (CC), named after the founder ST as predicted using eBURST. In CCs with multi-step branching (i.e., where a first ST differs from a second ST at more than one allele and a third, intermediate ST differs from either the first or second ST by one allele), sub-complexes (SC) were assigned wherever there was a founding ST with at least two terminally branched single locus variant STs. The resultant spanning tree is shown in
Next, antibiotic resistance was determined for all 387 Klebsiella isolates in the discovery set. Antibiotic testing was carried out according to the standard disk diffusion method as described in the Clinical and Laboratory Standards Institute (M100 Performance Standards for Antimicrobial Susceptibility Testing, 27th Edition, 2017). The antibiotics tested were: ampicillin (AMP), amoxicillin/clavulanate (A/C), cefazolin (CZ, as representative for first generation cephalosporins), ceftriaxone (CTR, as representative for third generation cephalosporins), trimethoprim/sulfamethoxazole (T/S), ciprofloxacin (CIP, as representative of fluorquinolones), nitrofurantoin (NIT), tetracycline (TET), imipenem (IMI) and ceftazidime vs ceftazidime/clavulanate to determine production of extended-spectrum beta-lactamases (ESBLs). Since almost all Klebsiella isolates are known to be resistant to AMP and sensitive to IMI, TET is not used for treatment of urinary tract infections, and ESBL production is partially reflected in resistance to CTR, these antibiotics were not included for further analysis.
For further analysis, each clonal group—among phylogroups, clonal complexes, clonal sub-complexes and individual STs—was evaluated for the prevalence of resistant isolates to AMC, CZ, CTR, T/S, CIP and NIT antibiotics (
First, any clonal group that exhibited resistance rates significantly below or above 20% (as estimated by two-sided one-sample t-test, alpha=0.1) was assigned a +0.5 or −0.5 weight, respectively. The 20% resistance threshold was chosen based on conventional guidelines for antimicrobial treatment adopted by Infectious Diseases Society of America (IDSA, idsociety.org/Guidelines/Patient_Care/IDSA_Practice_Guidelines/Infections_by_Organ_System/Genitourinary/Uncomplicated_Cystitis_and_Pyelonephriti (UTI)/). For the analysis of clone-specific patterns of resistance, intermediate resistance was counted as non-susceptibility.
Second, any clonal group with resistance rates that differed significantly from a reference group within the same clonal grouping scheme (as estimated by multiple logistic regression) was assigned a +0.3 or −0.3 weight for higher and lower resistance, respectively. The reference group was either chosen arbitrarily based on the magnitude of resistance observed in the group or selected as the largest clonal group whose resistance profile was closest to that of overall resistance profile of the discovery set.
Third, any clonal group that exhibited resistance rates above the 20% threshold was assigned a weight of 0.2.
Fourth, each clonal group was ranked for resistance to each antibiotic based on the absolute value of sum of weights, with resistant clonal groups having higher sums due to their relative clinical importance. In this method, only highly resistant clonal groups can be ranked the maximum value of 1.0 per each antibiotic. For each clonal complex, sub-complex, or ST, a clonal group-specific sum-rank (SR) was calculated as an average weight indicating resistance against all six antibiotics of interest (see
Having established a sequence-typing-based index of relatedness and antibiotic resistance profiles of the discovery set isolates, the ability of the generated clonotype features to predict diversity and resistance was next examined. The 10 SNPs identified using the method described in Example 1 were combined into 36 sets of 7 SNPs each (7 test wells for an 8-well PCR strip tube, plus a well for a control reaction), or “septatypes” or “7ts.” For each of the 36 Its, the following statistics were calculated (see
DI=1−D=1−Σ(n7t/N)2,
ACCI=Σ(P[ST|7t]*SRST)+Σ(P[SC|7t]*SRSC)+Σ(P[CC|7t]*SRCC),
Out of 36 possible 7-SNP combinations (or “sets”), 5 were chosen based on having the highest DI and ACCI indexes and further examined manually to confirm that higher resistance or susceptibility of bacteria having the individual 7-SNP combinations to antibiotics is indeed due to the prevalence of specific clonal groups among these types, and not due to random chance.
To select a final 7-SNP set for use in a PCR-based typing assay, the following criteria were used to compare potential primer-binding regions for the SNPs. Several hundred alleles of the sequenced loci (gapA, infB, mdh, phoE and rpoB) were downloaded from multiple available sequence databases. Each SNP's surrounding sequence was analyzed for polymorphisms in the potential primer-binding area(s). SNPs with lowest number of polymorphic sites that could potentially interfere with correct interpretation of PCR reaction were given preference. Additionally, the melting temperature of potential primer binding regions was analyzed to give preference to SNPs that could have primers designed with in-range melting temperature for more robust reactions to allow detection of all 7 SNPs simultaneously.
The 7-SNP combination chosen for further analysis was set no. 12, which included (1) phoE-54, (2) rpoB-130, (3) infB-279, (4) mdh-315, (5) phoE-336, (6) phoE-354 and (7) mdh-429.
First, several sets of primers were tested on a training subset of 51 isolates which included 5 E. coli isolates, 1 C. freundii isolate, 1 S. aureus isolates, 2 K. oxytoca isolates, and 42 K. pneumoniae isolates, belonging to different clonal complexes. The final set of primers and reaction conditions were chosen based on their performance on the training set. Additionally, the universal primers for mdh loci were designed to be used as positive control for both K. pneumoniae and K. oxytoca. The names, sequences, and directionality of the final primers are provided below in Table 6. Bold, italicized, underlined bases represent recognized SNPs. Bold, underlined bases represent introduced polymorphisms to make the primer less prone to false positives. Underlined bases (regular font) represent alternative (e.g., A or T or C or G, or a combination thereof) base positions according to the IUPAC nucleotide code (bioinformatics.org/sms/iupac.html). Mdh reference primers SEQ ID NOS:49 and 55 were also generated in-house. (“R/C”=Reverse Complement)
The names, sequences, and directionality of alternate primers are provided below in Table 7.
(R/C)
qPCR reaction volumes were 10 uL, and 10 uM primer stocks were used. SensiMix SYBR No-Rox Kit (Bioline) was used. The reaction mixes for amplifying the indicated SNPs using the primers in Table 1 or Table 2 are provided below in Table 8.
QPCR reactions were performed using a Rotogene Q instrument (72 tubes). Reaction conditions were as shown in Table 9.
Threshold reaction settings for amplifying the indicated SNPs are provided below in Table 10.
Next, a test set of 302 clinical isolates with SNPs identified via sequencing was used to evaluate the performance of the chosen 7t primers and conditions (
Finally, a validation set of 150 clinical isolates was first subjected to 7t testing in single replicates, with subsequent sequencing of the loci to determine the correlation between 7t-based and sequence-based results (
Thus, the 7t test performed with 95% accuracy for typing isolates in single replicates.
Next, the clonotypes and antibiotic resistance profiles of a set of 1,452 Klebsiella clinical isolates were determined. The set was divided into two subsets based on consecutive isolation dates—training (n=724) and validation (n=728). There was no statistically significant difference in clonotype distribution between the sets. The prevalence of resistant isolates was similar in both sets across all antibiotics tested, and the prevalence of clonotype-specific resistance did not differ significantly across both sets for the majority of clonotypes (
Thus, 7t-based clonotyping successfully predicts the prevalence of resistant isolates among Klebsiella isolates.
Once identified, an infection characterized by the presence of one or more identified clonotype can be treated with appropriate antibiotics or combinations of antibiotics. To aid in clinical decision-making and to translate clonotype information to treatment selection, Lookup Table 1 was generated. Lookup Table 1 (see Table 1 herein) recommends use or avoidance of antibiotics for treating Klebsiella infections based on the resistance profiles of the collected and tested Its described in the preceding Examples. The information contained in Lookup Table 1 was obtained from urine specimens from patients with Klebsiella infections from several clinics within different regions of the United States. Indications of allowance or non-recommendation were determined based on a 20% resistance threshold to the indicated antibiotic; i.e., the antibiotic is recommended when more than 80% of the tested isolates within the indicated clonotype were susceptible to the indicated antibiotic, and was cautioned against when less than 80% of the isolates were susceptible to the antibiotic.
The various embodiments described above can be combined to provide further embodiments. All of the U.S. patents, U.S. patent application publications, U.S. patent applications, foreign patents, foreign patent applications and non-patent publications referred to in this specification and/or listed in the Application Data Sheet, including U.S. Provisional Patent Application No. 62/668,042, filed May 7, 2018, are incorporated herein by reference, in their entirety. Aspects of the embodiments can be modified, if necessary to employ concepts of the various patents, applications and publications to provide yet further embodiments.
These and other changes can be made to the embodiments in light of the above-detailed description. In general, in the following claims, the terms used should not be construed to limit the claims to the specific embodiments disclosed in the specification and the claims, but should be construed to include all possible embodiments along with the full scope of equivalents to which such claims are entitled. Accordingly, the claims are not limited by the disclosure.
This invention was made with government support under R42AI116114-02 awarded by the National Institutes of Health. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US19/30948 | 5/6/2019 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62668042 | May 2018 | US |