A Sequence Listing is provided herewith as an xml file, “2346033.xml” created on Jun. 23, 2023 and having a size of 16,437 bytes. The content of the xml file is incorporated by reference herein in its entirety.
The separation of nucleic acids (NAs), such as DNA and RNA, from biological samples, known as NA extraction or isolation, is a first step in many analytical, diagnostic, molecular biological, and forensic procedures. The NA isolation process typically involves several steps, including inactivation of resident nucleases to preserve NA integrity, cellular disruption, separation of the NA from cellular contaminants, and concentration of the extracted NA for further analysis. Currently used processes can be categorized into two general types of extraction methods. One example is liquid-liquid extraction by guanidium thiocyanate-phenol-chloroform extraction. Another example is solid-phase extraction include use silica-based, microchromatographic columns (e.g., “spin columns”) or charged magnetic beads.
There is no universally established standardized technique for NA extraction to use in multiple application contexts. Available techniques require varying degrees of processing time, instrumentation, use of hazardous reagents, trained personnel, and well-maintained laboratory spaces, each providing potential impediments to implementation in low-resource settings and miniaturized point-of-care devices.
As described herein, NA extraction from biologically relevant solutions can be performed using triggered liquid-liquid phase separation of NA-binding intrinsically disordered proteins (IDPs). Two types of NA-binding IDPs are provided and are based on genetically engineered elastin-like polypeptides (ELPs). ELPs are model IDPs that exhibit a lower critical solution temperature in water and can be designed to exhibit liquid-liquid phase separation (LLPS) at desired temperatures in a variety of biological solutions. ELP fusion proteins with NA-binding domains can be used to extract DNA and RNA from biological solutions. LLPS of pH responsive ELPs that incorporate histidine in their amino acid sequences can be used for binding, extraction, and release of NAs from biological solutions such as for detection of SARS-CoV-2 RNA in samples from COVID+ patients.
Aspects of the present disclosure can be better understood with reference to the following drawings.
Cellular membraneless organelles (MLOs) are distinct phase separated compartments that lack a lipid membrane but nevertheless function akin to their membrane delineated counterparts via the spatial and temporal organization of molecules. Several MLOs comprise RNA binding intrinsically disordered proteins (IDPs) that undergo reversible liquid-liquid phase separation (LLPS) to assemble and disassemble condensed phase assemblies for a host of regulatory activities. For example, phase separated IDPs bind and sequester cytoplasmic mRNA in MLOs known as stress granules to regulate their activity in response to environmental stresses, sometimes acting with other MLOs such as P-bodies, to regulate mRNA outcome. Examples of environmental stimuli that can lead to rapid assembly and disassembly of IDP coacervate MLOs include temperature, pH, and osmotic stress. Cellular MLOs regulate downstream function using coupled environmental sensing and molecular phase behavior, thus helping to minimize complex, multilevel signaling cascades. Hence, condensed phase cellular MLOs provide a practical blueprint to potentially engineer programmable analogs in synthetic systems. Indeed, the simplicity of this biopolymer solution phase behavior is reflected by gaining popularity of IDPs and MLOs in origin of life discussions.
Further inspiring the use of IDPs in engineered systems are investigations that shed light on the mechanism of protein-NA binding and the role of IDPs in driving cellular MLO assemblies. For example, synthetic nucleoprotein MLOs were assembled in protocells using IDP fusions comprising an elastin-like protein (ELP) block concatenated with a soluble arginine-rich domain (RGG). ELPs are pentameric repeat polymers (sequence VPGXG, X=guest residue) while RGG domains are present in a host of cellular IDPs, including LAF-1, FUS, and MRE11. Relatively hydrophobic ELPs block conferred phase separation behavior to the fusions, while the RGG domain enabled electrostatic binding of the fusions to RNA. In biological systems, RGG domains of cellular IDPs interact with RNA while simultaneously undergoing LLPS and therefore have dual roles as mediators of both RNA-binding phase separation behaviors
The mechanism of dual-role IDPs is further characterized by investigation of synthetic NA-binding IDP surrogates with well-defined stimulus-induced phase behavior that is not driven solely by complexation of IDP and NA polyelectrolytes of opposite charge. In this regard, ELPs are intriguing as candidate surrogates because of their ability to maintain hydrophobic lower critical solution temperature (LCST) phase behavior even while carrying a relatively large mean net charge. Furthermore, the molecular parameters of diverse sets of ELPs (e.g., guest residue, chain length) and their aqueous solubility as a function of temperature, concentration, and presence of cosolutes is correlated.
As described herein, polymers suitable for use in methods for isolating nucleic acids from complex samples can comprise proteins or synthetic polymers that exhibit temperature triggered phase separation and that incorporate pH switchable ionizable groups. For example, the synthetic polymer can be a poly(N-isopropyl acrylamide) (PNIPAAm). An example of pH switchable ionizable groups that could be incorporated synthetically into PNIPAAm are imidazole groups such as the side chain of histidine.
In another example, the polymer can be an intrinsically disordered protein (IDP) comprising an amino acid composition that exhibits temperature triggered phase separation. Exemplary IDPs include polycationic ELPs, collagen, elastins, resilins, RRM-RGG and HCV Core proteins, and polypeptides comprising amino acid repeats rich in proline and glycine. The polypeptides can be modified or “tuned” to exhibit soluble to insoluble phase transitions that are of interest, including a lower critical solution temperature (LCST) transition that occurs upon heating above a critical solution temperature or an upper critical solution temperature (UCST) transition that occurs upon cooling below a critical temperature. See Quiroz, F., Chilkoti, A., Sequence heuristics to encode phase behavior in intrinsically disordered protein polymers, Nat. Mater. (2015); 14(11): 1164, which is incorporated herein by reference.
The phase behavior and NA binding affinity of a model polycationic ELP (called E3) is described herein. To produce E3, an otherwise uncharged ELP is engineered to contain equally spaced, interspersed cations that can promote electrostatic binding to nucleic acids after undergoing phase change. E3, with peptide sequence [(VPGXG)10-GKG]8, comprises 10 subunits of 8 concatenated neutral pentamers (VPGXG, X=8:2 ratio of Val/Ala), each flanked by cationic Lys residues. E3 undergoes simple coacervation, driven by a thermodynamic preference for homotypic self-interactions over heterotypic ones, in contrast to charge-mediated complex coacervation, in which oppositely charged polyanions associate to form coacervates in solution. Above a concentration dependent transition temperature (TT), the model cationic E3 protein undergoes LLPS in the presence or absence of DNA. Furthermore, the condensates formed by simple coacervation can be thermodynamically tuned with NaCl to preferentially interact with single stranded DNA to form synthetic deoxyribonucleoprotein (DNP) coacervates.
The DNA binding affinity of E3 and the amount of DNA captured and sequestered within E3 coacervates of distinct size and composition are measured systematically at different operating points by varying initial E3 concentration and the addition of charge shielding NaCl salt. An adapted mean field Flory-Huggins (FH) theory is used to mediate the strength of E3-DNA interaction by ionic strength through linearization of the Debye-Hückel free-energy in our evaluation of component FH interaction parameters. The FH interaction parameters are fit to fluorescence spectroscopy and microscopy data collected from bulk and from microdroplet samples to create ternary phase diagrams that interpret our experimental observations. Results showing dependence of FH interaction parameters with ionic strength are corroborated by the Debye-Hückel linearization. This combined approach results in the creation of phase diagrams that quantify DNA component partitioning within discrete protein- and solvent rich phases of known volume fraction across a range of salt and E3 compositions. The application of the method described herein provides a simple two-step DNA solution purification assay, with implications for applications such as viral RNA extraction, RNA/DNA capture from biological fluids, and gene regulation in synthetic cells.
A schematic of E3 coacervate formation and capture of DNA is illustrated in
The E3 polypeptide of SEQ ID NO. 1 can also be represented without the N-terminal MG or the C-terminal Y as SEQ ID NO. 2: [(VPGXG)10-GKG]8.
ssDNA Influences the Phase Behavior of Aqueous E3
To quantify the effect of ssDNA (0.5 M, 28 nt) on the thermally dependent phase behavior of E3 (sequence: [(VPGXG)10-GKG]8 (SEQ ID NO: 2), X=8:2 ratio of Val:Ala), temperature-controlled spectrophotometry was used to measure cloud-point transition temperature (Tt) as a function of volume fraction (ϕ)—the fraction of solution volume occupied by E3 chains—for E3 in the presence or absence of ssDNA species (+ssDNA or −ssDNA, respectively). E3 Tt(ϕ) was measured for volume fractions ranging from 4=0.00034 to 4=0.068 in pH-stable buffer. The range of 4 values correspond to E3 concentrations of 0.01 mM to 2 mM. The E3 polymer maintains canonical ELP LCST dilute phase behavior—a decrease in Tt with increasing E3 volume ϕ—for both +ssDNA and −ssDNA solutions. For all replicate samples, it was found that the presence of ssDNA prompts a shift to lower Tt(ϕ) across all experimental 4 values when compared to Tt(ϕ) for the −ssDNA E3 replicate samples (
NaCl Mediates Recruitment of ssDNA into E3 Condensates
The temperature and salt mediated partitioning of ssDNA with phase transitioned E3 condensates were investigated. Microfluidic generated water-in-oil emulsions provide a useful view into the LLPS microenvironment as it relates to E3-ssDNA binding behavior. ELP phase transition was triggered within microdroplets by increasing the temperature (T=55° C.) above the Tt of 1 mM E3 (ϕ=0.034, Tt+ssDNA=35° C.) to generate coarsening spherical E3 condensates surrounded by solvent-rich regions. Observing the evolution of the thermally phase transitioned system from 0 to 15 minutes, the coarsening of unlabeled E3 was tracked to completion and the concurrent partitioning of fluorophore labeled ssDNA-Cy3 in each phase by brightfield and fluorescence microscopy, respectively. The interaction between droplet encapsulated components of 1 mM E3 (brightfield), 0.5 μM ssDNA-Cy3 (fluorescence), and the influence of 0 or 100 mM NaCl salt (−NaCl and +NaCl, respectively) in aqueous droplets is detailed. For both − and +NaCl samples, representative droplet microscopy images are shown at: (1) t=0 minutes at room temperature, T<Tt, where E3 is in a soluble state; (2) t=3 minutes of incubation at T>Tt during the early stages of phase separation and E3 condensate coarsening; and (3) t=15 minutes at T>Tt when full coarsening and complete phase separation is achieved.
Across all t=0 samples at room temperature (T<Tt) no E3 condensates are observed (
Using FH formalisms (see equations 1-4), the phase diagrams of solution E3 and DNA were approximated at two salt concentrations of 0 mM NaCl and 100 mM NaCl. To quantify the partitioning of DNA into E3 coacervates at 0 mM and 100 mM NaCl, fluorimetry was used to determine DNA concentration in the dilute solvent-rich phase (
χ={χE3,Buffer,χDNA,Buffer,χE3,DNA}
are the concentration variant supernatant DNA concentration ratio from initial
and the total volume fraction of the E3 coacervate
In FH theory, component i volume fraction is related to concentration by the molar volume of the buffer (v) by
Without independent phase data of DNA in buffer, the DNA-buffer interaction parameter χDNA,Buffer is unknown. Keeping the difference χDNA,Buffer−χE3,DNA constant while varying χE3,DNA appears to generate nearly equivalent dilute phase DNA volume fractions. Therefore, keeping χDNA,Buffer=0 while finding the best fitting χE3,DNA will generate the best set of χ knowing that only χDNA,Buffer−χE3,DNA is expected to be unique. The solution E3 interaction parameter at T=55° C. was reported in our previous work χE3,Buffer=0.862 and is assumed to be constant for both salt conditions. For approximation, the degree of polymerization of the E3 was assumed to be equal to the number of pentameric repeats of the protein (i.e., VPGXG), in this case NE3=80. For the DNA degree of polymerization, that molar volume ratio is assumed to be equal to molecular weight ratio between molecules, giving
Fitting mean field Flory-Huggins (FH) theory equations (Equations 1-4 of Example 1E below) to the dilute phase DNA concentration ratio data simultaneously with the coacervate volume fraction data enables determination of the cross-polymer interaction parameter χE3,DNA and the buffer molar volume v at 0 mM NaCl and 100 mM NaCl conditions. Specifically, phase diagram tie line interpolation was used to find best fitting buffer molar volume for a variance of χE3,DNA for both measurements. Then, the intersection of the best fitting lines from each measurement determined the overall best fit. It was determined that 0 mM NaCl results in the interaction parameter XE3,DNA=−1.0 and the 100 mM NaCl resulted in a larger χE3,DNA=−0.54. The interpolation of the tie lines from the resulting phase diagrams of the experimental concentrations used are presented in
The best fitting buffer molar volume v was found to be 0.54 M(−1) and 0.58 M(−1) for the − and +NaCl respectively, and the mean value of 0.56 M(−1) was used for analysis. With respect to the experimental variance of the microdroplet measurements of
An appendage to FH could be used to describe the change in interaction parameters leading to partitioning of DNA with E3 coacervates at various NaCl concentrations. It is assumed that the ionic effects are captured on a mean field level by introducing an enthalpic free energy provided by the Debye-Hückel (DH) theory. Considering the solvation criteria for DH, it should be admissible to assume the ionic effects contribute by appending the standard FH interaction parameters with a linear perturbation for charge effects. Each effective interaction parameter χi,j will be, by a first approximation, the hypothetical interaction parameter of the same system without Coulombic interactions χi,j0 appended with the respective term from a linearization of the DH free energy.
Here, A is the Debye-Hückel free-energy pre-factor which is related to ionic radius. Is is the ionic strength of the salt solution multiplied with the buffer molar volume v. The Is value for our 100 mM di-basic sodium phosphate buffer at 0 and 100 mM added NaCl is 0.3M and 0.4M, respectively. The component of ionic strength for each polyion is given as
where zi is the total number of charges of molecule i.
Fully determining the FH parameters N and χ allows for the determination of the full three component phase diagram, including the binodal, tie lines and critical point, and these plots are given in
Sequential LLPS Allows Recovery of DNA from E3/DNA Solutions
The fact that E3 efficiently captures DNA upon coacervation in the absence of added salt suggests a potential simple method for separation of nucleic acids from solutions by LLPS. As depicted in
In sum, the phase behavior and molecular partitioning of a ternary component ELP, DNA, and aqueous buffer solution system was characterized. A model ELP called E3 was engineered that comprises ELP blocks flanked by 8 evenly spaced, cationic lysine residues (E3 sequence of SEQ. ID. NO. 2: [(VPGXG)10-GKG]8 (SEQ ID NO: 2), X=8:2 ratio of Val:Ala). The concentration dependent lower critical solution temperature (LCST) transition temperature of E3 is reduced by a few degrees Celsius in the presence of DNA. The NaCl-mediated capture of fluorescently labeled Cy3 DNA by E3 condensates is observed in microfluidic generated drops and characterized by fluorescence spectroscopy. Results above show E3 efficiently captures DNA upon coacervation only in the absence of added NaCl salt and at 100 mM added NaCl, DNA shows no preference for the coacervate or solvent-rich phase. Mean field Flory-Huggins (FH) theory describes the drastic reduction in DNA partitioning by E3 coacervates with addition of 100 mM NaCl.
Multivariate fitting of FH interaction parameters was applied to experimental data of concentration variant supernatant DNA concentration ratio from the initial and the total volume fraction of E3 coacervates. A linearized Debye Hückel term was introduced to FH interaction parameters to account for variable E3 condensate DNA capture as a result of change in ionic strength. Results above show similar changes in the DH-modified FH interaction parameters as those estimated from fitting to experimental data. Ternary phase diagrams complete with tie lines and binodal curves were generated that quantify DNA and E3 component partitioning within protein- and solvent-rich phases at 0 mM and 100 mM added NaCl buffer conditions. Finally, the utility of our system was demonstrated by prototyping a new DNA purification assay by using thermal LLPS of E3 and addition of NaCl salt to control the DNA binding and release behavior of E3 condensates.
As described herein, the isolation of nucleic acids using IDP coacervates may be applied to a variety of nucleic acid-containing samples. For example, the sample can be a physiologically relevant sample such as body tissue or body fluids. The body fluids can include saliva, sputum, mucus, nasopharyngeal discharge (e.g., nasal discharge collected from a patient by nasopharyngeal swab), blood, serum, plasma, urine, aspirate, stool or a combination thereof. The sample can also be from environmental sampling such as municipal wastewater, swabs from contaminated surfaces, and air samples (e.g., SKC polytetrafluoroethylene (PTFE) filter cassette samples).
The nucleic acids isolated in the coacervate produced by the IDPs described herein can be subjected to a variety of nucleic acid-based diagnostic assays. Such diagnostic assays can be implemented to identify diseases such a pathogenic bacteria or viruses in the coacervate. For example, many nucleic acid-based diagnostics rely on the quantitative polymerase chain reaction (qPCR) or real-time quantitative reverse transcription PCR, which have been widely adopted and are frequently used in clinical laboratories. The versatility, robustness and sensitivity of PCR have made this technology commonly used for the detection of DNA and RNA biomarkers.
In non-PCR based methods, nucleic acid-based diagnostics can include a variety of methods for amplifying nucleic acids including isothermal amplification, nicking endonuclease amplification reaction (NEAR), transcription mediated amplification (TMA), loop-mediated isothermal amplification (LAMP), helicase-dependent amplification (HDA), clustered regularly interspaced short palindromic repeats (CRISPR), and strand displacement amplification (SDA) based diagnostics. See, for example, Kaminski, M. et. al., CRISPR-based diagnostics, Nature Biomedical Engineering (2021) 5: 643-656.
The present technology further pertains to a packaged pharmaceutical composition such as a kit or other container for detecting, controlling, preventing, or treating a disease. The kits of the technology can be designed for detecting, controlling, preventing, or treating diseases such as those described herein (e.g., a viral infection). In one embodiment, the kit or container can hold the intrinsically disordered protein (IDP), such as the polycationic elastin-like polypeptide, as well as instructions for preparing a composition that includes the polycationic elastin-like polypeptide.
The kits of the technology can also comprise containers with tools useful for administering the compositions of the technology. Such tools can include syringes, swabs, catheters, antiseptic solutions, and the like. Some kits can include all of the desired tools, solutions, compounds, including mixing vessels, utensils, and injection devices, to diagnose or treat a patient according to any of the methods described herein. In one embodiment, a kit includes the IDP of the various embodiments described herein. The IDP can be sterile-packaged as a dry powder in a suitable container (e.g., a substantially water-impermeable) such as a syringe, vial (e.g., the vial can include a septum and/or a crimp seal; and the vial can optionally comprise an inert atmosphere, such as a nitrogen atmosphere or dry air) or pouch (e.g., a pouch comprising a moisture barrier; and the pouch can optionally comprise an inert atmosphere, such as a nitrogen atmosphere, or dry air). The kit can also include a desiccant. The desiccant can be included in the pouch or integrated into the layers of the pouch material. In some embodiments, the IDP can be sterile-packaged in frozen vehicle. As mentioned previously, the vehicle can be any suitable vehicle, including flowable vehicles (e.g., a liquid vehicle) such as a flowable, bioresorbable polymer, saline, sterile water, Ringer's solutions, and isotonic sodium chloride solutions. Examples of vehicles include, but are not limited, to Sodium Chloride Injection USP (0.9%), Ringer's Injection USP, Lactated Ringer's Injection USP, Sodium Lactate Injection USP, Dextrose Injection USP (5% or 10%), Bacteriostatic Water for Injection USP and Sterile Water for Injection USP. In some examples, the IDP can be suspended in a buffer; pre-filled into a container, such as a syringe; and frozen.
Values expressed in a range format should be interpreted in a flexible manner to include not only the numerical values explicitly recited as the limits of the range, but also to include all the individual numerical values or sub-ranges encompassed within that range as if each numerical value and sub-range were explicitly recited. For example, a range of “about 0.1% to about 5%” or “about 0.1% to 5%” should be interpreted to include not just about 0.1% to about 5%, but also the individual values (e.g., 1%, 2%, 3%, and 4%) and the sub-ranges (e.g., 0.1% to 0.5%, 1.1% to 2.2%, 3.3% to 4.4%) within the indicated range. The statement “about X to Y” has the same meaning as “about X to about Y,” unless indicated otherwise. Likewise, the statement “about X, Y, or about Z” has the same meaning as “about X, about Y, or about Z,” unless indicated otherwise.
In this document, the terms “a,” “an,” or “the” are used to include one or more than one unless the context clearly dictates otherwise. The term “or” is used to refer to a nonexclusive “or” unless otherwise indicated. In addition, it is to be understood that the phraseology or terminology employed herein, and not otherwise defined, is for the purpose of description only and not of limitation. Any use of section headings is intended to aid reading of the document and is not to be interpreted as limiting. Further, information that is relevant to a section heading can occur within or outside of that particular section. Furthermore, publications, patents, and patent documents referred to in this document are incorporated by reference herein in their entirety, as though individually incorporated by reference. In the event of inconsistent usages between this document and those documents so incorporated by reference, the usage in the incorporated reference should be considered supplementary to that of this document; for irreconcilable inconsistencies, the usage in this document controls.
In the methods described herein, the steps can be carried out in any order without departing from the principles of the disclosure, except when a temporal or operational sequence is explicitly recited. Furthermore, specified steps can be carried out concurrently unless explicit claim language recites that they be carried out separately. For example, a claimed step of doing X and a claimed step of doing Y can be conducted simultaneously within a single operation, and the resulting process will fall within the literal scope of the claimed process.
The term “about” as used herein can allow for a degree of variability in a value or range, for example, within 10%, within 5%, or within 1% of a stated value or of a stated limit of a range.
Each embodiment described above is envisaged to be applicable in each combination with other embodiments described herein. For example, embodiments corresponding to formula (I) are equally envisaged as being applicable to formula (1b).
The technology will be further described by the following non-limiting examples.
A. Expression and Purification of E3.
The expression vector pET24 was purchased from Novagen, Inc. (Milwaukee, WI). One-Shot BL21 Star (DE3) Escherichia coli cells were from ThermoFisher Scientific (Waltham, MA). Restriction enzymes were from New England Biolabs (Beverly, MA). DNA purification kits were purchased from QIAGEN, Inc. (Valencia, CA). DNA sequences (genes fragments and ssDNA) were purchased from Integrated DNA Technologies (Coralville, IA). tRNA was purchased from Millipore Sigma (St. Louis, MO). Luria broth (LB) agar plates were purchased from Bacto Agar, Becton Dickinson (Franklin Lakes, NJ), and Millipore Sigma (St. Louis, MO). Kanamycin was from Ultrapure, VWR, (Radnor, PA). LB Broth and Terrific Broth (TB) was from IBI Scientific (Dubuque, Iowa). The viral RNA isolation kit was from Zymo Research (Irvine, CA). Reagents for RT-qPCR were obtained from ThermoFisher Scientific (Waltham, MA).
The gene encoding the E3.10 protein was constructed using plasmid pET24-E3 as a starting point. The RRM and RGG domains of the FUS proteins are engineered into the E3 protein using the Golden Gate assembly method as described. Engler, C.; Kandzia, R.; Marillonnet, S. A One Pot, One Step, Precision Cloning Method with High Throughput Capability. PLoS ONE 2008, 3 (11), e3647. doi.org/10.1371/journal.pone.0003647. Briefly, the E3 plasmid and the FUS protein plasmid was digested with BsaI, and subsequently ligated together to generate pET24-E3.10. pET24-E1.40COR30 was constructed by ligating a synthetic COR30 sequence (Integrated DNA Technologies, Coralville, IA) to the 3′ end of the E1.40 sequence in pET24-E1.40 using a single step recursive ligation method. McDaniel, J. R.; MacKay, J. A.; Quiroz, F. G.; Chilkoti, A. Recursive Directional Ligation by Plasmid Reconstruction Allows Rapid and Seamless Cloning of Oligomeric Genes. Biomacromolecules 2010, 11 (4), 944-952. doi.org/10.1021/bm901387t. Plasmids expressing H-20 and H-24 were constructed following previously described methods. MacKay, J. A.; Callahan, D. J.; FitzGerald, K. N.; Chilkoti, A. Quantitative Model of the Phase Behavior of Recombinant PH-Responsive Elastin-Like Polypeptides. Biomacromolecules 2010, 11 (11), 2873-2879. https://doi.org/10.1021/bmi100571j.
Escherichia coli (BL21) cells harboring plasmids encoding the protein of interest were inoculated onto LB agar plate containing 45 μg/mL kanamycin sulfate and incubated overnight at 37° C. Starter cultures grown from individual colonies were used to inoculate 3 mL of LB broth with 45 μg/mL kanamycin sulfate. This culture was incubated overnight at 220 rpm and 37° C. The culture was then transferred into 1 L of TB supplemented with 45 μg/mL kanamycin sulfate. Cultures were incubated at 37° C. with agitation for 6 hrs before induction with isopropyl β-d-1-thiogalactopyranoside (IPTG). The culture was induced at 37° C. for 18 hrs prior to harvest by centrifugation at 4° C. and 3000 rpm for 30 min. The resulting pellets were resuspended into a lysis buffer (phosphate buffered saline (1×PBS), 1 Pierce™ protease inhibitor tablet from ThermoFisher (Waltham, MA), and 0.05 mM, ethylenediamine tetraacetic acid (EDTA) at pH 8.0) and lysed by sonication to release all intracellular content.
Expressed proteins were purified by inverse transition cycling, exploiting the reversible thermally responsive protein phase separation of the ELP constructs. This approach comprises cyclic centrifugation steps that alternate between cold (4° C.) and hot (40° C.) centrifugation in PBS until all contaminants are removed, usually within 2-5 cycles. For E3.10, hot centrifugation was replaced by room temperature centrifugation. LLPS was triggered by the addition of 1M ammonium sulfate to cool the solution from 37° C. to 25° C. to induce ELP phase separation and to avoid possible denaturation of folded domains in the protein.
B. Measurement of E3 Cloud Point TT in the Presence of DNA by UV-vis Spectroscopy. Compositions of DNA species used in the experiments described provided in Table 2 below.
Samples containing a range of E3 concentrations, 500 nM of single stranded 28 nucleotide (nt) DNA (Integrated DNA Technology, Coralville, Iowa), and buffer were prepared according to Table 3. The samples for temperature dependent absorbance experiments are described in Table 3, with the E3 content given by volume fraction and concentration. All samples were prepared in 100 mM NaH2PO4.
The transition temperature was quantified by measuring the absorbance of the samples at 380 nm, without and with 500 nM ssDNA, as a function of temperature with a temperature controlled (Peltier temperature controller, Agilent, Santa Clara, CA) UV-vis spectrophotometer (Cary 300 UV-vis, Agilent). The data are then plotted to display the change in absorbance of the solution over a temperature range of 30-60° C. and the TT is obtained by taking the maximum in the first derivative of the absorbance as a function of temperature.
C. Characterization of DNA Concentration by Fluorescence Spectroscopy.
A temperature and time dependent fluorescence spectroscopy assay was used to characterize the binding interactions between E3 and DNA-Cy3 (28 nt single stranded oligonucleotide with cyanine-3 fluorophore attached to the 5′ end, shown in Table 2) in the presence or absence of 100 mM NaCl (VWR). 1 mL of total volume sample solutions in triplicate, at either 0 or 100 mM NaCl, were prepared. The experimental samples contain E3 at varying concentrations, 500 nM DNA-Cy3, 100 mM sodium phosphate buffer (sodium phosphate powder, Sigma Aldrich, St Louis, Missouri), and molecular biology-grade water (Corning) at pH 7.0 to maintain E3 phase transition behavior, charge of the Lys residues distributed within the E3 polymers in solution, and a stable pH. All samples were prepared using dark, LightSafe 1.5 mL polypropylene microcentrifuge tubes (Sigma-Aldrich). Control samples were prepared and treated as experimental samples to control stability of the fluorescence intensity of Cy3 and used to normalize the measured fluorescence intensity values. The solutions were vortexed and centrifuged for 5 seconds to combine, then pipetted at room temperature (23° C.) into 100 μL precision volume quartz cuvettes (Ultra-Micro Cell 105.250-QS LP 10 mm×2 mm, CH 8.5 mm, Hellma Analytics, Plainview, NY). The fluorescence intensity of the samples was measured using a fluorimeter (PTI QuantaMaster QM-400 Horiba, Irvine, CA) with 520 nm excitation wavelength and 540-650 nm emission scan settings. Next, the sample volumes were transferred from the cuvettes back into the dark microcentrifuge tubes to be incubated at 55° C. in a heating block (Isoblock Dry Bath Heat Block, Benchmark, Edison, NJ) for 2 h to induce phase separation of E3 that results in the formation of a clear protein-poor phase and a protein-rich phase settled into the bottom of the tube. By deliberate pipetting, the supernatant-only (protein-poor phase) was transferred to the quartz cuvettes, leaving the coacervate phase undisturbed at the bottom of the tube. The fluorescence intensity of the supernatant was measured by fluorimetry with the same aforementioned settings. This enables the determination of the amount of DNA-Cy3 present in the supernatant versus the amount of DNA-Cy3 associated with E3 in the coacervate phase as plotted in
A two-step/two-color isolation assay was designed and validated to quantify the concentration of DNA isolated from a starting mixed sample of E3 and DNA using fluorimetry. Briefly, the fluorescence of E3 doped with E3-Alexa488 was measured, with a 450 nm excitation wavelength and 470-540 nm emission scan settings, along with 500 nM DNA-Cy3 at room temperature (
D. Fluorescence Microscopy Imaging.
To image the process of temperature-mediated E3 coacervation and interaction with DNA-Cy3, an Olympus IX83 fluorescence microscope (Olympus Life Science Technology Division, Center Valley, PA) was equipped with a Physitemp cooling and heating stage (TS4-MP/ER/PTU, Clifton, NJ) fitted with a temperature controller. Microfluidic droplets were generated using previously described methods and droplet generator. These droplets contain different E3 concentrations, 500 nM DNA-Cy3, sodium phosphate buffer at pH 7.0, and either 0 or 100 nM added NaCl, and are pipetted onto a glass slide (18 mm×18 mm Square Micro Cover Glass, VWR). The droplet population was allowed to settle for 5 min until there is a single layer of droplets on the surface. The glass slide was mounted onto the temperature-controlled stage and equilibrated to 20° C. Images were acquired by a high dynamic range camera (ORCA-Flash4.0 V3 Digital CMOS camera C13440-20CU, Hamamatsu, Bridgewater, NJ) using both brightfield and fluorescence (520 nm LED excitation source/550 nm emission filter) acquisition modes, across a temperature range including values below (25° C.), and above the TT (55° C.) (
E. Ternary Component Flory-Huggins Phase Diagrams.
Using mean field Flory-Huggins (FH) theory, ternary phase diagrams can be created to quantify a DNA component partitioning within discrete protein and solvent rich phases across a range of salt and E3 compositions. The standard FH equation providing the Helmholtz free energy density f for an incompressible two-polymer aqueous system of polymer volume fraction components ϕ1 and ϕ2 is
between all phases for each component. With the volume fraction conservation constraint, the buffer chemical potential is no longer independent of the other components. In its place analytically is the constraint of equivalent excess grand free-energy between phases, which is otherwise known as a Weierstrass-Erdmann condition. These criteria are summarized as
μ1(ϕ1′,ϕ2′)=μ1(ϕ1″,ϕ2″)=μ1′ (2)
μ2(ϕ1′,ϕ2′)=μ2(ϕ1″,ϕ2″)=μ2* (3)
f(ϕ1′,ϕ2′)−f(ϕ1″,ϕ2″)=(ϕ1′−ϕ1″)μ1*+(ϕ2′−ϕ2″)μ2* (4)
Here ϕi′ and ϕi″ denote the volume fraction of component i in the dilute and dense phases, respectively. There exists one set of the binodal chemical potentials {μ1*, μ2*} for each tie line within the phase envelope, given explicitly as the set {ϕ1′, ϕ1″, ϕ2′, ϕ2″ }. Therefore, the coupled set of equations can be determined experimentally by determining {ϕ1′, ϕ1″, ϕ2′, ϕ2″ } for a unique set of FH parameters.
Intrinsically disordered proteins (IDPs) used to target viral RNA can circumvent inherent drawbacks of existing methodologies for highly efficient and rapid isolation of viral RNA from complex samples. Engineered IDPs can isolate viral RNA by phase separation in complex samples for viral RNA for detection and diagnosis, as illustrated in
In some cases the IDP can be an ELP having an amino acid sequence of SEQ ID NO.: 3: (Val-Pro-Gly-X-Gly)n, wherein X is any amino acid residue except Proline. The IDP can have a thermally reversible lower critical solution temperature (LCST) phase transition at temperature (Tt). The thermally responsive properties of the ELP are influenced by the number of ELP pentapeptides and the identity of the amino acid X in SEQ ID NO. 2.
In some cases, the IDP can be an ELP that includes the amino acid sequence of SEQ ID NO. 1: [(VPGXG)10-GKG]8 or SEQ ID NO. 2, as shown in Table 1 above. FIG. 7 illustrates an IDP comprising SEQ ID NO. 6 below (shown as “E3”) followed by an 87 amino acid RNA recognition motif (“RRM”) from Fused in Sarcoma (FUS) protein which comprises an RNA binding folded domain. The RNA recognition motif can be followed by an RNA binding disordered region comprising a 51 amino acid Arginine/Glycine rich domain from the FUS protein (“RGG”).
In some cases, the IDP can be a 124 amino acid full-length hepatitis C virus core protein (COR124) (nucleocapsid) of SEQ. ID. NO. 11:
This sequence comprises a partially disordered, highly charged and robust nucleic-acid binding protein. The RNA binding profile of COR 124 as a function of concentration is shown in
In some cases, the IDP can be an ELP having an amino acid sequence of SEQ ID NO. 4: (VPGVG)40. The ELP of SEQ ID NO. 4 can be followed by a 30 amino acid sequence from HCV Core protein (COR 30) that has no cystine residues and binds RNA. For example, an ELP followed by COR30 with a length of 231 amino acids and a molecular weight of 19.9 kDA has the amino acid sequence of SEQ ID NO. 5:
In some cases, the IDP can be an ELP having an amino acid sequence of SEQ ID NO. 1, followed by the RRM and the RGG. The resulting ELP has SEQ ID NO. 6:
ELPs having an amino acid sequence of SEQ ID NOS. 5 and 6 can be used to extract nucleic acids (NA) into a protein poor phase (PPP) and a protein rich phase (PRP), as shown in
Solutions of ELPs and NAs in each physiologically relevant fluid were prepared, and the workflow of the experiment is described in
Inspection of the agarose gels shown in
The binding and recruitment of viral RNA into ELP fusion condensates was determined for using NA-binding ELPs as reagents for RNA isolation from clinical samples for subsequent amplification and detection techniques such as PCR. Purified clinical SARS-CoV-2 RNA (1×107 copies) was used as viral RNA for capture into ELP coacervates upon LLPS (
In some cases, IDPs can have an amino acid sequence of SEQ. ID. NO.: 7:
wherein the ratio of X=V:H:G:A [1:2:1:1]. The conjugate acid (protonated form) of the imidazole side chain in histidine has a pKa of approximately 6.0. Thus, below a pH of 6, the imidazole ring is mostly protonated. The histidine content of the various lengths (L) of SEQ. ID. NO.: 7 results in a polycationic ELP and are as follows:
The phase of the polycationic ELP can be switchable according to pH. In some cases, a method for nucleic acid extraction with the ELP of SEQ. ID. NO.: 7, can include the following steps:
In two specific examples of the histidine ELPS of SEQ. ID. NO.: 7, two peptides were made. The amino acid sequence of SEQ. ID. NO.: 8 is also referred to herein as “H-20.” The amino acid sequence of SEQ. ID. NO.: 9 is also referred to herein as “H-24.”
The ELP can have a histidine in the “X” position of the pentapeptide of SEQ. ID. NO.: 2, at a position external to the VPGXG pentapeptide, or a combination thereof. For example, the ELP can have an amino acid sequence of SEQ. ID. NO. 10: [(VPGXG)10-GHG]8.
The electrostatic nucleic acid binding is expected to be modulated upon pH change, as shown in
An ELP was created with an amino acid sequence of SEQ. ID. NO. 9, shown below:
(shown as “H-24”). H-24 was applied in a two-step extraction process to isolate viral RNA from anonymized nasal swab samples from human patients that had previously been classified as either COVID positive ((+) SARS-CoV-2) or negative (“(−) SARS-CoV-2”) by a CDC-certified diagnostic method. The efficacy of an unoptimized His-ELP enabled extraction process was compared to a widely used commercial RNA extraction method (Quick-RNA™ Viral Kit, Zymos Research) in the detection of SARS-CoV-2 RNA by RT-qPCR.
The nasopharyngeal swab samples (suspended in VTM and DNA/RNA Shield™) were subjected to cell lysis by heat shock and then mixed the lysate with ELP H-24 in a pH 6 solution. The samples were incubated above the Tt of H-24 to induce LLPS with the objective of recruiting RNA into the protein coacervate phase to isolate it from the other lysate components. After phase separation, the protein-poor supernatant was pipetted out and the coacervate was resuspended in a pH 8.5 solution to disrupt electrostatic interactions between RNA and H-24. The solution was incubated above the Tt of the neutralized H-24 to induce LLPS with the objective of separating the H-24 protein from the RNA. The supernatant was pipetted out for PCR detection. Finally, the supernatants were diluted in nuclease-free water, as final extraction products are usually eluted in water.
The efficacy of His-ELP enabled extraction process was compared to that of a commercially available spin column methodology (Quick-RNA™ Viral Kit, Zymos Research) in the detection of SARS-CoV-2 RNA by RT-qPCR. The N1 primer/probe set used in the RT-qPCR experiments specifically amplifies a portion of the SARS-CoV-2 genome. Results of extraction of nasopharyngeal swab samples from a de-identified human COVID-19-positive patient were compared with those from a healthy human volunteer. Representative data from replicate measurements of each sample were made on different days and are provided below in Table 5. The primer/probe for PCR amplification of SARS-CoV-2 virus can be designed for any suitable unique region of the viral genome. Such primers and probes have been the subject of previous studies, including Anantharajah, A. et. al., How to choose the right real-time PCR primer sets for the SARS CoV-2 genome detection?, J. Vir. Met., 295 (2021) 114197.
Using the commercially available RNA extraction kit, SARS-CoV-2 RNA was detected in the samples from the COVID-19+ patient by RT-qPCR, with a cycle threshold (CT) value of 25. The CT value refers to the number of cycles necessary for the RT-qPCR process to detect a specific RNA sequence. A smaller value is correlated with higher RNA concentration in a sample. No SARS-CoV-2 RNA was detected in the sample from the healthy volunteer using the commercial RNA extraction kit. By comparison, after the two-step LLPS/pH switch process described above, RT-PCR did not detect SARS-CoV-2 RNA directly in the supernatant after the LLPS at pH 8.5, but it did detect it (CT=37) after the supernatant was diluted 1:50 with nuclease-free water. Interestingly, the target RNA was not detected after a similar 1:10, 1:20, nor 1:100 dilution, suggesting an optimal dilution, which may represent a balance of dilution of PCR inhibitors and sufficient SARS-CoV-2 RNA concentration for detection. The higher CT value obtained for the LLPS-based extraction suggests lower efficiency than the conventional extraction.
To examine whether the initial LLPS step at pH 6 resulted in incomplete capture of SARS-CoV-2 RNA into the coacervate phase, RT-qPCR was conducted on the supernatant obtained from that initial LLPS step. While detection of the target RNA was not possible directly in the supernatant, RNA was detected when this supernatant was diluted with nuclease-free water (1:10 CT=39; 1:20 CT=37; 1:50 CT=36; 1:100 not detected), with optimal detection (lowest CT) at 1:50 dilution. No SARS-CoV-2 RNA was detected in the sample from the healthy volunteer using the LLPS-based extraction under all dilution conditions studied.
These results demonstrate that the two-step H-24 LLPS process with pH shift is a simple process that is capable of extracting SARS-CoV-2 RNA from patient samples that is detectable by RT-qPCR, albeit after significant dilution in nuclease-free water and at higher CT than the standard commercial method. The amount of H-24 used in the extraction process, the time to achieve LLPS, and the solution conditions for LLPS in general, can be optimized. Also, alternative methods of lysis such as enzymatic lysis (eg. lysozyme), bead beating, or chemical lysis can be used instead of, or in addition to, the simple lysis procedure used here (heat shock).
raditional Extraction)
indicates data missing or illegible when filed
According to the prediction of His-ELP charge as a function of pH (
To demonstrate the modulation of electrostatic binding of His-ELPs and NAs, gel retardation assays were performed with mixtures of His-ELPs and NAs at pH 9, where the proteins are predicted to be slightly negatively charged (
In a two-step nucleic acid separation method, pH can be adjusted to provide the ELP and nucleic acids in a sample with binding conditions in a first step, followed by separation conditions to release the nucleic acids from the ELP in a second step. For example,
The specific compositions and methods described herein are representative, exemplary and not intended as limitations on the scope of the technology. Other objects, aspects, and embodiments will occur to those skilled in the art upon consideration of this specification and are encompassed within the spirit of the technology as defined by the scope of the claims. It will be readily apparent to one skilled in the art that varying substitutions and modifications may be made to the technology disclosed herein without departing from the scope and spirit of the technology. The terms and expressions that have been employed are used as terms of description and not of limitation, and there is no intent in the use of such terms and expressions to exclude any equivalent of the features shown and described or portions thereof, but it is recognized that various modifications are possible within the scope of the technology as claimed. Thus, it will be understood that although the present technology has been specifically disclosed by embodiments and optional features, modification and variation of the concepts herein disclosed may be resorted to by those skilled in the art, and that such modifications and variations are considered to be within the scope of this technology as defined by the appended claims and statements of the technology.
The technology illustratively described herein may be practiced in the absence of any element or elements, or limitation or limitations, which is not specifically disclosed herein as essential. The methods and processes illustratively described herein may be practiced in differing orders of steps, and the methods and processes are not necessarily restricted to the orders of steps indicated herein or in the claims.
Under no circumstances may the patent be interpreted to be limited to the specific examples or embodiments or methods specifically disclosed herein. Under no circumstances may the patent be interpreted to be limited by any statement made by any Examiner or any other official or employee of the Patent and Trademark Office unless such statement is specifically and without qualification or reservation expressly adopted in a responsive writing by Applicants.
The technology has been described broadly and generically herein. Each of the narrower species and subgeneric groupings falling within the generic disclosure also form part of the technology. This includes the generic description of the technology with a proviso or negative limitation removing any subject matter from the genus, regardless of whether or not the excised material is specifically recited herein. In addition, where features or aspects of the technology are described in terms of Markush groups, those skilled in the art will recognize that the technology is also thereby described in terms of any individual member or subgroup of members of the Markush group.
The Abstract is provided to comply with 37 C.F.R. § 1.72(b) to allow the reader to quickly ascertain the nature and gist of the technical disclosure. The Abstract is submitted with the understanding that it will not be used to interpret or limit the scope or meaning of the claims.
This application claims the priority of U.S. provisional application Ser. No. 63/337,874, filed May 3, 2022, the disclosure of which is incorporated herein by reference in its entirety as if fully set forth herein.
This technology was made with government support under CBET-2031774, CBET-2048051, and MCB-2123465 from the National Science Foundation. The government has certain rights in the technology.
Number | Date | Country | |
---|---|---|---|
63337874 | May 2022 | US |