The present disclosure relates to gene drive system and method of use thereof. Traditional gene drives approaches employ a Cas9 that introduces DNA double-strand breaks to promote homology-directed repair (HDR) and consequent gene drive propagation. While the scientific community suggested the use of a nickase Cas9 for population engineering, no experimental data is available about the efficiency of a nickase Cas9 to edit an organism germline in a gene drive context. While paired nicks demonstrated their ability to disrupt genes and promote HDR in vitro there exists no system using a nickase Cas9 to introduce paired-nicks and promote HDR in the germline of a living organism.
The Culex pipiens species complex includes several species and their hybrid forms, including Cx. pipiens pipiens, Cx. pipiens pallens, Cx. pipiens molestus, Cx. quinquefasciatus, Cx. australicus, and Cx. globocoxitus. While some of these species have limited distribution, Cx. pipiens pipiens, Cx. quinquefasciatus, and their hybrids are widespread globally and represent the primary urban and suburban disease vector. The ability of these species to produce fertile hybrids has been a source of an ongoing discussion around the taxonomy of these species and has been proposed as a mechanism explaining the surprising adaptability of these mosquitoes and may account for the rapid evolution of insecticide resistance across the Culex pipiens complex. While CRISPR has allowed the development of engineered gene drives (GDs) based on homing endonucleases and other technologies that use genetically-engineered vectors to modify or suppress wild populations, Culex has been somewhat neglected regarding the development of genetic suppression strategies. No extensive CRISPR technology application has been achieved, mostly due to the fewer tools available for this species.
More background information related to gene drive system, and method of use thereof, fir genetic suppression, is described in EXAMPLES 1 and 2, respectively, of the present disclosure.
The present disclosure provides a novel gene drive methodology that can be used to spread engineered traits into a targeted population. The method of the present disclosure uses a transgenic line containing a modified Cas9 (i.e., nickase Cas9) that introduces single-strand breaks (nicks, instead of double-stranded DNA cuts) which promote super-Mendelian inheritance of an engineered gene drive allele. Further, the present disclosure provides a full sequence of a gene drive system, based on the use of nickase which is built for applications aimed at the suppression of Culex quinquefasciatus mosquitoes.
In certain embodiments, the present disclosure provides a system that is the first example of a gene drive that is able to bias Mendelian inheritance using a nickase-Cas9. In certain embodiments, a (modified) nickase Cas9 is utilized that introduces paired-nicks to produce a DNA break an allow the propagation of an engineered genetic element. The use of nicks instead of double-stranded breaks could lead to increased flexibility in gene drive design. Benefits include: 1) more stereotyped patterns of non-conversion alleles (insertion-deletions caused by the NHEJ DNA-repair pathway) which would allow to target larger beneficial portions of the gene target to ensure the negative selection of non-conversion alleles; and 2) the use of nickases should reduce the generation of mutation at off-target sites.
Several potential advantages using a nickase-based gene-drive system also include, first, DNA nicks are involved in important biological processes such as DNA replication and are typically repaired efficiently. For this reason, the novel arrangement of the method of the present disclosure could help limit the formation of resistant alleles; if paired-nicks do not occur simultaneously, single nicks should restore the original wildtype sequence to allow further targeting for gene drive conversion. Second, a nickase-based gene-drive approach could increase specificity and reduce off-target effects, which represent nonspecific genome editing events, as two independent cleavage events need to happen coordinately to obtain the desired modification. For example, gene drives also demonstrated its ability to bias Mendelian inheritance using mice, and a nickase-based gene-drive system should be translatable to these animals to improve specificity and reduce observed undesired error effects when editing mammalian embryos. Third, evidence is provided that the nH840A transgene combined with paired-gRNAs in a PAM-in configuration frequently generated large deletions between the spaced nicks; a scenario that is not detected when using the regular Cas9 inducing double-strand breaks. This property could be harnessed to boost efficiency when targeting vital genes or essential protein domains. Such large deletions should produce non-functional alleles resulting in non-viable animals in case of HDR failure, therefore ensuring the removal of individual escapees carrying small indels that allow for animal viability within a population.
Furthermore, only two out of four experimental conditions lead to successful gene drive, suggesting that the application of a nickase-Cas9 to gene drive was not obvious a priori. Additionally, only one of the two working conditions gives maximal efficiency, further supporting the uniqueness of the disclosed method.
The present disclosure further provides that the method and the nickase-based gene drive system of the present disclosure have several applications. For instance, adaptation of the system to generate nickase gene drive mosquitoes that could be used for population engineering to reduce the impact of mosquito-borne diseases. Using a nickase gene drive system in mice to trigger HDR while reducing off-target effects for field application to control pests or invasive species. A similar approach could also be applied in mammalian cells to boost HDR rates, while lowering off-targets.
The present disclosure also provides a full sequence of a gene drive system, based on the use of nickase which is built for applications aimed at the suppression of Culex quinquefasciatus mosquitoes. This is the first example of full-gene drive tailored for the suppression of this mosquito species. In certain embodiments, the present disclosure provides development strategies and approaches to generate the building blocks for the expression of CRISPR components, develop the know-how for site-specific transgenesis, and build a gene drive system for the suppression of Cx. quin. The present disclosure provides improvements that would obviate drawbacks of previous technology which aimed at the disruption of doublesex in females instead of the conversion to functional males. While the outlined strategy is tailored to this species (Culex quinquefasciatus), the same approach could be used for the suppression of other mosquitoes or insects (examples are Anopheles mosquitoes, Aedes mosquitoes, Drosophila suzukii fruit fly pest, etc.). The present disclosure provides both a graphical representation of the gene drive construct and the full sequence of the material.
Other systems, methods, features, and advantages of the present disclosure will be or become apparent to one with skill in the art upon examination of the following drawings and detailed description. It is intended that all such additional systems, methods, features, and advantages be included within this description, be within the scope of the present disclosure, and be protected by the accompanying claims. In addition, all optional and preferred features and modifications of the described embodiments are usable in all aspects of the disclosure taught herein. Furthermore, the individual features of the dependent claims, as well as all optional and preferred features and modifications of the described embodiments are combinable and interchangeable with one another.
Additional aspects and advantages of the present disclosure will become readily apparent to those skilled in this art from the following detailed description, wherein only illustrative embodiments of the present disclosure are shown and described. As will be realized, the present disclosure is capable of other and different embodiments, and its several details are capable of modifications in various obvious respects, all without departing from the disclosure. Accordingly, the drawings and description are to be regarded as illustrative in nature, and not as restrictive.
All publications, patents, and patent applications mentioned in this specification are herein incorporated by reference to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference. To the extent publications and patents or patent applications incorporated by reference contradict the disclosure contained in the specification, the specification is intended to supersede and/or take precedence over any such contradictory material.
Many aspects of the present disclosure can be better understood with reference to the drawings. The components in the drawings are not necessarily to scale, emphasis instead being placed upon clearly illustrating the principles of the present disclosure. Moreover, in the drawings, like reference numerals designate corresponding parts throughout the several views. A better understanding of the features and advantages of the invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention are utilized, and the accompanying drawings (also “Figure” and “FIG.” herein), of which:
Additional advantages of the invention will be set forth in part in the description which follows, and in part will be obvious from the description, or can be learned by practice of the invention. The advantages of the invention will be realized and attained by means of the elements and combinations particularly pointed out in the appended claims. It is to be understood that both the foregoing general description and the following detailed description are exemplary and explanatory only and are not restrictive of the invention, as claimed.
The present disclosure provides a novel gene drive method and Cas9 nickase-based gene drive system that can be used to spread engineered traits into a targeted population. In certain embodiments, the method of the present disclosure uses a transgenic line containing a modified Cas9 (e.g., nickase Cas9) that introduces single-strand breaks (nicks, instead of double-stranded DNA cuts) which promote super-Mendelian inheritance of an engineered gene drive allele. A full sequence of a gene drive system, based on the use of nickase which is built for applications aimed at the suppression of Culex quinquefasciatus mosquitoes is also provided. The present disclosure further provides building blocks for the expression of CRISPR components, develop the know-how for site-specific transgenesis, and build a gene drive system for the suppression of Cx. quin.
Many modifications and other embodiments disclosed herein will come to mind to one skilled in the art to which the disclosed compositions and methods pertain having the benefit of the teachings presented in the foregoing descriptions and the associated drawings. Therefore, it is to be understood that the disclosures are not to be limited to the specific embodiments disclosed and that modifications and other embodiments are intended to be included within the scope of the appended claims. The skilled artisan will recognize many variants and adaptations of the aspects described herein. These variants and adaptations are intended to be included in the teachings of this disclosure and to be encompassed by the claims herein.
Although specific terms are employed herein, they are used in a generic and descriptive sense only and not for purposes of limitation.
As will be apparent to those of skill in the art upon reading this disclosure, each of the individual embodiments described and illustrated herein has discrete components and features which may be readily separated from or combined with the features of any of the other several embodiments without departing from the scope or spirit of the present disclosure.
Any recited method can be carried out in the order of events recited or in any other order that is logically possible. That is, unless otherwise expressly stated, it is in no way intended that any method or aspect set forth herein be construed as requiring that its steps be performed in a specific order. Accordingly, where a method claim does not specifically state in the claims or descriptions that the steps are to be limited to a specific order, it is no way intended that an order be inferred, in any respect. This holds for any possible non-express basis for interpretation, including matters of logic with respect to arrangement of steps or operational flow, plain meaning derived from grammatical organization or punctuation, or the number or type of aspects described in the specification.
All publications mentioned herein are incorporated herein by reference to disclose and describe the methods and/or materials in connection with which the publications are cited. The publications discussed herein are provided solely for their disclosure prior to the filing date of the present application. Nothing herein is to be construed as an admission that the present invention is not entitled to antedate such publication by virtue of prior invention. Further, the dates of publication provided herein can be different from the actual publication dates, which can require independent confirmation.
While aspects of the present disclosure can be described and claimed in a particular statutory class, such as the system statutory class, this is for convenience only and one of skill in the art will understand that each aspect of the present disclosure can be described and claimed in any statutory class.
It is also to be understood that the terminology used herein is for the purpose of describing particular aspects only and is not intended to be limiting. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the disclosed compositions and methods belong. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the specification and relevant art and should not be interpreted in an idealized or overly formal sense unless expressly defined herein.
Prior to describing the various aspects of the present disclosure, the following definitions are provided and should be used unless otherwise indicated. Additional terms may be defined elsewhere in the present disclosure.
As used herein, “comprising” is to be interpreted as specifying the presence of the stated features, integers, steps, or components as referred to, but does not preclude the presence or addition of one or more features, integers, steps, or components, or groups thereof. Moreover, each of the terms “by”, “comprising,” “comprises”, “comprised of,” “including,” “includes,” “included,” “involving,” “involves,” “involved,” and “such as” are used in their open, non-limiting sense and may be used interchangeably. Further, the term “comprising” is intended to include examples and aspects encompassed by the terms “consisting essentially of” and “consisting of.” Similarly, the term “consisting essentially of” is intended to include examples encompassed by the term “consisting of”.
As used in the specification and the appended claims, the singular forms “a,” “an” and “the” include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to “a catalyst,” “a metal,” or “a substrate,” includes, but are not limited to, mixtures or combinations of two or more such catalysts, metals, or substrates, and the like.
It should be noted that ratios, concentrations, amounts, and other numerical data can be expressed herein in a range format. It will be further understood that the endpoints of each of the ranges are significant both in relation to the other endpoint, and independently of the other endpoint. It is also understood that there are a number of values disclosed herein, and that each value is also herein disclosed as “about” that particular value in addition to the value itself. For example, if the value “10” is disclosed, then “about 10” is also disclosed. Ranges can be expressed herein as from “about” one particular value, and/or to “about” another particular value. Similarly, when values are expressed as approximations, by use of the antecedent “about,” it will be understood that the particular value forms a further aspect. For example, if the value “about 10” is disclosed, then “10” is also disclosed.
When a range is expressed, a further aspect includes from the one particular value and/or to the other particular value. For example, where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the disclosure, e.g. the phrase “x to y” includes the range from ‘x’ to ‘y’ as well as the range greater than ‘x’ and less than ‘y’. The range can also be expressed as an upper limit, e.g. ‘about x, y, z, or less’ and should be interpreted to include the specific ranges of ‘about x’, ‘about y’, and ‘about z’ as well as the ranges of ‘less than x’, less than y’, and ‘less than z’. Likewise, the phrase ‘about x, y, z, or greater’ should be interpreted to include the specific ranges of ‘about x’, ‘about y’, and ‘about z’ as well as the ranges of ‘greater than x’, greater than y’, and ‘greater than z’. In addition, the phrase “about ‘x’ to ‘y’”, where ‘x’ and ‘y’ are numerical values, includes “about ‘x’ to about ‘y’”.
It is to be understood that such a range format is used for convenience and brevity, and thus, should be interpreted in a flexible manner to include not only the numerical values explicitly recited as the limits of the range, but also to include all the individual numerical values or sub-ranges encompassed within that range as if each numerical value and sub-range is explicitly recited. To illustrate, a numerical range of “about 0.1% to 5%” should be interpreted to include not only the explicitly recited values of about 0.1% to about 5%, but also include individual values (e.g., about 1%, about 2%, about 3%, and about 4%) and the sub-ranges (e.g., about 0.5% to about 1.1%; about 5% to about 2.4%; about 0.5% to about 3.2%, and about 0.5% to about 4.4%, and other possible sub-ranges) within the indicated range.
As used herein, the terms “about,” “approximate,” “at or about,” and “substantially” mean that the amount or value in question can be the exact value or a value that provides equivalent results or effects as recited in the claims or taught herein. That is, it is understood that amounts, sizes, formulations, parameters, and other quantities and characteristics are not and need not be exact but may be approximate and/or larger or smaller, as desired, reflecting tolerances, conversion factors, rounding off, measurement error and the like, and other factors known to those of skill in the art such that equivalent results or effects are obtained. In some circumstances, the value that provides equivalent results or effects cannot be reasonably determined. In such cases, it is generally understood, as used herein, that “about” and “at or about” mean the nominal value indicated ±10% variation unless otherwise indicated or inferred. In general, an amount, size, formulation, parameter or other quantity or characteristic is “about,” “approximate,” or “at or about” whether or not expressly stated to be such. It is understood that where “about,” “approximate,” or “at or about” is used before a quantitative value, the parameter also includes the specific quantitative value itself, unless specifically stated otherwise.
As used herein, the terms “optional” or “optionally” means that the subsequently described event or circumstance can or cannot occur, and that the description includes instances where said event or circumstance occurs and instances where it does not.
Unless otherwise expressly stated, it is in no way intended that any method set forth herein be construed as requiring that its steps be performed in a specific order. Accordingly, where a method claim does not actually recite an order to be followed by its steps or it is not otherwise specifically stated in the claims or descriptions that the steps are to be limited to a specific order, it is no way intended that an order be inferred, in any respect. This holds for any possible non-express basis for interpretation, including: matters of logic with respect to arrangement of steps or operational flow; plain meaning derived from grammatical organization or punctuation; and the number or type of embodiments described in the specification.
Unless otherwise specified, temperatures referred to herein are based on atmospheric pressure (i.e. one atmosphere).
Now having described the aspects of the present disclosure, in general, the following Examples describe some additional aspects of the present disclosure. While aspects of the present disclosure are described in connection with the following examples and the corresponding text and figures, there is no intent to limit aspects of the present disclosure to this description. On the contrary, the intent is to cover all alternatives, modifications, and equivalents included within the spirit and scope of the present disclosure.
The following examples are put forth so as to provide those of ordinary skill in the art with a complete disclosure and description of how the compounds, compositions, articles, devices and/or methods claimed herein are made and evaluated and are intended to be purely exemplary of the disclosure and are not intended to limit the scope of what the inventors regard as their disclosure. Efforts have been made to ensure accuracy with respect to numbers (e.g., amounts, temperature, etc.), but some errors and deviations should be accounted for. Unless indicated otherwise, parts are parts by weight, temperature is in ° C. or is at ambient temperature, and pressure is at or near atmospheric.
CRISPR gene-drive systems have emerged as a promising tool for disseminating engineered traits into wild populations to control disease transmission. This rapid dissemination is possible due to the ability of these systems to surpass Mendel's First Law of gene segregation, which dictates that an allele has a 50% probability of being passed to the next generation; in fact, gene drives can reach up to 100% inheritance of a desired allele. A proof-of-concept system was first implemented in flies (Gantz and Bier, 2015) and was applied to different mosquitoes such as Anopheles or Aedes under laboratory conditions to fight vector-borne diseases (Adolfi et al., 2020; Gantz et al., 2015; Hammond et al., 2016; Kyrou et al., 2018; Li et al., 2020; Simoni et al., 2020).
CRISPR-based gene drives consist of a three-component transgene: (i) Cas9, a DNA nuclease that produces DNA double-strand breaks; (ii) a guide RNA (gRNA) that directs Cas9 to cleave the DNA at a predetermined site; and (iii) two homology arms flanking the Cas9/gRNA components that perfectly match both sides of the cut site to promote homology-directed repair (HDR). When a gene-drive individual mates with a wildtype, the encoded Cas9 from the engineered gene drive cuts the wildtype allele in the germline, which is replaced by HDR using the intact gene-drive chromosome as a repair template. With the gene drive present on both alleles (i.e., homozygous), this process produces a super-Mendelian inheritance (>50%) of the engineered cassette to spread new traits through a population.
Current gene-drive methods employ a Cas9 that introduces DNA double-strand breaks (Adolfi et al., 2020; Bier, 2021; Gantz et al., 2015; Hammond et al., 2016; Kyrou et al., 2018; Li et al., 2020; Simoni et al., 2020). However, lack of alternative Cas9-based strategies could enlarge the available toolkit for gene-drive designs while potentially bringing advantages to improve the current ones. In fact, mutant versions of Cas9 that only generate nicks should also be amenable. Wildtype Cas9 contains two endonuclease domains (HNH and RuvC-like domains) that can introduce DNA double-strand breaks, where each cleaves one strand of the DNA double-helix (Gasiunas et al., 2012; Jinek et al., 2012). By mutating critical residues in the nuclease domains, two nickase versions of Cas9 can be generated: i) nCas9-D10A (nD10A) contains an inactivated RuvC domain and only cuts the target strand where the gRNA is bound, and ii) nCas9-H840A (nH840A) contains an inactivated HNH and so only cuts the non-target strand (Gasiunas et al., 2012; Jinek et al., 2012).
Importantly, the nickase versions of Cas9 already demonstrated their efficiency for genome editing. nD10A has been used to generate paired DNA nicks and efficiently disrupts genes in Drosophila and cell culture (Gopalappa et al., 2018; Port et al., 2014). Nicks induced by nD10A promoted higher HDR rates than nH840A, where HDR was almost undetectable (Bothmer et al., 2017; Hyodo et al., 2020; Mali et al., 2013; Wang et al., 2021, 2018). Furthermore, nD10A can boost specificity while reducing off-target effects since it requires two gRNAs that target complementary DNA strands to either disrupt a gene function or trigger HDR. When the same pair of gRNAs were combined with wildtype Cas9, undesired off-target effects at an unspecific genomic region were generated, which does not occur with nD10A (Ran et al., 2013).
Nickase Cas9 (nCas9) versions have been extensively utilized in vitro to evaluate HDR efficiencies. However, only a few studies attempted to employ paired-gRNAs' nickase-based approaches using mice and Drosophila, where nD10A promoted modest HDR rates (Lee and Lloyd, 2014; Ren et al., 2014). Therefore, the design of novel strategies using both nD10A and nH840A to induce meaningful HDR rates in the germline of a living organism is needed to better understand the fine-workings of nickase-based HDR in vivo. A nCas9-based gene drive might be applicable for population engineering to bring potential advantages. For example, DNA nicks are involved in important biological processes such as DNA replication and are typically repaired efficiently (Caldecott, 2008; Chafin et al., 2000; Reyes et al., 2021; Wang and Hays, 2007). Therefore, if paired nicks do not occur simultaneously, single nicks could restore the original wildtype sequence, reducing the formation of mutations or resistant alleles at the target site for further gene-drive conversion. Additionally, DNA nicks follow distinct DNA repair pathways compared to DNA double-strand breaks that are introduced by traditional gene drives (Vriend and Krawczyk, 2017), and the intrinsically offset distance between paired nicks in a gene-drive setting could potentially favor the formation of specific mutations to improve gene-drive propagation in certain applications.
Thus, simultaneous paired nicks targeting two adjacent DNA regions should generate a staggered double-strand break followed by DNA repair by HDR to promote super-Mendelian inheritance of an engineered gene-drive construct. Herein, a nCas9-based gene-drive system was developed which promoted super-Mendelian inheritance in Drosophila melanogaster as a proof-of-concept and showed that nD10A and nH840A can promote efficient HDR in the germline. Interestingly, it showed that super-Mendelian inheritance rates can be achieved only when the gene-drive design generated 5′ overhangs. It also showed that nH840A produces larger deletions compared to nD10A when the allelic conversion fails. This feature can potentially be employed to reduce the presence of viable animals carrying small mutations when targeting essential genes, as strategies to generate large deletions removing critical protein domains to ensure insects' lethality could be designed. Overall, this work expands the technology and applicability of CRISPR gene-drive systems for genetic engineering of wild populations.
All flies were kept on standard food with a 12/12 hours day/night cycle. Fly stocks are kept at 18° C., and all experimental crosses were performed at 25° C. All flies were anesthetized during the experiments using CO2. F0 crosses from gene-drive experiments were made in pools of 3-6 virgin females crossed to 3-6 males. F1 experiments were always made in single pairs to track editing events happening singularly in the germline. The F2 progeny was scored as male or female and sorted for a fluorescent marker (DsRed and/or GFP) using a Leica M165 F2 Stereomicroscope with fluorescence as an indicator of transgene inheritance rates. All experiments were performed in a high-security ACL2 (Arthropod Containment Level 2) facility built for gene drive purposes in the Division of Biological Sciences at the University of California, San Diego. Crosses were made in polypropylene vials (Genesee Scientific Cat. #32-120), and all flies were frozen for 48 hours before being removed from the facility, autoclaved, and discarded as biohazardous waste.
DNA constructs were built using NEBuilder HiFi DNA Assembly Master Mix (New England BioLabs Cat. #E2621) and transformed into NEB 10-beta electrocompetent E. coli (New England BioLabs Cat. #3020). DNA was extracted using a Qiagen Plasmid Midi Kit (Qiagen Cat. #12143) and sequenced by Sanger sequencing at Genewiz. Primers used for cloning can be found in the Key Resources Table below.
Embryo injections were outsourced to Rainbow Transgenic Flies, Inc. All DNA constructs were injected into the lab's isogenized Oregon-R (Or-R) strain to ensure consistent genetic background throughout experiments. Plasmid templates were co-injected with a Cas9-expressing plasmid (pBSHsp70-Cas9 was a gift from Melissa Harrison & Kate O'Connor-Giles & Jill Wildonger [Addgene plasmid #46294; RRID: Addgene_46294]). The injected generation 0 (G0) animals were received and then intercrossed the hatched adults in small pools (3-5 males×3-5 females), and the G1 flies were screened for a fluorescent marker (DsRed for Cas9 versions and GFP for gene-drive elements, both fluorescences in the eye), which was indicative of transgene insertion. Homozygous lines were established from single transformants by crossing to Or-R. As the Cas9 transgene is inserted into the yellow gene disrupting it, homozygous flies for the Cas9 versions can be identified once flies display a yellow body color. Similarly for the Copycat constructs that are integrated into the white gene, homozygous flies for the CopyCat elements display a white eye phenotype. Stocks were sequenced by PCR and Sanger sequencing to confirm proper transgene insertion.
DNA Extraction from Single Flies
To sequence resistant alleles, genomic DNA was extracted from individual males following the method described by Gloor GB and colleagues (Gloor et al., 1993). In brief, 50 ul of the extraction buffer was used to squish single flies in a PCR tube which was placed into the PCR machine (Proflex PCR system, Applied Biosystems) for 1 hour at 37° C. followed by 5 minutes at 95° C. to inactivate the proteinase K. Then, 200 μL of water was added to each DNA sample to obtain a total of 250 uL per sample. 1-5 uL was used in a 25 uL PCR reaction covering the gRNA cut sites in the white gene for Sanger sequencing analysis.
A DNA region covering the gRNA cut sites was amplified using the v1564 and v1565 oligos (see oligos list below). The obtained amplicon was then sequenced by Sanger sequencing to determine the quality of the resistant alleles using the v478 oligo. When lower-quality traces were obtained, a second Sanger sequencing reaction was performed from the other side of the amplicon to confirm the quality of the mutation with either the v659 or v1571 primers. Primers used for resistance allele sequencing are listed in the following Table.
Adult flies were anesthetized using 002 to select individuals for crossing experiments. Their phenotypes were analyzed using a Leica M165 FC Stereo microscope to properly prepare the experimental crosses. Inheritance analysis of the transgenes marked with fluorescence was evaluated using the same microscope. DsRed marker implies presence of the Cas9 cassettes while GFP fluorescence indicates presence of the CopyCat transgenes.
GraphPad Prism 9 and Adobe Illustrator were used to generate all the graphs. Statistical analyses were performed using GraphPad Prism 9. In
To design a nickase-based gene drive, a gRNA-only split-drive system (i.e., Copycat) was used that consists of two separate components: (i) a transgenic fly carrying a static Cas9 transgene, which is inherited in a Mendelian fashion, and (ii) an engineered animal carrying a CopyCat cassette formed by a gRNA gene that is flanked by two homology arms (Gantz and Bier, 2016). Once the two components are genetically crossed, traditional CopyCat gene drives rely on DNA double-strand breaks produced by a single gRNA, which targets the same sequence on the wildtype allele where it is inserted, to propagate the synthetic cassette by HDR (Champer et al., 2019; Gantz and Bier, 2016; López Del Amo et al., 2020a; Xu et al., 2017) (
To build a gene drive system based on a nCas9, two transgenic lines containing a DsRed marker and carrying either the nD10A or nH840A versions were generated, which cut the target strand (bound to the gRNA) or the non-target strand, respectively. Additionally, a validated wildtype Cas9 line was employed, which introduces DNA double-strand breaks (López Del Amo et al., 2020b) as a positive control (
The nickases were combined with the two CopyCat transgenic lines to create four schemes to test the nickase gene-drive system: i) nD10A with the CC(w2,w9) (PAM-in) to generate 3′ overhangs; ii) nD10A with the CC(w2,w8) (PAM-out) to generate 5′ overhangs; iii) nH840A with CC(w2,w9) (PAM-in) to generate 5′ overhangs and iv) nH840A with the CC(w2,w8) (PAM-out) transgenic line to generate 3′ overhangs. Importantly, combining either of the CopyCat lines with wildtype Cas9 produces similar blunt ends in both situations (
To evaluate the efficiency of the CopyCat elements, the same genetic cross scheme was used in all cases. First, male flies containing the Cas9 source (wildtype Cas9, nD10A, or nH840A) were combined with the CopyCat lines to obtain F1 trans-heterozygous animals carrying both transgenes. Then, these F1 females were crossed to a white mutant line to evaluate biased inheritance in their F2 progeny. If the Copycat is inactive, 50% inheritance of the GFP marker should be observed, which is integrated within the engineered cassette as mentioned. If the CopyCat construct can promote HDR in the germline, it should display super-Mendelian inheritance (>50%) of the GFP-marked transgene. All Cas9 sources, which carry the DsRed marker, should have ˜50% inheritance (
When combining the wildtype Cas9 with both CopyCat elements, two proximal DNA double-strand breaks were introduced by a multiplexing approach (
After confirming the activity of the two elements built to test the nickase gene drive, the ability of nCas9 to promote super-Mendelian inheritance in either of the four defined scenarios was evaluated (
The nH840A line following the same cross scheme was also tested (
Overall, nD10A performed significantly better than nH840A for inheritance bias when generating 5′ overhangs (p<0.0001), which could be due to different cleavage rates between the nickases. Since the white gene targeted for conversion in this system is tightly linked with the yellow gene where all the Cas9 sources are inserted, this experimental design allowed us to evaluate conversion (HDR), mutations (resistant alleles) and wildtype/uncut allele rates arising from all conditions tested via the F2 progeny (
Altogether, it was demonstrated the example of a gene-drive system driven by a nCas9, which simultaneously nicks both complementary strands to induce efficient allelic conversion that is mediated by HDR in vivo. Furthermore, these data indicate that super-Mendelian inheritance through nickase gene drives can only be achieved by generating 5′ overhangs.
DNA double-strand breaks introduced by gene drives can produce resistant alleles at the target site when the HDR-mediated allelic conversion process is inaccurate. Resistant alleles generated by wildtype Cas9 have been characterized (Champer et al., 2018; Gantz and Bier, 2015; Hammond et al., 2017; Lopez Del Amo et al., 2020b). However, the types of resistant alleles generated by paired nicks are uncharacterized in a gene-drive context. Here, the resistant alleles generated by the wildtype Cas9 and nickase gene-drive systems were identified and evaluated, providing promoted super-Mendelian inheritance (
The DNA was extracted from these animals and characterized by Sanger sequencing. Five different classes of resistant alleles were observed: i) large deletions spanning both cut sites (>50 base pairs [bp]), ii) deletions (<50 bp) occurring at one cut site with the other cut site intact, iii) simultaneous deletions (<50 bp) at both target sites with partial sequence retention between cut sites, iv) large insertions (>200 bp) and v) small insertions combined with small deletions in the same individuals at the same cut site (i.e., insertion+deletion) (
When F2 flies arising from the nH840A and PAM-in gRNAs combination that carried resistant alleles were sequenced, it was detected that 42% of these individuals contained large deletions spanning 50 to 90 nucleotides (
Large insertions (>200 bp) were also observed when nD10A was combined with paired gRNAs in a PAM-out orientation. It was found that 25% of the sequenced flies within this condition had large insertions, which was significantly higher than nH840A combined with PAM-in gRNAs, where no large insertion was detect (
No significant differences between the nickase genotypes was observed in the other categories described above, including single cuts at only one target site, simultaneous mutations at both cut sites or insertion+deletion events happening in the same individual (
Overall, it is confirmed that the gRNAs in this work are active, as mutations were found at all target sites within the genotypes that produced super-Mendelian inheritance. Additionally, different repair outcomes were also shown when using distinct Cas9 sources, which can inform future gene-drive strategies.
In this work, it reports a gene-drive system based on nickase versions of Cas9 that promotes super-Mendelian inheritance in Drosophila and increases the number of feasible design options for gene drives aimed at population engineering. It showed that both nD10A and nH840A produced efficient HDR in the germline, but only when the two nicks in the DNA generated 5′ overhangs. Characterization of events that failed to convert the wildtype allele to the gene drive by Sanger sequencing indicated that nH840A combined with PAM-in gRNAs produced higher rates of large deletions compared to the nD10A and PAM-out arrangement. While the PAM-out condition triggered large insertions, it was not observed such alterations when resistant alleles produced by the nH840A were analyzed, suggesting that the modes of DNA repair that are triggered are nickase dependent.
In this Example reported herein, nD10A produced higher super-Mendelian rates than nH840A, which may attribute to differences in cleavage activities between the nickases. In fact, nH840A has been shown to present less cleavage activity in vitro as it produced lower indel rates when disrupting the EMX-S1 gene (Gopalappa et al., 2018). Additionally, the distinct time-windows of cleavage between the pairs of gRNAs, which need to cut simultaneously, may impact HDR efficiencies as both nickases showed super-Mendelian inheritance with different paired gRNAs. While previous studies did not report meaningful HDR rates with nH840A (Bothmer et al., 2017; Mali et al., 2013; Ran et al., 2013), here, it showed efficient HDR rates achieved by gRNAs in a PAM-in configuration when generating 5′ overhangs using nH840A for the first time. Thus, nH840A could be a viable option for future nickase-based designs to promote HDR.
While the focus here was for a Cas9-nickase gene drive, slightly higher super-Mendelian rates was observed in the wildtype Cas9 over the nickases. This is because a nickase requires the coordinated action of both gRNAs cutting simultaneously to induce efficient HDR. Conversely, when the wildtype Cas9 is used, a single cut from either of the paired gRNAs can induce HDR to produce double-stranded DNA breaks. Furthermore, while failure of the first gRNA would result in a small indel, the second gRNA can still cut and trigger a second round of potential conversion, and which could explain the higher inheritance rates in the wildtype Cas9 scenario. Thus, thoroughly testing the paired gRNAs to maximize coordinated action in nickase gene-drive systems could be important.
With regard to resistant allele formation, it was shown that the nH840A transgene combined with paired gRNAs in a PAM-in configuration frequently generated large deletions between the spaced nicks, which was not detected when using gRNAs in a PAM-out configuration with nD10A. This effect can be harnessed to boost gene-drive propagation when specifically targeting essential genes. For example, the gene-drive element can carry a DNA rescue sequence to replace a wildtype allele while restoring functionality of vital genes to ensure animal viability and gene-drive spread. If resistant alleles occur from unsuccessful allelic conversion, these mutations should produce non-viable animals that evade propagation, though small mutations in essential genes can still produce some viable escapees (Terradas et al., 2021). Therefore, large deletions induced by a gene-drive system using nH840A in a PAM-in configuration should help remove surviving escapees in a population carrying small indels.
This report also reports whether a nickase gene-drive system could reduce the formation of resistant alleles, as DNA nicks are usually repaired efficiently. Indeed, wildtype/uncut alleles were observed within the nickase experiment conditions promoting biased inheritance, and this could help reduce resistant alleles formation while facilitating further gene drive conversion. However, resistant alleles caused by single DNA nicks were also detected, which could be from non-repaired single nicks that were converted to double-strand breaks that were subsequently fixed by non-homologous end joining (Kuzminov, 2001). Importantly, mutations produced at a single target site by DNA nicks could limit further gene-drive propagation, as single nicks are poor HDR inducers (Vriend et al., 2016) and a single mutation at one target site would be enough to avoid gene-drive spread. However, the nickase system presented herein is amenable to further optimization, especially as one major contributing factor to this might be due to fixing the induced DNA paired nicks to ˜50 nucleotides apart, though in fact, efficient HDR has been observed with offset distances ranging from 20 to 100 base pairs (Vriend et al., 2016). Thus, further adapting the strategy to generate distinct offset distances to improve the HDR efficiency of the nickase-based system is recommended.
Future nickase-gene-drive approaches could explore additional optimizations to increase specificity and reduce off-target effects, which can accumulate in a population and must be considered. As two independent cleavage events need to be coordinated for the desired modification, paired nicks were shown in previous work to improve specificity while reducing off-target effects when disrupting the EMX1 gene in human cell culture(Ran et al., 2013). Recently, the off-target effects of gene drives were predicted using validated algorithms and posterior in vivo-targeted deep sequencing with Anopheles mosquitoes in laboratory cage studies (Carballar-Lejarazú et al., 2020; Garrood et al., 2021). The off-targets effects were almost undetectable if using promoters that restricted Cas9 expression to the germline. Indeed, a nickase gene-drive system could be tested in Anopheles and species with much larger genomes, such as Aedes or Culex mosquitoes(Main et al., 2021; Severson et al., 2004), to study the pervasiveness of off-target effects across genome sizes and in the wild. Furthermore, gene drives can bias Mendelian inheritance in mice (Grunwald et al., 2019; Weitzel et al.). Further, a nickase-based gene-drive system could also be applied to mice or to reduce off-target effects when editing mammalian embryos (Aryal et al., 2018).
Altogether, this Example provides the development of nickase-based gene drives that advance their applications. The use of nickase-based gene-drive systems disclosed herein is for improved population control while encouraging its implementation in a broader range of organisms.
CRISPR-based gene-drives have been proposed for managing insect populations, including disease-transmitting mosquitoes, due to their ability to bias their inheritance towards super-Mendelian rates (>50%). Current technologies employ a Cas9 that introduces DNA double-strand breaks into the opposing wildtype allele to replace it with a copy of the gene-drive allele via DNA homology-directed repair. Yet, the use of different Cas9 versions is unexplored, and alternative approaches could increase the available toolkit for gene-drive designs. Here, this Example reports a gene drive that relies on Cas9 nickases that generate staggered paired nicks in DNA to propagate the engineered gene-drive cassette. It shows that generating 5′ overhangs in the system yields efficient allelic conversion. The nickase gene-drive arrangement produces large, stereotyped deletions that are advantageous to eliminate viable animals carrying small mutations when targeting essential genes. This nickase gene-drive approach should expand the repertoire for gene-drive arrangements aimed at applications in mosquitoes and beyond.
The Culex pipiens species complex includes several species and their hybrid forms, including Cx. pipiens pipiens, Cx. pipiens pallens, Cx. pipiens molestus, Cx. quinquefasciatus, Cx. australicus, and Cx. globocoxitus. While some of these species have limited distribution, Cx. pipiens pipiens, Cx. quinquefasciatus, and their hybrids are widespread globally and represent the primary urban and suburban disease vector [1]. The ability of these species to produce fertile hybrids has been a source of an ongoing discussion around the taxonomy of these species [2] and has been proposed as a mechanism explaining the surprising adaptability of these mosquitoes such as the “molestus” form of Cx. pipiens adapted to survive in the underground network of London [3] and may account for the rapid evolution of insecticide resistance across the Culex pipiens complex [4,5].
These mosquitoes are vectors for several medically-relevant RNA viruses such as West Nile virus, Japanese Encephalitis virus, Rift Valley fever virus, Western Equine Encephalitis virus, Eastern Equine Encephalitis virus, Sindbis virus, and St. Louis Encephalitis virus, as well as nematodes causing lymphatic filariasis [1]. Although fatalities in the United States due to these diseases are usually low in number, Culex mosquitoes are a permanent disease vector that represents a fertile ground for the establishment of new pathogens. For example, it has been estimated that since its introduction in the United States in 1999, West Nile Virus hospitalizations alone amounted on average to ˜$56M per year [6]. The mixed feeding behavior of these mosquitos on birds and mammals creates seasonal cycles of pathogen amplification in wild animals, which can lead to a periodic increase in the risk of pathogen transmission to people and to unpredictable outbreaks [7]. Lastly, the zoonotic nature of these and other viruses allows for unchecked reservoirs of viruses, mostly in wild birds, that could act as evolutionary melting pots for the development of new viral forms: the Western Equine Encephalitis is considered to be the outcome of recombination between EEEV and a Sindbis-like virus [8]. Because within this species complex, Culex quinquefasciatus (Cx. quin.) is considered the disease vector with the greatest human health impact worldwide [9], and its genetic adaptability has made it a challenging vector to keep at bay, therefore, this species were chosen for studies presented in this Example.
Genetic technologies such as engineered gene drives (GDs) based on homing endonucleases were proposed more than a decade ago [10]. Recently, CRISPR has allowed the development of these and other technologies that use genetically-engineered vectors to modify or suppress wild populations [11,12]. While different strategies have been proposed for their application, GDs were successfully applied to various organisms including yeast [13], fruit flies [14,15], Anopheles [16-18] and Aedes [19] mosquitoes, and the mouse [20].
Recently, tools for the expression of CRISPR components have been described in Aedes [19]. Evaluation of CRISPR activity in this species has been reported by few groups 15 [27-30], but no extensive CRISPR technology application has been achieved, mostly due to the fewer tools available for this species. The advent of CRISPR has caused the development of a flurry of new scientific opportunities and technologies. However, transgenesis and genome editing in mosquitoes remain cumbersome and labor-intensive. Compared to Anopheles and Aedes, Culex has been somewhat neglected regarding the development of genetic suppression strategies. For Anopheles mosquitoes, these techniques have proceeded swiftly due to the vast body of knowledge available due to the longstanding interest in this vector and their Plasmodium parasite causing malaria. Aedes and Culex mosquitoes have been somewhat lagging on CRISPR technology development [31]. Only few studies reported transgenesis in Culex [32,33], with limited follow-up use in the field.
The studies presented in this Example close this gap by jumpstarting CRISPR-based technologies in Cx. quin. and/or provide tools and knowledge for the genetic engineering of this species and develop a GD system for the population suppression of this vector. This Example provides data, molecular tools, and genome-editing know-how that would constitute the bedrock to develop field-ready products. More specifically, this Example provides the building blocks for the expression of CRISPR components, the development of the know-how for site-specific transgenesis, and a GD built for the suppression of Cx. quin. populations. The implementation of suppression GD provided herein would constitute an additional weapon in the arsenal to suppress vector populations and disease outbreaks.
GD systems promote the propagation of the drive elements along with engineered traits to fight pathogens or, alternatively, deleterious ones to suppress the population itself. The drive process involves cutting the wild-type chromosome and repairing it using the GD allele as a template (
Two development stages are described in this Example, each of which is tackling each a crucial aspect of technology development. The first development is to build and validate constructs with promoter sequences suitable to drive the expression of CRISPR components in Cx. quin. which is a necessary step for the development of any genetic technology. The second development is to develop targeted transgenesis of a ubiquitous Cas9 line to facilitate future engineering of this species and a GD construct for population suppression.
A. Develop Elements for the Expression of Cas9 and gRNA in Culex quinquefasciatus
Two laboratories have recently reported the use of CRISPR components (Cas9 protein and gRNA) injected in Cx. quin. eggs to generate heritable mutations [27,28]. To support the feasibility these methods were conducted in the laboratory Cx. quin. colony (originally isolated in the 1950s from California), and for which a high-quality genome assembly was generated. The findings from the exploratory reports were replicated and expanded by targeting several recessive pigmentation genes: white (CPIJ005542), kynurenine hydroxylase (CPIJ017147), cardinal (CPIJ005949), yellow (CPIJ018481), and ebony (CPIJ003423). This work, now available as a preprint on BiorXiv [30], demonstrates the ability to work with and perform research on Culex mosquitoes, provides with an efficient pipeline to select and test gRNAs, and further supports the feasibility of the work described in this Example.
To develop GD technologies in this species, first, promoters' sequences are identified and tested that can be used to efficiently drive the expression of CRISPR components. Four Cas9 plasmids and four gRNA ones were built and tested, and their functions in vivo were evaluated.
More specifically, two ubiquitous promoters were chosen to drive the expression of Cas9: the first is a strong viral promoter commonly used for expression in insect systems, IE1 [35], and the second is an endogenous ubiquitously-expressed gene Actin5C (CPIJ009808), which was successfully used in Drosophila [36]. IE1 was chosen because it was tested in Cx. quin. and its ability to drive GFP was validated (
For the expression of gRNAs, the use of Pol-III promoters driving U6-small-nuclear-RNAs is focused on, which have been shown to be a reliable method to drive the expression of gRNAs in multiple systems ranging from human [38], to plants [39], to insects [16,40]. Seven U6 genes were identified in the Cx. quin. genome U6-1—CPIJ039653, U6-2—CPIJ039728, U6-3—CPIJ039543, U6-4—CPIJ039801, U6-5—CPIJ040819, U6-6—CPIJ039596, U6-7—CPIJ040693), some of which were previously tested in Cx. quin. cells for activity [41]. Four of them (U6-1, -2, -3, -4) were chosen for testing because they displayed the highest transcript sequence conservation to other Diptera, making them less likely to be pseudogenes or tissue-specific splicing factors. For each promoter, a gRNA sequence targeting the white gene of Cx. quin was cloned. The the w4-gRNA was chosen because it was previously tested and displayed high activity, and for which the white-mosaic or mutant phenotypes are easily scored and do not display any viability issues [30].
For each of the eight promoters chosen to drive either Cas9 and gRNAs, their sequences are cloned in front of a Cas9 gene optimized for Anopheles [16] or the w4-gRNA which displayed the highest activity in the previous analysis targeting the white gene [30], respectively. For the Cas9 constructs are selected about ˜4 kb upstream of the gene's start codon. a fragment of the gene downstream of the stop codon, including the 3′UTR sequence and an additional ˜500 bp is also cloned, which cover all the sequences necessary to maintain endogenous regulation of the mRNA termination. For the Cas9 construct using the IE1 promoter, the SV40 poly-A sequence is used. For each of the constructs for the expression of the w4-gRNA, the gRNA sequence in between ˜700 bp of the promoter and ˜200 bp of terminator sequences are cloned; while these constructs could be smaller, additional sequences on either side to maintain some of the genomic sequences surrounding the genes may also be chosen. These constructs, outlined in
More specifically, in the first step, four injections are performed, each of them having one of the four Cas9 constructs, together with a mix of the four gRNA constructs. 500 ug/uL of a 4:1:1:1:1 ratio of each Cas9:U6-1:U6-2:U6-3:U6-4 constructs are injected into ˜600 Cx. quin. embryos, and the DNA concentration is lower if toxic. ˜60-90 surviving adults are expected for each injection based on the survival rate (10-15%) obtained in the previous analysis [30]. The surviving animals are then inspected for the presence of the expected white-eye phenotype (either fully-penetrant or in mosaic form), which indicate both Cas9 and gRNA activity. While phenotypes for the ubiquitous Cas9 promoters we are expected, it is not clear whether the germline ones would lead to visible genome edits in the soma of in the injected G0 animals. Therefore, with the surviving animals of each injection, the following analyses are performed:
The germline constructs might be expressed at very low levels, especially when present as plasmids, and could lead to failure in 1a), 1b) or 1c). For negative results in 1a) a nested PCR is performed to increase detection. For 1b) additional injection (˜200 eggs) with higher plasmid concentrations is performed. If 1c) fails, the results in 1a) and 1b) are relied on to bring the analyses forward to the second step.
Thus, altogether, the above analyses identify which of the Cas9 constructs is expressed, and their expression levels relative to each other (although ubiquitous and germline drivers are hard to compare). In addition, cutting activity detected in either 1 b) or 1c) indicate that at least one of the U6 promoters fused to the w4-gRNA can drive gRNA expression. The G0 injected adults from 1c) after mating are frozen and stored for additional replicates of the analysis in 1a) or 1 b), if needed.
The second step focuses on identifying U6 promoters with high activity. Here, a similar injection is performed, as described above, to test in each of four injections a single gRNA construct in combination with the ubiquitous-Cas9 element that provides the highest editing rate in the first step analyses discussed above. ˜400 Cx. quin. embryos are injected with 500 ug/uL (or lower, depending on the previous results) of a 1:1 ratio of the chosen Cas9:U6 construct. The surviving animals are first inspected for the presence of the expected white-eye phenotype. Even if not present in the previous analyses, a higher functional gRNA concentration injected here, may lead to at least a mosaic phenotype. With the surviving animals (˜40-60) the following analyses are performed:
Thus, this second step identifies one or more U6-Promoter constructs that can sustain efficient expression of the gRNA in Cx. quin. As for the analyses in first step, the G0 injected adults from 2b) after mating are stored to generate additional replicates of the analysis in 2a), if needed.
In summary, through the analyses of the two steps discussed above for the first development stage, expression information and functional validation of the built constructs are obtained, and three (3) candidates are identified: one ubiquitous-Cas9, one germline-Cas9, and one U6-gRNA construct. In certain cases, three additional U6 genes are kept as a backup in case the first four fail. Alternatively, other Pol-III promoters such as H1 or 7SK [41,42] are also tested.
B. Develop HDR-Based Transgenesis and GD in Culex quinquefasciatus
To develop CRISPR GD strategies, engineered transgenes need to be delivered precisely at predetermined locations in the genome, and therefore transposon-based integration would not be a viable option. Therefore, it is essential to develop targeted HDR-based delivery of transgenic elements in Culex mosquitoes. This development stage focused on testing and building the know-how for site-specific transgenesis. While doing so, animals carrying two transgenic constructs are generated and used for the work including but not limited to, 1) a ubiquitous-Cas9, and 2) a GD for the suppression of Cx. quin. populations.
The choice of the first transgene stands in the value of a ubiquitous-Cas9-expressing Cx. quin. line that could be used as a platform for targeted-mutagenesis which is higher in animals expressing Cas9 endogenously. Additionally, it could function as a tool for high-efficiency genome editing of endogenous candidates relevant to the pathogenesis of disease agents by injecting gRNAs. The transgene producing the highest editing in the analyses of the first development stage is used, and to mitigate risk, its insertion is targeted into the white locus, which is healthy as a homozygous line. The w4-gRNA is used. which provides consistent high-editing rates [30].
The second construct is a GD element for population suppression. This construct is much bigger in size, as it includes a Cas9, a gRNA, and a marker gene, potentially rendering it harder to insert in the genome. Its insertion is targeted in the doublesex (dsx) locus, which has been successfully used in Anopheles in a GD strategy [17]. Briefly, dsx is a gene with two splicing isoforms dsxM(male) and dsxF (female) that are expressed in the two sexes to determine sexual dimorphism in dipterans [43]. Previous work has used a GD to disrupt the dsxF splicing acceptor leading to the splicing forced to the dsxM acceptor and therefore generating only male progeny [17]. This strategy could be prone to the generation of small indels, at the targeted site, that are resistant to further attack by Cas9 and yet maintain the correct splicing of the dsxF isoform. Indels were not observed in this work [17], but could be problematic if released in the wild.
Differently from the previous work, one of the dsx exons common to both sexes' isoforms is targeted, and a re-coded version of the dsxM transcript is provided. Additionally, two closely-located (˜25 bp) highly-efficient gRNAs used in a Protospacer Adjacent Motif (PAM) are used—in arrangement that the previous work [21] predicts to be the best situation to lead to efficient HDR-conversion when using multiplexing (
The design of this GD improves differs from the previous work as it uses multiplexing with two gRNAs in an arrangement that would allow each one of them to fail and still lead to conversion. Additionally, resistant alleles maintaining the gene function are even less likely since the coding sequence is now targeted by two gRNAs. Even in such an unlikely situation, the GD always forces the sex determination towards the male sex. Altogether the features of this design should lead to an efficient suppression gene drive.
Generation of a Cx. quin. Cas9 line: the construct is generated as outlined in
Next, batches of ˜800 eggs with a mix containing the generated HDR template plasmid, Cas9 protein, and synthesized w4-gRNA are injected, following the protocol similar to what have been optimized in the previous work [30]. The resulting G0 injected adults are crossed in small batches of ˜5 G0 males to 20 wild-type females or ˜10 G0 females to 15 wild-type males. Following blood feeding, the G1 offspring is screened for the presence of the fluorescent marker. The blood-feeding is repeated to induce egg-production 2 or 3 consecutive times for the same G0 batch before proceeding to subsequent rounds of injections. Based on transgenesis in other mosquito species, with a rate of ˜1-2% [16,45] and the survival rate to adulthood of ˜10% [30], transformants are expected to obtain by performing ˜3 rounds of injections.
Positive transformants are single-crossed to wild-type animals of the opposite sex, and after obtainment of the following generation, and verified that the fluorescent transgene is observed in such progeny, each G1 transgenic parent is killed for molecular characterization of the insertion. PCR amplification of the junction points is first performed, employing a primer located on the genome and outside of each homology arm, and one within the transgenic sequence. Positive confirmation of the transgene junction points are followed by PCR amplification of all or sections of the transgene, to verify correct insertion of all its sequence.
Generation of a Cx. quin. gene-drive line: The first step to generate the GD construct is to validate the gRNAs to be used, and given that this strategy uses two gRNAs, two couples of gRNAs (PAM-in, and with cuts at ˜25 bp distance) are selected. The efficiency of these gRNAs are then tested by injecting each one of the selected four, in combination with Cas9 protein into ˜200 eggs. The injected eggs are let to develop for 24 h and then used to prepare genomic DNA. The DNA is then subject to deep-sequencing of a PCR amplicon covering the targeted sequence (similar to 1b) in the first development stage). Analysis of the ˜50.000 sequences obtained yields confirmation of activity and an estimate of the efficiency of the selected gRNAs, and which selected couple are used to generate the GD construct.
Two homology arms are then cloned by PCR-amplifying the sequences abutting to the cut site of either the selected gRNAs (
From this development stage, two Cx. quin. Lines are obtained: 1) a Cas9-expressing line that boosts genome editing and transgenesis, and 2) a gene-drive line that is the basis for a larger project focusing on assessing its ability to suppress laboratory populations.
Transgenesis in mosquitos has well-known low efficiencies, and the previous experience with inserting a GD construct in Anopheles stephensi yielded 2 transformants out of ˜25,000 F1 screened individuals [16]. This could result in the failure to isolate transformants in the second development stage. In the Cas9-line injection, if no transformants are observed after the first round of injection, a similar injection is first performed and the G0 injected animals are crossed to the white-line instead, to evaluate if cutting is happening at all, or at lower levels; and then the protocol is adjusted accordingly by increasing or changing concentrations of the injected mix. An alternative solution to this issue is to generate Cas9-expressing mosquito lines using traditional transgenesis which was shown to work in Cx. quin. using a Hermes transposon [33]. While HDR-based transgenesis is focused here, this could be a backup strategy for the generation of the ubiquitous-Cas9 line, as the same construct that is built could be readily cloned into a Hermes backbone. Such a line could increase the efficiency of Cas9 edits as observed and reported [46] that could be instrumental in achieving HDR-based site-directed transgenesis.
If major issues with the generation of transformants are encountered, the generation of the Cas9 line is then focused on, as this could be used to later obtain the GD insertion at higher frequencies. For the GD construct, two potential issues are anticipated: 1) altered sex-related morphologies and potential sterility, which could be an issue in recovering transformants as injected animals might be infertile or unable to mate. In such a case, the concentration of the injected mix is first lowered to lower the effect. If this does not work, only the plasmid is injected with, as the Cas9 gene on the GD construct should be expressed only in the germline and resulting in editing only in that tissue, which should avoid any pleiotropic effects that injecting with Cas9 protein might have. 2) the second point regards the GD transformants, which are males if the GD works as predicted. To keep this line the GD males need to be crossed to wild-type females at each generation. This process is also a preliminary test of the GD as the progeny of such crosses should have a skew towards males.
The functionality of the generated Cas9 line is validated. To do so, this GD line is injected with a plasmid expressing a gRNA targeting another previously described pigmentation marker, yellow (CPIJ018481), and the Cas9 activity of the generated line is evaluated by analyzing the yellow phenotype in the G0 injected animals and their G1 offspring. Such a validated reagent is a valuable tool for genome engineering of this species using CRISPR. Further, the ability of this GD line to copy on the companion chromosome and resulting in GD offspring to be >50% males is tested. And a follow up step is to analyze how this GD is able to spread into and suppress the laboratory caged populations of the wild-type strain, as well as other populations with different genetic makeup.
It should be emphasized that the above-described embodiments of the present disclosure are merely possible examples of implementations set forth for a clear understanding of the principles of the disclosure. Many variations and modifications may be made to the above-described embodiment(s) without departing substantially from the spirit and principles of the disclosure. All such modifications and variations are intended to be included herein within the scope of this disclosure and protected by the following claims.
This application claims the benefit of U.S. Provisional Application No. 63/254,753, filed Oct. 12, 2021, the entire content of which is incorporated herein by reference.
This invention was made with government support under grant number OD023098 and AI162911 awarded by the National Institutes of Health. The government has certain rights in the invention.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2022/077736 | 10/7/2022 | WO |