The invention relates to a method of generating a library of polynucleotide molecules encoding guide RNAs (gRNAs) from target polynucleotide(s). The invention also relates to a library of polynucleotide molecules encoding gRNAs obtainable by the aforementioned method, and a gRNA library generation kit thereof.
Genome-wide clustered regularly interspaced short palindrome repeats (CRISPR)-Cas knockout screens are used to study the relationship between genotype and phenotype by knocking-out gene expression on a genome-wide scale and studying the resulting phenotypic changes. The approach uses the CRISPR-Cas gene editing system, coupled with libraries of oligonucleotides called guide RNAs (gRNAs). The Cas nuclease is directed by the gRNA that is complementary with the target DNA strand. In the presence of a short sequence called a protospacer-adjacent motif (PAM) on the opposite DNA strand, the gRNA binds to the target strand by complementarity and guides Cas to generate site-specific double-strand breaks (DSBs) on the target DNA sequence. The resultant DSBs are then processed by DNA repair mechanisms such as non-homologous end joining (NHEJ) or homology directed repair (HDR). HDR can result in precise gene replacement whereas NHEJ is error prone in that it may induce frameshift indel mutations that can abolish target gene function.
Genome-wide CRISPR screens have revolutionized prioritization of novel drug targets in both academia and industry. However, oligonucleotide design and synthesis for whole genome CRISPR screens is resource intensive and costly. Consequently, most human genome-wide CRISPR screen studies use universal libraries generated by a handful of labs around the world and are restricted to protein coding regions of the reference genome. The rapidly growing field of genome-wide association studies (GWAS) suggests that the majority of single nucleotide polymorphisms (SNPs) linked to disease fall within the non-coding genome; however, tools to systematically screen the non-coding genome for functionality are limited.
There is therefore a need for methods which can generate patient and cell type specific gRNA libraries targeting functional coding and non-coding genome in a high-throughput and cost-effective manner.
According to a first aspect of the invention, there is provided a method of generating a library of polynucleotide molecules encoding guide RNAs (gRNAs) from target polynucleotide(s) comprising incubation of the target polynucleotide(s) with insertional enzyme complexes, wherein each of said insertional enzyme complexes comprises (i) an insertional enzyme and (ii) one or more tagmentation adapters to generate a plurality of tagged cleavage fragments.
According to a further aspect of the invention there is provided a library of polynucleotide molecules encoding guide RNAs (gRNAs) obtainable by the methods as disclosed herein.
According to a further aspect of the invention there is provided a guide RNA (gRNA) library generation kit which comprises:
Disclosed herein is customized ATAC based CRISPR (cus-ATAC-CRISPR), which is a method of using insertional enzymes to generate patient and cell-type specific gRNA libraries in a high-throughput and cost-effective manner. Both promoters and enhancers of transcriptionally active genes display increased sensitivity to nuclease digestion and are located to “open” chromatin domains. Assay for Transposase Accessible Chromatin using sequencing (ATAC-seq) is a method of probing open chromatin based on the ability of transposases such as Tn5 transposase and MuA transposase to fragment DNA (described in US20160060691A1). Therefore, by using insertional enzymes such as Tn5 transposase to generate a gRNA library from genomic DNA, the gRNAs should target all functionally “active” regions of the genome, including non-coding regions, and will carry the unique genetic variants (such as single nucleotide polymorphisms (SNPs), short insertions/deletions, and structural variants) from the individual cells and patients. In addition, the reactions can be carried out using any transposase accessible DNA as a substrate at a comparatively lower cost. Importantly, cus-ATAC-CRISPR does not require prior knowledge of sequence or ATAC profiles, could easily be applied to less well sequenced species or novel contexts, and will target all regions of open chromatin, including the non-coding genome.
Thus, according to a first aspect of the invention, there is provided a method of generating a library of polynucleotide molecules encoding guide RNAs (gRNAs) from target polynucleotide(s) comprising incubation of the target polynucleotide(s) with insertional enzyme complexes, wherein each of said insertional enzyme complexes comprises (i) an insertional enzyme and (ii) one or more tagmentation adapters to generate a plurality of tagged cleavage fragments.
Reference herein to “guide RNAs” or “gRNAs” is intended to refer to any RNA molecule that recognises a target polynucleotide region of interest and directs a polynucleotide-targeting enzyme to that region. In some embodiments, the polynucleotide-targeting enzyme is a DNA-targeting enzyme. In some embodiments the DNA-targeting enzyme is a Cas endonuclease. In some embodiments the Cas endonuclease is Cas9 endonuclease. In another embodiment, the Cas endonuclease is Cas12a or a variant thereof, such as AsCas12a (from Acidaminococcus sp. BV3L6), LbCas12a (from Lachnospiraceae bacterium ND 2006), CeCas12a (from Coprococcus eutactus) or FnCas12a. In a still further embodiment, the DNA-targeting enzyme is a Cas variant, such as xCas9, SpCas9-NG, SpG or SpRY. In an alternative embodiment, the DNA-targeting enzyme is RNA guided DNA endonuclease. In some embodiments, the gRNA is a single guide RNA (sgRNA). The sgRNA is an RNA molecule consisting of two parts, a ‘constant region’ and a ‘variable region’. The constant region is the tracer RNA (tracrRNA), also referred to as the scaffold sequence, and is the sequence that binds to the Cas endonuclease. The variable region of the sgRNA is the CRISPR RNA (crRNA) which contains a sequence specific to the region of interest. In further embodiments, the sgRNA is a Cas9 sgRNA. For other Cas enzymes such as Cas12a the gRNA includes just the crRNA and not the tracrRNA.
The crRNA sequence is typically about 20 nucleotides in length. In some embodiments, the crRNA sequence is 17 nucleotides in length. In some embodiments, the crRNA sequence is 18 nucleotides in length. In some embodiments, the crRNA sequence is 19 nucleotides in length. In some embodiments, the crRNA sequence is 20 nucleotides in length. In some embodiments, the crRNA sequence is 21 nucleotides in length. In some embodiments the crRNA sequence is 22 nucleotides in length. In some embodiments, the crRNA sequence is 23 nucleotides in length. In some embodiments, the crRNA sequence is 24 nucleotides in length. In some embodiments, the crRNA sequence is 25 nucleotides in length. Thus, in some embodiments the sgRNA sequence is 17 nucleotides in length. In some embodiments, the sgRNA sequence is 18 nucleotides in length. In some embodiments, the sgRNA sequence is 19 nucleotides in length. In some embodiments, the sgRNA sequence is 20 nucleotides in length. In some embodiments, the sgRNA sequence is 21 nucleotides in length. In some embodiments the sgRNA sequence is 22 nucleotides in length. In some embodiments, the sgRNA sequence is 23 nucleotides in length. In some embodiments, the sgRNA sequence is 24 nucleotides in length. In some embodiments, the sgRNA sequence is 25 nucleotides in length.
In some embodiments, the crRNA sequence is at least 17 nucleotides in length. In some embodiments, the crRNA sequence is at least 18 nucleotides in length. In some embodiments, the crRNA sequence is at least 19 nucleotides in length. In some embodiments, the crRNA sequence is at least 20 nucleotides in length. In some embodiments, the crRNA sequence is at least 21 nucleotides in length. In some embodiments the crRNA sequence is at least 22 nucleotides in length. In some embodiments, the crRNA sequence is at least 23 nucleotides in length. In some embodiments, the crRNA sequence is at least 24 nucleotides in length. In some embodiments, the crRNA sequence is at least 25 nucleotides in length. Thus, in some embodiments the sgRNA sequence is at least 17 nucleotides in length. In some embodiments, the sgRNA sequence is at least 18 nucleotides in length. In some embodiments, the sgRNA sequence is at least 19 nucleotides in length. In some embodiments, the sgRNA sequence is at least 20 nucleotides in length. In some embodiments, the sgRNA sequence is at least 21 nucleotides in length. In some embodiments the sgRNA sequence is at least 22 nucleotides in length. In some embodiments, the sgRNA sequence is at least 23 nucleotides in length. In some embodiments, the sgRNA sequence is at least 24 nucleotides in length. In some embodiments, the sgRNA sequence is at least 25 nucleotides in length.
In some embodiments, Cas9 comprises an inactive or catalytically dead Cas9 variant (also known as dCas9). This variant is known to inhibit transcription by blocking either initiation or elongation by the RNA polymerase complex. Furthermore, dCas9 activator can be used to increase transcription by recruiting transcription factor or modify chromatin modification complex. Reference herein to “target polynucleotide(s)” is intended to refer to any polynucleotide or collection of polynucleotides (e.g., a chromosome, a collection of chromosomes, reverse transcribed RNA or RNA-DNA duplex, PCR-amplified DNA, targeted capturing sequences etc.) from which gRNAs are to be generated.
Examples of the types of suitable target polynucleotides include but are not limited to: chromosomal DNA (e.g., a chromosome, a genome, a collection of chromosomes), viral DNA, unknown DNA collected from any source (e.g., collected from an environmental source), DNA from an organelle (mitochondrial DNA, nuclear DNA, chloroplast DNA, and the like). Examples of suitable cellular sources for the target polynucleotide(s) include, but are not limited to: a eukaryotic cell; a prokaryotic cell, (e.g., a bacterial cell or an archaeal cell), a cell of a single-cell eukaryotic organism; a plant cell (e.g., rice, soy, maize, corn, wheat, tomato, tobacco, fruit tree, etc.); an algal cell (e.g., Botryococcus braunii, Chlamydomonas reinhardtii, Nannochloropsis gaditana, Chlorella pyrenoidosa, Sargassum patens C. Agardh, and the like); a fungal cell (e.g., a yeast cell); an animal cell; a cell from an invertebrate animal (e.g. fruit fly, cnidarian, echinoderm, nematode, planarian, etc.); a cell from a vertebrate animal (e.g., fish, e.g., zebrafish, amphibian, e.g, frog, reptile, bird, e.g., chicken, mammal, and the like); a cell from a mammal (e.g., zoo animal, pet, canine, equine, porcine, rodent, primate, human, etc.); and the like. The cellular source may be a single cellular source or a pooled sample of multiple cellular sources. The target polynucleotide may also be prepared from RNA. For example, complementary DNA (cDNA) may be synthesized from a single-stranded RNA template in a reaction catalysed by the reverse transcriptase enzyme. The reverse transcription of RNA into cDNA may also be combined with the amplification of specific DNA targets using reverse transcription polymerase chain reaction (RT-PCR) and insertional enzymatic reaction using dsDNA or DNA-RNA duplex. In order to generate a plurality of tagged cleavage fragments, the target polynucleotides must be accessible to the insertional enzyme complexes. For example, wherein the target polynucleotide is chromosomal DNA, target polynucleotides are accessible when chromatin is open such as in regions of the genome which are undergoing active transcription. Accessibility is similarly required wherein the target polynucleotide is viral DNA, organelle-derived DNA or DNA from another source which may be packaged.
Reference herein to a “PAM sequence” or “protospacer adjacent motif sequence” is intended to refer to a 2-8 base pair DNA sequence which is recognized by a genome editing enzyme such as an RNA-guided DNA endonuclease (e.g., a Cas9, Mad7, or Cpf1 endonuclease) to promote cleavage of the target site by the endonuclease. For example, the Cas9 endonuclease from Streptococcus pyogenes recognizes the PAM sequence 5′-NGG-3′ (where “N” can be any nucleotide base). In other examples, SpG has been shown to recognize the PAM sequence 5′-NGN-3′, and SpRY can target almost all PAMs in the genome (with 5′-NRN-3′ being preferable to 5′-NYN-3′, where R is A or G and Y is C or T; Liang et al. (2022) Nat Comm., 13 (3421), doi: https://doi.org/10.1038/s41467-022-31034-8). xCas9 displays one of the broadest PAM compatibility profiles, recognizing NG, NNG, GAT, CAA NG (A/G/T), (G/C) AG and (G/C/T) GCC PAMs (Hu et al. (2018) Nature, doi: 10.1038/nature26155), while the Streptococcus pyogenes derived SpCas9-NG is more limited and recognises 5′-NGN-3′ (e.g., N (A/G), NTG, GT (A/C/T), (A/G/C) CG (A/G/T) and TCG (A/G/T) G; Fujii et al. (2019) Sci. Rep., 9 (12878), doi: https://doi.org/10.1038/s41598-019-49394-5 and Kim et al. (2020) Nat. Bio. Engineering, 4:111-124, doi: https://doi.org/10.1038/s41551-019-0505-1). Cas12a and variants thereof recognize thymine-rich PAM sequences at the 5′ end of the protospacer with TTTV (where V is A, G or C) being the optimal PAM. AsCas 12a and LbCas 12a also recognize C-containing PAMs, such as CTTV, TCTV and TTCV, while CeCas 12a is more specific for the TTTV PAM (Chen et al. (2020), Genome Biol., 21,78, doi: https://doi.org/10.1186/s13059-020-01989-2). Thus, in certain embodiments the PAM sequence herein or any part thereof is any PAM sequence suitable for the RNA-guided DNA endonuclease to be used together with the gRNA libraries generated by the present method.
Reference herein to an “insertional enzyme complex” is intended to refer to a synaptic complex of an insertional enzyme and one or more tagmentation adapter polynucleotides. The insertional enzyme mediates the fragmentation of the target polynucleotide(s) to generate a plurality of cleavage fragments, after which the insertional enzyme ligates the tagmentation adapter polynucleotides at both ends of the cleavage fragment to generate a tagged cleavage fragment. The term “tagged cleavage fragment” as used herein, refers to adapter-attached polynucleotides wherein each polynucleotide is flanked by one or more tagmentation adapters. Such a system, commonly referred to as ‘tagmentation’, is described in various publications such as Adey et al (2010) Genome Biology 11: R119; Goryshin and Reznikoff (1998) The Journal of Biological Chemistry 273:7367-7374; Picelli et al (2014) Genome Research 24:2033-2040; and Caruccio (2011) Methods in Molecular Biology 733:241-255.
Reference herein to an “insertional enzyme” is intended to refer to any enzyme capable of inserting a nucleic acid sequence into a polynucleotide. In some cases, the insertional enzyme can insert the nucleic acid sequence into the polynucleotide in a sequence-independent manner. The insertional enzyme can be prokaryotic or eukaryotic. Examples of insertional enzymes include, but are not limited to, transposases, HERMES, and HIV integrase.
In some embodiments, the insertional enzyme is a transposase. Reference herein to a “transposase” is intended to refer to any enzyme with transposase activity in vitro and/or in vivo. In some embodiments, the transposase is Tn5 transposase. Other examples of appropriate transposases include, but are not limited to, Mos1, Sleeping Beauty, piggyBac, Hsmar1 and ISY100 transposases.
In further embodiments, the Tn5 transposase may be a variant Tn5 transposase which comprises one or more sequence variations compared to wild-type Tn5 transposase. In further embodiments, the variant Tn5 transposase is a hyperactive Tn5 transposase. Hyperactive Tn5 transposases comprise one or more mutations compared to wild-type Tn5 transposase which results in enzyme hyperactivity where the activity of a modified enzyme variant is considerably higher than the activity of the wild-type enzyme. Examples of mutations resulting in a hyperactive Tn5 transposase include, but are not limited to, L372P (i.e., the replacement of leucine at amino acid position 372 with proline), E54K, E110K, E345K, P242A and P242G (summarised in Reznikoff (2003) Molecular Microbiology, 47 (5): 1199-1206).
In some embodiments, the transposase may be a transposase fusion protein. Reference herein to a “fusion protein” is intended to refer to a protein consisting of at least two domains that are encoded by separate genes that have been joined so that they are transcribed and translated as a single unit, producing a single polypeptide. For example, the Cleavage Under Targets and Tagmentation (CUT&Tag) method described by Hatice et al (2019) Nature Communications 10 (1930) utilises a hyperactive Tn5 transposase—Protein A (pA-Tn5) fusion protein. In CUT&Tag, a chromatin protein is bound in situ by a target-specific antibody, which then tethers a protein A-Tn5 transposase fusion protein to ensure that the transposase only cuts the DNA at close proximity to the target chromatin protein. Other examples of appropriate transposase fusion proteins include the fusion of transposases to transcription activator-like effectors (TALE) proteins, Gal4, ZFP, Cas9 or catalytically inactive Cas9 (dCas9) (summarised in Bhatt and Chalmers (2019) Nucleic Acids Research 47 (15): 8126-8135).
Reference herein to a “tagmentation adapter” is intended to refer to any DNA oligonucleotide that comprises the nucleotide sequences required to form a functional insertional enzyme complex. For example, efficient transposition with Tn5 transposase requires that each adapter polynucleotide has a specific 19-bp transposase recognition sequence (Mosaic End or ME sequence) at each of its ends. The tagmentation adapter polynucleotide can further comprise additional sequences (e.g., restriction sites or primer sequences) as needed or desired.
In some embodiments, the tagmentation adapter additionally comprises polymerase chain reaction (PCR) handles. PCR handles are nucleotide sequences to which PCR primers bind during PCR amplification. Examples of suitable PCR handles include, but are not limited to, Nextera i7/i5 Adapters. In further embodiments, the methods disclosed herein additionally comprise PCR amplification of the tagged cleavage fragments. For any downstream ligation reactions to occur, at least one of the DNA ends to be ligated must contain a 5′ phosphate. Therefore, in some embodiments, the tagmentation adapter is phosphorylated. In some embodiments, the tagmentation adapter may comprise amino group modifications, for example, amino modifier addition, to prevent self-ligation of adapters.
In one embodiment, the method of the invention comprises the following steps:
References herein to the term “ligation”, “ligate”, or “ligating” refers to any linkage of two nucleic acid sequences, usually comprising a phosphodiester bond. The linkage is normally facilitated by the presence of a catalytic enzyme (i.e., for example, a ligase such as T4 DNA ligase) in the presence of co-factor reagents and an energy source (i.e., for example, adenosine triphosphate (ATP)). Ligase enzymes are also available from commercial sources, such as Instant Sticky-end Ligase Master Mix and Blunt TA/Ligase Master Mix (both available from New England Biolabs).
Reference herein to a “ligation adapter” is intended to refer to any oligonucleotide suitable for ligation to another polynucleotide sequence. The ligation adapter can further comprise additional sequences (e.g., restriction sites or primer sequences) as needed or desired.
In some embodiments, the ligation adapter additionally comprises polymerase chain reaction (PCR) handles. PCR handles are nucleotide sequences to which PCR primers bind during PCR amplification. Examples of suitable PCR handles include, but are not limited to, TruSeq or Nextera i7/i5 Adapters. For any downstream ligation reactions to occur, at least one of the DNA ends to be ligated must contain a 5′ phosphate. Therefore, in some embodiments, the ligation adapter is phosphorylated. In some embodiments, the ligation adapter may comprise amino group modifications, for example, amino modifier addition, to prevent self-ligation of adapters.
Step (b) comprises amplification of the product of step (a). Reference herein to ‘amplification’ is intended to refer to any method of increasing the number of copies of an oligonucleotide sequence. Amplification is a commonly performed procedure and the skilled person would recognise that a range of techniques would be suitable to perform this step. Examples of suitable techniques include, but are not limited to, polymerase chain reaction (PCR), loop mediated isothermal amplification (LAMP), nucleic acid sequence based amplification (NASBA), self-sustained sequence replication (3SR), and rolling circle amplification (RCA).
In some embodiments, step (d) additionally comprises amplification of the adapter-attached polynucleotide fragments. In some embodiments, step (f) additionally comprises amplification of the adapter-attached polynucleotide fragments. In some embodiments, step (i) additionally comprises amplification of the adapter-attached gRNAs.
The first ligation adaptor comprises a label. The term “label” refers to a specific moiety having a unique affinity for a ligand (i.e. an affinity tag). Such a label may include, but is not limited to, a biotin label, a histidine label (i.e. 6His), or a FLAG label. In one embodiment, the label is biotin.
In some embodiments, step (a), (b), (c), (d), (e), (f), (g), (h) and/or step (i) additionally comprise a purification step. The purification step is useful to remove unwanted sequences after amplification, restriction digestion and to remove any unligated adapter sequences remaining after ligation. DNA purification is a commonly performed procedure and the skilled person would recognise that a range of techniques would be suitable to perform this step. Examples of suitable techniques include, but are not limited to, the use of magnetic beads or absorption onto a solid matrix (e.g., commercially available DNA purification columns).
In some embodiments the first restriction site of step (a) is an Mmel restriction site. In further embodiments, the restriction enzyme of step (c) which recognises the first restriction site and cleaves the adapter-attached polynucleotide fragment at a site downstream so as to remove the tagmentation adapters from the polynucleotide fragment is Mmel. Mmel is a Type IIS restriction enzyme which recognises the sequence TCCRAC, wherein R is A or G, and makes a 2 bp staggered cut 20 bases downstream. When the Mmel recognition site is placed at the correct distance (i.e., around 20 base pairs) from the junction between the tagmentation adaptor and the polynucleotide fragment, this results in the Mmel restriction enzyme cleaving the adapter-attached polynucleotide fragment at the junction between the tagmentation adaptor and the polynucleotide fragment, therefore removing the tagmentation adapters from the polynucleotide fragment. The skilled person would recognise that other restriction enzymes may be used to achieve the same result, as long as the first restriction site is placed within the tagmentation adaptor at the appropriate distance away from the junction between the tagmentation adaptor and the polynucleotide fragment. For example, NmeAlll recognises the sequence GCCGAG and makes a 2 bp staggered cut 21 bases downstream, therefore the NmeAlll restriction site would be placed around 21 base pairs away from the junction between the tagmentation adaptor and the polynucleotide fragment. Examples of other appropriate restriction enzymes include, but are not limited to, the Mmel family of restriction enzymes (e.g., ApyPI, Aqull, Aqulll, AquIV, Bsbl, Cdpl, CstMI, DraRI, DrdIV, Maql, Mmel, NhaXI, NIaCI, NmeAlll, PlaDI, PspOMII, PspPRI, Reel, RpaB5I, SdeAI, and SpoDI), EcoPI5I and NmeAll.
In some embodiments, the second restriction site of step (d) is an EcoP15I restriction site. In further embodiments, the restriction enzyme of step (e) which recognises the second restriction site and cleaves the adapter-attached polynucleotide fragments within the polynucleotide region to generate a plurality of adapter-attached polynucleotide fragments comprising a first ligation adapter, a crRNA sequence and either part of or complete PAM sequence is EcoP15I. EcoP15I recognises the sequence CAGCAG and makes a 2 bp staggered cut 25 bases downstream. Therefore, when the EcoP15I restriction site is placed at the appropriate position in the first ligation adapter (i.e., about 4 base pairs from the junction between the ligation adaptor and the polynucleotide fragment), this results in the EcoP15I restriction enzyme cleaving the adapter-attached polynucleotide fragment within the polynucleotide region to generate a plurality of adapter-attached polynucleotide fragments comprising a first ligation adapter, crRNA sequence and part of PAM sequence which is about 21 nucleotides long. The skilled person would recognise that other restriction enzymes may be used to achieve the same result, as long as the second restriction site is located within the first ligation adaptor at the appropriate distance away from the junction between the first ligation adaptor and the polynucleotide fragment. Examples of other appropriate restriction enzymes include, but are not limited to, the Mmel family of restriction enzymes (e.g., ApyPI, Aqull, Aqulll, AquIV, Bsbl, Cdpl, CstMI, DraRI, DrdIV, Maql, Mmel, NhaXI, NIaCI, NmeAlll, PlaDI, PspOMII, PspPRI, Reel, RpaB5I, SdeAI, and SpoDI), and NmeAll.
In some embodiments, the third restriction site of step (d) is an Acul restriction site. In further embodiments, the restriction enzyme of step (h) which recognises the third restriction site and cleaves the adapter-attached polynucleotide fragment at a site downstream so as to remove the first ligation adaptor is Acul. Acul recognises the sequence CTGAAG and makes a 2 bp staggered cut 16 bases downstream. Therefore, when the Acul restriction site is placed at the appropriate position in the first ligation adapter (i.e., about 15 base pairs from the junction between the ligation adaptor and the polynucleotide fragment), this results in the Acul restriction enzyme cleaving the adapter-attached polynucleotide fragment at the junction between the first ligation adaptor and the polynucleotide fragment, therefore removing the first ligation adapter from the polynucleotide fragment. The skilled person would recognise that other restriction enzymes may be used to achieve the same result, as long as the third restriction site is located within the first ligation adaptor at the appropriate distance away from the junction between the first ligation adaptor and the polynucleotide fragment.
In some embodiments, the fourth restriction site of step (f) is an Ecil restriction site. In further embodiments, the restriction enzyme of step (g) which recognises the fourth restriction site and cleaves the adapter-attached polynucleotide fragment at a site downstream so as to remove the second ligation adaptor and the PAM sequence is Ecil. Ecil recognises the sequence GGCGGA and makes a 2 bp staggered cut 11 bases downstream. Therefore, when the Ecil restriction site is placed at the appropriate position in the second ligation adapter (i.e., about 6 base pairs from the junction between the second ligation adaptor and the polynucleotide fragment), this results in the Ecil restriction enzyme cleaving the adapter-attached polynucleotide fragment at the junction between the second ligation adaptor and the polynucleotide fragment, therefore removing the second ligation adapter and the remaining PAM sequence from the polynucleotide fragment. The skilled person would recognise that other restriction enzymes may be used to achieve the same result, as long as the fourth restriction site is located within the second ligation adaptor at the appropriate distance away from the junction between the second ligation adaptor and the polynucleotide fragment.
Step (i) comprises ligation of the digested product of step (h) to a plurality of third and fourth ligation adapters which comprise vector cloning sequences, to generate a plurality of adapter-attached gRNAs. Reference herein to ‘vector cloning sequences’ is intended to refer to any sequence which allows for the cloning of an oligonucleotide into a vector. There are various techniques available for cloning into vectors, which include, but are not limited to, restriction enzyme cloning, Gateway® recombination cloning, TOPO® cloning, Gibson Assembly (Isothermal Assembly Reaction), Type IIS Assembly (e.g., Golden Gate & MoClo), and ligation independent cloning (LIC). The skilled person would understand which vector cloning sequences are appropriate depending on which technique is to be used. The third and fourth ligation adapters can further comprise additional sequences as needed or desired. For example, the adapters may include additional restriction enzyme sites, all or part of a tracrRNA sequence, all or part of a promoter sequence for expression of gRNAs from the vector, and/or sequences intended to improve the expression of gRNAs from the vector.
A representative, and non-limiting, method of generating a library of polynucleotide molecules encoding guide RNAs (gRNAs) is illustrated by the flowchart in
According to a further aspect of the invention there is provided, a library of polynucleotide molecules encoding guide RNAs (gRNAs) obtainable by the methods as disclosed herein.
According to a further aspect of the invention there is provided a guide RNA (gRNA) library generation kit which comprises:
In one embodiment, the first restriction site is an Mmel restriction site.
In one embodiment, the restriction enzyme which recognises the first restriction site is Mmel.
In one embodiment, the second restriction site is an EcoP15I restriction site.
In one embodiment, the restriction enzyme which recognises the second restriction site is EcoP15I.
In one embodiment, the third restriction site is an Acul restriction site.
In one embodiment, the restriction enzyme which recognises the third restriction site is Acul.
In one embodiment, the fourth restriction site is an Ecil restriction site.
In one embodiment, the restriction enzyme which recognises the fourth restriction site is Ecil.
In one embodiment, the PAM sequence is 5′-NGG-3′, where N can be any nucleotide base. In another embodiment, the PAM sequence is 5′-NGN-3′. In a further embodiment, the PAM sequence is 5′-NRN-3′ or 5′-NYN-3′, where R is A or G and Y is C or T. In a yet further embodiment, the PAM sequence is NG, NNG, GAT or CAA. In a still further embodiment, the PAM sequence is 5′-NGN-3′. As hereinbefore, according to these embodiments N can be any nucleotide base.
In one embodiment, the transposase is Tn5 transposase.
The following studies and protocols illustrate embodiments of the methods described herein and their suitability for use:
C3013 cells (NEB) were transformed with pTBX1-Tn5 plasmid (Addgene #60240; Picelli et al. (2014) Genome Research, 24 (12), 2033-2040) and plated in LB-Agar plates supplemented with ampicillin. A single colony was picked and cultured in 5 mL LB media containing ampicillin at 37° C. until the OD600 reached 0.7-0.9. Cells were chilled at 10° C. and 250 μl of 1 M IPTG was added. Cells were grown for 4 h at 23° C. Cells were pelleted at 6500 rpm (JA25.50 rotor) for 20 min. The supernatant was removed and the cells were frozen at −70° C.
All primers were ordered from IDT or Sigma.
The oligos TAG-Adapt1-i7-FW, TAG-Adapt1-15-FW and MEDS-REV (Table 1) were dissolved in TE buffer at a concentration of 10 nmoles/μl.
250 nmoles of TAG-Adapt1-i7-FW and MEDS-REV were mixed in a tube and heated to 95° C. for 10 min. The tube was then allowed to cool down to room temp for 1 h.
Similarly, 250 nmoles of TAG-Adapt1-i5-FW and MEDS-REV were mixed and heated to 95° C. for 10 min and subsequently allowed to cool for 1 h at room temperature.
Frozen C3013 pellet expressing Tn5 (as per Example 1) was thawed and resuspended in 7 mL of cold HEGX buffer (20 mM HEPES-KOH (PH 7.2), 0.8 M NaCl, 1 mM EDTA, 10% glycerol and 0.2% Triton X-100) containing complete protease inhibitor cocktail (PIC). The cells were sonicated on 80% power with Bioruptor sonicator for eight times with 30 s on and 30 s off intervals on ice. The lysate was centrifuged at 11000 rpm for 30 min at 4° C. The supernatant was then transferred to a new beaker and 2.1 mL of 10% neutralised polyethyleneimine (PEI) was added in a dropwise manner with regular stirring to precipitate DNA. The solution was then centrifuged at 11000 rpm for 20 min at 4° C. The supernatant was transferred to a new tube and 100 μl of this sample was set aside as a cell lysate control (
In order to prepare the chitin magnetic resin (NEB), 2 mL of the chitin resin was washed with 10 mL HEGX buffer twice. The clarified supernatant (7 mL) from before was then added to the chitin resin. The tube was rotated for 30 min at 4° C. and subsequently placed on a magnetic column to remove the supernatant. 100 μl of the supernatant was kept as a control (
The chitin beads were then loaded with 250 nmoles of TAG-Adapt1-i7-FW/MEDS-REV and TAG-Adapt1-i5-FW/MEDS-REV oligos (as per Example 3) in 3 mL HEGX buffer containing PIC. The beads were rotated overnight at room temperature. The next day, chitin beads were washed 4 times with 10 mL HEGX buffer. The chitin beads were then resuspended in 5 mL HEGX buffer containing 50 mM 1,4-Dithiothreitol (DTT). The beads were incubated at 4° C. for 48 h. Subsequently, the beads were placed in a magnetic column and the supernatant containing purified Tn5 was collected. The beads were then washed three times in 1 mL HEGX buffer containing 50 mM DTT and the remaining Tn5 in the eluate was collected. All the Tn5 samples were run on 10% SDS-PAGE gel and Coomassie staining was used to detect the presence of 55 kD Tn5 band (
All Tn5 transposase samples were pooled and concentrated to 0.5 mL using 10 kD MW Vivaspin concentrator column by spinning at 4500×g. The concentrated Tn5 sample was dialysed overnight at 4° C. in 2 L of 2× Tn5 dialysis buffer (100 mM HEPES-KOH (pH 7.2), 0.2 M NaCl, 0.2 mM EDTA, 0.2% Triton X-100, 20% glycerol and 2 mM DTT) using Spectra/por 6-8 kD dialysis membrane. The dialysed sample was collected and the concentration was measured using Bradford assay. The sample was diluted to 24 μM using 2× Tn5 dialysis buffer. An equal amount of 100% glycerol was added to the Tn5 and 25μl aliquots were stored at −70° C.
The oligos MEDS-A, MEDS-B, MEDS-REV, TAG-Adapt1-i5-FW and TAG-Adapt1-i7-FW (Table 1) were dissolved in T4 DNA ligase buffer at 100 μM. 3 μL each of the following combinations of MEDS-A/MEDS-REV, MEDS-B/MEDS-REV, TAG-Adapt1-i7-FW/MEDS-REV and TAG-Adapt1-i5-FW/MEDS-REV were placed in PCR tube. The samples were heated to 95° C. for 10 min and then placed on a bench for 1 h to cool down to room temperature.
Commercially available Tn5 (7 μL) was loaded with either 1 μL of MEDS-A/MEDS-REV and MEDS-B/MEDS-REV (referred hereafter as standard Nextera adapters) or TAG-Adapt1-i7-FW/MEDS-REV and TAG-Adapt1-i5-FW/MEDS-REV (referred hereafter as custom adapters) for 1 h at room temperature to have the final transposome.
Assembled transposomes either in solution (as per Example 5) or on magnetic beads (as per Example 4) were used for tagmenting genomic DNA (400 ng) in 5× TAPS-DMF buffer (50 mM TAPS-NaoH, 25 mM MgCl2 and 50% DMF) in 20 μL reaction. The tagmentation reaction was carried out at 55° C. for 30 min. After the tagmentation, proteinase K (0.5 μL) was added to the reaction for 7 min at 55° C. The tagmentation reaction (5 μL) was checked on 2% agarose gel (
The purified products were then PCR amplified using the conditions in Table 2. The PCR primers used were either Nextera i501/i701, Nextera i502/i702 or Nextera i503/i703 (Table 1). KAPA HiFi HotStart ReadyMix was used for the PCR. The PCR products were purified using ChargeSwitch PCR clean-up beads.
The forward and reverse oligos were dissolved in TE buffer at final concentration of 100 μM. 10 μL of both oligos were mixed in a PCR tube and 2.2 μL of T4 DNA ligase buffer (NEB) was added. The oligos were heated to 95° C. for 10 min in PCR machine and then allowed to cool down to room temperature for 1 h. Subsequently, the oligos were diluted to 10 μM by addition of TE buffer. The adapter names and the oligos used to prepare them are listed Table 3, oligo and adapter sequences can be found in Table 1.
A defined template of known sequence was used to test enzymatic reactions. The sequence of the template is as follows:
Two PCR primers TAG-NextA-PCR-FP and TAG-NextB-PCR-RP (Table 1) were used to add a Mmel restriction enzyme site and Nextera A/Nextera B PCR amplification sites. KAPA HiFi HotStart ReadyMix was used for PCR using the conditions in Table 4.
The PCR products were purified using ChargeSwitch PCR clean-up beads.
2.5 μg of the defined template containing Mmel restriction sites (as per Example 8) or PCR amplified tagmented gDNA (from Example 6) were digested with Mmel in a 50 μL reaction at 37° C. for 1 h. The restriction digestion reaction was then purified using ZYMO DNA Clean & Concentrator kit. Mmel digested product (240 ng) was ligated with 3 μl each of biotinylated Adapter1-1 and Adapter1-2 (see Example 7) using Blunt TA/ligase mastermix (NEB) for 20 min at 25° C. The ligation products were purified using ZYMO DNA Clean & Concentrator kit.
The Adapter 1 ligation products (from Example 9) were then PCR amplified using Chk_Bio_A1_Rn1_FP and Chk_Bio_A1_Rn1_RP primers. The PCR conditions in Table 5 and Table 6 were used with KAPA HiFi HotStart polymerase.
ChargeSwitch PCR clean-up beads were used for purification of PCR products.
The ligation products from Example 9 or PCR amplified ligation products from Example 10 were then amplified using A1-Rn1-PCR1-i5 and A1-Rn1-PCR1-i7 primers to add PCR handles for Illumina sequencing. The PCR1 conditions in Table 7 were used with KAPA HiFi HotStart polymerase.
These PCR1 products were purified using ChargeSwitch PCR clean-up beads.
The purified PCR1 products (30 ng) were used for different PCR cycles (5, 8, 10 and 12 cycles) to add P5/P7 Illumina sequencing sites and multiplexing indexes using KAPA HiFi HotStart polymerase. NEBNext i501 primer and NEBNext i701 primer were used for amplification of gDNA. NEBNext i504 Primer and NEBNext i701 Primer were used for amplification of defined template. The PCR2 conditions used were as shown in Table 8.
The PCR2 products were checked on 2% agarose gel for presence of the correct band (
Binding and Washing (B&W) buffer—2× (10 mM Tris-HCl (pH 7.5), 1 mM EDTA and 2 M NaCl) was diluted to 1× with nuclease free water. Dynabeads M-270 Streptavidin beads (75 μL/sample) were washed with 1× B&W buffer (1mL) three times. Dynabeads were then dissolved in 100 μL 2× B&W buffer. Biotinylated Adapter 1 ligation products (450 ng) from Example 9 or Example 10 for gDNA and defined template were added to the dynabeads for a total volume of 200 μL. The samples were then incubated for 20 min at room temperature with shaking. The samples were then placed in DynaMag-PCR Magnet and washed thrice with 1× B&W buffer. After the final wash, the beads were resuspended in 36 μL water.
EcoP15I digestion was carried out directly on beads in Cutsmart buffer (NEB) supplemented with 100 μM sinefungin in a 50 μL reaction at 37° C. for 1 h. After the digestion, the beads were washed twice with 1× B&W buffer (containing 0.01% Tween-20) and three times with 1× B&W buffer. After washings, the beads were dissolved in 30 μL water.
The beads were then used for ligation with 6 μl of Adapter2 (10 μM) that contained TruSeqA/B PCR handles using Blunt TA/Ligase Master Mix. The ligation was carried out at 25° C. for 20 min. After the ligation, the beads were washed three times with 1× B&W buffer & subsequently dissolved in 40 μL water.
The EcoP15I digestion products ligated with Adapter 2 (5 μL beads) were then used for PCR amplification with A1-Rn1-PCR1_i5, A1-Rn1-Fw2_PCR1_i5 and A2-Rn1-Rw1_PCR1_i7 primers for 12 cycles with KAPA HiFi HotStart polymerase. The PCR1 parameters are shown in Table 9.
The PCR1 products were then purified using ChargeSwitch PCR clean-up beads. These purified PCR 1 products (30 ng) were used for the addition of multiplexing indexes and different PCR cycles were tested (5, 8, 10 or 12 cycles) with KAPA HiFi HotStart polymerase. For gDNA, NEBNext i501 primer and NEBNext i703 primer were used while for defined template NEBNext i504 primer and NEBNext i703 primer were used. The PCR2 parameters are mentioned in Table 10. ChargeSwitch PCR clean-up beads were used for PCR2 purification.
The PCR2 products were run on 2% agarose gel. The expected band of 204 bp was observed for Adapter 2 ligated products for both gDNA and defined template (FIG. 3B).
The magnetic beads with adapter 2 ligated products from Example 13 (both gDNA and defined template) were then digested with Ecil for 1 h at 37° C. in Cutsmart buffer (NEB). After Ecil digestion, the beads were washed thrice with 1× B&W buffer and dissolved in 15 μL water. Ligation of 6 μL of Adapter 3 (10 μM) was carried out with Blunt TA/Ligase Master Mix (NEB) for 20 min at 25° C. After the ligation, the beads were washed thrice with 1× B&W buffer and then dissolved in 40 μL water.
Adapter 3 ligated beads (5 μL) were used for PCR amplification with A1-Rn1-PCR1_i5, A1-Rn1-Fw2_PCR1_i5 and Amplify_A3-Rn1-i7 primers for addition of TruSeqA/B PCR handles.
KAPA HiFi Taq polymerase was used for PCR1 with the conditions in Table 11:
The PCR1 products were purified with ChargeSwitch PCR clean-up beads. These PCR1 products (30 ng) were used for addition of multiplexing indexes and either 5, 8, 10 or 12 PCR cycles were tested. For defined template sample, NEBNext i504 Primer and NEBNext i705 Primer were used whereas for gDNA sample, NEBNext i501 Primer and NEBNext i705 Primer were used. The PCR2 conditions are mentioned in Table 12. PCR2 purification was carried out using ChargeSwitch PCR clean-up beads.
The PCR2 products were run 2% agarose gel and the expected band of 240 bp could be observed (
The beads containing Adapter 3 ligated products (from Example 15) were digested with Acul (NEB) in Cutsmart buffer (NEB) for 1 h at 37° C. in a 30 μL reaction. After the digestion, the enzymatic reaction containing the beads was placed on DynaMag-PCR magnet and the supernatant was transferred to a new tube. The supernatant was then supplemented with 2× Blunt TA/Ligase Master Mix and 6 μL of Adapter 4 (10 μM). The ligation was carried out at 25° C. for 20 min. ZYMO DNA Clean & Concentrator kit was used for purification of the ligation reaction.
The purified ligation products (from Example 17) were then PCR amplified using Amplify_A4-Rn1-i5 and Amplify_A3-Rn1-i7 primers using the PCR1 conditions in Table 13.
PCR1 products were purified with ChargeSwitch PCR clean-up beads and 30 ng of them were used for addition of multiplexing indexes using the PCR2 conditions in Table 14. For gDNA sample, NEBNext i501 Primer and NEBNext i707 Primer were used whereas for defined template sample, NEBNext i504 Primer and NEBNext i707 Primer were used for PCR amplification (5, 8, 10 or 12 cycles). The PCR2 products were purified with ChargeSwitch PCR clean-up beads.
These PCR2 products were then run on 2% agarose gel and expected band of 270 bp was observed for both defined template and gDNA (
Samples in each step were individually barcoded and pooled together for next generation sequencing analysis. Miseq v2 150 bp paired kit was used to sequence the NGS library. Samples were quantified and loaded on MiSeq machine. FASTQ files were analyzed with customized scripts. Reads with expected library structure were isolated and mapped to human hg38 genome.
Leukaemia cell line K562 were cultured in RPMI 1640 medium and 10% fetal bovine serum (FBS) at 37° C. and 5% CO2. Cell number and viability of K562 cells were determined using Countess II automated cell counter by mixing 1:1 solution of K562 cells with 0.4% Trypan blue stain. If the samples had 5-15% dead cells, the suspension of K562 cells was treated with DNase (Worthington) at a final concentration of 200 U/ml in cell culture medium for 30 min at 37° C. and 5% CO2. The cells were then washed with Phosphate buffered saline (PBS) and resuspended in PBS. K562 cells number and viability was determined again using Countess II.
The following buffers were prepared for ATAC reaction: ATAC-resuspension buffer (ATAC-RSB) contained 10 mM Tris-HCl (pH 7.4), 10 mM NaCl and 3 mM MgCl2. Tagmentation buffer (2×) contained 20 mM Tris-HCl (pH 7.6), 10 mM MgCl2 and 20% dimethyl formamide. Transposition mix contained 25 μL tagmentation buffer (2×), 16.5 μL PBS, 6-8 μL transposase (12 μM stock), 0.5 μL 1% digitonin and 0.5 μL 10% Tween-20.
Either 50,000 or 100,000 K562 cells were pelleted at 500 RCF for 5 min at 4° C. in a 1.5 mL tube. The supernatant was discarded and the cell pellet was resuspended in 50 μL cold ATAC-RSB supplemented with 0.1% Tween-20, 0.1% NP40 and 0.01% Digitonin. The cells were incubated on ice for 3 minutes. 1 mL of cold ATAC-RSB containing 0.1% Tween-20 was added to the cells and the tube was inverted 3 times for mixing the contents. The nuclei were pelleted in a fixed angle centrifuge at 500 RCF for 10 min (4° C.). The supernatant was discarded and the pellet is resuspended in 50 μL of transposition mix at 37° C. for 30 min in a thermomixer at 300 RPM.
The transposition reaction was then cleaned with Zymo DNA clean and concentrator-5 columns and eluted in 21 μL elution buffer. All of the eluate was used for amplification with the PCR conditions mentioned in Table 15 using NEBNext 2×MasterMix. The primers used for amplification were i7-NexteraA (GTCTCGTGGGCTCGGTC; SEQ ID NO: 51) and i5-NexteraB (TCGTCGGCAGCGTCTC; SEQ ID NO: 52).
The PCR products were purified using Zymo DNA clean and concentrator-5 columns. The purified PCR products were analysed on Bioanalyzer using Agilent high sensitivity DNA kit (
Assembled transposomes on magnetic beads (as per Example 4) were used for tagmenting genomic DNA (400 ng) from K562 cells in 5× TAPS-DMF buffer (50 mM TAPS-NaoH, 25 mM MgCl2 and 50% DMF) in 20 μL reaction. The tagmentation reaction was carried out at 55° C. for 30 min. After the tagmentation, proteinase K (0.5 μL) was added to the reaction for 7 min at 55° C. The tagmentation reaction was then purified using Zymo DNA clean and concentrator-5 kit.
The purified products were then PCR amplified using the conditions in Table 16. The PCR primers used were i7-NexteraA and and i5-NexteraB. KAPA HiFi HotStart ReadyMix was used for the PCR. The PCR products were purified using ChargeSwitch PCR clean-up beads.
Two defined templates of known sequence were used to test enzymatic reactions.
The sequence of the template 1 is as follows:
The sequence of template 2 is as follows:
Two PCR primers TAG-NextA-PCR-FP and TAG-NextB-PCR-RP (Table 1) were used to add a Mmel restriction enzyme site and Nextera A/Nextera B PCR amplification sites.
KAPA HiFi HotStart ReadyMix was used for PCR using the conditions in Table 17.
The PCR products were purified using ChargeSwitch PCR clean-up beads.
Example 24-Mmel restriction digestion & ligation of Adapter 1 2.5 μg of the defined templates containing Mmel restriction sites (as per Example 23) or PCR amplified tagmented gDNA (from Example 22) or PCR amplified ATAC sample (from Example 21) were digested with Mmel in a 50 μL reaction at 37° C. for 1 h. The restriction digestion reaction was then purified using ZYMO DNA Clean & Concentrator kit. Mmel digested product (240 ng) was ligated with 3 μL each of biotinylated Adapter1-1 and Adapter1-2 (see Example 7) using Blunt TA/ligase mastermix (NEB) for 20 min at 25° C. The ligation products were purified using ZYMO DNA Clean & Concentrator kit.
The Adapter 1 ligation products (from Example 24) were then PCR amplified using Chk_Bio_A1_Rn1_FP and Chk_Bio_A1_Rn1_RP primers. The PCR conditions in Table 18 and Table 19 were used with KAPA HiFi HotStart polymerase.
ChargeSwitch PCR clean-up beads were used for purification of PCR products.
The ligation products from Example 24 or PCR amplified ligation products from Example 25 were then amplified using A1-Rn1-PCR1-i5 and A1-Rn1-PCR1-i7 primers to add PCR handles for Illumina sequencing. The PCR1 conditions in Table 20 were used with KAPA HiFi HotStart polymerase.
These PCR1 products were purified using ChargeSwitch PCR clean-up beads.
The purified PCR1 products (50-100 ng) were used for 5 PCR cycles to add P5/P7 Illumina sequencing sites and multiplexing indexes using KAPA HiFi HotStart polymerase. Different combinations of NEBNext i501-1506 primers and NEBNext i701-i708 primers were used for amplification of gDNA, ATAC and defined template 1 and defined template 2. The PCR2 conditions used were as shown in Table 21.
The PCR2 products were checked on 2% agarose gel for presence of the correct band (
Binding and Washing (B&W) buffer-2× (10 mM Tris-HCl (pH 7.5), 1 mM EDTA and 2 M NaCl) was diluted to 1× with nuclease free water. Dynabeads M-270 Streptavidin beads (75 μL/sample) were washed with 1× B&W buffer (1 mL) three times. Dynabeads were then dissolved in 100 μL 2× B&W buffer. Biotinylated Adapter 1 products (450 ng) from Example 25 for gDNA, ATAC, defined template 1 and defined template 2 were added to the dynabeads for a total volume of 200 μL. The samples were then incubated for 20 min at room temperature with shaking. The samples were then placed in DynaMag-PCR Magnet and washed thrice with 1× B&W buffer. After the final wash, the beads were resuspended in 36 μL water.
EcoP15I digestion was carried out directly on beads in Cutsmart buffer (NEB) supplemented with 100 μM sinefungin in a 50 μL reaction at 37° C. for 1 h. After the digestion, the beads were washed twice with 1× B&W buffer (containing 0.01% Tween-20) and three times with 1× B&W buffer. After washings, the beads were dissolved in 30 μL water.
The beads were then used for ligation with 6 μl of Adapter2 (10 μM) using Blunt TA/Ligase Master Mix. The ligation was carried out at 25° C. for 20 min. After the ligation, the beads were washed three times with 1× B&W buffer & subsequently dissolved in 40 μL water.
The EcoP15I digestion products ligated with Adapter 2 (5 μL beads) were then used for PCR amplification with A1-Rn1-PCR1_i5, A1-Rn1-Fw2_PCR1_i5 and A2-Rn1-Rw1_PCR1_i7 primers for 12 cycles with KAPA HiFi HotStart polymerase. The PCR1 parameters are shown in Table 22.
The PCR1 products were then purified using ChargeSwitch PCR clean-up beads. These purified PCR1 products (50-100 ng) were used for the addition of multiplexing indexes for 5 PCR cycles with KAPA HiFi HotStart polymerase. For gDNA, defined template 1, defined template 2 and ATAC, different combinations of NEBNext i501-i506 primer and NEBNext i701-i708 primers were used. The PCR2 parameters are mentioned in Table 23. ChargeSwitch PCR clean-up beads were used for PCR2 purification.
The PCR2 products were run on 2% agarose gel. The expected band of 204 bp was observed for Adapter 2 ligated products for gDNA, defined templates (1 and 2) and ATAC (
The magnetic beads with adapter 2 ligated products from Example 28 (gDNA, ATAC and defined templates 1 and 2) were then digested with Ecil for 1 h at 37° C. in Cutsmart buffer (NEB). After Ecil digestion, the beads were washed thrice with 1× B&W buffer and dissolved in 15 μL water. Ligation of 2-6 μL of Adapter 3 (10 μM) was carried out with Blunt TA/Ligase Master Mix (NEB) for 20 min at 25° C. After the ligation, the beads were washed thrice with 1× B&W buffer and then dissolved in 40 μL water.
Adapter 3 ligated beads (5 μL) from Example 30 were used for PCR amplification with A1-Rn1-PCR1_15, A1-Rn1-Fw2_PCR1_i5 and Amplify_A3-Rn1-i7 primers for addition of TruSeqA/B PCR handles.
KAPA HiFi Taq polymerase was used for PCR1 with the conditions in Table 24.
The PCR1 products were purified with ChargeSwitch PCR clean-up beads. These PCR1 products (50-100 ng) were used for addition of multiplexing indexes and amplified for 5 PCR cycles. Combinations of NEBNext i501-1506 primers and NEBNext i701-i708 primers were used. The PCR2 conditions are mentioned in Table 25. PCR2 purification was carried out using ChargeSwitch PCR clean-up beads.
The PCR2 products were run 2% agarose gel and the expected band of 240 bp could be observed (
The beads containing Adapter 3 ligated products (from Example 30) were digested with Acul (NEB) in Cutsmart buffer (NEB) for 1 h at 37° C. in a 30 μL reaction. After the digestion, the enzymatic reaction containing the beads was placed on DynaMag-PCR magnet and the supernatant was transferred to a new tube. The supernatant was then supplemented with 2× Blunt TA/Ligase Master Mix and 1-6 μL of Adapter 4 (10 μM). The ligation was carried out at 25° C. for 20 min. ZYMO DNA Clean & Concentrator kit was used for purification of the ligation reaction.
The purified ligation products (from Example 32) were then PCR amplified using Amplify_A4-Rn1-i5 and Amplify_A3-Rn1-i7 primers using the PCR1 conditions in Table 26.
PCR1 products were purified with ChargeSwitch PCR clean-up beads and 50-100 ng of them were used for addition of multiplexing indexes using the PCR2 conditions in Table 27. NEBNext i501-i506 primers and NEBNext i701-i708 primers were used for PCR amplification. R2 products were purified with ChargeSwitch PCR clean-up beads.
These PCR2 products were then run on 2% agarose gel and expected band of 270 bp was observed (
The sequencing libraries generated from Example 26, Example 29, Example 31 and Example 33 were pooled and sent for sequencing using Miseq v2 300 bp paired kit.
The sequencing reads for adapter 4 ligated products (Example 33) were trimmed for the adapter sequences using cutadapt 4.1 and the reads shorter than 17 bp and longer than 25 bp were discarded. The filtered reads between 17-25 bp were aligned to the human genome hg38 using Bowtie2. Samtools was used to convert SAM to BAM files. Bedtools was used for intersecting DNAse hypersensitivity regions with ATAC and gDNA reads. UCSC Genome browser was used to visualize alignment of reads to the genome.
The sequencing of ligation products of Adapter 2 (FIG. 4), Adapter 3 (
The percentage of total sequencing reads for adapter 4 ligated products (Example 33) after adapter trimming that were of the correct length (17-25 bp) is shown in
The sequencing reads for gDNA and ATAC samples for adapter 4 ligated products (from Example 33) were intersected with DNAse hypersensitivity regions in the genome using bedtools. Sequencing reads from the ATAC sample are present 12-fold higher than gDNA in DNAse hypersensitivity regions (
Number | Date | Country | Kind |
---|---|---|---|
2114206.2 | Oct 2021 | GB | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/GB2022/052509 | 10/4/2022 | WO |