The instant application contains a Sequence Listing which has been submitted in.xml format and is hereby incorporated by reference in its entirety. Said.xml file is named “018617_01390_ST26.xml”, was created on Dec. 12, 2022, and is 18,002 bytes in size.
The present disclosure is related to compositions and methods for use in selecting genetic targets for use in specific killing of cancer cells. The selection is based upon genotypic differences between cancer and non-cancer cells.
Cancer remains a leading cause of death among population. Most cancers are difficult to treat because no two cancers are identical. Cancers typically result from the long-term exposure to mutagenic agents, such as tobacco smoking, UV/radiation exposure, repeated tissue damage, chronic viral infection, and the like. Existing cancer therapeutic approaches primarily target the protein product of the mutated genes that either harbor cancer-driver mutations or genes that render conditional lethality in cancer cells when inhibited. Because such protein targets are rare among patients, and because rapidly dividing cancer cells tend to acquire resistant mutations quickly, cancer patients typically succumb to the disease due to the exhaustion of all treatment options.
Cancer cells typically have accumulated thousands of passenger mutations in the non-coding regions of genes before acquiring key driver mutations and undergoing clonal expansion. These truncal mutations are not under selection during tumorigenesis, and they frequently become a bi-allelic from loss-of-heterozygosity (LOH) events. They therefore provide an abundant supply of anti-cancer targets for use with personalized medicine for cancer patients. With the emergence of powerful genome editing tools, targeting genotypic biomarkers in cancer becomes conceptionally feasible. However, the available CRISPR tools such as Cas9 and Cas12 only introduce localized genome damage. The resulting indel is only impactful when the coding sequence or splicing pattern is altered. But a majority of the cancer-specific mutations are noncoding passenger mutations, hence small indels have not been impactful enough to kill a cell. Thus, there is an ongoing and unmet need for improved compositions and methods for targeting cancer cells. The disclosure is related to this need.
The present disclosure provides methods for treating cancer. In embodiments, a provide method comprises i) obtaining one or more biological samples from an individual; ii) determining different nucleotide sequences in cancer and non-cancer cells from the biological sample to identify a candidate target sequence that is present in the cancer cells and not present in the non-cancer cells, and further characterizing the candidate target sequence to determine it is suitable for use as a target sequence. Characterizing the candidate target sequence comprises determining in the cancer cells but not the non-cancer cells that the candidate target sequence comprises: a) a homozygous mutation in the cancer cells that is a segment of a chromosome that is within 3 Kb-10 kb of an exon of an essential gene, wherein optionally the exon is not an alternatively spliced exon; b) identifying a protospacer adjacent motif (PAM) in the segment of a); and c) identifying a sequence in the chromosome that is preferably separated from the PAM by 13 to 17 nucleotides, and more preferably separated from the PAM by 1 to 11 nucleotides, wherein the identified sequence is approximately 32 nucleotides in length, to thereby identify a suitable target sequence. Based on the different nucleotide sequences in the cancer cells relative to the non-cancer cells, the method includes introducing into cells of the individual a CRISPR Cas3 system comprising a guide RNA targeted to a segment of the chromosome that is linked to the target sequence such that the chromosome comprising the target sequence is degraded, thereby treating the cancer. In embodiments, the characterization of the candidate target sequence also comprises determining that the different nucleotide sequences in the cancer cells comprises a mutation that is at least one of a breakpoint sequence in a chromosomal translocation sequence, a mutation that comprises an insertion or a deletion, a di-nucleotide mutation, or a single nucleotide mutation. The target characterization may include steps carried out by a computer implemented process. The method can include repeated administrations of the CRISPR Cas3 system and the guide RNA to cancer cells to facilitate cancer cell killing.
Unless defined otherwise herein, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this disclosure pertains.
Unless specified to the contrary, it is intended that every maximum numerical limitation given throughout this description includes every lower numerical limitation, as if such lower numerical limitations were expressly written herein. Every minimum numerical limitation given throughout this specification will include every higher numerical limitation, as if such higher numerical limitations were expressly written herein. Every numerical range given throughout this specification will include every narrower numerical range that falls within such broader numerical range, as if such narrower numerical ranges were all expressly written herein.
All nucleotide sequences described herein include the RNA and DNA equivalents of such sequences, i.e., an RNA sequence includes its cDNA. All nucleotide sequences include their complementary sequences.
All temperatures and ranges of temperatures, all buffers, and other reagents, and all combinations thereof, are included in this disclosure.
All nucleotide and amino acid sequences identified by reference to a database, such as a GenBank database reference number, are incorporated herein by reference as the sequence exists on the filing date of this application or patent.
The disclosure includes all embodiments illustrated in the Figures provided with this
The disclosure of each cited reference is incorporated herein by reference.
Any component of the editing systems described herein can be provided on the same or different polynucleotides, such as plasmids, or a polynucleotide integrated into a chromosome. In embodiments, at least one component of the system is heterologous to the cells. In eukaryotic cells, all components of the system can be heterologous.
In embodiments, any enzyme or other protein as described herein is introduced into the cell as a recombinant or purified protein, or as an RNA encoding the protein that is expressed once introduced into the cell, or as an expression vector, which is expressed once in the cell. Any suitable expression system can be used and many are commercially available for use with the instant invention, given the benefit of the present description. The disclosure also includes use of ribonucleoproteins (RNPs) to introduce the described systems into cells. Any protein described herein can be provided with a nuclear localization signal.
This disclosure relates to use of CRISPR-Cas3 systems to target 1) cancer-specific passenger mutations near oncogenes and 2) homozygous cancer-specific passenger mutations near essential human genes. Because CRISPR-Cas3 allows repeated targeting and causes kilobases of DNA deletion near the target site, when programmed to target a cancer-specific passenger mutation near an oncogene or essential human gene, long-range damage to the genome is caused, which inactivates the nearby gene and kills the cell. The distinct deletion-editing profile of CRISPR-Cas3 is one aspect of the disclosure that, with the described methods of identifying targets, is believed to render the described anti-cancer method distinct from other genome editing based anti-cancer inventions.
In general, the present disclosure provides compositions, methods, and systems that relate to a process for selecting targets in cancer cells, and killing the cancer cells. The disclosure thus independently and collectively includes methods of selecting the cancer cells, and killing selected cancer cells using selected target sites. The combined process of selecting a target site and killing cancer cells is also encompassed by this disclosure. The selected targets are unique to cancer cells within a particular individual. As such, non-cancer cells are not killed using the described methods. In certain aspects the method is accordingly suitable for personalized medicine approaches.
The method can comprise identifying candidate target sites, and further characterizing target sites to establish they are likely to be sufficient to be used in the described methods of cancer cells. Once a target site is selected as described further herein, a Cas3 and a suitable guide RNA that is designed for specificity for the selected target is introduced into the cancer cells (and may be introduced into non-cancer cells). The operation of at least in part the Cas3 and the guide RNA degrades a segment of a chromosome only in the cancer cells that comprise the selected target site. The process may be used for a single or multiple different selected targets sites in cancer cells. Multiple targets may be targeted concurrently or sequentially. The process may be used a single time to kill cancer cells, or multiple times to increase the efficiency and/or number of targets and thereby increase the efficiency and/or the number of cancer cells that are killed.
In more detail, the described process of selecting a suitable targeting site reveals genetic differences that may be homozygous (such as being due to loss-of-heterozygosity) and near (i.e. <3 kb) essential genetic sequences. This targeting requirement greatly increases targetable sites from possibly none, to hundreds and potentially thousands of sites within a particular type of cancer cells. Further, in an additional embodiments, the selected target can be heterozygous if the product of the targeted genetic sequence is at least in part driving cancer formation, one representative example of which comprises a gene encoding a KRAS mutation which results in a fusion oncogene from chromosomal translocation. In a non-limiting embodiment, the KRAS mutation is any of KRAS G12C, G12D or G12R.
In one embodiment, the disclosure provides a method for killing cancer cells based on genetic information of cancer cells obtained from a biological sample of an individual. Either the whole genome of the cancer cells or genomic regions of the cancer cell genomes are sequenced. From the sequencing results, one or a plurality of targets is determined based on criteria further described herein. In embodiments, a target that is operably linked to an essential gene on the same chromosome is selected. Based on the determination of each target site, a guide RNA is prepared such that it targets a location on the same chromosome that is suitable for Cas3-mediated degradation of the chromosome that comprises the essential gene. Thus, the location of, for example, a mutation unique to the cancer cell that is in complete linkage disequilibrium with an essential gene can be targeted. As such, the disclosure does not require and can exclude directly targeting a site within the essential gene. For example, the target may be located in a non-coding segment of a chromosome, including but not necessarily limited to an intron that is flanked by one or two exons. However, the disclosure does not exclude targeting a coding sequence that is linked to the mutation. In certain embodiments, the identified target site may be considered a “therapeutic sequence.”
Essential human genes are defined as genes indispensable for the survival of any human cells. Inactivation of all alleles of such a gene, whether in wild type cell or cancer cell, is expected to kill the cell. Such genes are typically defined experimentally. A representative and non-limiting list of essential genes that are suitable for use with the described methods identified from HAP1 and KBM7 cells is provided in Table A. In embodiments, certain human genes may be Haplo-insufficient. In such cases, inactivation of only one allele of such genes may cause cell death.
In general, a selected a target sequence may comprise a homozygous cancer-specific mutation. In normal diploid human cells, a spontaneous mutation is typically only present in one allele of the chromosome, the other allele in the homologous chromosome remains wild type. Due to the increased genome instability, cancer cells have high incidence of somatic structural variation (SV) in their genome. Such SV typically render cancer-specific mutations homozygous (loss-of-heterozygosity) through one of the following routes: 1) a gene conversion event that render all alleles of the gene in the cancer genome harboring the same mutation; 2) a deletion event that reduces the copy number of a particular gene to one, which happens to harbor the cancer-specific mutation; and 3) a reciprocal or non-reciprocal chromosomal translocation event that renders the cancer cell to contain only one copy of the functional gene, harboring a hallmark breakpoint sequence unique to cancer cells. In embodiments, the described bioinformatics pipeline identifies such mutations within 3 Kb-10 kb distance to an exon of an essential gene through the comparative analysis of the whole genome sequences of the cancer and normal cells from the same patient. The protospacer adjacent motif (PAM) sequence (an example of which is 5′-AAG) of the target site is oriented towards to the to-be-deleted exon, such that the programmed CRISPR-Cas3 tool will delete towards and into this exon.
With respect to identification of target sites, the disclosure provides databases comprising indexed genes and/or mutations that may be found in human cancer cells and targetable using the described approaches. In embodiments, the disclosure provides a system, the system comprising: at least one computer hardware processor; one or more databases that store genomic information, which may include coding and non-coding sequences, entire gene sequences or segments of them, mutations, and may comprise information regarding a cancerous phenotype. A system in communication with the database may include at least one non-transitory computer-readable storage medium storing processor-executable instructions that, when executed by a computer hardware processor, cause the computer hardware processor to perform one or more steps and/or algorithms that are described herein, to generate results that include identification target sites. In embodiments, the disclosure provides a system that includes one or more devices, said devices comprising a DNA sequencing device and/or a computer. In embodiments, one or more components of the device can be connected to or in communication with a described digital processor and/or the computer running software to interpret a DNA sequencing signal. In embodiments, a system described herein may operate in a networked environment using logical connections to one or more remote computers. In embodiments, a result obtained using a device/system/method of this disclosure is fixed in a tangible medium of expression. The result may be communicated to, for example, a health care provider who will perform the chromosome modifying method, and/or an entity which may design, and optionally produce the described guide RNAs based on a determination of target sequences, or an expression vector encoding the guide RNAs for an end user.
In embodiments, the disclosure provides for selecting a target site as described above and in depicted
In embodiments, the disclosure provides for increased DNA deletion-editing in cancer cells, relative to a control value. In embodiments, the control value is obtained from using a CRISPR-Cas3 system programmed with the same RNA guide but harboring a sequence mismatch to the same cell line.
In embodiments, the disclosure utilizes a Type I systems protospacer adjacent motifs (PAM) that comprises di- or tri-nucleotide conserved motifs downstream of protospacers opposite of the crRNA 5′-handle. Those skilled in the art will understand that other PAM sequences may be recognized by Cas enzymes from different bacterial types.
In embodiments, Cascade and Cas3 used according to this disclosure generates one or more genome lesions, considered to be long-range deletions, wherein from the lesion(s) are initiated, or are located, from a few nucleotides from a suitable PAM sequence, and to up to 10 kb upstream of the PAM sequence.
In embodiments one or more proteins used in this disclosure has/have between 50-100% identity to a wild type amino acid sequence. In embodiments, any type I CRISPR system proteins are used. In embodiments, the protein comprises a truncation and/or deletion such that only a segment of the protein that is required to achieve a desired effect (i.e., an improvement in DNA editing/deletion relative to a reference) is achieved. In embodiments, a protein used herein comprises an amino acid sequence that includes additional amino acids at the N- or C-terminus, relative to a wild type sequence. Thus, in proteins used herein have an amino acid sequence described herein, and/or are encoded by any of the nucleotide sequences described herein, or any sequence having at least from 50%-100%, inclusive, and including all integers and ranges of integers there between, identity with the foregoing nucleotide and/or amino acid sequences. In embodiments, proteins have 90, 91, 92, 93, 94, 95, 96, 97, 98, or 99% identity across the entire length or a functional segment thereof of the sequences described herein. Thus, variants of the proteins and their nucleotide sequences are included. The term “variant” and its various grammatical forms as used herein refers to a nucleotide sequence or an amino acid sequence with substantial identity to a reference nucleotide sequence or reference amino acid sequence, respectively. The differences in the sequences may be the result of changes, either naturally or by design, in sequence or structure. Designed changes may be specifically designed and introduced into the sequence for specific purposes. Such specific changes may be made in vitro using a variety of mutagenesis techniques. Such sequence variants generated specifically may be referred to as “mutants” or “derivatives” of the original sequence.
In embodiments, the disclosure includes a crRNA, which may be considered a “targeting RNA” that is developed based upon selection of a target sequence as described above. A crRNA, when transcribed from the portion of the CRISPR system encoding it, comprises at least a segment of RNA sequence that is identical to (with the exception of replacing T for U in the case of RNA) or complementary to (and thus “targets”) the selected target sequence in a cell into which the system is introduced.
In embodiments, the disclosure comprises deleting a segment of a chromosome that is in complete linkage disequilibrium with, or contains an essential gene or a segment thereof. The deletion may be single or double stranded. In embodiments, the deletions comprise from 500 nucleotides, to 10K nucleotides, inclusive, and including all ranges of numbers there between, and including base pair deletions.
In embodiments, two or more effective guides can be combined to formulate a cocktail anti-cancer therapeutic. The advantage of this combinatorial strategy is the reduced likelihood of selecting drug-resistant cancer clones. In embodiments, repeated delivery of a described system is used.
In embodiments, kits for making genetic modifications as described herein are provided. A kit comprises one or more suitable vectors that encode Type I Cascade proteins. The kits can also include other components that are suitable for using the expression vectors to edit DNA in any cell type.
The guide RNA-containing Cascade Complexes can be either produced in a cell using DNA or RNA encoding for the protein and/or RNA components or delivered in the form of one or more vectors for expression or delivered in the form of RNA encoding for the proteins and/or RNA components or delivered in the form of fully-formed protein-RNA complexes through mechanisms including but not limited to electroporation, injection, or transfection. The guide RNA-containing Cascade complexes described herein, can be recombinantly expressed and purified through known purification technologies and methods either as whole Cascade complexes or as individual proteins. These proteins can be used in various delivery mechanisms including but not limited to electroporation, injection, or transfection for whole-protein delivery to eukaryotic organisms or can be used for in-vitro applications for sequence targeting of nucleic acid substrates or modification of substrates. Cascade complexes containing guides which target a selected DNA target will hybridize to the target sequence and will, if complementarity is sufficient, open a full R-loop along the length of the target site. This Cascade-marked R-loop region adopts a conformation which allows Cas3 to bind to a site which is PAM-proximal, orienting the nuclease domain to initially attack the non-targeted DNA strand approximately 9-12 nucleotides inside the R-looped region. The helicase domain is loaded with the non-target strand, and the Cas3 then processively unwinds the substrate DNA in an ATP-dependent fashion from 3′ to 5′. In conjunction with this helicase activity, nuclease activity cleaves the non-target strand in a processive fashion.
In embodiments, the disclosure uses wild-type Cas3 proteins, or modifications or derivatives thereof. For example, in a case where either wild-type Cas3 or an otherwise engineered Cas3 is capable of cleaving both strands of DNA during a processive mode, once recruited to a validated target sequence by Cascade, Cas3 inherently produces a 3′ overhang on the target strand. This is because Cascade is protecting the target strand from just after the PAM site to the end of the R-loop. Thus, once Cas3 is loaded on the non-target strand and begins its processive cleavage, the earliest nucleotide on the target strand that is available for cleavage is at the PAM site. In comparison, degradation of the non-target strand occurs 9-12 nucleotides inside the R-loop region.
In a case where either wild-type Cas3 or an otherwise engineered Cas3 is capable of confining its cleavage activity to one or the other strand of substrate DNA, two or more Cascade targeting complexes can be used, such that the PAM sites are facing towards one another, to recruit Cas3 to each target site and degrade the intervening section of DNA on both strands. This will produce 3′ overhangs on both strands of DNA and a degraded segment of DNA between.
As discussed above, variations on Cascade are encompassed in this disclosure. In embodiments, a Cas3/Cascade system is any Cas3 and Cascade described in PCT publication no. WO 2019/246555, from which the entire disclosure is incorporated herein by reference.
As discussed above, the disclosure also includes using Cas3 variants and derivatives. For example, mutations can be made that affect protein stability, R-loop recruitment efficiency, initial nicking efficiency, helicase activity, processive nuclease activity, expression or purification, off-target effects, or other protein functions or properties.
The following examples are presented to illustrate the present disclosure. They are not intended to be limiting in any manner.
This Example provides a demonstration of using non-coding regions of POLR2A, an essential gene to evaluate 1) whether CRISPR-Cas3 targeting can efficiently affect cell viability, and 2) whether it is safe to rely on a single nucleotide difference to distinguish wild-type cells from diseased cells. The experiments were performed in the haploid cell line eHAP1. The results and related information are depicted in
This Example provides demonstrates selected target cancer-specific killing of homozygous mutations in noncoding regions of essential genes. The mutations were identified and ranked using the described methods of target selection by comparing the whole genome sequences of COLO829 (skin cancer) and COLO829-BL (normal cell from the same patient). A list of ˜200 targets were identified this way. Five mutations were then selected for experimental testing. Due to technical difficulties in electroporating COLO829 cells, we targeted the same sites (without the mutation) in two different cancer cell lines (haploid eHAP1 cell line described above, and diploid HEK293 cell line used in this Example). The described anti-cancer method is expected to be broadly applicable to all cancer types. The efficacy is expected to only correlate with the targeting efficiency and the functional consequence of the deletion-targeting event. Therefore, validating the targeting efficacy in different cell lines supports a broadly applicable approach. In this regard, the efficacy pattern is consistent in eHAP1 and HEK293 cells. For example, the guide RNA targeted to RPL39 was identified as the most effective guide against both cell lines.
While the disclosure has been particularly shown and described with reference to specific embodiments (some of which are preferred embodiments), it should be understood by those having skill in the art that various changes in form and detail may be made therein without departing from the spirit and scope of the present disclosure as disclosed herein.
This application claim priority to U.S. provisional application No. 63/290,489, filed Dec. 16, 2021, the entire disclosure of which is incorporated herein by reference.
| Filing Document | Filing Date | Country | Kind |
|---|---|---|---|
| PCT/US2022/081852 | 12/16/2022 | WO |
| Number | Date | Country | |
|---|---|---|---|
| 63290489 | Dec 2021 | US |