Mouse or other animal models are indispensable for studying the molecular mechanisms of human diseases and for pharmaceutical drug testing, as most of these studies cannot be done practically or ethically with humans. Most human diseases are complex diseases involving many genes and environmental inputs. The contribution of each gene to these diseases is usually small and is dependent on genetic background. To be reproducible, current mouse gene studies are dependent on establishing a congenic strain in which all mice have identical genetic material. The genetic variations can be eliminated. However, genetically, a congenic strain is analogous to a single human in the sense that one mouse strain/human cannot represent the entire population. To recapitulate the complex human disease, an existing line has to be backcrossed with many inbred strains, many times (10 generations or 2 years), to establish many congenic strains to mimic human genetic variations in the population. Practically, this is not possible due to cost and time, as it is estimated that at least 1001 mouse strains are required to represent the genetic variation in human population. Moreover, mice from a congenic strain still have genetic variations, and other variations (epigenetic and environmental) and experimental variations remain. Therefore, complex human diseases cannot be accurately modeled in mouse or other animals using the current techniques.
Common variable immunodeficiency (CVID) is the most common human primary immunodeficiency, which affects 1-2% of the population. The highly variable symptoms that are present in different CVID patients, including those with the same genetic causes, indicate complex interactions in multiple genes and environmental inputs, making it difficult to study this disease. Consequently, the etiology of over 80% of CVID cases is unknown. A small group of genes including the lipopolysaccharide-responsive vesicle trafficking, beach and anchor containing (LRBA) gene are found to cause CVID (Lopez-Herrera et al., 2012; Alangari et al., 2012; Burns et al., 2012). Unlike the other CVID genes, LRBA regulates the vesicle trafficking required for the regulation of one-third of human proteins (Wang et al., 2001). For example, LRBA regulates or may regulate many crucial proteins, e.g., NFκB, MAPKs, AKT, TNFα, IL10 and CVID receptors, and many components of the EGFR, NOTCH and RAS/MAPK pathways (Shamloula et al., 2002; Wang et al., 2004; Yatim et al., 2012). It is also involved in cell proliferation, apoptosis and autophagy (Lopez-Herrera et al., 2012; Wang et al., 2004). Furthermore, LRBA is a potential oncogene and is overexpressed in multiple cancers (Wang et al., 2004). For example, its overexpression is a molecular signature for breast cancer mortality and recurrence (Andres et al., 2013). These data demonstrate that LRBA has the features of central immune regulators like NFκB. Without being limited by theory, the inventors propose that LRBA controls multiple critical immune regulators and its deregulation causes immunodeficiency.
The inventors wished to study how LRBA regulates the critical immune regulators in vivo to better understand and treat immune disorders associated with deregulated LRBA. However, the lack of a good mouse model poses a critical barrier to test the inventors' hypothesis. Current mouse gene studies are dependent on establishing a congenic strain, which is analogous to a single human in the sense that one cannot represent the whole population. To recapitulate the complex human disease, an existing line has to be backcrossed with many strains, multiple times, to establish many congenic strains to increase genetic diversity to mimic human genetic variations in the population. This may not be feasible as it would cost too much money and time. Moreover, mice from such a congenic strain are still not genetically identical and it is demonstrated that the phenotypes originally attributed to the targeted gene may actually attribute to other genes (Eisener-Dorman et al., 2009; Ridgway, 2014). Therefore, it is not possible to accurately model complex human diseases in mice using the current techniques.
To alleviate this problem, the inventors have produced an all-in-one animal model (e.g., a mouse model) so that expression of a gene, such as Lrba, can be turned on/off and wild type (wt), heterozygote (het) and knockout (ko) cell populations can be produced and studied in a single animal and in a single tube (vessel) without the interference of variations. Phenotypes thus can be specifically attributed to the gene (e.g., Lrba) if they change in response to gene expression being switched on/off (
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Patent and Trademark Office upon request and payment of the necessary fee.
SEQ ID NO:1 is the nucleic acid sequence of the human wild-type Lrba gene (Homo sapiens LPS-responsive vesicle trafficking, beach and anchor containing (LRBA), transcript variant 1, mRNA. Accession number: NM_001199282:
SEQ ID NO:2 is the nucleic acid sequence of the mouse wild-type Lrba gene (Mus musculus LPS-responsive beige-like anchor (Lrba), transcript variant 1, mRNA. Accession number: NM_030695):
SEQ ID NO:3: Lrba/SD/loxP portion of Lrba BAC subclone plasmid (
SEQ ID NO:4: STOP/loxP portion of Lrba BAC subclone plasmid (
SEQ ID NO:5: P2A/Lrba portion of Lrba BAC subclone plasmid (
SEQ ID NO:6: transcription control cassette (
The invention concerns a non-human animal model comprising three genotypes for a single gene, resulting in three different cell populations for that cell type. Thus, each cell population has a different genotype for the gene. The invention concerns a non-human animal model comprising three populations of a cell type, wherein each cell population has a different genotype for a gene, wherein the three cell populations comprise:
(a) a first population of cells having a wild-type gene;
(b) a second population of cells heterozygous for the gene; and
(c) a third population of cells having an inactivated version of the gene;
wherein the three populations of cells include a transcription control cassette operably linked at the genomic locus of the gene.
Any gene of interest may be utilized to produce the non-human model. Lrba is used herein merely as an example. Various methods may be used for gene activation, such as insertion, deletion, substitution, and/or recombination.
The 2A peptide gene DNA sequence was used to link a flouresecnt gene and the two proteins to be co-expressed by the endogenous gene (Kim, Lee et al. 2011). Due to its small size (57 bp), high self-cleavability, and ability to produce an equal molar ratio of the two proteins, 2A is superior to IRES, which is large and causes differential expression of the two genes that it links (Kim, Lee et al. 2011). By placing the transcription control FP cassettes (TCFP) before the translation start codon (ATG), allows not only labeling of the endogenous protein but also tracking of the promoter activity of the endogenous gene. The STOP sequence in the casette is used to stop downstream gene expression. The Cre partial cleavage of the two loxP sites (triangle) will result in nine genotypes and cells with specific FP labeling.
The three populations of cells may comprise a first detectable label, a second detectable label, and a third detectable label, respectively, wherein each detectable label is distinguishable from the other detectable labels, wherein each detectable label is encoded by a nucleic acid sequence linked to the respective gene, and wherein the expression of the nucleic acid sequence is dependent upon expression of the gene (co-expressed) in the respective cell population. In some embodiments, the detectable label is a fluorescent label or luminescent label. In some embodiments, the detectable label is a fluorescent label selected from the group consisting of green fluorescent protein (GFP), red fluorescent protein (RFP), and cyano fluorescent protein (CFP).
The selected cell type will be that in which the three genotypes for that gene will exist. The cell type may be one in which the selected gene occurs naturally and is expressed, or is not expressed, or a cell type in which the gene is not normally present. Thus, the gene may be heterologous to the cell, and may be from the same species or a different species. In some embodiments, the three populations of cells are B cells.
Any gene may be utilized. In some embodiments, the three populations of cells are B cells, and the gene is lipopolysaccharide (LPS)-responsive beige-like anchor (Lrba) gene. Thus, the animal model may comprise three populations of B cells, wherein the three populations of B cells comprise:
(a) a first population of B cells having a wild-type lipopolysaccharide (LPS)-responsive beige-like anchor (Lrba) gene;
(b) a second population of B cells heterozygous for the Lrba gene; and
(c) a third population of B cells having an inactivated Lrba gene;
wherein the three populations of B cells include a transcription control cassette operably linked at the Lrba genomic locus.
The three populations of B cells may further comprise a first detectable label, a second detectable label, and a third detectable label, respectively, wherein each detectable label is distinguishable from the other detectable labels, wherein each detectable label is encoded by a nucleic acid sequence, and wherein the expression of the nucleic acid sequence is dependent upon expression of the Lrba gene in the respective B cell population. In some embodiments, the detectable label is a fluorescent label or luminescent label. In some embodiments, the detectable label is a fluorescent label selected from the group consisting of green fluorescent protein (GFP), red fluorescent protein (RFP), and cyano fluorescent protein (CFP).
In some embodiments, the animal model is a rodent, such as a mouse or rat, or a non-human primate such as a monkey or ape.
Another aspect of the invention concerns a composition comprising two or more of the populations of cells from the non-human animal model. Thus, the composition comprises a plurality of populations of cells from a single non-human animal, wherein said plurality of populations of cells comprise at least two of the following populations of cells:
(a) a first population of cells having a wild-type gene;
(b) a second population of cells heterozygous for the gene; and
(c) a third population of cells having an inactivated version of the gene;
wherein the three populations of cells include a transcription control cassette operably linked at the genomic locus.
Two or all three of the cell populations may be collected in the composition. In some embodiments, the populations of cells comprise a first detectable label, a second detectable label, and a third detectable label, respectively, wherein each detectable label is distinguishable from the other detectable labels, wherein each detectable label is encoded by a nucleic acid sequence, and wherein the expression of the nucleic acid sequence is dependent upon expression of the gene in the respective cell population. The populations of cells in the composition may be in isolated form, or may reside in a tissue sample collected from an animal model of the invention. The composition may be blood collected from the animal, such as peripheral blood.
In some embodiments, the cell populations are B cell populations.
In some embodiments, the cell populations are B cells and the gene is Lrba. In these embodiments, the composition comprises a plurality of populations of B cells from an animal model of the invention, wherein the plurality of populations of B cells comprise at least two of the following populations of B cells:
(a) a first population of B cells having a wild-type lipopolysaccharide (LPS)-responsive beige-like anchor (Lrba) gene;
(b) a second population of B cells heterozygous for the Lrba gene; and
(c) a third population of B cells having an inactivated Lrba gene;
wherein the three populations of B cells include a transcription control cassette operably linked at the Lrba genomic locus.
The animal models and compositions of the invention can be used for studying phenotypes. Thus, another aspect of the invention concerns a method for studying phenotypes (using the animal models or compositions of the invention), comprising:
providing a non-human animal model or composition of the invention; and
analyzing one or more of the phenotypes of the non-human animal model or composition in the presence and/or absence of an exogenous agent.
Phenotype analysis may involve analyzing the characteristics and/or behavior of one or more of the cell populations of the animal model or composition, which can be done, for example, by microscopy, flow cytometry, or other procedures known in the art.
Phenotype analysis may include measuring the detectable label of one or more of the cell populations of the animals or compositions and, optionally, comparing the measured detectable label to that of one or both of the other detectable labels.
The method may further comprise activing or deactivating the transcription control cassette to induce or inhibit expression of the gene, wherein a change in phenotype is indicative of a gene-dependent response (for example, in the case of Lrba, an Lrba-dependent response).
The exogenous agent may be a small molecule or biologic molecule that is administered to the animal model, or some other treatment administered to the animal.
The non-human animal model is preferably a mammal. For example, the animal model may be a rodent or non-human primate. The single animal technique is especially useful to generate larger animal models that cannot be bred in large number (such as mice) which is normally required for statistical power using traditional methods.
In some embodiments, the animal is selected from the group consisting of a mouse, rat, guinea pig, hamster, gerbil, pig, cow, dog, wolf, coyote, jackal, and cat. In some embodiments, the animal model is a monkey or ape. In some embodiments, the animal model is a primate selected from the group consisting of a macaque, marmoset, tamarin, spider monkey, vervet monkey, squirrel monkey, and baboon. In some embodiments, the animal model is an ape selected from the group consisting of a gorilla, chimpanzee, orangutan, and gibbon. The animal model may be a hybrid of two non-human animals (e.g., dog-wolf).
Because the wild-type, heterozygous, and knockout B cells are present within the same animal, it is possible to accurately attribute changes in phenotype to Lrba as Lrba expression is switched on and off, without the need for control animals.
The animal model may have any desired genetic background. The animals may be crossed with many strains, and the gene (e.g., Lrba) may be studied with a wide genetic background, which is desired to recapitulate the complexity of human disease. As controls are not required, establishing congenic strains are not necessary. The all-in-one model of the invention can be crossed with many strains simultaneously and the function of a gene can be studied in a wide genetic background, which is desirable for recapitulating human complex diseases and can save time and resources.
The animal may be further modified at the genetic or epigenetic level so as to be useful in modeling a particular disease, such as cancer, cardiovascular disease, a metabolic disease such as diabetes, or a monogenic disease. For example, the animal model may be further modified to model Down Syndrome, cystic fibrosis, cancer, glaucoma, type-I diabetes, type-II diabetes, epilepsy, heart disease, muscular dystrophy, or gynecological tumors.
Down Syndrome—One of the most common genetic birth defects in humans, occurring once in every 800 to 1,000 live births, Down syndrome results from an extra copy of chromosome 21, an abnormality known as trisomy. The Ts65Dn mouse, developed at The Jackson Laboratory, mimics trisomy 21 and exhibits many of the behavioral, learning, and physiological defects associated with the syndrome in humans, including mental deficits, small size, obesity, hydrocephalus and thymic defects. This model represents the latest and best improvement of Down syndrome models to facilitate research into the human condition.
Cystic Fibrosis (CF)—The Cftr knockout mouse has helped advance research into cystic fibrosis, the most common fatal genetic disease in the United States today, occurring in approximately one of every 3,300 live births. Scientists now know that CF is caused by a small defect in the gene that manufactures CFTR, a protein that regulates the passage of salts and water in and out of cells. Studies with the Cftr knockout have shown that the disease results from a failure to clear certain bacteria from the lung, which leads to mucus retention and subsequent lung disease. These mice have become models for developing new approaches to correct the CF defect and cure the disease.
Cancer—The p53 knockout mouse has a disabled Trp53 tumor suppressor gene that makes it highly susceptible to various cancers, including lymphomas and osteosarcomas. The mouse has emerged as an important model for human Li-Fraumeni syndrome, a form of familial breast cancer.
Glaucoma—The DBA/2J mouse exhibits many of the symptoms that are often associated with human glaucoma, including elevated intraocular pressure. Glaucoma is a debilitating eye disease that is the second leading cause of blindness in the United States.
Type 1 Diabetes—This autoimmune disease, also known as Juvenile Diabetes, or Insulin Dependent Diabetes Mellitus (IDDM), accounts for up to 10 percent of diabetes cases. Non-obese Diabetic (NOD) mice are enabling researchers to identify IDDM susceptibility genes and disease mechanisms.
Type 2 Diabetes—A metabolic disorder also called Non-Insulin Dependent Diabetes Mellitus (NIDDM), this is the most common form of diabetes and occurs primarily after age 40. The leading mouse models for NIDDM and obesity research were all developed at The Jackson Laboratory: Cpefat, Lepob, Leprdb and tub.
Epilepsy—The “slow-wave epilepsy,” or swe, mouse is the only model to exhibit both of the two major forms of epilepsy: petit mal (absence) and grand mal (convulsive). It shows particular promise for research into absence seizures, which occur most often in children.
Heart Disease—Elevated blood cholesterol levels and plaque buildup in arteries within three months of birth (even on a low-fat diet) are characteristics of several experimental models for human atherosclerosis: the Apoe knockout mouse and C57BL/6J.
Muscular Dystrophy—The Dmd mdx mouse is a model for Duchenne Muscular Dystrophy, a rare neuromuscular disorder in young males that is inherited as an X-linked recessive trait and results in progressive muscle degeneration.
Ovarian Tumors—The SWR and SWXJ mouse models provide excellent research platforms for studying the genetic basis of ovarian granulosa cell tumors, a common and very serious form of malignant ovarian tumor in young girls and post-menopausal women.
Cell populations of the animal are preferably labeled with a detectable label (also referred to herein as a reporter) in order to detect and track gene promoter activity and gene expression, e.g., by flow cytometry. Thus, the reporter gene and the gene of interest are operably linked such that they are co-expressed. If a phenotype changes in response to gene expression being switched on or off, the phenotype can be specifically attributed to that gene.
Such detectable labels are known in the art, and include, for example, fluorescent reporter proteins encoded by fluorescent reporter genes. Preferably, each reporter can be detected in a living animal. Thus, the amount, distribution, proliferation, movement, properties, and behavior of the labeled cells can thus be assessed and, optionally, monitored.
Exemplary reporters include light-emitting reporters, such as fluorescent and luminescent reporters. Polypeptides that result in the generation of light in a living organism (bioluminescence) include, but are not limited to, various luciferases, green fluorescent protein (GFP), yellow fluorescent protein (YFP) and aequorin (Wilson and Hastings, Annu. Rev. Cell Dev. Biol., 1998, 14:197-230). Fluorescence reporters have many diverse uses, the most common of which are for fluorescence microscopy and also for flow cytometry. In both cases internal expression of the fluorescence reporter, using a reporter plasmid system, allows simple assessment of cell properties and/or behavior.
Luciferase is a luminescent molecule, and thus does not require excitation in order to generate light. It does typically require a substrate (e.g., luciferin, an aldehyde or coelenterazine), an energy source (e.g., ATP) and oxygen. In the case of bacterial luciferases, the genes encoding the substrate can be supplied the same vector as the gene(s) encoding the luciferase enzyme, thus eliminating the need for exogenously-supplied substrate (see, e.g., U.S. Pat. No. 5,650,135).
In some embodiments, the reporter for each B cell population is selected from the group consisting of red fluorescent protein (RFP), green fluorescence protein (GFP), yellow fluorescence protein (YFP), and cyano fluorescence protein (CFP). Polynucleotide cassettes encoding such polypeptides may be transfected into the target site as extra-chromosomal genetic elements (e.g., plasmids) or are stably incorporated into the genome (e.g., “hopped” in using, for example, a transposon).
If the reporter is a light-emitting reporter, method of measurement incluude using a photon detection device, such as an intensified CCD camera, a cooled CCD camera, or any other photon detection device with a high sensitivity. However, other methods may be used. For example, a light-emitting reporter may also be detected using a sensitive luminometer; a radioactive reporter may be detected by counts, X-ray imaging or scintillation.
The term “operatively linked” or “operably linked” refers to the connection of elements being a part of a functional unit such as a gene or an open reading frame (e.g., encoding LRBA). Accordingly, by operatively linking a promoter to a nucleic acid sequence encoding a gene product such as a polypeptide the two elements becomes part of the functional unit—a gene. The linking of the expression control sequence (promoter) to the nucleic acid sequence enables the transcription of the nucleic acid sequence directed by the promoter. By operatively linking two heterologous nucleic acid sequences encoding a polypeptide the sequences becomes part of the functional unit—an open reading frame encoding a protein or proteins comprising the amino acid sequences encoded by the heterologous nucleic acid sequences. By operatively linking two coding sequences, the sequences can be co-expressed.
LRBA mutation causes CVID, which is highly heterogeneous, genetically, immunologically and clinically. It is associated with many diseases and conditions including infections, chronic lung disease, autoimmunity, hepatitis, granulomatous disease and cancers (Lopez-Herrera et al., 2012; Alangari et al., 2012; Burns et al., 2012; Gathmann et al., 2012; Eibel et al., 2010; Park et al., 2008). The association with autoimmunity is paradoxical but intriguing as it provides a unique opportunity to study the etiology of autoimmunity associated with many chronic human diseases (Podjasek and Abraham, 2012). The CVID symptoms are highly variable among patients including those with the same genetic causes. For example, the same LRBA mutation presented in two nuclear families of cousins causes hypogammaglobulinemia in one family but not in the other (Alangari et al., 2012), indicating complex interactions in multiple genes and environmental inputs. This makes it difficult to study this disease. Unlike other CVID genes, which include cell membrane receptors [CD19, CD20, CD21, CD81, inducible costimulator (ICOS), B cell-activating factor receptor (BAFFR), transmembrane activator and calcium modulator and cyclophilin ligand interactor (TACI)], LRBA causes more severe and potentially life-threatening CVID and a higher incidence of autoimmune problems (Lopez-Herrera et al., 2012; Alangari et al., 2012; Burns et al., 2012; Wang and Lockey, 2014). It regulates vesicle trafficking (Wang et al., 2001; Cullinane et al., 2013), which is essential for the proper function of about one-third of human proteins (Kholodenko, 2003; Wang et al., 2010; Wiley, 2003). Thus, LRBA potentially regulates many proteins, e.g., the LRBA orthologue, rugose, interacts with the multiple components of the EGFR, NOTCH and RAS/MAPK pathways (Shamloula et al., 2002; Wang et al., 2004; Yatim et al., 2012), and the preliminary data herein show that LRBA regulates NFκB, MAPKs, AKT, TNFα, IL10 and CVID receptors. It is also an oncogene candidate as it is overexpressed in breast, lung, colon and kidney cancers, and its promoter activity is significantly increased by oncogene E2F1 and inhibited by tumor suppressor p53 (Wang et al., 2004). For example, LRBA knockdown inhibits cancer cell growth (Wang et al., 2004), while LRBA overexpression is a molecular signature for breast cancer mortality and recurrence (Andres et al., 2013). Moreover, it is involved in cell proliferation, apoptosis and autophagy (Lopez-Herrera et al., 2012; Wang et al., 2004). These data demonstrate that LRBA has the features of central immune regulators such as NFκB. The inventors propose that LRBA regulates critical immune regulators and its deregulation (null, down-regulation and over-expression) causes immunodeficiency.
Current mouse models cannot recapitulate most human diseases including LRBA-caused CVID as these diseases have highly variable phenotypes. Traditional mouse studies are dependent on establishing a congenic strain, which is analogous to a single human in the sense that one cannot represent the entire population (Vogel, 2003). To recapitulate such a complex human disease, an existing line has to be backcrossed with many other strains about 10 times (2 years) to establish many congenic strains to increase genetic diversity to mimic human genetic variations in the population. This is not possible due to cost and time. Moreover, mice from a congenic strain still have genetic variations due to the tightly-linked genes flanking the targeted gene and single nucleotide polymorphisms (Eisener-Dorman et al., 2009; Ridgway, 2014). Therefore, it is not implausible that the phenotypes originally attributed to the targeted gene actually be attributable to other genes (Eisener-Dorman et al., 2009; Ridgway, 2014). To alleviate this problem, the inventors propose the production of an all-in-one animal model (e.g., an all-in-one mouse model) in which Lrba expression can be turned on/off in a spatiotemporal and trackable manner, so that, for example, wt, het and ko B cells with respectively distinct reporter signals (e.g., different FP colors) can be produced in a single animal. Thus, phenotypes can be studied by flow cytometry in a single animal and in a single tube with high resolution (sensitivity) as there are no interferences from the genetic, epigenetic, environmental and experimental variations, and can be specifically attributed to Lrba if they change in response to Lrba expression being switched on/off. This animal model will allow the inventors to test this hypothesis, confirming the in vitro data that LRBA regulates many critical immune regulators, NFκB, MAPKs, AKT, TNFα, IL10 and CVID receptors to demonstrate that like NFκB, LRBA is one of most important immune regulators.
The concept of a single animal and a single tube can provide more accurate pre-clinical trial data. The pre-clinical results of this study will directly benefit LRBA-deficient patients who suffer severe CVID symptoms, even death and cannot be cured (Lopez-Herrera et al., 2012; Alangari et al., 2012; Burns et al., 2012). Thanks to the CRISPR technique, primate ko models can be generated (Niu et al., 2014). The all-in-one concept can be used to create primate models for pre-clinical trials and will dramatically reduce cost and animal number as only one animal is needed for each treatment group and can be used for its lifetime.
The animal model of the invention is highly innovative due to several features. The concept of phenotyping in a single animal and single tube will eliminate genetic, environmental and experimental variations that can contribute to or even mask the phenotype (Eisener-Dorman et al., 2009; Ridgway, 2014). In addition, flow cytometry can be used to analyze millions of cells with multiplex ability. Consequently, without the interference of variations, small differences in phenotype can be detected, allowing high resolution of the phenotyping required for those caused by less penetrance (more dependent on genetic background) of the targeted gene. As controls are not required, establishing a congenic strain is not necessary, the all-in-one model can be crossed with many strains and study Lrba in a wide genetic background required for mimicking genetic diversity in complex human CVID. F2 mice can be used directly for experiments. This can save tremendous time and money. The single animal concept is also very useful to study gene-gene interactions. Because if a significant difference is detected in B cell phenotypes from two animals either from the same or different inbred or outbred strain, it may indicate the presence of a modifier gene(s), which can be determined by the next generation sequencing of the whole exome from the two mice. Therefore, this model can be used to quickly discover gene-gene interactions, which are critical for studying complex human diseases. Knockout, overexpression and reporter mouse models are usually generated separately and only one model is studied in most labs due to time and financial limitations. With the all-in-one model of the invention, LRBA expression can be manipulated in multiple ways and the three genotypes can be generated in a single animal so the data therefore are more comparable.
The all-in-one animal model addresses a critical problem that current animal models cannot solve in modeling complex human immunodeficiency diseases that require great genetic variations present in the population. The Lrba gene has previously been cloned and a conditional knockout mouse model has been successfully produced (Wang et al., 2001; Wang et al., 2002). The all-in-one animal model provides the unique opportunity to facilitate high resolution study of the LRBA gene, a novel, unique and important immune regulator to understand a critical aspect of immunodeficiency. However, as indicated above, the all-in-one animal model may also be utilized for high resolution study of any gene.
There are alternative ways in which the all-in-one animal model can be produced, including alternative ways to switch the gene of interest on and off. An alternative to the CRISPR method, the traditional knock-in method based on mouse embryonic stem cells can be used to obtain this model. Similar to the Cre-loxP technology, Flp-FRT or PhiC31 Integrase-mediated recombination can also be used in the place of Cre-loxP recombination to turn on gene expression. In order to produce other species of animal models, the species-specific sequences will be used to replace mouse sequences. Other changes include, fertilized eggs and foster mothers from that species.
As an alternative to the fluorescent proteins (FP), Fluorogen activating peptide—FAP-tags®, a new class of small genetically encoded reporters that exhibit fluorescence only in the presence of micromolar concentrations of particular nontoxic soluble fluorogens, can be used in the place of FPs (http://spectragenetics.com/).
A vast array of experiments can be carried out using the all-in-one animal model, in a general step-by-step fashion, including experiments that would be conducted on the animal itself and on the cells (e.g., B cells) obtained from the animal.
1. Recapitulation of LRBA deficiency on the development of lymphocytes. In addition, LRBA is expressed ubiquitously, especially in hematopoietic cells and stem cells [13,15]. The inventors will use the multiparametric flow cytometry developed by BD Biosciences to study the development of B and T lymphocytes in this model. (a) Analysis of B-cell developmental stages in mouse bone marrow [77]. Seven different developmental phases can be discriminated in bone marrow by a panel of seven B cell surface markers. Pre-pro-B, Pro-B, and Pre-B cells can be distinguished within the low positive CD45R population based on their differential expression of BP1 and CD24. Immature, transitional, and early and late mature B cells could be segregated based on differential expression of IgM and IgD [77]. To study the effect of Lrba ko in the periphery, mature B cell presence in mouse spleens will also be determined. (b) Analysis of T-cell developmental stages in thymus [78]. The six developmental stages, four double-negative (DN1, DN2, DN3, DN4), double-positive (DP) and single-positive (SP), can be discriminated in thymus by an 8-color panel of cell surface markers. B220 negative events will be gated first, and then CD4 vs CD8 will identify the DN, DP, and SP cell populations. CD44 vs CD25 will identify the four substages (DN1 to DN4) in DN cell population. The low, intermediate, and high expression levels of TCR β corresponds to DN, DP and SP cells. CD69 and CD5 will be included in the panel because they are indicators for positive selection and the intensity of TCRs and self MHC-peptides interactions. Mature T cell presence in mouse spleens and lymph nodes will also be determined to study the effect of Lrba ko in the periphery. The activation state of B cells or T cells will also be determined in the periphery using MHC class II, CD40 and CD86 for B cells and, CD44, CD62L and CD25 for T cells, respectively. *Flow cytometry methodologies: LSR-II flow cytometer, which has an analysis rate of up to 40,000 cells per second, and the capacity to measure 15 cell markers. 40 samples can be analyzed within a couple of hours. 1) To exclude cell aggregates, two sequential gates of scatter width vs height signals will be applied. The singlet population will then be gated by forward scatter vs side scatter to exclude dead cells and debris. “Live cells” will be gated using a contour plot and then switch to a dot plot for easy monitoring of acquisition. 2) A FMO-control (Fluorescence Minus One) is a control sample composed of all antibody labels except one, and will be used as a negative control in place of an isotype control for that antibody staining. 3) All antibodies will be titrated. 4) The fluorophores used by BD that overlap with iRFP of CFP will be replaced with other fluorophores. (c) Others: 1) The One Step Staining Mouse Treg Flow™ Kit (BioLegend) will be used to detect Treg cells in mouse spleens. 2) Plasma antibody isotyping (IgG1, IgG2a, IgG2b, IgG3, IgA and IgM) will be performed using the Pierce Rapid ELISA Mouse mAb Isotyping Kit. 3) Lymph nodes, lung and intestine sections will be analyzed by fluorescent microscopy [72] to examine lymphoproliferation and lymphocyte infiltration. 4) Comprehensive standardized gross and histopathologic analyses will be performed, including the analyses of organ weights, serum chemistries and hematology. The inventors will use 12 age-matched mice of both sexes, 6 mutant and 6 wild-type controls for the analyses.
2. Recapitulation of LRBA deficiency on different genetic backgrounds: The phenotype is different in the two strains (
The impact of LRBA deficiency on the development of B or T cells at each stage has not been investigated yet. The all-in-one animal model will allow highly sensitive detection of any abnormalities in B and T cell development caused by Lrba deficiency, which may not be detectable with the current techniques. 1) Based on human data, it is expected, as shown in
3. Linage tracing of hematopoietic cells. The three genotypes can be induced at early development of hematopoietic cells, e.g. hematopoietic stem cells, then trace the development of each genotype using flow cytometry and fluorescent microscopy, and answer the questions whether different genotypes have different development consequences in terms of cell numbers, cell types (B, T lymphocytes, monocytes, dendritic cells) and cell subtypes (CD4, CD8 T cells, B1 and B2 B cells, etc.).
4. Study lymphocyte and stromal cell interaction. Lymphocytes undergo massive cell death at multiple developmental stages in order to eliminate non- or self-reactive lymphocytes through positive and negative selection, in which stromal cells play an important role. Since both lymphocytes and stromal cells are labeled with fluorescent colors specific to the genotypes, the influence of the different genotype of the stromal cells on the selection of lymphocytes can be studied.
5. Study the cell-cell interactions of hematopoietic cells. Similarly, the cell-cell interaction play an important role in the activation or inhibition of hematopoietic cells, such as the activation of B cells by T cells, the inhibition of T cell proliferation by regulatory T cells.
Example 1 describes how to generate animal model of the invention (an all-in-one mouse model) in which Lrba expression can be turned on/off in a spatiotemporal and trackable manner. A transcription control cassette (TCC) composed of a CAG promoter, STOP, tet-inducible system, two FP genes and three loxP sites will be inserted into the Lrba genomic locus by the CRISPR technique (Zhou et al., 2014; Wang et al., 2013; Yang et al., 2013; Fujii et al., 2013).
Example 2 describes determination of whether Lrba regulates the cell membrane levels of the CVID-associated receptors, in order to test if the all-in-one animal model is working and the novel idea to study simultaneously multiple phenotypes in a single mouse and single tube (
Following are examples that illustrate procedures for practicing the invention. These examples should not be construed as limiting. All percentages are by weight and all solvent mixture proportions are by volume unless otherwise noted.
CRISPR can induce up to 78% targeted mutation in mice with higher germline transmission and lower off-target mutation rate, greatly reducing time from years to months (Zhou et al., 2014; Wang et al., 2013; Yang et al., 2013; Fujii et al., 2013; Yang et al., 2013; Jinek et al., 2012). It causes DNA double strand break (DSB), which is usually repaired by the non-homologous end-joining (NHEJ) repair pathway, which results in deletion or insertion mutations. It also can be repaired by homology-directed-repair (HDR). A DNA fragment flanking by homologous sequences can be inserted into the cleaved site based on homologous recombination, the efficiency of which can be 5000 times higher than traditional homologous recombination (Donoho et al., 1998). Due to these advantages, the inventors plan to use the CRISPR/HDR technique to knock-in the transcription control cassette (TCC) into the Lrba genomic locus in order to generate the all-in-one mouse model and generate four mouse lines (
1. Generation of Tri-lox (α) Mice by CRISPR.
Generation of Tri-lox (α) mice by CRISPR will be done as previously described (Yang et al., 2013). Briefly, T7 promoter will be added to the Cas9 coding sequence (CDS) and Lrba sgRNA by PCR using the primer pairs (Yang et al., 2013). The RNAs will be synthesized by T7 RNA polymerase and purified. Cas9 mRNA (100 ng/ml), sgRNA (50 ng/ml) and 200 ng/ml target plasmid DNA will be injected into the fertilized B57BL/6 eggs. The genomic DNA from targeted and control mice, age 8 to 12 days, will be extracted from clipped toes and used for PCR screening: The correct 5′ and 3′ end targeting will be confirmed by long range PCR protocol (Wang et al., 2002) using the primers from the vector and the genomic DNA sequence outside of the short or the long arms (
2. Generation of Multiple Lines by Cre Recombination.
The correctly targeted mice will be mated with C57BL/6 mice to obtain F1 germline transmission mice (
It is expected that the all-in-one mouse model and its three derivatives will be successfully generated and can be used to study Lrba in a spatiotemporal and trackable manner in a single mouse and single tube with high resolution, so that the genetic diversity in the human population can be mimicked to facilitate gene-gene interactions to better understand and treat complex human diseases associated with deregulated LRBA. This model can be made available to scientists who are interested in LRBA and LRBA regulated genes, such as NFκB, MAPKs, AKT, TNFα, IL10 and CVID receptors, and EGFR, NOTCH and RAS/MAPK pathways critical in CVID, autoimmunity and basic immunology. One potential concern is that the CRISPR targeting may induce off-target mutations (Yang et al., 2013; Fu et al., 2013). To detect any such potential mutations, the top 20 genome-wide off-target sites identified by the CRISPR Design tool will be amplified by PCR and sequenced. If mutations are detected, then the founders will be backcrossed with C57BL/6 mice multiple times until no mutations can be detected. Another potential concern is that since the kn fragment is large, 5.9 kb, the CRISPR/HDR one-step targeting rate may be low. The targeting rate is usually one in 20 for the CRISPR/HDR, which has been successfully used to knock-in a 3 kb transgene cassette (Yang et al., 2013). If this targeting fails, as thousands of ES clones can be screened by PCR, and double strand break (DSB) can increase targeting efficiency by 5000 fold (Donoho et al., 1998), the inventors will use the traditional ES cell gene targeting procedure combined with the CRISPR technique to obtain the targeted ES cells to obtain kn mice. An alternative to ES cells is to test three different sgRNA to obtain the highest DSB rate in ES cells.
Vesicle trafficking is required for homeostasis of membrane receptors through cell membrane deposition, oligomerization, phosphorylation, internalization, recycling and degradation (Wiley, 2003). As a vesicle trafficking regulator, LRBA may regulate CVID cell membrane receptors. LRBA regulates two cell membrane receptors: epidermal growth factor receptor (EGFR) and NOTCH (Wang et al., 2004; Yatim et al., 2012; Volders et al., 2012). Three CVID receptors CD19, CD20 and BAFFR are downregulated in LRBA deficient patients (Lopez-Herrera et al., 2012). The preliminary data show that LRBA knockdown down-regulates CVID receptors in vitro. The inventors' working hypothesis is that LRBA regulates the cell surface levels of CVID receptors on B cells in vivo. The inventors will use the β kn mice labeled as Lrba-β/β which express rtTA (
1. Generation of Lrba-β/β creERT+/− mice: As shown in
2. Dox-induced Lrba gene expression. The Lrba-β/β creERT+/− mice and wt C57BL/6 mice (male, 6 wks of age) will be treated with or without Dox. rtTA will activate Lrba expression in Lrba-β/β creERT+/− mice in the presence of Dox. This kn switch will allow for endogenous expression of Lrba which can be turned on (+Dox) or off (−Dox).
3. Tam-induced Cre mosaic recombination. Three types of B cells (
4. Flow cytometry: As mouse peripheral blood is limited, the inventors will use a no-lyse, no-wash staining flow cytometry technique using 20 μL of whole blood for each analysis (Weaver et al., 2002; Weaver et al., 2010). The B cells in the peripheral blood will be subjected to multiparametric flow cytometry of the CVID receptors using mouse-specific antibodies against BAFFR, TACI, CD19, CD20, CD21 and CD81. Two panels will be used and live/dead discrimination will be determined using DAPI or 7AAD. Data acquisition will be performed. At least 10,000 cells will be collected using an LSRII flow cytometer then analyzed with FACSDiva software with the gating strategy: the first gating will exclude cell debris based on a forward scatter/side scatter plot, the second gating will exclude non-B220 (pan B-cell marker) cells, and the third gating will be for RFP and CFP.
It is expected that the mouse model will work as shown in
Prelimary Results: LRBA Knockdown (KD) Upregulates NFκB, AKT, P38, JNK, TNFα and IL-10 but Downregulates ERK and CVID Receptors.
LRBA is LPS-responsive and an anchoring protein for protein kinase A (PKA) (Wang et al., 2001; Kerr et al., 1996). LPS and PKA can activate NFκB, which is implicated in the pathogenesis of human immunodeficiency diseases (Kurylowicz and Nauman, 2008). This NFκB signaling may be affected by LRBA deficiency. The results show LRBA KD increases NFκB activity in a dose-dependent manner (
Target Vector and Single Guide RNA (sgRNA)/Cas9 Vector Construction.
The target vector was constructed from BAC a clone using homologous recombineering (HR). The subsequent engineering of the vector included multiple rounds of HR (Quick & Easy Conditional Knock Out Kit), Seamless Ligation Cloning Extract (SLICE) and traditional ligation. The correct cloning of the target vector was characterized by restriction digestion (
LRBA gene deficiency causes severe primary immunodeficiency disease, which is manifested as multiple diseases with highly variable symptoms, such as common variable immunodeficiency (CVID), FOXP3 deficiency-like syndrome, autoimmune lymphoproliferative syndrome, inflammatory bowel diseases etc. LRBA overexpression is found in multiple cancers, and it is a molecular signature for breast cancer mortality and recurrence. In addition, LRBA mutations are present in more than 15% of CVID patients. This indicates that there are many unknown genes interacting with the LRBA gene to cause so many different symptoms and diseases.
The severity and prevalence of LRBA mutations make LRBA one of the most important genes related to human health. The two basic functions of LRBA, regulation of cell survival and regulation of vesicle trafficking, are central to the development of lymphocytes, the deregulation of which is the major cause of autoimmunity and immunodeficiency. Complementally, LRBA regulates multiple critical immune regulators. Moreover, LRBA deficient patients have highly variable symptoms, indicating that there are many unidentified genes (modifiers) which interact with LRBA. However, none of these genes' roles in the pathology of LRBA deficiency is yet known. The long-term goal is to identify these critical immune regulators and to study how LRBA interacts with them in vivo to cause specific phenotype in order to better understand this disease. However, there are two critical barriers to progression in this field. 1) The genetic, epigenetic and environmental variations between mice often mask the phenotype of a gene, resulting in ambiguous results. That no mouse model of LRBA deficiency has been described likely reflects the difficulty of modeling this disease with the current techniques. 2) LRBA human “knockouts” show that the phenotype of LRBA is highly dependent on genetic background. To replicate all phenotypes seen in human “knockouts” in mice, an existing target mouse line should be backcrossed with many mouse strains to establish many congenic strains. This is impossible due to cost and time as it takes ˜3 years to change a genetic background, and there are still variations that may mask phenotypes.
The all-in-one animal model of the invention will overcome two critical barriers to modeling this complex human disease (as well as others) by using a variation-free phenotyping technique to quickly identify these LRBA-interacting genes, so that a clear correlation between the phenotype and genotypic can be determined in order to better understand the disease.
Generation of an all-in-one animal model in which wild type, heterozygote, and knockout lymphocytes can be produced in a single animal, and the three genotypes are labeled with three different fluorescent colors will overcome these problems. Phenotypes can thus be studied by flow cytometry in a single tube with high sensitivity due to the elimination of variations. Furthermore, since each genotype will have the other two genotypes as controls in a single animal, the F1 mice can be used directly to identify genetic modifiers. This can significantly save in time and cost. Thus, it is possible now that Lrba (or other genes) can be studied on various genetic backgrounds, which is required for recapitulating this complex human disease. Recapitulating the key features of LRBA deficient disease is important. The impaired lymphocyte development is one of the most important features. The all-in-one animal model will allow highly sensitive detection of any abnormalities in lymphocyte development.
All patents, patent applications, provisional applications, and publications referred to or cited herein, supra or infra, are incorporated by reference in their entirety, including all figures and tables, to the extent they are not inconsistent with the explicit teachings of this specification.
It should be understood that the examples and embodiments described herein are for illustrative purposes only and that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included within the spirit and purview of this application and the scope of the appended claims. In addition, any elements or limitations of any invention or embodiment thereof disclosed herein can be combined with any and/or all other elements or limitations (individually or in any combination) or any other invention or embodiment thereof disclosed herein, and all such combinations are contemplated with the scope of the invention without limitation thereto.
The history of mouse genetics demonstrates that “clear and consistent phenotypes are the exception rather than the rule” [1]. Similarly, in human genetics, genetic risks have not been determined for the majority of common human diseases [2]. Both reflect the difficulty of defining a clear correlation between phenotype and genotype. 1) This is due to the fact that genetic and other experimental variations mask the phenotype of a gene [3,4]. The inventors thus propose a variation-free phenotyping method to increase phenotyping sensitivity so that “clear and consistent phenotypes are the rule” (
In contrast, animal and methods of the invention allow F1 mice to be used for phenotyping (
Genetic mosaics with different genotypes residing in the same organisms have been widely used to study biological processes [7]. Bearing some similarity, a loxP-STOP-TRE-loxP cassette has been used to control gene expression [8]. The inventors will experimentally test the utility of this model. Human LRBA and murine Lrba proteins have ˜94% homology. The phenotypes of a LRBA orthologue mutant in a fly can be rescued by a mouse LRBA-isoform transgene, demonstrating that LRBA function is highly conserved [9]. A goal is to overcome two critical barriers to generating a model that is truly representative of human diseases.
The deficiency of the LRBA gene causes primary immunodeficiency (PID), which is manifested as common variable immunodeficiency (CVID), FOXP3 deficiency-like syndrome [11] autoimmune lymphoproliferative syndrome (ALPS) [12], inflammatory bowel disease (IBD) etc. with highly variable symptoms in different patients. This indicates that genetic background plays an important role in determining phenotype (symptoms). The data also show that Lrba deficiency causes opposite phenotypes in different mouse strains. On the other hand, LRBA overexpression is found in multiple cancers [13] and is a molecular signature for breast cancer mortality and recurrence [14]. Moreover, LRBA mutations are present in more than 15% of CVID patients [15]. The severity and prevalence of LRBA mutations make LRBA one of the most important genes in human health. The inventors discovered two basic functions of LRBA, cell survival regulation [13] and vesicle trafficking regulation [5,68], which is required for the regulation of one-third of human proteins [16]. These functions are central to the development of lymphocytes, the deregulation of which is the major cause of autoimmunity and immunodeficiency. Supplemental to its functions, LRBA regulates multiple critical immune regulators [13] (
However, the inventors were faced with two critical barriers to modeling this complex human disease. 1. The history of mouse genetics shows that “clear and consistent phenotypes are the exception rather than the rule” [1]. In the inventors' opinion, this is due to the fact that genetic, epigenetic, and environmental variations between intra-strain mice mask the phenotype of a gene, resulting in low sensitivity of phenotyping and thus ambiguous data. 2. It takes ˜3 years to change a genetic background, limiting the function study of a gene to one strain and making it impossible to use multiple strains. Each mouse strain is analogous to only a single human and thus lacks the genetic diversity required to identify genetic modifiers specific to different symptoms or diseases. To recapitulate human genetic variations in the population to find these modifiers, Lrba should be studied on a wide genetic background by backcrossing an existing target line with many mouse strains. That, to date, no mouse model of LRBA deficiency has been described by others likely reflects the difficulty of modeling this disease with the current techniques. To overcome these problems, the inventors propose to generate an all-in-one mouse model so that wild type (wt), heterozygote (het), and knockout (ko) cells can be produced and studied in a single mouse and tube without the interference of variations. This will allow for the identification of genetic modifiers through sensitive phenotyping and rapid genetic background change (
The inventors propose generating an all-in-one mouse model in which Lrba expression can be turned on/off in a spatiotemporal and trackable manner. A transcription control cassette (TCC) will be inserted into the Lrba genomic locus by the Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) technique. To test the functionality of the all-in-one mouse model in recapitulating the complex LRBA deficiency disease. The impact of Lrba deficiency on the development of lymphocytes will be determined in this model to provide a proof of principle for modeling complex immunodeficiency diseases in a single mouse.
The invention has the potential to solve the two critical barriers to modeling complex diseases, a task which requires sensitive phenotyping and great genetic diversity. The human LRBA and mouse Lrba genes have been cloned, a conditional knockout mouse model has been generated [10,16], and a novel CRISPR method for constructing a target vector has been developed [19]. These accomplishments ensure the successful generation of the all-in-one animal model for us as well as others with this unique opportunity to facilitate a highly sensitive study of the LRBA gene, a novel, unique immune regulator important in understanding a critical aspect of immunodeficiency, as well as other genes.
LRBA gene: 1) LRBA deficiency causes multiple diseases with highly variable symptoms in different patients [11,12,20-23]. The immune dysregulation, polyendocrinopathy, enteropathy, X-linked (IPEX)-like syndrome [11], autoimmune lymphoproliferative syndrome (ALPS) [12], common variable immunodeficiency (CVID) [20,21], inflammatory bowel diseases (IBD) [23], etc. were described in LRBA deficient patients. Some patients have early onset primary immunodeficiency (PID), while others initially present with autoimmunity but normal antibody levels, or IBD without other autoimmunity [23]. The same mutation causes hypogammaglobulinemia in one family but not in the other [12,21]. The highly variable symptoms in different patients indicate that genetic background plays an important role in determining phenotype (symptoms). The data also show that Lrba deficiency causes opposite phenotypes in different mouse strains (
The conventional mouse model technique has two major barriers to modeling complex human diseases. 1) The history of mouse genetics demonstrates that “clear and consistent phenotypes are the exception rather than the rule”, i.e. phenotypes cannot be detected in many knockouts [1]. The inventors propose this is due to the fact that genetic [25,26], epigenetic, and environmental variations between the intra-strain mice mask the phenotype of a gene, resulting in low sensitivity of phenotyping and thus ambiguous data [3,25-28]. These variations may result from the genes tightly linked to the targeted gene [26,27], single housing [29], or infections [30-37]. The phenotypes originally attributed to the targeted gene are actually caused by other genes [26,27]. 2) LRBA human “knockouts” show that the phenotype of LRBA is highly dependent on genetic background, which indicates that there are many unidentified genes (modifiers) interacting with LRBA to cause different diseases. To detect all phenotypes seen in human “knockouts”, an existing target mouse line should be backcrossed with as many strains as possible to establish multiple congenic strains so that these unknown modifiers contributing to different symptoms can be identified in order to understand LRBA deficient diseases [38-40]. This is currently impossible due to cost and time as it takes ˜3 years to change a genetic background. Moreover, mice from a congenic strain still have genetic variations in addition to other variations. No mouse model of LRBA deficiency has yet been described. This is unsurprising given that the symptoms of LRBA deficiency are highly variable, suggesting that some phenotypes may not present in one strain but may present in others and vice versa, and that the phenotype is masked by variations. This reflects the difficulty of modeling LRBA deficiency with the current techniques.
To overcome these problems, the inventors propose a variation-free phenotyping technique by generating an all-in-one mouse model, in which Lrba expression can be turned on/off in a spatiotemporal and trackable manner so that wt, het, and ko B cells with different FP colors can be produced in a single mouse. This will allow: 1) Highly sensitive phenotyping. Phenotypes can be studied by flow cytometry in a single mouse and in a single tube with high sensitivity as there will be no interference from genetic, epigenetic, environmental and procedure variations (
The proposed work is highly innovative due to the following key features: 1) The concept of phenotyping in a single mouse and single tube will eliminate genetic, environmental, and procedure variations that can contribute to or even mask the phenotype [26,27]. In addition, flow cytometry can be used to analyze millions of cells with multiplex ability. Consequently, without the interference of variations, small differences in phenotype can be detected, allowing for high resolution of the phenotyping required for those phenotypes that are caused by less penetrance (more dependence on genetic background) of the targeted gene. 2) Since controls are not a requisite, establishing a congenic strain is not necessary. The all-in-one model can be crossed with many strains to allow the study of Lrba on a wide genetic background, which is required for mimicking genetic diversity in the human population. F1 mice can be used directly for experiments. This can significantly reduce both time and cost. 3) The single mouse concept is also very useful in the study of gene-gene interactions. If a significant difference is detected in phenotypes from two mice, it may indicate the presence of a modifier gene(s) [41], which can be determined by the deep sequencing of the whole exome from the two mice [6]. Therefore, this model can be used to quickly discover gene-gene interactions, which are critical in the study of complex human diseases. 4) Knockout, overexpression, and reporter mouse models are usually generated separately, and only one model is studied in most labs due to time and cost. With the all-in-one model of the invention, LRBA expression can be manipulated in multiple ways and the three genotypes can be generated in a single mouse, making the data more comparable. 5) GWAS has failed to establish common variant risks for the majority of common diseases. The inventors' strategy, featuring sensitive phenotyping and rapid genetic background change, offers a quick and elegant tool for identifying the genes underlying common diseases in humans.
The preliminary data demonstrate that the phenotype of Lrba deficiency is highly dependent on the genetic background in mice. This supports the concept that LRBA regulates multiple critical immune regulators and further strengthens the necessity of generating an all-in-one mouse model. The data also demonstrates that this project is technically feasible.
1. Phenotype of Lrba KO depends on genetic background and LRBA regulates multiple critical immune genes. Lrba deficiency causes different phenotypes in different strains. As the amount of C57BL/6 background increases, so does the percentage of pups found positive for the Lrba knockout allele (
2. Target vector and single guide RNA (gRNA)/Cas9 vector construction: The target vector was constructed from a BAC clone using homologous recombineering (HR). The subsequent engineering of the vector included multiple rounds of HR, and the CRISPR cloning method [19]. The correct cloning of the target vector was characterized by restriction digestion and confirmed by sequencing the TCC and the junctions between the 3′ arm/5′ arm/TCC/backbone regions. Portions of sequence traces around loxP sites show accurate cloning (
Since the phenotypes caused by the LRBA gene are highly dependent on genetic background and environmental factors, highly sensitive phenotyping and rapid genetic background change are essential in order to recapitulate LRBA deficient human disease in a mouse model. A unique mouse model that has these two prerequisites will be generated.
Generation an all-in-one mouse model in which Lrba expression can be turned on/off in a spatiotemporal and trackable manner.
The goal of this research was to generate the all-in-one Lrba mouse model by knocking in a transcription control cassette (TCC) into the Lrba locus in order to control Lrba expression in a spatiotemporal and trackable manner (
Target Vector Design.
The target vector has been constructed (
Generation of the All-in-One (a) Mice by CRISPR.
This will be done in collaboration as described [47]. In brief, T7 promoter will be added to the Cas9 coding sequence (CDS) and to Lrba gRNA by PCR [47]. The RNAs will be synthesized by T7 RNA polymerase and purified. Cas9 mRNA (100 ng/ml), gRNA (50 ng/ml), and a target plasmid DNA (200 ng/ml) will be injected into the fertilized B57BL/6 eggs. The genomic DNA from the targeted and the control mice, ranging from 8 to 12 days, will be extracted from clipped toes and used for PCR screening: The correct 5′ and 3′ end targeting will be confirmed by PCR protocol [10] by using the primers from the vector and the genomic DNA sequence outside of the short or the long arms (
It is expected that the all-in-one mouse model will be successfully generated within one year. This model will be made available to scientists who are interested in LRBA. Potential pitfalls include: 1) The off-target mutations induced by CRISPR targeting is high in cultured cells [47,65], but is low or undetectable in mice [53,54]. The inventors will use the whole exon sequencing (WES) in order to detect any off-target mutations. If mutations are detected, the founders will be backcrossed with C57BL/6 mice multiple times until no mutations can be detected. 2) Although the size of the TCC (5.9 kb) was of concern in the previous application, the inventors have since successfully inserted the 7.4 kb (with neo) TCC into the Lrba locus in ES cells at ˜70% efficiency (
Testing the functionality of the all-in-one mouse model in recapitulating the key features of LRBA deficient disease.
Accumulated evidence suggests that LRBA deficient disease results from the impaired regulation of cell death during lymphocyte development, as fewer mature lymphocytes but more immature lymphocytes were observed [11,12,17,18]. These lymphocytes, e.g. Treg cells [11] and class switched B cells [20,21], are significantly reduced (cytopenia), which may result in autoimmunity [11] and antibody deficiency, respectively [20,21]. Follicular lymphoid hyperplasia (ALPS) [12,20-22] and tissue infiltration of lymphocytes were also observed [21]. Both B and T lymphocytes undergo massive cell death at multiple developmental stages in order to eliminate non- or self-reactive lymphocytes. LRBA has two basic functions that can be involved in lymphocyte development. 1) Cell survival regulation. [13] LRBA knockdown increases cell survival in B cells (
The phenotype of αCreERT2 mice generated can be studied before adding Dox (Lrba negative), after adding Dox (Lrba positive), or after Dox withdrawal (Lrba negative) in the same mouse. Furthermore, mosaic lymphocytes with the three genotypes (wt, het, and ko) can be generated by incomplete Cre recombination in the same mouse (
Tesing whether Lrba expression can be turned on/off in a spatiotemporal and trackable manner.
This test is informative as it will allow phenotype to be attributed specifically to Lrba if they change in response to Lrba expression being switched on/off. As mouse peripheral blood is limited, the inventors will use a no-lyse, no-wash staining flow cytometry technique that uses 20 μL of whole blood for each analysis [73,74]. The expression of FPs, two CVID receptors:TACI and BAFFR, critical regulators of cell survival [75,76] in the peripheral white blood cells will be examined by flow cytometry.
Recapitulation of LRBA deficiency on the development of lymphocytes.
In addition, LRBA is expressed ubiquitously, especially in hematopoietic cells and stem cells [13,15]. The inventors will use the multiparametric flow cytometry developed by BD Biosciences to study the development of B and T lymphocytes in this model. (a) Analysis of B-cell developmental stages in mouse bone marrow. [77] Seven different developmental phases can be discriminated in bone marrow by a panel of seven B cell surface markers. Pre-pro-B, Pro-B, and Pre-B cells can be distinguished within the low positive CD45R population based on their differential expression of BP1 and CD24. Immature, transitional, and early and late mature B cells could be segregated based on differential expression of IgM and IgD [77]. To study the effect of Lrba ko in the periphery, mature B cell presence in mouse spleens will also be determined. (b) Analysis of T-cell developmental stages in thymus. [78] The six developmental stages, four double-negative (DN1, DN2, DN3, DN4), double-positive (DP) and single-positive (SP), can be discriminated in thymus by an 8-color panel of cell surface markers. B220 negative events will be gated first, and then CD4 vs CD8 will identify the DN, DP, and SP cell populations. CD44 vs CD25 will identify the four substages (DN1 to DN4) in DN cell population. The low, intermediate, and high expression levels of TCR β corresponds to DN, DP and SP cells. CD69 and CD5 will be included in the panel because they are indicators for positive selection and the intensity of TCRs and self MHC-peptides interactions. Mature T cell presence in mouse spleens and lymph nodes will also be determined to study the effect of Lrba ko in the periphery. The activation state of B cells or T cells will also be determined in the periphery using MHC class II, CD40 and CD86 for B cells and, CD44, CD62L and CD25 for T cells, respectively. *Flow cytometry methodologies: LSR-II flow cytometer, which has an analysis rate of up to 40,000 cells per second, and the capacity to measure 15 cell markers. 40 samples can be analyzed within a couple of hours. 1) To exclude cell aggregates, two sequential gates of scatter width vs height signals will be applied. The singlet population will then be gated by forward scatter vs side scatter to exclude dead cells and debris. “Live cells” will be gated using a contour plot and then switch to a dot plot for easy monitoring of acquisition. 2) A FMO-control (Fluorescence Minus One) is a control sample composed of all antibody labels except one, and will be used as a negative control in place of an isotype control for that antibody staining. 3) All antibodies will be titrated. 4) The fluorophores used by BD that overlap with iRFP of CFP will be replaced with other fluorophores. (c) Others: 1) The One Step Staining Mouse Treg Flow™ Kit (BioLegend) will be used to detect Treg cells in mouse spleens. 2) Plasma antibody isotyping (IgG1, IgG2a, IgG2b, IgG3, IgA and IgM) will be performed using the Pierce Rapid ELISA Mouse mAb Isotyping Kit. 3) Lymph nodes, lung and intestine sections will be analyzed by fluorescent microscopy [72] to examine lymphoproliferation and lymphocyte infiltration. 4) Comprehensive standardized gross and histopathologic analyses will be performed, including the analyses of organ weights, serum chemistries and hematology. The inventors will use 12 age-matched mice of both sexes, 6 mutant and 6 wild-type controls for the analyses.
Recapitulation of LRBA deficiency on different genetic backgrounds: The phenotype is different in the two strains (
The impact of LRBA deficiency on the development of B or T cells at each stage has not been investigated yet. The all-in-one mouse model will allow highly sensitive detection of any abnormalities in B and T cell development caused by Lrba deficiency, which may not be detectable with the current techniques. 1) Based on human data, it is expected, as shown in
The animals and cells obtained therefrom are an elegant tool to better recapitulate complex human diseases. In one embodiment, the invention provides a conditional gene knockout technique to study genes in a single animal at high resolution. Currently, clear and consistent phenotypes are the exception rather than the rule. This is because the variations between individuals can mask a gene's contribution to a phenotype. The invention can eliminate these variations by allowing the study of three genotypes of a gene in a single animal in a single flow cytometry tube. Phenotyping sensitivity thus can be greatly increased, e.g., by up to one million times theoretically, and the technique can detect the phenotypes undetectable currently.
Optionally, the targeting strategy can be modified to reduce the transgene size and simplify this animal model while retaining the advantages of the conditional knockout technique. The present invention provides a conditional gene knockout technique to study the genotype-phenotype relationship (GPr) at high resolution (
Targeting Vector Construction.
(1) Redesigned targeting vector: Fluorescent proteins (FP) [92] and site-specific recombinases, e.g., Cre, are indispensable and widely-used tools in the analysis of gene functions in a visible and spatiotemporal manner, respectively [93]. There are two potential disadvantages associated with the inventors' former model. One is that it may be difficult to control the Cre-recombination to obtain equal numbers of wt and ko alleles. Another is that the default inactivation of the targeted gene by the knockin can be lethal if this gene is essential for survival. To overcome these potential issues, the inventors redesigned kn transgene (GSLC, ˜2 kb.
(2) Innovation: While this GSL model can be used as a regular conditional knockout model, it will allow investigators to generate and respectively label the three genotypes of the targeted gene to study the phenotypes of a gene in a single (e.g., a single mouse) with high resolution (
(3) Cloning strategy of the redesigned targeting vector: The GSLC that has polycloning sites at the both ends has been cloned (
Obtaining Knock-In ES Cell Clones.
The Cas9/gRNA system causes DNA double strand break, which can be repaired by the homology-directed-repair (HDR) pathway.26 Using this method, the inventors successfully inserted a 7.4 kb fragment into ES cells at high efficiency [89]. The above resultant targeting vector will be electroporated into ES cells. The NeonGreen (like GFP)-positive clones will be picked up and subjected to 3′ and 5′ end PCR screening of homologous recombination [89]. The correctly targeted clones will be confirmed by Southern blot [87,89].
Testing the Functionality of the GSLC.
GSLC is used to switch and label the genotypes, and will be tested in cell culture prior to generating a mouse model, which is costly and time-consuming.
(1) The Cre recombinasemediated inversion will be tested in bacteria. Cre recombinase-mediated inversion will be examined by transforming the 706-Cre plasmid (Gene Bridges GmbH) into the bacteria containing the targeting vector, and then by restriction enzyme digestion of the isolated DNA (
(2) The functionality of the GSLC will be tested in ES cells. The GSLC will be knocked into the Lrba genomic locus in ES cells as previously [89]. Green FP positive clones will be picked up and screened by PCR, and confirmed by Southern blot following the inventors' methods [89]. The correctly targeted ES clones will be infected with the recombinant retroviruses (MSCV.CreERT2.puro. Addgene) and selected by puromycin for stable clones, which will be used for Tamoxifen (TAM)-induced Cre-cleavage assay. The exclusive expression of Neongreen and Orange2 will be directly detected by flow cytometry. The association of the genotypes with the FP colors will also be confirmed by sorting the three genotypes of cells and conducting PCR and Western blot, and further confirmed by Southern blot [89]. It is expected that the three genotypes of Lrba will be generated and specifically labeled in ES cells with the ratios: wt:het:ko=1:2:1. The knockin vector may need to be modified to obtain optimal genotype-switching and labeling results. This can be easily done by using the CRISPR cloning technique [88]. In summary, this technique can be used to obtain high resolution data while retaining all the function of the current conditional knockout technique. This is a significant improvement.
Examples of embodiments of the invention include, but are not limited to:
Embodiment 1: A non-human animal model comprising three populations of a cell type, wherein each cell population has a different genotype for a gene, wherein the three cell populations comprise:
(a) a first population of cells having a wild-type gene;
(b) a second population of cells heterozygous for the gene; and
(c) a third population of cells having an inactivated version of the gene;
wherein the three populations of cells include a transcription control cassette operably linked at the genomic locus of the gene.
Embodiment 2: The non-human animal model of embodiment 1, wherein the three populations of cells comprise a first detectable label, a second detectable label, and a third detectable label, respectively, wherein each detectable label is distinguishable from the other detectable labels, wherein each detectable label is encoded by a nucleic acid sequence linked to the respective gene, and wherein the expression of the nucleic acid sequence is dependent upon expression of the gene (co-expressed) in the respective cell population.
Embodiment 3: The non-human animal model of embodiment 2, wherein the detectable label is a fluorescent label or luminescent label.
Embodiment 4: The non-human animal model of embodiment 2, wherein each detectable label is a fluorescent label selected from the group consisting of green fluorescent protein (GFP), red fluorescent protein (RFP), and cyano fluorescent protein (CFP).
Embodiment 5: The non-human animal of any one of embodiments 1 to 4, wherein the animal model is a rodent.
Embodiment 6: The non-human animal of any one of embodiments 1 to 4, wherein the animal model is a mouse.
Embodiment 7: The non-human animal of any one of embodiments 1 to 4, wherein the animal model is a primate.
Embodiment 8: The non-human animal model of any one of embodiments 1 to 7, wherein the gene is expressed in B cells in humans.
Embodiment 9: The non-human animal model of any one of embodiments 1 to 8, wherein the gene is Lrba.
Embodiment 10: The non-human animal model of any preceding embodiment, wherein the non-human animal model comprises three populations of B cells, and wherein the three populations of B cells comprise:
(a) a first population of B cells having a wild-type lipopolysaccharide (LPS)-responsive beige-like anchor (Lrba) gene;
(b) a second population of B cells heterozygous for the Lrba gene; and
(c) a third population of B cells having an inactivated Lrba gene;
wherein the three populations of B cells include a transcription control cassette operably linked at the Lrba genomic locus.
Embodiment 11: A method for studying phenotypes, comprising:
providing a non-human animal model of any one of embodiments 1 to 10;
analyzing one or more of the phenotypes of the non-human animal model in the presence and/or absence of an exogenous agent.
Embodiment 12: The method of embodiment 11, wherein said analyzing comprises analyzing the characteristics and/or behavior of one or more of the cell populations of the animal model.
Embodiment 13. The method of claim 11 or 12, wherein said analyzing comprises subjecting cells of the animal model to flow cytometry.
Embodiment 14: The method of any preceding embodiment, wherein said analyzing comprises measuring the detectable label of one or more of the cell populations of the animals and, optionally, comparing the measured detectable label to that of one or both of the other detectable labels.
Embodiment 15: The method of any preceding embodiment, further comprising activing or deactivating the transcription control cassette to induce or inhibit expression of the gene.
Embodiment 16: The method of any preceding embodiment, wherein the exogenous agent is a small molecule or biologic molecule that is administered to the animal model.
Embodiment 17: A composition comprising a plurality of populations of cells from a single non-human animal, wherein said plurality of populations of cells comprise at least two of the following populations of cells:
(a) a first population of cells having a wild-type gene;
(b) a second population of cells heterozygous for the gene; and
(c) a third population of cells having an inactivated version of the gene;
wherein the three populations of cells include a transcription control cassette operably linked at the genomic locus.
Embodiment 18: The composition of embodiment 17, wherein the cell populations are B cell populations.
Embodiment 19: The composition of embodiment 18, wherein said plurality of populations of cells comprise at least two of the following populations of cells:
(a) a first population of B cells having a wild-type lipopolysaccharide (LPS)-responsive beige-like anchor (Lrba) gene;
(b) a second population of B cells heterozygous for the Lrba gene; and
(c) a third population of B cells having an inactivated Lrba gene;
wherein the three populations of B cells include a transcription control cassette operably linked at the Lrba genomic locus.
Embodiment 20: The composition of embodiment 18, wherein the three populations of cells comprise a first detectable label, a second detectable label, and a third detectable label, respectively, wherein each detectable label is distinguishable from the other detectable labels, wherein each detectable label is encoded by a nucleic acid sequence linked to the respective gene, and wherein the expression of the nucleic acid sequence is dependent upon expression of the gene (co-expressed) in the respective cell population.
Embodiment 21: The composition of any preceding embodiment, wherein the non-human animal is a rodent or primate.
Embodiment 22: The composition of any preceding embodiment, wherein the cells are B cells.
Embodiment 23: The composition of any preceding embodiment, wherein the composition comprises isolated B cells or a tissue comprising the B cells.
Embodiment 24: The composition of any one of embodiments 18 to 23, wherein the composition is peripheral blood from the non-human animal.
Embodiment 25: A method for studying phenotypes, comprising:
Embodiment 26: The method of embodiment 25, wherein said analyzing comprises analyzing the characteristics and/or behavior of one or more of the cell populations.
Embodiment 27: The method of embodiment 25 or 26, wherein said analyzing comprises subjecting cells of the animal model to flow cytometry.
Embodiment 28: The method of any one of embodiments 25 to 27, wherein said analyzing comprises measuring the detectable label of one or more of the cell populations of the animals and, optionally, comparing the measured detectable label to that of one or both of the other detectable labels.
Embodiment 29: The method of any one of embodiments 25 to 28, further comprising activing or deactivating the transcription control cassette to induce or inhibit expression of the gene.
Embodiment 30. The method of any one of embodiments 25 to 29, wherein the exogenous agent is a small molecule or biologic molecule that is administered to the animal model.
The present application claims the benefit of U.S. Provisional Application Ser. No. 62/149,424, filed Apr. 17, 2015, which is hereby incorporated by reference herein in its entirety, including any figures, tables, nucleic acid sequences, amino acid sequences, or drawings.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2016/028148 | 4/18/2016 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62149424 | Apr 2015 | US |