The delivery of molecules (e.g., biological molecules, such as proteins and nucleic acids (e.g., DNA, RNA, including DNA plasmids shRNA, and mRNA) and small molecules) to the cells of the central nervous system (“CNS”) and peripheral nervous system (PNS) is limited by a number of factors, including poor target specificity as well as difficulties in targeting post-mitotic cells that are no longer dividing. Therefore, there is a need to develop methods, compositions, and systems for effectively delivering therapeutic proteins, nucleic acids (such as RNA), and small molecules to the nervous system.
This invention relates to the discovery that molecules, such as proteins and nucleic acids, including ribonucleic acids (RNAs), as well as small molecules, can be loaded into microvesicles, specifically ARRDC1-mediated microvesicles (ARMMs), for delivery to the nervous system, specifically cells of the CNS and PNS. The ARMMs can incorporate viral envelope proteins to allow for delivery of molecules to the nervous system. For example, vesicular stomatitis virus G protein (VSV-G) or rabies virus glycoprotein (RVG) can be co-expressed in, and appear on, the surface of ARMMS to target cells of the CNS and PNS. These proteins normally function to aid viral attachment and entry of viruses into cells. For example, VSV-G mediates viral attachment to LDL receptors (LDLR) or LDLR family member, and RVG is known to use the nicotinic acetylcholine receptor and the low affinity nerve growth factor receptor for viral entry. It has been found that these proteins can also aid ARMMs to attach to cells, including cells of the nervous system.
In addition, the ARMM delivery system, described herein, addresses many limitations of current delivery systems that prevent the safe and efficient delivery of proteins and nucleic acids (e.g., RNAs (including both RNA coding for proteins and non-coding RNA) to CNS and PNS cells. As ARMMS are derived from an endogenous budding pathway, they are unlikely to elicit a strong immune response, unlike viral delivery systems, which are known to trigger an inflammatory response (Sen et al., “Cellular unfolded protein response against viruses used in gene therapy.” Front Microbiology. 2014; 5:250, 1-16). Additionally, ARMMs allow for the specific packaging of any molecules (e.g., biological molecules, such as a protein or nucleic acid (e.g., a DNA plasmid, a mRNA, a miRNA, or a shRNA), or a small molecule). These molecules can then be delivered by fusion with, or uptake by, specific recipient cells/tissues by incorporating antibodies or other types of molecules in the ARMMs that recognize tissue-specific markers.
ARMMs are microvesicles that are distinct from exosomes and which, like budding viruses, are produced by direct plasma membrane budding (DPMB). DPMB is driven by a specific interaction of TSG101 with a tetrapeptide PSAP (SEQ ID NO: 1) motif of the arrestin-domain-containing protein ARRDC1 accessory protein, which is localized to the plasma membrane through its arrestin domain. ARMMs have been described in detail, for example, in PCT application number PCT/US2013/024839, filed Feb. 6, 2013 (published as WO 2013/119602 A1 on Aug. 15, 2013) by Lu et al., and entitled “Arrdcl-Mediated Microvesicles (ARMMs) and Uses Thereof,” as well as in U.S. Pat. Nos. 9,737,480; 9,816,080; 10,260,055; and PCT Publication WO2018/067546; the entire contents of which are hereby incorporated by reference in their entirety. The ARRDC1/TSG101 interaction results in relocation of TSG101 from endosomes to the plasma membrane and mediates the release of microvesicles that contain TSG101, ARRDC1, and other cellular components as well as the molecule of interest.
The molecules of interest, whether naturally and non-naturally occurring such as proteins, nucleic acids, and small molecules, can associate with one or more ARMM proteins (e.g., ARRDC1), or can be modified to associate with TSG101 or ARRDC1. This association facilitates their incorporation into ARMMs, which in turn can be used to deliver the desired payload into a targeted cell. By way of example, but not meant to be limiting, a payload RNA can be fused to a trans-activation response (TAR) element, thereby allowing it to associate with an ARRDC1 protein that is fused to an RNA binding protein, such as a Tat protein. Alternatively, a payload protein can be fused to one or more WW domains, which associate with the PPXY (SEQ ID NO: 2) motif of ARRDC1. This association of the molecule of interest to an ARMM protein (e.g., ARRDC1), facilitates loading of the molecule into the ARRDC1-containing ARMM. Alternatively, the molecule can be fused to an ARMM protein (e.g., TSG101 or ARRDC1) to load the payload into the ARMM. The molecule can be fused to the ARMM protein (e.g., TSG101 or ARRDC1) via a linker that may be cleaved upon delivery to a target cell.
Similarly, synthetic or natural small molecules can be modified to associate with an ARMM protein (e.g., TSG101 or ARRDC1). This association can facilitate their incorporation into ARMMs, which in turn can be used to deliver the molecule to a target cell. Incorporation of a cleavable linker may be used to allow such a molecule to be released upon delivery into a target cell. As a non-limiting example, a small molecule can be linked to biotin, thereby allowing it to associate with an ARRDC1 protein which is fused to an streptavidin. As another non-limiting example, a small molecule can be linked to synthetic high affinity ligand that specifically binds to a mutant form of FKBP12 such as FKBP12(F36V) (Yang W, Rozamus L W, Narula S, Rollins C T, Yuan R, Andrade L J, Ram M K, Phillips T B, van Schravendijk M R, Dalgarno D, Clackson T, Holt D A. Investigating protein-ligand interactions with a mutant FKBP possessing a designed specificity pocket. J Med Chem. 2000 Mar. 23; 43(6):1135-42), which will associate with an ARRDC1 protein which is fused to FKBP12(F36V). The association of the small molecule to an ARMM protein (e.g., TSG101 or ARRDC1), facilitates loading of the small molecule into the ARRDC1-containing ARMM.
As an example, the delivery platform of ARMMs will enable multiple cis-acting structural elements of mRNAs to perform in the context of intracellular and secreted therapeutics for nervous system cells, where these structural elements include but are not limited to: (i) 5′ cap structure; (ii) 5′ untranslated region (UTR); (iii) the codon optimized coding sequence; (iv) 3′ UTR; (v) a 3′ poly-A tail consisting of a stretch of repeated adenine nucleotides; and (vi) inclusion of cis-acting zipcode elements within RNA transcripts that are recognized by specific RNA binding proteins to cause specific cellular localization (e.g., to synapse of neurons) (see, e.g. Chin A, Lécuyer E. RNA localization: Making its way to the center stage. Biochim Biophys Acta Gen Subj. 2017 November; 1861(11 Pt B):2956-297).
As another example, the delivery platform for ARMMs will enable multiple classes of protein and mRNA-based therapeutics to be targeted to nervous system cells, the therapeutics including but not limited to: transmembrane proteins; cytoplasmic proteins; nuclear proteins; mitochondrial proteins; endoplasmic reticulum proteins; Golgi proteins; peroxisome proteins; lysosome proteins; and secreted proteins.
Also contemplated herein in the context of therapeutics for nervous system disorders is the targeted expression of single-chain variable fragment (scFv) antibodies composed of a fusion protein of the variable regions of the heavy (VH) and light chains (VL) of immunoglobulin connected with a short linker peptide. These scFv antibodies can bind selectively to a specific antigen or they can be engineered to be multifunctional by appending to the fusion other protein- or nucleic acid-biding domains, such as for example the case of bispecific scFvs. Alternatively, mRNA encoding both VH and VL chains may be used. Additionally, a single-domain antibody (sdAb), consisting of a single monomeric variable domain can be delivered to the nervous system cells as mRNAs.
Also contemplated in the context of therapeutics for nervous system disorders is the targeted expression of antigenic peptides, or neoantigens, which can occur through the use of ARMM-mediated delivery of a mRNA. The delivered mRNA is translated by the ribosome to produce a neoantigen protein chain which can be processed by the proteasome to produce a neoantigen. This neoantigen can associate with other membrane-bound proteins to display itself, thereby allowing it to be recognized by T-cell receptors on T-cells or other cells of the immune system.
In some aspects of this invention, arrestin domain-containing protein 1 (ARRDC1)-mediated microvesicles (ARMMs) containing a lipid bilayer and an ARRDC1 protein, a molecule, and a viral envelope protein are provided. In other aspects, the viral envelope protein is vesicular stomatitis virus G (VSV-G) or rabies virus glycoprotein (RVG).
In some aspects of the invention, microvesicle-producing cells containing a recombinant expression construct encoding an ARRDC1 protein or a variant thereof under the control of a heterologous promoter, and a viral envelope protein are provided. In other aspects, the viral envelope protein is vesicular stomatitis virus G (VSV-G) or rabies virus glycoprotein (RVG).
In some aspects of the invention, methods of delivering a molecule to a target cell by contacting the target cell with a microvesicle as described herein are provided. In other aspects, the cells are of the nervous system (NS), including the central nervous system (CNS) and the peripheral nervous system (PNS). In yet other aspects, the target cell is a neuron, astrocyte, an oligodendrocyte, or a microglial cell.
In some aspects of the invention, methods of treating a disorder in a patient by administering to the patient a microvesicle or a microvesicle-producing cell as described herein are provided. In other aspects of the invention, the disorder is a disorder of the CNS system. In yet other aspects, the disorder impacts the function of neurons, the function of astrocyte cells, the function of oligodendrocytes, or the function of microglial cells. In other aspects of the invention, the disorder is either a gain-of-function disorder, a loss-of-function disorder, or a repeat expansion.
Other advantages, features, and uses of the invention will be apparent from the detailed description of certain exemplary, non-limiting embodiments; the drawings; the non-limiting working examples; and the claims.
The term “ARMM,” as used herein, refers to a microvesicle comprising an ARRDC1 protein or variant thereof, and/or TSG101 protein, or variant thereof. In some embodiments, the ARMM is shed from a cell, and comprises a payload for example, a nucleic acid, protein, or small molecule, present in the cytoplasm or associated with the membrane of the cell. In some embodiments, the ARMM is shed from a transgenic cell comprising a recombinant expression construct that includes a transgene, and the ARMM comprises a gene product, for example, an RNA transcript and/or a protein (e.g., an ARRDC1-Tat fusion protein and a TAR-payload RNA) encoded by the expression construct. In some embodiments, the ARMM is produced synthetically, for example, by contacting a lipid bilayer with an ARRDC1 protein, or variant thereof, in a cell-free system in the presence of TSG101, or a variant thereof. In other embodiments, the ARMM is synthetically produced by contacting a lipid bilayer with HECT domain ligase, and VPS4a. In some embodiments, an ARMM lacks a late endosomal marker. Some of the ARMMs provided herein do not include, or are negative for, one or more exosomal biomarker. Exosomal biomarkers are known to those of skill in the art and include, but are not limited to, CD63, Lamp-1, Lamp-2, CD9, HSPA8, GAPDH, CD81, SDCBP, PDCD6IP, ENO1, ANXA2, ACTB, YWHAZ, HSP90AA1, ANXA5, EEF1A1, YWHAE, PPIA, MSN, CFL1, ALDOA, PGK1, EEF2, ANXA1, PKM2, HLA-DRA, and YWHAB. Certain ARMMs provided herein may include an exosomal biomarker. Accordingly, some ARMMs may be negative for one or more other exosomal biomarkers, but positive for one or more different exosomal biomarkers. For example, such an ARMM may be negative for CD63 and Lamp-1, but may include PGK1 or GAPDH; or may be negative for CD63, Lamp-1, CD9, and CD81, but may be positive for HLA-DRA. In some embodiments, ARMMs include an exosomal biomarker, but at a lower level than the level found in exosomes. For example, some ARMMs include one or more exosomal biomarkers at a level of less than about 1%, less than about 5%, less than about 10%, less than about 20%, less than about 30%, less than about 40%, or less than about 50% of the level of that biomarker found in exosomes. To give a non-limiting example, in some embodiments, an ARMM may be negative for CD63 and Lamp-1, include CD9 at a level of less than about 5% of the level of CD9 typically found in exosomes, and be positive for ACTB. Exosomal biomarkers in addition to those listed above are known to those of skill in the art, and the invention is not limited in this regard.
The term “binding RNA,” as used herein, refers to a ribonucleic acid (RNA) that binds to an RNA binding protein, for example, any of the RNA binding proteins known in the art and/or described herein. In some embodiments, a binding RNA is an RNA that specifically binds to an RNA binding protein. A binding RNA that “specifically binds” to an RNA binding protein, binds to the RNA binding protein with greater affinity, avidity, more readily, and/or with greater duration than it binds to another protein, such as a protein that does not bind the RNA or a protein that weakly binds to the binding RNA. In some embodiments, the binding RNA is a naturally-occurring RNA, or non-naturally-occurring variant thereof, that binds to a specific RNA binding protein. For example, the binding RNA may be a TAR element, a Rev response element, an MS2 RNA, or any variant thereof that specifically binds an RNA binding protein. In some embodiments, the binding RNA may be a trans-activating response element (TAR element), or variant thereof, which is an RNA stem-loop structure that is found at the 5′-ends of nascent HIV-1 transcripts and specifically binds to the trans-activator of transcription (Tat) protein. In some embodiments, the binding RNA is a Rev response element (RRE), or variant thereof, that specifically binds to the accessory protein Rev (e.g., Rev from HIV-1). In some embodiments, the binding RNA is an MS2 RNA that specifically binds to a MS2 phage coat protein. The binding RNAs of the present disclosure may be designed to specifically bind a protein (e.g., an RNA binding protein fused to ARRDC1) in order to facilitate loading of the binding RNA (e.g., a binding RNA fused to a payload RNA) into an ARMM.
The term “aptamer,” as used herein, refers to nucleic acids (e.g., RNA, DNA) that bind to a specific target molecule, e.g., an RNA binding protein. In some embodiments, nucleic acid (e.g., DNA or RNA) aptamers are engineered through repeated rounds of in vitro selection or alternatively, SELEX (systematic evolution of ligands by exponential enrichment) methodology, to bind to various molecular targets, for example, proteins, small molecules, macromolecules, metabolites, carbohydrates, metals, nucleic acids, cells, tissues, and organisms. Methods for engineering aptamers to bind to various molecular targets, such as proteins, are known in the art and include those described in U.S. Pat. Nos. 6,376,19; and 9,061,043; Shui B., et al., “RNA aptamers that functionally interact with green fluorescent protein and its derivatives.” Nucleic Acids Res., March; 40(5): e39 (2012); Trujillo U. H., et al., “DNA and RNA aptamers: from tools for basic research towards therapeutic applications.” Comb Chem High Throughput Screen 9 (8): 619-32 (2006); Srisawat C., et al., “Streptavidin aptamers: Affinity tags for the study of RNAs and ribonucleoproteins.” RNA, 7:632-641 (2001); and Tuerk and Gold, “Systematic evolution of ligands by exponential enrichment: RNA ligands to bacteriophage T4 DNA polymerase.” Science. 1990; the entire contents of each of which are hereby incorporated by reference in their entirety.
The term “RNA binding protein,” as used herein refers to a polypeptide molecule that binds to a binding RNA, for example, any of the binding RNAs known in the art and/or described herein. In some embodiments, an RNA binding protein is a protein that specifically binds to a binding RNA. An RNA binding protein that “specifically binds” to a binding RNA, binds to the binding RNA with greater affinity, avidity, more readily, and/or with greater duration than it binds to another RNA, such as a control RNA (e.g., an RNA having a random nucleic acid sequence) or an RNA that weakly binds to the RNA binding protein. In some embodiments, the RNA binding protein is a naturally-occurring protein, or non-naturally-occurring variant thereof, that binds to a specific RNA. For example, in some embodiments, the RNA binding protein may be a trans-activator of transcription (Tat) protein that specifically binds a trans-activating response element (TAR element). In some embodiments, the RNA binding protein is a regulator of virion expression (Rev) protein (e.g., Rev from HIV-1) or variant thereof, that specifically binds to a Rev response element (RRE). In some embodiments, the RNA binding protein is a coat protein of an MS2 bacteriophage that specifically binds to an MS2 RNA. The RNA binding proteins useful in the present disclosure (e.g., a binding protein fused to ARRDC1) may be designed to specifically bind a binding RNA (e.g., a binding RNA fused to a payload RNA) in order to facilitate loading of the binding RNA into an ARMM.
The term “payload,” “payload protein,” “payload nucleic acid,” “payload DNA,” “payload RNA,” or “payload small molecule,” as used herein, refers to a protein, nucleic acid, including DNA or RNA, or a small molecule, respectively, that may be incorporated into an ARMM, for example, into the liquid phase of the ARMM or into the lipid bilayer of an ARMM. Types of payload protein, payload nucleic acid, payload DNA, payload RNA, and payload small molecule are known in the art and include those described in U.S. Pat. Nos. 9,737,480; 9,816,080; 10,260,055; and PCT Publication WO2018/067546; the entire contents of each of which are hereby incorporated by reference in their entirety.
The payload can be delivered via its association with or inclusion in an ARMM to a subject, organ, tissue, or cell. In some embodiments, the payload is to be delivered to a targeted cell in vitro, in vivo, or ex vivo. In some embodiments, the payload to be delivered is a biologically active agent, i.e., it has activity in a cell, organ, tissue, and/or subject. For instance, a protein, nucleic acid (e.g., DNA or RNA), or small molecule that, when administered to a subject, has a biological effect on that subject or is considered to be biologically active. In some embodiments, a payload to be delivered is a therapeutic agent.
As used herein, the term “therapeutic agent” refers to any agent that, when administered to a subject, has a beneficial effect. In some embodiments, the payload is a small molecule, or the payload protein, or nucleic acid, such as DNA or RNA, is associated with a small molecule. In some embodiments, the payload to be delivered is a diagnostic agent. In some embodiments, the payload to be delivered is a prophylactic agent. In some embodiments, the payload to be delivered is useful as an imaging agent. In some of these embodiments, the diagnostic or imaging agent is, and in others it is not, biologically active.
The term “central nervous system” or “CNS,” as used herein, is the portion of the nervous system comprised of the cells and tissues of the brain and spinal cord. Two of the major types of cells that make up the nervous system are neurons and glial cells. In some embodiments, neurons are excitatory, and in some embodiments they are inhibitory. In some embodiments, the cells of the CNS include, but are not necessarily limited to, neuronal cells, glial cells, oligodendrocytes, astrocytes, microglia, cerebrospinal fluid (CSF), interstitial spaces, bone, cartilage and the like.
The terms “peripheral nervous system” or “PNS,” as may be used herein, refer to all cells and tissue of the nervous system outside of the cells and tissues of the brain and spinal cord (i.e., outside of the CNS). The PNS consists of the nerves and ganglia outside of the CNS comprised primarily two types of cells in the peripheral nervous system. Sensory nervous cells carry information to the CNS and motor nervous cells carry information from the CNS. The PNS also includes Schwann cells, which myelinate the cells of the nervous system. Cells of the sensory nervous system send information to the CNS from internal organs or from external stimuli. Motor nervous system cells carry information from the CNS to organs, muscles, and glands.
The terms “nervous system” or “NS,” as used herein, refers collectively to the CNS and the PNS. Cells of the nervous system include neurons, astrocytes, oligodendrocytes, and microglia with further interaction with endothelial cells in blood vessels and cells of the immune system including T-cells. While neurons, astrocytes, and oligodendrocytes are terminally differentiated cells, in certain niches of the CNS neural stem and neural progenitor cells exist that retain the capacity to replicate through both symmetric and symmetric cell division to produce additional stem cells, progenitor cells, and cells that will terminally differentiated into neurons and astrocytes. Microglia, the resident immune system cells of the nervous system, are also able to proliferate.
The term “viral envelope proteins” refers to proteins that normally function to aid viral attachment and entry into cells. In some embodiments, viral envelope proteins can be incorporated into ARMMS to allow for the targeting of cells of the CNS. Non-limiting examples of viral envelope proteins include vesicular stomatitis virus G protein (VSV-G) or rabies virus glycoprotein (RVG). VSV-G mediates viral attachment to LDL receptors (LDLR) or LDLR family member, and RVG is known to use the nicotinic acetylcholine receptor and the low affinity nerve growth factor receptor for viral entry.
The term “linker,” as used herein, refers to a chemical moiety linking two molecules or moieties, e.g., an ARRDC1 protein and a Tat protein, a WW domain and a Tat protein, or an ARRDC1 protein and a Cas9 nuclease. Typically, the linker is positioned between, or flanked by, two groups, molecules, or other moieties and connected to each one via a covalent bond, thus connecting the two. In some embodiments, the linker comprises an amino acid or a plurality of amino acids (e.g., a peptide or protein). In some embodiments, the linker comprises a nucleotide (e.g., DNA or RNA) or a plurality of nucleotides (e.g., a nucleic acid). In some embodiments, the linker is an organic molecule, functional group, polymer, or other chemical moiety. In some embodiments, the linker is a cleavable linker, e.g., the linker comprises a bond that can be cleaved upon exposure to, for example, UV light or a hydrolytic enzyme, such as a protease or esterase. In some embodiments, the linker is any stretch of amino acids having at least 1, at least 2, at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 15, at least 20, at least 25, at least 30, at least 40, at least 50, or more amino acids (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, or 50 amino acids). In other embodiments, the linker is a chemical bond (e.g., a covalent bond, amide bond, disulfide bond, ester bond, carbon-carbon bond, carbon heteroatom bond).
As used herein, the term “animal” refers to any member of the animal kingdom. In some embodiments, the term “animal” refers to a human of either sex at any stage of development. In some embodiments, the term “animal” refers to a non-human animal at any stage of development. In certain embodiments, the non-human animal is a mammal (e.g., a rodent, a mouse, a rat, a rabbit, a monkey, a dog, a cat, a sheep, cattle, a primate, or a pig). Animals include, but are not limited to, mammals, birds, reptiles, amphibians, fish, and worms. In some embodiments, the animal is a transgenic animal, genetically-engineered animal, or a clone. In some embodiments, the animal is a transgenic non-human animal, genetically-engineered non-human animal, or a non-human clone.
As used herein, the term “approximately” or “about,” as applied to one or more values of interest, refers to a value that is similar to a stated reference value. In certain embodiments, the term “approximately” or “about” refers to a range of values that fall within 10%, 9%, 8%, 7%, 6%, 5%, 4%, 3%, 2%, 1%, or less in either direction (greater than or less than) of the stated reference value unless otherwise stated or otherwise evident from the context (for example, when such number would exceed 100% of a possible value).
As used herein, the term “associated with,” when used with respect to two or more entities, for example, with chemical moieties, molecules, and/or ARMMs, means that the entities are physically associated or connected with one another, either directly or via one or more additional moieties that serves as a linker, to form a structure that is sufficiently stable so that the entities remain physically associated under the conditions in which the structure is used, e.g., under physiological conditions. An ARMM is typically associated with an agent, for example, a nucleic acid, protein, or small molecule, by a mechanism that involves a covalent (e.g., via an amide bond) or non-covalent association (e.g., between ARRDC1 and a WW domain, or between a Tat protein and a TAR element). In certain embodiments, the agent to be delivered (e.g., a payload protein, payload nucleic acid, or payload small molecule) is covalently bound to a molecule that associates non-covalently with a part of the ARMM that is fused to an ARRCD1 protein, or variant thereof. In some embodiments, the association is via a linker, for example, a cleavable linker. In some embodiments, an entity (e.g., a payload protein, payload nucleic acid, or payload small molecule) is associated with an ARMM by inclusion in the ARMM, for example, by encapsulation of the molecule within the ARMM. For example, in some embodiments, a molecule (e.g., a payload protein, payload nucleic acid, or payload small molecule) present in the cytoplasm of an ARMM-producing cell is associated with an ARMM by encapsulation of the cytoplasm with the agent in the ARMM upon ARMM budding. Similarly, a membrane protein or other molecule associated with the cell membrane of an ARMM producing cell may be associated with an ARMM produced by the cell by inclusion into the ARMM's membrane upon budding.
As used herein, the phrase “biologically active” refers to a characteristic of any substance that has activity in a cell, organ, tissue, and/or subject. For instance, a substance that, when administered to an organism, has a biological effect on that organism, is considered to be biologically active. As one example, a payload RNA may be considered biologically active if it increases or decreases the expression of a gene product when administered to a subject or cell. As another example, a nuclease payload protein may be considered biologically active if it increases or decreases the expression of a gene product when administered to a subject.
As used herein, the term “conserved” refers to nucleotides or amino acid residues of a polynucleotide sequence or amino acid sequence, respectively, that are those that occur unaltered in the same position of two or more related sequences being compared. Nucleotides or amino acids that are relatively conserved are those that are conserved amongst more related sequences than nucleotides or amino acids appearing elsewhere in the sequences. In some embodiments, two or more sequences are said to be “completely conserved” if they are 100% identical to one another. In some embodiments, two or more sequences are said to be “highly conserved” if they are at least 70% identical, at least 80% identical, at least 90% identical, or at least 95% identical to one another. In some embodiments, two or more sequences are said to be “highly conserved” if they are about 70% identical, about 80% identical, about 90% identical, about 95% identical, about 98% identical, or about 99% identical to one another. In some embodiments, two or more sequences are said to be “conserved” if they are at least 30% identical, at least 40% identical, at least 50% identical, at least 60% identical, at least 70% identical, at least 80% identical, at least 90% identical, or at least 95% identical to one another. In some embodiments, two or more sequences are said to be “conserved” if they are about 30% identical, about 40% identical, about 50% identical, about 60% identical, about 70% identical, about 80% identical, about 90% identical, about 95% identical, about 98% identical, or about 99% identical to one another.
The term “engineered,” as used herein, refers to a protein, nucleic acid, complex, substance, or entity that has been designed, produced, prepared, synthesized, and/or manufactured by a human. Accordingly, an engineered product is a product that does not occur in nature. In some embodiments, an engineered protein or nucleic acid is a protein or nucleic acid that has been designed to meet particular requirements or to have particular design features. For example, a payload RNA may be engineered to associate with the ARRDC1 by fusing one or more WW domains to a Tat protein and fusing the payload RNA to a TAR element to facilitate loading of the payload RNA into an ARMM. As another example, a payload RNA may be engineered to associate with the ARRDC1 by fusing a Tat protein to the ARRDC1 and by fusing the payload RNA to a TAR element to facilitate loading of the payload RNA into an ARMM. As another example, a payload protein may be engineered to associate with the ARRDC1 by fusing one or more WW domains to the payload protein to facilitate loading of the payload protein into an ARMM.
As used herein, “expression” of a nucleic acid sequence refers to one or more of the following events: (1) production of an RNA transcript from a DNA sequence (e.g., by transcription); (2) processing of an RNA transcript (e.g., by splicing, editing, 5′ cap formation, and/or 3′ end processing); (3) translation of an RNA transcript into a polypeptide or protein; and (4) post-translational modification of a polypeptide or protein.
As used herein, a “fusion protein” includes a first protein moiety, e.g., an ARRCD1 protein or variant thereof, or a TSG101 protein or variant thereof, associated with a second protein moiety, for example, a protein to be delivered to a target cell through a peptide linkage. In certain embodiments, the fusion protein is encoded by a single fusion gene.
As used herein, the term “gene” has its meaning as understood in the art. It will be appreciated by those of ordinary skill in the art that the term “gene” may include gene regulatory sequences (e.g., promoters, enhancers, etc.) and/or intron sequences. It will further be appreciated that the definition of gene includes references to nucleic acids that do not encode proteins but rather encode functional RNA molecules, such as gRNAs, RNAi agents, ribozymes, tRNAs, etc. For the purpose of clarity it should be noted that, as used in the present application, the term “gene” generally refers to a portion of a nucleic acid that encodes a protein; the term may optionally encompass regulatory sequences, as will be clear from context to those of ordinary skill in the art. This definition is not intended to exclude application of the term “gene” to non-protein—coding expression units but rather to clarify that, in most cases, the term as used herein refers to a protein-coding nucleic acid.
As used herein, the term “gene product” or “expression product” generally refers to an RNA transcribed from the gene (pre- and/or post-processing) or a polypeptide (pre- and/or post-modification) encoded by an RNA transcribed from the gene.
As used herein, the term “green fluorescent protein” (GFP) refers to a protein originally isolated from the jellyfish Aequorea victoria that fluoresces green when exposed to blue light or a derivative of such a protein (e.g., an enhanced or wavelength-shifted version of the protein). The amino acid sequence of wild type GFP is as follows:
Proteins that are at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, or at least 99% homologous to SEQ ID NO: 3 are also considered to be green fluorescent proteins.
As used herein, the term “homology” refers to the overall relatedness between nucleic acids (e.g., DNA molecules and/or RNA molecules) or polypeptides. In some embodiments, nucleic acids or proteins are considered to be “homologous” to one another if their sequences are at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identical. In some embodiments, nucleic acids or proteins are considered to be “homologous” to one another if their sequences are at least 50%, at least 55%, at least 60%, at least 65%, at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, or at least 99% identical. The term “homologous” necessarily refers to a comparison between at least two sequences (nucleotide sequences or amino acid sequences). In accordance with the invention, two nucleotide sequences are considered to be homologous if the polypeptides they encode are at least about 50% identical, at least about 60% identical, at least about 70% identical, at least about 80% identical, or at least about 90% identical for at least one stretch of at least about 20 amino acids. In some embodiments, homologous nucleotide sequences are characterized by the ability to encode a stretch of at least 4-5 uniquely specified amino acids. Both the identity and the approximate spacing of these amino acids relative to one another must be considered for sequences to be considered homologous. For nucleotide sequences less than 60 nucleotides in length, homology is determined by the ability to encode a stretch of at least 4-5 uniquely specified amino acids. In accordance with the invention, two protein sequences are considered to be homologous if the proteins are at least about 50% identical, at least about 60% identical, at least about 70% identical, at least about 80% identical, or at least about 90% identical for at least one stretch of at least about 20 amino acids.
As used herein, the term “identity” refers to the overall relatedness between nucleic acids or proteins (e.g., DNA molecules, RNA molecules, and/or polypeptides). Calculation of the percent identity of two nucleic acid sequences, for example, can be performed by aligning the two sequences for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and second nucleic acid sequence for optimal alignment and non-identical sequences can be disregarded for comparison purposes). In certain embodiments, the length of a sequence aligned for comparison purposes is at least 30%, at least 40%, at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or 100% of the length of the reference sequence. The nucleotides at corresponding nucleotide positions are then compared. When a position in the first sequence is occupied by the same nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position. The percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which needs to be introduced for optimal alignment of the two sequences. The comparison of sequences and determination of percent identity between two sequences can be accomplished using a mathematical algorithm. For example, the percent identity between two nucleotide sequences can be determined using methods such as those described in Computational Molecular Biology, Lesk, A. M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D. W., ed., Academic Press, New York, 1993; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; Computer Analysis of Sequence Data, Part I, Griffin, A. M., and Griffin, H. G., eds., Humana Press, New Jersey, 1994; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991; each of which is incorporated herein by reference. For example, the percent identity between two nucleotide sequences can be determined using the algorithm of Meyers and Miller (CABIOS, 1989, 4:11-17), which has been incorporated into the ALIGN program (version 2.0) using a PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4. The percent identity between two nucleotide sequences can, alternatively, be determined using the GAP program in the GCG software package using an NWSgapdna.CMP matrix. Methods commonly employed to determine percent identity between sequences include, but are not limited to those disclosed in Carillo, H., and Lipman, D., SIAM J Applied Math., 48:1073 (1988); incorporated herein by reference. Techniques for determining identity are codified in publicly available computer programs. Exemplary computer software to determine homology between two sequences include, but are not limited to, GCG program package, Devereux, J., et al., Nucleic Acids Research, 12(1), 387 (1984)), BLASTP, BLASTN, and FASTA Atschul, S. F. et al., J. Molec. Biol., 215, 403 (1990)).
As used herein, the term “in vitro” refers to events that occur in an artificial environment, e.g., in a test tube or reaction vessel, in cell culture, in a Petri dish, etc., rather than within an organism (e.g., animal, plant, or microbe).
As used herein, the term “in vivo” refers to events that occur within an organism (e.g., animal, plant, or microbe).
As used herein, the term “isolated” refers to a substance or entity that has been: (1) separated from at least some of the components with which it was associated when initially produced (whether in nature or in an experimental setting); and/or (2) produced, prepared, and/or manufactured by the hand of man. Isolated substances and/or entities may be separated from at least about 10%, about 20%, about 30%, about 40%, about 50%, about 60%, about 70%, about 80%, about 90%, or more of the other components with which they were initially associated. In some embodiments, isolated substances are more than about 80%, about 85%, about 90%, about 91%, about 92%, about 93%, about 94%, about 95%, about 96%, about 97%, about 98%, about 99%, or more than about 99% pure. As used herein, a substance is “pure” if it is substantially free of other components.
As used herein, the term “nucleic acid,” in its broadest sense, refers to a compound and/or substance that is or can be incorporated into an oligonucleotide chain via a phosphodiester linkage. In some embodiments, “nucleic acid” refers to individual nucleic acid residues (e.g., nucleotides and/or nucleosides). In some embodiments, “nucleic acid” refers to an oligonucleotide chain comprising individual nucleotides. As used herein, the terms “oligonucleotide” and “polynucleotide” can be used interchangeably to refer to a polymer of nucleotides (e.g., a string of at least two nucleotides). In some embodiments, “nucleic acid” encompasses RNA as well as single and/or double-stranded DNA and/or complementary DNA (cDNA). Furthermore, the terms “nucleic acid,” “DNA,” “RNA,” and/or similar terms include nucleic acid analogs, i.e., analogs having other than a phosphodiester backbone. For example, the so-called “peptide nucleic acids,” which are known in the art and have peptide bonds instead of phosphodiester bonds in the backbone, are considered within the scope of the present invention. The term “nucleotide sequence encoding an amino acid sequence” includes all nucleotide sequences that are degenerate versions of each other and/or encode the same amino acid sequence. Nucleotide sequences that encode proteins and/or RNA may include introns. Nucleic acids can be purified from natural sources, produced using recombinant expression systems and optionally purified, chemically synthesized, etc. Where appropriate, e.g., in the case of chemically synthesized molecules, nucleic acids can comprise nucleoside analogs such as analogs having chemically modified bases or sugars, backbone modifications, etc. A nucleic acid sequence is presented in the 5′ to 3′ direction unless otherwise indicated. The term “nucleic acid segment” is used herein to refer to a nucleic acid sequence that is a portion of a longer nucleic acid sequence. In many embodiments, a nucleic acid segment comprises at least 3, at least 4, at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, or more residues. In some embodiments, a nucleic acid is or comprises natural nucleosides (e.g., adenosine, thymidine, guanosine, cytidine, uridine, deoxyadenosine, deoxythymidine, deoxyguanosine, and deoxycytidine); nucleoside analogs (e.g., 2-aminoadenosine, 2-thiothymidine, inosine, pyrrolo-pyrimidine, 3-methyl adenosine, 5-methylcytidine, 2-aminoadenosine, C5-bromouridine, C5-fluorouridine, C5-iodouridine, C5-propynyl-uridine, C5-propynyl-cytidine, C5-methylcytidine, 2-aminoadenosine, 7-deazaadenosine, 7-deazaguano sine, 8-oxoadenosine, 8-oxoguanosine, 0(6)-methylguanine, and 2-thiocytidine); chemically modified bases; biologically modified bases (e.g., methylated bases); intercalated bases; modified sugars (e.g., 2′-fluororibose, ribose, 2′-deoxyribose, arabinose, and hexose); and/or modified phosphate groups (e.g., phosphorothioates and 5′-N-phosphoramidite linkages). In some embodiments, the present invention is specifically directed to “unmodified nucleic acids,” meaning nucleic acids (e.g., polynucleotides and residues, including nucleotides and/or nucleosides) that have not been chemically modified in order to facilitate or achieve delivery.
As used herein, the term “protein” refers to a string of at least two amino acids linked to one another by one or more peptide bonds. Proteins may include moieties other than amino acids (e.g., may be glycoproteins) and/or may be otherwise processed or modified. Those of ordinary skill in the art will appreciate that a “protein” can be a complete protein chain as produced by a cell (with or without a signal sequence), or can be a functional portion thereof. Those of ordinary skill will further appreciate that a protein can sometimes include more than one protein chain, for example linked by one or more disulfide bonds or associated by other means. Proteins may contain L-amino acids, D-amino acids, or both and may contain any of a variety of amino acid modifications or analogs known in the art. Useful modifications include, e.g., addition of a chemical entity such as a carbohydrate group, a phosphate group, a farnesyl group, an isofarnesyl group, a fatty acid group, an amide group, a terminal acetyl group, a linker for conjugation, functionalization, or other modification (e.g., alpha amidation), etc. In certain embodiments, the modifications of the protein lead to a more stable protein (e.g., greater half-life in vivo). These modifications may include cyclization of the protein, the incorporation of D-amino acids, etc. None of the modifications should substantially interfere with the desired biological activity of the protein. In certain embodiments, the modifications of the protein lead to a more biologically active protein. In some embodiments, proteins may comprise natural amino acids, non-natural amino acids, synthetic amino acids, amino acid analogs, and combinations thereof.
As used herein, the term “subject” or “patient” refers to any organism to which a composition in accordance with the invention may be administered, e.g., for experimental, diagnostic, prophylactic, and/or therapeutic purposes. Typical subjects include animals (e.g., mammals, such as mice, rats, rabbits, non-human primates, and humans) and/or plants. In some embodiments, the subject is a patient having or suspected of having a disease or disorder. In other embodiments, the subject is a healthy volunteer.
As used herein, the term “therapeutically effective amount” means an amount of an agent to be delivered (e.g., nucleic acid, protein, drug, therapeutic agent, diagnostic agent, prophylactic agent, ARMM, or ARMM comprising a payload protein or payload RNA) that is sufficient, when administered to a subject suffering from or susceptible to a disease, disorder, and/or condition, to treat, improve symptoms of, diagnose, prevent, and/or delay the onset of the disease, disorder, and/or condition.
As used herein, the term “treating” refers to partially or completely preventing, and/or reducing the incidence of one or more symptoms or features of a particular disease or condition. For example, “treating” cancer may refer to inhibiting survival, growth, and/or spread of the cancer. Treatment may be administered to a subject who does not exhibit signs or symptoms of a disease, disorder, and/or condition and/or to a subject who exhibits only early signs or symptoms of a disease, or condition for the purpose of decreasing the risk of developing more severe effects associated with the disease, disorder, or condition.
As used herein, a “vector” means any nucleic acid or nucleic acid-bearing particle, cell, or organism capable of being used to transfer a nucleic acid into a host cell. The term “vector” includes both viral and nonviral products and means for introducing the nucleic acid into a cell. A “vector” can be used in vitro, ex vivo, or in vivo. Vectors capable of directing the expression of operatively linked genes are referred to herein as “expression vectors.” Non-viral vectors include plasmids, cosmids, artificial chromosomes (e.g., bacterial artificial chromosomes or yeast artificial chromosomes) and can comprise liposomes, electrically charged lipids (cytofectins), DNA-protein complexes, and biopolymers, for example. Viral vectors include retroviruses, lentiviruses, adeno-associated virus, pox viruses, baculovirus, reoviruses, vaccinia viruses, herpes simplex viruses, Epstein-Barr viruses, and adenovirus vectors, for example. Vectors can also comprise the entire genome sequence or recombinant genome sequence of a virus. A vector can also comprise a portion of the genome that comprises the functional sequences for production of a virus capable of infecting, entering, or being introduced to a cell to deliver nucleic acid therein.
The term “WW domain” as used herein, refers to a protein domain having two basic residues at the C-terminus that mediates protein-protein interactions with short proline-rich or proline-containing motifs. It should be appreciated that the two basic residues (e.g., any two of: H, R, and K) of the WW domain are not required to be at the absolute C-terminal end of the WW protein domain. Rather, the two basic residues may be at a C-terminal portion of the WW protein domain (e.g., the C-terminal half of the WW protein domain). In some embodiments, the WW domain contains at least 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 tryptophan (W) residues. In some embodiments, the WW domain contains at least two W residues. In some embodiments, the at least two W residues are spaced apart by from 15-25 amino acids. In some embodiments, the at least two W residues are spaced apart by from 19-23 amino acids. In some embodiments, the at least two W residues are spaced apart by from 20-22 amino acids. The WW domain possessing the two basic C-terminal amino acid residues may have the ability to associate with short proline-rich or proline-containing motifs (e.g., a PPXY (SEQ ID NO: 2) motif). WW domains bind a variety of distinct peptide ligands including motifs with core proline-rich sequences, such as PPXY (SEQ ID NO: 2), which is found in AARDC1. A WW domain may be a 30-40 amino acid protein interaction domain with two signature tryptophan residues spaced by 20-22 amino acids. The three-dimensional structure of WW domains shows that they generally fold into a three-stranded, antiparallel β sheet with two ligand-binding grooves.
WW domains are found in many eukaryotes and are present in approximately 50 human proteins (Bork, P. & Sudol, M. The WW domain: a signaling site in dystrophin? Trends Biochem Sci 19, 531-533 (1994)). WW domains may be present together with several other interaction domains, including membrane targeting domains, such as C2 in the NEDD4 family proteins, the phosphotyrosine-binding (PTB) domain in FE65 protein, FF domains in CA150 and FBPII, and pleckstrin homology (PH) domains in PLEKHA5. WW domains are also linked to a variety of catalytic domains, including HECT E3 protein-ubiquitin ligase domains in NEDD4 family proteins, rotomerase or peptidyl prolyisomerase domains in Pinl, and Rho GAP domains in ArhGAP9 and ArhGAP12.
In the instant disclosure, the WW domain may be a WW domain that naturally possesses two basic amino acids at the C-terminus. In some embodiments, a WW domain or WW domain variant may be from the human ubiquitin ligase WWP1, WWP2, Nedd4-1, Nedd4-2, Smurf1, Smurf2, ITCH, NEDL1, or NEDL2. Exemplary amino acid sequences of WW domain containing proteins (WW domains underlined) are listed below. It should be appreciated that any of the WW domains or WW domain variants of the exemplary proteins may be used in the invention, described herein, and are not meant to be limiting.
Human WWP1 amino acid sequence (uniprot.org/uniprot/Q9H0M0). The four underlined WW domains correspond to amino acids 349-382 (WW1), 381-414 (WW2), 456-489 (WW3), and 496-529 (WW4).
LPSGWEQRKD PHGRTYYVDH NTRTTTWERP QPLPPGWERR VDDRRRVYYV
DHNTRTTTWQ RPTMESVRNF EQWQSQRNQL QGAMQQFNQR YLYSASMLAA
GWEIRYTREG VRYFVDHNTR TTTFKDPRNG KSSVTKGGPQ IAYERGFRWK
Human WWP2 amino acid sequence (uniprot.org/uniprot/O00308). The four underlined WW domains correspond to amino acids 300-333 (WW1), 330-363 (WW2), 405-437 (WW3), and 444-547 (WW4).
ALPAGWEQRE LPNGRVYYVD HNTKTTTWER PLPPGWEKRT DPRGRFYYVD
HNTRTTTWQR PTAEYVRNYE QWQSQRNQLQ GAMQHFSQRF LYQSSSASTD
EMKYTSEGVR YFVDHNTRTT TFKDPRPGFE SGTKQGSPGA YDRSFRWKYH
Human Nedd4-1 amino acid sequence (uniprot.org/uniprot/P46934). The four underlined WW domains correspond to amino acids 610-643 (WW1), 767-800 (WW2), 840-873 (WW3), and 892-925 (WW4).
APNGRPFFID HNTKTTTWED PRLKIPAHLR GKTSLDTSND LGPLPPGWEE
RTHTDGRIFY INHNIKRTQW EDPRLENVAI TGPAVPYSRD YKRKYEFFRR
Human Nedd4-2 amino acid sequence (>gi|21361472|refINP_056092.2|E3 ubiquitin-protein ligase NEDD4-like isoform 3 [Homo sapiens]). The four underlined WW domains correspond to amino acids 198-224 (WW1), 368-396 (WW2), 480-510 (WW3), and 531-561 (WW4).
EKVDNLGRTYYVNHNNRTTQWHRPSLMDVSSESDNNIRQINQEAAHRRFR
KTTTWEDPRLKFPVHMRSKTSLNPNDLGPLPPGWEERIHLDGRTFYIDHN
SKITQWEDPRLQNPAITGPAVPYSREFKQKYDYFRKKLKKPADIPNRFEM
Human Smurf1 amino acid sequence (uniprot.org/uniprot/Q9HCE7). The two underlined WW domains correspond to amino acids 234-267 (WW1) and 306-339 (WW2).
YFLHTQTGVS TWHDPRIPSP SGTIPGGDAA FLYEFLLQGH TSEPRDLNSV
Human Smurf2 amino acid sequence (uniprot.org/uniprot/Q9HAU4). The three underlined WW domains correspond to amino acids 157-190 (WW1), 251-284 (WW2), and 297-330 (WW3).
PDLPEGYEQR TTQQGQVYFL HTQTGVSTWH DPRVPRDLSN INCEELGPLP
PGWEIRNTAT GRVYFVDHNN RTTQFTDPRL SANLHLVLNR QNQLKDQQQQ
Human ITCH amino acid sequence (uniprot.org/uniprot/Q96J02). The four underlined WW domains correspond to amino acids 326-359 (WW1), 358-391 (WW2), 438-471 (WW3), and 478-511 (WW4).
RTTWDRPEPL PPGWERRVDN MGRIYYVDHF TRTTTWQRPT LESVRNYEQW
NGRVYFVNHN TRITQWEDPR SQGQLNEKPL PEGWEMRFTV DGIPYFVDHN
RRTTTYIDPR TGKSALDNGP QIAYVRDFKA KVQYFRFWCQ QLAMPQHIKI
Human NEDL1 amino acid sequence (uniprot.org/uniprot/Q76N89). The two underlined WW domains correspond to amino acids 829-862 (WW1), and 1018-1051 (WW2).
VNRTTTWQRP TAAATPDGMR RSGSIQQMEQ LNRRYQNIQR TIATERSEED
Human NEDL2 amino acid sequence (uniprot.org/uniprot/Q9P2P5). The two underlined WW domains correspond to amino acids 807-840 (WW1) and 985-1018 (WW2).
AFFVDHNSRT TTFIDPRLPL QSSRPTSALV HRQHLTRQRS HSAGEVGEDS
In some embodiments, the WW domain consists essentially of a WW domain or WW domain variant. Consists essentially of means that a domain, peptide, or polypeptide consists essentially of an amino acid sequence when such an amino acid sequence is present with only a few additional amino acid residues, for example, from about 1 to about 10 or so additional residues, typically from 1 to about 5 additional residues in the domain, peptide, or polypeptide.
Alternatively, the WW domain may be a WW domain that has been modified to include two basic amino acids at the C-terminus of the domain. Techniques are known in the art and are described in the art, for example, in Sambrook et al., ((2001) Molecular Cloning: a Laboratory Manual, 3rd ed., Cold Spring Harbour Laboratory Press). Thus, a skilled person could readily modify an existing WW domain that does not normally have two C-terminal basic residues so as to include two basic residues at the C-terminus.
Basic amino acids are amino acids that possess a side-chain functional group that has a pKa of greater than 7 and includes lysine, arginine, and histidine, as well as basic amino acids that are not included in the twenty α-amino acids commonly included in proteins. The two basic amino acids at the C-terminus of the WW domain may be the same basic amino acid or may be different basic amino acids. In one embodiment, the two basic amino acids are two arginines.
The term WW domain also includes variants of a WW domain provided that any such variant possesses two basic amino acids at its C-terminus and maintains the ability of the WW domain to associate with the PPXY (SEQ ID NO: 2) motif. A variant of such a WW domain refers to a WW domain which retains the ability of the variant to associate with the PPXY (SEQ ID NO: 2) motif (i.e., the PPXY (SEQ ID NO:2) motif of ARRDC1 and that has been mutated at one or more amino acids, including point, insertion, and/or deletion mutations, but still retains the ability to associate with the PPXY (SEQ ID NO: 2) motif. A variant or derivative therefore includes deletions, including truncations and fragments; insertions and additions, for example conservative substitutions, site-directed mutants and allelic variants; and modifications, including one or more non-amino acyl groups (e.g., sugar, lipid, etc.) covalently linked to the peptide and post-translational modifications. In making such changes, substitutions of like amino acid residues can be made on the basis of relative similarity of side-chain substituents, for example, their size, charge, hydrophobicity, hydrophilicity, and the like, and such substitutions may be assayed for their effect on the function of the peptide by routine testing.
The WW domain may be part of a longer protein. Thus, the protein, in various different embodiments, comprises the WW domain, consists of the WW domain or consists essentially of the WW domain, as defined herein. The polypeptide may be a protein that includes a WW domain as a functional domain within the protein sequence.
The instant disclosure relates, at least in part, to the discovery that a GFP-encoding payload RNA that is associated with an ARRDC1 protein can be loaded into an ARMM and delivered to cells of the nervous system, including the neurons of the CNS. The uptake of these ARMMs and payload is enhanced by the presence of viral envelope proteins, including but not limited to VSV-G. Different types of payload, such as payload proteins and payload nucleic acids including payload RNA, can be loaded in such ARMMs for delivery to cells of the CNS. Various types of payload proteins, payload nucleic acids, payload RNAs, payload protein, payload nucleic acid, and payload RNA are known in the art and include those described in U.S. Pat. Nos. 9,737,480; 9,816,080; 10,260,055; and PCT Application Publication WO2018/067546; the entire contents of each of which are hereby incorporated by reference in their entirety.
Arrestin domain containing protein 1 mediated microvesicles (ARMMs) are extracellular vesicles (EVs) that are distinct from exosomes. The budding of ARMMs requires Arrestin domain containing protein 1 (ARRDC1), which is localized to the cytosolic side of the plasma membrane and, through a tetrapeptide motif, recruits the ESCRT-I complex protein TSG101 to the cell surface to initiate the outward membrane budding. Thus, in contrast to exosomes, the biogenesis of ARMMs occurs at the plasma membrane. ARMMs exhibit several additional features that make them potentially ideal vehicles for therapeutic delivery. ARRDC1 is not only necessary, but also sufficient to drive ARMMs budding. Indeed, simple overexpression of the ARRDC1 protein increases the production of ARMMs in cells. This allows controlled production of ARMMs using modern biological manufacturing methods. Moreover, endogenous proteins such as cell surface receptors are actively recruited into ARMMs and can be delivered into recipient cells to initiate intercellular communication, suggesting that the exogenous payload molecules may be similarly packaged and delivered via ARMMs.
ARRDC1 is a protein that comprises a PSAP (SEQ ID NO: 1) motif and a PPXY (SEQ ID NO: 2) motif in its C-terminus, and interacts with TSG101 as shown herein. It should be appreciated that the PSAP (SEQ ID NO: 1) motif and the PPXY (SEQ ID NO: 2) motif are not required to be at the absolute C-terminal end of the ARRDC1. Rather, they may be at a C-terminal portion of the ARRDC1 protein (e.g., the C-terminal half of the ARRDC1). The disclosure also contemplates variants of ARRDC1, such as fragments of ARRDC1 and/or ARRDC1 proteins that have a degree of identity (e.g., 60%, 70%, 80%, 85%, 90%, 95%, 98%, or 99% identity) to an ARRDC1 protein and are capable if interacting with TSG101. Accordingly, an ARRDC1 protein may be a protein that comprises a PSAP (SEQ ID NO: 1) motif and a PPXY (SEQ ID NO: 2) motif, and interacts with TSG101. In some embodiments, the ARRDC1 protein is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% identical to the amino acid sequence of any one of SEQ ID NOs: 42-44, comprises a PSAP (SEQ ID NO: 1) motif and a PPXY (SEQ ID NO: 2) motif, and interacts with TSG101. In some embodiments, the ARRDC1 protein has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 110, at least 120, at least 130, at least 140, at least 150, at least 160, at least 170, at least 180, at least 190, at least 200, at least 210, at least 220, at least 230, at least 240, at least 250, at least 260, at least 270, at least 280, at least 290, at least 300, at least 310, at least 320, at least 330, at least 340, at least 350, at least 360, at least 370, at least 380, at least 390, at least 400, at least 410, at least 420, or at least 430 identical contiguous amino acids of any one of SEQ ID NOs: 42-44, comprises a PSAP (SEQ ID NO: 1) motif and a PPXY (SEQ ID NO: 2) motif, and interacts with TSG101. In some embodiments, the ARRDC1 protein has 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 21, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 or more mutations compared to any one of the amino acid sequences set forth in SEQ ID NOs: 42-44 comprises a PSAP (SEQ ID NO: 1) motif and a PPXY (SEQ ID NO: 2) motif, and interacts with TSG101. In some embodiments, the ARRDC1 protein comprises any one of the amino acid sequences set forth in SEQ ID NOs: 42-44. Exemplary, non-limiting ARRDC1 protein sequences are provided herein, and additional, suitable ARRDC1 protein variants according to aspects of this invention are known in the art. It will be appreciated by those of skill in the art that this invention is not limited in this respect. Exemplary ARRDC1 sequences include the following (PSAP (SEQ ID NO: 1) and PPXY (SEQ ID NO: 2) motifs are marked):
In certain embodiments, the inventive microvesicles further comprise TSG101 (tumor susceptibility gene 101) TSG101 belongs to a group of apparently inactive homologs of ubiquitin-conjugating enzymes. The protein contains a coiled-coil domain that interacts with stathmin, a cytosolic phosphoprotein implicated in tumorigenesis. TSG101 is a protein that comprises a UEV domain, and interacts with ARRDC1. As referred to herein, UEV refers to the Ubiquitin E2 variant domain of approximately 145 amino acids. The structure of the domain contains a α/β fold similar to the canonical E2 enzyme but has an additional N-terminal helix and further lacks the two C-terminal helices. Often found in TSG101/Vps23 proteins, the UEV interacts with a ubiquitin molecule and is essential for the trafficking of a number of ubiquitylated payloads to multivesicular bodies (MVBs). Furthermore, the UEV domain can bind to Pro-Thr/Ser-Ala-Pro peptide ligands, a fact exploited by viruses such as HIV. Thus, the TSG101 UEV domain binds to the PTAP tetrapeptide motif in the viral Gag protein that is involved in viral budding. The disclosure also contemplates variants of TSG101, such as fragments of TSG101 and/or TSG101 proteins that have a degree of identity (e.g., 60%, 70%, 80%, 85%, 90%, 95%, 98%, or 99% identity) to a TSG101 protein and are capable if interacting with ARRDC1. Accordingly, an TSG101 protein may be a protein that comprises a UEV domain, and interacts with ARRDC1. In some embodiments, the TSG101 protein is at least 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 98%, or 99% identical to the amino acid sequence of any one of SEQ ID NOs: 45-47, comprises a UEV domain, and interacts with ARRDC1. In some embodiments, the TSG101 protein has at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, at least 90, at least 100, at least 110, at least 120, at least 130, at least 140, at least 150, at least 160, at least 170, at least 180, at least 190, at least 200, at least 210, at least 220, at least 230, at least 240, at least 250, at least 260, at least 270, at least 280, at least 290, at least 300, at least 310, at least 320, at least 330, at least 340, at least 350, at least 360, at least 370, at least 380, or at least 390, identical contiguous amino acids of any one of SEQ ID NOs: 45-47, comprises a UEV domain, and interacts with ARRDC1. In some embodiments, the TSG101 protein has 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 21, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50 or more mutations compared to any one of the amino acid sequences set forth in SEQ ID NOs: 45-47 and comprises a UEV domain. In some embodiments, the ARRDC1 protein comprises any one of the amino acid sequences set forth in SEQ ID NOs: 45-47. Exemplary, non-limiting TSG101 protein sequences are provided herein, and additional, suitable TSG101 protein sequences, isoforms, and variants are known in the art. It will be appreciated by those of skill in the art that this invention is not limited in this respect. Exemplary TSG101 sequences include the following sequences (the UEV domain in these sequences includes amino acids 1-145 and is underlined in the sequences below):
MAVSESQLKKMVSKYKYRDLTVRETVNVITLYKDLKPVLDSYVFNDGSSRELMNLTGTIPVP
YRGNTYNIPICLWLLDTYPYNPPICFVKPTSSMTIKTGKHVDANGKIYLPYLHEWKHPQSDL
LGLIQVMIVVFGDEPPVFSRPISASYPPYQATGPPNTSYMPGMPGGISPYPSGYPPNPSGYP
MAVSESQLKKMMSKYKYRDLTVRQTVNVIAMYKDLKPVLDSYVFNDGSSRELVNLTGTIPVR
YRGNIYNIPICLWLLDTYPYNPPICFVKPTSSMTIKTGKHVDANGKIYLPYLHDWKHPRSEL
LELIQIMIVIFGEEPPVFSRPTVSASYPPYTATGPPNTSYMPGMPSGISAYPSGYPPNPSGY
MAVSESQLKKMMSKYKYRDLTVRQTVNVIAMYKDLKPVLDSYVFNDGSSRELVNLTGTIPVR
YRGNIYNIPICLWLLDTYPYNPPICFVKPTSSMTIKTGKHVDANGKIYLPYLHDWKHPRSEL
LELIQIMIVIFGEEPPVFSRPTVSASYPPYTAAGPPNTSYLPSMPSGISAYPSGYPPNPSGY
The structure of UEV domains is known to those of skill in the art (see, e.g., Owen Pornillos et al., Structure and functional interactions of the Tsg101 UEV domain, EMBO J. 2002 May 15; 21(10): 2397-2406, the entire contents of which are incorporated herein by reference).
Some aspects of this invention provide expression constructs for encoding a gene product or gene products that induce or facilitate the generation of ARMMs in cells harboring such a construct. In some embodiments, the expression constructs described herein encode a fusion proteins as described herein, such as ARRDC1 fusion proteins and TSG101 fusion proteins. In some embodiments, the expression constructs encode an ARRDC1 protein, or variant thereof, and/or a TSG101 protein, or variant thereof. In some embodiments, overexpression of either or both of these gene products in a cell increase the production of ARMMs in the cell, thus turning the cell into a microvesicle producing cell. In some embodiments, such an expression construct comprises at least one restriction or recombination site that allows in-frame cloning of a protein sequence to be fused, either at the C-terminus, or at the N-terminus of the encoded ARRDC1, or variant thereof. As another example an expression construct comprises at least one restriction or recombination site that allows in-frame cloning of a protein sequence to be fused either at the C-terminus, or at the N-terminus of one or more encoded WW domains.
In some embodiments, the expression construct comprises (a) a nucleotide sequence encoding an ARRDC1 protein, or variant thereof, operably linked to a heterologous promoter, and (b) a restriction site or a recombination site positioned adjacent to the ARRDC1-encoding nucleotide sequence allowing for the insertion of a nucleotide sequencing encoding a payload protein, or an RNA binding protein or RNA binding protein variant sequence, in frame with the ARRDC1-encoding nucleotide sequence. In some embodiments, the heterologous promoter may be constitutive promoter, in some embodiments, the heterologous promoter may be an inducible promoter. Some aspects of this invention provide an expression construct comprising (a) a nucleotide sequence encoding a TSG101 protein, or variant thereof, operably linked to a heterologous promoter, and (b) a restriction site or a recombination site positioned adjacent to the TSG101-encoding nucleotide sequence allowing for the insertion of a nucleotide sequencing encoding a payload protein, or an RNA binding protein, DNA binding protein, or variant sequence thereof, in frame with the TSG101-encoding nucleotide sequence. In some embodiments, the heterologous promoter may be constitutive promoter, in some embodiments, the heterologous promoter may be an inducible promoter.
Some aspects of this invention provide an expression construct comprising (a) a nucleotide sequence encoding a WW domain, or variant thereof, operably linked to a heterologous promoter, and (b) a restriction site or a recombination site positioned adjacent to the WW domain-encoding nucleotide sequence allowing for the insertion of a payload protein or RNA binding protein, or a protein variant sequence thereof in frame with the WW domain-encoding nucleotide sequence. In some embodiments, the heterologous promoter may be constitutive promoter, in some embodiments, the heterologous promoter may be an inducible promoter. The expression constructs may encode a payload protein or an RNA binding protein fused to at least one WW domain. In some embodiments, the expression constructs encode a payload protein or an RNA binding protein, or variant thereof, fused to at least one WW domain, or variant thereof. Any of the expression constructs, described herein, may encode any WW domain or variant thereof. In some embodiments, the heterologous promoter may be constitutive promoter, in some embodiments, the heterologous promoter may be an inducible promoter.
The expression constructs, described herein, may comprise any nucleic acid sequence capable of encoding a WW domain or variant thereof. For example a nucleic acid sequence encoding a WW domain or WW domain variant may be from the human ubiquitin ligase WWP1, WWP2, Nedd4-1, Nedd4-2, Smurf1, Smurf2, ITCH, NEDL1, or NEDL2. Exemplary nucleic acid sequences of WW domain containing proteins are listed below. It should be appreciated that any of the nucleic acids encoding WW domains or WW domain variants of the exemplary proteins may be used in the invention, described herein, and are not meant to be limiting.
Human WWP1 nucleic acid sequence (uniprot.org/uniprot/Q9H0M0).
Human WWP2 nucleic acid sequence (uniprot.org/uniprot/O00308).
Human Nedd4-1 nucleic acid sequence (uniprot.org/uniprot/P46934).
Human Nedd4-2 nucleic acid sequence (>gi|345478679|refINM_015277.5|Homo sapiens neural precursor cell expressed, developmentally down-regulated 4-like, E3 ubiquitin protein ligase (NEDD4L), transcript variant d, mRNA).
Human Smurf1 nucleic acid sequence (uniprot.org/uniprot/Q9HCE7).
Human Smurf2 nucleic acid sequence (uniprot.org/uniprot/Q9HAU4).
Human ITCH nucleic acid sequence (uniprot.org/uniprot/Q96J02).
Human NEDL1 nucleic acid sequence (uniprot.org/uniprot/Q76N89).
Human NEDL2 nucleic acid sequence (uniprot.org/uniprot/Q9P2P5).
Some aspects of this invention provide expression constructs that encode any of the proteins, nucleic acids, such as RNAs, or fusions thereof described herein.
Nucleic acids encoding any of the proteins and/or nucleic acid (including RNA) described herein, may be in any number of nucleic acid “vectors” known in the art. As used herein, a “vector” means any nucleic acid or nucleic acid-bearing particle, cell, or organism capable of being used to transfer a nucleic acid into a host cell. The term “vector” includes both viral and nonviral products and means for introducing the nucleic acid into a cell. A “vector” can be used in vitro, ex vivo, or in vivo. Non-viral vectors include plasmids, cosmids, artificial chromosomes (e.g., bacterial artificial chromosomes or yeast artificial chromosomes) and can comprise liposomes, electrically charged lipids (cytofectins), DNA-protein complexes, and biopolymers, for example. Viral vectors include retroviruses, lentiviruses, adeno-associated virus, pox viruses, baculovirus, reoviruses, vaccinia viruses, herpes simplex viruses, Epstein-Barr viruses, and adenovirus vectors, for example. Vectors can also comprise the entire genome sequence or recombinant genome sequence of a virus. A vector can also comprise a portion of the genome that comprises the functional sequences for production of a virus capable of infecting, entering, or being introduced to a cell to deliver nucleic acid therein.
Expression of any of the proteins and/or nucleic acid (including RNA) described herein, may be controlled by any regulatory sequence (e.g., a promoter sequence) known in the art. Regulatory sequences, as described herein, are nucleic acid sequences that regulate the expression of a nucleic acid sequence. A regulatory or control sequence may include sequences that are responsible for expressing a particular nucleic acid or may include other sequences, such as heterologous, synthetic, or partially synthetic sequences. The sequences can be of eukaryotic, prokaryotic, or viral origin that stimulate or repress transcription of a gene in a specific or non-specific manner and in an inducible or non-inducible manner. Regulatory or control regions may include origins of replication, RNA splice sites, introns, chimeric or hybrid introns, promoters, enhancers, transcriptional termination sequences, poly A sites, locus control regions, signal sequences that direct the polypeptide into the secretory pathways of the target cell. A heterologous regulatory region is a regulatory region not naturally associated with the expressed nucleic acid it is linked to. Included among the heterologous regulatory regions are regulatory regions from a different species, regulatory regions from a different gene, hybrid regulatory sequences, and regulatory sequences that do not occur in nature, but which are designed by one of ordinary skill in the art.
The term operably linked refers to an arrangement of sequences or regions wherein the components are configured so as to perform their usual or intended function. Thus, a regulatory or control sequence operably linked to a coding sequence is capable of affecting the expression of the coding sequence. The regulatory or control sequences need not be contiguous with the coding sequence, so long as they function to direct the proper expression or polypeptide production. Thus, for example, intervening untranslated but transcribed sequences can be present between a promoter sequence and the coding sequence and the promoter sequence can still be considered operably linked to the coding sequence. A promoter sequence, as described herein, is a DNA regulatory region a short distance from the 5′ end of a gene that acts as the binding site for RNA polymerase. The promoter sequence may bind RNA polymerase in a cell and/or initiate transcription of a downstream (3′ direction) coding sequence. The promoter sequence may be a promoter capable of initiating transcription in prokaryotes or eukaryotes. Some non-limiting examples of eukaryotic promoters include the cytomegalovirus (CMV) promoter, the chicken β-actin (CBA) promoter, and a hybrid form of the CBA promoter (CBh).
A microvesicle-producing cell of the present invention may be a cell containing any of the expression constructs, any of the fusion proteins, or any of the payloads of molecules (e.g., biological molecules, small molecules, proteins, and nucleic acids (e.g., DNA, RNA), DNA plasmids siRNA, mRNA) described herein. For example, an inventive microvesicle-producing cell may contain one or more recombinant expression constructs encoding (1) an ARRDC1 protein, or PSAP (SEQ ID NO: 1) motif-containing variant thereof and (2) an RNA binding protein (e.g., a Tat protein), that is associated with the ARRDC1 protein, or PSAP (SEQ ID NO: 1) motif-containing variant thereof. In some embodiments, a microvesicle-producing cell may contain one or more recombinant expression constructs encoding (1) an ARRDC1 protein, or PSAP (SEQ ID NO: 1) motif-containing variant thereof, and (2) a payload protein, such as a RNA binding protein fused to at least one WW domain, or variant thereof, under the control of a heterologous promoter. In certain embodiments, the expression construct in the microvesicle producing cell encodes a payload protein with one or more WW domains or variants thereof. In certain embodiments, an expression construct in the microvesicle producing cell encodes a RNA that associates with (e.g., binds specifically) an RNA binding protein, for example a therapeutic RNA.
Any of the expression constructs, described herein, may be stably inserted into the genome of the cell. In some embodiments, the expression construct is maintained in the cell, but not inserted into the genome of the cell. In some embodiments, the expression construct is in a vector, for example, a plasmid vector, a cosmid vector, a viral vector, or an artificial chromosome. In some embodiments, the expression construct further comprises additional sequences or elements that facilitate the maintenance and/or the replication of the expression construct in the microvesicle-producing cell, or that improve the expression of the fusion protein in the cell. Such additional sequences or elements may include, for example, an origin of replication, an antibiotic resistance cassette, a polyA sequence, and/or a transcriptional isolator. Some expression constructs suitable for the generation of microvesicle producing cells according to aspects of this invention are described elsewhere herein. Methods and reagents for the generation of additional expression constructs suitable for the generation of microvesicle producing cells according to aspects of this invention will be apparent to those of skill in the art based on the present disclosure. In some embodiments, the microvesicle producing cell is a mammalian cell, for example, a mouse cell, a rat cell, a hamster cell, a rodent cell, or a nonhuman primate cell. In some embodiments, the microvesicle producing cell is a human cell.
One skilled in the art may employ conventional techniques, such as molecular or cell biology, virology, microbiology, and recombinant DNA techniques. Exemplary techniques are explained fully in the literature. For example, one may rely on the following general texts to make and use the invention: Sambrook et al., Molecular Cloning: A Laboratory Manual, Second Edition (1989) Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., and Sambrook et al., Third Edition (2001); DNA Cloning: A Practical Approach, Volumes I and II (D. N. Glover ed. 1985); Oligonucleotide Synthesis (M. J. Gaited. 1984); Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. (1985)); Transcription And Translation Hames & Higgins, eds. (1984); Animal Cell Culture (R I. Freshney, ed. (1986)); Immobilized Cells And Enzymes (IRL Press, (1986)); Gennaro et al., (eds.) Remington's Pharmaceutical Sciences, 18th edition; B. Perbal, A Practical Guide To Molecular Cloning (1984); F. M. Ausubel et al., (eds.), Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (updates through 2001), Coligan et al., (eds.), Current Protocols in Immunology, John Wiley & Sons, Inc. (updates through 2001); W. Paul et al., (eds.) Fundamental Immunology, Raven Press; E. J. Murray et al., (ed.) Methods in Molecular Biology: Gene Transfer and Expression Protocols, The Humana Press Inc. (1991)(especially vol. 7); and J. E. Celis et al., Cell Biology: A Laboratory Handbook, Academic Press (1994).
The inventive microvesicles (e.g., ARMMs containing any of the expression constructs and/or any of the payload of molecules (e.g., biological molecules, small molecules, proteins, and nucleic acids (e.g., DNA, RNA), DNA plasmids siRNA, mRNA)), may further have a targeting moiety. The targeting moiety may be used to target the delivery of ARMMs to specific cell types, resulting in the release of the contents of the ARMM into the cytoplasm of the specific targeted cell type. A targeting moiety may be a viral envelope protein that normally function to aid viral attachment and entry into cells. The viral envelope protein may allow for the targeting of cells of the CNS. Viral envelope proteins include, but are not limited to, vesicular stomatitis virus G protein (VSV-G; Genbank Accession and Version Number: AJ318514.1) or rabies virus glycoprotein (RVG; Genbank Accession and Version Number: M38452.1). VSV-G protein, facilitates viral entry by mediating viral attachment to an LDL receptor (LDLR) or an LDLR family member present on a target cell. Subsequent to binding, the VSV-G-LDLR complex is rapidly endocytosed and proceeds to mediate fusion of the viral envelope with the endosomal membrane. VSV-G enters the cell through partially clathrin-coated vesicles; virus-containing vesicles contain more clathrin and clathrin adaptor than conventional vesicles. VSV-G is a common coat protein for vector expression systems used to introduce genetic material into in vitro systems or animal models, mainly because of its extremely broad tropism. RVG is a trimeric and surface-exposed viral coat protein known to use the nicotinic acetylcholine receptor and the low affinity nerve growth factor receptor for viral entry. In some embodiments, the viral envelope proteins (e.g., VSV-G, RVG) facilitate binding (e.g., targeting) of cells of the nervous system, such as CNS and PNS.
A targeting moiety may selectively bind an antigen of the target nervous system cell. For example, the targeting moiety may be a membrane-bound immunoglobulin, an integrin, a receptor, a receptor ligand, an aptamer, a small molecule, or a variant thereof. Any number of cell surface proteins may also be included in an ARMM to facilitate the binding of an ARMM to a target cell and/or to facilitate the uptake of an ARMM into a target nervous system cell. Integrins, receptor tyrosine kinases, G-protein coupled receptors, and membrane-bound immunoglobulins suitable for use with embodiments of this invention will be apparent to those of skill in the art and the invention is not limited in this respect. For example, in some embodiments, the integrin is an α1β1, α2β1, α4β1, α5β1, α6β1, αLβ2, αMβ2, αIIbβ3, αVβ3, αVβ5, αVβ6, or a α6β4 integrin. In some embodiments, the receptor tyrosine kinase is a an EGF receptor (ErbB family), insulin receptor, PDGF receptor, FGF receptor, VEGF receptor, HGF receptor, Trk receptor, Eph receptor, AXL receptor, LTK receptor, TIE receptor, ROR receptor, DDR receptor, RET receptor, KLG receptor, RYK receptor, or MuSK receptor. In some embodiments, the G-protein coupled receptor is a rhodopsin-like receptor, the secretin receptor, metabotropic glutamate/pheromone receptor, cyclic AMP receptor, frizzled/smoothened receptor, CXCR4, CCR5, or beta-adrenergic receptor.
Additional molecules, such as synthetic small molecules or natural products, can be modified to associate with an ARMM protein (e.g., TSG101 or ARRDC1) for the purpose of targeting. This association can facilitate their incorporation into ARMMs, which in turn can be used to deliver the molecule to a target cell. Incorporation of a cleavable linker may be used to allow the small molecule to be released upon delivery in a target cell. As a non-limiting example, a small molecule can be linked to biotin, thereby allowing it to associate with an ARRDC1 protein which is fused to a streptavidin. As another non-limiting example, a small molecule can be linked to synthetic high affinity ligand that specifically binds to a mutant form of FKBP12 such as FKBP12(F36V) (Yang W, Rozamus L W, Narula S, Rollins C T, Yuan R, Andrade L J, Ram M K, Phillips T B, van Schravendijk M R, Dalgarno D, Clackson T, Holt D A. Investigating protein-ligand interactions with a mutant FKBP possessing a designed specificity pocket. J Med Chem. 2000 Mar. 23; 43(6):1135-42), which will associate with an ARRDC1 protein which is fused to FKBP12(F36V). The association of the small molecule to an ARMM protein (e.g., TSG101 or ARRDC1), facilitates loading of the small molecule into the ARRDC1-containing ARMM.
Some aspects of this invention relate to the recognition that ARMMs are taken up by target cells (e.g., cells of the CNS), and ARMM uptake results in the release of the contents of the ARMM into the cytoplasm of the target cells (e.g., cells of the CNS). In some embodiments, the payload is an agent that affects a desired change in the target cell, for example, a change in cell survival, proliferation rate, a change in differentiation stage, a change in a cell identity, a change in chromatin state, a change in the transcription rate of one or more genes, a change in the transcriptional profile, or a post-transcriptional change in gene compression of the target cell. It will be understood by those of skill in the art, that the agent to be delivered will be chosen according to the desired effect in the target cell.
In some embodiments, cells from a subject are obtained and a payload is delivered to the cells by a system or method provided herein ex vivo. In some embodiments, the treated cells are selected for those cells in which a desired gene is expressed or repressed. In some embodiments, treated cells carrying a desired payload protein or payload RNA are returned to the subject they were obtained from.
In some embodiments, the ARMMs comprising any of the fusion proteins, any of the binding RNAs, any of the payload RNAs, and/or any of the binding RNAs fused to any of the payload RNAs, described herein, further include a detectable label. Such ARMMs allow for the labeling of a target cell without genetic manipulation. Detectable labels suitable for direct delivery to target cells are known in the art, and include, but are not limited to, fluorescent proteins, fluorescent dyes, membrane-bound dyes, and enzymes, for example, membrane-bound or cytosolic enzymes, catalyzing the reaction resulting in a detectable reaction product. Detectable labels suitable according to some aspects of this invention further include membrane-bound antigens, for example, membrane-bound ligands that can be detected with commonly available antibodies or antigen binding agents.
In some embodiments, ARMMs are provided that comprise a payload RNA that encodes a transcription factor, a transcriptional repressor, a fluorescent protein, a kinase, a phosphatase, a protease, a ligase, a chromatin modulator, or a recombinase. In some embodiments, ARMMs are provided that comprise a payload RNA (e.g., an siRNA) that inhibits expression of a transcription factor, a transcriptional repressor, a fluorescent protein, a kinase, a phosphatase, a protease, a ligase, a chromatin modulator, or a recombinase. In some embodiments, the payload RNA is a therapeutic RNA. In some embodiments the payload RNA is an RNA that affects a change in the state or identity of a target cell. For example, in some embodiments, the payload RNA encodes a reprogramming factor. Suitable transcription factors, transcriptional repressors, fluorescent proteins, kinases, phosphatases, proteases, ligases, chromatin modulators, recombinases, and reprogramming factors may be encoded by a payload RNA that is associated with a binding RNA to facilitate their incorporation into ARMMs and their function may be tested by any methods that are known to those skilled in the art, and the invention is not limited in this respect.
Methods for isolating the ARMMs described herein are also provided. One exemplary method includes collecting the culture medium, or supernatant, of a cell culture comprising microvesicle-producing cells. In some embodiments, the cell culture comprises cells obtained from a subject, for example, cells suspected to exhibit a pathological phenotype, for example, a hyperproliferative phenotype. In some embodiments, the cell culture comprises genetically engineered cells producing ARMMs, for example, cells expressing a recombinant ARMM protein, for example, a recombinant ARRDC1 or TSG101 protein, such as an ARRDC1 or TSG101 protein fused to an RNA binding protein (e.g., a Tat protein) or variant thereof. In some embodiments, the supernatant is pre-cleared of cellular debris by centrifugation, for example, by two consecutive centrifugations of increasing G value (e.g., 500G and 2000G). In some embodiments, the method comprises passing the supernatant through a 0.2 μm filter, eliminating all large pieces of cell debris and whole cells. In some embodiments, the supernatant is subjected to ultracentrifugation, for example, at 120,000G for 2 hours, depending on the volume of centrifugate. The pellet obtained comprises microvesicles. In some embodiments, exosomes are depleted from the microvesicle pellet by staining and/or sorting (e.g., by FACS or MACS) using an exosome marker as described herein. Isolated or enriched ARMMs can be suspended in culture media or a suitable buffer, as described herein.
Some aspects of this invention provide a method of delivering an agent to a target cell of the nervous system. The target cell can be contacted with an ARMM in different ways. For example, a target cell may be contacted directly with an ARMM as described herein, or with an isolated ARMM from a microvesicle producing cell. The contacting can be done in vitro by administering the ARMM to the target cell in a culture dish, or in vivo by administering the ARMM to a subject (e.g., parenterally or non-parenterally). In some embodiments, an ARMM is produced from a cell obtained from a subject. In some embodiments, the ARMM that was produced from a cell that was obtained from the subject is administered to the subject from which the ARMM producing cell was obtained. In some embodiments, the ARMM that was produced from a cell that was obtained from the subject is administered to a subject different from the subject from which the ARMM producing cell was obtained. As one example, a cell may be obtained from a subject and engineered to express one or more of the constructs provided herein (e.g., engineered to express a payload RNA associated with a binding RNA, an ARRDC1 protein, an ARRDC1 protein fused to an RNA binding protein, and/or an RNA binding protein fused to a WW domain). The cell obtained from the subject and engineered to express one or more of the constructs provided herein may be administered to the same subject, or a different subject, from which the cell was obtained. Alternatively, the cell obtained from the subject and engineered to express one or more of the constructs provided herein produces ARMMs, which may be isolated and administered to the same subject form which the cell was obtained or administered to a different subject from which the cell was obtained.
Alternatively, a target cell of the nervous system can be contacted with a microvesicle producing cell as described herein, for example, in vitro by co-culturing the target cell and the microvesicle producing cell, or in vivo by administering a microvesicle producing cell to a subject harboring the target cell. Accordingly, the method may include contacting the target cell with a microvesicle, for example, an ARMM containing any of the payload to be delivered, as described herein. The target cell may be contacted with a microvesicle-producing cell, as described herein, or with an isolated microvesicle that has a lipid bilayer, an ARRDC1 protein or variant thereof, a payload and an viral envelope protein.
It should be appreciated that the target cell of the nervous system may be of any origin, for example, from an organism. In some embodiments, the target cell is a mammalian cell. Some non-limiting examples of a mammalian cell include, without limitation, a mouse cell, a rat cell, hamster cell, a rodent cell, and a nonhuman primate cell. In some embodiments, the target cell is a human cell. It should also be appreciated that the target cell may be of any cell type of the nervous system. In other cases, the target cell may be any differentiated cell type found in a subject. In some embodiments, the target cell is a cell in vitro, and the method includes administering the microvesicle to the cell in vitro, or co-culturing the target cell with the microvesicle-producing cell in vitro. In some embodiments, the target cell is a cell in a subject, and the method comprises administering the microvesicle or the microvesicle-producing cell to the subject. In some embodiments, the subject is a mammalian subject, for example, a rodent, a mouse, a rat, a hamster, or a non-human primate. In some embodiments, the subject is a human subject.
In some embodiments, the target cell is a pathological cell. In some embodiments, the target cell is a cancer cell. In some embodiments, the microvesicle is associated with a binding agent that selectively binds an antigen on the surface of the target cell. In some embodiments, the antigen of the target cell is a cell surface antigen. In some embodiments, the binding agent is a membrane-bound immunoglobulin, an integrin, a receptor, or a receptor ligand. Suitable surface antigens of target cells (e.g., cells of the nervous system), for example of specific target cell types, e.g., cancer cells, are known to those of skill in the art, as are suitable binding agents that specifically bind such antigens. Methods for producing membrane-bound binding agents, for example, membrane-bound immunoglobulins, membrane-bound antibodies or antibody fragments that specifically bind a surface antigen expressed on the surface of cancer cells, are also known to those of skill in the art. The choice of the binding agent will depend, of course, on the identity or the type of target cell. Cell surface antigens specifically expressed on various types of nervous system cells that can be targeted by ARMMs comprising membrane-bound binding agents will be apparent to those of skill in the art. It will be appreciated that the present invention is not limited in this respect.
In some embodiments, the target cells of the nervous system includes disease targets. For example, a non-limiting example of genetic targets for ARMM therapeutics for Alzheimer's disease (familial forms and late onset) and related indications include: APP, PSEN1, PSEN2, APOE (e2), APOE (e3), APOE (e4), ADAMTS4, HESX1, HS3ST1, HLA-DQB1, NYAP1, CNTNAP2, ECHDC3, ADAM10, APH1B, KAT8, ABI3, SCIMP, ACE, ALPK2, BHMG1, ADAMTS1, IQCK1, CLU, SORL1, ABCA7, TREM2, CD33, MS4A6A, CR1, EPHA1, HLA-DRB1, HLA-DRB5, IL1RAP, INPP5D, PLCG2, CD2AP, BIN1, RIN3, SLC24A4, PICALM, PTK2B, CASS4, ABI3, FERMT2, SPI1, MEF2C, ZCWPW1, NME8, CR1, PICALM. A non-limiting example of genetic targets for ARMM therapeutics for frontotemporal dementia/amyotrophic lateral sclerosis spectrum disorders and related indications include: MAPT, GRN, C9ORF72, SOD1, FUS, UBQLN2, CHCHD10, SQSTM1, VCP, CHMP2B, TBK1, CTSD, CTSF, TRKA, ERBB4, EWSR1, TAF15, HNRNPA1, HNRNPA2B1, ATXN2, OPTN, ANG, SETX, DAO, PFN1, ALS2, VAPB, SIGMAR1, MATR3, NEK1, PFN1, TIA1 TUBA4A. A non-limiting example of genetic targets for ARMM therapeutics for Parkinson's disease and related indications include: SNCA, LRRK2, PARK7, PINK1, PRKN, DJ-1, VPS35, UCHL1, ATP13A2, GBA1. A non-limiting example of genetic targets for ARMM therapeutics for other neurological diseases genetic targets include: SMN1, SMN2, HTT, DMPK, FMR1, MECP2, CIC, TCF4, CNTNAP2, STXBP1, SHANK2, TSC1, TSC2, SPG11, SEPT9, PANK2, PLA2G6, C19orf12, FTL, MR1, SLC2A1, DRD2, GCH1, GCDH, PRKRA, SGCE, THAP1, TOR1A, TAF1, TIMM8A, ACTB, SLC6A3, DYNC1H1, YARS, MPZ, NEFL, PMP22, ARHGEF10, LITAF, EGR2, MFN2, RAB7A, LMNA, GARS, HSPB1, GDAP1, HSPB8, DNM2, SH3TC2, MTMR2, SBF2, NDRG1, PRX, FGD4, FIG4, GJB1, PRPS1, CTDP1, GAN, BSCL2, WNK1, IKBKAP, NTRK1, NGF, SPTLC1, TH. Non-limiting examples of genetic targets for ARMM therapeutics for pain disorders and related indications include: SCN9A, FAAH.
Other aspects of the present disclosure relate to pharmaceutical compositions comprising any of the ARMMs or microvesicle (e.g., ARMM) producing cells provided herein. The term “pharmaceutical composition”, as used herein, refers to a composition formulated for pharmaceutical use. In some embodiments, the pharmaceutical composition further comprises a pharmaceutically acceptable carrier. In some embodiments, the pharmaceutical composition comprises additional agents (e.g., for specific delivery, increasing half-life, or other therapeutic compounds).
As used here, the term “pharmaceutically-acceptable carrier” means a pharmaceutically-acceptable material, composition or vehicle, such as a liquid or solid filler, diluent, excipient, manufacturing aid (e.g., lubricant, talc magnesium, calcium or zinc stearate, or steric acid), or solvent encapsulating material, involved in carrying or transporting the compound from one site (e.g., the delivery site) of the body, to another site (e.g., organ, tissue or portion of the body). A pharmaceutically acceptable carrier is “acceptable” in the sense of being compatible with the other ingredients of the formulation and not injurious to the tissue of the subject (e.g., physiologically compatible, sterile, physiologic pH, etc.). Some examples of materials which can serve as pharmaceutically-acceptable carriers include: (1) sugars, such as lactose, glucose and sucrose; (2) starches, such as corn starch and potato starch; (3) cellulose, and its derivatives, such as sodium carboxymethyl cellulose, methylcellulose, ethyl cellulose, microcrystalline cellulose and cellulose acetate; (4) powdered tragacanth; (5) malt; (6) gelatin; (7) lubricating agents, such as magnesium stearate, sodium lauryl sulfate and talc; (8) excipients, such as cocoa butter and suppository waxes; (9) oils, such as peanut oil, cottonseed oil, safflower oil, sesame oil, olive oil, corn oil and soybean oil; (10) glycols, such as propylene glycol; (11) polyols, such as glycerin, sorbitol, mannitol and polyethylene glycol (PEG); (12) esters, such as ethyl oleate and ethyl laurate; (13) agar; (14) buffering agents, such as magnesium hydroxide and aluminum hydroxide; (15) alginic acid; (16) pyrogen-free water; (17) isotonic saline; (18) Ringer's solution; (19) ethyl alcohol; (20) pH buffered solutions; (21) polyesters, polycarbonates and/or polyanhydrides; (22) bulking agents, such as polypeptides and amino acids (23) serum component, such as serum albumin, HDL and LDL; (22) C2-C12 alcohols, such as ethanol; and (24) other non-toxic compatible substances employed in pharmaceutical formulations. Wetting agents, coloring agents, release agents, coating agents, sweetening agents, flavoring agents, perfuming agents, preservative and antioxidants can also be present in the formulation. The terms such as “excipient”, “carrier”, “pharmaceutically acceptable carrier” or the like are used interchangeably herein.
In some embodiments, the pharmaceutical composition is formulated for delivery to a subject, e.g., for delivering a payload protein or payload RNA (e.g., a payload RNA that expresses a tumor suppressor) to a cell. Suitable routes of administrating the pharmaceutical composition described herein include, without limitation: topical, subcutaneous, transdermal, intradermal, intralesional, intraarticular, intraperitoneal, intravesical, transmucosal, gingival, intradental, intracochlear, transtympanic, intraorgan, epidural, intrathecal, intramuscular, intravenous, intravascular, intraosseus, periocular, intratumoral, intracerebral, and intracerebroventricular administration.
In some embodiments, the pharmaceutical composition described herein is administered locally to a diseased site (e.g., cell of the nervous system). In some embodiments, the pharmaceutical composition described herein is administered to a subject by injection, by means of a catheter, by means of a suppository, or by means of an implant, the implant being of a porous, non-porous, or gelatinous material, including a membrane, such as a sialastic membrane, or a fiber.
In some embodiments, the pharmaceutical composition is formulated in accordance with routine procedures as a composition adapted for intravenous or subcutaneous administration to a subject, e.g., a human. In some embodiments, pharmaceutical composition for administration by injection are solutions in sterile isotonic aqueous buffer. Where necessary, the pharmaceutical can also include a solubilizing agent and a local anesthetic such as lidocaine to ease pain at the site of the injection. Where the pharmaceutical is to be administered by infusion, it can be dispensed with an infusion bottle containing sterile pharmaceutical grade water or saline. Where the pharmaceutical composition is administered by injection, an ampoule of sterile water for injection or saline can be provided so that the ingredients can be mixed prior to administration.
A pharmaceutical composition for systemic administration may be a liquid, e.g., sterile saline, lactated Ringer's solution or Hank's solution. In addition, the pharmaceutical composition can be in solid forms and re-dissolved or suspended immediately prior to use. Lyophilized forms are also contemplated.
The pharmaceutical composition described herein may be administered or packaged as a unit dose, for example. The term “unit dose” when used in reference to a pharmaceutical composition of the present disclosure refers to physically discrete units suitable as unitary dosage for the subject, each unit containing a predetermined quantity of active material calculated to produce the desired therapeutic effect in association with the required diluent, i.e., carrier, or vehicle.
Further, the pharmaceutical composition can be provided as a pharmaceutical kit comprising (a) a container containing an ARMM or microvesicle producing cell of the invention and (b) a second container containing a pharmaceutically acceptable diluent (e.g., sterile water) for injection. The pharmaceutically acceptable diluent can be used e.g., for reconstitution or dilution of the ARMM or microvesicle producing cell of the invention. Optionally associated with such container(s) can be a notice in the form prescribed by a governmental agency regulating the manufacture, use or sale of pharmaceuticals or biological products, which notice reflects approval by the agency of manufacture, use or sale for human administration.
In another aspect, an article of manufacture containing materials useful for the treatment of the diseases described above is included. In some embodiments, the article of manufacture comprises a container and a label. Suitable containers include, for example, bottles, vials, syringes, and test tubes. The containers may be formed from a variety of materials such as glass or plastic. In some embodiments, the container holds a composition that is effective for treating a disease described herein and may have a sterile access port. For example, the container may be an intravenous solution bag or a vial having a stopper pierceable by a hypodermic injection needle. The active agent in the composition is a compound of the invention. In some embodiments, the label on or associated with the container indicates that the composition is used for treating the disease of choice. The article of manufacture may further comprise a second container comprising a pharmaceutically-acceptable buffer, such as phosphate-buffered saline, Ringer's solution, or dextrose solution. It may further include other materials desirable from a commercial and user standpoint, including other buffers, diluents, filters, needles, syringes, and package inserts with instructions for use.
Some aspects of this disclosure provide kits comprising a nucleic acid construct comprising a nucleotide sequence encoding one or more of any of the proteins (e.g., ARRDC1, and TSG101), fusion proteins and/or nucleic acids provided herein. In some embodiments, the nucleotide sequence encodes any of the proteins, fusion proteins, and/or RNAs provided herein. In some embodiments, the nucleotide sequence comprises a heterologous promoter that drives expression of any of the proteins, fusion proteins, and/or RNAs provided herein.
Some aspects of this disclosure provide microvesicle (e.g., ARMM) producing cells comprising any of the proteins, fusion proteins, and/or RNAs provided herein. In some embodiments, the cells comprise a nucleotide that encodes any of the proteins, fusion proteins, and/or RNAs provided herein. In some embodiments, the cells comprise any of the nucleotides or vectors provided herein. In some embodiments, the vector comprise viral targeting proteins.
It should be appreciated however, that additional proteins, fusion proteins, and RNAs would be apparent to the skilled artisan based on the present disclosure and knowledge in the art.
The function and advantage of these and other embodiments of the present invention will be more fully understood from the Examples below. The following Examples are intended to illustrate the benefits of the present invention and to describe particular embodiments but are not intended to exemplify the full scope of the invention. Accordingly, it will be understood that the Examples are not meant to limit the scope of the invention.
A methodology has been created wherein fibroblasts are isolated from a subject, by means of a skin biopsy (although other isolation techniques may be used), cells are reprogrammed into induced pluripotent stem cells (iPSCs) and directed to differentiate into neural progenitor cells (
An imaging system has also been developed for analyzing the results of an ARMM-mediated payload screen. Automated confocal microscopy was used in an assay to analyze high-content single-cell level imaging (
A platform has been developed for using ARRDC1-mediated microvesicles (ARMMs) for delivery of payload of molecules (e.g., biological molecules, such as proteins, nucleic acids (e.g., DNA, RNA, DNA plasmids, siRNA, mRNA), editing complexes, and small molecules to various nervous system cell types, such as cells of the CNS. The ARMMs are loaded with one of these molecules as the payload and used to deliver the payload to the cells of the nervous system, such as cells of the CNS (
The viral envelope protein VSV-G was analyzed to determine if it could be used to enhance uptake of ARMMs containing molecular payloads. ARMMs were added in various concentrations (as shown across the top of the plate) as four experimental sets, blank, ARRDC1, ARRDC1-GFP, and ARRDC1-GFP-VSV-G, for delivery to neural progenitor cells and incubated for 24 hours (24 h) (
Payload delivery and success of protein translation were further evaluated (
The ARMMs-mediated delivery technology can be adapted to different targets for both gain-of-function disorders (for example, but not limited to, due to mutations or dysfunction of MAPT, SNCA, HTT, ATXN2) and loss-of-function disorders (for example, but not limited to, due to mutations or dysfunction of GRN, GBA1, FMR1, MECP2, TCF4), and repeat expansion (for example, but not limited to, due to mutations in C9orf72) (
An example of the use of CRISPR editors that would be compatible with the ARMM-delivery technology are shown in
It is also possible that FMRP can be delivered to rescue Fragile X syndrome patient neurons using CNS-optimized ARMMs based upon our development of patient-derived iPSC models (
Using proteomics studies of ARMMs from multiple human cells, multiple proteins that are enriched in the ARMM vesicles were identified. Among the proteins were tetraspanins such as TSPAN14 and TSPAN6. These proteins have multiple extracellular loop regions that allow the insertion of “homing peptide” or other targeting moieties, thus enabling the potential targeting of ARMMs to specific cells/tissues. A fusion construct was developed, in which the rabies viral glycoprotein (RVG) peptide along with a HA tag was inserted into the second extracellular loop of TSPAN6 (
HEK293T cells (2×106 cells/plate) were transfected with ARRDC1 (A1), or ARRDC1 along with TAR-GFP, in the presence or absence of VSV-G, in accordance with Table 1.
Extracellular vesicles were isolated using ultracentrifugation (
Starting with human induced pluripotent stem cells (iPSCs), 3-dimensional (3D), cerebral tissue-like containing neurons from both deep and superficial cortical layers along with astrocytes were differentiated following published methodology (Paşca A M, Sloan S A, Clarke L E, Tian Y, Makinson C D, Huber N, Kim C H, Park J Y, O'Rourke N A, Nguyen K D, Smith S J, Huguenard J R, Geschwind D H, Barres B A, Paca S P. Functional cortical neurons and astrocytes from human pluripotent stem cells in 3D culture. Nature Methods. 2015 July; 12(7):671-8). After maturation en masse for 9 months in cerebral organoid maturation media (Neurobasal A, GlutaMax, B-27 supplement without vitamin A, penicillin/streptomycin), single organoids were isolated then exposed to ARRDC1-GFP-VSVG extracellular vesicles (14.3 mL) for either 24 hrs or 48 hours in a 96-well plate format. Single organoids were then dissociated with Accutase and the resulting cells harvested and allowed to attach to laminin/poly-L-ornithine-coated 96-well plates for 24 hrs. Cells were then imaged under transmitted light or to detect GFP fluorescence using an automated confocal microscope (IN Cell Analyzer 6000).
As shown in
All publications, patents and sequence database entries mentioned herein, including those items listed above, are hereby incorporated by reference in their entirety as if each individual publication or patent was specifically and individually indicated to be incorporated by reference. In case of conflict, the present application, including any definitions herein, will control.
Those skilled in the art will recognize or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments of the invention described herein. The scope of the present invention is not intended to be limited to the above description, but rather is as set forth in the appended claims.
In the claims, articles such as “a,” “an,” and “the” may mean one or more than one unless indicated to the contrary or otherwise evident from the context. Claims or descriptions that include “or” between one or more members of a group are considered satisfied if one, more than one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process unless indicated to the contrary or otherwise evident from the context. The invention includes embodiments in which exactly one member of the group is present in, employed in, or otherwise relevant to a given product or process. The invention also includes embodiments in which more than one, or all of the group members are present in, employed in, or otherwise relevant to a given product or process.
Furthermore, it is to be understood that the invention encompasses all variations, combinations, and permutations in which one or more limitations, elements, clauses, descriptive terms, etc., from one or more of the claims or from relevant portions of the description is introduced into another claim. For example, any claim that is dependent on another claim can be modified to include one or more limitations found in any other claim that is dependent on the same base claim. Furthermore, where the claims recite a composition, it is to be understood that methods of using the composition for any of the purposes disclosed herein are included, and methods of making the composition according to any of the methods of making disclosed herein or other methods known in the art are included, unless otherwise indicated or unless it would be evident to one of ordinary skill in the art that a contradiction or inconsistency would arise.
Where elements are presented as lists, e.g., in Markush group format, it is to be understood that each subgroup of the elements is also disclosed, and any element(s) can be removed from the group. It is also noted that the term “comprising” is intended to be open and permits the inclusion of additional elements or steps. It should be understood that, in general, where the invention, or aspects of the invention, is/are referred to as comprising particular elements, features, steps, etc., certain embodiments of the invention or aspects of the invention consist, or consist essentially of, such elements, features, steps, etc. For purposes of simplicity those embodiments have not been specifically set forth in haec verba herein. Thus, for each embodiment of the invention that comprises one or more elements, features, steps, etc., the invention also provides embodiments that consist or consist essentially of those elements, features, steps, etc.
Where ranges are given, endpoints are included. Furthermore, it is to be understood that unless otherwise indicated or otherwise evident from the context and/or the understanding of one of ordinary skill in the art, values that are expressed as ranges can assume any specific value within the stated ranges in different embodiments of the invention, to the tenth of the unit of the lower limit of the range, unless the context clearly dictates otherwise. It is also to be understood that unless otherwise indicated or otherwise evident from the context and/or the understanding of one of ordinary skill in the art, values expressed as ranges can assume any subrange within the given range, wherein the endpoints of the subrange are expressed to the same degree of accuracy as the tenth of the unit of the lower limit of the range.
In addition, it is to be understood that any particular embodiment of the present invention may be explicitly excluded from any one or more of the claims. Where ranges are given, any value within the range may explicitly be excluded from any one or more of the claims. Any embodiment, element, feature, application, or aspect of the compositions and/or methods of the invention, can be excluded from any one or more claims. For purposes of brevity, all of the embodiments in which one or more elements, features, purposes, or aspects is excluded are not set forth explicitly herein.
The present application claims priority under 35 U.S.C. § 119(e) to U.S. provisional application, U.S. Ser. No. 63/038,461, filed Jun. 12, 2020, which is incorporated herein by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2021/037053 | 6/11/2021 | WO |
Number | Date | Country | |
---|---|---|---|
63038461 | Jun 2020 | US |