METHODS AND COMPOSITIONS FOR TREATING BAG-3 RELATED CARDIOMYOPATHY WITH A VIRAL VECTOR

Abstract
In several embodiments, the present disclosure relates to nucleic acids, compositions, and methods for the delivery of a therapeutic gene to a subject. In several embodiments, the therapeutic gene is through the use of a viral vector. In several embodiments, the viral vector is an adeno-associated virus. In several embodiments, the therapeutic gene is delivered to treat a cardiac disease, injury or other disorder.
Description
INCORPORATION BY REFERENCE OF MATERIAL IN SEQUENCE LISTING

This application incorporates the material provided in the accompanying XML file entitled U120270086WO00-SEQ-PRW.xml, created Dec. 8, 2022, which is 149,589 bytes in size.


BACKGROUND

Cardiomyopathy represents a collection of diverse conditions of the heart muscle and is the second most common cause of heart disease in subjects and medical management of the secondary signs is the only therapeutic option. These diseases have many causes, symptoms, and treatments, and can affect people of all ages and races. When cardiomyopathy occurs, the normal muscle in the heart can thicken, stiffen, thin out, or fill with substances the body produces that do not belong in the heart muscle. As a result, the heart muscle's ability to pump blood is reduced, which can lead to irregular heartbeats, the backup of blood into the lungs or rest of the body, and heart failure. Cardiomyopathy can be acquired or inherited. The cause isn't always known but there is an increasing understanding of the genetic underpinnings of inherited forms of disease. Gene transfer strategies have been shown to ameliorate heart diseases.


SUMMARY

Cardiomyopathy is a class of disease of heart muscle that adversely impacts the hearts ability to circulate blood through the cardiovascular system. Various types of cardiomyopathies exist, including dilated cardiomyopathy, hypertrophic cardiomyopathy, and restrictive cardiomyopathy. Cardiomyopathy in human populations is a major medical burden and treatment needs are currently unmet, despite cardiomyopathies in human populations being particularly desirable to treat.


Dilated cardiomyopathy (DCM) is one of the most common types of human cardiomyopathy, occurring mostly in adults 20 to 60. DCM affects the heart's ventricles and atria, the lower and upper chambers of the heart, respectively. Most forms of DCM are acquired forms from a number of causes that include coronary heart disease, heart attack, high blood pressure, diabetes, thyroid disease, viral hepatitis and viral infections that inflame the heart muscle. Alcohol abuse and certain drugs, such as cocaine and amphetamines, as well as at least two drugs used to treat cancer (doxorubicin and daunorubicin), can also lead to DCM. In addition, there are a number of genetic forms of DCM, including, but not limited to the DCM associated with Duchenne and Becker muscular dystrophies. In the case of certain forms of Becker muscular dystrophy, as well as in most cases of Duchenne muscular dystrophy, the cardiomyopathy can ultimately limit the patient's survival.


Hypertrophic cardiomyopathy (HCM) occurs when the walls of the heart muscle become abnormally thick. The increase in wall thickness may increase cardiac complications, as well as block or obstruct blood flowing in the heart.


Restrictive cardiomyopathy (RCM) is a condition leading to a stiffening of the chambers of the heart over time. While the heart's ability to contract remains largely unaffected, the cardiac muscle does not fully relax between beats of the heart. This restricts the ability of the ventricles to fill with blood and causes blood to back up in the circulatory system.


Heart function is critically dependent upon calcium-dependent signaling. During heart disease, malfunctioning of calcium channels within cardiac cells promotes calcium cycling abnormalities, further inhibiting heart function. Gene transfer strategies to reduce calcium cycling abnormalities are reported to ameliorate heart disease in small and large animal models, as well as in human clinical trials.


Disclosed herein are gene delivery approaches for treatment of human subjects with one or more types of cardiomyopathy or symptoms thereof.


Accordingly, some aspects of the present disclosure provide recombinant adeno-associated virus (rAAV) vectors for delivering transgenes into the heart of a subject. Such rAAV vectors may include, from 5′ to 3′, in order, a first adeno-associated virus (AAV) inverted terminal repeat (ITR) sequence, a promoter operably linked to the one or more transgene, and a second AAV inverted terminal repeat (ITR) sequence. In some embodiments, the rAAV vector includes, in addition to a promoter, a regulatory element which modifies expression, e.g., in a manner that provides physiologically relevant expression levels and/or restricts expression to a particular cell type or tissue. In some embodiments, the regulatory element comprises one or more of an enhancer, a 5′ untranslated region (UTR), and a 3′ UTR. In some embodiments, the rAAV vector also includes at least one polyadenylation signal (e.g., positioned 3′ of the transgene). In some embodiments, two transgenes are operably linked to the same single promoter. In some embodiments, each transgene is operably linked to a separate promoter. In some embodiments in which multiple transgenes are provided, the rAAV vector also includes at least one polyadenylation signal (e.g., positioned 3′ of two transgenes expressed from a single promoter or 3′ of one or both transgenes expressed from different promoters). Aspects of the disclosure provide recombinant adeno-associated virus (rAAV) nucleic acid vector for delivering two or more transgenes into the heart of a subject, wherein said vector comprises, from 5′ to 3′, a first adeno-associated virus (AAV) inverted terminal repeat (ITR) sequence, two or more transgenes and a promoter operably linked to the two or more transgenes, a polyadenylation signal, and a second AAV inverted terminal repeat (ITR) sequence


In some embodiments, the therapeutic transgene is encoded by a polynucleotide having at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% sequence identity to the nucleotide sequence set forth as SEQ ID NO: 21, 101, or 83-85 in order. In some embodiments, one or more of the transgenes of the present disclosure are naturally-occurring sequences. In some embodiments, one or more transgenes are engineered to be species-specific. In some embodiments, one or more transgenes are codon-optimized for expression in a species of interest, e.g. human. For example, in several embodiments, the therapeutic transgene (e.g. the BAG3 transgene) are codon-optimized.


Further provided herein are rAAV particles containing the rAAV vectors disclosed herein, encapsidated in AAV capsids. Other aspects of the present disclosure include compositions containing the rAAV particles described herein. In several embodiments, such compositions may be administered to a subject for gene therapy for cardiomyopathy. In additional embodiments, such compositions may be administered to a subject for gene therapy for heart disease. In some embodiments, the heart disease causes heart failure in the subject.


The compositions of the present disclosure may be administered to the subject via different routes. In some embodiments, the composition is administered via intravenous injection into the subject. In some embodiments, the administration of the composition results in expression of the transgene (or if multiple transgenes are used, expression of, two or more transgenes) in the subject's heart. In various embodiments, the step of administering the composition results in improved cardiac function in the subject, such as improved cardiac function in the subject for more than 10 months. In some embodiments, administration results in improved cardiac function for more than 12 months, more than 14 months, more than 16 months, more than 17 months, more than 20 months, more than 22 months, or more than 24 months. In several embodiments, improved cardiac function is represented by an increase in left ventricular ejection fraction (LVEF). In several embodiments, the LVEF (as compared to a pre-therapy measurement) increases by at least about 1%, about 2%, about 3%, about 4%, about 5% or more (including any amount between those listed). In several embodiments, LVEF is measured by echocardiography. In some embodiments, administration results in improved cardiac physiology (e.g., structural features) for more than 12 months, more than 14 months, more than 16 months, more than 17 months, more than 20 months, more than 22 months, or more than 24 months. In several embodiments, the improved cardiac physiology is represented by a decrease in left ventricular wall thickness. In several embodiments, left ventricular wall thickness is reduced by at least about 1%, about 2%, about 3%, about 4%, about 5% or more (including any amount between those listed). In several embodiments, the left ventricular wall thickness is measured by cardiac magnetic resonance imaging (MRI) or transthoracic echocardiography (TTE).


In some embodiments, described herein are compositions comprising AAV vectors, virions, viral particles, and pharmaceutical formulations thereof, useful in methods for delivering genetic material encoding one or more beneficial or therapeutic product(s) to mammalian cells and tissues. The rAAV vectors, rAAV particles, or the composition comprising the rAAV particles of the present disclosure, may be used for gene therapy for heart diseases in a subject in need thereof, such as one or more types of cardiomyopathy.


Additionally, provided herein are compositions, as well as therapeutic and/or diagnostic kits that include one or more of the disclosed AAV compositions, formulated with one or more additional ingredients, or prepared with one or more instructions for their use.


In some embodiments, described herein is a nucleic acid comprising an expression construct comprising a human BAG3 coding sequence and an enhancer element, such as a CMV enhancer, operably linked to a promoter, wherein the expression construct is flanked on each side by an inverted terminal repeat sequence. In some embodiments, described herein is a nucleic acid comprising an expression construct comprising a human BAG3 coding sequence, an enhancer element operably linked to a promoter, and a Kozak sequence, wherein the Kozak sequence enhances transgene expression in the heart, wherein the expression construct is flanked on each side by an inverted terminal repeat sequence, wherein the Kozak sequence is non-native to the human MYBPC3 coding sequence and/or non-native to the promoter. In several embodiments, the Kozak sequence is a synthetic sequence. In some embodiments, the human BAG3 coding sequence is codon-optimized for expression in human cells. In some embodiments, the promoter comprises a cardiac specific promotor. In some embodiments, the promoter is CBA (Chicken β-Actin). In some embodiments, the promoter is CMV or mini CMV. In some embodiments, the promoter is CK8. In some embodiments, the promoter is MHCK7. In some embodiments, the promoter is CMV, mini-CMV, HSV. TK. RSV, SV40, MMTV, or Ad E1A. In some embodiments, the nucleic acid is a recombinant adeno-associated virus (rAAV) vector. In some embodiments, the nucleic acid is a single-stranded or self-complementary rAAV nucleic acid vector. In some embodiments, the rAAV particle is an AAV9 particle. In some embodiments, the rAAV particle is an rh74 particle. In some embodiments, the rAAV particle is an rh10 particle. In some embodiments, a composition comprising a plurality of rAAV particles is provided. In some embodiments, the plurality of rAAV particles may further comprise a pharmaceutically acceptable carrier. In some embodiments, the rh74 particle comprises at least one capsid protein encoded by a polynucleotide having at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% sequence identity to the nucleotide sequence set forth as SEQ ID NO: 29, or a portion of SEQ ID NO. 29 (for example, SEQ ID NO: 29 encodes the rh74 VP1, VP2, and VP3 proteins—thus, in several embodiments, an rh74 particle according to embodiments disclosed herein comprises at least one capsid protein encoded by a polynucleotide having at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% sequence identity to a subpart of the nucleotide sequence of SEQ ID NO: 29). In some embodiments, the rh74 particle comprises an amino acid sequence least about 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% sequence identity to the amino acid sequence set forth as SEQ ID NO: 27, or a portion of SEQ ID NO. 27 (for example, SEQ ID NO: 27 is the amino acid sequence of rh74 VP1, VP2, and VP3 proteins—thus, in several embodiments, an rh74 particle according to embodiments disclosed herein comprises at least one capsid protein having at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% sequence identity to a subpart of the amino acid sequence of SEQ ID NO: 27). In some embodiments, the AAV9 particle comprises an amino acid sequence least about 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% sequence identity to the amino acid sequence set forth as SEQ ID NO: 28.


In some embodiments, a method of treating dilated cardiomyopathy is described, the method comprising administering a therapeutically effective amount of rAAV comprising a nucleic acid expression construct comprising a human BAG3 coding sequence and an enhancer element operably linked to a promoter, wherein the expression construct is flanked on each side by an inverted terminal repeat sequence, and wherein said administration results in expression of a therapeutically effective amount of human BAG3, thereby treating the dilated cardiomyopathy. In some embodiments, the rAAV is administered via intravenous injection. In some embodiments, between about 0.5 and about 5 rAAV vector genomes per cell are administered. In some embodiments, between about 0.5 and about 2 rAAV vector genomes per cell are administered. In some embodiments, between about 1×1013 and about 3×1014 vector genomes per kilo (vgs/kg) are administered.


Also described herein is a method of inducing increased expression of human BAG3 in a target cell, comprising contacting a target cell with a plurality of rAAV particles comprising a nucleic acid expression construct comprising a human BAG3 coding sequence and an enhancer element operably linked to a promoter, wherein the expression construct is flanked on each side by an inverted terminal repeat sequence, and wherein said contacting results in the target cell increasing expression of human BAG3 as compared to prior to the contacting, thereby increasing the expression of human BAG3. In some embodiments, the contacting is in vivo. In some embodiments, the method is used for the treatment of dilated cardiomyopathy. In some embodiments, the nucleic acids, the rAAV particles, the compositions, or the methods of manufacture described herein can be used for the treatment of dilated cardiomyopathy.





BRIEF DESCRIPTION OF THE DRAWINGS


FIG. 1 shows a non-limiting example of gene construct maps for a construct including a sequence for BAG3.



FIG. 2 shows a non-limiting example of gene construct maps for a construct including a sequence for BAG3.



FIG. 3 shows a non-limiting example of gene construct maps for a construct including a sequence for BAG3.



FIG. 4 shows a western blot showing BAG3 protein expression in transfected cells.



FIGS. 5A and 5B show sex differentiated treated mice weights over a study period.



FIG. 6 shows hBAG3 expression in liver.



FIG. 7 shows hBAG3 expression in heart.



FIG. 8 shows hBAG3 expression in gastrocnemius muscle.



FIGS. 9A and 9B show ddPCR results from tissue specific sources for BAG3.





DETAILED DESCRIPTION

Reference is made to particular features and/or non-limiting embodiments of the invention. It is to be understood that the disclosure of the invention in this specification includes all possible combinations of such particular features. For example, where a particular feature is disclosed in the context of a particular aspect or embodiment of the invention, or a particular claim, that feature can also be used, to the extent possible, in combination with and/or in the context of other particular aspects and embodiments of the invention, and in the invention generally.


Definitions

Unless defined otherwise, all technical and scientific terms used herein have the same meaning as is commonly understood by one of ordinary skill in the art. All patents, applications, published applications and other publications referenced herein are incorporated by reference in their entirety unless stated otherwise. In the event that there are a plurality of definitions for a term herein, those in this section prevail unless stated otherwise.


A “subject” refers to mammal that is the object of treatment using a method or composition as provided for herein . . . “Mammal” includes, without limitation, mice, rats, rabbits, guinea pigs, dogs, cats, sheep, goats, cows, horses, primates, such as monkeys, chimpanzees, and apes, and humans. In some embodiments, the subject is human.


The terms “treating.” “treatment,” “therapeutic,” or “therapy” do not necessarily mean total cure or abolition of the disease or condition. Any alleviation of any undesired signs or symptoms of a disease or condition, to any extent can be considered treatment and/or therapy. To “treat” a disease as the term is used herein, means to reduce the frequency or severity of at least one sign or symptom of a disease or disorder experienced by a subject.


The term “effective amount,” as used herein, refers to an amount that is capable of treating or ameliorating a disease or condition or otherwise capable of producing an intended therapeutic effect, such as reducing the frequency or severity of at least one sign or symptom of a disease or disorder experienced by a subject.


A “nucleic acid” sequence refers to a deoxyribonucleic acid (DNA) or or ribonucleic acid (RNA) sequence. The term captures sequences that include any of the known base analogues of DNA and RNA such as, but not limited to 4-acetylcytosine, 8-hydroxy-N6-methyladenosine, aziridinylcytosine, pseudoisocytosine, 5-(carboxyhydroxyl-methyl) uracil, 5-fluorouracil, 5-bromouracil, 5-carboxymethylaminomethyl-2-thiouracil, 5-carboxymethylaminomethyluracil, dihydrouracil, inosine, N6-isopentenyladenine, 1-methyladenine, 1-methylpseudouracil, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2-methyladenine, 2 methylguanine, 3-methylcytosine. 5-methylcytosine, N6-methyladenine, 7-methylguanine, 5-methylaminomethyluracil, 5-methoxy-aminomethyl-2-thiouracil, beta-D-mannosylqueosine, 5′-methoxycarbonylmethyluracil, 5-methoxyuracil, 2-methylthio-N6-isopentenyladenine, uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid, oxybutoxosine, pseudouracil, queosine, 2-thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, N-uracil-5-oxyacetic acid methylester, uracil-5-oxyacetic acid, pseudouracil, queosine, 2-thiocytosine, and 2,6-diaminopurine.


The term “polynucleotide,” refers to a polymeric form of nucleotides of any length, including DNA, RNA, or analogs thereof. A polynucleotide may comprise modified nucleotides, such as methylated nucleotides and nucleotide analogs, and may be interrupted by non-nucleotide components. If present, modifications to the nucleotide structure may be imparted before or after assembly of the polymer. The term polynucleotide, as used herein, refers interchangeably to double- and single-stranded molecules. Unless otherwise specified or required, any embodiment of the invention described herein that is a polynucleotide encompasses both the double-stranded form and each of two complementary single-stranded forms known or predicted to make up the double-stranded form.


The term “isolated” when referring to a nucleotide sequence, means that the indicated molecule is present in the substantial absence of other biological macromolecules of the same type. Thus, an “isolated nucleic acid molecule which encodes a particular polypeptide” refers to a nucleic acid molecule which is substantially free of other nucleic acid molecules that do not encode the subject polypeptide; however, the molecule may include some additional bases or moieties which do not materially affect the basic characteristics of the composition.


The term “identity” refers to an exact nucleotide-to-nucleotide or amino acid-to-amino acid correspondence of two polynucleotides or polypeptide sequences, respectively. Two or more sequences (polynucleotide or amino acid) can be compared by determining their “percent identity.” The percent identity of two sequences, whether nucleic acid or amino acid sequences, is the number of exact matches between two aligned sequences divided by the length of the shorter sequences and multiplied by 100.


For the purpose of describing the relative position of nucleotide sequences in a particular nucleic acid molecule throughout the instant application, such as when a particular nucleotide sequence is described as being situated “upstream.” “downstream,” “3′,” or “5′” relative to another sequence, it is to be understood that it is the position of the sequences in the “sense” or “coding” strand of a DNA molecule that is being referred to as is conventional in the art.


Sequence identity can be determined by aligning sequences using algorithms, such as BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package Release 7.0, Genetics Computer Group, 575 Science Dr., Madison, Wis.), using default gap parameters, or by inspection, and the best alignment (i.e., resulting in the highest percentage of sequence similarity over a comparison window). Percentage of sequence identity is calculated by comparing two optimally aligned sequences over a window of comparison, determining the number of positions at which the identical residues occurs in both sequences to yield the number of matched positions, dividing the number of matched positions by the total number of matched and mismatched positions not counting gaps in the window of comparison (i.e., the window size), and multiplying the result by 100 to yield the percentage of sequence identity. Unless otherwise indicated the window of comparison between two sequences is defined by the entire length of the shorter of the two sequences.


The term “recombinant,” as applied to a polynucleotide means that the polynucleotide is the product of various combinations of cloning, restriction or ligation steps, and other procedures that result in a construct that is distinct from a polynucleotide found in nature and/or a combination of polynucleotides and viral proteins that is not found in nature. A recombinant virus is a viral particle comprising a recombinant polynucleotide. The terms respectively include replicates of the original polynucleotide construct and progeny of the original virus construct.


The term “gene,” refers to a polynucleotide containing at least one open reading frame that is capable of encoding a particular gene product. Any of the polynucleotide sequences described herein may be used to identify larger fragments or full-length coding sequences of the genes with which they are associated. Methods of isolating larger fragment sequences are known to those of skill in the art.


The term “transgene,” as used herein, refers to a nucleic acid sequence to be positioned within a viral vector and encoding a polypeptide, protein or other product of interest. In some embodiments, one rAAV vector may comprise a sequence encoding one or more transgenes (which can optionally be the same gene, or different genes). For example, one rAAV vector may comprise the coding sequence for 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 transgenes. The transgenes of the present disclosure relate to the improvement of one or more heart conditions, such as cardiomyopathies as provided for herein.


The terms “gene transfer” or “gene delivery” refer to methods or systems for inserting DNA, such as a transgene, into host cells, such as those of a subject afflicted with a cardiomyopathy. In several embodiments, gene transfer yields transient expression of non-integrated transferred DNA, extrachromosomal replication and expression of transferred replicons (e.g., episomes). In additional embodiments, gene transfer results in integration of transferred genetic material into the genomic DNA of host cells.


The terms “regulatory element” or “regulatory sequence”, or variations thereof, refer to a nucleotide sequence that participates in functional regulation of a polynucleotide, including replication, duplication, transcription, splicing, translation, or degradation of the polynucleotide. Regulatory elements can be enhancing or inhibitory in nature, depending on the embodiment. Non-limiting examples of regulatory elements include transcriptional regulatory sequences such as promoter sequences, polyadenylation signals, transcription termination sequences, upstream regulatory domains, origins of replication, internal ribosome entry sites (“IRES”), enhancers, and the like. These elements collectively provide for the replication, transcription and translation of a coding sequence in a recipient cell, though not all of these sequences need always be present. It shall be appreciated that the structural components of a rAAV vector as provided for herein may be listed in individual paragraphs solely for clarity and may be used together in combination. For example, any regulatory element or other component can be used in combination with any transgene (or transgenes) provided for herein.


A “promoter” is a polynucleotide that interacts with an RNA polymerase and initiates transcription of a coding region (e.g., a transgene) usually located downstream (in the 3′ direction) from the promoter.


The term “operably linked” refers to an arrangement of elements wherein the components are configured to perform a function. For example, regulatory sequences operably linked to a coding sequence result in the expression of the coding sequence. Depending on the embodiment, a regulatory sequence need not be contiguous with the coding sequence. Thus, for example, one or more untranslated, yet transcribed, sequences can be present between a promoter sequence and a coding sequence, with those two sequence still being considered “operably linked”.


The term “vector” means any molecular vehicle, such as a plasmid, phage, transposon, cosmid, chromosome, virus, viral particle, virion, etc. which can transfer gene sequences (e.g., a transgene) to or between cells of interest.


An “expression vector” is a vector comprising a region of nucleic acid (e.g., a transgene) which encodes a gene product (e.g., a polypeptide or protein) of interest. As disclosed herein, vectors are used for achieving expression, e.g., stable expression, of a protein in an intended target cell. An expression vector may also comprise control elements operatively linked to the transgene to facilitate expression of the encoded protein in the target cell. A combination of one or more regulatory elements and a gene or genes to which they are operably linked for expression may be referred to herein as an “expression cassette.”


The term “AAV” is an abbreviation for adeno-associated virus, and may be used to refer to the virus itself or derivatives thereof. The term covers all subtypes and both naturally occurring and recombinant forms, unless otherwise indicated. The abbreviation “rAAV” refers to recombinant adeno-associated virus, also referred to as a recombinant AAV vector (or “rAAV vector”), which refers to AAV comprising a polynucleotide sequence not of AAV origin (e.g. a transgene). The term “AAV” includes AAV serotype 1 (AAV-1), AAV serotype 2 (AAV-2), AAV serotype 3 (AAV-3), AAV serotype 4 (AAV-4), AAV serotype 5 (AAV-5), AAV serotype 6 (AAV-6), AAV serotype 7 (AAV-7), AAV serotype 8 (AAV-8), AAV serotype 9 (AAV-9), serotype rh10 AAV, serotype rh74 AAV, or a pseudotyped rAAV (e.g., AAV2/9, referring an AAV vector with the genome of AAV2 (e.g., the ITRs of AAV2) and the capsid of AAV9). In several embodiments, the preferred serotype for delivery to human patients affected by a cardiomyopathy is one of AAV-9, serotype rh74, serotype rh10, or AAV-8. In several embodiments, an rh74 AAV is mutated to advantageously enhance delivery to cardiac tissue, for example by a tryptophan to arginine mutation at amino acid 505 of VP1 capsid, or other mutations, as described in PCT Publication WO 2019/178412, which is incorporated in its entirety by reference herein.


The term “AAV virus” or “AAV viral particle” or “rAAV vector particle” refers to a viral particle composed of at least AAV capsid protein and an encapsidated polynucleotide.


The term “heterologous” refers to genotypically distinct origins. For example, a heterologous polynucleotide is one derived from a different species as compared to a reference species (for example a human gene inserted into a viral plasmid is a heterologous gene). A promoter removed from its native coding sequence and operatively linked to a coding sequence with which it is not naturally found linked is a heterologous promoter.


As used herein, the term “kit” may be used to describe variations of the portable, self-contained enclosure that includes at least one set of components to conduct one or more of the diagnostic or therapeutic methods of the present disclosure.


The term “carrier” refers to a diluent, adjuvant, excipient, or vehicle with which the rAAV particle or preparation, and/or rAAV vectors is administered. Such pharmaceutical carriers can be sterile liquids, such as water and oils, including those of petroleum oil such as mineral oil, vegetable oil such as peanut oil, soybean oil, and sesame oil, animal oil, or oil of synthetic origin. Saline solutions and aqueous dextrose and glycerol solutions may also be employed as liquid carriers.


In some embodiments, sequences recited herein are CpG depleted, and cDNA codon optimized. In some embodiments, the sequences encoding BAG3 are optionally CpG depleted.


Terms and phrases used in this application, and variations thereof, especially in the appended claims, unless otherwise expressly stated, should be construed as open ended as opposed to limiting. As examples of the foregoing, the term ‘including’ should be read to mean ‘including, without limitation,’ ‘including but not limited to,’ or the like; the term ‘comprising’ as used herein is synonymous with ‘including,’ ‘containing,’ or ‘characterized by,’ and is inclusive or open-ended and does not exclude additional, unrecited elements or method steps; the term ‘having’ should be interpreted as ‘having at least;’ the term ‘includes’ should be interpreted as ‘includes but is not limited to;’ the term ‘example’ is used to provide exemplary instances of the item in discussion, not an exhaustive or limiting list thereof; and use of terms like ‘preferably,’ ‘preferred.’ ‘desired,’ or ‘desirable,’ and words of similar meaning should not be understood as implying that certain features are critical, essential, or even important to the structure or function, but instead as merely intended to highlight alternative or additional features that may or may not be utilized in a particular embodiment. In addition, the term “comprising” is to be interpreted synonymously with the phrases “having at least” or “including at least”. When used in the context of a process, the term “comprising” means that the process includes at least the recited steps, but may include additional steps. When used in the context of a compound, composition or device, the term “comprising” means that the compound, composition, or device includes at least the recited features or components, but may also include additional features or components. Likewise, a group of items linked with the conjunction ‘and’ should not be read as requiring that each and every one of those items be present in the grouping, but rather should be read as ‘and/or’ unless expressly stated otherwise. Similarly, a group of items linked with the conjunction ‘or’ should not be read as requiring mutual exclusivity among that group, but rather should be read as ‘and/or’ unless expressly stated otherwise.


With respect to the use of substantially any plural and/or singular terms herein, those having skill in the art can translate from the plural to the singular and/or from the singular to the plural as is appropriate to the context and/or application. The various singular/plural permutations may be expressly set forth herein for sake of clarity. The indefinite article “a” or “an” does not exclude a plurality. A single processor or other unit may fulfill the functions of several items recited in the claims. The mere fact that certain measures are recited in mutually different dependent claims does not indicate that a combination of these measures cannot be used to advantage. Any reference signs in the claims should not be construed as limiting the scope.


The ranges disclosed herein also encompass any and all overlap, sub-ranges, and combinations thereof. Language such as “up to,” “at least,” “greater than,” “less than,” “between,” and the like includes the number recited. Numbers preceded by a term such as “about” or “approximately” include the recited numbers. For example, “about 90%” includes “90%.” In some embodiments, at least 95% homologous or identical includes 96%, 97%, 98%, 99%, and 100% homologous or identical to the reference sequence. In addition, when a sequence is disclosed as “comprising” a nucleotide or amino acid sequence, such a reference shall also include, unless otherwise indicated, that the sequence “comprises”, “consists of” or “consists essentially of” the recited sequence.


Sequence Listing—Construct 1 (pTR-CBA-Bag3) and Additional Sequences














SEQ ID:
Elements (5′→3′)
NT sequence







 1
ITR-L
TTGGCCACTCCCTCTCTGCGCGCTCGCTCG




CTCACTGAGGCCGGGCGACCAAAGGTCGC




CCGACGCCCGGGCTTTGCCCGGGCGGCCTC




AGTGAGCGAGCGAGCGCGCAGAGAGGGA




GTGGCCAACTCCATCACTAGGGGTTCCT


 2
Spacer
CTAGAGGTAC





 3
CMV enhancer
CCTAGTTATTAATAGTAATCAATTACGGGG







TCATTAGTTCATAGCCCATATATGGAGTTC




CGCGTTACATAACTTACGGTAAATGGCCCG




CCTGGCTGACCGCCCAACGACCCCCGCCC




ATTGACGTCAATAATGACGTATGTTCCCAT




AGTAACGCCAATAGGGACTTTCCATTGAC




GTCAATGGGTGGACTATTTACGGTAAACTG




CCCACTTGGCAGTACATCAAGTGTATCATA




TGCCAAGTACGCCCCCTATTGACGTCAATG




ACGGTAAATGGCCCGCCTGGCATTATGCCC




AGTACATGACCTTATGGGACTTTCCTACTT




GGCAGTACATCTACGTATTAGTCATCGCTA




TTACCATG





 4
Spacer
G





 5
CBA promoter
TCGAGGTGAGCCCCACGTTCTGCTTCACTC




TCCCCATCTCCCCCCCCTCCCCACCCCCAA




TTTTGTATTTATTTATTTTTTAATTATTTTGT




GCAGCGATGGGGGCGGGGGGGGGGGGGG




GGCGCGCGCCAGGCGGGGGGGGGCGGGGC




GAGGGGCGGGGCGGGGCGAGGCGGAGAG




GTGCGGCGGCAGCCAATCAGAGCGGCGCG




CTCCGAAAGTTTCCTTTTATGGCGAGGCGG




CGGCGGCGGCGGCCCTATAAAAAGCGAAG




CGCGCGGCGGGCG





 6
“chimeric intron”
GGAGTCGCTGCGACGCTGCCTTCGCCCCGT




GCCCCGCTCCGCCGCCGCCTCGCGCCGCCC




GCCCCGGCTCTGACTGACCGCGTTACTCCC




ACAGGTGAGCGGGCGGGACGGCCCTTCTC




CTCCGGGCTGTAATTAGCGCTTGGTTTAAT




GACGGCTTGTTTCTTTTCTGTGGCTGCGTG




AAAGCCTTGAGGGGCTCCGGGAGGGCCCT




TTGTGCGGGGGGGAGCGGCTCGGGGGGTG




CGTGCGTGTGTGTGTGCGTGGGGAGCGCC




GCGTGCGGCCCGCGCTGCCCGGCGGCTGT




GAGCGCTGCGGGCGCGGCGCGGGGCTTTG




TGCGCTCCGCAGTGTGCGCGAGGGGAGCG




CGGCCGGGGGCGGTGCCCCGCGGTGCGGG




GGGGGCTGCGAGGGGAACAAAGGCTGCGT




GCGGGGTGTGTGCGTGGGGGGGTGAGCAG




GGGGTGTGGGCGCGGCGGTCGGGCTGTAA




CCCCCCCCTGCACCCCCCTCCCCGAGTTGC




TGAGCACGGCCCGGCTTCGGGTGCGGGGC




TCCGTACGGGGCGTGGCGCGGGGCTCGCC




GTGCCGGGCGGGGGGTGGCGGCAGGTGGG




GGTGCCGGGCGGGGCGGGGCCGCCTCGGG




CCGGGGAGGGCTCGGGGGAGGGGCGCGGC




GGCCCCCGGAGCGCCGGCGGCTGTCGAGG




CGCGGCGAGCCGCAGCCATTGCCTTTTATG




GTAATCGTGCGAGAGGGCGCAGGGACTTC




CTTTGTCCCAAATCTGTGCGGAGCCGAAAT




CTGGGAGGCGCCGCCGCACCCCCTCTAGC




GGGCGCGGGGCGAAGCGGTGCGGCGCCGG




CAGGAAGGAAATGGGCGGGGAGGGCCTTC




GTGCGTCGCCGCGCCGCCGTCCCCTTCTCC




CTCTCCAGCCTCGGGGCTGTCCGCGGGGG




GACGGCTGCCTTCGGGGGGGACGGGGCAG




GGCGGGGTTCGGCTTCTGGCGTGTGACCG




GCGGCTCTAGAGCCTCTGCTAACCATGTTC




ATGCCTTCTTCTTTTTCCTACAG





 7
Spacer
CTCCTGGGCAACGTGCTGGTTATTGTGCTG




TCTCATCATTTTGGCAAAGAATTCCTCGAA




GATCCGAAGGGGTTCAAGCTT





 8
“exon 1 non-
GCATCCAACCCCGGGCCGCGGCCAACTTCT



coding”
CTGGACTGGACCAGAAGTTTCTAGCCGGC




CAGTTGCTACCTCCCTTTATCTCCTCCTTCC




CCTCTGGCAGCGAGGAGGCTATTTCCAGA




CACTTCCACCCCTCTCTGGCCACGTCACCC




CCGCCTTTAATTCATAAAGGTGCCCGGCGC




CGGCTTCCCGGACACGTCGGCGGCGGAGA




GGGGCCCACGGCGGCGGCCCGGCCAGAGA




CTCGGCGCCCGGAGCCAGCGCCCCGCACC




CGCGCCCCAGCGGGCAGACCCCAACCCAG




CG





 9
Spacer
CCACC





10
Exon 1
ATGTCTGCTGCCACACACAGCCCTATGATG




CAGGTCGCCTCTGGCAACGGCGACAGAGA




TCCTTTGCCTCCTGGCTGGGAGATCAAGAT




CGATCCTCAGACCGGCTGGCCCTTCTTCGT




GGACCACAATAGCAGAACCACCACCTGGA




ACGACCCCAGAGTGCCTTCTGAGGGCCCC




AAA





11
Exon 2
GAGACACCCAGCTCTGCCAATGGACCCAG




CAGAGAGGGAAGCAGACTGCCACCAGCTA




GAGAAGGACACCCCGTGTATCCACAGCTG




AGGCCTGGCTACATCCCCATTCCAGTGCTG




CATGAGGGCGCCGAAAACAGACAGGTGCA




CCCCTTTCACGTGTACCCTCAGCCTGGCAT




GCAGCGGTTTAGAACAGAAGCCGCTGCTG




CCGCTCCTCAGAGATCTCAGTCTCCTCTGA




GAGGCATGCCCGAGACAACCCAGCCTGAT




AAGCAGTGTGGACAGGTGGCAGCTGCAGC




AGCAGCTCAACCTCCTGCTTCTCACGGCCC




CGAA





12
Exon 3
AGAAGCCAATCTCCTGCCGCCTCTGATTGC




AGCTCCAGCTCTAGCTCTGCCTCTCTGCCT




AGCAGCGGCAGATCTAGCCTGGGCTCTCA




TCAACTGCCCAGAGGCTACATCAGCATCCC




TGTGATCCACGAGCAGAACGTGACCAGAC




CTGCTGCTCAGCCCAGCTTCCATCAGGCCC




AGAAAACACACTACCCCGCTCAGCAGGGC




GAGTACCAGACACACCAGCCTGTGTACCA




CAAGATCCAGGGCGACGACTGGGAGCCCA




GACCTCTTAGAGCCGCTAGTCCCTTCAGAT




CCTCTGTGCAGGGCGCCAGTTCTAGAGAG




GGCTCTCCTGCCAGAAGCAGCACACCTCTG




CACAGCCCATCTCCAATCAGAGTGCACAC




CGTGGTGGACAGACCCCAG





13
Exon 4
CAGCCTATGACACACAGAGAGACAGCCCC




TGTCAGCCAGCCTGAGAACAAGCCTGAGT




CCAAGCCAGGACCTGTGGGACCTGAACTG




CCTCCAGGACACATCCCTATCCAAGTGATC




CGGAAAGAGGTGGACAGCAAGCCCGTGTC




TCAGAAGCCTCCTCCACCTAGCGAGAAAG




TGGAAGTGAAAGTGCCTCCTGCTCCTGTGC




CTTGTCCTCCTCCATCTCCTGGACCATCTG




CCGTGCCTAGCTCTCCTAAAAGCGTGGCCA




CCGAGGAAAGAGCCGCTCCTTCTACAGCT




CCTGCCGAGGCCACACCTCCTAAACCTGGC




GAAGCTGAAGCCCCTCCAAAACACCCTGG




CGTGCTGAAGGTGGAAGCCATCCTGGAAA




AGGTGCAGGGACTCGAGCAGGCCGTGGAC




AACTTCGAGGGCAAGAAAACCGACAAGAA




ATACCTGATGATCGAGGAATACCTGACCA




AAGAGCTGCTGGCCCTGGACAGCGTTGAC




CCTGAAGGCAGAGCAGATGTGCGGCAGGC




TAGAAGAGATGGCGTGCGGAAAGTGCAGA




CCATCCTCGAGAAGCTGGAACAGAAAGCC




ATCGACGTGCCAGGCCAGGTGCAGGTTTA




CGAGCTGCAGCCCTCTAACCTGGAAGCCG




ATCAGCCTCTGCAGGCCATCATGGAAATG




GGAGCCGTGGCCGCCGACAAGGGAAAGAA




GAATGCTGGCAACGCCGAGGATCCCCACA




CCGAAACACAGCAGCCTGAAGCTACAGCC




GCCGCTACCAGCAATCCCAGCAGCATGAC




AGACACCCCTGGCAATCCAGCCGCTCCA


14
STOP
TAATGATAG





15
Exon 4 UTR
CCTCTGCCCTGTAAAAATCAGACTCGGAAC




CGATGTGTGCTTTAGGGAATTTTAAGTTGC




ATGCATTTCAGAGACTTTAAGTCAGTTGGT




TTTTATTAGCTGCTTGGTATGCAGTAACTT




GGGTGGAGGCAAAACACTAATAAAAGGGC




TAAAAAGGAAAATGATGCTTTTCTTCTATA




TTCTTACTCTGTACCAATAAAGAAGTTGCT




TGTTGTTTGAGAAGTTTAACCCCGTTGCTT




GTTGTTCTGCAGCCCTGTCTACTTGGGCAC




CCCCACCACCTGTTAGCTGTGGTTGTGCAC




TGTCTTTTGTAGCTCTGGACTGGAGGGGTA




GATGGGGAGTCAATTACCCATCACATAAA




TATGAAACATTTATCAGAAATGTTGCCATT




TTAATGAGATGATTTTCTTCATCTCATAAT




TAAAATACCTGACTTTAGAGAGAGTAAAA




TGTGCCAGGAGCCATAGGAATATCTGTAT




GTTGGATGACTTTAATGCTACATTTTAAAA




AAAGAAAATAAAGTAATAATATAACTCAA




AA





16
Spacer
GCGGCCGCTCGAGTCTAGAGGGCCCGTTT




AAACCCGCTGATCAGCCT





17
bGH polyA
CGACTGTGCCTTCTAGTTGCCAGCCATCTG




TTGTTTGCCCCTCCCCCGTGCCTTCCTTGAC




CCTGGAAGGTGCCACTCCCACTGTCCTTTC




CTAATAAAATGAGGAAATTGCATCGCATT




GTCTGAGTAGGTGTCATTCTATTCTGGGGG




GTGGGGTGGGGCAGGACAGCAAGGGGGA




GGATTGGGAAGACAATAGCAGG





19
Spacer
GACTCTAG





20
ITR-R
AGGAACCCCTAGTGATGGAGTTGGCCACT




CCCTCTCTGCGCGCTCGCTCGCTCACTGAG




GCCGGGCGACCAAAGGTCGCCCGACGCCC




GGGCTTTGCCCGGGCGGCCTCAGTGAGCG




AGCGAGCGCGCAGAGAGGGAGTGGCCAA





21
BAG3
ATGTCTGCTGCCACACACAGCCCTATGATG




CAGGTCGCCTCTGGCAACGGCGACAGAGA




TCCTTTGCCTCCTGGCTGGGAGATCAAGAT




CGATCCTCAGACCGGCTGGCCCTTCTTCGT




GGACCACAATAGCAGAACCACCACCTGGA




ACGACCCCAGAGTGCCTTCTGAGGGCCCC




AAAGAGACACCCAGCTCTGCCAATGGACC




CAGCAGAGAGGGAAGCAGACTGCCACCAG




CTAGAGAAGGACACCCCGTGTATCCACAG




CTGAGGCCTGGCTACATCCCCATTCCAGTG




CTGCATGAGGGCGCCGAAAACAGACAGGT




GCACCCCTTTCACGTGTACCCTCAGCCTGG




CATGCAGCGGTTTAGAACAGAAGCCGCTG




CTGCCGCTCCTCAGAGATCTCAGTCTCCTC




TGAGAGGCATGCCCGAGACAACCCAGCCT




GATAAGCAGTGTGGACAGGTGGCAGCTGC




AGCAGCAGCTCAACCTCCTGCTTCTCACGG




CCCCGAAAGAAGCCAATCTCCTGCCGCCTC




TGATTGCAGCTCCAGCTCTAGCTCTGCCTC




TCTGCCTAGCAGCGGCAGATCTAGCCTGG




GCTCTCATCAACTGCCCAGAGGCTACATCA




GCATCCCTGTGATCCACGAGCAGAACGTG




ACCAGACCTGCTGCTCAGCCCAGCTTCCAT




CAGGCCCAGAAAACACACTACCCCGCTCA




GCAGGGCGAGTACCAGACACACCAGCCTG




TGTACCACAAGATCCAGGGCGACGACTGG




GAGCCCAGACCTCTTAGAGCCGCTAGTCCC




TTCAGATCCTCTGTGCAGGGCGCCAGTTCT




AGAGAGGGCTCTCCTGCCAGAAGCAGCAC




ACCTCTGCACAGCCCATCTCCAATCAGAGT




GCACACCGTGGTGGACAGACCCCAGCAGC




CTATGACACACAGAGAGACAGCCCCTGTC




AGCCAGCCTGAGAACAAGCCTGAGTCCAA




GCCAGGACCTGTGGGACCTGAACTGCCTC




CAGGACACATCCCTATCCAAGTGATCCGG




AAAGAGGTGGACAGCAAGCCCGTGTCTCA




GAAGCCTCCTCCACCTAGCGAGAAAGTGG




AAGTGAAAGTGCCTCCTGCTCCTGTGCCTT




GTCCTCCTCCATCTCCTGGACCATCTGCCG




TGCCTAGCTCTCCTAAAAGCGTGGCCACCG




AGGAAAGAGCCGCTCCTTCTACAGCTCCTG




CCGAGGCCACACCTCCTAAACCTGGCGAA




GCTGAAGCCCCTCCAAAACACCCTGGCGT




GCTGAAGGTGGAAGCCATCCTGGAAAAGG




TGCAGGGACTCGAGCAGGCCGTGGACAAC




TTCGAGGGCAAGAAAACCGACAAGAAATA




CCTGATGATCGAGGAATACCTGACCAAAG




AGCTGCTGGCCCTGGACAGCGTTGACCCTG




AAGGCAGAGCAGATGTGCGGCAGGCTAGA




AGAGATGGCGTGCGGAAAGTGCAGACCAT




CCTCGAGAAGCTGGAACAGAAAGCCATCG




ACGTGCCAGGCCAGGTGCAGGTTTACGAG




CTGCAGCCCTCTAACCTGGAAGCCGATCA




GCCTCTGCAGGCCATCATGGAAATGGGAG




CCGTGGCCGCCGACAAGGGAAAGAAGAAT




GCTGGCAACGCCGAGGATCCCCACACCGA




AACACAGCAGCCTGAAGCTACAGCCGCCG




CTACCAGCAATCCCAGCAGCATGACAGAC




ACCCCTGGCAATCCAGCCGCTCCATAATGA




TAG





22
Construct with
TTGGCCACTCCCTCTCTGCGCGCTCGCTCG



BAG3
CTCACTGAGGCCGGGCGACCAAAGGTCGC




CCGACGCCCGGGCTTTGCCCGGGCGGCCTC




AGTGAGCGAGCGAGCGCGCAGAGAGGGA




GTGGCCAACTCCATCACTAGGGGTTCCTCT




AGAGGTACCCTAGTTATTAATAGTAATCAA




TTACGGGGTCATTAGTTCATAGCCCATATA




TGGAGTTCCGCGTTACATAACTTACGGTAA




ATGGCCCGCCTGGCTGACCGCCCAACGAC




CCCCGCCCATTGACGTCAATAATGACGTAT




GTTCCCATAGTAACGCCAATAGGGACTTTC




CATTGACGTCAATGGGTGGACTATTTACGG




TAAACTGCCCACTTGGCAGTACATCAAGTG




TATCATATGCCAAGTACGCCCCCTATTGAC




GTCAATGACGGTAAATGGCCCGCCTGGCA




TTATGCCCAGTACATGACCTTATGGGACTT




TCCTACTTGGCAGTACATCTACGTATTAGT




CATCGCTATTACCATGGTCGAGGTGAGCCC




CACGTTCTGCTTCACTCTCCCCATCTCCCCC




CCCTCCCCACCCCCAATTTTGTATTTATTTA




TTTTTTAATTATTTTGTGCAGCGATGGGGG




CGGGGGGGGGGGGGGGGCGCGCGCCAGG




CGGGGCGGGGGGGGCGAGGGGCGGGGC




GGGGCGAGGCGGAGAGGTGCGGCGGCAG




CCAATCAGAGCGGCGCGCTCCGAAAGTTT




CCTTTTATGGCGAGGCGGCGGCGGCGGCG




GCCCTATAAAAAGCGAAGCGCGCGGCGGG




CGGGAGTCGCTGCGACGCTGCCTTCGCCCC




GTGCCCCGCTCCGCCGCCGCCTCGCGCCGC




CCGCCCCGGCTCTGACTGACCGCGTTACTC




CCACAGGTGAGCGGGCGGGACGGCCCTTC




TCCTCCGGGCTGTAATTAGCGCTTGGTTTA




ATGACGGCTTGTTTCTTTTCTGTGGCTGCG




TGAAAGCCTTGAGGGGCTCCGGGAGGGCC




CTTTGTGCGGGGGGGAGCGGCTCGGGGGG




TGCGTGCGTGTGTGTGTGCGTGGGGAGCG




CCGCGTGCGGCCCGCGCTGCCCGGCGGCT




GTGAGCGCTGCGGGCGCGGCGCGGGGCTT




TGTGCGCTCCGCAGTGTGCGCGAGGGGAG




CGCGGCCGGGGGCGGTGCCCCGCGGTGCG




GGGGGGGCTGCGAGGGGAACAAAGGCTGC




GTGCGGGGTGTGTGCGTGGGGGGGTGAGC




AGGGGGTGTGGGCGCGGCGGTCGGGCTGT




AACCCCCCCCTGCACCCCCCTCCCCGAGTT




GCTGAGCACGGCCCGGCTTCGGGTGCGGG




GCTCCGTACGGGGCGTGGCGCGGGGCTCG




CCGTGCCGGGGGGGGGTGGCGGCAGGTG




GGGGTGCCGGGCGGGGGGGGGCCGCCTCG




GGCCGGGGAGGGCTCGGGGGAGGGGCGC




GGCGGCCCCCGGAGCGCCGGCGGCTGTCG




AGGCGCGGCGAGCCGCAGCCATTGCCTTTT




ATGGTAATCGTGCGAGAGGGCGCAGGGAC




TTCCTTTGTCCCAAATCTGTGCGGAGCCGA




AATCTGGGAGGCGCCGCCGCACCCCCTCT




AGCGGGCGCGGGGCGAAGCGGTGCGGCGC




CGGCAGGAAGGAAATGGGCGGGGAGGGC




CTTCGTGCGTCGCCGCGCCGCCGTCCCCTT




CTCCCTCTCCAGCCTCGGGGCTGTCCGCGG




GGGGACGGCTGCCTTCGGGGGGGACGGGG




CAGGGCGGGGTTCGGCTTCTGGCGTGTGA




CCGGCGGCTCTAGAGCCTCTGCTAACCATG




TTCATGCCTTCTTCTTTTTCCTACAG




CTCCTGGGCAACGTGCTGGTTATTGTGCTG




TCTCATCATTTTGGCAAAGAATTCCTCGAA




GATCCGAAGGGGTTCAAGCTTGCATCCAA




CCCCGGGCCGCGGCCAACTTCTCTGGACTG




GACCAGAAGTTTCTAGCCGGCCAGTTGCTA




CCTCCCTTTATCTCCTCCTTCCCCTCTGGCA




GCGAGGAGGCTATTTCCAGACACTTCCACC




CCTCTCTGGCCACGTCACCCCCGCCTTTAA




TTCATAAAGGTGCCCGGCGCCGGCTTCCCG




GACACGTCGGCGGCGGAGAGGGGCCCACG




GCGGCGGCCCGGCCAGAGACTCGGCGCCC




GGAGCCAGCGCCCCGCACCCGCGCCCCAG




CGGGCAGACCCCAACCCAGCGCCACC




ATGTCTGCTGCCACACACAGCCCTATGATG




CAGGTCGCCTCTGGCAACGGCGACAGAGA




TCCTTTGCCTCCTGGCTGGGAGATCAAGAT




CGATCCTCAGACCGGCTGGCCCTTCTTCGT




GGACCACAATAGCAGAACCACCACCTGGA




ACGACCCCAGAGTGCCTTCTGAGGGCCCC




AAAGAGACACCCAGCTCTGCCAATGGACC




CAGCAGAGAGGGAAGCAGACTGCCACCAG




CTAGAGAAGGACACCCCGTGTATCCACAG




CTGAGGCCTGGCTACATCCCCATTCCAGTG




CTGCATGAGGGCGCCGAAAACAGACAGGT




GCACCCCTTTCACGTGTACCCTCAGCCTGG




CATGCAGCGGTTTAGAACAGAAGCCGCTG




CTGCCGCTCCTCAGAGATCTCAGTCTCCTC




TGAGAGGCATGCCCGAGACAACCCAGCCT




GATAAGCAGTGTGGACAGGTGGCAGCTGC




AGCAGCAGCTCAACCTCCTGCTTCTCACGG




CCCCGAAAGAAGCCAATCTCCTGCCGCCTC




TGATTGCAGCTCCAGCTCTAGCTCTGCCTC




TCTGCCTAGCAGCGGCAGATCTAGCCTGG




GCTCTCATCAACTGCCCAGAGGCTACATCA




GCATCCCTGTGATCCACGAGCAGAACGTG




ACCAGACCTGCTGCTCAGCCCAGCTTCCAT




CAGGCCCAGAAAACACACTACCCCGCTCA




GCAGGGCGAGTACCAGACACACCAGCCTG




TGTACCACAAGATCCAGGGCGACGACTGG




GAGCCCAGACCTCTTAGAGCCGCTAGTCCC




TTCAGATCCTCTGTGCAGGGCGCCAGTTCT




AGAGAGGGCTCTCCTGCCAGAAGCAGCAC




ACCTCTGCACAGCCCATCTCCAATCAGAGT




GCACACCGTGGTGGACAGACCCCAGCAGC




CTATGACACACAGAGAGACAGCCCCTGTC




AGCCAGCCTGAGAACAAGCCTGAGTCCAA




GCCAGGACCTGTGGGACCTGAACTGCCTC




CAGGACACATCCCTATCCAAGTGATCCGG




AAAGAGGTGGACAGCAAGCCCGTGTCTCA




GAAGCCTCCTCCACCTAGCGAGAAAGTGG




AAGTGAAAGTGCCTCCTGCTCCTGTGCCTT




GTCCTCCTCCATCTCCTGGACCATCTGCCG




TGCCTAGCTCTCCTAAAAGCGTGGCCACCG




AGGAAAGAGCCGCTCCTTCTACAGCTCCTG




CCGAGGCCACACCTCCTAAACCTGGCGAA




GCTGAAGCCCCTCCAAAACACCCTGGCGT




GCTGAAGGTGGAAGCCATCCTGGAAAAGG




TGCAGGGACTCGAGCAGGCCGTGGACAAC




TTCGAGGGCAAGAAAACCGACAAGAAATA




CCTGATGATCGAGGAATACCTGACCAAAG




AGCTGCTGGCCCTGGACAGCGTTGACCCTG




AAGGCAGAGCAGATGTGCGGCAGGCTAGA




AGAGATGGCGTGCGGAAAGTGCAGACCAT




CCTCGAGAAGCTGGAACAGAAAGCCATCG




ACGTGCCAGGCCAGGTGCAGGTTTACGAG




CTGCAGCCCTCTAACCTGGAAGCCGATCA




GCCTCTGCAGGCCATCATGGAAATGGGAG




CCGTGGCCGCCGACAAGGGAAAGAAGAAT




GCTGGCAACGCCGAGGATCCCCACACCGA




AACACAGCAGCCTGAAGCTACAGCCGCCG




CTACCAGCAATCCCAGCAGCATGACAGAC




ACCCCTGGCAATCCAGCCGCTCCATAATGA




TAGCCTCTGCCCTGTAAAAATCAGACTCGG




AACCGATGTGTGCTTTAGGGAATTTTAAGT




TGCATGCATTTCAGAGACTTTAAGTCAGTT




GGTTTTTATTAGCTGCTTGGTATGCAGTAA




CTTGGGTGGAGGCAAAACACTAATAAAAG




GGCTAAAAAGGAAAATGATGCTTTTCTTCT




ATATTCTTACTCTGTACCAATAAAGAAGTT




GCTTGTTGTTTGAGAAGTTTAACCCCGTTG




CTTGTTGTTCTGCAGCCCTGTCTACTTGGG




CACCCCCACCACCTGTTAGCTGTGGTTGTG




CACTGTCTTTTGTAGCTCTGGACTGGAGGG




GTAGATGGGGAGTCAATTACCCATCACAT




AAATATGAAACATTTATCAGAAATGTTGCC




ATTTTAATGAGATGATTTTCTTCATCTCAT




AATTAAAATACCTGACTTTAGAGAGAGTA




AAATGTGCCAGGAGCCATAGGAATATCTG




TATGTTGGATGACTTTAATGCTACATTTTA




AAAAAAGAAAATAAAGTAATAATATAACT




CAAAAGCGGCCGCTCGAGTCTAGAGGGCC




CGTTTAAACCCGCTGATCAGCCTCGACTGT




GCCTTCTAGTTGCCAGCCATCTGTTGTTTG




CCCCTCCCCCGTGCCTTCCTTGACCCTGGA




AGGTGCCACTCCCACTGTCCTTTCCTAATA




AAATGAGGAAATTGCATCGCATTGTCTGA




GTAGGTGTCATTCTATTCTGGGGGGTGGGG




TGGGGCAGGACAGCAAGGGGGAGGATTGG




GAAGACAATAGCAGGGACTCTAG




AGGAACCCCTAGTGATGGAGTTGGCCACT




CCCTCTCTGCGCGCTCGCTCGCTCACTGAG




GCCGGGCGACCAAAGGTCGCCCGACGCCC




GGGCTTTGCCCGGGCGGCCTCAGTGAGCG




AGCGAGCGCGCAGAGAGGGAGTGGCCAA




ATGTCTGCTGCCACACACAGCCCTATGATG




CAGGTCGCCTCTGGCAACGGCGACAGAGA




TCCTTTGCCTCCTGGCTGGGAGATCAAGAT




CGATCCTCAGACCGGCTGGCCCTTCTTCGT




GGACCACAATAGCAGAACCACCACCTGGA




ACGACCCCAGAGTGCCTTCTGAGGGCCCC




AAAGAGACACCCAGCTCTGCCAATGGACC




CAGCAGAGAGGGAAGCAGACTGCCACCAG




CTAGAGAAGGACACCCCGTGTATCCACAG




CTGAGGCCTGGCTACATCCCCATTCCAGTG




CTGCATGAGGGCGCCGAAAACAGACAGGT




GCACCCCTTTCACGTGTACCCTCAGCCTGG




CATGCAGCGGTTTAGAACAGAAGCCGCTG




CTGCCGCTCCTCAGAGATCTCAGTCTCCTC




TGAGAGGCATGCCCGAGACAACCCAGCCT




GATAAGCAGTGTGGACAGGTGGCAGCTGC




AGCAGCAGCTCAACCTCCTGCTTCTCACGG




CCCCGAAAGAAGCCAATCTCCTGCCGCCTC




TGATTGCAGCTCCAGCTCTAGCTCTGCCTC




TCTGCCTAGCAGCGGCAGATCTAGCCTGG




GCTCTCATCAACTGCCCAGAGGCTACATCA




GCATCCCTGTGATCCACGAGCAGAACGTG




ACCAGACCTGCTGCTCAGCCCAGCTTCCAT




CAGGCCCAGAAAACACACTACCCCGCTCA




GCAGGGCGAGTACCAGACACACCAGCCTG




TGTACCACAAGATCCAGGGCGACGACTGG




GAGCCCAGACCTCTTAGAGCCGCTAGTCCC




TTCAGATCCTCTGTGCAGGGCGCCAGTTCT




AGAGAGGGCTCTCCTGCCAGAAGCAGCAC




ACCTCTGCACAGCCCATCTCCAATCAGAGT




GCACACCGTGGTGGACAGACCCCAGCAGC




CTATGACACACAGAGAGACAGCCCCTGTC




AGCCAGCCTGAGAACAAGCCTGAGTCCAA




GCCAGGACCTGTGGGACCTGAACTGCCTC




CAGGACACATCCCTATCCAAGTGATCCGG




AAAGAGGTGGACAGCAAGCCCGTGTCTCA




GAAGCCTCCTCCACCTAGCGAGAAAGTGG




AAGTGAAAGTGCCTCCTGCTCCTGTGCCTT




GTCCTCCTCCATCTCCTGGACCATCTGCCG




TGCCTAGCTCTCCTAAAAGCGTGGCCACCG




AGGAAAGAGCCGCTCCTTCTACAGCTCCTG




CCGAGGCCACACCTCCTAAACCTGGCGAA




GCTGAAGCCCCTCCAAAACACCCTGGCGT




GCTGAAGGTGGAAGCCATCCTGGAAAAGG




TGCAGGGACTCGAGCAGGCCGTGGACAAC




TTCGAGGGCAAGAAAACCGACAAGAAATA




CCTGATGATCGAGGAATACCTGACCAAAG




AGCTGCTGGCCCTGGACAGCGTTGACCCTG




AAGGCAGAGCAGATGTGCGGCAGGCTAGA




AGAGATGGCGTGCGGAAAGTGCAGACCAT




CCTCGAGAAGCTGGAACAGAAAGCCATCG




ACGTGCCAGGCCAGGTGCAGGTTTACGAG




CTGCAGCCCTCTAACCTGGAAGCCGATCA




GCCTCTGCAGGCCATCATGGAAATGGGAG




CCGTGGCCGCCGACAAGGGAAAGAAGAAT




GCTGGCAACGCCGAGGATCCCCACACCGA




AACACAGCAGCCTGAAGCTACAGCCGCCG




CTACCAGCAATCCCAGCAGCATGACAGAC




ACCCCTGGCAATCCAGCCGCTCCA





29
RH74 VP1, VP2,
ATGGCTGCCGATGGTTATCTTCCAGATTGG



VP3
CTCGAGGACAACCTCTCTGAGGGCATTCGC




GAGTGGTGGGACCTGAAACCTGGAGCCCC




GAAACCCAAAGCCAACCAGCAAAAGCAGG




ACAACGGCCGGGGTCTGGTGCTTCCTGGCT




ACAAGTACCTCGGACCCTTCAACGGACTC




GACAAGGGGGAGCCCGTCAACGCGGCGGA




CGCAGCGGCCCTCGAGCACGACAAGGCCT




ACGAC




CAGCAGCTCCAAGCGGGTGACAATCCGTA




CCTGCGGTATAATCACGCCGACGCCGAGTT




TCAGGAGCGTCTGCAAGAAGATACGTCTTT




TGGGGGCAACCTCGGGCGCGCAGTCTTCC




AGGCCAAAAAGCGGGTTCTCGAACCTCTG




GGCCTGGTTGAATCGCCGGTTAAGACGGC




TCCTGGAAAGAAGAGACCGGTAGAGCCAT




CACCCCAGCGCTCTCCAGACTCCTCTACGG




GCATC




GGCAAGAAAGGCCAGCAGCCCGCAAAAA




AGAGACTCAATTTTGGGCAGACTGGCGAC




TCAGAGTCAGTCCCCGACCCTCAACCAATC




GGAGAACCACCAGCAGGCCCCTCTGGTCT




GGGATCTGGTACAATGGCTGCAGGCGGTG




GCGCTCCAATGGCAGACAATAACGAAGGC




GCCGACGGAGTGGGTAGTTCCTCAGGAAA




TTGGCATTGCGATTCCACATGGCTGGGCGA




CAGAGTCATCACCACCAGCACCCGCACCT




GGGCCCTGCCCACCTACAACAACCACCTCT




ACAAGCAAATCTCCAACGGGACCTCGGGA




GGAAGCACCAACGACAACACCTACTTCGG




CTACAGCACCCCCTGGGGGTATTTTGACTT




CAACAGATTCCACTGCCACTTTTCACCACG




TGACTGGCAGCGACTCATCAACAACAACT




GGGGATTCCGGCCCAAGAGGCTCAACTTC




AAGCTCTTCAAC




ATCCAAGTCAAGGAGGTCACGCAGAATGA




AGGCACCAAGACCATCGCCAATAACCTTA




CCAGCACGATTCAGGTCTTTACGGACTCGG




AATACCAGCTCCCGTACGTGCTCGGCTCGG




CGCACCAGGGCTGCCTGCCTCCGTTCCCGG




CGGACGTCTTCATGATTCCTCAGTACGGGT




ACCTGACTCTGAACAATGGCAGTCAGGCT




GTGGGCCGGTCGTCCTTCTACTGCCTGGAG




TAC




TTTCCTTCTCAAATGCTGAGAACGGGCAAC




AACTTTGAATTCAGCTACAACTTCGAGGAC




GTGCCCTTCCACAGCAGCTACGCGCACAG




CCAGAGCCTGGACCGGCTGATGAACCCTC




TCATCGACCAGTACTTGTACTACCTGTCCC




GGACTCAAAGCACGGGCGGTACTGCAGGA




ACTCAGCAGTTGCTATTTTCTCAGGCCGGG




CCTAACAACATGTCGGCTCAGGCCAAGAA




CTGG




CTACCCGGTCCCTGCTACCGGCAGCAACGC




GTCTCCACGACACTGTCGCAGAACAACAA




CAGCAACTTTGCCTGGACGGGTGCCACCA




AGTATCATCTGAATGGCAGAGACTCTCTGG




TGAATCCTGGCGTTGCCATGGCTACCCACA




AGGACGACGAAGAGCGATTTTTTCCATCC




AGCGGAGTCTTAATGTTTGGGAAACAGGG




AGCTGGAAAAGACAACGTGGACTATAGCA




GCGTGATGCTAACCAGCGAGGAAGAAATA




AAGACCACCAACCCAGTGGCCACAGAACA




GTACGGCGTGGTGGCCGATAACCTGCAAC




AGCAAAACGCCGCTCCTATTGTAGGGGCC




GTCAATAGTCAAGGAGCCTTACCTGGCAT




GGTGTGGCAGAACCGGGACGTGTACCTGC




AGGGTCCCATCTGGGCCAAGATTCCTCATA




CGGACGGCAACTTTCATCCCTCGCCGCTGA




TGGGAGGCTTTGGACTGAAGCATCCGCCTC




CTCAGATCCTGATTAAAAACACACCTGTTC




CCGCGGATCCTCCGACCACCTTCAATCAGG




CCAAGCTGGCTTCTTTCATCACGCAGTACA




GTACCGGCCAGGTCAGCGTGGAGATCGAG




TGGGAGCTGCAGAAGGAGAACAGCAAACG




CTGGAACCCAGAGATTCAGTACACTTCCA




ACTACTACAAATCTACAAATGTGGACTTTG




CTGTCAATACTGAGGGTACTTATTCCGAGC




CTCGCCCCATTGGCACCCGTTACCTCACCC




GTAATCTGTAA





55
aMHC
CCTTCAGATTAAAAATAACTAAGGTAAGG




GCCATGTGGGTAGGGGAGGTGGTGTGAGA




CGGTCCTGTCTCTCCTCTATCTGCCCATCG




GCCCTTTGGGGAGGAGGAATGTGCCCAAG




GACTAAAAAAAGGCCCTGGAGCCAGAGGG




GCGAGGGCAGCAGACCTTTCATGGGCAAA




CCTCAGGGCTGCTGTC









Kozak Sequences













SEQ ID:
Description
NT Sequence







56
Kozak Consensus

AGCCCCAAC




Sequence






57
scAAV with chick
tccctctctgcgcgctcgctcgctcactgaggccgggcgaccaaagg



beta actin (CBA)
tcgcccgacgcccgggctttgcccgggcggcctcagtgagcgagcg



promoter and
agcgcgcagctgctgCTAGAGGTACTCGAGGTGAG



AGCGCCACC
CCCCACGTTCTGCTTCACTCTCCCCATCTCC



Kozak sequence
CCCCCCTCCCCACCCCCAATTTTGTATTTAT




TTATTTTTTAATTATTTTGTGCAGCGATGGG




GGCGGGGGGGGGGGGGGGGCGCGCGCCA




GGCGGGGCGGGGGGGGCGAGGGGGGGG




GCGGGGCGAGGCGGAGAGGTGCGGCGGC




AGCCAATCAGAGCGGCGCGCTCCGAAAGT




TTCCTTTTATGGCGAGGCGGCGGCGGCGGC




GGCCCTATAAAAAGCGAAGCGCGCGGCGG




GCGAGCGCCACCATGTCTGCTGCCACACA




CAGCCCTATGATGCAGGTCGCCTCTGGCAA




CGGCGACAGAGATCCTTTGCCTCCTGGCTG




GGAGATCAAGATCGATCCTCAGACCGGCT




GGCCCTTCTTCGTGGACCACAATAGCAGA




ACCACCACCTGGAACGACCCCAGAGTGCC




TTCTGAGGGCCCCAAAGAGACACCCAGCT




CTGCCAATGGACCCAGCAGAGAGGGAAGC




AGACTGCCACCAGCTAGAGAAGGACACCC




CGTGTATCCACAGCTGAGGCCTGGCTACAT




CCCCATTCCAGTGCTGCATGAGGGCGCCG




AAAACAGACAGGTGCACCCCTTTCACGTG




TACCCTCAGCCTGGCATGCAGCGGTTTAGA




ACAGAAGCCGCTGCTGCCGCTCCTCAGAG




ATCTCAGTCTCCTCTGAGAGGCATGCCCGA




GACAACCCAGCCTGATAAGCAGTGTGGAC




AGGTGGCAGCTGCAGCAGCAGCTCAACCT




CCTGCTTCTCACGGCCCCGAAAGAAGCCA




ATCTCCTGCCGCCTCTGATTGCAGCTCCAG




CTCTAGCTCTGCCTCTCTGCCTAGCAGCGG




CAGATCTAGCCTGGGCTCTCATCAACTGCC




CAGAGGCTACATCAGCATCCCTGTGATCCA




CGAGCAGAACGTGACCAGACCTGCTGCTC




AGCCCAGCTTCCATCAGGCCCAGAAAACA




CACTACCCCGCTCAGCAGGGCGAGTACCA




GACACACCAGCCTGTGTACCACAAGATCC




AGGGCGACGACTGGGAGCCCAGACCTCTT




AGAGCCGCTAGTCCCTTCAGATCCTCTGTG




CAGGGCGCCAGTTCTAGAGAGGGCTCTCC




TGCCAGAAGCAGCACACCTCTGCACAGCC




CATCTCCAATCAGAGTGCACACCGTGGTG




GACAGACCCCAGCAGCCTATGACACACAG




AGAGACAGCCCCTGTCAGCCAGCCTGAGA




ACAAGCCTGAGTCCAAGCCAGGACCTGTG




GGACCTGAACTGCCTCCAGGACACATCCCT




ATCCAAGTGATCCGGAAAGAGGTGGACAG




CAAGCCCGTGTCTCAGAAGCCTCCTCCACC




TAGCGAGAAAGTGGAAGTGAAAGTGCCTC




CTGCTCCTGTGCCTTGTCCTCCTCCATCTCC




TGGACCATCTGCCGTGCCTAGCTCTCCTAA




AAGCGTGGCCACCGAGGAAAGAGCCGCTC




CTTCTACAGCTCCTGCCGAGGCCACACCTC




CTAAACCTGGCGAAGCTGAAGCCCCTCCA




AAACACCCTGGCGTGCTGAAGGTGGAAGC




CATCCTGGAAAAGGTGCAGGGACTCGAGC




AGGCCGTGGACAACTTCGAGGGCAAGAAA




ACCGACAAGAAATACCTGATGATCGAGGA




ATACCTGACCAAAGAGCTGCTGGCCCTGG




ACAGCGTTGACCCTGAAGGCAGAGCAGAT




GTGCGGCAGGCTAGAAGAGATGGCGTGCG




GAAAGTGCAGACCATCCTCGAGAAGCTGG




AACAGAAAGCCATCGACGTGCCAGGCCAG




GTGCAGGTTTACGAGCTGCAGCCCTCTAAC




CTGGAAGCCGATCAGCCTCTGCAGGCCAT




CATGGAAATGGGAGCCGTGGCCGCCGACA




AGGGAAAGAAGAATGCTGGCAACGCCGAG




GATCCCCACACCGAAACACAGCAGCCTGA




AGCTACAGCCGCCGCTACCAGCAATCCCA




GCAGCATGACAGACACCCCTGGCAATCCA




GCCGCTCCATAATGAGCGGCCGCCGGCCG





AATAAAAGATCCTTATTTTCATTGGATCT






GTGTGTTGGTTTTTTGTGTG

GTCGACTCT








AG
aggaacccctagtgatggagttggccactccctctctgcgcgctc





gctcgctcactgaggccgggcgaccaaaggtcgcccgacgcccgg




gctttgcccgggcggcctcagtgagcgagcgagcgcgcagagagg




gagtggccaa





58
scAAV with chick
tccctctctgcgcgctcgctcgctcactgaggccgggcgaccaaagg



beta actin (CBA)
tcgcccgacgcccgggctttgcccgggcggcctcagtgagcgagcg



promoter and in
agcgcgcagctgctgCTAGAGGTACTCGAGGTGAG



silico derived
CCCCACGTTCTGCTTCACTCTCCCCATCTCC



Kozak sequence
CCCCCCTCCCCACCCCCAATTTTGTATTTAT




TTATTTTTTAATTATTTTGTGCAGCGATGGG




GGCGGGGGGGGGGGGGGGGCGCGCGCCA




GGCGGGGCGGGGCGGGGCGAGGGGGGGG




GCGGGGCGAGGCGGAGAGGTGCGGCGGC




AGCCAATCAGAGCGGCGCGCTCCGAAAGT




TTCCTTTTATGGCGAGGCGGCGGCGGCGGC




GGCCCTATAAAAAGCGAAGCGCGCGGCGG




GCGAGCCCCAACATGTCTGCTGCCACACA




CAGCCCTATGATGCAGGTCGCCTCTGGCAA




CGGCGACAGAGATCCTTTGCCTCCTGGCTG




GGAGATCAAGATCGATCCTCAGACCGGCT




GGCCCTTCTTCGTGGACCACAATAGCAGA




ACCACCACCTGGAACGACCCCAGAGTGCC




TTCTGAGGGCCCCAAAGAGACACCCAGCT




CTGCCAATGGACCCAGCAGAGAGGGAAGC




AGACTGCCACCAGCTAGAGAAGGACACCC




CGTGTATCCACAGCTGAGGCCTGGCTACAT




CCCCATTCCAGTGCTGCATGAGGGCGCCG




AAAACAGACAGGTGCACCCCTTTCACGTG




TACCCTCAGCCTGGCATGCAGCGGTTTAGA




ACAGAAGCCGCTGCTGCCGCTCCTCAGAG




ATCTCAGTCTCCTCTGAGAGGCATGCCCGA




GACAACCCAGCCTGATAAGCAGTGTGGAC




AGGTGGCAGCTGCAGCAGCAGCTCAACCT




CCTGCTTCTCACGGCCCCGAAAGAAGCCA




ATCTCCTGCCGCCTCTGATTGCAGCTCCAG




CTCTAGCTCTGCCTCTCTGCCTAGCAGCGG




CAGATCTAGCCTGGGCTCTCATCAACTGCC




CAGAGGCTACATCAGCATCCCTGTGATCCA




CGAGCAGAACGTGACCAGACCTGCTGCTC




AGCCCAGCTTCCATCAGGCCCAGAAAACA




CACTACCCCGCTCAGCAGGGCGAGTACCA




GACACACCAGCCTGTGTACCACAAGATCC




AGGGCGACGACTGGGAGCCCAGACCTCTT




AGAGCCGCTAGTCCCTTCAGATCCTCTGTG




CAGGGCGCCAGTTCTAGAGAGGGCTCTCC




TGCCAGAAGCAGCACACCTCTGCACAGCC




CATCTCCAATCAGAGTGCACACCGTGGTG




GACAGACCCCAGCAGCCTATGACACACAG




AGAGACAGCCCCTGTCAGCCAGCCTGAGA




ACAAGCCTGAGTCCAAGCCAGGACCTGTG




GGACCTGAACTGCCTCCAGGACACATCCCT




ATCCAAGTGATCCGGAAAGAGGTGGACAG




CAAGCCCGTGTCTCAGAAGCCTCCTCCACC




TAGCGAGAAAGTGGAAGTGAAAGTGCCTC




CTGCTCCTGTGCCTTGTCCTCCTCCATCTCC




TGGACCATCTGCCGTGCCTAGCTCTCCTAA




AAGCGTGGCCACCGAGGAAAGAGCCGCTC




CTTCTACAGCTCCTGCCGAGGCCACACCTC




CTAAACCTGGCGAAGCTGAAGCCCCTCCA




AAACACCCTGGCGTGCTGAAGGTGGAAGC




CATCCTGGAAAAGGTGCAGGGACTCGAGC




AGGCCGTGGACAACTTCGAGGGCAAGAAA




ACCGACAAGAAATACCTGATGATCGAGGA




ATACCTGACCAAAGAGCTGCTGGCCCTGG




ACAGCGTTGACCCTGAAGGCAGAGCAGAT




GTGCGGCAGGCTAGAAGAGATGGCGTGCG




GAAAGTGCAGACCATCCTCGAGAAGCTGG




AACAGAAAGCCATCGACGTGCCAGGCCAG




GTGCAGGTTTACGAGCTGCAGCCCTCTAAC




CTGGAAGCCGATCAGCCTCTGCAGGCCAT




CATGGAAATGGGAGCCGTGGCCGCCGACA




AGGGAAAGAAGAATGCTGGCAACGCCGAG




GATCCCCACACCGAAACACAGCAGCCTGA




AGCTACAGCCGCCGCTACCAGCAATCCCA




GCAGCATGACAGACACCCCTGGCAATCCA




GCCGCTCCATAATGAGCGGCCGCCGGCCG





AATAAAAGATCCTTATTTTCATTGGATCT






GTGTGTTGGTTTTTTGTGTG

GTCGACTCT








AG
aggaacccctagtgatggagttggccactccctctctgegcgctc





gctcgctcactgaggccgggcgaccaaaggtcgcccgacgcccgg




gctttgcccgggcggcctcagtgagcgagcgagcgcgcagagagg




gagtggccaa





59
scAAV with chick
tccctctctgcgcgctcgctcgctcactgaggccgggcgaccaaagg



beta actin (CBA)
tcgcccgacgcccgggctttgcccgggcggcctcagtgagcgagcg



promoter and
agcgcgcagctgctgCTAGAGGTACTCGAGGTGAG



CAACCCAGC
CCCCACGTTCTGCTTCACTCTCCCCATCTCC



Kozak sequence
CCCCCCTCCCCACCCCCAATTTTGTATTTAT




TTATTTTTTAATTATTTTGTGCAGCGATGGG




GGCGGGGGGGGGGGGGGGGCGCGCGCCA




GGCGGGGCGGGGCGGGGCGAGGGGCGGG




GCGGGGCGAGGCGGAGAGGTGCGGCGGC




AGCCAATCAGAGCGGCGCGCTCCGAAAGT




TTCCTTTTATGGCGAGGCGGCGGCGGCGGC




GGCCCTATAAAAAGCGAAGCGCGCGGCGG




GCGCAACCCAGCATGTCTGCTGCCACACA




CAGCCCTATGATGCAGGTCGCCTCTGGCAA




CGGCGACAGAGATCCTTTGCCTCCTGGCTG




GGAGATCAAGATCGATCCTCAGACCGGCT




GGCCCTTCTTCGTGGACCACAATAGCAGA




ACCACCACCTGGAACGACCCCAGAGTGCC




TTCTGAGGGCCCCAAAGAGACACCCAGCT




CTGCCAATGGACCCAGCAGAGAGGGAAGC




AGACTGCCACCAGCTAGAGAAGGACACCC




CGTGTATCCACAGCTGAGGCCTGGCTACAT




CCCCATTCCAGTGCTGCATGAGGGCGCCG




AAAACAGACAGGTGCACCCCTTTCACGTG




TACCCTCAGCCTGGCATGCAGCGGTTTAGA




ACAGAAGCCGCTGCTGCCGCTCCTCAGAG




ATCTCAGTCTCCTCTGAGAGGCATGCCCGA




GACAACCCAGCCTGATAAGCAGTGTGGAC




AGGTGGCAGCTGCAGCAGCAGCTCAACCT




CCTGCTTCTCACGGCCCCGAAAGAAGCCA




ATCTCCTGCCGCCTCTGATTGCAGCTCCAG




CTCTAGCTCTGCCTCTCTGCCTAGCAGCGG




CAGATCTAGCCTGGGCTCTCATCAACTGCC




CAGAGGCTACATCAGCATCCCTGTGATCCA




CGAGCAGAACGTGACCAGACCTGCTGCTC




AGCCCAGCTTCCATCAGGCCCAGAAAACA




CACTACCCCGCTCAGCAGGGCGAGTACCA




GACACACCAGCCTGTGTACCACAAGATCC




AGGGCGACGACTGGGAGCCCAGACCTCTT




AGAGCCGCTAGTCCCTTCAGATCCTCTGTG




CAGGGCGCCAGTTCTAGAGAGGGCTCTCC




TGCCAGAAGCAGCACACCTCTGCACAGCC




CATCTCCAATCAGAGTGCACACCGTGGTG




GACAGACCCCAGCAGCCTATGACACACAG




AGAGACAGCCCCTGTCAGCCAGCCTGAGA




ACAAGCCTGAGTCCAAGCCAGGACCTGTG




GGACCTGAACTGCCTCCAGGACACATCCCT




ATCCAAGTGATCCGGAAAGAGGTGGACAG




CAAGCCCGTGTCTCAGAAGCCTCCTCCACC




TAGCGAGAAAGTGGAAGTGAAAGTGCCTC




CTGCTCCTGTGCCTTGTCCTCCTCCATCTCC




TGGACCATCTGCCGTGCCTAGCTCTCCTAA




AAGCGTGGCCACCGAGGAAAGAGCCGCTC




CTTCTACAGCTCCTGCCGAGGCCACACCTC




CTAAACCTGGCGAAGCTGAAGCCCCTCCA




AAACACCCTGGCGTGCTGAAGGTGGAAGC




CATCCTGGAAAAGGTGCAGGGACTCGAGC




AGGCCGTGGACAACTTCGAGGGCAAGAAA




ACCGACAAGAAATACCTGATGATCGAGGA




ATACCTGACCAAAGAGCTGCTGGCCCTGG




ACAGCGTTGACCCTGAAGGCAGAGCAGAT




GTGCGGCAGGCTAGAAGAGATGGCGTGCG




GAAAGTGCAGACCATCCTCGAGAAGCTGG




AACAGAAAGCCATCGACGTGCCAGGCCAG




GTGCAGGTTTACGAGCTGCAGCCCTCTAAC




CTGGAAGCCGATCAGCCTCTGCAGGCCAT




CATGGAAATGGGAGCCGTGGCCGCCGACA




AGGGAAAGAAGAATGCTGGCAACGCCGAG




GATCCCCACACCGAAACACAGCAGCCTGA




AGCTACAGCCGCCGCTACCAGCAATCCCA




GCAGCATGACAGACACCCCTGGCAATCCA





GCCGCTCCATAATGA

GCGGCCGCCGGCCG







AATAAAAGATCCTTATTTTCATTGGATCT






GTGTGTTGGTTTTTTGTGTG

GTCGACTCT








AG
aggaacccctagtgatggagttggccactccctctctgcgcgctc





gctcgctcactgaggccgggcgaccaaaggtcgcccgacgcccgg




gctttgcccgggcggcctcagtgagcgagcgagcgcgcagagagg




gagtggccaa





60
scAAV with muscle
tccctctctgcgcgctcgctcgctcactgaggccgggcgaccaaagg



creatine kinase
tcgcccgacgcccgggctttgcccgggggcctcagtgagcgagcg



(MCK) promoter
agcgcgcagctgctgCTAGAGGTACCAAGGCTGTG



and AGCGCCACC
GGGGACTGAGGGCAGGCTGTAACAGGCTT



Kozak sequence
GGGGGCCAGGGCTTATACGTGCCTGGGAC




TCCCAAAGTATTACTGTTCCATGTTCCCGG




CGAAGGGCCAGCTGTCCCCCGCCAGCTAG




ACTCAGCACTTAGTTTAGGAACCAGTGAG




CAAGTCAGCCCTTGGGGCAGCCCATACAA




GGCCATGGGGCTGGGCAAGCTGCACGCCT




GGGTCCGGGGTGGGCACGGTGCCCGGGCA




ACGAGCTGAAAGCTCATCTGCTCTCAGGG




GCCCCTCCCTGGGGACAGCCCCTCCTGGCT




AGTCACACCCTGTAGGCTCCTCTATATAAC




CCAGGGGCACAGGGGCTGCCCTCAGCGCC





ACCATGTCTGCTGCCACACACAGCCCTATG





ATGCAGGTCGCCTCTGGCAACGGCGACAG




AGATCCTTTGCCTCCTGGCTGGGAGATCAA




GATCGATCCTCAGACCGGCTGGCCCTTCTT




CGTGGACCACAATAGCAGAACCACCACCT




GGAACGACCCCAGAGTGCCTTCTGAGGGC




CCCAAAGAGACACCCAGCTCTGCCAATGG




ACCCAGCAGAGAGGGAAGCAGACTGCCAC




CAGCTAGAGAAGGACACCCCGTGTATCCA




CAGCTGAGGCCTGGCTACATCCCCATTCCA




GTGCTGCATGAGGGCGCCGAAAACAGACA




GGTGCACCCCTTTCACGTGTACCCTCAGCC




TGGCATGCAGCGGTTTAGAACAGAAGCCG




CTGCTGCCGCTCCTCAGAGATCTCAGTCTC




CTCTGAGAGGCATGCCCGAGACAACCCAG




CCTGATAAGCAGTGTGGACAGGTGGCAGC




TGCAGCAGCAGCTCAACCTCCTGCTTCTCA




CGGCCCCGAAAGAAGCCAATCTCCTGCCG




CCTCTGATTGCAGCTCCAGCTCTAGCTCTG




CCTCTCTGCCTAGCAGCGGCAGATCTAGCC




TGGGCTCTCATCAACTGCCCAGAGGCTACA




TCAGCATCCCTGTGATCCACGAGCAGAAC




GTGACCAGACCTGCTGCTCAGCCCAGCTTC




CATCAGGCCCAGAAAACACACTACCCCGC




TCAGCAGGGCGAGTACCAGACACACCAGC




CTGTGTACCACAAGATCCAGGGCGACGAC




TGGGAGCCCAGACCTCTTAGAGCCGCTAG




TCCCTTCAGATCCTCTGTGCAGGGCGCCAG




TTCTAGAGAGGGCTCTCCTGCCAGAAGCA




GCACACCTCTGCACAGCCCATCTCCAATCA




GAGTGCACACCGTGGTGGACAGACCCCAG




CAGCCTATGACACACAGAGAGACAGCCCC




TGTCAGCCAGCCTGAGAACAAGCCTGAGT




CCAAGCCAGGACCTGTGGGACCTGAACTG




CCTCCAGGACACATCCCTATCCAAGTGATC




CGGAAAGAGGTGGACAGCAAGCCCGTGTC




TCAGAAGCCTCCTCCACCTAGCGAGAAAG




TGGAAGTGAAAGTGCCTCCTGCTCCTGTGC




CTTGTCCTCCTCCATCTCCTGGACCATCTG




CCGTGCCTAGCTCTCCTAAAAGCGTGGCCA




CCGAGGAAAGAGCCGCTCCTTCTACAGCT




CCTGCCGAGGCCACACCTCCTAAACCTGGC




GAAGCTGAAGCCCCTCCAAAACACCCTGG




CGTGCTGAAGGTGGAAGCCATCCTGGAAA




AGGTGCAGGGACTCGAGCAGGCCGTGGAC




AACTTCGAGGGCAAGAAAACCGACAAGAA




ATACCTGATGATCGAGGAATACCTGACCA




AAGAGCTGCTGGCCCTGGACAGCGTTGAC




CCTGAAGGCAGAGCAGATGTGCGGCAGGC




TAGAAGAGATGGCGTGCGGAAAGTGCAGA




CCATCCTCGAGAAGCTGGAACAGAAAGCC




ATCGACGTGCCAGGCCAGGTGCAGGTTTA




CGAGCTGCAGCCCTCTAACCTGGAAGCCG




ATCAGCCTCTGCAGGCCATCATGGAAATG




GGAGCCGTGGCCGCCGACAAGGGAAAGAA




GAATGCTGGCAACGCCGAGGATCCCCACA




CCGAAACACAGCAGCCTGAAGCTACAGCC




GCCGCTACCAGCAATCCCAGCAGCATGAC




AGACACCCCTGGCAATCCAGCCGCTCCAT




AATGAGCGGCCGCCGGCCGAATAAAAGAT





CCTTATTTTCATTGGATCTGTGTGTTGG






TTTTTTGTGTG

GTCGACTCTAG
aggaacccctag





tgatggagttggccactccctctctgcgcgctcgctcgctcactgagg




ccgggcgaccaaaggtcgcccgacgcccgggctttgcccgggcgg




cctcagtgagcgagcgagcgcgcagagagggagtggccaa





61
scAAV with muscle
tccctctctgcgcgctcgctcgctcactgaggccgggcgaccaaagg



creatine kinase
tcgcccgacgcccgggctttgcccgggcggcctcagtgagcgagcg



(MCK) promoter
agcgcgcagctgctgCTAGAGGTACCAAGGCTGTG



and in silico derived
GGGGACTGAGGGCAGGCTGTAACAGGCTT



Kozak sequence
GGGGGCCAGGGCTTATACGTGCCTGGGAC




TCCCAAAGTATTACTGTTCCATGTTCCCGG




CGAAGGGCCAGCTGTCCCCCGCCAGCTAG




ACTCAGCACTTAGTTTAGGAACCAGTGAG




CAAGTCAGCCCTTGGGGCAGCCCATACAA




GGCCATGGGGCTGGGCAAGCTGCACGCCT




GGGTCCGGGGTGGGCACGGTGCCCGGGCA




ACGAGCTGAAAGCTCATCTGCTCTCAGGG




GCCCCTCCCTGGGGACAGCCCCTCCTGGCT




AGTCACACCCTGTAGGCTCCTCTATATAAC




CCAGGGGCACAGGGGCTGCCCTCAGCCCC





AACATGTCTGCTGCCACACACAGCCCTATG





ATGCAGGTCGCCTCTGGCAACGGCGACAG




AGATCCTTTGCCTCCTGGCTGGGAGATCAA




GATCGATCCTCAGACCGGCTGGCCCTTCTT




CGTGGACCACAATAGCAGAACCACCACCT




GGAACGACCCCAGAGTGCCTTCTGAGGGC




CCCAAAGAGACACCCAGCTCTGCCAATGG




ACCCAGCAGAGAGGGAAGCAGACTGCCAC




CAGCTAGAGAAGGACACCCCGTGTATCCA




CAGCTGAGGCCTGGCTACATCCCCATTCCA




GTGCTGCATGAGGGCGCCGAAAACAGACA




GGTGCACCCCTTTCACGTGTACCCTCAGCC




TGGCATGCAGCGGTTTAGAACAGAAGCCG




CTGCTGCCGCTCCTCAGAGATCTCAGTCTC




CTCTGAGAGGCATGCCCGAGACAACCCAG




CCTGATAAGCAGTGTGGACAGGTGGCAGC




TGCAGCAGCAGCTCAACCTCCTGCTTCTCA




CGGCCCCGAAAGAAGCCAATCTCCTGCCG




CCTCTGATTGCAGCTCCAGCTCTAGCTCTG




CCTCTCTGCCTAGCAGCGGCAGATCTAGCC




TGGGCTCTCATCAACTGCCCAGAGGCTACA




TCAGCATCCCTGTGATCCACGAGCAGAAC




GTGACCAGACCTGCTGCTCAGCCCAGCTTC




CATCAGGCCCAGAAAACACACTACCCCGC




TCAGCAGGGCGAGTACCAGACACACCAGC




CTGTGTACCACAAGATCCAGGGCGACGAC




TGGGAGCCCAGACCTCTTAGAGCCGCTAG




TCCCTTCAGATCCTCTGTGCAGGGCGCCAG




TTCTAGAGAGGGCTCTCCTGCCAGAAGCA




GCACACCTCTGCACAGCCCATCTCCAATCA




GAGTGCACACCGTGGTGGACAGACCCCAG




CAGCCTATGACACACAGAGAGACAGCCCC




TGTCAGCCAGCCTGAGAACAAGCCTGAGT




CCAAGCCAGGACCTGTGGGACCTGAACTG




CCTCCAGGACACATCCCTATCCAAGTGATC




CGGAAAGAGGTGGACAGCAAGCCCGTGTC




TCAGAAGCCTCCTCCACCTAGCGAGAAAG




TGGAAGTGAAAGTGCCTCCTGCTCCTGTGC




CTTGTCCTCCTCCATCTCCTGGACCATCTG




CCGTGCCTAGCTCTCCTAAAAGCGTGGCCA




CCGAGGAAAGAGCCGCTCCTTCTACAGCT




CCTGCCGAGGCCACACCTCCTAAACCTGGC




GAAGCTGAAGCCCCTCCAAAACACCCTGG




CGTGCTGAAGGTGGAAGCCATCCTGGAAA




AGGTGCAGGGACTCGAGCAGGCCGTGGAC




AACTTCGAGGGCAAGAAAACCGACAAGAA




ATACCTGATGATCGAGGAATACCTGACCA




AAGAGCTGCTGGCCCTGGACAGCGTTGAC




CCTGAAGGCAGAGCAGATGTGCGGCAGGC




TAGAAGAGATGGCGTGCGGAAAGTGCAGA




CCATCCTCGAGAAGCTGGAACAGAAAGCC




ATCGACGTGCCAGGCCAGGTGCAGGTTTA




CGAGCTGCAGCCCTCTAACCTGGAAGCCG




ATCAGCCTCTGCAGGCCATCATGGAAATG




GGAGCCGTGGCCGCCGACAAGGGAAAGAA




GAATGCTGGCAACGCCGAGGATCCCCACA




CCGAAACACAGCAGCCTGAAGCTACAGCC




GCCGCTACCAGCAATCCCAGCAGCATGAC




AGACACCCCTGGCAATCCAGCCGCTCCAT




AATGAGCGGCCGCCGGCCGAATAAAAGAT





CCTTATTTTCATTGGATCTGTGTGTTGG






TTTTTTGTGTG

GTCGACTCTAG
aggaacccctag





tgatggagttggccactccctctctgcgcgctcgctcgctcactgagg




ccgggcgaccaaaggtcgcccgacgcccgggctttgcccgggcgg




cctcagtgagcgagcgagcgcgcagagagggagtggccaa





62
scAAV with muscle
tccctctctgcgcgctcgctcgctcactgaggccgggcgaccaaagg



creatine kinase
tcgcccgacgcccgggctttgcccgggcggcctcagtgagcgagcg



(MCK) promoter
agcgcgcagctgctgCTAGAGGTACCAAGGCTGTG



and CAACCCAGC
GGGGACTGAGGGCAGGCTGTAACAGGCTT



Kozak sequence
GGGGGCCAGGGCTTATACGTGCCTGGGAC




TCCCAAAGTATTACTGTTCCATGTTCCCGG




CGAAGGGCCAGCTGTCCCCCGCCAGCTAG




ACTCAGCACTTAGTTTAGGAACCAGTGAG




CAAGTCAGCCCTTGGGGCAGCCCATACAA




GGCCATGGGGCTGGGCAAGCTGCACGCCT




GGGTCCGGGGTGGGCACGGTGCCCGGGCA




ACGAGCTGAAAGCTCATCTGCTCTCAGGG




GCCCCTCCCTGGGGACAGCCCCTCCTGGCT




AGTCACACCCTGTAGGCTCCTCTATATAAC




CCAGGGGCACAGGGGCTGCCCTCCAACCC





AGCATGTCTGCTGCCACACACAGCCCTAT





GATGCAGGTCGCCTCTGGCAACGGCGACA




GAGATCCTTTGCCTCCTGGCTGGGAGATCA




AGATCGATCCTCAGACCGGCTGGCCCTTCT




TCGTGGACCACAATAGCAGAACCACCACC




TGGAACGACCCCAGAGTGCCTTCTGAGGG




CCCCAAAGAGACACCCAGCTCTGCCAATG




GACCCAGCAGAGAGGGAAGCAGACTGCCA




CCAGCTAGAGAAGGACACCCCGTGTATCC




ACAGCTGAGGCCTGGCTACATCCCCATTCC




AGTGCTGCATGAGGGCGCCGAAAACAGAC




AGGTGCACCCCTTTCACGTGTACCCTCAGC




CTGGCATGCAGCGGTTTAGAACAGAAGCC




GCTGCTGCCGCTCCTCAGAGATCTCAGTCT




CCTCTGAGAGGCATGCCCGAGACAACCCA




GCCTGATAAGCAGTGTGGACAGGTGGCAG




CTGCAGCAGCAGCTCAACCTCCTGCTTCTC




ACGGCCCCGAAAGAAGCCAATCTCCTGCC




GCCTCTGATTGCAGCTCCAGCTCTAGCTCT




GCCTCTCTGCCTAGCAGCGGCAGATCTAGC




CTGGGCTCTCATCAACTGCCCAGAGGCTAC




ATCAGCATCCCTGTGATCCACGAGCAGAA




CGTGACCAGACCTGCTGCTCAGCCCAGCTT




CCATCAGGCCCAGAAAACACACTACCCCG




CTCAGCAGGGCGAGTACCAGACACACCAG




CCTGTGTACCACAAGATCCAGGGCGACGA




CTGGGAGCCCAGACCTCTTAGAGCCGCTA




GTCCCTTCAGATCCTCTGTGCAGGGCGCCA




GTTCTAGAGAGGGCTCTCCTGCCAGAAGC




AGCACACCTCTGCACAGCCCATCTCCAATC




AGAGTGCACACCGTGGTGGACAGACCCCA




GCAGCCTATGACACACAGAGAGACAGCCC




CTGTCAGCCAGCCTGAGAACAAGCCTGAG




TCCAAGCCAGGACCTGTGGGACCTGAACT




GCCTCCAGGACACATCCCTATCCAAGTGAT




CCGGAAAGAGGTGGACAGCAAGCCCGTGT




CTCAGAAGCCTCCTCCACCTAGCGAGAAA




GTGGAAGTGAAAGTGCCTCCTGCTCCTGTG




CCTTGTCCTCCTCCATCTCCTGGACCATCT




GCCGTGCCTAGCTCTCCTAAAAGCGTGGCC




ACCGAGGAAAGAGCCGCTCCTTCTACAGC




TCCTGCCGAGGCCACACCTCCTAAACCTGG




CGAAGCTGAAGCCCCTCCAAAACACCCTG




GCGTGCTGAAGGTGGAAGCCATCCTGGAA




AAGGTGCAGGGACTCGAGCAGGCCGTGGA




CAACTTCGAGGGCAAGAAAACCGACAAGA




AATACCTGATGATCGAGGAATACCTGACC




AAAGAGCTGCTGGCCCTGGACAGCGTTGA




CCCTGAAGGCAGAGCAGATGTGCGGCAGG




CTAGAAGAGATGGCGTGCGGAAAGTGCAG




ACCATCCTCGAGAAGCTGGAACAGAAAGC




CATCGACGTGCCAGGCCAGGTGCAGGTTT




ACGAGCTGCAGCCCTCTAACCTGGAAGCC




GATCAGCCTCTGCAGGCCATCATGGAAAT




GGGAGCCGTGGCCGCCGACAAGGGAAAGA




AGAATGCTGGCAACGCCGAGGATCCCCAC




ACCGAAACACAGCAGCCTGAAGCTACAGC




CGCCGCTACCAGCAATCCCAGCAGCATGA




CAGACACCCCTGGCAATCCAGCCGCTCCAT




AATGAGCGGCCGCCGGCCGAATAAAAGAT





CCTTATTTTCATTGGATCTGTGTGTTGG






TTTTTTGTGTG

GTCGACTCTAG
aggaacccctag





tgatggagttggccactccctctctgcgcgctcgctcgctcactgagg




ccgggcgaccaaaggtcgcccgacgcccgggctttgcccgggcgg




cctcagtgagcgagcgagcgcgcagagagggagtggccaa





















SEQ ID:
Description
Protein Sequence







23
Exon 1
MSAATHSPMMQVASGNGDRDPLPPG




WEIKIDPQTGWPFFVDHNSRTTTWNDP




RVPSEGPK





24
Exon 2
ETPSSANGPSREGSRLPPAREGHPVYPQ




LRPGYIPIPVLHEGAENRQVHPFHVYP




QPGMQRFRTEAAAAAPQRSQSPLRGM




PETTQPDKQCGQVAAAAAAQPPASHG




PE





25
Exon 3
RSQSPAASDCSSSSSSASLPSSGRSSLGS




HQLPRGYISIPVIHEQNVTRPAAQPSFH




QAQKTHYPAQQGEYQTHQPVYHKIQG




DDWEPRPLRAASPFRSSVQGASSREGS




PARSSTPLHSPSPIRVHTVVDRPQ





26
Exon 4
QPMTHRETAPVSQPENKPESKPGPVGP




ELPPGHIPIQVIRKEVDSKPVSQKPPPPS




EKVEVKVPPAPVPCPPPSPGPSAVPSSP




KSVATEERAAPSTAPAEATPPKPGEAE




APPKHPGVLKVEAILEKVQGLEQAVD




NFEGKKTDKKYLMIEEYLTKELLALDS




VDPEGRADVRQARRDGVRKVQTILEK




LEQKAIDVPGQVQVYELQPSNLEADQP




LQAIMEMGAVAADKGKKNAGNAEDP




HTETQQPEATAAATSNPSSMTDTPGNP




AAP





27
Rh74 VP1 (VP2,
MAADGYLPDWLEDNLSEGIREWWDL



VP3)
KPGAPKPKANQQKQDNGRGLVLPGYK




YLGPFNGLDKGEPVNAADAAALEHDK




AYDQQLQAGDNPYLRYNHADAEFQE




RLQEDTSFGGNLGRAVFQAKKRVLEP




LGLVESPVKTAPGKKRPVEPSPQRSPD




SSTGIGKKGQQPAKKRLNFGQTGDSES




VPDPQPIGEPPAGPSGLGSGTMAAGGG




APMADNNEGADGVGSSSGNWHCDST




WLGDRVITTSTRTWALPTYNNHLYKQI




SNGTSGGSTNDNTYFGYSTPWGYFDF




NRFHCHFSPRDWQRLINNNWGFRPKR




LNFKLFNIQVKEVTQNEGTKTIANNLT




STIQVFTDSEYQLPYVLGSAHQGCLPPF




PADVFMIPQYGYLTLNNGSQAVGRSSF




YCLEYFPSQMLRTGNNFEFSYNFEDVP




FHSSYAHSQSLDRLMNPLIDQYLYYLS




RTQSTGGTAGTQQLLFSQAGPNNMSA




QAKNWLPGPCYRQQRVSTTLSQNNNS




NFAWTGATKYHLNGRDSLVNPGVAM




ATHKDDEERFFPSSGVLMFGKQGAGK




DNVDYSSVMLTSEEEIKTTNPVATEQY




GVVADNLQQQNAAPIVGAVNSQGALP




GMVWQNRDVYLQGPIWAKIPHTDGNF




HPSPLMGGFGLKHPPPQILIKNTPVPAD




PPTTFNQAKLASFITQYSTGQVSVEIEW




ELQKENSKRWNPEIQYTSNYYKSTNV




DFAVNTEGTYSEPRPIGTRYLTRNL





28
AAV 9 VP1
MAADGYLPDWLEDNLSEGIREWWAL




KPGAPQPKANQQHQDNARGLVLPGYK




YLGPGNGLDKGEPVNAADAAALEHD




KAYDQQLKAGDNPYLKYNHADAEFQ




ERLKEDTSFGGNLGRAVFQAKKRLLEP




LGLVEEAAKTAPGKKRPVEQSPQEPDS




SAGIGKSGAQPAKKRLNFGQTGDTESV




PDPQPIGEPPAAPSGVGSLTMASGGGA




PVADNNEGADGVGSSSGNWHCDSQW




LGDRVITTSTRTWALPTYNNHLYKQIS




NSTSGGSSNDNAYFGYSTPWGYFDFN




RFHCHFSPRDWQRLINNNWGFRPKRL




NFKLFNIQVKEVTDNNGVKTIANNLTS




TVQVFTDSDYQLPYVLGSAHEGCLPPF




PADVFMIPQYGYLTLNDGSQAVGRSSF




YCLEYFPSQMLRTGNNFQFSYEFENVP




FHSSYAHSQSLDRLMNPLIDQYLYYLS




KTINGSGQNQQTLKFSVAGPSNMAVQ




GRNYIPGPSYRQQRVSTTVTQNNNSEF




AWPGASSWALNGRNSLMNPGPAMAS




HKEGEDRFFPLSGSLIFGKQGTGRDNV




DADKVMITNEEEIKTTNPVATESYGQV




ATNHQSAQAQAQTGWVQNQGILPGM




VWQDRDVYLQGPIWAIPHTDGNFHPS




PLMGGFGMKHPPPQILIKNTPVPADPPT




AFNKDKLNSFITQYSTGQVSVEIEWEL




QKENSKRWNPEIQYTSNYYKSNNVEF




AVNTEGVYSEPRPIGTRYLTRNL





54
BAG3 protein
MSAATHSPMMQVASGNGDRDPLPPG



sequence
WEIKIDPQTGWPFFVDHNSRTTTWNDP




RVPSEGPKETPSSANGPSREGSRLPPAR




EGHPVYPQLRPGYIPIPVLHEGAENRQ




VHPFHVYPQPGMQRFRTEAAAAAPQR




SQSPLRGMPETTQPDKQCGQVAAAAA




AQPPASHGPERSQSPAASDCSSSSSSAS




LPSSGRSSLGSHQLPRGYISIPVIHEQNV




TRPAAQPSFHQAQKTHYPAQQGEYQT




HQPVYHKIQGDDWEPRPLRAASPFRSS




VQGASSREGSPARSSTPLHSPSPIRVHT




VVDRPQQPMTHRETAPVSQPENKPESK




PGPVGPELPPGHIPIQVIRKEVDSKPVS




QKPPPPSEKVEVKVPPAPVPCPPPSPGP




SAVPSSPKSVATEERAAPSTAPAEATPP




KPGEAEAPPKHPGVLKVEAILEKVQGL




EQAVDNFEGKKTDKKYLMIEEYLTKE




LLALDSVDPEGRADVRQARRDGVRKV




QTILEKLEQKAIDVPGQVQVYELQPSN




LEADQPLQAIMEMGAVAADKGKKNA




GNAEDPHTETQQPEATAAATSNPSSMT




DTPGNPAAP










Construct 2 (pTR-CK8-Ex1_Intron1-Ex2-Bag3)














SEQ ID
Elements (5′→3′)
Nt sequence







30
ITR-L
TTGGCCACTCCCTCTCTGCGCGCTCG




CTCGCTCACTGAGGCCGGGCGACCA




AAGGTCGCCCGACGCCCGGGCTTTG




CCCGGGCGGCCTCAGTGAGCGAGCG




AGCGCGCAGAGAGGGAGTGGCCAA




CTCCATCACTAGGGGTTCCT





31
CK8
AGACTAGCATGCTGCCCATGTAAGG




AGGCAAGGCCTGGGGACACCCGAG




ATGCCTGGTTATAATTAACCCAGAC




ATGTGGCTGCCCCCCCCCCCCCAAC




ACCTGCTGCCTCTAAAAATAACCCT




GCATGCCATGTTCCCGGCGAAGGGC




CAGCTGTCCCCCGCCAGCTAGACTC




AGCACTTAGTTTAGGAACCAGTGAG




CAAGTCAGCCCTTGGGGCAGCCCAT




ACAAGGCCATGGGGCTGGGCAAGCT




GCACGCCTGGGTCCGGGGTGGGCAC




GGTGCCCGGGCAACGAGCTGAAAGC




TCATCTGCTCTCAGGGGCCCCTCCCT




GGGGACAGCCCCTCCTGGCTAGTCA




CACCCTGTAGGCTCCTCTATATAACC




CAGGGGCACAGGGGCTGCCCTCATT




CTACCACCACCTCCACAGCACAGAC




AGACACTCAGGAGCCAGCCAAA





32
“Exon1 Non-coding”
GAATTCGATATCAAGCTTGCATCCA




ACCCCGGGCCGCGGCCAACTTCTCT




GGACTGGACCAGAAGTTTCTAGCCG




GCCAGTTGCTACCTCCCTTTATCTCC




TCCTTCCCCTCTGGCAGCGAGGAGG




CTATTTCCAGACACTTCCACCCCTCT




CTGGCCACGTCACCCCCGCCTTTAAT




TCATAAAGGTGCCCGGCGCCGGCTT




CCCGGACACGTCGGCGGCGGAGAG




GGGCCCACGGCGGCGGCCCGGCCAG




AGACTCGGCGCCCGGAGCCAGCGCC




CCGCACCCGCGCCCCAGCGGGCAGA




CCCCAACCCAGCGCCACC





33
Exon 1
ATGTCTGCTGCCACACACTCTCCAAT




GATGCAGGTTGCCTCTGGCAATGGG




GACAGAGATCCTCTGCCTCCTGGCT




GGGAGATCAAGATTGATCCTCAGAC




AGGCTGGCCCTTCTTTGTGGACCAC




AACAGCAGAACCACCACCTGGAATG




ACCCCAGAGTGCCCTCTGAGGGCCC




TAAG





34
Intron 1
GTTTCAAGAGCTAGAGGCCCTCCTT




GGAGTGTGGCTCCTCCTAGAAGGCA




AGCTGCTGGCTCTGGACTTGGAAGA




GGGGATGCTAGAAGAAGAGGCCCT




GGAGTTGGAGAGGGCCCCAGTAGA




GCTGATACAGGATCTGCCCCTAGAC




ACACCCTGCCTCTGAGGCCTGATGA




GAGCCCACTGAGAACCAGACCTTGA




CAGGCGTCGGGGCGAAAGGAGGCC




CCGGGATTCGGTGGCCCGGGAAGCG




ACCCCGCAGTGGCTCCGGTGCCGTC




CACGGCTCGACTCCAGGGCGGAAGG




CCGGGTGTCCAGCGCTGGCCTCGCG




CTCTAGGGCTGGGAGAGGGGCGGCC




GGCCTGGTCAGCTCCGGAGGCCCCG




GCCCACCGTGGCCCCTGCTGCCCGC




TCGTGCTGTAATGTAGAGGTTGGAG




CTGACCCCTGCTCCTGGAGCTCATCT




TCTGATCCGGGTCTCTGAAAATGCG




GGCATGGGCAGCTCTCCTACACTCA




CCGCTTCCCTCAGTCACCCAGAAAA




GAAACCTGTCTTACCAGTTTGGAGA




ATTGGGACCTTTCCTTTGCTTAACAG




ATACTTTTGGCTTTCTCCTGATGCCC




CTAATTCCTAAACTGTTGGCCAAAT




AGCAACCTCTATGGGGTGGGGGGTT




TGGAGGGTACAGGGGCTGGGAGCTG




GCTGACGCTTTGAGGCCCAAGTCAC




TCGGGAAGATCACAATGCCAAGCGC




CACAGTGTTTCCTCTGCCAGGAGGG




TTCACTTCCCAGTTTCTAACCAGCCT




GTGTTTCTCCACTTTTTATTTCAG





35
Exon 2
GAGACACCCAGCTCTGCCAATGGAC




CCAGCAGAGAGGGAAGCAGACTGC




CACCAGCTAGAGAAGGACACCCCGT




GTATCCACAGCTGAGGCCTGGCTAC




ATCCCCATTCCAGTGCTGCATGAGG




GCGCCGAAAACAGACAGGTGCACCC




CTTTCACGTGTACCCTCAGCCTGGCA




TGCAGAGATTCAGAACAGAGGCTGC




TGCTGCAGCCCCTCAGAGATCTCAA




TCTCCTCTGAGAGGCATGCCAGAGA




CAACACAGCCTGACAAGCAGTGTGG




ACAGGTGGCAGCAGCAGCTGCAGCT




CAACCTCCTGCTTCTCATGGCCCTGA




G





36
Exon 3
AGAAGCCAGTCTCCTGCTGCTTCTG




ATTGCAGCAGCTCCAGTAGCTCTGC




CTCTCTGCCTAGCTCTGGCAGATCTT




CTCTGGGCAGCCATCAGCTGCCTAG




AGGCTACATCAGCATCCCTGTGATC




CATGAGCAGAATGTGACCAGACCAG




CTGCTCAGCCTAGCTTCCACCAGGC




TCAGAAAACACACTACCCTGCTCAG




CAAGGGGAGTACCAGACACACCAG




CCAGTGTACCACAAGATCCAAGGGG




ATGACTGGGAGCCCAGACCACTGAG




AGCTGCTAGCCCCTTTAGAAGCTCT




GTGCAAGGGGCCAGCTCTAGAGAGG




GCTCTCCTGCCAGAAGCAGCACACC




TCTGCACAGCCCATCTCCAATCAGA




GTGCACACCGTGGTGGACAGACCCC




AG





37
Exon 4
CAGCCTATGACACACAGAGAGACAG




CCCCTGTCAGCCAGCCTGAGAACAA




GCCTGAGTCCAAGCCAGGACCTGTG




GGACCTGAACTGCCTCCAGGACACA




TCCCTATCCAAGTGATCCGGAAAGA




GGTGGACAGCAAGCCCGTGTCTCAG




AAGCCTCCTCCACCTAGCGAGAAAG




TGGAAGTGAAAGTGCCTCCTGCTCC




TGTGCCTTGTCCTCCTCCATCTCCTG




GACCATCTGCCGTGCCTAGCTCTCCT




AAAAGCGTGGCCACCGAGGAAAGA




GCCGCTCCTTCTACAGCTCCTGCCGA




GGCCACACCTCCTAAACCTGGCGAA




GCTGAAGCCCCTCCAAAACACCCTG




GCGTGCTGAAGGTGGAAGCCATCCT




GGAAAAGGTGCAGGGACTCGAGCA




GGCCGTGGACAACTTCGAGGGCAAG




AAAACCGACAAGAAATACCTGATGA




TCGAGGAATACCTGACCAAAGAGCT




GCTGGCCCTGGACAGCGTTGACCCT




GAAGGCAGAGCAGATGTGCGGCAG




GCTAGAAGAGATGGCGTGCGGAAA




GTGCAGACCATCCTCGAGAAGCTGG




AACAGAAAGCCATCGACGTGCCAGG




CCAGGTGCAGGTTTACGAGCTGCAG




CCCTCTAACCTGGAAGCCGATCAGC




CTCTGCAGGCCATCATGGAAATGGG




AGCCGTGGCCGCCGACAAGGGAAA




GAAGAATGCTGGCAACGCCGAGGAT




CCCCACACCGAAACACAGCAGCCTG




AAGCTACAGCCGCCGCTACCAGCAA




TCCCAGCAGCATGACAGACACCCCT




GGCAATCCAGCCGCTCCA





38
Stop
TAATGATAG





39
“Exon 4 UTR”
CCTCTGCCCTGTAAAAATCAGACTC




GGAACCGATGTGTGCTTTAGGGAAT




TTTAAGTTGCATGCATTTCAGAGACT




TTAAGTCAGTTGGTTTTTATTAGCTG




CTTGGTATGCAGTAACTTGGGTGGA




GGCAAAACACTAATAAAAGGGCTA




AAAAGGAAAATGATGCTTTTCTTCT




ATATTCTTACTCTGTACCAATAAAG




AAGTTGCTTGTTGTTTGAGAAGTTTA




ACCCCGTTGCTTGTTGTTCTGCAGCC




CTGTCTACTTGGGCACCCCCACCAC




CTGTTAGCTGTGGTTGTGCACTGTCT




TTTGTAGCTCTGGACTGGAGGGGTA




GATGGGGAGTCAATTACCCATCACA




TAAATATGAAACATTTATCAGAAAT




GTTGCCATTTTAATGAGATGATTTTC




TTCATCTCATAATTAAAATACCTGAC




TTTAGAGAGAGTAAAATGTGCCAGG




AGCCATAGGAATATCTGTATGTTGG




ATGACTTTAATGCTACATTTTAAAA




AAAGAAAATAAAGTAATAATATAAC




TCAAAA





40
Spacer
GCGGCCGCTCGAGTCTAGA





41
ITR-R
AGGAACCCCTAGTGATGGAGTTGGC




CACTCCCTCTCTGCGCGCTCGCTCGC




TCACTGAGGCCGGGCGACCAAAGGT




CGCCCGACGCCCGGGCTTTGCCCGG




GCGGCCTCAGTGAGCGAGCGAGCGC




GCAGAGAGGGAGTGGCCAA










Construct 3 (pTR2-mDes-Ex1-Intron1-Ex2-Bag3)














SEQ ID
Elements (5′→3′)
Nt sequence







42
ITR-L
TTGGCCACTCCCTCTCTGCGCGCTCG




CTCGCTCACTGAGGCCGGGCGACCA




AAGGTCGCCCGACGCCCGGGCTTTGC




CCGGGCGGCCTCAGTGAGCGAGCGA




GCGCGCAGAGAGGGAGTGGCCAACT




CCATCACTAGGGGTTCCT





43
mDES
GATCTTACCCCCTGCCCCCCACAGCT




CCTCTCCTGTGCCTTGTTTCCCAGCCA




TGCGTTCTCCTCTATAAATACCCGCT




CTGGTATTTGGGGTTGGCAGCTGTTG




CTGCCAGGGAGATGGTTGGGTTGAC




ATGCGGCTCCTGACAAAACACAAAC




CCCTGGTGTGTGTGGGCGTGGGTGGT




GTGAGTAGGGGGATGAATCAGGGAG




GGGGCGGGGGACCCAGGGGGCAGGA




GCCACACAAAGTCTGTGCGGGGGTG




GGAGCGCACATAGCAATTGGAAACT




GAAAGGTTATCAGACCCTTTCTGGAA




ATCAGCCCACTGTTTATAAACTTGAG




GCCCCACCCTCGAGATAACCAGGGCT




GAAAGAGGCCCGCCTGGGGGCTGGA




GACATGCTTGCTGCCTGCCCTGGCGA




AGGATTGGCAGGCTTGCCCGTCACAG




GACCCCCGCTGGCTGACTCAGGGGC




GCAGGCCTCTTGCGGGGGAGCTGGC




CTCCCCGCCCCCACGGCCACGGGCCG




CCCTTTCCTGGCAGGACAGCGGGATC




TTGCAGCTGTCAGGGGAGGGGAGGC




GGGGGCTGATGTCAGGAGGGATACA




AATAGTGCCGACGGCTGGGGGCCCT




GTCTCCCCTCGCCGCATCCACTCTCC




GGCCGGCCGCCTGTCCGCCGCCTCCT




CCGTGCGCCCGCCAGCCTCGCCCGCG




CCGTCACCGTGAGGCACTGGGCAGG




TAAGTATCAAAGTATCAAGGTTACAA




GACAGGTTTAAGGAGACCAATAGAA




ACTGGGCTTGTCGAGACAGAGAAGA




CTCTTGCGTTTCTGATAGGCACCTAT




TGGTCTTACTGACATCCACTTTGCCTT




TCTCTCCACAGGCTAG





44
“Exon1 Non-
GAATTCGATATCAAGCTTGCATCCAA



coding”
CCCCGGGCCGCGGCCAACTTCTCTGG




ACTGGACCAGAAGTTTCTAGCCGGCC




AGTTGCTACCTCCCTTTATCTCCTCCT




TCCCCTCTGGCAGCGAGGAGGCTATT




TCCAGACACTTCCACCCCTCTCTGGC




CACGTCACCCCCGCCTTTAATTCATA




AAGGTGCCCGGCGCCGGCTTCCCGG




ACACGTCGGCGGCGGAGAGGGGCCC




ACGGCGGCGGCCCGGCCAGAGACTC




GGCGCCCGGAGCCAGCGCCCCGCAC




CCGCGCCCCAGCGGGCAGACCCCAA




CCCAGCGCCACC





45
Exon 1
ATGTCTGCTGCCACACACTCTCCAAT




GATGCAGGTTGCCTCTGGCAATGGGG




ACAGAGATCCTCTGCCTCCTGGCTGG




GAGATCAAGATTGATCCTCAGACAG




GCTGGCCCTTCTTTGTGGACCACAAC




AGCAGAACCACCACCTGGAATGACC




CCAGAGTGCCCTCTGAGGGCCCTAAG





46
Intron 1
GTTTCAAGAGCTAGAGGCCCTCCTTG




GAGTGTGGCTCCTCCTAGAAGGCAA




GCTGCTGGCTCTGGACTTGGAAGAGG




GGATGCTAGAAGAAGAGGCCCTGGA




GTTGGAGAGGGCCCCAGTAGAGCTG




ATACAGGATCTGCCCCTAGACACACC




CTGCCTCTGAGGCCTGATGAGAGCCC




ACTGAGAACCAGACCTTGACAGGCG




TCGGGGCGAAAGGAGGCCCCGGGAT




TCGGTGGCCCGGGAAGCGACCCCGC




AGTGGCTCCGGTGCCGTCCACGGCTC




GACTCCAGGGCGGAAGGCCGGGTGT




CCAGCGCTGGCCTCGCGCTCTAGGGC




TGGGAGAGGGGCGGCCGGCCTGGTC




AGCTCCGGAGGCCCCGGCCCACCGT




GGCCCCTGCTGCCCGCTCGTGCTGTA




ATGTAGAGGTTGGAGCTGACCCCTGC




TCCTGGAGCTCATCTTCTGATCCGGG




TCTCTGAAAATGCGGGCATGGGCAG




CTCTCCTACACTCACCGCTTCCCTCA




GTCACCCAGAAAAGAAACCTGTCTTA




CCAGTTTGGAGAATTGGGACCTTTCC




TTTGCTTAACAGATACTTTTGGCTTTC




TCCTGATGCCCCTAATTCCTAAACTG




TTGGCCAAATAGCAACCTCTATGGGG




TGGGGGGTTTGGAGGGTACAGGGGC




TGGGAGCTGGCTGACGCTTTGAGGCC




CAAGTCACTCGGGAAGATCACAATG




CCAAGCGCCACAGTGTTTCCTCTGCC




AGGAGGGTTCACTTCCCAGTTTCTAA




CCAGCCTGTGTTTCTCCACTTTTTATT




TCAG





47
Exon 2
GAGACACCCAGCTCTGCCAATGGAC




CCAGCAGAGAGGGAAGCAGACTGCC




ACCAGCTAGAGAAGGACACCCCGTG




TATCCACAGCTGAGGCCTGGCTACAT




CCCCATTCCAGTGCTGCATGAGGGCG




CCGAAAACAGACAGGTGCACCCCTTT




CACGTGTACCCTCAGCCTGGCATGCA




GAGATTCAGAACAGAGGCTGCTGCT




GCAGCCCCTCAGAGATCTCAATCTCC




TCTGAGAGGCATGCCAGAGACAACA




CAGCCTGACAAGCAGTGTGGACAGG




TGGCAGCAGCAGCTGCAGCTCAACCT




CCTGCTTCTCATGGCCCTGAG





48
Exon 3
AGAAGCCAGTCTCCTGCTGCTTCTGA




TTGCAGCAGCTCCAGTAGCTCTGCCT




CTCTGCCTAGCTCTGGCAGATCTTCT




CTGGGCAGCCATCAGCTGCCTAGAG




GCTACATCAGCATCCCTGTGATCCAT




GAGCAGAATGTGACCAGACCAGCTG




CTCAGCCTAGCTTCCACCAGGCTCAG




AAAACACACTACCCTGCTCAGCAAG




GGGAGTACCAGACACACCAGCCAGT




GTACCACAAGATCCAAGGGGATGAC




TGGGAGCCCAGACCACTGAGAGCTG




CTAGCCCCTTTAGAAGCTCTGTGCAA




GGGGCCAGCTCTAGAGAGGGCTCTC




CTGCCAGAAGCAGCACACCTCTGCAC




AGCCCATCTCCAATCAGAGTGCACAC




CGTGGTGGACAGACCCCAG





49
Exon 4
CAGCCTATGACACACAGAGAGACAG




CCCCTGTCAGCCAGCCTGAGAACAA




GCCTGAGTCCAAGCCAGGACCTGTG




GGACCTGAACTGCCTCCAGGACACAT




CCCTATCCAAGTGATCCGGAAAGAG




GTGGACAGCAAGCCCGTGTCTCAGA




AGCCTCCTCCACCTAGCGAGAAAGTG




GAAGTGAAAGTGCCTCCTGCTCCTGT




GCCTTGTCCTCCTCCATCTCCTGGAC




CATCTGCCGTGCCTAGCTCTCCTAAA




AGCGTGGCCACCGAGGAAAGAGCCG




CTCCTTCTACAGCTCCTGCCGAGGCC




ACACCTCCTAAACCTGGCGAAGCTGA




AGCCCCTCCAAAACACCCTGGCGTGC




TGAAGGTGGAAGCCATCCTGGAAAA




GGTGCAGGGACTCGAGCAGGCCGTG




GACAACTTCGAGGGCAAGAAAACCG




ACAAGAAATACCTGATGATCGAGGA




ATACCTGACCAAAGAGCTGCTGGCCC




TGGACAGCGTTGACCCTGAAGGCAG




AGCAGATGTGCGGCAGGCTAGAAGA




GATGGCGTGCGGAAAGTGCAGACCA




TCCTCGAGAAGCTGGAACAGAAAGC




CATCGACGTGCCAGGCCAGGTGCAG




GTTTACGAGCTGCAGCCCTCTAACCT




GGAAGCCGATCAGCCTCTGCAGGCC




ATCATGGAAATGGGAGCCGTGGCCG




CCGACAAGGGAAAGAAGAATGCTGG




CAACGCCGAGGATCCCCACACCGAA




ACACAGCAGCCTGAAGCTACAGCCG




CCGCTACCAGCAATCCCAGCAGCATG




ACAGACACCCCTGGCAATCCAGCCG




CTCCA





50
Stop
TAATGATAG





51
“Exon 4 UTR”
CCTCTGCCCTGTAAAAATCAGACTCG




GAACCGATGTGTGCTTTAGGGAATTT




TAAGTTGCATGCATTTCAGAGACTTT




AAGTCAGTTGGTTTTTATTAGCTGCT




TGGTATGCAGTAACTTGGGTGGAGGC




AAAACACTAATAAAAGGGCTAAAAA




GGAAAATGATGCTTTTCTTCTATATT




CTTACTCTGTACCAATAAAGAAGTTG




CTTGTTGTTTGAGAAGTTTAACCCCG




TTGCTTGTTGTTCTGCAGCCCTGTCTA




CTTGGGCACCCCCACCACCTGTTAGC




TGTGGTTGTGCACTGTCTTTTGTAGCT




CTGGACTGGAGGGGTAGATGGGGAG




TCAATTACCCATCACATAAATATGAA




ACATTTATCAGAAATGTTGCCATTTT




AATGAGATGATTTTCTTCATCTCATA




ATTAAAATACCTGACTTTAGAGAGAG




TAAAATGTGCCAGGAGCCATAGGAA




TATCTGTATGTTGGATGACTTTAATG




CTACATTTTAAAAAAAGAAAATAAA




GTAATAATATAACTCAAAA





52
Spacer
GCGGCCGCTCGAGTCTAGA





53
ITR-R
AGGAACCCCTAGTGATGGAGTTGGC




CACTCCCTCTCTGCGCGCTCGCTCGC




TCACTGAGGCCGGGCGACCAAAGGT




CGCCCGACGCCCGGGCTTTGCCCGGG




CGGCCTCAGTGAGCGAGCGAGCGCG




CAGAGAGGGAGTGGCCAA










Construct 4 (pTR2-MHCK7-BAG3int1-dual)















Elements



SEQ ID
(5′ -> 3′)
Nt sequence







79
Left ITR
ttggccactccctctctgegcgctcgctcgctcactgaggcc




gggcgaccaaaggtcgcccgacgcccgggctttgcccgggcg




gcctcagtgagcgagcgagcgcgcagagagggagtggccaac




tccatcactaggggttcct





80
MHCK7
aagcttgcatgtctaagctagacccttcagattaaaaataac



promoter
tgaggtaagggcctgggtaggggaggtggtgtgagacgctcc




tgtctctcctctatctgcccatcggccctttggggaggagga




atgtgcccaaggactaaaaaaaggccatggagccagaggggc




gagggcaacagacctttcatgggcaaaccttggggccctgct




gtctagcatgccccactacgggtctaggctgcccatgtaagg




aggcaaggcctggggacacccgagatgcctggttataattaa




cccagacatgtggctgcccccccccccccaacacctgctgcc




tctaaaaataaccctgtccctggtggatcgcctgcatgcgaa




ggatcctcgaacaaggctgtgggggactgagggcaggctgta




acaggcttgggggccagggcttatacgtgcctgggactccca




aagtattactgttccatgttcccggcgaagggccagctgtcc




cccgccagctagactcagcacttagtttaggaaccagtgagc




aagtcagcccttggggcagcccatacaaggccatggggctgg




gcaagctgcacgcctgggtccggggtgggcacggtgcccggg




caacgagctgaaagctcatctgctctcaggggcccctccctg




gggacagcccctcctggctagtcacaccctgtaggctcctct




atataacccaggggcacaggggctgccctcattctaccacca




cctccacagcacagacagacactcaggagcagccag





81
Human Bag3
gcatccaaccccgggccgcggccaacttctctggactggacc



exon 1
agaagtttctagccggccagttgctacctccctttatctcct



non-coding
ccttcccctctggcagcgaggaggctatttccagacacttcc




acccctctctggccacgtcacccccgcctttaattcataaag




gtgcccggcgccggcttcccggacacgtcggcggcggagagg




ggcccacggcggcggcccggccagagactcggcgcccggagc




cagcgccccgcacccgcgccccagcgggcagaccccaaccca




gc





82
Kozak
gccacc



sequence






83
Human Bag3
atgtcagccgcaactcactcccctatgatgcaggtggcctct



Exon 1
ggaaatggcgaccgagaccctctgccccccggatgggaaatc




aagatcgacccccagaccggctggcctttctttgtggaccat




aatagccgaacaaccacctggaacgacccaagagtgccatca




gaaggacctaag





84
Human Bag3
gtgagccgggcccgcggcccgccctggtcggtggcgccacct



Intron 1
cgacggcaggcggcggggagtgggctgggccggggggacgcg




aggcggcggggcccgggggtcggcgaaggcccctcggggcgg




acaccggctccgcgccccgccacacactcccgctgegcccgg




acgagtccccgctccggaccegcccttgacaggcgtcggggc




gaaaggaggccccgggattcggtggcccgggaagcgaccccg




cagtggctccggtgccgtccacggctcgactccagggcggaa




ggccgggtgtccagcgctggcctcgcgctctagggctgggag




aggggcggccggCCTGGTCAGCTCCGGAGGCCCCGGCCCACC




GTGGCCCCTGCTGCCCGCTCGTGCTGTAATGTAGAGGTTGGA




GCTGACCCCTGCTCCTGGAGCTCATCTTCTGATCCGGGTCTC




TGAAAATGCGGGCATGGGCAGCTCTCCTACACTCACCGCTTC




CCTCAGTCACCCAGAAAAGAAACCTGTCTTACCAGTTTGGAG




AATTGGGACCTTTCCTTTGCTTAACAGATACTTTTGGCTTTC




TCCTGATGCCCCTAATTCCTAAACTGTTGGCCAAATAGCAAC




CTCTATGGGGTGGGGGGTTTGGAGGGTACAGGGGCTGGGAGC




TGGCTGACGCTTTGAGGCCCAAGTCACTCGGGAAGATCACAA




TGCCAAGCGCCACAGTGTTTCCTCTGCCAGGAGGGTTCACTT




CCCAGTTTCTAACCAGCCTGTGTTTCTCCACTTTTTATTTCA




G





85
Human Bag3
GAAACACCATCCAGTGCCAACGGACCATCCAGAGAAGGAAGT



exon 2 -
CGCCTGCCCCCTGCCCGAGAAGGACACCCAGTCTATCCACAG



exon 4
CTGCGGCCTGGCTATATTCCTATTCCCGTGCTGCACGAGGGA




GCAGAGAACAGACAGGTGCACCCATTCCACGTGTACCCTCAG




CCAGGCATGCAGAGGTTTCGCACAGAGGCTGCCGCCGCCGCC




CCACAGCGGAGCCAGTCCCCACTGAGAGGAATGCCTGAGACA




ACACAGCCAGACAAGCAGTGTGGACAGGTGGCTGCCGCCGCC




GCCGCCCAGCCCCCTGCCAGCCACGGCCCCGAGCGGAGCCAG




AGCCCTGCCGCCTCTGATTGTAGCTCCTCTAGCTCCTCTGCC




AGCCTGCCCAGCTCCGGCAGGTCTAGCCTGGGCAGCCACCAG




CTGCCTCGCGGCTATATCTCCATCCCAGTGATCCACGAGCAG




AACGTGACAAGACCAGCAGCACAGCCCTCCTTCCACCAGGCA




CAGAAGACCCACTACCCAGCCCAGCAGGGCGAGTATCAGACA




CACCAGCCCGTGTACCACAAAATTCAGGGCGACGATTGGGAG




CCCCGGCCTCTGAGAGCAGCCAGCCCTTTTCGGTCCTCTGTG




CAGGGAGCCAGCTCCAGGGAGGGCTCCCCAGCCCGCTCTAGC




ACCCCTCTGCACTCCCCATCTCCAATCAGAGTGCACACAGTG




GTGGACAGACCTCAGCAGCCAATGACACACAGGGAGACAGCA




CCCGTGAGCCAGCCAGAGAATAAGCCCGAGTCTAAGCCTGGC




CCAGTGGGACCCGAGCTGCCACCTGGACACATCCCAATCCAG




GTCATCCGCAAGGAGGTGGATAGCAAGCCCGTGTCCCAGAAG




CCTCCACCCCCTAGCGAGAAGGTGGAGGTGAAGGTGCCACCT




GCACCCGTGCCTTGTCCACCACCATCTCCAGGCCCCAGCGCC




GTGCCCTCCTCTCCTAAATCTGTGGCAACAGAGGAGAGAGCA




GCACCTTCTACCGCACCAGCAGAGGCAACACCACCTAAGCCT




GGAGAGGCAGAGGCACCACCTAAGCACCCTGGCGTGCTGAAG




GTGGAGGCCATCCTGGAGAAGGTGCAGGGCCTGGAGCAGGCC




GTGGACAACTTCGAGGGCAAGAAGACCGATAAGAAGTACCTG




ATGATCGAGGAGTATCTGACAAAGGAGCTGCTGGCCCTGGAC




TCTGTGGACCCCGAGGGAAGGGCAGACGTGCGCCAGGCCCGG




AGAGATGGCGTGCGCAAGGTGCAGACCATCCTGGAGAAGCTG




GAGCAGAAGGCCATCGACGTGCCTGGCCAGGTGCAGGTGTAC




GAGCTGCAGCCTTCCAATCTGGAGGCCGATCAGCCACTGCAG




GCAATCATGGAGATGGGAGCAGTGGCAGCAGACAAGGGCAAG




AAGAACGCAGGAAATGCAGAGGACCCCCACACCGAGACACAG




CAGCCTGAGGCCACAGCCGCAGCAACCAGTAATCCATCCAGT




ATGACAGACACCCCAGGAAACCCCGCAGCCCCA


86
Stop codons
TAATAATAG





87
Human Bag 3
CCTCTGCCCTGTAAAAATCAGACTCGGAACCGATGTGTGCTT



3′UTR
TAGGGAATTTTAAGTTGCATGCATTTCAGAGACTTTAAGTCA




GTTGGTTTTTATTAGCTGCTTGGTATGCAGTAACTTGGGTGG




AGGCAAAACACTAATAAAAGGGCTAAAAAGGAAAATGATGCT




TTTCTTCTATATTCTTACTCTGTACCAATAAAGAAGTTGCTT




GTTGTTTGAGAAGTTTAACCCCGTTGCTTGTTGTTCTGCAGC




CCTGTCTACTTGGGCACCCCCACCACCTGTTAGCTGTGGTTG




TGCACTGTCTTTTGTAGCTCTGGACTGGAGGGGTAGATGGGG




AGTCAATTACCCATCACATAAATATGAAACATTTATCAGAAA




TGTTGCCATTTTAATGAGATGATTTTCTTCATCTCATAATTA




AAATACCTGACTTTAGAGAGAGTAAAATGTGCCAGGAGCCAT




AGGAATATCTGTATGTTGGATGACTTTAATGCTACATTTTAA




AAAAAGAAAATAAAGTAATAATATAACTCAAAA





88
bGH poly A
ctgtgccttctagttgccagccatctgttgtttgcccctccc



signal
ccgtgccttccttgaccctggaaggtgccactcccactgtcc




tttcctaataaaatgaggaaattgcatcgcattgtctgagta




ggtgtcattctattctggggggggggtggggcaggacagcaa




gggggaggattgggaagacaatagcaggcatgctgggga





89
Right ITR
aggaacccctagtgatggagttggccactccctctctgcgcg




ctcgctcgctcactgaggccgggcgaccaaaggtcgcccgac




gcccgggctttgcccgggcggcctcagtgagcgagcgagcgc




gcagagagggagtggccaa





90
pTR2-MHCK7-
ttggccactccctctctgcgcgctcgctcgctcactgaggcc



BAG3int1-
gggcgaccaaaggtcgcccgacgcccgggctttgcccgggcg



dual
gcctcagtgagcgagcgagcgcgcagagagggagtggccaac



ITR-to-ITR
tccatcactaggggttccttctagaggcgcgccaagcttgca



sequence
tgtctaagctagacccttcagattaaaaataactgaggtaag




ggcctgggtaggggaggtggtgtgagacgctcctgtctctcc




tctatctgcccatcggccctttggggaggaggaatgtgccca




aggactaaaaaaaggccatggagccagaggggcgagggcaac




agacctttcatgggcaaaccttggggccctgctgtctagcat




gccccactacgggtctaggctgcccatgtaaggaggcaaggc




ctggggacacccgagatgcctggttataattaacccagacat




gtggctgcccccccccccccaacacctgctgcctctaaaaat




aaccctgtccctggtggatcgcctgcatgcgaaggatcctcg




aacaaggctgtgggggactgagggcaggctgtaacaggcttg




ggggccagggcttatacgtgcctgggactcccaaagtattac




tgttccatgttcccggcgaagggccagctgtcccccgccagc




tagactcagcacttagtttaggaaccagtgagcaagtcagcc




cttggggcagcccatacaaggccatggggctgggcaagctgc




acgcctgggtccggggtgggcacggtgcccgggcaacgagct




gaaagctcatctgctctcaggggcccctccctggggacagcc




cctcctggctagtcacaccctgtaggctcctctatataaccc




aggggcacaggggctgccctcattctaccaccacctccacag




cacagacagacactcaggagcagccagcgaattcgctagccg




catccaaccccgggccgcggccaacttctctggactggacca




gaagtttctagccggccagttgctacctccctttatctcctc




cttcccctctggcagcgaggaggctatttccagacacttcca




cccctctctggccacgtcacccccgcctttaattcataaagg




tgcccggcgccggcttcccggacacgtcggcggcggagaggg




gcccacggcggcggcccggccagagactcggcgcccggagcc




agcgccccgcacccgcgccccagcgggcagaccccaacccag




cgccaccatgtcagccgcaactcactcccctatgatgcaggt




ggcctctggaaatggcgaccgagaccctctgccccccggatg




ggaaatcaagatcgacccccagaccggctggcctttctttgt




ggaccataatagccgaacaaccacctggaacgacccaagagt




gccatcagaaggacctaaggtgagccgggcccgcggcccgcc




ctggtcggtggcgccacctcgacggcaggcggcggggagtgg




gctgggccggggggacgcgaggcggcggggcccgggggtcgg




cgaaggcccctcgcgggcggacaccggctccgcgccccgcca




cacactcccgctgcgcccggacgagtccccgctccggacccg




cccttgacaggcgtcggggcgaaaggaggccccgggattcgg




tggcccgggaagcgaccccgcagtggctccggtgccgtccac




ggctcgactccagggcggaaggccgggtgtccagcgctggcc




tcgcgctctagggctgggagaggggcggccggCCTGGTCAGC




TCCGGAGGCCCCGGCCCACCGTGGCCCCTGCTGCCCGCTCGT




GCTGTAATGTAGAGGTTGGAGCTGACCCCTGCTCCTGGAGCT




CATCTTCTGATCCGGGTCTCTGAAAATGCGGGCATGGGCAGC




TCTCCTACACTCACCGCTTCCCTCAGTCACCCAGAAAAGAAA




CCTGTCTTACCAGTTTGGAGAATTGGGACCTTTCCTTTGCTT




AACAGATACTTTTGGCTTTCTCCTGATGCCCCTAATTCCTAA




ACTGTTGGCCAAATAGCAACCTCTATGGGGTGGGGGGTTTGG




AGGGTACAGGGGCTGGGAGCTGGCTGACGCTTTGAGGCCCAA




GTCACTCGGGAAGATCACAATGCCAAGCGCCACAGTGTTTCC




TCTGCCAGGAGGGTTCACTTCCCAGTTTCTAACCAGCCTGTG




TTTCTCCACTTTTTATTTCAGGAAACACCATCCAGTGCCAAC




GGACCATCCAGAGAAGGAAGTCGCCTGCCCCCTGCCCGAGAA




GGACACCCAGTCTATCCACAGCTGCGGCCTGGCTATATTCCT




ATTCCCGTGCTGCACGAGGGAGCAGAGAACAGACAGGTGCAC




CCATTCCACGTGTACCCTCAGCCAGGCATGCAGAGGTTTCGC




ACAGAGGCTGCCGCCGCCGCCCCACAGCGGAGCCAGTCCCCA




CTGAGAGGAATGCCTGAGACAACACAGCCAGACAAGCAGTGT




GGACAGGTGGCTGCCGCCGCCGCCGCCCAGCCCCCTGCCAGC




CACGGCCCCGAGCGGAGCCAGAGCCCTGCCGCCTCTGATTGT




AGCTCCTCTAGCTCCTCTGCCAGCCTGCCCAGCTCCGGCAGG




TCTAGCCTGGGCAGCCACCAGCTGCCTCGCGGCTATATCTCC




ATCCCAGTGATCCACGAGCAGAACGTGACAAGACCAGCAGCA




CAGCCCTCCTTCCACCAGGCACAGAAGACCCACTACCCAGCC




CAGCAGGGCGAGTATCAGACACACCAGCCCGTGTACCACAAA




ATTCAGGGCGACGATTGGGAGCCCCGGCCTCTGAGAGCAGCC




AGCCCTTTTCGGTCCTCTGTGCAGGGAGCCAGCTCCAGGGAG




GGCTCCCCAGCCCGCTCTAGCACCCCTCTGCACTCCCCATCT




CCAATCAGAGTGCACACAGTGGTGGACAGACCTCAGCAGCCA




ATGACACACAGGGAGACAGCACCCGTGAGCCAGCCAGAGAAT




AAGCCCGAGTCTAAGCCTGGCCCAGTGGGACCCGAGCTGCCA




CCTGGACACATCCCAATCCAGGTCATCCGCAAGGAGGTGGAT




AGCAAGCCCGTGTCCCAGAAGCCTCCACCCCCTAGCGAGAAG




GTGGAGGTGAAGGTGCCACCTGCACCCGTGCCTTGTCCACCA




CCATCTCCAGGCCCCAGCGCCGTGCCCTCCTCTCCTAAATCT




GTGGCAACAGAGGAGAGAGCAGCACCTTCTACCGCACCAGCA




GAGGCAACACCACCTAAGCCTGGAGAGGCAGAGGCACCACCT




AAGCACCCTGGCGTGCTGAAGGTGGAGGCCATCCTGGAGAAG




GTGCAGGGCCTGGAGCAGGCCGTGGACAACTTCGAGGGCAAG




AAGACCGATAAGAAGTACCTGATGATCGAGGAGTATCTGACA




AAGGAGCTGCTGGCCCTGGACTCTGTGGACCCCGAGGGAAGG




GCAGACGTGCGCCAGGCCCGGAGAGATGGCGTGCGCAAGGTG




CAGACCATCCTGGAGAAGCTGGAGCAGAAGGCCATCGACGTG




CCTGGCCAGGTGCAGGTGTACGAGCTGCAGCCTTCCAATCTG




GAGGCCGATCAGCCACTGCAGGCAATCATGGAGATGGGAGCA




GTGGCAGCAGACAAGGGCAAGAAGAACGCAGGAAATGCAGAG




GACCCCCACACCGAGACACAGCAGCCTGAGGCCACAGCCGCA




GCAACCAGTAATCCATCCAGTATGACAGACACCCCAGGAAAC




CCCGCAGCCCCATAATAATAGCCTCTGCCCTGTAAAAATCAG




ACTCGGAACCGATGTGTGCTTTAGGGAATTTTAAGTTGCATG




CATTTCAGAGACTTTAAGTCAGTTGGTTTTTATTAGCTGCTT




GGTATGCAGTAACTTGGGTGGAGGCAAAACACTAATAAAAGG




GCTAAAAAGGAAAATGATGCTTTTCTTCTATATTCTTACTCT




GTACCAATAAAGAAGTTGCTTGTTGTTTGAGAAGTTTAACCC




CGTTGCTTGTTGTTCTGCAGCCCTGTCTACTTGGGCACCCCC




ACCACCTGTTAGCTGTGGTTGTGCACTGTCTTTTGTAGCTCT




GGACTGGAGGGGTAGATGGGGAGTCAATTACCCATCACATAA




ATATGAAACATTTATCAGAAATGTTGCCATTTTAATGAGATG




ATTTTCTTCATCTCATAATTAAAATACCTGACTTTAGAGAGA




GTAAAATGTGCCAGGAGCCATAGGAATATCTGTATGTTGGAT




GACTTTAATGCTACATTTTAAAAAAAGAAAATAAAGTAATAA




TATAACTCAAAAGCggccgcctcgagctgtgccttctagttg




ccagccatctgttgtttgcccctcccccgtgccttccttgac




cctggaaggtgccactcccactgtcctttcctaataaaatga




ggaaattgcatcgcattgtctgagtaggtgtcattctattct




ggggggtggggtggggcaggacagcaagggggaggattggga




agacaatagcaggcatgctggggagtcgacgcgccggcgtct




agaaggaacccctagtgatggagttggccactccctctctgc




gcgctcgctcgctcactgaggccgggcgaccaaaggtcgccc




gacgcccgggctttgcccgggcggcctcagtgagcgagcgag




cgcgcagagagggagtggccaa










Construct 5 (pTR2-Des-BAG3int1-dual)














SEQ
Elements



ID
(5′ -> 3′)
Nt sequence

















91
Left ITR
TTGGCCACTCCCTCTCTGCGCGCTCG




CTCGCTCACTGAGGCCGGGCGACCAA




AGGTCGCCCGACGCCCGGGCTTTGCC




CGGGCGGCCTCAGTGAGCGAGCGAGC




GCGCAGAGAGGGAGTGGCCAACTCCA




TCACTAGGGGTTCCT





92
Desmin
CCCTGCCCCCACAGCTCCTCTCCTGT



promoter
GCCTTGTTTCCCAGCCATGCGTTCTC




CTCTATAAATACCCGCTCTGGTATTT




GGGGTTGGCAGCTGTTGCTGCCAGGG




AGATGGTTGGGTTGACATGCGGCTCC




TGACAAAACACAAACCCCTGGTGTGT




GTGGGCGTGGGTGGTGTGAGTAGGGG




GATGAATCAGGGAGGGGGCGGGGGAC




CCAGGGGGCAGGAGCCACACAAAGTC




TGTGCGGGGGGGGAGCGCACATAGCA




ATTGGAAACTGAAAGCTAATCAGACC




CTTTCTGGAAATCAGCCCACTGTTTA




TATACTTGAGGCCCCACCCTCGAGAT




AACCAGGGCTGAAAGAGGCCCGCCTG




GGGGCTGCAGACATGCTTGCTGCCTG




CCCTGGCGAAGGATTGGCAGGCTTGC




CCGTCACAGGACCCCCGCTGGCTGAC




TCAGGGGCGCAGGCCTCTTGCGGGGG




AGCTGGCCTCCCCGCCCCCACGGCCA




CGGGCCGCCCTTTCCTGGCAGGACAG




CGGGATCTTGCAGCTGTCAGGGGAGG




GGAGGCGGGGGCTGATGTCAGGAGGG




ATACAAATAGTGCCGACGGCTGGGGG




CCCTGTCTCCCCTCGCCGCATCCACT




CTCCGGCCGGccgcctgcccgccgcc




tcctccgtgcgcccgccagcctcgcc




cgcgccgtca





93
Human
gcatccaaccccgggccgcggccaac



Bag3 exon 1
ttctctggactggaccagaagtttct



non-coding
agccggccagttgctacctcccttta




tctcctccttcccctctggcagcgag




gaggctatttccagacacttccaccc




ctctctggccacgtcacccccgcctt




taattcataaaggtgcccggcgccgg




cttcccggacacgtcggcggcggaga




ggggcccacggcggcggcccggccag




agactcggcgcccggagccagcgccc




cgcacccgcgccccagcgggcagacc




ccaacccagc





94
Kozak
gccacc



sequence






95
Human
atgtcagccgcaactcactcccctat



Bag3 Exon 1
gatgcaggtggcctctggaaatggcg




accgagaccctctgccccccggatgg




gaaatcaagatcgacccccagaccgg




ctggcctttctttgtggaccataata




gccgaacaaccacctggaacgaccca




agagtgccatcagaaggacctaag





96
Human Bag3
gtgagccgggcccgcggcccgccctg



Intron 1
gtcggtggcgccacctcgacggcagg




cggcggggagtgggctgggccggggg




gacgcgaggcggcggggcccgggggt




cggcgaaggcccctcgcgggcggaca




ccggctccgcgccccgccacacactc




ccgctgcgcccggacgagtccccgct




ccggacccgcccttgacaggcgtcgg




ggcgaaaggaggccccgggattcggt




ggcccgggaagcgaccccgcagtggc




tccggtgccgtccacggctcgactcc




agggcggaaggccgggtgtccagcgc




tggcctcgcgctctagggctgggaga




ggggcggccggCCTGGTCAGCTCCGG




AGGCCCCGGCCCACCGTGGCCCCTGC




TGCCCGCTCGTGCTGTAATGTAGAGG




TTGGAGCTGACCCCTGCTCCTGGAGC




TCATCTTCTGATCCGGGTCTCTGAAA




ATGCGGGCATGGGCAGCTCTCCTACA




CTCACCGCTTCCCTCAGTCACCCAGA




AAAGAAACCTGTCTTACCAGTTTGGA




GAATTGGGACCTTTCCTTTGCTTAAC




AGATACTTTTGGCTTTCTCCTGATGC




CCCTAATTCCTAAACTGTTGGCCAAA




TAGCAACCTCTATGGGGTGGGGGGTT




TGGAGGGTACAGGGGCTGGGAGCTGG




CTGACGCTTTGAGGCCCAAGTCACTC




GGGAAGATCACAATGCCAAGCGCCAC




AGTGTTTCCTCTGCCAGGAGGGTTCA




CTTCCCAGTTTCTAACCAGCCTGTGT




TTCTCCACTTTTTATTTCAG





97
Human Bag3
GAAACACCATCCAGTGCCAACGGACC



exon 2 -
ATCCAGAGAAGGAAGTCGCCTGCCCC



exon 4
CTGCCCGAGAAGGACACCCAGTCTAT




CCACAGCTGCGGCCTGGCTATATTCC




TATTCCCGTGCTGCACGAGGGAGCAG




AGAACAGACAGGTGCACCCATTCCAC




GTGTACCCTCAGCCAGGCATGCAGAG




GTTTCGCACAGAGGCTGCCGCCGCCG




CCCCACAGCGGAGCCAGTCCCCACTG




AGAGGAATGCCTGAGACAACACAGCC




AGACAAGCAGTGTGGACAGGTGGCTG




CCGCCGCCGCCGCCCAGCCCCCTGCC




AGCCACGGCCCCGAGCGGAGCCAGAG




CCCTGCCGCCTCTGATTGTAGCTCCT




CTAGCTCCTCTGCCAGCCTGCCCAGC




TCCGGCAGGTCTAGCCTGGGCAGCCA




CCAGCTGCCTCGCGGCTATATCTCCA




TCCCAGTGATCCACGAGCAGAACGTG




ACAAGACCAGCAGCACAGCCCTCCTT




CCACCAGGCACAGAAGACCCACTACC




CAGCCCAGCAGGGCGAGTATCAGACA




CACCAGCCCGTGTACCACAAAATTCA




GGGCGACGATTGGGAGCCCCGGCCTC




TGAGAGCAGCCAGCCCTTTTCGGTCC




TCTGTGCAGGGAGCCAGCTCCAGGGA




GGGCTCCCCAGCCCGCTCTAGCACCC




CTCTGCACTCCCCATCTCCAATCAGA




GTGCACACAGTGGTGGACAGACCTCA




GCAGCCAATGACACACAGGGAGACAG




CACCCGTGAGCCAGCCAGAGAATAAG




CCCGAGTCTAAGCCTGGCCCAGTGGG




ACCCGAGCTGCCACCTGGACACATCC




CAATCCAGGTCATCCGCAAGGAGGTG




GATAGCAAGCCCGTGTCCCAGAAGCC




TCCACCCCCTAGCGAGAAGGTGGAGG




TGAAGGTGCCACCTGCACCCGTGCCT




TGTCCACCACCATCTCCAGGCCCCAG




CGCCGTGCCCTCCTCTCCTAAATCTG




TGGCAACAGAGGAGAGAGCAGCACCT




TCTACCGCACCAGCAGAGGCAACACC




ACCTAAGCCTGGAGAGGCAGAGGCAC




CACCTAAGCACCCTGGCGTGCTGAAG




GTGGAGGCCATCCTGGAGAAGGTGCA




GGGCCTGGAGCAGGCCGTGGACAACT




TCGAGGGCAAGAAGACCGATAAGAAG




TACCTGATGATCGAGGAGTATCTGAC




AAAGGAGCTGCTGGCCCTGGACTCTG




TGGACCCCGAGGGAAGGGCAGACGTG




CGCCAGGCCCGGAGAGATGGCGTGCG




CAAGGTGCAGACCATCCTGGAGAAGC




TGGAGCAGAAGGCCATCGACGTGCCT




GGCCAGGTGCAGGTGTACGAGCTGCA




GCCTTCCAATCTGGAGGCCGATCAGC




CACTGCAGGCAATCATGGAGATGGGA




GCAGTGGCAGCAGACAAGGGCAAGAA




GAACGCAGGAAATGCAGAGGACCCCC




ACACCGAGACACAGCAGCCTGAGGCC




ACAGCCGCAGCAACCAGTAATCCATC




CAGTATGACAGACACCCCAGGAAACC




CCGCAGCCCCA





98
Stop codons
TAATAATAG





99
Human Bag 3
CCTCTGCCCTGTAAAAATCAGACTCG



3′UTR
GAACCGATGTGTGCTTTAGGGAATTT




TAAGTTGCATGCATTTCAGAGACTTT




AAGTCAGTTGGTTTTTATTAGCTGCT




TGGTATGCAGTAACTTGGGTGGAGGC




AAAACACTAATAAAAGGGCTAAAAAG




GAAAATGATGCTTTTCTTCTATATTC




TTACTCTGTACCAATAAAGAAGTTGC




TTGTTGTTTGAGAAGTTTAACCCCGT




TGCTTGTTGTTCTGCAGCCCTGTCTA




CTTGGGCACCCCCACCACCTGTTAGC




TGTGGTTGTGCACTGTCTTTTGTAGC




TCTGGACTGGAGGGGTAGATGGGGAG




TCAATTACCCATCACATAAATATGAA




ACATTTATCAGAAATGTTGCCATTTT




AATGAGATGATTTTCTTCATCTCATA




ATTAAAATACCTGACTTTAGAGAGAG




TAAAATGTGCCAGGAGCCATAGGAAT




ATCTGTATGTTGGATGACTTTAATGC




TACATTTTAAAAAAAGAAAATAAAGT




AATAATATAACTCAAAA





100
bGH poly A
CTGTGCCTTCTAGTTGCCAGCCATCT



signal
GTTGTTTGCCCCTCCCCCGTGCCTTC




CTTGACCCTGGAAGGTGCCACTCCCA




CTGTCCTTTCCTAATAAAATGAGGAA




ATTGCATCGCATTGTCTGAGTAGGTG




TCATTCTATTCTGGGGGGTGGGGTGG




GGCAGGACAGCAAGGGGGAGGATTGG




GAAGACAATAGCAGGCATGCTGGGGA





101
Right ITR
AGGAACCCCTAGTGATGGAGTTGGCC




ACTCCCTCTCTGCGCGCTCGCTCGCT




CACTGAGGCCGGGCGACCAAAGGTCG




CCCGACGCCCGGGCTTTGCCCGGGCG




GCCTCAGTGAGCGAGCGAGCGCGCAG




AGAGGGAGTGGCCAA





102
pTR2-Des-
TTGGCCACTCCCTCTCTGCGCGCTCG



BAG3int1-
CTCGCTCACTGAGGCCGGGCGACCAA



dual
AGGTCGCCCGACGCCCGGGCTTTGCC



ITR-to-
CGGGCGGCCTCAGTGAGCGAGCGAGC



ITR DNA
GCGCAGAGAGGGAGTGGCCAACTCCA



sequence
TCACTAGGGGTTCCTTCTAGAGGCGC




GCCAAGCTTCCCTGCCCCCACAGCTC




CTCTCCTGTGCCTTGTTTCCCAGCCA




TGCGTTCTCCTCTATAAATACCCGCT




CTGGTATTTGGGGTTGGCAGCTGTTG




CTGCCAGGGAGATGGTTGGGTTGACA




TGCGGCTCCTGACAAAACACAAACCC




CTGGTGTGTGTGGGCGTGGGTGGTGT




GAGTAGGGGGATGAATCAGGGAGGGG




GCGGGGGACCCAGGGGGCAGGAGCCA




CACAAAGTCTGTGCGGGGGTGGGAGC




GCACATAGCAATTGGAAACTGAAAGC




TAATCAGACCCTTTCTGGAAATCAGC




CCACTGTTTATATACTTGAGGCCCCA




CCCTCGAGATAACCAGGGCTGAAAGA




GGCCCGCCTGGGGGCTGCAGACATGC




TTGCTGCCTGCCCTGGCGAAGGATTG




GCAGGCTTGCCCGTCACAGGACCCCC




GCTGGCTGACTCAGGGGCGCAGGCCT




CTTGCGGGGGAGCTGGCCTCCCCGCC




CCCACGGCCACGGGCCGCCCTTTCCT




GGCAGGACAGCGGGATCTTGCAGCTG




TCAGGGGAGGGGAGGCGGGGGCTGAT




GTCAGGAGGGATACAAATAGTGCCGA




CGGCTGGGGGCCCTGTCTCCCCTCGC




CGCATCCACTCTCCGGCCGGccgcct




gcccgccgcctcctccgtgcgcccgc




cagcctcgcccgcgccgtcagaattc




gctagccgcatccaaccccgggccgc




ggccaacttctctggactggaccaga




agtttctagccggccagttgctacct




ccctttatctcctccttcccctctgg




cagcgaggaggctatttccagacact




tccacccctctctggccacgtcaccc




ccgcctttaattcataaaggtgcccg




gcgccggcttcccggacacgtcggcg




gcggagaggggcccacggcggcggcc




cggccagagactcggcgcccggagcc




agcgccccgcacccgcgccccagcgg




gcagaccccaacccagcgccaccatg




tcagccgcaactcactcccctatgat




gcaggtggcctctggaaatggcgacc




gagaccctctgccccccggatgggaa




atcaagatcgacccccagaccggctg




gcctttctttgtggaccataatagcc




gaacaaccacctggaacgacccaaga




gtgccatcagaaggacctaaggtgag




ccgggcccgcggcccgccctggtcgg




tggcgccacctcgacggcaggcggcg




gggagtgggctgggccggggggacgc




gaggcggcggggcccgggggtcggcg




aaggcccctcgcgggcggacaccggc




tccgcgccccgccacacactcccgct




gcgcccggacgagtccccgctccgga




cccgcccttgacaggcgtcggggcga




aaggaggccccgggattcggtggccc




gggaagcgaccccgcagtggctccgg




tgccgtccacggctcgactccagggc




ggaaggccgggtgtccagcgctggcc




tcgcgctctagggctgggagaggggc




ggccggCCTGGTCAGCTCCGGAGGCC




CCGGCCCACCGTGGCCCCTGCTGCCC




GCTCGTGCTGTAATGTAGAGGTTGGA




GCTGACCCCTGCTCCTGGAGCTCATC




TTCTGATCCGGGTCTCTGAAAATGCG




GGCATGGGCAGCTCTCCTACACTCAC




CGCTTCCCTCAGTCACCCAGAAAAGA




AACCTGTCTTACCAGTTTGGAGAATT




GGGACCTTTCCTTTGCTTAACAGATA




CTTTTGGCTTTCTCCTGATGCCCCTA




ATTCCTAAACTGTTGGCCAAATAGCA




ACCTCTATGGGGTGGGGGGTTTGGAG




GGTACAGGGGCTGGGAGCTGGCTGAC




GCTTTGAGGCCCAAGTCACTCGGGAA




GATCACAATGCCAAGCGCCACAGTGT




TTCCTCTGCCAGGAGGGTTCACTTCC




CAGTTTCTAACCAGCCTGTGTTTCTC




CACTTTTTATTTCAGGAAACACCATC




CAGTGCCAACGGACCATCCAGAGAAG




GAAGTCGCCTGCCCCCTGCCCGAGAA




GGACACCCAGTCTATCCACAGCTGCG




GCCTGGCTATATTCCTATTCCCGTGC




TGCACGAGGGAGCAGAGAACAGACAG




GTGCACCCATTCCACGTGTACCCTCA




GCCAGGCATGCAGAGGTTTCGCACAG




AGGCTGCCGCCGCCGCCCCACAGCGG




AGCCAGTCCCCACTGAGAGGAATGCC




TGAGACAACACAGCCAGACAAGCAGT




GTGGACAGGTGGCTGCCGCCGCCGCC




GCCCAGCCCCCTGCCAGCCACGGCCC




CGAGCGGAGCCAGAGCCCTGCCGCCT




CTGATTGTAGCTCCTCTAGCTCCTCT




GCCAGCCTGCCCAGCTCCGGCAGGTC




TAGCCTGGGCAGCCACCAGCTGCCTC




GCGGCTATATCTCCATCCCAGTGATC




CACGAGCAGAACGTGACAAGACCAGC




AGCACAGCCCTCCTTCCACCAGGCAC




AGAAGACCCACTACCCAGCCCAGCAG




GGCGAGTATCAGACACACCAGCCCGT




GTACCACAAAATTCAGGGCGACGATT




GGGAGCCCCGGCCTCTGAGAGCAGCC




AGCCCTTTTCGGTCCTCTGTGCAGGG




AGCCAGCTCCAGGGAGGGCTCCCCAG




CCCGCTCTAGCACCCCTCTGCACTCC




CCATCTCCAATCAGAGTGCACACAGT




GGTGGACAGACCTCAGCAGCCAATGA




CACACAGGGAGACAGCACCCGTGAGC




CAGCCAGAGAATAAGCCCGAGTCTAA




GCCTGGCCCAGTGGGACCCGAGCTGC




CACCTGGACACATCCCAATCCAGGTC




ATCCGCAAGGAGGTGGATAGCAAGCC




CGTGTCCCAGAAGCCTCCACCCCCTA




GCGAGAAGGTGGAGGTGAAGGTGCCA




CCTGCACCCGTGCCTTGTCCACCACC




ATCTCCAGGCCCCAGCGCCGTGCCCT




CCTCTCCTAAATCTGTGGCAACAGAG




GAGAGAGCAGCACCTTCTACCGCACC




AGCAGAGGCAACACCACCTAAGCCTG




GAGAGGCAGAGGCACCACCTAAGCAC




CCTGGCGTGCTGAAGGTGGAGGCCAT




CCTGGAGAAGGTGCAGGGCCTGGAGC




AGGCCGTGGACAACTTCGAGGGCAAG




AAGACCGATAAGAAGTACCTGATGAT




CGAGGAGTATCTGACAAAGGAGCTGC




TGGCCCTGGACTCTGTGGACCCCGAG




GGAAGGGCAGACGTGCGCCAGGCCCG




GAGAGATGGCGTGCGCAAGGTGCAGA




CCATCCTGGAGAAGCTGGAGCAGAAG




GCCATCGACGTGCCTGGCCAGGTGCA




GGTGTACGAGCTGCAGCCTTCCAATC




TGGAGGCCGATCAGCCACTGCAGGCA




ATCATGGAGATGGGAGCAGTGGCAGC




AGACAAGGGCAAGAAGAACGCAGGAA




ATGCAGAGGACCCCCACACCGAGACA




CAGCAGCCTGAGGCCACAGCCGCAGC




AACCAGTAATCCATCCAGTATGACAG




ACACCCCAGGAAACCCCGCAGCCCCA




TAATAATAGCCTCTGCCCTGTAAAAA




TCAGACTCGGAACCGATGTGTGCTTT




AGGGAATTTTAAGTTGCATGCATTTC




AGAGACTTTAAGTCAGTTGGTTTTTA




TTAGCTGCTTGGTATGCAGTAACTTG




GGTGGAGGCAAAACACTAATAAAAGG




GCTAAAAAGGAAAATGATGCTTTTCT




TCTATATTCTTACTCTGTACCAATAA




AGAAGTTGCTTGTTGTTTGAGAAGTT




TAACCCCGTTGCTTGTTGTTCTGCAG




CCCTGTCTACTTGGGCACCCCCACCA




CCTGTTAGCTGTGGTTGTGCACTGTC




TTTTGTAGCTCTGGACTGGAGGGGTA




GATGGGGAGTCAATTACCCATCACAT




AAATATGAAACATTTATCAGAAATGT




TGCCATTTTAATGAGATGATTTTCTT




CATCTCATAATTAAAATACCTGACTT




TAGAGAGAGTAAAATGTGCCAGGAGC




CATAGGAATATCTGTATGTTGGATGA




CTTTAATGCTACATTTTAAAAAAAGA




AAATAAAGTAATAATATAACTCAAAA




GCGGCCGCCTCGAGCTGTGCCTTCTA




GTTGCCAGCCATCTGTTGTTTGCCCC




TCCCCCGTGCCTTCCTTGACCCTGGA




AGGTGCCACTCCCACTGTCCTTTCCT




AATAAAATGAGGAAATTGCATCGCAT




TGTCTGAGTAGGTGTCATTCTATTCT




GGGGGGTGGGGTGGGGCAGGACAGCA




AGGGGGAGGATTGGGAAGACAATAGC




AGGCATGCTGGGGAGTCGACGCGCCG




GCGTCTAGAAGGAACCCCTAGTGATG




GAGTTGGCCACTCCCTCTCTGCGCGC




TCGCTCGCTCACTGAGGCCGGGCGAC




CAAAGGTCGCCCGACGCCCGGGCTTT




GCCCGGGCGGCCTCAGTGAGCGAGCG




AGCGCGCAGAGAGGGAGTGGCCAA










Construct 6 (B827-pTR-CBA-Bag3-dual ITR-to-ITR sequence)















Elements



SEQ ID
(5′ -> 3′)
Nt sequence







103
Left ITR
ttggccactccctctctgegcgctcgctcgctcactgaggcc




gggcgaccaaaggtcgcccgacgcccgggctttgcccgggcg




gcctcagtgagcgagcgagcgcgcagagagggagtggccaac




tccatcactaggggttcct





104
CBA
cgttacataacttacggtaaatggcccgcctggctgaccgcc



promoter
caacgacccccgcccattgacgtcaataatgacgtatgttcc




catagtaacgccaatagggactttccattgacgtcaatgggt




ggactatttacggtaaactgcccacttggcagtacatcaagt




gtatcatatgccaagtacgccccctattgacgtcaatgacgg




taaatggcccgcctggcattatgcccagtacatgaccttatg




ggactttcctacttggcagtacatctacgtattagtcatcgc




tattaccatggtcgaggtgagccccacgttctgcttcactct




ccccatctcccccccctccccacccccaattttgtatttatt




tattttttaattattttgtgcagcgatgggggcggggggggg




gggggggcgcgcgccaggcggggcggggggggcgaggggcgg




ggcggggcgaggcggagaggtgcggcggcagccaatcagagc




ggcgcgctccgaaagtttccttttatggcgaggcggcggcgg




cggcggccctataaaaagcgaagcgcgcggcgggcgggagtc




gctgcgacgctgccttcgccccgtgccccgctccgccgccgc




ctcgcgccgcccgccccggctctgactgaccgcgttactccc




acaggtgagcggggggacggcccttctcctccgggctgtaat




tagcgcttggtttaatgacggcttgtttcttttctgtggctg




cgtgaaagccttgaggggctccgggagggccctttgtgcggg




ggggagcggctcggggggtgcgtgcgtgtgtgtgtgcgtggg




gagcgccgcgtgcggcccgcgctgcccggcggctgtgagcgc




tgcgggcgcggcgcggggctttgtgcgctccgcagtgtgcgc




gaggggagcgcggccgggggcggtgccccgcggtgcgggggg




ggctgcgaggggaacaaaggctgcgtgcggggtgtgtgcgtg




ggggggtgagcagggggtgtgggcgcggcggtcgggctgtaa




cccccccctgcacccccctccccgagttgctgagcacggccc




ggcttcgggtgcggggctccgtacggggcgtggcgcggggct




cgccgtgccgggcggggggggcggcaggtgggggtgccgggc




ggggcggggccgcctcgggccggggagggctcgggggagggg




cgcggcggcccccggagcgccggcggctgtcgaggcgcggcg




agccgcagccattgccttttatggtaatcgtgcgagagggcg




cagggacttcctttgtcccaaatctgtgcggagccgaaatct




gggaggcgccgccgcaccccctctagcgggcgcggggcgaag




cggtgcggcgccggcaggaaggaaatgggcggggagggcctt




cgtgcgtcgccgcgccgccgtccccttctccctctccagcct




cggggctgtccgcggggggacggctgccttcgggggggacgg




ggcagggcggggttcggcttctggcgtgtgaccggcggctct




agagcctctgctaaccatgttcatgccttcttctttttccta




cag





105
Human Bag3
GCATCCAACCCCGGGCCGCGGCCAACTTCTCTGGACTGGACC



exon 1
AGAAGTTTCTAGCCGGCCAGTTGCTACCTCCCTTTATCTCCT



non-coding
CCTTCCCCTCTGGCAGCGAGGAGGCTATTTCCAGACACTTCC




ACCCCTCTCTGGCCACGTCACCCCCGCCTTTAATTCATAAAG




GTGCCCGGCGCCGGCTTCCCGGACACGTCGGCGGCGGAGAGG




GGCCCACGGCGGCGGCCCGGCCAGAGACTCGGCGCCCGGAGC




CAGCGCCCCGCACCCGCGCCCCAGCGGGCAGACCCCAACCCA




GC





106
Kozak
GCCACC



sequence






107
Human Bag3
ATGTCTGCTGCCACACACTCTCCAATGATGCAGGTTGCCTCT



exon 1 -
GGCAATGGGGACAGAGATCCTCTGCCTCCTGGCTGGGAGATC



exon 4
AAGATTGATCCTCAGACAGGCTGGCCCTTCTTTGTGGACCAC




AACAGCAGAACCACCACCTGGAATGACCCCAGAGTGCCCTCT




GAGGGCCCTAAAGAAACCCCTAGCTCTGCCAATGGACCCAGC




AGAGAGGGATCTAGACTGCCTCCAGCTAGAGAAGGACACCCT




GTGTATCCTCAGCTCAGGCCTGGCTACATCCCCATTCCAGTT




CTGCATGAAGGGGCTGAGAACAGACAGGTGCACCCCTTCCAT




GTGTACCCACAGCCTGGCATGCAGAGATTCAGAACAGAGGCT




GCTGCTGCAGCCCCTCAGAGATCTCAATCTCCTCTGAGAGGC




ATGCCAGAGACAACACAGCCTGACAAGCAGTGTGGACAGGTG




GCAGCAGCAGCTGCAGCTCAACCTCCTGCTTCTCATGGCCCT




GAGAGAAGCCAGTCTCCTGCTGCCTCTGATTGCAGCAGTTCC




AGCAGCTCTGCCTCTCTGCCATCTTCTGGCAGAAGCAGCCTG




GGATCTCATCAGCTGCCTAGAGGCTACATCAGCATCCCTGTG




ATCCATGAGCAGAATGTGACCAGACCAGCTGCTCAGCCTAGC




TTCCATCAGGCCCAGAAAACACACTACCCTGCTCAGCAAGGG




GAGTACCAGACACACCAGCCTGTGTACCACAAGATCCAAGGG




GATGACTGGGAGCCCAGACCACTGAGAGCTGCCTCTCCATTC




AGATCATCTGTGCAAGGGGCCAGCTCTAGAGAGGGCTCTCCT




GCCAGATCTAGCACACCTCTGCACAGCCCATCTCCAATCAGA




GTGCACACAGTGGTGGACAGACCCCAGCAGCCTATGACACAC




AGAGAAACAGCCCCTGTGTCTCAGCCTGAGAACAAGCCTGAG




TCCAAACCTGGACCTGTGGGCCCTGAACTGCCACCAGGACAC




ATCCCTATCCAAGTGATCAGAAAAGAGGTGGACAGCAAGCCA




GTGTCTCAGAAGCCTCCTCCACCATCTGAGAAAGTGGAAGTG




AAGGTGCCACCAGCTCCAGTGCCTTGTCCTCCTCCATCTCCT




GGACCATCTGCTGTGCCTAGCTCTCCTAAGTCTGTGGCCACT




GAGGAAAGGGCTGCCCCTTCTACAGCTCCTGCTGAGGCTACA




CCTCCTAAGCCTGGGGAAGCTGAAGCCCCTCCTAAACACCCT




GGGGTGCTGAAGGTGGAAGCCATCCTGGAAAAGGTGCAGGGC




CTTGAGCAGGCAGTGGACAACTTTGAGGGCAAGAAAACAGAC




AAGAAATACCTGATGATTGAGGAATACCTGACCAAAGAACTG




CTGGCCCTGGATTCTGTGGACCCTGAGGGCAGAGCAGATGTT




AGACAGGCTAGAAGAGATGGTGTTAGGAAGGTGCAGACCATC




CTTGAGAAGCTGGAACAGAAAGCCATTGATGTGCCTGGACAG




GTCCAAGTGTATGAACTGCAGCCCAGCAACCTGGAAGCTGAC




CAGCCTCTGCAGGCCATCATGGAAATGGGAGCTGTGGCTGCT




GACAAGGGAAAGAAAAATGCTGGCAATGCTGAGGACCCTCAC




ACTGAGACTCAGCAGCCTGAAGCCACAGCAGCTGCCACAAGC




AACCCTAGCAGCATGACAGACACCCCTGGCAACCCTGCTGCT




CCT





108
Stop codons
TAATGATAG





109
Human Bag 3
CCTCTGCCCTGTAAAAATCAGACTCGGAACCGATGTGTGCTT



3′UTR
TAGGGAATTTTAAGTTGCATGCATTTCAGAGACTTTAAGTCA




GTTGGTTTTTATTAGCTGCTTGGTATGCAGTAACTTGGGTGG




AGGCAAAACACTAATAAAAGGGCTAAAAAGGAAAATGATGCT




TTTCTTCTATATTCTTACTCTGTACCAATAAAGAAGTTGCTT




GTTGTTTGAGAAGTTTAACCCCGTTGCTTGTTGTTCTGCAGC




CCTGTCTACTTGGGCACCCCCACCACCTGTTAGCTGTGGTTG




TGCACTGTCTTTTGTAGCTCTGGACTGGAGGGGTAGATGGGG




AGTCAATTACCCATCACATAAATATGAAACATTTATCAGAAA




TGTTGCCATTTTAATGAGATGATTTTCTTCATCTCATAATTA




AAATACCTGACTTTAGAGAGAGTAAAATGTGCCAGGAGCCAT




AGGAATATCTGTATGTTGGATGACTTTAATGCTACATTTTAA




AAAAAGAAAATAAAGTAATAATATAACTCAAAA





110
Poly A
aataaaagatccttattttcattggatctgtgtgttggtttt




ttgtgtg





111
Right ITR
aggaacccctagtgatggagttggccactccctctctgcgcg




ctcgctcgctcactgaggccgggcgaccaaaggtcgcccgac




gcccgggctttgcccgggcggcctcagtgagcgagcgagcgc




gcagagagggagtggccaa





112
B827-pTR-
ttggccactccctctctgcgcgctcgctcgctcactgaggcc



CBA-Bag3-
gggcgaccaaaggtcgcccgacgcccgggctttgcccgggcg



dual
gcctcagtgagcgagcgagcgcgcagagagggagtggccaac



ITR-to-
tccatcactaggggttccttctagaggcgcgccaagcttggt



ITR
accctagttattaatagtaatcaattacggggtcattagttc



sequence
atagcccatatatggagttccgcgttacataacttacggtaa




atggcccgcctggctgaccgcccaacgacccccgcccattga




cgtcaataatgacgtatgttcccatagtaacgccaataggga




ctttccattgacgtcaatgggggactatttacggtaaactgc




ccacttggcagtacatcaagtgtatcatatgccaagtacgcc




ccctattgacgtcaatgacggtaaatggcccgcctggcatta




tgcccagtacatgaccttatgggactttcctacttggcagta




catctacgtattagtcatcgctattaccatggtcgaggtgag




ccccacgttctgcttcactctccccatctcccccccctcccc




acccccaattttgtatttatttattttttaattattttgtgc




agcgatgggggcggggggggggggggggcgcgcgccaggcgg




ggcggggggggcgaggggcggggcggggcgaggcggagaggt




gcggcggcagccaatcagagcggcgcgctccgaaagtttcct




tttatggcgaggcggcggcggcggcggccctataaaaagcga




agcgcgcggcgggcgggagtcgctgcgacgctgccttcgccc




cgtgccccgctccgccgccgcctcgcgccgcccgccccggct




ctgactgaccgcgttactcccacaggtgagcgggcgggacgg




cccttctcctccgggctgtaattagcgcttggtttaatgacg




gcttgtttcttttctgtggctgcgtgaaagccttgaggggct




ccgggagggccctttgtgcgggggggagcggctcggggggtg




cgtgcgtgtgtgtgtgcgtggggagcgccgcgtgcggcccgc




gctgcccggcggctgtgagcgctgcgggcgcggcgcggggct




ttgtgcgctccgcagtgtgcgcgaggggagcgcggccggggg




cggtgccccgcggtgcggggggggctgcgaggggaacaaagg




ctgcgtgcggggtgtgtgcgtgggggggtgagcagggggtgt




gggcgcggcggtcgggctgtaacccccccctgcacccccctc




cccgagttgctgagcacggcccggcttcgggtgcggggctcc




gtacggggcgtggcgcggggctcgccgtgccgggcggggggt




ggcggcaggtgggggtgccgggcggggcggggccgcctcggg




ccggggagggctcgggggaggggcgcggcggcccccggagcg




ccggcggctgtcgaggcgcggcgagccgcagccattgccttt




tatggtaatcgtgcgagagggcgcagggacttcctttgtccc




aaatctgtgcggagccgaaatctgggaggcgccgccgcaccc




cctctagcgggcgcggggcgaagcggtgcggcgccggcagga




aggaaatgggcggggagggccttcgtgcgtcgccgcgccgcc




gtccccttctccctctccagcctcggggctgtccgcgggggg




acggctgccttcgggggggacggggcagggcggggttcggct




tctggcgtgtgaccggcggctctagagcctctgctaaccatg




ttcatgccttcttctttttcctacaggCTAGCGTTTAAACTT




AAGCTTGCATCCAACCCCGGGCCGCGGCCAACTTCTCTGGAC




TGGACCAGAAGTTTCTAGCCGGCCAGTTGCTACCTCCCTTTA




TCTCCTCCTTCCCCTCTGGCAGCGAGGAGGCTATTTCCAGAC




ACTTCCACCCCTCTCTGGCCACGTCACCCCCGCCTTTAATTC




ATAAAGGTGCCCGGCGCCGGCTTCCCGGACACGTCGGCGGCG




GAGAGGGGCCCACGGCGGCGGCCCGGCCAGAGACTCGGCGCC




CGGAGCCAGCGCCCCGCACCCGCGCCCCAGCGGGCAGACCCC




AACCCAGCGCCACCATGTCTGCTGCCACACACTCTCCAATGA




TGCAGGTTGCCTCTGGCAATGGGGACAGAGATCCTCTGCCTC




CTGGCTGGGAGATCAAGATTGATCCTCAGACAGGCTGGCCCT




TCTTTGTGGACCACAACAGCAGAACCACCACCTGGAATGACC




CCAGAGTGCCCTCTGAGGGCCCTAAAGAAACCCCTAGCTCTG




CCAATGGACCCAGCAGAGAGGGATCTAGACTGCCTCCAGCTA




GAGAAGGACACCCTGTGTATCCTCAGCTCAGGCCTGGCTACA




TCCCCATTCCAGTTCTGCATGAAGGGGCTGAGAACAGACAGG




TGCACCCCTTCCATGTGTACCCACAGCCTGGCATGCAGAGAT




TCAGAACAGAGGCTGCTGCTGCAGCCCCTCAGAGATCTCAAT




CTCCTCTGAGAGGCATGCCAGAGACAACACAGCCTGACAAGC




AGTGTGGACAGGTGGCAGCAGCAGCTGCAGCTCAACCTCCTG




CTTCTCATGGCCCTGAGAGAAGCCAGTCTCCTGCTGCCTCTG




ATTGCAGCAGTTCCAGCAGCTCTGCCTCTCTGCCATCTTCTG




GCAGAAGCAGCCTGGGATCTCATCAGCTGCCTAGAGGCTACA




TCAGCATCCCTGTGATCCATGAGCAGAATGTGACCAGACCAG




CTGCTCAGCCTAGCTTCCATCAGGCCCAGAAAACACACTACC




CTGCTCAGCAAGGGGAGTACCAGACACACCAGCCTGTGTACC




ACAAGATCCAAGGGGATGACTGGGAGCCCAGACCACTGAGAG




CTGCCTCTCCATTCAGATCATCTGTGCAAGGGGCCAGCTCTA




GAGAGGGCTCTCCTGCCAGATCTAGCACACCTCTGCACAGCC




CATCTCCAATCAGAGTGCACACAGTGGTGGACAGACCCCAGC




AGCCTATGACACACAGAGAAACAGCCCCTGTGTCTCAGCCTG




AGAACAAGCCTGAGTCCAAACCTGGACCTGTGGGCCCTGAAC




TGCCACCAGGACACATCCCTATCCAAGTGATCAGAAAAGAGG




TGGACAGCAAGCCAGTGTCTCAGAAGCCTCCTCCACCATCTG




AGAAAGTGGAAGTGAAGGTGCCACCAGCTCCAGTGCCTTGTC




CTCCTCCATCTCCTGGACCATCTGCTGTGCCTAGCTCTCCTA




AGTCTGTGGCCACTGAGGAAAGGGCTGCCCCTTCTACAGCTC




CTGCTGAGGCTACACCTCCTAAGCCTGGGGAAGCTGAAGCCC




CTCCTAAACACCCTGGGGTGCTGAAGGTGGAAGCCATCCTGG




AAAAGGTGCAGGGCCTTGAGCAGGCAGTGGACAACTTTGAGG




GCAAGAAAACAGACAAGAAATACCTGATGATTGAGGAATACC




TGACCAAAGAACTGCTGGCCCTGGATTCTGTGGACCCTGAGG




GCAGAGCAGATGTTAGACAGGCTAGAAGAGATGGTGTTAGGA




AGGTGCAGACCATCCTTGAGAAGCTGGAACAGAAAGCCATTG




ATGTGCCTGGACAGGTCCAAGTGTATGAACTGCAGCCCAGCA




ACCTGGAAGCTGACCAGCCTCTGCAGGCCATCATGGAAATGG




GAGCTGTGGCTGCTGACAAGGGAAAGAAAAATGCTGGCAATG




CTGAGGACCCTCACACTGAGACTCAGCAGCCTGAAGCCACAG




CAGCTGCCACAAGCAACCCTAGCAGCATGACAGACACCCCTG




GCAACCCTGCTGCTCCTTAATGATAGCCTCTGCCCTGTAAAA




ATCAGACTCGGAACCGATGTGTGCTTTAGGGAATTTTAAGTT




GCATGCATTTCAGAGACTTTAAGTCAGTTGGTTTTTATTAGC




TGCTTGGTATGCAGTAACTTGGGTGGAGGCAAAACACTAATA




AAAGGGCTAAAAAGGAAAATGATGCTTTTCTTCTATATTCTT




ACTCTGTACCAATAAAGAAGTTGCTTGTTGTTTGAGAAGTTT




AACCCCGTTGCTTGTTGTTCTGCAGCCCTGTCTACTTGGGCA




CCCCCACCACCTGTTAGCTGTGGTTGTGCACTGTCTTTTGTA




GCTCTGGACTGGAGGGGTAGATGGGGAGTCAATTACCCATCA




CATAAATATGAAACATTTATCAGAAATGTTGCCATTTTAATG




AGATGATTTTCTTCATCTCATAATTAAAATACCTGACTTTAG




AGAGAGTAAAATGTGCCAGGAGCCATAGGAATATCTGTATGT




TGGATGACTTTAATGCTACATTTTAAAAAAAGAAAATAAAGT




AATAATATAACTCAAAAGCggccgcgagctccggccgaataa




aagatccttattttcattggatctgtgtgttggttttttgtg




tggcatgcgtcgacgcgccggcgtctagaaggaacccctagt




gatggagttggccactccctctctgcgcgctcgctcgctcac




tgaggccgggcgaccaaaggtcgcccgacgcccgggctttgc




ccgggcggcctcagtgagcgagcgagcgcgcagagagggagt




ggccaa









The Transgene

A transgene may be employed to correct, reduce, eliminate, or otherwise ameliorate gene deficiencies, which may include deficiencies in which normal genes are expressed at less than normal levels, are expressed at normal or near-normal levels but having a gene product with abnormal activity, or deficiencies in which the functional gene product is not expressed. In several embodiments, the transgene sequence encodes a therapeutic protein or polypeptide which is to be expressed in a host cell. Embodiments of the present disclosure also include using multiple transgenes.


BAG3 (BCL-2-associated athanogene 3) has been implicated in selected macroautophagy (aggrephagy), wherein aggregated proteins are degraded. Under stress conditions and during normal cellular aging, BAG3 acts with other molecular chaperones HSP70 and HSPB8, along with ubiquitin receptor p62/SQSTM1 to target aggregated proteins for autophagic degradation. Loss of function of BAG3 can disrupt cellular clearing of protein aggregates which may lead to physiological complications and dysfunction. BAG3 mediated clearance is involved in many cellular processes which require the clearance of aggregate or aggregate prone proteins, and may be associated with age-related neurodegenerative disorders, like Alzheimer's disease (marked by tau-protein), Huntington's disease (involving mutated huntingtin/polyQ proteins), and amyotrophic lateral sclerosis (mutated SOD1). Additionally, BAG3 has been shown to play a role in a variety of other disease states, including cancer and myopathies. BAG3 mutations in cardiomyopathy may significantly increase burdens associated with heart disease and increase severe cardiac events.


Regulatory Elements

In some embodiments, the rAAV vector comprises one or more regions comprising a sequence that facilitates expression of the heterologous nucleic acid, e.g., expression regulatory sequences operatively linked to the heterologous nucleic acid. A promoter drives transcription of the nucleic acid sequence that it regulates, thus, it is typically located at or near the transcriptional start site of a gene. A promoter may have, for example, a length of 100 to 1000 nucleotides. In some embodiments, a promoter is operably linked to a nucleic acid, or a sequence of a nucleic acid (nucleotide sequence). A promoter is considered to be “operably linked” to a sequence of nucleic acid that it regulates when the promoter is in a correct functional location and orientation relative to the sequence such that the promoter regulates (e.g., to control (“drive”) transcriptional initiation and/or expression of) that sequence. Numerous such sequences are known in the art.


Promoters that may be used in accordance with the present disclosure may comprise any promoter that can drive the expression of the transgenes in the heart of the subject. In some embodiments, the promoter may be a tissue-specific promoter. A “tissue-specific promoter”, as used herein, refers to promoters that can only function in a specific type of tissue, e.g., the heart. Thus, a “tissue-specific promoter” is not able to drive the expression of the transgenes in other types of tissues. In some embodiments, the promoter that may be used in accordance with the present disclosure is a cardiac-restricted promoter. Non-limiting examples of Tissue-specific promoters and/or regulatory elements that may be used include (1) desmin, creatine kinase, myogenin, alpha myosin heavy chain, and natriuretic peptide, specific for muscle cells, and (2) albumin, alpha-1-antitrypsin, hepatitis B virus core protein promoters, specific for liver cells. Non-limiting examples of cardiac-restricted promoter selected from cardiac troponin C, cardiac troponin I, and cardiac troponin T (cTnT). In treating cardiomyopathies as provided for herein, cardiac-restricted promoters are advantageous at least due to the reduced possibility of off-target expression of the transgene(s), thereby effectively increasing the delivered dose to the heart and enhancing therapy. Non-limiting examples of expression regulatory sequences include promoters, insulators, silencers, response elements, introns, enhancers, initiation sites, termination signals, and poly (A) tails. Any combination of such regulatory sequences is contemplated herein (e.g., a promoter and an enhancer).


Alternatively, the promoter may be, without limitation, a promoter from one of the following genes: α-myosin heavy chain gene, 6-myosin heavy chain gene, myosin light chain 2v (MLC-2v) gene, myosin light chain 2a gene, CARP gene, cardiac α-actin gene, cardiac m2 muscarinic acetylcholine gene, atrial natriuretic factor gene (ANF), cardiac sarcoplasmic reticulum Ca-ATPase gene, skeletal α-actin gene; or an artificial cardiac promoter derived from MLC-2v gene.


To achieve appropriate expression levels of the nucleic acid, protein, or polypeptide of interest, any of a number of promoters suitable for use in the selected host cell may be employed. The promoter may be, for example, a constitutive promoter, tissue-specific promoter, inducible promoter, or a synthetic promoter. For example, constitutive promoters of different strengths can be used. An rAAV vector described herein may include one or more constitutive promoters, such as viral promoters or promoters from mammalian genes that are generally active in promoting transcription. Non-limiting examples of constitutive viral promoters include the Herpes Simplex virus (HSV), thymidine kinase (TK), Rous Sarcoma Virus (RSV), Simian Virus 40 (SV40), Mouse Mammary Tumor Virus (MMTV), Ad E1A and cytomegalovirus (CMV) promoters. Non-limiting examples of non-viral constitutive promoters include various housekeeping gene promoters, as exemplified by the β-actin promoter, including the chicken β-actin promoter (CBA).


Inducible promoters and/or regulatory elements may also be contemplated for achieving appropriate expression levels of the protein or polypeptide of interest. Non-limiting examples of suitable inducible promoters include those from genes such as cytochrome P450 genes, heat shock protein genes, metallothionein genes, and hormone-inducible genes, such as the estrogen gene promoter. Another example of an inducible promoter is the tetVP16 promoter that is responsive to tetracycline.


Synthetic promoters are also contemplated herein. A synthetic promoter may comprise, for example, regions of known promoters, regulatory elements, transcription factor binding sites, enhancer elements, repressor elements, and the like.


Enhancer elements can function in combination with other regulatory elements to increase the expression of a transgene. In several embodiments, the enhancer elements are upstream (positioned 5′) of the transgene. Non-limiting embodiments of enhancer elements include nucleotide sequences comprising, for example, a 100 base pair element from Simian virus 40 (SV40 late 2XUSE), a 35 base pair element from Human Immunodeficiency Virus 1 (HIV-1 USE), a 39 base pair element from ground squirrel hepatitis virus (GHV USE), a 21 base pair element from adenovirus (Adenovirus L3 USE), a 21 base pair element from human prothrombin (hTHGB USE), a 53 base pair element from human C2 complement gene (hC2 USE), truncations of any of the foregoing, and combinations of the foregoing. In some embodiments the enhancer is derived from the α-myosin heavy chain (αMHC) gene. In some embodiments the αMHC enhancer comprises a nucleic acid sequence having at least about 85%, about 90%, about 95%, about 96%, about 97%, about 98%, about 99%, or 100% sequence identity to:









SEQ ID NO: 55


CCTTCAGATTAAAAATAACTAAGGTAAGGGCCATGTGGGTAGGGGAGGTG





GTGTGAGACGGTCCTGTCTCTCCTCTATCTGCCCATCGGCCCTTTGGGGA





GGAGGAATGTGCCCAAGGACTAAAAAAAGGCCCTGGAGCCAGAGGGGCGA





GGGCAGCAGACCTTTCATGGGCAAACCTCAGGGCTGCTGTC.









Non-limiting polyadenylation signals include nucleotide sequences comprising, for example, a 624 base pair polyadenylation signal from human growth hormone (hGH), a 135 base pair polyadenylation signal from simian virus 40 (sV40 late), a 49 base pair synthetic polyadenylation signal from rabbit beta-globin (SPA), a 250 base pair polyadenylation signal from bovine growth hormone (bGH), truncations of any of the foregoing, and combinations of the foregoing.


In some embodiments of the disclosed rAAV vectors, the two or more transgenes are operably controlled by a single promoter. In some embodiments, each of the two or more transgenes are operably controlled by a distinct promoter.


In some embodiments, the rAAV vectors of the present disclosure further comprise an Internal Ribosome Entry Site (IRES). An IRES is a nucleotide sequence that allows for translation initiation in the middle of a messenger RNA (mRNA) sequence as part of the greater process of protein synthesis. Usually, in eukaryotes, translation can be initiated only at the 5′ end of the mRNA molecule, since 5′ cap recognition is required for the assembly of the initiation complex. In some embodiments, the IRES is located between the transgenes.


In such embodiments, the proteins encoded by different transgenes are translated individually (i.e., versus translated as a fusion protein).


In some embodiments, the rAAV vectors of the present disclosure comprise at least, in order from 5′ to 3′, a first adeno-associated virus (AAV) inverted terminal repeat (ITR) sequence, a promoter operably linked to a first transgene, an IRES operably linked to a second transgene, a polyadenylation signal, and a second AAV inverted terminal repeat (ITR) sequence.


In some embodiments, the rAAV vectors of the present disclosure further comprise a polyadenylation (pA) signal


Expression Cassette

The expression cassette is composed of, at a minimum, a transgene and its regulatory sequences. Where the cassette is designed to be expressed from a rAAV, the expression cassette further contains 5′ and 3′ AAV ITRs. These ITR's may be full-length, or one or both of the ITRs may be truncated. In one embodiment, the rAAV is pseudotyed, i.e., the AAV capsid is from a different source AAV than that the AAV which provides the ITRs. In one embodiment, the ITRs of AAV serotype 2 are used. In additional embodiments, the ITRs of AAV serotype 1 are used. However, ITRs from other suitable sources may be selected.



FIG. 1 depicts an embodiment of a construct described herein. At the 5′ end, an AAV ITR and CMV enhancer is positioned upstream from a chimeric intron, chicken β-actin promoter. Following the promoter, the BAG3 transgene, consisting of exons 1-4 is depicted. The construct further includes a polyadenylated site, SV40 following the BAG3 transgene, as well as additional regulatory sequences M13 ori, NeoR/KanR, and ori. Within the structural sequences described in the aforementioned construct, at least one or a plurality of spacer sequences may be inserted at any point within the construct. Additionally, any number of promoter or regulatory sequences may comprise a construct to alter or change the expression of BAG3.



FIG. 2 depicts an embodiment of a construct described herein. At the 5′ end, an AAV ITR is positioned upstream from a CK8 promoter. Following the promoter, the BAG3 transgene consisting of exons 1-4, interspersed with non-coding elements and introns, is depicted. The construct further includes an untranslated region for exon 4, as well as an AAV ITR. Within the structural sequences described in the aforementioned construct, at least one or a plurality of spacer sequences may be inserted at any point within the construct. Additionally, any number of promoter or regulatory sequences may comprise a construct to alter or change the expression of BAG3. In several embodiments, one or more of the depicted spacer sequences can be removed.



FIG. 3 depicts an embodiment of a construct described herein. At the 5′ end, an AAV ITR is positioned upstream from a mDes promoter. Following the promoter, the BAG3 transgene consisting of exons 1-4, interspersed with non-coding elements and introns, is depicted. The construct further includes an untranslated region for exon 4, as well as an AAV ITR. Within the structural sequences described in the aforementioned construct, at least one or a plurality of spacer sequences may be inserted at any point within the construct. Additionally, any number of promoter or regulatory sequences may comprise a construct to alter or change the expression of BAG3. In several embodiments, one or more of the depicted spacer sequences can be removed.


The Vector

Further provided herein are rAAV viral particles or rAAV preparations containing such particles. In several embodiments, rAAV particles comprise a viral capsid and one or more transgenes as described herein, which is encapsidated by the viral capsid. Methods of producing rAAV particles are known in the art and are commercially available (see, e.g., Zolotukhin el al. Production and purification of serotype 1, 2, and 5 recombinant adeno-associated viral vectors. Methods 28 (2002) 158-167; and U.S. Patent Application Publication Numbers US 2007/0015238 and US 2012/0322861, which are incorporated herein by reference; and plasmids and kits available from ATCC and Cell Biolabs, Inc.). For example, a plasmid containing the rAAV vector may be combined with one or more helper plasmids, e.g., that contain a rep gene (e.g., encoding Rep78, Rep68, Rep52 and Rep40) and a cap gene (encoding VP1, VP2, and VP3, including a modified VP3 region as described herein), and transfected into a producer cell line such that the rAAV particle can be packaged and subsequently purified.


The rAAV particles or particles within an rAAV preparation disclosed herein, may be of any AAV serotype, including any derivative or pseudotype (e.g., 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 2/1, 2/5, 2/8, 2/9, 3/1, 3/5, 3/8, or 3/9). As used herein, the serotype of an rAAV an rAAV particle refers to the serotype of the capsid proteins of the recombinant virus. In some embodiments, the rAAV particle is rAAV6 or rAAV9. In some embodiments, the rAAV particle is AAVrh.74. In a preferred embodiment, the rAAV particle is AAVrh74. In an additional preferred embodiment, the rAAV is AAV9. In several embodiments, an rh74 AAV is mutated to advantageously enhance delivery to cardiac tissue, for example by a tryptophan to arginine mutation at amino acid 505 of VP1 capsid, or other mutations, as described in PCT Publication WO 2019/1784412, which is incorporated in its entirety by reference herein. Non-limiting examples of derivatives, pseudotypes, and/or other vector types include, but are not limited to, AAVrh.10. AAVrh.74, AAV2/1, AAV2/5, AAV2/6, AAV2/8, AAV2/9, AAV2-AAV3 hybrid, AAVhu.14, AAV3a/3b, AAVrh32.33, AAV-HSC15, AAV-HSC17, AAVhu.37, AAVrh.8, CHt-P6, AAV2.5, AAV6.2, AAV218, AAV-HSC15/17, AAVM41, AAV9.45, AAV6 (Y445F/Y731F), AAV2.5T, AAV-HAE1/2, AAV clone 32/83, AAVShHIO, AAV2 (Y->F), AAV8 (Y733F), AAV2.15, AAV2.4, AAVM41, and AA Vr3.45.


Such AAV serotypes and derivatives/pseudotypes, and methods of producing such derivatives/pseudotypes are known in the art (see, e.g., Mol Ther. 2012 April; 20 (4): 699-708. doi: 10.1038/mt.2011.287. Epub 2012 Jan. 24. The AAV vector toolkit: poised at the clinical crossroads. Asokan A1, Schaffer DV, Samulski RJ.). In particular embodiments, the capsid of any of the herein disclosed rAAV particles is of the AAVrh.10 serotype. In a preferred embodiment, the capsid of the rAAV particle is AAVrh10 serotype. In some embodiments, the capsid is of the AAV2/6 serotype. In some embodiments, the rAAV particle is a pseudotyped rAAV particle, which comprises (a) an rAAV vector comprising ITRs from one serotype (e.g., AAV2, AAV3) and (b) a capsid comprised of capsid proteins derived from another serotype (e.g., AAV1, AAV2, AAV3, AAV4, AAV5, AAV6, AAV7, AAV8, AAV9, or AAV10). Methods for producing and using pseudotyped rAAV vectors are known in the art (see, e.g., Duan et al, J. Virol., 75:7662-7671, 2001; Halbert et al, J. Virol., 74:1524-1532, 2000; Zolotukhin et al, Methods, 28:158-167, 2002; and Auricchio et al., Hum. Molec. Genet., 10:3075-3081, 2001). rAAV Gene Therapy for Heart Diseases


In some embodiments, the rAAV vectors of the present disclosure further comprise a polyadenylation (pA) signal. For example, in preferred embodiments the pA signal comprises the following sequence: 17


In some embodiments, the rAAV vectors of the present disclosure comprise at least, in order from 5′ to 3′, a first adeno-associated vims (AAV) inverted terminal repeat (ITR) sequence, a promoter operably linked to a transgene, a polyadenylation signal, and a second AAV inverted terminal repeat (ITR) sequence.


In some embodiments, the rAAV vector genome is circular. In some embodiments, the rAAV vector genome is linear. In some embodiments, the rAAV vector genome is single-stranded. In some embodiments, the rAAV vector genome is double-stranded. In some embodiments, the rAAV genome vector is a self-complementary rAAV vector. In preferred embodiments, the rAAV vector genome is single stranded. In preferred embodiments, the rAAV vector genome is self complementary.


Described herein are non-limiting examples of rAAV vectors. The vectors illustrated below comprise the linearized plasmid sequences set forth as SEQ ID NOs: 1-20, 30-41, 42-53, 79-89, 91-101, or 103-111. The vectors of the disclosure may comprise nucleotide sequences that have at least 70% identity, at least about 80% identity, at least about 90% identity, at least about 95% identity, at least about 96% identity, at least about 97% identity, at least about 98% identity, at least about 99% identity, at least about 99.5% identity, or at least about 99.9% identity to the sequences set forth as SEQ ID NOs: 1-20, 30-41, 42-53, 79-89, 91-101, or 103-111. In several embodiments, the rAAV has 100% identity to the sequences set forth as SEQ ID NOs 1-20, 30-41, 42-53, 79-89, 91-101, or 103-111.


In some embodiments, any of the disclosed rAAV nucleic acid vector sequences comprise truncations at the 5′ or 3′ end relative to the sequences of any one of SEQ ID NOs: 1-20, 30-41, 42-53, 79-89, 91-101, or 103-111. In some embodiments, any of the rAAV vectors comprise a nucleotide sequence that differs from the sequence of any one of SEQ ID NOs: 1-20, 30-41, 42-53, 79-89, 91-101, or 103-111 by 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, or more than 18 nucleotides.


Recombinant Adeno-Associated Virus Vectors and Therapeutic Use Thereof

Many serotypes of AAV have been cloned and sequenced. Serotypes 1 and 6 share >99% amino acid homology in their capsid proteins. Of the first six AAV serotypes, serotype 2 is widely characterized and therefore often used in gene transfer studies, however according to embodiments disclosed herein, other AAV serotypes are also used, such as AAV9, AAV20, AAVrh74, AAVrh10, and the like. In several embodiments, repeat administration of a given serotype that would be expected to elicit a humoral immune response is performed in connection with an immune management regimen. In several embodiments, an immune management regimen comprises administration of one or more agents that function as B-cell depletors, alone, or in conjunction with one or more agents that inhibit one or more aspects of the mTOR pathway. In one embodiment, an antiCD20 antibody is administered and rapamycin is administered. In several embodiments, this allows for the repeat administration of a given serotype rAAV with reduced, limited or no immune response to a subsequent dosing of the rAAV. Further information about immune management can found in U.S. patent application Ser. No. 15/306,139, the entire contents of which is incorporated by reference herein.


The therapeutic rAAV vectors, therapeutic rAAV particles, or the composition comprising the therapeutic rAAV particles of the present disclosure, may be used for gene therapy for heart diseases in a human subject in need thereof, such as cardiomyopathies as provided for herein. Examples of heart disease that may be treated using the methods and compositions of the present disclosure include, but are not limited to, cardiomyopathy and acute ischemia. In some embodiments, cardiomyopathy is hypertrophic cardiomyopathy or dilated cardiomyopathy. In some embodiments, the cardiomyopathy is dilated cardiomyopathy and is caused by or associated with reduced or non-existent expression and/or function of BAG3. The therapeutic rAAV vectors, particles, and compositions comprising the therapeutic rAAV particles may be used for treatment of such heart failure (e.g., heart failure secondary to cardiomyopathy) when administered to a subject in need thereof, e.g., via vascular delivery into the coronary arteries and/or direct injection to the heart. The therapeutic rAAV vectors, particles, and compositions comprising the rAAV particles drive the concurrent expression of BAG3 in the cardiomyocytes of the subject.


In several embodiments, the amino acid sequence of the therapeutic BAG3 encoded by the BAG3 transgene is at least about 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% sequence identity to one or a combination of the amino acid sequences set forth as SEQ ID NOs: 23-26, 101, or 83-85. In several embodiments, the amino acid sequence of the therapeutic BAG3 encoded by the BAG3 transgene is at least about 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% sequence identity to the amino acid sequences set forth as SEQ ID NOs: 23-26 arranged in sequence. In several embodiments, the amino acid sequence of the therapeutic BAG3 encoded by the BAG3 transgene is at least about 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% sequence identity to the amino acid sequences set forth as SEQ ID NO: 101. In several embodiments, the amino acid sequence of the therapeutic BAG3 encoded by the BAG3 transgene is at least about 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98% or at least 99% sequence identity to the amino acid sequences set forth as SEQ ID NOs: 83-85 arranged in sequence.


In some embodiments, there are provided amino acid sequences that correspond to any of the nucleic acids disclosed herein (and/or included in the accompanying sequence listing), while accounting for degeneracy of the nucleic acid code. Furthermore, those sequences (whether nucleic acid or amino acid) that vary from those expressly disclosed herein (and/or included in the accompanying sequence listing), but have functional similarity or equivalency are also contemplated within the scope of the present disclosure. The foregoing includes mutants, truncations, substitutions, or other types of modifications.


In accordance with some embodiments described herein, any of the sequences may be used, or a truncated or mutated form of any of the sequences disclosed herein (and/or included in the accompanying sequence listing) may be used and in any combination.


The promoter driving expression of the therapeutic nucleic acid can be, but is not limited to, a constitutive promoter, an inducible promoter, a tissue-specific promoter, a neuronal-specific promoter, a muscle-specific promoter, or a synthetic promoter. In some embodiments, the promoter is a neuronal-specific promoter or a muscle-specific promoter. A constitutive promoter can be, but is not limited to, a Herpes Simplex virus (HSV) promoter, a thymidine kinase (TK) promoter, a Rous Sarcoma Virus (RSV) promoter, a Simian Virus 40 (SV40) promoter, a Mouse Mammary Tumor Virus (MMTV) promoter, an Adenovirus E1A promoter, a cytomegalovirus (CMV) promoter, a mammalian housekeeping gene promoter, or a β-actin promoter. An inducible promoter can be, but is not limited to, a cytochrome P450 gene promoter, a heat shock protein gene promoter, a metallothionein gene promoter, a hormone-inducible gene promoter, an estrogen gene promoter, or a tetVP16 promoter that is responsive to tetracycline. A muscle-specific promoter can be, but is not limited to, desmin promoter, a creatine kinase promoter, a myogenin promoter, an alpha myosin heavy chain promoter, or a natriuretic peptide promoter.


In some embodiments, the therapeutic rAAV promoter comprises a neuronal- or cardiomuscle-specific promoter.


The therapeutic rAAV can be serotype 1, serotype 2, serotype 3, serotype 4, serotype 5, serotype 6, serotype 7, serotype 8, serotype 9, serotype 10, serotype 11, serotype 12, serotype rh10, or serotype rh74. The therapeutic rAAV can also be a pseudo-type rAAV.


In some embodiments, the therapeutic rAAV has a sequence sharing at least 85% sequence identity to SEQ ID NO: 1-20, 30-41, 42-53, 79-89, 91-101, or 103-111 arranged in sequence.


In some embodiments, the therapeutic rAAV has a sequence sharing at least 95% sequence identity to SEQ ID NO: 1-20, 30-41, 42-53, 79-89, 91-101, or 103-111 arranged in sequence.


Pharmaceutical Formulations and Administration

Compositions described herein may further comprise a pharmaceutical excipient, buffer, or diluent, and may be formulated for administration to host cell ex vivo or in situ in an animal, and particularly a human being. Such compositions may further optionally comprise a liposome, a lipid, a lipid complex, a microsphere, a microparticle, a nanosphere, or a nanoparticle, or may be otherwise formulated for administration to the cells, tissues, organs, or body of a subject in need thereof. Such compositions may be formulated for use in a variety of therapies, such as for example, in the amelioration, prevention, and/or treatment of conditions such as peptide deficiency, polypeptide deficiency, peptide overexpression, polypeptide overexpression, including for example, conditions which result in diseases or disorders as described herein.


Formulations comprising pharmaceutically-acceptable excipients and/or carrier solutions are well-known to those of skill in the art, as is the development of suitable dosing and treatment regimens for using the particular compositions described herein in a variety of treatment regimens, including e.g., oral, parenteral, intravenous, intranasal, intra-articular, and intramuscular administration and formulation.


Typically, these formulations may contain at least about 0.1% of the therapeutic agent (e.g., therapeutic rAAV particle or preparation) or more, although the percentage of the active ingredient(s) may, of course, be varied and may conveniently be between about 1 or 2% and about 70% or 80% or more of the weight or volume of the total formulation. Naturally, the amount of therapeutic agent(s) in each therapeutically-useful composition may be prepared in such a way that a suitable dosage will be obtained in any given unit dose of the compound. Factors such as solubility, bioavailability, biological half-life, route of administration, product shelf life, as well as other pharmacological considerations will be contemplated by one skilled in the art when preparing such pharmaceutical formulations. Additionally, a variety of dosages and treatment regimens may be desirable.


In certain circumstances, it will be desirable to deliver the therapeutic rAAV particles or preparations, in suitably formulated pharmaceutical compositions disclosed herein; either subcutaneously, intracardially, intraocularly, intravitreally, parenterally, subcutaneously, intravenously, intracerebro-ventricularly, intramuscularly, intrathecally, orally, intraperitoneally, by oral or nasal inhalation, or by direct injection to one or more cells (e.g., cardiomyocytes and/or other heart cells), tissues, or organs. In some embodiments, the therapeutic rAAV particles or the composition comprising the therapeutic rAAV particles of the present invention are delivered systemically via intravenous injection, particularly in those for treating a human. In some embodiments, the therapeutic rAAV particles or the composition comprising the therapeutic rAAV particles of the present invention are injected directly into the heart of the subject. Direct injection to the heart may comprise injection into one or more of the myocardial tissues, the cardiac lining, or the skeletal muscle surrounding the heart, e.g., using a needle catheter. In several embodiments, direct injection to human heart is preferred, for example, if delivery is performed concurrently with a surgical procedure whereby access to the heart is improved.


The pharmaceutical formulations of the compositions suitable for injectable use include sterile aqueous solutions or dispersions. In some embodiments, the formulation is sterile and fluid to the extent that easy syringability exists. In some embodiments, the form is stable under the conditions of manufacture and storage, and is preserved against the contaminating action of microorganisms, such as bacteria and fungi. The carrier may be a solvent or dispersion medium containing, for example, water, saline, ethanol, polyol (e.g., glycerol, propylene glycol, and liquid polyethylene glycol, and the like), suitable mixtures thereof, vegetable oils or other pharmaceutically acceptable carriers such as those that are Generally Recognized as Safe (GRAS) by the United States Food and Drug Administration. Proper fluidity may be maintained, for example, by the use of a coating, such as lecithin, by the maintenance of the required particle size in the case of dispersion and by the use of surfactants. In fact, there is virtually no limit to other components that may also be included, as long as the additional agents do not cause a significant adverse effect upon contact with the target cells or host tissues. The therapeutic rAAV particles or preparations may thus be delivered along with various other pharmaceutically acceptable agents as required in the particular instance. Such compositions may be purified from host cells or other biological sources, or alternatively may be chemically synthesized as described herein.


The amount of therapeutic rAAV particle or preparation, and/or therapeutic rAAV vector compositions and time of administration of such compositions will be within the purview of the skilled artisan having benefit of the present teachings. It is likely, however, that the administration of therapeutically-effective amounts of the compositions of the present disclosure may be achieved by a single administration, such as for example, a single injection of sufficient numbers of infectious particles to provide therapeutic benefit to the patient undergoing such treatment. In some circumstances, it may be desirable to provide multiple or successive administrations of the rAAV particle or preparation, and/or rAAV vector compositions, either over a relatively short, or a relatively prolonged period of time, as may be determined by the medical practitioner overseeing the administration of such compositions.


Toxicity and efficacy of the compositions utilized in methods of the present invention may be determined by standard pharmaceutical procedures, using either cells in culture or experimental animals to determine the LD50 (the dose lethal to 50% of the population). The dose ratio between toxicity and efficacy the therapeutic index and it may be expressed as the ratio LD50/ED50. Those compositions that exhibit large therapeutic indices are preferred. While compositions that exhibit toxic side effects may be used, care should be taken to design a delivery system that minimizes the potential damage of such side effects. The dosage of compositions as described herein lies generally within a range that includes an ED50 with little or no toxicity. The dosage may vary within this range depending upon the dosage form employed and the route of administration utilized.


Other aspects of the present disclosure relate to methods and preparations for use with a subject, such as human or non-human subjects, a host cell in situ in a subject, or a host cell derived from a subject. In some embodiments, the subject is a mammal. In some embodiments, the subject is a companion animal. “A companion animal”, as used herein, refers to pets and other domestic animals. Non-limiting examples of companion animals include dogs and cats; livestock such as horses, cattle, pigs, sheep, goats, and chickens; and other animals such as mice, rats, guinea pigs, and hamsters. In some embodiments, the subject is a human subject.


In some embodiments, one or more pharmaceutically acceptable excipients (including vehicles, carriers, diluents, and/or delivery polymers) are added to the pharmaceutical compositions including a therapeutic, thereby forming a pharmaceutical formulation suitable for in vivo delivery to a subject, such as a human.


A pharmaceutical composition or medicament includes a pharmacologically effective amount of at least one of the therapeutic and optionally one or more pharmaceutically acceptable excipients. Pharmaceutically acceptable excipients (excipients) are substances other than the Active Pharmaceutical ingredient (API, therapeutic product) that are intentionally included in the drug delivery system. Excipients do not exert or are not intended to exert a therapeutic effect at the intended dosage. Excipients may act to a) aid in processing of the drug delivery system during manufacture, b) protect, support or enhance stability, bioavailability or patient acceptability of the API, c) assist in product identification, and/or d) enhance any other attribute of the overall safety, effectiveness, of delivery of the API during storage or use. A pharmaceutically acceptable excipient may or may not be an inert substance.


Excipients include, but are not limited to: absorption enhancers, anti-adherents, anti-foaming agents, anti-oxidants, binders, buffering agents, carriers, coating agents, colors, delivery enhancers, delivery polymers, dextran, dextrose, diluents, disintegrants, emulsifiers, extenders, fillers, flavors, glidants, humectants, lubricants, oils, polymers, preservatives, saline, salts, solvents, sugars, suspending agents, sustained release matrices, sweeteners, thickening agents, tonicity agents, vehicles, water-repelling agents, and wetting agents.


The pharmaceutical compositions can contain other additional components commonly found in pharmaceutical compositions. Such additional components can include, but are not limited to: anti-pruritics, astringents, local anesthetics, or anti-inflammatory agents (e.g., antihistamine, diphenhydramine, etc.).


The carrier can be, but is not limited to, a solvent or dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyethylene glycol), and suitable mixtures thereof. A carrier may also contain adjuvants such as preservatives, wetting agents, emulsifying agents, and dispersing agents. A carrier may also contain isotonic agents, such as sugars, polyalcohols, sodium chloride, and the like into the compositions.


Pharmaceutically acceptable refers to those properties and/or substances which are acceptable to the subject from a pharmacological/toxicological point of view. The phrase pharmaceutically acceptable refers to molecular entities, compositions, and properties that are physiologically tolerable and do not typically produce an allergic or other untoward or toxic reaction when administered to a subject. In some embodiments, a pharmaceutically acceptable compound is approved by a regulatory agency of the Federal or a state government or listed in the U.S. Pharmacopeia or other generally recognized pharmacopeia for use in animals and more particularly in humans.


The The rAAVs or pharmaceutical compositions as described herein, may be formulated for administration to host cell ex vivo or in situ in an animal, and particularly a human being. The rAAVs or pharmaceutical compositions can be administered by a variety of routes. Administration routes included, but are not limited to, intravenous, intra-arterial, subcutaneous, intramuscular, intrahepatic, intraperitoneal and/or local delivery to a target tissue. In some embodiments, a plurality of injections, or other administration types, are provided, for example 2, 3, 4, 5, 6, 7, 8, 9, 10 or more injections. Routes of administration may be combined, if desired. Depending on the embodiment, the first and second rAAV need not be administered the same number of times (e.g., the first rAAV may be administered 1 time, and the second vector may be administered three times). In some embodiments, the dosing is intramuscular administration.


In some embodiments, the number of rAAV particles administered to a subject may be on the order ranging from about 106 to about 1014 particles/mL or about 103 to about 1013 particles/mL, or any values in between for either range, such as for example, about 106, 107, 108, 109, 1010, 1011, 1012, 1013, or 1014 particles/mL. In some embodiments, the number of rAAV particles administered to a subject may be on the order ranging from about 106 to about 1014 vector genomes (vgs)/mL or 103 to 1015 vgs/mL, or any values in between for either range, such as for example, about 106, 107, 108, 109, 1010, 1011, 1012, 1013, or 1014 vgs/mL. In some embodiments, between about 0.5 and about 5 rAAV vector genomes per cell are administered. In some embodiments, between about 0.5 and about 2 rAAV vector genomes per cell are administered. In some embodiments, between about 1×1013 and about 3×1014 vector genomes per kilo (vgs/kg) are administered. In some embodiments, dosing is based on the mass of the subject's cardiac muscle. In some embodiments, dosing is based on body weight. In some embodiments, dosing is based on body surface area. The rAAV particles can be administered as a single dose, or divided into two or more administrations as may be required to achieve therapy of the particular disease or disorder being treated. In some embodiments, doses ranging from about 0.0001 mL to about 10 mLs are delivered to a subject.


For administration of an injectable aqueous solution, for example, the solution may be suitably buffered, if necessary, and the liquid diluent first rendered isotonic with sufficient saline or glucose. These particular aqueous solutions are especially suitable for intravenous, intramuscular, intravitreal, subcutaneous and intraperitoneal administration. In this connection, a sterile aqueous medium that can be employed will be known to those of skill in the art in light of the present disclosure. For example, one dosage may be dissolved in 1 mL of isotonic NaCl solution and either added to 1000 mL of hypodermoclysis fluid or injected at the proposed site of infusion, (see, for example, “Remington's Pharmaceutical Sciences” 15th Edition, pages 1035-1038 and 1570-1580). Some variation in dosage will necessarily occur depending on the condition of the subject being treated. The person responsible for administration will, in any event, determine the appropriate dose for the individual subject. Moreover, for human administration, preparations should meet sterility, pyrogenicity, and the general safety and purity standards as required by, e.g., FDA Office of Biologies standards. In several embodiments, the rAAV formulation will comprise, consist of, or consist essentially of active rAAV ingredient, a mono-basic buffer (e.g., sodium phosphate mono-basic buffer, a di-basic salt (e.g., sodium phosphate di-basic), a sodium-based tonicifier (e.g., sodium chloride tonicifier), a non-sodium tonicifier (e.g., magnesium chloride hexahydrate tonicifier), a surfactant (e.g., poloxamer 188 surfactant), and water. In several embodiments, the rAAV formulation will comprise, consist of, or consist essentially of active rAAV ingredient, sodium phosphate mono-basic buffer, sodium phosphate di-based, sodium chloride tonicifier, magnesium chloride hexahydrate tonicifier, poloxamer 188 surfactant, and water. In several embodiments, the active rAAV ingredient is present in the formulation according to the vector genome amounts provided for herein. In several embodiments, the mono-basic buffer (e.g., sodium phosphate mono-basic buffer) is present in the formulation at a concentration between about 0.2 mg/mL and about 0.5 mg/mL. In several embodiments, the di-basic salt (e.g., sodium phosphate di-basic) is present in the formulation at a concentration between about 1.5 mg/mL and about 4 mg/mL. In several embodiments, the sodium-based tonicifier (e.g., sodium chloride tonicifier) is present in the formulation at a concentration between about 8 mg/mL and about 12 mg/mL. In several embodiments, the non-sodium tonicifier (e.g., magnesium chloride hexahydrate tonicifier) is present in the formulation at a concentration between about 0.1 mg/mL and about 0.35 mg/mL. In several embodiments, the surfactant (e.g., poloxamer 188 surfactant) is present in the formulation at a concentration between about 0.05 mg/mL and about 0.8 mg/mL. In several embodiments, water is present to bring the volume of the formulation (e.g. a dosage unit) to 1 mL.


Sterile injectable solutions are prepared by incorporating the rAAV particles or preparations, in the required amount in the appropriate solvent with several of the other ingredients enumerated above, as required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the various sterilized active ingredients into a sterile vehicle that contains the basic dispersion medium and the other ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile injectable solutions, the preferred methods of preparation are vacuum-drying and freeze-drying techniques, which yield a powder of the active ingredient plus any additional desired ingredient from a previously sterile-filtered solution thereof.


The amount of rAAV particle or preparation, and time of administration of such particle or preparation will be within the purview of the skilled artisan having benefit of the present teachings. It is likely, however, that the administration of therapeutically-effective amounts of the AAV particles or preparation of the present disclosure may be achieved by a single administration, such as for example, a single injection of sufficient numbers of infectious particles to provide therapeutic benefit to the patient undergoing such treatment. Alternatively, in some circumstances, it may be desirable to provide multiple or successive administrations of the rAAV particle or preparation, either over a relatively short, or a relatively prolonged period of time, as may be determined by the medical practitioner overseeing the administration of such compositions.


If desired, rAAV particles may be administered in combination with other agents as well, such as, e.g., proteins or polypeptides or various pharmaceutically-active agents, including one or more administrations of therapeutic polypeptides, biologically active fragments, or variants thereof. In fact, there is virtually no limit to other components that may also be included, as long as the additional agents do not cause a significant adverse effect upon contact with the target cells or host tissues. The rAAV particles or preparations may thus be delivered along with various other pharmaceutically acceptable agents as required in the particular instance. Such compositions may be purified from host cells or other biological sources, or alternatively may be chemically synthesized as described herein.


In some embodiments, treatment of a subject with a rAAV particles as described herein achieves one, two, three, four, or more of the following effects, including, for example: (i) reduction or amelioration the severity of disease or symptom associated therewith; (ii) reduction in the duration of a symptom associated with a disease; (iii) protection against the progression of a disease or symptom associated therewith; (iv) regression of a disease or symptom associated therewith; (v) protection against the development or onset of a symptom associated with a disease; (vi) protection against the recurrence of a symptom associated with a disease; (vii) reduction in the hospitalization of a subject; (viii) reduction in the hospitalization length; (ix) an increase in the survival of a subject with a disease; (x) a reduction in the number of symptoms associated with a disease; (xi) an enhancement, improvement, supplementation, complementation, or augmentation of the prophylactic or therapeutic effect(s) of another therapy.


As is apparent to those skilled in the art in view of the teachings of this specification, an effective amount of viral vector to be added can be empirically determined. Administration can be administered in a single dose, a plurality of doses, continuously or intermittently throughout the course of treatment. Methods of determining the most effective means and dosages of administration are well known to those of skill in the art and will vary with the viral vector, the composition of the therapy, the target cells, and the subject being treated. Single and multiple administrations can be carried out with the dose level and pattern being selected by the treating physician.


Kits

Herein are described compositions including one or more of the disclosed rAAV vectors comprised within a kit for diagnosing, preventing, treating or ameliorating one or more symptoms of a heart disease or condition, such as a cardiomyopathy. Such kits may be useful in the diagnosis, prophylaxis, and/or therapy or a human disease, and may be particularly useful in the treatment, prevention, and/or amelioration of one or more symptoms of heart disease, such as a cardiomyopathy. In some embodiments, the heart disease is caused by cardiomyopathy. In some embodiments, the heart disease is caused by hypertrophic cardiomyopathy or dilated cardiomyopathy. In some embodiments, the heart disease is dilated cardiomyopathy.


Kits comprising one or more of the disclosed rAAV vectors (as well as one or more virions, viral particles, transformed host cells or pharmaceutical compositions comprising such vectors); and instructions for using such kits in one or more therapeutic, diagnostic, and/or prophylactic clinical embodiments are also provided according to several embodiments. Such kits may comprise one or more reagents, restriction enzymes, peptides, therapeutics, pharmaceutical compounds, or means for delivery of the composition(s) to host cells, or to an animal (e.g., syringes, injectables, and the like). Depending on the embodiment, kits include those for treating, preventing, or ameliorating the symptoms of a disease, deficiency, dysfunction, and/or injury, or may include components for the large-scale production of the viral vectors themselves.


In some embodiments, a kit comprises one or more containers or receptacles comprising one or more doses of any of the described therapeutic. Such kits may be therapeutic in nature. In some embodiments, the kit contains a unit dosage, meaning a predetermined amount of a composition comprising, for example, a described therapeutic with or without one or more additional agents.


One or more of the components of a kit can be provided in one or more liquid or frozen solvents. The solvent can be aqueous or non-aqueous. The formulation in the kit can also be provided as dried powder(s) or in lyophilized form that can be reconstituted upon addition of an appropriate solvent.


In some embodiments, a kit comprises a label, marker, package insert, bar code and/or reader indicating directions of suitable usage of the kit contents. In some embodiments, the kit may comprise a label, marker, package insert, bar code and/or reader indicating that the kit contents may be administered in accordance with a certain dosage or dosing regimen to treat a subject.


In addition, a kit may also contain various reagents, including, but not limited to, wash reagents, elution reagents, and concentration reagents. Such reagents may be readily selected from among the reagents described herein, and from among conventional concentration reagents.


As used herein, the term “kit” may be used to describe variations of the portable, self-contained enclosure that includes at least one set of components to conduct one or more of the diagnostic or therapeutic methods of the invention.


Combination Therapies

Multiple embodiments of the present disclosure are directed to each individual feature, system, article, material, kit, and/or method described herein. In addition, any combination of two or more such features, systems, articles, materials, kits, and/or methods, if such features, systems, articles, materials, kits, and/or methods are not mutually inconsistent, is included within the inventive scope of the present disclosure.


The compositions of the present disclosure may include rAAV particles or preparations, and/or rAAV vectors, either alone or in combination with one or more additional active ingredients, which may be obtained from natural or recombinant sources or chemically synthesized. In some embodiments, rAAV particles or preparations are administered in combination, either in the same composition or administered as part of the same treatment regimen, with a proteasome inhibitor, such as Bortezomib, or hydroxyurea.


If desired, rAAV particles may be administered in combination with other agents as well, such as, e.g., proteins or polypeptides or various pharmaceutically-active agents. This may, in some embodiments, reflect for example one or more administrations of therapeutic polypeptides, (e.g., a recombinant form of a functional peptide or protein that aids to replace or supplement the rAAV-based production of protein encoded by the transgene) biologically active fragments, or variants thereof. The rAAV particles or preparations may thus be delivered along with various other pharmaceutically acceptable agents as required in the particular instance. Such compositions may be purified from host cells or other biological sources, or alternatively may be chemically synthesized as described herein.


In some embodiments, the additional therapeutic agent comprises an anti-inflammatory agent. The anti-inflammatory agent can be, but is not limited to, a corticosteroid, cortisone hydrocortisone, hydrocortisone-21-monoesters (e.g., hydrocortisone-21-acetate, hydrocortisone-21-butyrate, hydrocortisone-21-propionate, hydrocortisone-21-valerate, etc.), hydrocortisone-17,21-diesters (e.g., hydrocortisone-17,21-diacetate, hydrocortisone-17-acetate-21-butyrate, hydrocortisone-17,21-dibutyrate, etc.), alclometasone, dexamethasone, flumethasone, prednisolone, methylprednisolone, betamethasone, typically as betamethasone benzoate or betamethasone diproprionate; fluocinonide; prednisone; and triamcinolone, typically as triamcinolone acetonide. In some embodiments, the anti-inflammatory agent is a mast cell degranulation inhibitor, such as, without limitation, cromolyn (5,5′-(2-hydroxypropane-1,3-diyl)bis(oxy)bis(4-oxo-4H-chromene-2-carboxylic acid) (also known as cromoglycate), and 2-carboxylatochromon-5′-yl-2-hydroxypropane derivatives such as bis(acetoxymethyl), disodium cromoglycate, nedocromil (9-ethyl-4,6-dioxo-10-propyl-6,9-dihydro-4H-pyrano[3,2-g]quinoline-2,8-dicarboxylic acid) and tranilast (2-{[(2E)-3-(3,4-dimethoxyphenyl) prop-2-enoyl]amino}), and lodoxamide (2-[2-chloro-5-cyano-3-(oxaloamino) anilino]-2-oxoacetic acid). In some embodiments, the anti-inflammatory agent is a nonsteroidal anti-inflammatory drugs (NSAIDs), such as, without limitation, aspirin compounds (acetylsalicylates), non-aspirin salicylates, diclofenac, diflunisal, etodolac, fenoprofen, flurbiprofen, ibuprofen, indomethacin, ketoprofen, meclofenamate, naproxen, naproxen sodium, phenylbutazone, sulindac, and tometin.


In some embodiments, the anti-inflammatory agent comprises an antihistamine. The antihistamine can be, but is not limited to, clemastine, clemastine fumarate (2 (R)-[2-[1-(4-Chlorophenyl)-1-phenyl-ethoxy]ethyl-1-methylpyrrolidine), dexmedetomidine, doxylamine, loratidine, desloratidine and promethazine, and diphenhydramine, or pharmaceutically acceptable salts, solvates or esters thereof. In some embodiments, the antihistamine includes, without limitation, azatadine, azelastine, burfroline, cetirizine, cyproheptadine, doxantrozole, ctodroxizine, forskolin, hydroxyzine, ketotifen, oxatomide, pizotifen, proxicromil, N,N′-substituted piperazines or terfenadine. In some embodiments, the antihistamine is an H1 antagonist, such as, but not limited to, cetirizine, chlorpheniramine, dimenhydrinate, diphenhydramine, fexofenadine, hydroxyzine, orphenadrine, pheniramine, and doxylamine. In some embodiments, the antihistamine is an H2 antagonist, such as, but not limited to, cimetidine, famotidine, lafutidine, nizatidine, ranitidine, and roxatidine.


In some embodiments, the additional therapeutic agent comprises an antiviral agent, including antiretroviral agents. Suitable antiviral agents include, without limitation, remdesivir, acyclovir, famcyclovir, ganciclovir, foscarnet, idoxuridine, sorivudine, trifluorothymidine, valacyclovir, vidarabine, didanosine, dideoxyinosine, stavudine, zalcitabine, zidovudine, amantadine, interferon alpha, ribavirin and rimantadine.


In some embodiments, the additional therapeutic agent comprises an antibiotic. Non-limiting examples of suitable antibiotics include beta-lactams such as penicillins, aminopenicillins (e.g., amoxicillin, ampicillin, hetacillin, etc.), penicillinase resistant antibiotics (e.g., cloxacillin, dicloxacillin, methicillin, nafcillin, oxacillin, etc.), extended spectrum antibiotics (e.g., axlocillin, carbenicillin, mezlocillin, piperacillin, ticarcillin, etc.); cephalosporins (e.g., cefadroxil, cefazolin, cephalixin, cephalothin, cephapirin, cephradine, cefaclor, cefacmandole, cefmetazole, cefonicid, ceforanide, cefotetan, cefoxitin, cefprozil, cefuroxime, loracarbef, cefixime, cefoperazone, cefotaxime, cefpodoxime, ceftazidime, ceftiofur, ceftizoxime, ceftriaxone, moxalactam, etc.); monobactams such as aztreonam; Carbapenems such as imipenem and eropenem; quinolones (e.g., ciprofloxacin, enrofloxacin, difloxacin, orbifloxacin, marbofloxacin, etc.); chloramphenicols (e.g., chloramphenicol, thiamphenicol, florfenicol, etc.); tetracyclines (e.g., chlortetracycline, tetracycline, oxytetracycline, doxycycline, minocycline, etc.); macrolides (e.g., erythromycin, tylosin, tlimicosin, clarithromycin, azithromycin, etc.); lincosamides (e.g., lincomycin, clindamycin, etc.); aminoglycosides (e.g., gentamicin, amikacin, kanamycin, apramycin, tobramycin, neomycin, dihydrostreptomycin, paromomycin, etc.); sulfonamides (e.g., sulfadmethoxine, sfulfamethazine, sulfaquinoxaline, sulfamerazine, sulfathiazole, sulfasalazine, sulfadiazine, sulfabromomethazine, suflaethoxypyridazine, etc.); glycopeptides (e.g., vancomycin, teicoplanin, ramoplanin, and decaplanin; and other antibiotics (e.g., rifampin, nitrofuran, virginiamycin, polymyxins, tobramycin, etc.)).


In some embodiments, the additional therapeutic agent comprises an antifungal agent, such as, but not limited to, itraconazole, ketoconazole, fluoconazole, and amphotericin B. In some embodiments, the therapeutic agent is an antiparasitic agents, such as, but not limited to, the broad spectrum antiparasitic medicament nitazoxanide; antimalarial drugs and other antiprotozoal agents (e.g., artemisins, mefloquine, lumefantrine, tinidazole, and miltefosine); anthelminthics such as mebendazole, thiabendazole, and ivermectin; and antiamoebic agents such as rifampin and amphotericin B.


In some embodiments, the additional therapeutic agent comprises an analgesic agent, including, without limitation, opioid analgesics such as alfentanil, buprenorphine, butorphanol, codeine, drocode, fentanyl, hydrocodone, hydromorphone, levorphanol, meperidine, methadone, morphine, nalbuphine, oxycodone, oxymorphone, pentazocine, propoxyphene, sufentanil, and tramadol; and nonopioid analgesics such as apazone, etodolac, diphenpyramide, indomethacin, meclofenamate, mefenamic acid, oxaprozin, phenylbutazone, piroxicam, and tolmetin.


The disclosure herein of any particular feature, aspect, method, property, characteristic, quality, attribute, element, or the like in connection with an embodiment can be used in all other embodiments set forth herein. Accordingly, it should be understood that various features and aspects of the disclosed embodiments can be combined with or substituted for one another in order to form varying modes of the disclosed inventions. Thus, it is intended that the scope of the present inventions herein disclosed should not be limited by the particular disclosed embodiments described above. Moreover, while the invention is susceptible to various modifications, and alternative forms, specific examples thereof have been shown in the drawings and are herein described in detail. It should be understood, however, that the invention is not to be limited to the particular forms or methods disclosed, but to the contrary, the invention is to cover all modifications, equivalents, and alternatives falling within the spirit and scope of the various embodiments described and the appended claims. Any methods disclosed herein need not be performed in the order recited. The methods disclosed herein include certain actions taken by a practitioner; however, they can also include any third-party instruction of those actions, either expressly or by implication. In addition, where features or aspects of the disclosure are described in terms of Markush groups, those skilled in the art will recognize that the disclosure is also thereby described in terms of any individual member or subgroup of members of the Markush group.


Any titles or subheadings used herein are for organization purposes and should not be used to limit the scope of embodiments disclosed herein.


EXAMPLES

The following examples are illustrative only and are not intended to be a limitation on the scope of the invention.


MATERIALS AND METHODS CONSTRUCT DESIGN

BAG3 cDNA was codon optimized for expression in human tissues and was subcloned into a plasmid backbone suitable for production of AAV. The constructs were engineered to comprise the elements as provided in Tables 1-3 below. Schematic representations of the constructs are provided in FIGS. 1 through 3.









TABLE 1







Construct 1 (pTR-CBA-Bag3; FIG. 1)









Elements

AA sequence (as


(5′ -> 3′)
Nt sequence
applicable)





ITR-L
TTGGCCACTCCCTCTCTGCGCGCTCG




CTCGCTCACTGAGGCCGGGCGACCAA




AGGTCGCCCGACGCCCGGGCTTTGCC




CGGGCGGCCTCAGTGAGCGAGCGAGC




GCGCAGAGAGGGAGTGGCCAACTCCA




TCACTAGGGGTTCCT (SEQ ID




NO: 1)






CMV
CCTAGTTATTAATAGTAATCAATTAC



enhancer
GGGGTCATTAGTTCATAGCCCATATA




TGGAGTTCCGCGTTACATAACTTACG




GTAAATGGCCCGCCTGGCTGACCGCC




CAACGACCCCCGCCCATTGACGTCAA




TAATGACGTATGTTCCCATAGTAACG




CCAATAGGGACTTTCCATTGACGTCA




ATGGGTGGACTATTTACGGTAAACTG




CCCACTTGGCAGTACATCAAGTGTAT




CATATGCCAAGTACGCCCCCTATTGA




CGTCAATGACGGTAAATGGCCCGCCT




GGCATTATGCCCAGTACATGACCTTA




TGGGACTTTCCTACTTGGCAGTACAT




CTACGTATTAGTCATCGCTATTACCA




TG (SEQ ID NO: 3)






CBA
TCGAGGTGAGCCCCACGTTCTGCTTC



promoter
ACTCTCCCCATCTCCCCCCCCTCCCC




ACCCCCAATTTTGTATTTATTTATTT




TTTAATTATTTTGTGCAGCGATGGGG




GCGGGGGGGGGGGGGGGGCGCGCGCC




AGGCGGGGCGGGGCGGGGCGAGGGGC




GGGGCGGGGCGAGGCGGAGAGGTGCG




GCGGCAGCCAATCAGAGCGGCGCGCT




CCGAAAGTTTCCTTTTATGGCGAGGC




GGCGGCGGCGGCGGCCCTATAAAAAG




CGAAGCGCGCGGCGGGCG (SEQ ID




NO: 5)






chimeric
GGAGTCGCTGCGACGCTGCCTTCGCC



intron
CCGTGCCCCGCTCCGCCGCCGCCTCG




CGCCGCCCGCCCCGGCTCTGACTGAC




CGCGTTACTCCCACAGGTGAGCGGGC




GGGACGGCCCTTCTCCTCCGGGCTGT




AATTAGCGCTTGGTTTAATGACGGCT




TGTTTCTTTTCTGTGGCTGCGTGAAA




GCCTTGAGGGGCTCCGGGAGGGCCCT




TTGTGCGGGGGGGAGCGGCTCGGGGG




GTGCGTGCGTGTGTGTGTGCGTGGGG




AGCGCCGCGTGCGGCCCGCGCTGCCC




GGCGGCTGTGAGCGCTGCGGGCGCGG




CGCGGGGCTTTGTGCGCTCCGCAGTG




TGCGCGAGGGGAGCGCGGCCGGGGGC




GGTGCCCCGCGGTGCGGGGGGGGCTG




CGAGGGGAACAAAGGCTGCGTGCGGG




GTGTGTGCGTGGGGGGGTGAGCAGGG




GGTGTGGGCGCGGCGGTCGGGCTGTA




ACCCCCCCCTGCACCCCCCTCCCCGA




GTTGCTGAGCACGGCCCGGCTTCGGG




TGCGGGGCTCCGTACGGGGCGTGGCG




CGGGGCTCGCCGTGCCGGGCGGGGGG




TGGCGGCAGGTGGGGGTGCCGGGCGG




GGCGGGGCCGCCTCGGGCCGGGGAGG




GCTCGGGGGAGGGGCGCGGCGGCCCC




CGGAGCGCCGGCGGCTGTCGAGGCGC




GGCGAGCCGCAGCCATTGCCTTTTAT




GGTAATCGTGCGAGAGGGCGCAGGGA




CTTCCTTTGTCCCAAATCTGTGCGGA




GCCGAAATCTGGGAGGCGCCGCCGCA




CCCCCTCTAGCGGGCGCGGGGCGAAG




CGGTGCGGCGCCGGCAGGAAGGAAAT




GGGCGGGGAGGGCCTTCGTGCGTCGC




CGCGCCGCCGTCCCCTTCTCCCTCTC




CAGCCTCGGGGCTGTCCGCGGGGGGA




CGGCTGCCTTCGGGGGGGACGGGGCA




GGGCGGGGTTCGGCTTCTGGCGTGTG




ACCGGCGGCTCTAGAGCCTCTGCTAA




CCATGTTCATGCCTTCTTCTTTTTCC




TACAG (SEQ ID NO: 6)






exon 1
GCATCCAACCCCGGGCCGCGGCCAAC



non-
TTCTCTGGACTGGACCAGAAGTTTCT



coding
AGCCGGCCAGTTGCTACCTCCCTTTA




TCTCCTCCTTCCCCTCTGGCAGCGAG




GAGGCTATTTCCAGACACTTCCACCC




CTCTCTGGCCACGTCACCCCCGCCTT




TAATTCATAAAGGTGCCCGGCGCCGG




CTTCCCGGACACGTCGGCGGCGGAGA




GGGGCCCACGGCGGCGGCCCGGCCAG




AGACTCGGCGCCCGGAGCCAGCGCCC




CGCACCCGCGCCCCAGCGGGCAGACC




CCAACCCAGCG (SEQ ID NO: 8)






Exon 1
ATGTCTGCTGCCACACACAGCCCTAT
MSAATHSPMMQVASG



GATGCAGGTCGCCTCTGGCAACGGCG
NGDRDPLPPGWEIKI



ACAGAGATCCTTTGCCTCCTGGCTGG
DPQTGWPFFVDHNSR



GAGATCAAGATCGATCCTCAGACCGG
TTTWNDPRVPSEGPK



CTGGCCCTTCTTCGTGGACCACAATA
(SEQ ID NO: 23)



GCAGAACCACCACCTGGAACGACCCC




AGAGTGCCTTCTGAGGGCCCCAAA




(SEQ ID NO: 10)






Exon 2
GAGACACCCAGCTCTGCCAATGGACC
ETPSSANGPSREGSR



CAGCAGAGAGGGAAGCAGACTGCCAC
LPPAREGHPVYPQLR



CAGCTAGAGAAGGACACCCCGTGTAT
GPYIPIPVLHEGAEN



CCACAGCTGAGGCCTGGCTACATCCC
RQVHPFHVYPQPGMQ



CATTCCAGTGCTGCATGAGGGCGCCG
RFRTEAAAAAPQRSQ



AAAACAGACAGGTGCACCCCTTTCAC
SPLRGMPETTQPDKQ



GTGTACCCTCAGCCTGGCATGCAGCG
CGQVAAAAAAQPPAS



GTTTAGAACAGAAGCCGCTGCTGCCG
HGPE (SEQ ID



CTCCTCAGAGATCTCAGTCTCCTCTG
NO: 24)



AGAGGCATGCCCGAGACAACCCAGCC




TGATAAGCAGTGTGGACAGGTGGCAG




CTGCAGCAGCAGCTCAACCTCCTGCT




TCTCACGGCCCCGAA (SEQ ID




NO: 11)






Exon 3
AGAAGCCAATCTCCTGCCGCCTCTGA
RSQSPAASDCSSSSS



TTGCAGCTCCAGCTCTAGCTCTGCCT
SASLPSSGRSSLGSH



CTCTGCCTAGCAGCGGCAGATCTAGC
QLPRGYISIPVIHEQ



CTGGGCTCTCATCAACTGCCCAGAGG
NVTRPAAQPSFHQAQ



CTACATCAGCATCCCTGTGATCCACG
KTHYPAQQGEYQTHQ



AGCAGAACGTGACCAGACCTGCTGCT
PVYHKIQGDDWEPRP



CAGCCCAGCTTCCATCAGGCCCAGAA
LRAASPFRSSVQGAS



AACACACTACCCCGCTCAGCAGGGCG
SREGSPARSSTPLHS



AGTACCAGACACACCAGCCTGTGTAC
PSPIRVHTVVDRPQ



CACAAGATCCAGGGCGACGACTGGGA
(SEQ ID NO: 25)



GCCCAGACCTCTTAGAGCCGCTAGTC




CCTTCAGATCCTCTGTGCAGGGCGCC




AGTTCTAGAGAGGGCTCTCCTGCCAG




AAGCAGCACACCTCTGCACAGCCCAT




CTCCAATCAGAGTGCACACCGTGGTG




GACAGACCCCAG (SEQ ID NO:




12)






Exon 4
CAGCCTATGACACACAGAGAGACAGC
QPMTHRETAPVSQPE



CCCTGTCAGCCAGCCTGAGAACAAGC
NKPESKPGPVGPELP



CTGAGTCCAAGCCAGGACCTGTGGGA
PGHIPIQVIRKEVDS



CCTGAACTGCCTCCAGGACACATCCC
KPVSQKPPPPSEKVE



TATCCAAGTGATCCGGAAAGAGGTGG
VKVPPAPVPCPPPSP



ACAGCAAGCCCGTGTCTCAGAAGCCT
GPSAVPSSPKSVATE



CCTCCACCTAGCGAGAAAGTGGAAGT
ERAAPSTAPAEATPP



GAAAGTGCCTCCTGCTCCTGTGCCTT
KPGEAEAPPKHPGVL



GTCCTCCTCCATCTCCTGGACCATCT
KVEAILEKVQGLEQA



GCCGTGCCTAGCTCTCCTAAAAGCGT
VDNFEGKKTDKKYLM



GGCCACCGAGGAAAGAGCCGCTCCTT
IEEYLTKELLALDSV



CTACAGCTCCTGCCGAGGCCACACCT
DPEGRADVRQARRDG



CCTAAACCTGGCGAAGCTGAAGCCCC
VRKVQTILEKLEQKA



TCCAAAACACCCTGGCGTGCTGAAGG
IDVPGQVQVYELQPS



TGGAAGCCATCCTGGAAAAGGTGCAG
NLEADQPLQAIMEMG



GGACTCGAGCAGGCCGTGGACAACTT
AVAADKGKKNAGNAE



CGAGGGCAAGAAAACCGACAAGAAAT
DPHTETQQPEATAAA



ACCTGATGATCGAGGAATACCTGACC
TSNPSSMTDTPGNPA



AAAGAGCTGCTGGCCCTGGACAGCGT
AP (SEQ ID NO:



TGACCCTGAAGGCAGAGCAGATGTGC
26)



GGCAGGCTAGAAGAGATGGCGTGCGG




AAAGTGCAGACCATCCTCGAGAAGCT




GGAACAGAAAGCCATCGACGTGCCAG




GCCAGGTGCAGGTTTACGAGCTGCAG




CCCTCTAACCTGGAAGCCGATCAGCC




TCTGCAGGCCATCATGGAAATGGGAG




CCGTGGCCGCCGACAAGGGAAAGAAG




AATGCTGGCAACGCCGAGGATCCCCA




CACCGAAACACAGCAGCCTGAAGCTA




CAGCCGCCGCTACCAGCAATCCCAGC




AGCATGACAGACACCCCTGGCAATCC




AGCCGCTCCA (SEQ ID NO: 13)






Exon 4
CCTCTGCCCTGTAAAAATCAGACTCG



UTR
GAACCGATGTGTGCTTTAGGGAATTT




TAAGTTGCATGCATTTCAGAGACTTT




AAGTCAGTTGGTTTTTATTAGCTGCT




TGGTATGCAGTAACTTGGGTGGAGGC




AAAACACTAATAAAAGGGCTAAAAAG




GAAAATGATGCTTTTCTTCTATATTC




TTACTCTGTACCAATAAAGAAGTTGC




TTGTTGTTTGAGAAGTTTAACCCCGT




TGCTTGTTGTTCTGCAGCCCTGTCTA




CTTGGGCACCCCCACCACCTGTTAGC




TGTGGTTGTGCACTGTCTTTTGTAGC




TCTGGACTGGAGGGGTAGATGGGGAG




TCAATTACCCATCACATAAATATGAA




ACATTTATCAGAAATGTTGCCATTTT




AATGAGATGATTTTCTTCATCTCATA




ATTAAAATACCTGACTTTAGAGAGAG




TAAAATGTGCCAGGAGCCATAGGAAT




ATCTGTATGTTGGATGACTTTAATGC




TACATTTTAAAAAAAGAAAATAAAGT




AATAATATAACTCAAAA (SEQ ID




NO: 15)






bGH polyA
CGACTGTGCCTTCTAGTTGCCAGCCA




TCTGTTGTTTGCCCCTCCCCCGTGCC




TTCCTTGACCCTGGAAGGTGCCACTC




CCACTGTCCTTTCCTAATAAAATGAG




GAAATTGCATCGCATTGTCTGAGTAG




GTGTCATTCTATTCTGGGGGGTGGGG




TGGGGCAGGACAGCAAGGGGGAGGAT




TGGGAAGACAA (SEQ ID NO:




17)






bGH polyA
TAGCAGG



signal







ITR-R
AGGAACCCCTAGTGATGGAGTTGGCC




ACTCCCTCTCTGCGCGCTCGCTCGCT




CACTGAGGCCGGGCGACCAAAGGTCG




CCCGACGCCCGGGCTTTGCCCGGGCG




GCCTCAGTGAGCGAGCGAGCGCGCAG




AGAGGGAGTGGCCAA (SEQ ID




NO: 20)
















TABLE 2







Construct 2 (pTR-CK8-Ex1_Intron1-Ex2-Bag3; FIG. 2)









Elements

AA sequence (as


(5′ -> 3′)
Nt sequence
applicable)





ITR-L
TTGGCCACTCCCTCTCTGCGCGCTCG




CTCGCTCACTGAGGCCGGGCGACCAA




AGGTCGCCCGACGCCCGGGCTTTGCC




CGGGCGGCCTCAGTGAGCGAGCGAGC




GCGCAGAGAGGGAGTGGCCAACTCCA




TCACTAGGGGTTCCT (SEQ ID




NO: 30)






CK8
AGACTAGCATGCTGCCCATGTAAGGA




GGCAAGGCCTGGGGACACCCGAGATG




CCTGGTTATAATTAACCCAGACATGT




GGCTGCCCCCCCCCCCCCAACACCTG




CTGCCTCTAAAAATAACCCTGCATGC




CATGTTCCCGGCGAAGGGCCAGCTGT




CCCCCGCCAGCTAGACTCAGCACTTA




GTTTAGGAACCAGTGAGCAAGTCAGC




CCTTGGGGCAGCCCATACAAGGCCAT




GGGGCTGGGCAAGCTGCACGCCTGGG




TCCGGGGTGGGCACGGTGCCCGGGCA




ACGAGCTGAAAGCTCATCTGCTCTCA




GGGGCCCCTCCCTGGGGACAGCCCCT




CCTGGCTAGTCACACCCTGTAGGCTC




CTCTATATAACCCAGGGGCACAGGGG




CTGCCCTCATTCTACCACCACCTCCA




CAGCACAGACAGACACTCAGGAGCCA




GCCAAA (SEQ ID NO: 31)






Exon 1
GAATTCGATATCAAGCTTGCATCCAA



Non-coding
CCCCGGGCCGCGGCCAACTTCTCTGG




ACTGGACCAGAAGTTTCTAGCCGGCC




AGTTGCTACCTCCCTTTATCTCCTCC




TTCCCCTCTGGCAGCGAGGAGGCTAT




TTCCAGACACTTCCACCCCTCTCTGG




CCACGTCACCCCCGCCTTTAATTCAT




AAAGGTGCCCGGCGCCGGCTTCCCGG




ACACGTCGGCGGCGGAGAGGGGCCCA




CGGCGGCGGCCCGGCCAGAGACTCGG




CGCCCGGAGCCAGCGCCCCGCACCCG




CGCCCCAGCGGGCAGACCCCAACCCA




GCGCCACC (SEQ ID NO: 32)






Exon 1
ATGTCTGCTGCCACACACTCTCCAAT
MSAATHSPMMQVASG



GATGCAGGTTGCCTCTGGCAATGGGG
NGDRDPLPPGWEIKI



ACAGAGATCCTCTGCCTCCTGGCTGG
DPQTGWPFFVDHNSR



GAGATCAAGATTGATCCTCAGACAGG
TTTWNDPRVPSEGPK



CTGGCCCTTCTTTGTGGACCACAACA
(SEQ ID NO: 23)



GCAGAACCACCACCTGGAATGACCCC




AGAGTGCCCTCTGAGGGCCCTAAG




(SEQ ID NO: 33)






Intron 1
GTTTCAAGAGCTAGAGGCCCTCCTTG




GAGTGTGGCTCCTCCTAGAAGGCAAG




CTGCTGGCTCTGGACTTGGAAGAGGG




GATGCTAGAAGAAGAGGCCCTGGAGT




TGGAGAGGGCCCCAGTAGAGCTGATA




CAGGATCTGCCCCTAGACACACCCTG




CCTCTGAGGCCTGATGAGAGCCCACT




GAGAACCAGACCTTGACAGGCGTCGG




GGCGAAAGGAGGCCCCGGGATTCGGT




GGCCCGGGAAGCGACCCCGCAGTGGC




TCCGGTGCCGTCCACGGCTCGACTCC




AGGGCGGAAGGCCGGGTGTCCAGCGC




TGGCCTCGCGCTCTAGGGCTGGGAGA




GGGGCGGCCGGCCTGGTCAGCTCCGG




AGGCCCCGGCCCACCGTGGCCCCTGC




TGCCCGCTCGTGCTGTAATGTAGAGG




TTGGAGCTGACCCCTGCTCCTGGAGC




TCATCTTCTGATCCGGGTCTCTGAAA




ATGCGGGCATGGGCAGCTCTCCTACA




CTCACCGCTTCCCTCAGTCACCCAGA




AAAGAAACCTGTCTTACCAGTTTGGA




GAATTGGGACCTTTCCTTTGCTTAAC




AGATACTTTTGGCTTTCTCCTGATGC




CCCTAATTCCTAAACTGTTGGCCAAA




TAGCAACCTCTATGGGGTGGGGGGTT




TGGAGGGTACAGGGGCTGGGAGCTGG




CTGACGCTTTGAGGCCCAAGTCACTC




GGGAAGATCACAATGCCAAGCGCCAC




AGTGTTTCCTCTGCCAGGAGGGTTCA




CTTCCCAGTTTCTAACCAGCCTGTGT




TTCTCCACTTTTTATTTCAG (SEQ




ID NO: 34)






Exon 2
GAGACACCCAGCTCTGCCAATGGACC
ETPSSANGPSREGSR



CAGCAGAGAGGGAAGCAGACTGCCAC
LPPAREGHPVYPQLR



CAGCTAGAGAAGGACACCCCGTGTAT
PGYIPIPVLHEGAEN



CCACAGCTGAGGCCTGGCTACATCCC
RQVHPFHVYPQPGMQ



CATTCCAGTGCTGCATGAGGGCGCCG
RFRTEAAAAAPQRSQ



AAAACAGACAGGTGCACCCCTTTCAC
SPLRGMPETTQPDKQ



GTGTACCCTCAGCCTGGCATGCAGAG
CGQVAAAAAAQPPAS



ATTCAGAACAGAGGCTGCTGCTGCAG
HGPE (SEQ ID



CCCCTCAGAGATCTCAATCTCCTCTG
NO: 24)



AGAGGCATGCCAGAGACAACACAGCC




TGACAAGCAGTGTGGACAGGTGGCAG




CAGCAGCTGCAGCTCAACCTCCTGCT




TCTCATGGCCCTGAG (SEQ ID




NO: 35)






Exon 3
AGAAGCCAGTCTCCTGCTGCTTCTGA
RSQSPAASDCSSSSS



TTGCAGCAGCTCCAGTAGCTCTGCCT
SASLPSSGRSSLGSH



CTCTGCCTAGCTCTGGCAGATCTTCT
QLPRGYISIPVIHEQ



CTGGGCAGCCATCAGCTGCCTAGAGG
NVTRPAAQPSFHQAQ



CTACATCAGCATCCCTGTGATCCATG
KTHYPAQQGEYQTHQ



AGCAGAATGTGACCAGACCAGCTGCT
PVYHKIQGDDWEPRP



CAGCCTAGCTTCCACCAGGCTCAGAA
LRAASPFRSSVQGAS



AACACACTACCCTGCTCAGCAAGGGG
SREGSPARSSTPLHS



AGTACCAGACACACCAGCCAGTGTAC
PSPIRVHTVVDRPQ



CACAAGATCCAAGGGGATGACTGGGA
(SEQ ID NO: 25)



GCCCAGACCACTGAGAGCTGCTAGCC




CCTTTAGAAGCTCTGTGCAAGGGGCC




AGCTCTAGAGAGGGCTCTCCTGCCAG




AAGCAGCACACCTCTGCACAGCCCAT




CTCCAATCAGAGTGCACACCGTGGTG




GACAGACCCCAG (SEQ ID NO:




36)






Exon 4
CAGCCTATGACACACAGAGAGACAGC
QPMTHRETAPVSQPE



CCCTGTCAGCCAGCCTGAGAACAAGC
NKPESKPGPVGPELP



CTGAGTCCAAGCCAGGACCTGTGGGA
PGHIPIQVIRKEVDS



CCTGAACTGCCTCCAGGACACATCCC
KPVSQKPPPPSEKVE



TATCCAAGTGATCCGGAAAGAGGTGG
VKVPPAPVPCPPPSP



ACAGCAAGCCCGTGTCTCAGAAGCCT
GPSAVPSSPKSVATE



CCTCCACCTAGCGAGAAAGTGGAAGT
ERAAPSTAPAEATPP



GAAAGTGCCTCCTGCTCCTGTGCCTT
KPGEAEAPPKHPGVL



GTCCTCCTCCATCTCCTGGACCATCT
KVEAILEKVQGLEQA



GCCGTGCCTAGCTCTCCTAAAAGCGT
VDNFEGKKTDKKYLM



GGCCACCGAGGAAAGAGCCGCTCCTT
IEEYLTKELLALDSV



CTACAGCTCCTGCCGAGGCCACACCT
DPEGRADVRQARRDG



CCTAAACCTGGCGAAGCTGAAGCCCC
VRKVQTILEKLEQKA



TCCAAAACACCCTGGCGTGCTGAAGG
IDVPGQVQVYELQPS



TGGAAGCCATCCTGGAAAAGGTGCAG
NLEADQPLQAIMEMG



GGACTCGAGCAGGCCGTGGACAACTT
AVAADKGKKNAGNAE



CGAGGGCAAGAAAACCGACAAGAAAT
DPHTETQQPEATAAA



ACCTGATGATCGAGGAATACCTGACC
TSNPSSMTDTPGNPA



AAAGAGCTGCTGGCCCTGGACAGCGT
AP (SEQ ID NO:



TGACCCTGAAGGCAGAGCAGATGTGC
26)



GGCAGGCTAGAAGAGATGGCGTGCGG




AAAGTGCAGACCATCCTCGAGAAGCT




GGAACAGAAAGCCATCGACGTGCCAG




GCCAGGTGCAGGTTTACGAGCTGCAG




CCCTCTAACCTGGAAGCCGATCAGCC




TCTGCAGGCCATCATGGAAATGGGAG




CCGTGGCCGCCGACAAGGGAAAGAAG




AATGCTGGCAACGCCGAGGATCCCCA




CACCGAAACACAGCAGCCTGAAGCTA




CAGCCGCCGCTACCAGCAATCCCAGC




AGCATGACAGACACCCCTGGCAATCC




AGCCGCTCCA (SEQ ID NO: 37)






Exon 4 UTR
CCTCTGCCCTGTAAAAATCAGACTCG




GAACCGATGTGTGCTTTAGGGAATTT




TAAGTTGCATGCATTTCAGAGACTTT




AAGTCAGTTGGTTTTTATTAGCTGCT




TGGTATGCAGTAACTTGGGTGGAGGC




AAAACACTAATAAAAGGGCTAAAAAG




GAAAATGATGCTTTTCTTCTATATTC




TTACTCTGTACCAATAAAGAAGTTGC




TTGTTGTTTGAGAAGTTTAACCCCGT




TGCTTGTTGTTCTGCAGCCCTGTCTA




CTTGGGCACCCCCACCACCTGTTAGC




TGTGGTTGTGCACTGTCTTTTGTAGC




TCTGGACTGGAGGGGTAGATGGGGAG




TCAATTACCCATCACATAAATATGAA




ACATTTATCAGAAATGTTGCCATTTT




AATGAGATGATTTTCTTCATCTCATA




ATTAAAATACCTGACTTTAGAGAGAG




TAAAATGTGCCAGGAGCCATAGGAAT




ATCTGTATGTTGGATGACTTTAATGC




TACATTTTAAAAAAAGAAAATAAAGT




AATAATATAACTCAAAA (SEQ ID




NO: 39)






ITR-R
AGGAACCCCTAGTGATGGAGTTGGCC




ACTCCCTCTCTGCGCGCTCGCTCGCT




CACTGAGGCCGGGCGACCAAAGGTCG




CCCGACGCCCGGGCTTTGCCCGGGCG




GCCTCAGTGAGCGAGCGAGCGCGCAG




AGAGGGAGTGGCCAA (SEQ ID




NO: 41)
















TABLE 3







Construct 3 (pTR2-mDes-Ex1-Intron1-


Ex2-Bag3; FIG. 3)









Elements

AA sequence (as


(5′ -> 3′)
Nt sequence
applicable)





ITR-L
TTGGCCACTCCCTCTCTGCGCGCTCG




CTCGCTCACTGAGGCCGGGCGACCAA




AGGTCGCCCGACGCCCGGGCTTTGCC




CGGGCGGCCTCAGTGAGCGAGCGAGC




GCGCAGAGAGGGAGTGGCCAACTCCA




TCACTAGGGGTTCCT (SEQ ID




NO: 42)






mDES
GATCTTACCCCCTGCCCCCCACAGCT




CCTCTCCTGTGCCTTGTTTCCCAGCC




ATGCGTTCTCCTCTATAAATACCCGC




TCTGGTATTTGGGGTTGGCAGCTGTT




GCTGCCAGGGAGATGGTTGGGTTGAC




ATGCGGCTCCTGACAAAACACAAACC




CCTGGTGTGTGTGGGCGTGGGTGGTG




TGAGTAGGGGGATGAATCAGGGAGGG




GGCGGGGGACCCAGGGGGCAGGAGCC




ACACAAAGTCTGTGCGGGGGTGGGAG




CGCACATAGCAATTGGAAACTGAAAG




GTTATCAGACCCTTTCTGGAAATCAG




CCCACTGTTTATAAACTTGAGGCCCC




ACCCTCGAGATAACCAGGGCTGAAAG




AGGCCCGCCTGGGGGCTGGAGACATG




CTTGCTGCCTGCCCTGGCGAAGGATT




GGCAGGCTTGCCCGTCACAGGACCCC




CGCTGGCTGACTCAGGGGCGCAGGCC




TCTTGCGGGGGAGCTGGCCTCCCCGC




CCCCACGGCCACGGGCCGCCCTTTCC




TGGCAGGACAGCGGGATCTTGCAGCT




GTCAGGGGAGGGGAGGCGGGGGCTGA




TGTCAGGAGGGATACAAATAGTGCCG




ACGGCTGGGGGCCCTGTCTCCCCTCG




CCGCATCCACTCTCCGGCCGGCCGCC




TGTCCGCCGCCTCCTCCGTGCGCCCG




CCAGCCTCGCCCGCGCCGTCACCGTG




AGGCACTGGGCAGGTAAGTATCAAAG




TATCAAGGTTACAAGACAGGTTTAAG




GAGACCAATAGAAACTGGGCTTGTCG




AGACAGAGAAGACTCTTGCGTTTCTG




ATAGGCACCTATTGGTCTTACTGACA




TCCACTTTGCCTTTCTCTCCACAGGC




TAG (SEQ ID NO: 43)






Exon 1
GAATTCGATATCAAGCTTGCATCCAA



Non-coding
CCCCGGGCCGCGGCCAACTTCTCTGG




ACTGGACCAGAAGTTTCTAGCCGGCC




AGTTGCTACCTCCCTTTATCTCCTCC




TTCCCCTCTGGCAGCGAGGAGGCTAT




TTCCAGACACTTCCACCCCTCTCTGG




CCACGTCACCCCCGCCTTTAATTCAT




AAAGGTGCCCGGCGCCGGCTTCCCGG




ACACGTCGGCGGCGGAGAGGGGCCCA




CGGCGGCGGCCCGGCCAGAGACTCGG




CGCCCGGAGCCAGCGCCCCGCACCCG




CGCCCCAGCGGGCAGACCCCAACCCA




GCGCCACC (SEQ ID NO: 44)






Exon 1
ATGTCTGCTGCCACACACTCTCCAAT
MSAATHSPMMQVASG



GATGCAGGTTGCCTCTGGCAATGGGG
NGDRDPLPPGWEIKI



ACAGAGATCCTCTGCCTCCTGGCTGG
DPQTGWPFFVDHNSR



GAGATCAAGATTGATCCTCAGACAGG
TTTWNDPRVPSEGPK



CTGGCCCTTCTTTGTGGACCACAACA
(SEQ ID NO: 23)



GCAGAACCACCACCTGGAATGACCCC




AGAGTGCCCTCTGAGGGCCCTAAG




(SEQ ID NO: 45)






Intron 1
GTTTCAAGAGCTAGAGGCCCTCCTTG




GAGTGTGGCTCCTCCTAGAAGGCAAG




CTGCTGGCTCTGGACTTGGAAGAGGG




GATGCTAGAAGAAGAGGCCCTGGAGT




TGGAGAGGGCCCCAGTAGAGCTGATA




CAGGATCTGCCCCTAGACACACCCTG




CCTCTGAGGCCTGATGAGAGCCCACT




GAGAACCAGACCTTGACAGGCGTCGG




GGCGAAAGGAGGCCCCGGGATTCGGT




GGCCCGGGAAGCGACCCCGCAGTGGC




TCCGGTGCCGTCCACGGCTCGACTCC




AGGGCGGAAGGCCGGGTGTCCAGCGC




TGGCCTCGCGCTCTAGGGCTGGGAGA




GGGGCGGCCGGCCTGGTCAGCTCCGG




AGGCCCCGGCCCACCGTGGCCCCTGC




TGCCCGCTCGTGCTGTAATGTAGAGG




TTGGAGCTGACCCCTGCTCCTGGAGC




TCATCTTCTGATCCGGGTCTCTGAAA




ATGCGGGCATGGGCAGCTCTCCTACA




CTCACCGCTTCCCTCAGTCACCCAGA




AAAGAAACCTGTCTTACCAGTTTGGA




GAATTGGGACCTTTCCTTTGCTTAAC




AGATACTTTTGGCTTTCTCCTGATGC




CCCTAATTCCTAAACTGTTGGCCAAA




TAGCAACCTCTATGGGGTGGGGGGTT




TGGAGGGTACAGGGGCTGGGAGCTGG




CTGACGCTTTGAGGCCCAAGTCACTC




GGGAAGATCACAATGCCAAGCGCCAC




AGTGTTTCCTCTGCCAGGAGGGTTCA




CTTCCCAGTTTCTAACCAGCCTGTGT




TTCTCCACTTTTTATTTCAG (SEQ




ID NO: 46)






Exon 2
GAGACACCCAGCTCTGCCAATGGACC
ETPSSANGPSREGSR



CAGCAGAGAGGGAAGCAGACTGCCAC
LPPAREGHPVYPQLR



CAGCTAGAGAAGGACACCCCGTGTAT
PGYIPIPVLHEGAEN



CCACAGCTGAGGCCTGGCTACATCCC
RQVHPFHVYPQPGMQ



CATTCCAGTGCTGCATGAGGGCGCCG
RFRTEAAAAAPQRSQ



AAAACAGACAGGTGCACCCCTTTCAC
SPLRGMPETTQPDKQ



GTGTACCCTCAGCCTGGCATGCAGAG
CGQVAAAAAAQPPAS



ATTCAGAACAGAGGCTGCTGCTGCAG
HGPE (SEQ ID



CCCCTCAGAGATCTCAATCTCCTCTG
NO: 24)



AGAGGCATGCCAGAGACAACACAGCC




TGACAAGCAGTGTGGACAGGTGGCAG




CAGCAGCTGCAGCTCAACCTCCTGCT




TCTCATGGCCCTGAG (SEQ ID




NO: 47)






Exon 3
AGAAGCCAGTCTCCTGCTGCTTCTGA
RSQSPAASDCSSSSS



TTGCAGCAGCTCCAGTAGCTCTGCCT
SASLPSSGRSSLGSH



CTCTGCCTAGCTCTGGCAGATCTTCT
QLPRGYISIPVIHEQ



CTGGGCAGCCATCAGCTGCCTAGAGG
NVTRPAAQPSFHQAQ



CTACATCAGCATCCCTGTGATCCATG
KTHYPAQQGEYQTHQ



AGCAGAATGTGACCAGACCAGCTGCT
PVYHKIQGDDWEPRP



CAGCCTAGCTTCCACCAGGCTCAGAA
LRAASPFRSSVQGAS



AACACACTACCCTGCTCAGCAAGGGG
SREGSPARSSTPLHS



AGTACCAGACACACCAGCCAGTGTAC
PSPIRVHTVVDRPQ



CACAAGATCCAAGGGGATGACTGGGA
(SEQ ID NO: 25)



GCCCAGACCACTGAGAGCTGCTAGCC




CCTTTAGAAGCTCTGTGCAAGGGGCC




AGCTCTAGAGAGGGCTCTCCTGCCAG




AAGCAGCACACCTCTGCACAGCCCAT




CTCCAATCAGAGTGCACACCGTGGTG




GACAGACCCCAG (SEQ ID




NO: 48)






Exon 4
CAGCCTATGACACACAGAGAGACAGC
QPMTHRETAPVSQPE



CCCTGTCAGCCAGCCTGAGAACAAGC
NKPESKPGPVGPELP



CTGAGTCCAAGCCAGGACCTGTGGGA
PGHIPIQVIRKEVDS



CCTGAACTGCCTCCAGGACACATCCC
KPVSQKPPPPSEKVE



TATCCAAGTGATCCGGAAAGAGGTGG
VKVPPAPVPCPPPSP



ACAGCAAGCCCGTGTCTCAGAAGCCT
GPSAVPSSPKSVATE



CCTCCACCTAGCGAGAAAGTGGAAGT
ERAAPSTAPAEATPP



GAAAGTGCCTCCTGCTCCTGTGCCTT
KPGEAEAPPKHPGVL



GTCCTCCTCCATCTCCTGGACCATCT
KVEAILEKVQGLEQA



GCCGTGCCTAGCTCTCCTAAAAGCGT
VDNFEGKKTDKKYLM



GGCCACCGAGGAAAGAGCCGCTCCTT
IEEYLTKELLALDSV



CTACAGCTCCTGCCGAGGCCACACCT
DPEGRADVRQARRDG



CCTAAACCTGGCGAAGCTGAAGCCCC
VRKVQTILEKLEQKA



TCCAAAACACCCTGGCGTGCTGAAGG
IDVPGQVQVYELQPS



TGGAAGCCATCCTGGAAAAGGTGCAG
NLEADQPLQAIMEMG



GGACTCGAGCAGGCCGTGGACAACTT
AVAADKGKKNAGNAE



CGAGGGCAAGAAAACCGACAAGAAAT
DPHTETQQPEATAAA



ACCTGATGATCGAGGAATACCTGACC
TSNPSSMTDTPGNPA



AAAGAGCTGCTGGCCCTGGACAGCGT
AP (SEQ ID NO:



TGACCCTGAAGGCAGAGCAGATGTGC
26)



GGCAGGCTAGAAGAGATGGCGTGCGG




AAAGTGCAGACCATCCTCGAGAAGCT




GGAACAGAAAGCCATCGACGTGCCAG




GCCAGGTGCAGGTTTACGAGCTGCAG




CCCTCTAACCTGGAAGCCGATCAGCC




TCTGCAGGCCATCATGGAAATGGGAG




CCGTGGCCGCCGACAAGGGAAAGAAG




AATGCTGGCAACGCCGAGGATCCCCA




CACCGAAACACAGCAGCCTGAAGCTA




CAGCCGCCGCTACCAGCAATCCCAGC




AGCATGACAGACACCCCTGGCAATCC




AGCCGCTCCA (SEQ ID NO: 49)






Exon 4 UTR
CCTCTGCCCTGTAAAAATCAGACTCG




GAACCGATGTGTGCTTTAGGGAATTT




TAAGTTGCATGCATTTCAGAGACTTT




AAGTCAGTTGGTTTTTATTAGCTGCT




TGGTATGCAGTAACTTGGGTGGAGGC




AAAACACTAATAAAAGGGCTAAAAAG




GAAAATGATGCTTTTCTTCTATATTC




TTACTCTGTACCAATAAAGAAGTTGC




TTGTTGTTTGAGAAGTTTAACCCCGT




TGCTTGTTGTTCTGCAGCCCTGTCTA




CTTGGGCACCCCCACCACCTGTTAGC




TGTGGTTGTGCACTGTCTTTTGTAGC




TCTGGACTGGAGGGGTAGATGGGGAG




TCAATTACCCATCACATAAATATGAA




ACATTTATCAGAAATGTTGCCATTTT




AATGAGATGATTTTCTTCATCTCATA




ATTAAAATACCTGACTTTAGAGAGAG




TAAAATGTGCCAGGAGCCATAGGAAT




ATCTGTATGTTGGATGACTTTAATGC




TACATTTTAAAAAAAGAAAATAAAGT




AATAATATAACTCAAAA (SEQ ID




NO: 51)






ITR-R
AGGAACCCCTAGTGATGGAGTTGGCC




ACTCCCTCTCTGCGCGCTCGCTCGCT




CACTGAGGCCGGGCGACCAAAGGTCG




CCCGACGCCCGGGCTTTGCCCGGGCG




GCCTCAGTGAGCGAGCGAGCGCGCAG




AGAGGGAGTGGCCAA (SEQ ID




NO: 53)









In Silico Derivation of Consensus Kozak Sequence for Enhanced Expression in Cardiac Tissues

An analysis of highly expressed genes in human heart tissues was performed to design a novel synthetic Kozak sequence to enhance transgene expression in the heart. Genes were selected from the Human Protein Atlas and Kozak sequences for each were identified in NCBI, as show in Table 4 below. A consensus sequence was derived using Weblogo (https://weblogo.berkeley.edu/logo.cgi). The consensus sequence (AGCCCCAAC (SEQ ID NO: 56)) was then utilized in the design of selected transgene constructs provided herein.













TABLE 4








Kozak
SEQ



Gene
sequence
ID:









MYH7
GGCACAGCC
63







ACTC1
TGTGCCAAG
64







TNNI3
AGTCTCAGC
65







MYL7
GCAGAGAGA
66







NPPA
TCCAGAGAC
67







NPPB
TCCAGAGAC
68







TNNI2
GACCTCAGG
69







MYBPC3
TCTCTCAGG
70







MYL4
CAAGACAAC
71







MYBPHL
AGGCCCAGC
72







MYH6
AGCACCAAG
73







LRRC10
AGCCTCCGC
74







ACTC1
TGTGCCAAG
75







RD3L
AGGCTAAAA
76







Consensus

AGCCCCAAC

56



Sequence










Self-complementary AAV (scAAV) genomes were designed with various promoters and alternative Kozak sequences, including the in silico derived sequence, as shown below.


1. scAAV with chick beta actin (CBA) promoter and AGCGCCACC Kozak sequence

    • Lower case=5′ mutant ITR
    • Upper case, bold italics=spacer sequences
    • Underlined, uppercase=CBA promoter
    • Upper case, bold=Kozak sequence
    • Upper case=BAG3 cDNA ending with stop sequence (TAATGA)
    • Upper case, bold underlined=PolyA
    • Lower case, underlined=3′ WT ITR









Sequence


(SEQ ID NO: 57)


tccctctctgcgcgctcgctcgctcactgaggccgggcgaccaaaggtcg





cccgacgcccgggctttgcccgggcggcctcagtgagcgagcgagcgcgc





agctgctgCTAGAGGTACTCGAGGTGAGCCCCACGTTCTGCTTCACTCTC





CCCATCTCCCCCCCCTCCCCACCCCCAATTTTGTATTTATTTATTTTTTA





ATTATTTTGTGCAGCGATGGGGGCGGGGGGGGGGGGGGGGCGCGCGCCAG





GCGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTGCG





GCGGCAGCCAATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAG





GCGGCGGCGGCGGCGGCCCTATAAAAAGCGAAGCGCGCGGCGGGCGAGCG






CCACCATGTCTGCTGCCACACACAGCCCTATGATGCAGGTCGCCTCTGGC






AACGGCGACAGAGATCCTTTGCCTCCTGGCTGGGAGATCAAGATCGATCC





TCAGACCGGCTGGCCCTTCTTCGTGGACCACAATAGCAGAACCACCACCT





GGAACGACCCCAGAGTGCCTTCTGAGGGCCCCAAAGAGACACCCAGCTCT





GCCAATGGACCCAGCAGAGAGGGAAGCAGACTGCCACCAGCTAGAGAAGG





ACACCCCGTGTATCCACAGCTGAGGCCTGGCTACATCCCCATTCCAGTGC





TGCATGAGGGCGCCGAAAACAGACAGGTGCACCCCTTTCACGTGTACCCT





CAGCCTGGCATGCAGCGGTTTAGAACAGAAGCCGCTGCTGCCGCTCCTCA





GAGATCTCAGTCTCCTCTGAGAGGCATGCCCGAGACAACCCAGCCTGATA





AGCAGTGTGGACAGGTGGCAGCTGCAGCAGCAGCTCAACCTCCTGCTTCT





CACGGCCCCGAAAGAAGCCAATCTCCTGCCGCCTCTGATTGCAGCTCCAG





CTCTAGCTCTGCCTCTCTGCCTAGCAGCGGCAGATCTAGCCTGGGCTCTC





ATCAACTGCCCAGAGGCTACATCAGCATCCCTGTGATCCACGAGCAGAAC





GTGACCAGACCTGCTGCTCAGCCCAGCTTCCATCAGGCCCAGAAAACACA





CTACCCCGCTCAGCAGGGCGAGTACCAGACACACCAGCCTGTGTACCACA





AGATCCAGGGCGACGACTGGGAGCCCAGACCTCTTAGAGCCGCTAGTCCC





TTCAGATCCTCTGTGCAGGGCGCCAGTTCTAGAGAGGGCTCTCCTGCCAG





AAGCAGCACACCTCTGCACAGCCCATCTCCAATCAGAGTGCACACCGTGG





TGGACAGACCCCAGCAGCCTATGACACACAGAGAGACAGCCCCTGTCAGC





CAGCCTGAGAACAAGCCTGAGTCCAAGCCAGGACCTGTGGGACCTGAACT





GCCTCCAGGACACATCCCTATCCAAGTGATCCGGAAAGAGGTGGACAGCA





AGCCCGTGTCTCAGAAGCCTCCTCCACCTAGCGAGAAAGTGGAAGTGAAA





GTGCCTCCTGCTCCTGTGCCTTGTCCTCCTCCATCTCCTGGACCATCTGC





CGTGCCTAGCTCTCCTAAAAGCGTGGCCACCGAGGAAAGAGCCGCTCCTT





CTACAGCTCCTGCCGAGGCCACACCTCCTAAACCTGGCGAAGCTGAAGCC





CCTCCAAAACACCCTGGCGTGCTGAAGGTGGAAGCCATCCTGGAAAAGGT





GCAGGGACTCGAGCAGGCCGTGGACAACTTCGAGGGCAAGAAAACCGACA





AGAAATACCTGATGATCGAGGAATACCTGACCAAAGAGCTGCTGGCCCTG





GACAGCGTTGACCCTGAAGGCAGAGCAGATGTGCGGCAGGCTAGAAGAGA





TGGCGTGCGGAAAGTGCAGACCATCCTCGAGAAGCTGGAACAGAAAGCCA





TCGACGTGCCAGGCCAGGTGCAGGTTTACGAGCTGCAGCCCTCTAACCTG





GAAGCCGATCAGCCTCTGCAGGCCATCATGGAAATGGGAGCCGTGGCCGC





CGACAAGGGAAAGAAGAATGCTGGCAACGCCGAGGATCCCCACACCGAAA





CACAGCAGCCTGAAGCTACAGCCGCCGCTACCAGCAATCCCAGCAGCATG





ACAGACACCCCTGGCAATCCAGCCGCTCCATAATGAGCGGCCGCCGGCCG






AATAAAAGATCCTTATTTTCATTGGATCTGTGTGTTGGTTTTTTGT

GTGG









TCGACTCTAG
aggaacccctagtgatggagttggccactccctctctgcg






cgctcgctcgctcactgaggccgggcgaccaaaggtcgcccgacgcccgg





gctttgcccgggcggcctcagtgagcgagcgagcgcgcagagagggagtg





gccaa







2. scAAV with chick beta actin (CBA) promoter and in silico derived Kozak sequence
    • Lower case=5′ mutant ITR
    • Upper case, bold italics=spacer sequences
    • Underlined, uppercase=CBA promoter
    • Upper case, bold=in silico derived Kozak sequence
    • Upper case=BAG3 cDNA ending with stop sequence (TAATGA)
    • Upper case, bold underlined=PolyA
    • Lower case, underlined=3′ WT ITR









Sequence


(SEQ ID NO: 58)


tccctctctgcgcgctcgctcgctcactgaggccgggcgaccaaaggtcg





cccgacgcccgggctttgcccgggcggcctcagtgagcgagcgagcgcgc





agctgctgCTAGAGGTACTCGAGGTGAGCCCCACGTTCTGCTTCACTCTC





CCCATCTCCCCCCCCTCCCCACCCCCAATTTTGTATTTATTTATTTTTTA





ATTATTTTGTGCAGCGATGGGGGCGGGGGGGGGGGGGGGGCGCGCGCCAG





GCGGGGCGGGGGGGGCGAGGGGGGGGCGGGGCGAGGCGGAGAGGTGCGGC





GGCAGCCAATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGGC





GGCGGCGGCGGCGGCCCTATAAAAAGCGAAGCGCGCGGCGGGCGAGCCCC






AACATGTCTGCTGCCACACACAGCCCTATGATGCAGGTCGCCTCTGGCAA






CGGCGACAGAGATCCTTTGCCTCCTGGCTGGGAGATCAAGATCGATCCTC





AGACCGGCTGGCCCTTCTTCGTGGACCACAATAGCAGAACCACCACCTGG





AACGACCCCAGAGTGCCTTCTGAGGGCCCCAAAGAGACACCCAGCTCTGC





CAATGGACCCAGCAGAGAGGGAAGCAGACTGCCACCAGCTAGAGAAGGAC





ACCCCGTGTATCCACAGCTGAGGCCTGGCTACATCCCCATTCCAGTGCTG





CATGAGGGCGCCGAAAACAGACAGGTGCACCCCTTTCACGTGTACCCTCA





GCCTGGCATGCAGCGGTTTAGAACAGAAGCCGCTGCTGCCGCTCCTCAGA





GATCTCAGTCTCCTCTGAGAGGCATGCCCGAGACAACCCAGCCTGATAAG





CAGTGTGGACAGGTGGCAGCTGCAGCAGCAGCTCAACCTCCTGCTTCTCA





CGGCCCCGAAAGAAGCCAATCTCCTGCCGCCTCTGATTGCAGCTCCAGCT





CTAGCTCTGCCTCTCTGCCTAGCAGCGGCAGATCTAGCCTGGGCTCTCAT





CAACTGCCCAGAGGCTACATCAGCATCCCTGTGATCCACGAGCAGAACGT





GACCAGACCTGCTGCTCAGCCCAGCTTCCATCAGGCCCAGAAAACACACT





ACCCCGCTCAGCAGGGCGAGTACCAGACACACCAGCCTGTGTACCACAAG





ATCCAGGGCGACGACTGGGAGCCCAGACCTCTTAGAGCCGCTAGTCCCTT





CAGATCCTCTGTGCAGGGCGCCAGTTCTAGAGAGGGCTCTCCTGCCAGAA





GCAGCACACCTCTGCACAGCCCATCTCCAATCAGAGTGCACACCGTGGTG





GACAGACCCCAGCAGCCTATGACACACAGAGAGACAGCCCCTGTCAGCCA





GCCTGAGAACAAGCCTGAGTCCAAGCCAGGACCTGTGGGACCTGAACTGC





CTCCAGGACACATCCCTATCCAAGTGATCCGGAAAGAGGTGGACAGCAAG





CCCGTGTCTCAGAAGCCTCCTCCACCTAGCGAGAAAGTGGAAGTGAAAGT





GCCTCCTGCTCCTGTGCCTTGTCCTCCTCCATCTCCTGGACCATCTGCCG





TGCCTAGCTCTCCTAAAAGCGTGGCCACCGAGGAAAGAGCCGCTCCTTCT





ACAGCTCCTGCCGAGGCCACACCTCCTAAACCTGGCGAAGCTGAAGCCCC





TCCAAAACACCCTGGCGTGCTGAAGGTGGAAGCCATCCTGGAAAAGGTGC





AGGGACTCGAGCAGGCCGTGGACAACTTCGAGGGCAAGAAAACCGACAAG





AAATACCTGATGATCGAGGAATACCTGACCAAAGAGCTGCTGGCCCTGGA





CAGCGTTGACCCTGAAGGCAGAGCAGATGTGCGGCAGGCTAGAAGAGATG





GCGTGCGGAAAGTGCAGACCATCCTCGAGAAGCTGGAACAGAAAGCCATC





GACGTGCCAGGCCAGGTGCAGGTTTACGAGCTGCAGCCCTCTAACCTGGA





AGCCGATCAGCCTCTGCAGGCCATCATGGAAATGGGAGCCGTGGCCGCCG





ACAAGGGAAAGAAGAATGCTGGCAACGCCGAGGATCCCCACACCGAAACA





CAGCAGCCTGAAGCTACAGCCGCCGCTACCAGCAATCCCAGCAGCATGAC





AGACACCCCTGGCAATCCAGCCGCTCCATAATGAGCGGCCGCCGGCCGAA






TAAAAGATCCTTATTTTCATTGGATCTGTGTGTTGGTTTTTTGTGT

GGTC









GACTCTAG
aggaacccctagtgatggagttggccactccctctctgcgcg






ctcgctcgctcactgaggccgggcgaccaaaggtcgcccgacgcccgggc





tttgcccgggcggcctcagtgagcgagcgagcgcgcagagagggagtggc





caa







3. scAAV with chick beta actin (CBA) promoter and CAACCCAGC Kozak sequence
    • Lower case=5′ mutant ITR
    • Upper case, bold italics=spacer sequences
    • Underlined, uppercase=CBA promoter
    • Upper case, bold=Kozak sequence
    • Upper case=BAG3 cDNA ending with stop sequence (TAATGA)
    • Upper case, bold underlined=PolyA
    • Lower case, underlined=3′ WT ITR









Sequence


(SEQ ID NO: 59)


tccctctctgcgcgctcgctcgctcactgaggccgggcgaccaaaggtcg





cccgacgcccgggctttgcccgggggcctcagtgagcgagcgagcgcgca





gctgctgCTAGAGGTACTCGAGGTGAGCCCCACGTTCTGCTTCACTCTCC





CCATCTCCCCCCCCTCCCCACCCCCAATTTTGTATTTATTTATTTTTTAA





TTATTTTGTGCAGCGATGGGGGCGGGGGGGGGGGGGGGGCGCGCGCCAGG





CGGGGCGGGGCGGGGCGAGGGGCGGGGCGGGGCGAGGCGGAGAGGTGCGG





CGGCAGCCAATCAGAGCGGCGCGCTCCGAAAGTTTCCTTTTATGGCGAGG





CGGCGGCGGCGGCGGCCCTATAAAAAGCGAAGCGCGCGGCGGGCGCAACC






CAGCATGTCTGCTGCCACACACAGCCCTATGATGCAGGTCGCCTCTGGCA






ACGGCGACAGAGATCCTTTGCCTCCTGGCTGGGAGATCAAGATCGATCCT





CAGACCGGCTGGCCCTTCTTCGTGGACCACAATAGCAGAACCACCACCTG





GAACGACCCCAGAGTGCCTTCTGAGGGCCCCAAAGAGACACCCAGCTCTG





CCAATGGACCCAGCAGAGAGGGAAGCAGACTGCCACCAGCTAGAGAAGGA





CACCCCGTGTATCCACAGCTGAGGCCTGGCTACATCCCCATTCCAGTGCT





GCATGAGGGCGCCGAAAACAGACAGGTGCACCCCTTTCACGTGTACCCTC





AGCCTGGCATGCAGCGGTTTAGAACAGAAGCCGCTGCTGCCGCTCCTCAG





AGATCTCAGTCTCCTCTGAGAGGCATGCCCGAGACAACCCAGCCTGATAA





GCAGTGTGGACAGGTGGCAGCTGCAGCAGCAGCTCAACCTCCTGCTTCTC





ACGGCCCCGAAAGAAGCCAATCTCCTGCCGCCTCTGATTGCAGCTCCAGC





TCTAGCTCTGCCTCTCTGCCTAGCAGCGGCAGATCTAGCCTGGGCTCTCA





TCAACTGCCCAGAGGCTACATCAGCATCCCTGTGATCCACGAGCAGAACG





TGACCAGACCTGCTGCTCAGCCCAGCTTCCATCAGGCCCAGAAAACACAC





TACCCCGCTCAGCAGGGCGAGTACCAGACACACCAGCCTGTGTACCACAA





GATCCAGGGCGACGACTGGGAGCCCAGACCTCTTAGAGCCGCTAGTCCCT





TCAGATCCTCTGTGCAGGGCGCCAGTTCTAGAGAGGGCTCTCCTGCCAGA





AGCAGCACACCTCTGCACAGCCCATCTCCAATCAGAGTGCACACCGTGGT





GGACAGACCCCAGCAGCCTATGACACACAGAGAGACAGCCCCTGTCAGCC





AGCCTGAGAACAAGCCTGAGTCCAAGCCAGGACCTGTGGGACCTGAACTG





CCTCCAGGACACATCCCTATCCAAGTGATCCGGAAAGAGGTGGACAGCAA





GCCCGTGTCTCAGAAGCCTCCTCCACCTAGCGAGAAAGTGGAAGTGAAAG





TGCCTCCTGCTCCTGTGCCTTGTCCTCCTCCATCTCCTGGACCATCTGCC





GTGCCTAGCTCTCCTAAAAGCGTGGCCACCGAGGAAAGAGCCGCTCCTTC





TACAGCTCCTGCCGAGGCCACACCTCCTAAACCTGGCGAAGCTGAAGCCC





CTCCAAAACACCCTGGCGTGCTGAAGGTGGAAGCCATCCTGGAAAAGGTG





CAGGGACTCGAGCAGGCCGTGGACAACTTCGAGGGCAAGAAAACCGACAA





GAAATACCTGATGATCGAGGAATACCTGACCAAAGAGCTGCTGGCCCTGG





ACAGCGTTGACCCTGAAGGCAGAGCAGATGTGCGGCAGGCTAGAAGAGAT





GGCGTGCGGAAAGTGCAGACCATCCTCGAGAAGCTGGAACAGAAAGCCAT





CGACGTGCCAGGCCAGGTGCAGGTTTACGAGCTGCAGCCCTCTAACCTGG





AAGCCGATCAGCCTCTGCAGGCCATCATGGAAATGGGAGCCGTGGCCGCC





GACAAGGGAAAGAAGAATGCTGGCAACGCCGAGGATCCCCACACCGAAAC





ACAGCAGCCTGAAGCTACAGCCGCCGCTACCAGCAATCCCAGCAGCATGA





CAGACACCCCTGGCAATCCAGCCGCTCCATAATGAGCGGCCGCCGGCCGA






ATAAAAGATCCTTATTTTCATTGGATCTGTGTGTTGGTTTTTTGTGTG

GT









CGACTCTAG
aggaacccctagtgatggagttggccactccctctctgcgc






gctcgctcgctcactgaggccgggcgaccaaaggtcgcccgacgcccggg





ctttgcccgggcggcctcagtgagcgagcgagcgcgcagagagggagtgg





ccaa







4. scAAV with muscle creatine kinase (MCK) promoter and AGCGCCACC Kozak sequence
    • Lower case=5′ mutant ITR
    • Upper case, bold italics=spacer sequences
    • Underlined, uppercase=CBA promoter
    • Upper case, bold=Kozak sequence
    • Upper case=BAG3 cDNA ending with stop sequence (TAATGA)
    • Upper case, bold underlined=PolyA
    • Lower case, underlined=3′ WT ITR









Sequence


(SEQ ID NO: 60)


tccctctctgcgcgctcgctcgctcactgaggccgggcgaccaaaggtcg





cccgacgcccgggctttgcccgggcggcctcagtgagcgagcgagcgcgc





agctgctgCTAGAGGTACCAAGGCTGTGGGGGACTGAGGGCAGGCTGTAA





CAGGCTTGGGGGCCAGGGCTTATACGTGCCTGGGACTCCCAAAGTATTAC





TGTTCCATGTTCCCGGCGAAGGGCCAGCTGTCCCCCGCCAGCTAGACTCA





GCACTTAGTTTAGGAACCAGTGAGCAAGTCAGCCCTTGGGGCAGCCCATA





CAAGGCCATGGGGCTGGGCAAGCTGCACGCCTGGGTCCGGGGTGGGCACG





GTGCCCGGGCAACGAGCTGAAAGCTCATCTGCTCTCAGGGGCCCCTCCCT





GGGGACAGCCCCTCCTGGCTAGTCACACCCTGTAGGCTCCTCTATATAAC





CCAGGGGCACAGGGGCTGCCCTCAGCGCCACCATGTCTGCTGCCACACAC





AGCCCTATGATGCAGGTCGCCTCTGGCAACGGCGACAGAGATCCTTTGCC





TCCTGGCTGGGAGATCAAGATCGATCCTCAGACCGGCTGGCCCTTCTTCG





TGGACCACAATAGCAGAACCACCACCTGGAACGACCCCAGAGTGCCTTCT





GAGGGCCCCAAAGAGACACCCAGCTCTGCCAATGGACCCAGCAGAGAGGG





AAGCAGACTGCCACCAGCTAGAGAAGGACACCCCGTGTATCCACAGCTGA





GGCCTGGCTACATCCCCATTCCAGTGCTGCATGAGGGCGCCGAAAACAGA





CAGGTGCACCCCTTTCACGTGTACCCTCAGCCTGGCATGCAGCGGTTTAG





AACAGAAGCCGCTGCTGCCGCTCCTCAGAGATCTCAGTCTCCTCTGAGAG





GCATGCCCGAGACAACCCAGCCTGATAAGCAGTGTGGACAGGTGGCAGCT





GCAGCAGCAGCTCAACCTCCTGCTTCTCACGGCCCCGAAAGAAGCCAATC





TCCTGCCGCCTCTGATTGCAGCTCCAGCTCTAGCTCTGCCTCTCTGCCTA





GCAGCGGCAGATCTAGCCTGGGCTCTCATCAACTGCCCAGAGGCTACATC





AGCATCCCTGTGATCCACGAGCAGAACGTGACCAGACCTGCTGCTCAGCC





CAGCTTCCATCAGGCCCAGAAAACACACTACCCCGCTCAGCAGGGCGAGT





ACCAGACACACCAGCCTGTGTACCACAAGATCCAGGGCGACGACTGGGAG





CCCAGACCTCTTAGAGCCGCTAGTCCCTTCAGATCCTCTGTGCAGGGCGC





CAGTTCTAGAGAGGGCTCTCCTGCCAGAAGCAGCACACCTCTGCACAGCC





CATCTCCAATCAGAGTGCACACCGTGGTGGACAGACCCCAGCAGCCTATG





ACACACAGAGAGACAGCCCCTGTCAGCCAGCCTGAGAACAAGCCTGAGTC





CAAGCCAGGACCTGTGGGACCTGAACTGCCTCCAGGACACATCCCTATCC





AAGTGATCCGGAAAGAGGTGGACAGCAAGCCCGTGTCTCAGAAGCCTCCT





CCACCTAGCGAGAAAGTGGAAGTGAAAGTGCCTCCTGCTCCTGTGCCTTG





TCCTCCTCCATCTCCTGGACCATCTGCCGTGCCTAGCTCTCCTAAAAGCG





TGGCCACCGAGGAAAGAGCCGCTCCTTCTACAGCTCCTGCCGAGGCCACA





CCTCCTAAACCTGGCGAAGCTGAAGCCCCTCCAAAACACCCTGGCGTGCT





GAAGGTGGAAGCCATCCTGGAAAAGGTGCAGGGACTCGAGCAGGCCGTGG





ACAACTTCGAGGGCAAGAAAACCGACAAGAAATACCTGATGATCGAGGAA





TACCTGACCAAAGAGCTGCTGGCCCTGGACAGCGTTGACCCTGAAGGCAG





AGCAGATGTGCGGCAGGCTAGAAGAGATGGCGTGCGGAAAGTGCAGACCA





TCCTCGAGAAGCTGGAACAGAAAGCCATCGACGTGCCAGGCCAGGTGCAG





GTTTACGAGCTGCAGCCCTCTAACCTGGAAGCCGATCAGCCTCTGCAGGC





CATCATGGAAATGGGAGCCGTGGCCGCCGACAAGGGAAAGAAGAATGCTG





GCAACGCCGAGGATCCCCACACCGAAACACAGCAGCCTGAAGCTACAGCC





GCCGCTACCAGCAATCCCAGCAGCATGACAGACACCCCTGGCAATCCAGC





CGCTCCATAATGAGCGGCCGCCGGCCGAATAAAAGATCCTTATTTTCATT






GGATCTGTGTGTTGGTTTTTTGTGT

GGTCGACTCTAG
aggaacccctagt






gatggagttggccactccctctctgcgcgctcgctcgctcactgaggccg





ggcgaccaaaggtcgcccgacgcccgggctttgcccgggcggcctcagtg





agcgagcgagcgcgcagagagggagtggccaa







5. scAAV with muscle creatine kinase (MCK) promoter and in silico derived Kozak sequence
    • Lower case=5′ mutant ITR
    • Upper case, bold italics=spacer sequences
    • Underlined, uppercase=CBA promoter
    • Upper case, bold=in silico derived Kozak sequence
    • Upper case=BAG3 cDNA ending with stop sequence (TAATGA)
    • Upper case, bold underlined=PolyA
    • Lower case, underlined=3′ WT ITR









Sequence


(SEQ ID NO: 61)


tccctctctgcgcgctcgctcgctcactgaggccgggcgaccaaaggtcg





cccgacgcccgggctttgcccgggggcctcagtgagcgagcgagcgcgca





gctgctgCTAGAGGTACCAAGGCTGTGGGGGACTGAGGGCAGGCTGTAAC





AGGCTTGGGGGCCAGGGCTTATACGTGCCTGGGACTCCCAAAGTATTACT





GTTCCATGTTCCCGGCGAAGGGCCAGCTGTCCCCCGCCAGCTAGACTCAG





CACTTAGTTTAGGAACCAGTGAGCAAGTCAGCCCTTGGGGCAGCCCATAC





AAGGCCATGGGGCTGGGCAAGCTGCACGCCTGGGTCCGGGGTGGGCACGG





TGCCCGGGCAACGAGCTGAAAGCTCATCTGCTCTCAGGGGCCCCTCCCTG





GGGACAGCCCCTCCTGGCTAGTCACACCCTGTAGGCTCCTCTATATAACC





CAGGGGCACAGGGGCTGCCCTCAGCCCCAACATGTCTGCTGCCACACACA





GCCCTATGATGCAGGTCGCCTCTGGCAACGGCGACAGAGATCCTTTGCCT





CCTGGCTGGGAGATCAAGATCGATCCTCAGACCGGCTGGCCCTTCTTCGT





GGACCACAATAGCAGAACCACCACCTGGAACGACCCCAGAGTGCCTTCTG





AGGGCCCCAAAGAGACACCCAGCTCTGCCAATGGACCCAGCAGAGAGGGA





AGCAGACTGCCACCAGCTAGAGAAGGACACCCCGTGTATCCACAGCTGAG





GCCTGGCTACATCCCCATTCCAGTGCTGCATGAGGGCGCCGAAAACAGAC





AGGTGCACCCCTTTCACGTGTACCCTCAGCCTGGCATGCAGCGGTTTAGA





ACAGAAGCCGCTGCTGCCGCTCCTCAGAGATCTCAGTCTCCTCTGAGAGG





CATGCCCGAGACAACCCAGCCTGATAAGCAGTGTGGACAGGTGGCAGCTG





CAGCAGCAGCTCAACCTCCTGCTTCTCACGGCCCCGAAAGAAGCCAATCT





CCTGCCGCCTCTGATTGCAGCTCCAGCTCTAGCTCTGCCTCTCTGCCTAG





CAGCGGCAGATCTAGCCTGGGCTCTCATCAACTGCCCAGAGGCTACATCA





GCATCCCTGTGATCCACGAGCAGAACGTGACCAGACCTGCTGCTCAGCCC





AGCTTCCATCAGGCCCAGAAAACACACTACCCCGCTCAGCAGGGCGAGTA





CCAGACACACCAGCCTGTGTACCACAAGATCCAGGGCGACGACTGGGAGC





CCAGACCTCTTAGAGCCGCTAGTCCCTTCAGATCCTCTGTGCAGGGCGCC





AGTTCTAGAGAGGGCTCTCCTGCCAGAAGCAGCACACCTCTGCACAGCCC





ATCTCCAATCAGAGTGCACACCGTGGTGGACAGACCCCAGCAGCCTATGA





CACACAGAGAGACAGCCCCTGTCAGCCAGCCTGAGAACAAGCCTGAGTCC





AAGCCAGGACCTGTGGGACCTGAACTGCCTCCAGGACACATCCCTATCCA





AGTGATCCGGAAAGAGGTGGACAGCAAGCCCGTGTCTCAGAAGCCTCCTC





CACCTAGCGAGAAAGTGGAAGTGAAAGTGCCTCCTGCTCCTGTGCCTTGT





CCTCCTCCATCTCCTGGACCATCTGCCGTGCCTAGCTCTCCTAAAAGCGT





GGCCACCGAGGAAAGAGCCGCTCCTTCTACAGCTCCTGCCGAGGCCACAC





CTCCTAAACCTGGCGAAGCTGAAGCCCCTCCAAAACACCCTGGCGTGCTG





AAGGTGGAAGCCATCCTGGAAAAGGTGCAGGGACTCGAGCAGGCCGTGGA





CAACTTCGAGGGCAAGAAAACCGACAAGAAATACCTGATGATCGAGGAAT





ACCTGACCAAAGAGCTGCTGGCCCTGGACAGCGTTGACCCTGAAGGCAGA





GCAGATGTGCGGCAGGCTAGAAGAGATGGCGTGCGGAAAGTGCAGACCAT





CCTCGAGAAGCTGGAACAGAAAGCCATCGACGTGCCAGGCCAGGTGCAGG





TTTACGAGCTGCAGCCCTCTAACCTGGAAGCCGATCAGCCTCTGCAGGCC





ATCATGGAAATGGGAGCCGTGGCCGCCGACAAGGGAAAGAAGAATGCTGG





CAACGCCGAGGATCCCCACACCGAAACACAGCAGCCTGAAGCTACAGCCG





CCGCTACCAGCAATCCCAGCAGCATGACAGACACCCCTGGCAATCCAGCC





GCTCCATAATGAGCGGCCGCCGGCCGAATAAAAGATCCTTATTTTCATTG






GATCTGTGTGTTGGTTTTTTGTGTGG

TCGACTCTAG
aggaacccctagtg






atggagttggccactccctctctgcgcgctcgctcgctcactgaggccgg





gcgaccaaaggtcgcccgacgcccgggctttgcccgggcggcctcagtga





gcgagcgagcgcgcagagagggagtggccaa







6. scAAV with muscle creatine kinase (MCK) promoter and CAACCCAGC Kozak sequence
    • Lower case=5′ mutant ITR
    • Upper case, bold italics=spacer sequences
    • Underlined, uppercase=CBA promoter
    • Upper case, bold=Kozak sequence
    • Upper case=BAG3 cDNA ending with stop sequence (TAATGA)
    • Upper case, bold underlined=PolyA
    • Lower case, underlined=3′ WT ITR









Sequence


(SEQ ID NO: 62)


tccctctctgcgcgctcgctcgctcactgaggccgggcgaccaaaggtcg





cccgacgcccgggctttgcccgggcggcctcagtgagcgagcgagcgcgc





agctgctgCTAGAGGTACCAAGGCTGTGGGGGACTGAGGGCAGGCTGTAA





CAGGCTTGGGGGCCAGGGCTTATACGTGCCTGGGACTCCCAAAGTATTAC





TGTTCCATGTTCCCGGCGAAGGGCCAGCTGTCCCCCGCCAGCTAGACTCA





GCACTTAGTTTAGGAACCAGTGAGCAAGTCAGCCCTTGGGGCAGCCCATA





CAAGGCCATGGGGCTGGGCAAGCTGCACGCCTGGGTCCGGGGTGGGCACG





GTGCCCGGGCAACGAGCTGAAAGCTCATCTGCTCTCAGGGGCCCCTCCCT





GGGGACAGCCCCTCCTGGCTAGTCACACCCTGTAGGCTCCTCTATATAAC





CCAGGGGCACAGGGGCTGCCCTCCAACCCAGCATGTCTGCTGCCACACAC





AGCCCTATGATGCAGGTCGCCTCTGGCAACGGCGACAGAGATCCTTTGCC





TCCTGGCTGGGAGATCAAGATCGATCCTCAGACCGGCTGGCCCTTCTTCG





TGGACCACAATAGCAGAACCACCACCTGGAACGACCCCAGAGTGCCTTCT





GAGGGCCCCAAAGAGACACCCAGCTCTGCCAATGGACCCAGCAGAGAGGG





AAGCAGACTGCCACCAGCTAGAGAAGGACACCCCGTGTATCCACAGCTGA





GGCCTGGCTACATCCCCATTCCAGTGCTGCATGAGGGCGCCGAAAACAGA





CAGGTGCACCCCTTTCACGTGTACCCTCAGCCTGGCATGCAGCGGTTTAG





AACAGAAGCCGCTGCTGCCGCTCCTCAGAGATCTCAGTCTCCTCTGAGAG





GCATGCCCGAGACAACCCAGCCTGATAAGCAGTGTGGACAGGTGGCAGCT





GCAGCAGCAGCTCAACCTCCTGCTTCTCACGGCCCCGAAAGAAGCCAATC





TCCTGCCGCCTCTGATTGCAGCTCCAGCTCTAGCTCTGCCTCTCTGCCTA





GCAGCGGCAGATCTAGCCTGGGCTCTCATCAACTGCCCAGAGGCTACATC





AGCATCCCTGTGATCCACGAGCAGAACGTGACCAGACCTGCTGCTCAGCC





CAGCTTCCATCAGGCCCAGAAAACACACTACCCCGCTCAGCAGGGCGAGT





ACCAGACACACCAGCCTGTGTACCACAAGATCCAGGGCGACGACTGGGAG





CCCAGACCTCTTAGAGCCGCTAGTCCCTTCAGATCCTCTGTGCAGGGCGC





CAGTTCTAGAGAGGGCTCTCCTGCCAGAAGCAGCACACCTCTGCACAGCC





CATCTCCAATCAGAGTGCACACCGTGGTGGACAGACCCCAGCAGCCTATG





ACACACAGAGAGACAGCCCCTGTCAGCCAGCCTGAGAACAAGCCTGAGTC





CAAGCCAGGACCTGTGGGACCTGAACTGCCTCCAGGACACATCCCTATCC





AAGTGATCCGGAAAGAGGTGGACAGCAAGCCCGTGTCTCAGAAGCCTCCT





CCACCTAGCGAGAAAGTGGAAGTGAAAGTGCCTCCTGCTCCTGTGCCTTG





TCCTCCTCCATCTCCTGGACCATCTGCCGTGCCTAGCTCTCCTAAAAGCG





TGGCCACCGAGGAAAGAGCCGCTCCTTCTACAGCTCCTGCCGAGGCCACA





CCTCCTAAACCTGGCGAAGCTGAAGCCCCTCCAAAACACCCTGGCGTGCT





GAAGGTGGAAGCCATCCTGGAAAAGGTGCAGGGACTCGAGCAGGCCGTGG





ACAACTTCGAGGGCAAGAAAACCGACAAGAAATACCTGATGATCGAGGAA





TACCTGACCAAAGAGCTGCTGGCCCTGGACAGCGTTGACCCTGAAGGCAG





AGCAGATGTGCGGCAGGCTAGAAGAGATGGCGTGCGGAAAGTGCAGACCA





TCCTCGAGAAGCTGGAACAGAAAGCCATCGACGTGCCAGGCCAGGTGCAG





GTTTACGAGCTGCAGCCCTCTAACCTGGAAGCCGATCAGCCTCTGCAGGC





CATCATGGAAATGGGAGCCGTGGCCGCCGACAAGGGAAAGAAGAATGCTG





GCAACGCCGAGGATCCCCACACCGAAACACAGCAGCCTGAAGCTACAGCC





GCCGCTACCAGCAATCCCAGCAGCATGACAGACACCCCTGGCAATCCAGC





CGCTCCATAATGAGCGGCCGCCGGCCGAATAAAAGATCCTTATTTTCATT






GGATCTGTGTGTTGGTTTTTTGTGTG

GTCGACTCTAG
aggaacccctagt






gatggagttggccactccctctctgcgcgctcgctcgctcactgaggccg





ggcgaccaaaggtcgcccgacgcccgggctttgcccgggcggcctcagtg





agcgagcgagcgcgcagagagggagtggccaa






Additional constructs comprising single stranded and scAAV genomes were designed to include the following alternative promoters:









TNNC1


(SEQ ID NO: 77)


GATCACTGGGACCAGAGGAGGGGCTGGAGGATACTACACGCAGGGGTGGG





CTGGGCTGGGCTGGGCTGGGCCAGGAATGCAGCGGGGCAGGGCTATTTAA





GTCAAGGGCCGGCTGGCAACCCCAGCAAGCTGTCCTGTGAG





MHC


(SEQ ID NO: 78)


CAAGGCTGTGGGGGACTGAGGGCAGGCTGTAACAGGCTTGGGGGCCAGGG





CTTATACGTGCCTGGGACTCCCAAAGTATTACTGTTCCATGTTCCCGGCG





AAGGGCCAGCTGTCCCCCGCCAGCTAGACTCAGCACTTAGTTTAGGAACC





AGTGAGCAAGTCAGCCCTTGGGGCAGCCCATACAAGGCCATGGGGCTGGG





CAAGCTGCACGCCTGGGTCCGGGGTGGGCACGGTGCCCGGGCAACGAGCT





GAAAGCTCATCTGCTCTCAGGGGCCCCTCCCTGGGGACAGCCCCTCCTGG





CTAGTCACACCCTGTAGGCTCCTCTATATAACCCAGGGGCACAGGGGCTG





CCCTC






Aav Production Method

Recombinant AAV (rAAV) particles comprising each of the constructs are made by suspension transfection of Expi293F cells with the BAG3 constructs and other plasmids needed for rAAV production (e.g., comprising rep, cap, and Helper expression cassettes) to generate three groups of rAAV comprising (1) AAV9 capsid proteins (2) rh74 capsid proteins; and (3) rh74 variant capsid proteins comprising a tryptophan to arginine mutation at amino acid 505 of the rh74 VP1 capsid protein. Vector is isolated via affinity chromatography followed by an anion exchange column and purified using a cesium chloride gradient to a titer of 2-5E+13 vg/ml.


Example 1. In Vitro Expression Study

The three groups of rAAV comprising the BAG3 constructs are made as described following the AAV production method above and delivered to HEK293, C2C12 or cardiomyocytes derived from human induced pluripotent stem cells. Whole cell lysates are generated and probed for expression of BAG3 by ELISA and/or immunoblotting.


Example 2. In Vivo Expression Study

The three groups of rAAV comprising the BAG3 constructs are made as described above and administered via the jugular vein to 5-7 weeks old C57BL/6 mice (n=6-10/group) at three different doses: 1E+13 vg/kg, 5E+13 vg/kg or 1+E14 vg/kg. One month after rAAV dosing, heart, diaphragm and skeletal muscle tissues are harvested and whole cell lysates are analyzed for BAG3 expression using ELISA and/or immunoblot.


In a separate experiment, the three groups of rAAV comprising the BAG3 constructs is made as described above and administered via the jugular vein to 5-7 weeks old C57BL/6 mice (n=6-10/group) at three different doses: 1E+13 vg/kg, 5E+13 vg/kg or 1+E14 vg/kg. One month after rAAV dosing, heart, diaphragm and skeletal muscle tissues are harvested and whole cell lysates are analyzed for BAG3 expression using ELISA and/or immunoblot.


Example 3. Restoration of Bag3 Expression In Vivo

Bag3-floxed (Bag-3fl/fl) mice (C57BL/6N-Bag3tm1c(EUCOMM)Hmgu/H: MGI:5760384) are utilized to create tissue-specific BAG3 null animals. To generate BAG3 cardiomyocyte-specific knockout (CKO) mice, Bag3fl/fl mice are crossed with α-myosin heavy chain-transgenic (αMHC-Cre) mice. CKO mice are viable at birth; however, they exhibit premature death consequent to age-dependent dilated cardiomyopathy and heart failure (Fang et al., 2017; Myers et al.,2018).


The BAG3E455K knock-in mouse model (Bag3tm1.1Chen; MGI: 6107852) carries the orthologous mouse mutation for the human E455K (Fang et al, 2017). BAG3 levels in the hearts of BAG3 homozygous E455K mice are comparable to BAG3 protein levels in wild type controls suggesting that the E455K mutation does not impair the expression or stability of BAG3. Homozygous global BAG3 E455K mutants exhibit impaired postnatal growth and premature lethality at 4 weeks of age. Cardiac-specific BAG3 E455K-knockin mice display marked cardiac enlargement and diminished systolic function.


rAAV comprising the BAG3 constructs are made as described above and delivered via a single IV injection to presymptomatic and/or symptomatic BAG3 mutant mice using different doses. Endpoints include survival as well as cardiac function monitored by echocardiography. Upon necropsy, heart tissues are collected and whole tissue lysates are analyzed for AAV biodistribution by ddPCR and for human BAG3 expression by ELISA and/or immunoblot. In addition, tissue sections are analyzed for histopathology. Therapeutic effects of the rAAV are assessed via the measured endpoints and/or histopathology assessments.


Example 4. Transfection of C2C12 Cells Using Selected Constructs

AAVN production was carried out via triple plasmid transfection of adherent HEK293 cells to generate the following:

    • 1. AAV9/CMV-GFP
    • 2. AAV9/Des-BAG3 intron 1
    • 3. AAV9/MHCK7-BAG3 intron 1
    • 4. rh74/Des-BAG3 intron 1
    • 5. rh74/MHCK7-BAG3 intron 1
    • 6. CMV-BAG3 (Negative control with only one plasmid)


Resulting HEK293 cells were harvested, lysed using freeze-thaw cycles, and crude lysate mixture was stored for infection and analysis of BAG3 expression.


C2C12 cells were then plated and differentiated using equine serum and AdMyoD. Differentiated cells were infected using crude lysates as follows (hereinafter selected constructs 1-6):

    • 1. AAV9/CMV-GFP
    • 2. AAV9/Des-BAG3 intron 1
    • 3. AAV9/MHCK7-BAG3 intron 1
    • 4. rh74/Des-BAG3 intron 1
    • 5. rh74/MHCK7-BAG3 intron 1
    • 6. CMV-BAG3 (Negative control with only one plasmid)


Cells were harvested and supplemented with protease and phosphatase inhibitor, then lysed and homogenized using a needle and syringe approach. Resulting transgene expression was then assessed using, for example, western blot analysis. FIG. 4 depicts the presence of human BAG3 transduction with negative controls (selected constructs 1 and 6 (AAV9-CMV-GFP and CMV-GFP plasmid)) compared to selected constructs 2-5 and positive control (recombinant human BAG3 protein). As can be appreciated, signal was observed for BAG3 selected constructs 2-5 and for positive control. It was observed that transgene expression from rh74-MHCK7 (selected construct 5) was higher compared to other vector-promoter combos.


Example 5. Retro Orbital Dose Study in Adult Wt Mice

WT Adult Mice were dosed with the following vectorized constructs: Construct 4 (pTR2-MHCK7-BAG3-dual)-SEQ ID Nos: 79-89 (AAVrh74), Construct 5 (pTR2-Des-BAG3int1)—SEQ ID Nos: 91-101 (AAVrh74), and Construct 6 (B827-pTR-CBA-Bag3-dual)—SEQ ID Nos: 103-111 (AAV9). Administration route was a single retro-orbital (RO) dose in WT C57BI/6J mice. Mice were approximately 5 weeks old at time of dosing, and doses were administered according to table 5, below. The total duration prior to necropsy and tissue harvesting was 28 days.
















TABLE 5











Dosing



Group
No. of

Dose
Conc.
Dose
Regimen
Terminal


No.
Animals/Sex
Test Material
(vg/kg)a
(vg/mL)
Volume
ROA
Time Point







1
3M, 3F
Vehicle
0
0
5 mL/kg
Intravenous
Day 28


2
4M
AAV9-pTR2-CBA-
1.0E+13
1.55E12

via the retro-



3
4M
BAG3-dual
5.0E+13
1.55E12

orbital sinus



4
4M, 4F
AAVrh.74-DES-
1.0E+13
2.03E12





5
4M, 4F
BAG3int1
5.0E+13
9.93E12





6
4M, 4F
AAVrh.74-MHCK7-
1.0E+13
1.99E12





7
4M. 4F
BAG3int1
5.0E+13
9.96E12





ROA = Route of administration;


Conc. = concentration;


M = Male;


F = Female;


TBD = to be determined.



aEstimated dose based on body weight as measured prior to dose on Day 1.








FIGS. 5A and 5B illustrate male and female body weights respectively over the study period. Body weight was assessed weekly, and no test article related changes in body weights were observed. All animals in groups 1-7 survived to scheduled Necroscopy, and no gross observations or clinical observations were observed throughout the study. FIGS. 6-8 illustrate tissue specific expression of BAG3 in specific tissues, including the liver (FIG. 6), heart (FIG. 7), and Gastrocnemius muscle (FIG. 8). Results illustrated that Desmin and MHCK7 promoters provided cardiac specific expression compared to CBA. Further, rh74-MHCK7 constructs show significantly higher cardiac transgene expression compared to AAV9, while AAV9-CBA shows significantly higher transgene expression in the liver compared to either rh74-MHCK7 or rh74-DES driven hBAG3.


Vector copy number analysis was conducted via ddPCR on heart and liver tissue samples as shown in FIGS. 9A and 9B. The data show dose response and overall biodistribution of rh74 constructs compared to AAV9. FIG. 9A shows rh74-Des provided significantly greater heart transduction as compared to AAV9. FIG. 9B shows rh74 constructs were shown to transduce in the liver, but transgene expression was significantly reduced compared to AAV9 delivered constructs in this tissue (See FIG. 6). Results from the study indicated the following: increased BAG3 expression in rh74-MHCK7 groups compared to vehicle, hBAG3 expression in off-target tissues was quantified to be lower than endogenous mouse BAG3, and rh74 delivered constructs had reduced expression in liver tissue compared to AAV9. Dose response behavior was observed in heart with either AAV9 or rh74 vectors, and Des based constructs showed higher copy number compared to Vehicle or MHCK7 construct.

Claims
  • 1. A nucleic acid comprising an expression construct comprising a human BAG3 coding sequence operably linked to a promoter, wherein the expression construct is flanked on each side by an inverted terminal repeat sequence.
  • 2. The nucleic acid of claim 1, wherein the human BAG3 coding sequence is codon-optimized for expression in human cells.
  • 3. The nucleic acid of claim 1, wherein the human BAG3 coding sequence has at least about 85% sequence identity to the sequence of SEQ ID NO: 21, or SEQ ID NO: 101, or SEQ ID Nos: 83-85 in order.
  • 4. The nucleic acid claim 1, wherein the promoter comprises a cardiac specific promotor and/or wherein the nucleic acid further encodes an enhancer element.
  • 5. The nucleic acid of any one of claims 1 to 3, wherein the promoter is selected from the group consisting of: CBA, CK8, MHCK7, Desmin (optionally mDES), CMV, mini-CMV, HSV, TK, RSV, SV40, MMTV, Ad E1A and combinations thereof.
  • 6. The nucleic acid of claim 4, wherein the CBA promoter sequence has at least about 85% sequence identity to the sequence of SEQ ID NO: 5 and wherein the construct further comprises an enhancer element, optionally a CMV enhancer element.
  • 7. The nucleic acid of claim 1, wherein the expression construct has at least about 85% sequence identity to the sequence of SEQ ID NO: 1-20, 30-41, 42-53, 79-89, 91-101, or 103-111 arranged in sequence.
  • 8. The nucleic acid of claim 7, wherein the expression construct comprises the sequence of SEQ ID NO: 1-20, 30-41, 42-53, 79-89, 91-101, or 103-111 arranged in sequence.
  • 9. The nucleic acid of a claim 1, wherein the nucleic acid is a recombinant adeno-associated virus (rAAV) vector.
  • 10. The nucleic acid of claim 9, wherein the nucleic acid is a single-stranded or self-complementary rAAV nucleic acid vector.
  • 11. A recombinant adeno-associated virus (rAAV) particle comprising the nucleic acid of one of claims 1 to 10.
  • 12. The rAAV particle of claim 11, wherein the rAAV particle is an AAV9 particle.
  • 13. The rAAV particle of claim 11, wherein the rAAV particle is an rh74 particle.
  • 14. The rAAV particle of claim 11, wherein the rAAV particle is an rh10 particle.
  • 15. A composition comprising a plurality of the rAAV particle of any one of claim 12, 13, or 14.
  • 16. The composition of claim 15, further comprising a pharmaceutically acceptable carrier.
  • 17. A method of treating dilated cardiomyopathy, the method comprising: administering a therapeutically effective amount of rAAV comprising a nucleic acid expression construct comprising a human BAG3 coding sequence operably linked to a promoter, wherein the expression construct is flanked on each side by an inverted terminal repeat sequence, and wherein said administration results in expression of a therapeutically effective amount of human BAG3, thereby treating the dilated cardiomyopathy.
  • 18. The method of claim 17, wherein the rAAV is administered via intravenous injection, wherein the BAG3 coding sequence has at least about 85% sequence identity to SEQ ID NO: 21, or SEQ ID NO: 101, or SEQ ID Nos: 83-85 in order and wherein, optionally, the nucleic acid expression construct optionally comprises a viral enhancer element.
  • 19. The method of claim 17, wherein between about 0.5 and about 5 rAAV vector genomes per cell are administered.
  • 20. The method of claim 19, wherein between about 0.5 and about 2 rAAV vector genomes per cell are administered.
  • 21. A method of inducing increased expression of human BAG3 in a target cell, comprising: contacting a target cell with a plurality of rAAV particles comprising a nucleic acid expression construct comprising a human BAG3 coding sequence operably linked to a promoter, wherein the expression construct is flanked on each side by an inverted terminal repeat sequence, andwherein said contacting results in the target cell increasing expression of human BAG3 as compared to prior to the contacting, thereby increasing the expression of human BAG3
  • 22. The method of claim 21, wherein the contacting is in vivo, wherein the BAG3 coding sequence has at least about 85% sequence identity to SEQ ID NO: 21, or SEQ ID NO: 101, or SEQ ID Nos: 83-85 in order and wherein, optionally, the nucleic acid expression construct optionally comprises a viral enhancer element.
  • 23. The method of claim 21 or 22, for the treatment of dilated cardiomyopathy.
  • 24. Use of the nucleic acid of any one of claims 1 to 9, the rAAV particle of any one of claims 11 to 14, or the composition of claim 15 or 16 in the manufacture of a medicament for the treatment of dilated cardiomyopathy.
  • 25. Use of the nucleic acid of any one of claims 1 to 9, the rAAV particle of any one of claims 11 to 14, or the composition of claim 15 or 16 for the treatment of dilated cardiomyopathy.
  • 26. A nucleic acid comprising an expression construct comprising a human BAG3 coding sequence and an enhancer element operably linked to a promoter, wherein the expression construct is flanked on each side by an inverted terminal repeat sequence, wherein the expression construct has at least about 85% sequence identity to the sequence of SEQ ID NO: 1-20 arranged in sequence.
  • 27. A nucleic acid comprising an expression construct comprising a human BAG3 coding sequence and an enhancer element operably linked to a promoter, wherein the expression construct is flanked on each side by an inverted terminal repeat sequence, wherein the expression construct has at least about 85% sequence identity to the sequence of SEQ ID NO: 30-41 arranged in sequence.
  • 28. A nucleic acid comprising an expression construct comprising a human BAG3 coding sequence and an enhancer element operably linked to a promoter, wherein the expression construct is flanked on each side by an inverted terminal repeat sequence, wherein the expression construct has at least about 85% sequence identity to the sequence of SEQ ID NO: 42-53 arranged in sequence.
  • 29. A nucleic acid comprising an expression construct comprising a human BAG3 coding sequence and an enhancer element operably linked to a promoter, wherein the expression construct is flanked on each side by an inverted terminal repeat sequence, wherein the expression construct has at least about 85% sequence identity to the sequence of SEQ ID NO: 79-89 arranged in sequence.
  • 30. A nucleic acid comprising an expression construct comprising a human BAG3 coding sequence and an enhancer element operably linked to a promoter, wherein the expression construct is flanked on each side by an inverted terminal repeat sequence, wherein the expression construct has at least about 85% sequence identity to the sequence of SEQ ID NO: 91-101 arranged in sequence.
  • 31. A nucleic acid comprising an expression construct comprising a human BAG3 coding sequence and an enhancer element operably linked to a promoter, wherein the expression construct is flanked on each side by an inverted terminal repeat sequence, wherein the expression construct has at least about 85% sequence identity to the sequence of SEQ ID NO: 103-111 arranged in sequence.
  • 32. The nucleic acid claims 26 to 31, wherein the nucleic acid comprises, or further comprises, an enhancer element, optionally a viral enhancer element, optionally a CMV enhancer element.
  • 33. A nucleic acid comprising an expression construct comprising: a human BAG3 coding sequence;an enhancer element operably linked to a promoter; anda Kozak sequence, wherein the Kozak sequence enhances transgene expression in the heart, wherein the expression construct is flanked on each side by an inverted terminal repeat sequence, wherein the Kozak sequence is non-native with respect to the human BAG3 coding sequence, the cardiac enhancer element, and/or the promoter.
  • 34. The nucleic acid of claim 33, wherein the Kozak sequence is a synthetic sequence and has at least 85% sequence identity to the sequence of SEQ ID NO: 56.
  • 35. The nucleic acid of claim 33, wherein the human BAG3 coding sequence is codon-optimized for expression in human cells.
  • 36. The nucleic acid of claim 33, wherein the human BAG3 coding sequence has at least about 85% sequence identity to the sequence of SEQ ID NO: 21, or SEQ ID NO: 101, or SEQ ID Nos: 83-85 in order.
  • 37. The nucleic acid claim 33, wherein the promoter comprises a cardiac specific promotor.
  • 38. The nucleic acid claim 33, wherein the promoter is selected from the group consisting of: CBA, CK8, MHCK7, Desmin (optionally mDES), CMV, mini-CMV, HSV, TK, RSV, SV40, MMTV, Ad E1A and combinations thereof.
  • 39. The nucleic acid of claim 38, wherein the CBA promoter sequence has at least about 85% sequence identity to the sequence of SEQ ID NO: 5.
  • 40. The nucleic acid of claim 33, wherein the expression construct has at least about 85% sequence identity to the sequence of any of SEQ ID NO: 57-62
  • 41. The nucleic acid of claim 40, wherein the expression construct comprises the sequence of any of SEQ ID NO: 57-62.
  • 42. The nucleic acid claim 33, wherein the nucleic acid is a recombinant adeno-associated virus (rAAV) vector.
  • 43. The nucleic acid of claim 42, wherein the nucleic acid is a single-stranded or self-complementary rAAV nucleic acid vector.
  • 44. A recombinant adeno-associated virus (rAAV) particle comprising the nucleic acid of claim 39 or 40.
  • 45. The rAAV particle of claim 44, wherein the rAAV particle is an AAV9 particle.
  • 46. The rAAV particle of claim 44, wherein the rAAV particle is an rh74 particle.
  • 47. The rAAV particle of claim 44, wherein the rAAV particle is an rh10 particle.
  • 48. The rAAV particle of claim 44, wherein the rAAV particle is a Mut5 AAV particle.
  • 49. A composition comprising a plurality of the rAAV particle of any one of claim 44.
  • 50. The composition of claim 49, further comprising a pharmaceutically acceptable carrier.
CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of priority to U.S. Provisional Patent Application No. 63/288,520, filed Dec. 10, 2021, the entire contents of which are incorporated by reference herein.

PCT Information
Filing Document Filing Date Country Kind
PCT/US2022/081323 12/9/2022 WO