SINGLE CHAIN ANTIBODIES AND INTRABODIES TO MISFOLDED TDP-43 AND METHODS OF USE

INCORPORATION OF SEQUENCE LISTING

A computer readable form of the Sequence Listing “P61456PC00 Sequence ListingST25.txt” (154,654 bytes), created on Apr. 29, 2021, is herein incorporated by reference.

FIELD

The present disclosure relates to TDP-43 single chain antibodies and more specifically to intrabodies for targeting intracellular misfolded TDP-43.

BACKGROUND

Transactive response (TAR) element DNA binding protein of 43 kDa (TDP-43), is a 414 amino acid protein, and is comprised of an N-terminal ubiquitin like domain (NTD, residues 1-80), two RNA recognition motifs (RRMs) composed of residues 106-177 (RRM1), and residues 192-259 (RRM2), and a C-terminal domain (CTD, residues 274-414). The NTD flanks a domain that directs nuclear localization (NLS motifs in residues 82-98, NLS1 K82RK84 and K95VKR98). RRM2 includes a nuclear export signal (NES) from residue 239 to 250.

TDP-43 is predominantly a nuclear protein that plays a central role in RNA metabolism. TDP-43 has become a focal point of research in the amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD) disease spectrum, since pathogenic inclusions within affected neurons can contain post-translationally modified TDP-43. The CTD of TDP-43 is particularly relevant to disease, as it is where nearly all familial ALS/FTD-associated mutations are found in TDP-43.

TDP-43 was found to be hyperphosphorylated, ubiquitinated, and fragmented in neuronal inclusions of patients with both sporadic and familial forms of ALS and FTD [4].

Functional TDP-43 can exist as nuclear oligomers that are distinct from cytoplasmic aggregates formed upon cellular stress. Functional TDP-43 oligomerization is required for its RNA-splicing function. NTD-driven TDP-43 oligomerization in the nucleus can inhibit cytoplasmic mislocalization and the formation of pathologic aggregation [9].

Physiological TDP-43 oligomerization is mediated by its N-terminal domain, which can adopt dynamic, solenoid-like structures, revealed by a 2.1 A crystal structure in combination nuclear magnetic resonance spectroscopy and electron microscopy [9].

Aggregates (inclusion bodies) of TDP-43 have now been found in nearly all (approx. 97%) cases of ALS and roughly half (approx. 40%) of the cases of FTD. TDP-43 is one of the main components of the cytoplasmic inclusions found in the motor neurons and glial cells of ALS patients.

Precursers of TDP-43 inclusions may have concentration far below that of functional TDP-43. The low concentration of misfolded TDP-43 makes this target elusive.

Intracerebral injections of brain derived pathological TDP-43 FTLD-TDP seeds in transgenic mice expressing cytoplasmic human TDP-43 and non-transgenic mice, and has led to the induction of de novo TDP-43 pathology which spreads through the brain in a time dependent manner [10].

Antibodies that bind TDP-43 have been described.

WO2012174666 titled METHODS FOR THE PROGNOSTIC AND/OR DIAGNOSTIC OF NEURODEGENERATIVE DISEASE, METHODS TO IDENTIFY CANDIDATE COMPOUNDS AND COMPOUNDS FOR TREATING NEURODEGENERATIVE DISEASE discloses methods for diagnosing neurodegenerative diseases such as ALS and FTD through assessing the interaction between TDP-43 and NF-κB p65 using an anti-TDP-43 antibody.

WO2016086320 titled TDP-43-BINDING POLYPEPTIDES USEFUL FOR THE TREATMENT OF NEURODEGENERATIVE DISEASES discloses antibodies that bind to the RRM1 domain of TDP-43 to disrupt its interaction with NF-κB for the treatment of ALS and FTD.

Antibodies that can be expressed intracellularly and preferentially bind misfolded TDP-43 over natively folded TDP-43 are desirable.

SUMMARY

The inventors have identified single chain intrabodies that can target cytoplasmic misfolded TDP-43 and that can for example increase its degradation when expressed in cells comprising cytoplasmic misfolded TDP-43.

The single chain intrabodies are derived from antibodies that bind a conformational N-terminal epitope that is accessible in misfolded TDP-43 but is unavailable in natively folded non-disease associated TDP-43. Antibodies raised to an immunogen comprising the N-terminal TDP-43 sequence DAGWGNL (SEQ ID NO: 1), preferentially bound misfolded TDP-43 aggregates. Residue W68 was found to be an important residue in conferring antibody specificity for misfolded TDP-43 aggregates.

An aspect includes a single chain antibody that binds misfolded TDP-43 and comprises a heavy chain variable region comprising complementarity determining regions CDR-H1, CDR-H2 and CDR-H3 and a light chain variable region comprising complementarity determining regions CDR-L1, CDR-L2 and CDR-L3, wherein the heavy chain variable region and the light chain variable region are linked by a linker. The orientation of the heavy and light chain variable regions and linker can be heavy chain variable region—linker—light chain variable region or light chain variable region—linker—heavy chain variable region.

In an embodiment, the single chain antibody is a scFv, nanobody or minibody.

A further aspect comprises an immunoconjugate comprising an antibody described herein and a detectable label, such as a positron emitting radionuclide a fusion tag, such as a FLAG tag or myc tag, or a targeting moiety such as a lysosomal or autophagy targeting sequence.

A further aspect comprises an isolated nucleic acid encoding the single chain antibody described herein, as well as vectors comprising the nucleic acid, for example, for delivering and/or expressing the single chain antibody described herein.

A further aspect comprises a cell recombinantly expressing a single chain antibody described herein.

A further aspect includes a composition comprising the single chain antibody, immunoconjugate, isolated nucleic acid, vector or a cell described herein.

Further provided is a method of treating a subject with a TDP-43 proteinopathy, the method comprising administering to a subject in need thereof an effective amount of a nucleic acid encoding the single chain antibody or immunoconjugate, described herein.

Other features and advantages of the present disclosure will become apparent from the following detailed description. It should be understood, however, that the detailed description and the specific examples while indicating preferred embodiments of the disclosure are given by way of illustration only, since various changes and modifications within the spirit and scope of the disclosure will become apparent to those skilled in the art from this detailed description.

BRIEF DESCRIPTION OF THE DRAWINGS

An embodiment of the present disclosure will now be described in relation to the drawings in which:

FIG. 1 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and LYS-2F7. dNLS-TDP43 was detected using anti-HA, and LYS-2F7 was detected using anti-FLAG. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 2 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and LYS-1H3-1K3. dNLS-TDP43 was detected using anti-HA, and LYS-1H3-1K3 was detected using anti-FLAG. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 3 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and LYS-28H3-28K1. dNLS-TDP43 was detected using anti-HA, and LYS-28H3-28K1 was detected using anti-FLAG. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 4 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and LYS-14H1-14K2. dNLS-TDP43 was detected using anti-HA, and LYS-14H1-14K2 was detected using anti-FLAG. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 5 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and YPTL-2F7. dNLS-TDP43 was detected using anti-HA, and YPTL-2F7 was detected using anti-FLAG. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 6 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and YPTL-1H3-1K3. dNLS-TDP43 was detected using anti-HA, and YPTL-1H3-1K3 was detected using anti-FLAG. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 7 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and YPTL-28H3-28K1. dNLS-TDP43 was detected using anti-HA, and YPTL-28H3-28K1 was detected using anti-FLAG. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 8 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and YPTL-14H1-14K2. dNLS-TDP43 was detected using anti-HA, and YPTL-14H1-14K2 was detected using anti-FLAG. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 9 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and MYCL15H-2F7. dNLS-TDP43 was detected using anti-HA, and MYCL15H-2F7 was detected using anti-MYC. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 10 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and MYCH15L-2F7. dNLS-TDP43 was detected using anti-HA, and MYCH15L-2F7 was detected using anti-MYC. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 11 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and MYCH20L-2F7. dNLS-TDP43 was detected using anti-HA, and MYCH20L-2F7 was detected using anti-MYC. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 12 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and MYCL15H-1H3-1K3. dNLS-TDP43 was detected using anti-HA, and MYCL15H-1H3-1K3 was detected using anti-MYC. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 13 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and MYCL20H-1H3-1K3. dNLS-TDP43 was detected using anti-HA, and MYCL20H-1H3-1K3 was detected using anti-MYC. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 14 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and MYCH15L-28H3-28K1. dNLS-TDP43 was detected using anti-HA, and MYCH15L-28H3-28K1 was detected using anti-MYC. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 15 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and MYCL15H-14H1-14K2. dNLS-TDP43 was detected using anti-HA, and MYCL15H-14H1-14K2 was detected using anti-MYC. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 16 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and MYCH15L-14H1-14K2. dNLS-TDP43 was detected using anti-HA, and MYCH15L-14H1-14K2 was detected using anti-MYC. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 17 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and MYCL20H-14H1-14K2. dNLS-TDP43 was detected using anti-HA, and MYCL20H-14H1-14K2 was detected using anti-MYC. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 18 shows immunocytochemistry of cells overexpressing dNLS-TDP43 and MYCH20L-14H1-14K2. dNLS-TDP43 was detected using anti-HA, and MYCH20L-14H1-14K2 was detected using anti-MYC. The merge shows levels of co-localization in reference with the nucleus stained with DAPI;

FIG. 19 shows a western blot of cells overexpressing dNLS-TDP43 and LYS-2F7, LYS-28H3-28K1, or LYS-14H1-14K2. Anti-HA was used to detect expression levels of dNLS-TDP43. Anti-FLAG was used to detect expression levels of LYS-2F7, LYS-28H3-28K1, and LYS-14H1-14K2. Actin was used as a loading control. EV (empty vector) is a reference negative control plasmid;

FIG. 20 shows a western blot of cells overexpressing dNLS-TDP43 and YPTL-14H1-14K2. Anti-HA was used to detect expression levels of dNLS-TDP43. Anti-FLAG was used to detect expression levels of YPTL-14H1-14K2. Actin was used as a loading control. EV is a reference negative control;

FIG. 21 shows a western blot of cells overexpressing dNLS-TDP43 and MYCL15H-2F7, MYCH15L-2F7, MYCL15H-1H3-1K3, or MYCH20L-14H1-14K2. Anti-HA was used to detect expression levels of dNLS-TDP43. Anti-MYC was used to detect expression levels of MYCL15H-2F7, MYCH15L-2F7, MYCL15H-1H3-1K3, and MYCH20L-14H1-14K2. Actin was used as a loading control. EV is a reference negative control.

DETAILED DESCRIPTION OF THE DISCLOSURE
I. Definitions

As used herein, the term “TDP-43” (transactivation response element (TAR) DNA-binding protein 43) alternately referred to as “TDP43”, or “TDP” unless otherwise qualified, as used herein means all forms of TDP-43 including wild type TDP-43, native TDP-43, as well as misfolded forms including mutant forms and analogs thereof from all species, particularly human TDP-43 (i.e. hTDP-43). Human TDP-43 is a protein of typically 414 amino acid residues and the amino acid sequence (e.g. Uniprot Accession number Q13148) and the nucleotide sequence (e.g. Accession number HGNC:11571) have been previously characterized.

“Wild type” as used herein refers to the primary amino acid sequence of non-mutant or naturally occurring protein.

“Native” as used herein refers to the normal three dimensional structure of a specific protein or part thereof). Native TDP-43 is optionally referred to as “natively folded” TDP-43 “normally folded” TDP-43 and/or “healthy” TDP-43. Accordingly the term “native TDP-43”, or “natively folded TDP-43”, herein refers to TDP-43 as natively folded after nascent translation and/or multimers including but not limited to dimeric TDP-43 and trimeric TDP-43, as folded in non-disease states (e.g. healthy cells) with a molecular structure that comprises a non-covalently associated, individual TDP-43 peptide which shows native structure under in x-ray crystallography or as reconstructed from nuclear magnetic resonance spectra. Native TDP-43 forms multimers through its NTD and TDP-43 when natively folded is typically nuclear. Misfolded aggregates of TDP-43 can be and are typically cytoplasmic.

“Misfolded” as used herein refers to the secondary and tertiary structure of a polypeptide or part thereof, and indicates that the polypeptide has adopted a conformation that is not normal for that polypeptide in its properly functioning state. Although misfolding can be caused by mutations in a protein, such as amino acid deletion, substitution, or addition, wild-type sequence protein can also be misfolded in disease, and expose disease-specific epitopes for instance, as a result of microenvironmental conditions and/or amino acid modification such as nitration, oxidation, carbonylation or other modification. Other post-translational modifications include aberrant ubiquitination, phosphorylation, acetylation, sumoylation, and cleavage into C-terminal fragments. Misfolded TDP43 can be aggregated and/or cytosolic. In the context of TDP-43, native TDP-43 forms multimers through its NTD. Misfolded multimers (e.g. disease-associated oligomers) typically oligomerize through other regions of the protein, for example its LCD and/or RRM1 domains. Accordingly, “misfolded TDP-43 polypeptide”, or “misfolded TDP-43” when referring to the polypeptide herein includes TDP-43 polypeptide that is oligomerized through its LCD and/or RRM1 domains, non-native dimers and trimers, as well as larger aggregates (e.g. 5 or greater subunits), which is cytosolic and/or is aggregated. Misfolded TDP-43 is prone to the formation of aggregates which results in a loss of protein function, toxicity, possession of amyloid-like features (e.g. congo red staining) and propagation of pathogenic aggregates.

The term “mutant TDP-43” refers to forms of TDP-43, and particularly endogenous forms of TDP-43 that occur as a result of genetic mutation that result for instance in amino acid substitution, such as those substitutions characteristic for instance of FTD or familial ALS including for example the mutations described in the bioinformatics tool described in [6].

The term “DAGWGNL (SEQ ID NO: 1)” means the amino acid sequence: aspartic acid, alanine, glycine, tryptophan, glycine, asparagine, and leucine as shown in SEQ ID NO: 1. Similarly GWG refers to the amino acid sequences identified by the 1-letter amino acid code. Depending on the context, the reference of the amino acid sequence can refer to a sequence in TDP-43 or an isolated peptide. The sequence DAGWGNL (SEQ ID NO: 1) corresponds to residues 65-71 in the amino acid primary sequence of TDP-43.

The term “amino acid” includes all of the naturally occurring amino acids as well as modified L-amino acids as well as D-amino acids. The atoms of the amino acid can for example include different isotopes. For example, the amino acids can comprise deuterium substituted for hydrogen, nitrogen-15 substituted for nitrogen-14, and carbon-13 substituted for carbon-12 and other similar changes.

A “conservative amino acid substitution” as used herein, is one in which one amino acid residue is replaced with another amino acid residue without abolishing the protein's desired properties. Suitable conservative amino acid substitutions can be made by substituting amino acids with similar hydrophobicity, polarity, and R-group size for one another. Examples of conservative amino acid substitution include:

Conservative Substitutions

Type of Amino Acid
Substitutable Amino Acids

Hydrophilic
Ala, Pro, Gly, Glu, Asp, Gln, Asn, Ser, Thr

Sulphydryl
Cys

Aliphatic
Val, Ile, Leu, Met

Basic
Lys, Arg, His

Aromatic
Phe, Tyr, Trp

The term “antibody” as used herein is intended to include monoclonal antibodies, polyclonal antibodies, single chain, humanized and other chimeric antibodies, or fully human antibodies, as well as binding fragments thereof. Also included are vectorized antibodies or intrabodies. The antibody may be from recombinant sources and/or produced in transgenic animals. Also included are human antibodies that can be produced through using biochemical techniques or isolated from a library. Humanized or chimeric antibody may include sequences from one or more than one isotype or class.

The phrase “isolated antibody” refers to antibody produced in vivo or in vitro that has been removed from the source that produced the antibody, for example, an animal, hybridoma or other cell line (such as recombinant cells that produce antibody). The isolated antibody is optionally “purified”, which means at least: 80%, 85%, 90%, 95%, 98% or 99% purity.

The term “intrabody” or “intrabodies” as used herein refers to an antibody that is expressed or can be expressed in a cell and that binds to an intracellular protein, for example an intrabody is an antibody that has been modified or adapted for intracellular localization and intracellular function. An intrabody comprises a heavy chain variable domain and a light chain variable domain and linker optionally in either variable domain orientation, e.g. heavy chain variable domain-linker—light chain variable domain or light chain variable domain-linker—heavy chain variable domain. Depending on the context, the term intrabody may refer to a nucleic acid molecule or a polypeptide molecule.

The term “linker” as used herein refers to a synthetic sequence (e.g, amino acid sequence in a polypeptide or nucleic acid sequence in a nucleic acid) that connects or links two sequences, e.g, that link two polypeptide domains. The linker can be a “tag linker” indicating that it is linking a detectable label or a targeting moiety linker indicating that it is linking a targeting moiety to a polypeptide which may also comprise a linker as in the case of a heavy chain variable region linked to a light chain variable region.

The term “complementarity determining region” or “CDR” as used herein refers to particular hypervariable regions of antibodies that are commonly understood to define epitope binding. Computational methods for identifying CDR sequences include Kabat, Chothia, and IMGT. A person skilled in the art having regard to the sequences comprised herein would also be able to identify CDR sequences based on Kabat and Chothia etc.

The term “detectable label” as used herein refers to moieties such as peptide sequences, fluorescent proteins that can be appended or introduced into a peptide, antibody or other compound described herein and which is capable of producing, either directly or indirectly, a detectable signal.

The term “epitope selectively presented or accessible in misfolded TDP-43” as used herein refers to an epitope that is selectively presented or antibody-accessible on misfolded TDP-43 as present for example in ALS or FTD (e.g. disease associated misfolded TDP-43) whether in monomeric, dimeric or aggregated forms, but not on the molecular surface of the native, correctly folded, homodimeric form of TDP-43. As shown herein, W68 is selectively presented or accessible in misfolded TDP-43.

The term “greater affinity” as used herein refers to a degree of antibody binding where an antibody X binds to target Y more strongly (K_on) and/or with a smaller dissociation constant (K_off) than to target Z, and in this context antibody X has a greater affinity for target Y than for Z. Likewise, the term “lesser affinity” herein refers to a degree of antibody binding where an antibody X binds to target Y less strongly and/or with a larger dissociation constant than to target Z, and in this context antibody X has a lesser affinity for target Y than for Z. The affinity of binding between an antibody and its target antigen, can be expressed as KA equal to 1/K_Dwhere K_Dis equal to k_on/k_off. The k_onand k_offvalues can be measured using surface plasmon resonance (measurable for example using a Biacore system).

The term “nucleic acid sequence” as used herein refers to a sequence of nucleotide or nucleotide monomers consisting of naturally occurring bases, sugars and intersugar (backbone) linkages. The term also includes modified or substituted sequences comprising non-naturally occurring monomers or portions thereof. The nucleic acid sequences of the present application may be deoxyribonucleic acid (DNA) sequences or ribonucleic acid (RNA) sequences and may include naturally occurring bases including adenine, guanine, cytosine, thymidine and uracil. The sequences may also contain modified bases. Examples of such modified bases include aza and deaza adenine, guanine, cytosine, thymidine and uracil; and xanthine and hypoxanthine. The nucleic acid can be either double stranded or single stranded, and represents the sense or antisense strand. Further, the term “nucleic acid” includes the complementary nucleic acid sequences as well as codon optimized or synonymous codon equivalents. The term “isolated nucleic acid sequences” as used herein refers to a nucleic acid substantially free of cellular material or culture medium when produced by recombinant DNA techniques, or chemical precursors, or other chemicals when chemically synthesized. An isolated nucleic acid is also substantially free of sequences which naturally flank the nucleic acid (i.e. sequences located at the 5′ and 3′ ends of the nucleic acid) from which the nucleic acid is derived.

“Operatively linked” is intended to mean that the nucleic acid is linked to regulatory sequences in a manner which allows expression of the nucleic acid. Suitable regulatory sequences may be derived from a variety of sources, including bacterial, fungal, viral, mammalian, or insect genes. Selection of appropriate regulatory sequences is dependent on the host cell chosen and may be readily accomplished by one of ordinary skill in the art. Examples of such regulatory sequences include: a transcriptional promoter and enhancer or RNA polymerase binding sequence, a ribosomal binding sequence, including a translation initiation signal. Additionally, depending on the host cell chosen and the vector employed, other sequences, such as an origin of replication, additional DNA restriction sites, enhancers, and sequences conferring inducibility of transcription may be incorporated into the expression vector.

The term “vector” as used herein comprises any intermediary vehicle for a nucleic acid molecule which enables said nucleic acid molecule, for example, to be introduced into prokaryotic and/or eukaryotic cells and/or integrated into a genome, and include plasmids, phagemids, bacteriophages or viral vectors such as retroviral based vectors, including lentiviral vectors, Adeno Associated viral (AAV) vectors and the like. The term “plasmid” as used herein generally refers to a construct of extrachromosomal genetic material, usually a circular DNA duplex, which can replicate independently of chromosomal DNA.

By “at least moderately stringent hybridization conditions” it is meant that conditions are selected which promote selective hybridization between two complementary nucleic acid molecules in solution. Hybridization may occur to all or a portion of a nucleic acid sequence molecule. The hybridizing portion is typically at least 15 (e.g. 20, 25, 30, 40 or 50) nucleotides in length. Those skilled in the art will recognize that the stability of a nucleic acid duplex, or hybrids, is determined by the Tm, which in sodium containing buffers is a function of the sodium ion concentration and temperature (Tm=81.5° C.−16.6 (Log 10 [Na+])+0.41(% (G+C)−600/l), or similar equation). Accordingly, the parameters in the wash conditions that determine hybrid stability are sodium ion concentration and temperature. In order to identify molecules that are similar, but not identical, to a known nucleic acid molecule a 1% mismatch may be assumed to result in about a 1° C. decrease in Tm, for example if nucleic acid molecules are sought that have a >95% identity, the final wash temperature will be reduced by about 5° C. Based on these considerations those skilled in the art will be able to readily select appropriate hybridization conditions. In preferred embodiments, stringent hybridization conditions are selected. By way of example the following conditions may be employed to achieve stringent hybridization: hybridization at 5×sodium chloride/sodium citrate (SSC)/5×Denhardt's solution/1.0% SDS at Tm—5° C. based on the above equation, followed by a wash of 0.2×SSC/0.1% SDS at 60° C. Moderately stringent hybridization conditions include a washing step in 3×SSC at 42° C. It is understood, however, that equivalent stringencies may be achieved using alternative buffers, salts and temperatures. Additional guidance regarding hybridization conditions may be found in: Current Protocols in Molecular Biology, John Wiley & Sons, N.Y., 2002, and in: Sambrook et al., Molecular Cloning: a Laboratory Manual, Cold Spring Harbor Laboratory Press, 2001.

As used herein “binds” or “specifically binds” in reference to an antibody means that the antibody recognizes its target antigen and binds its target with greater affinity than it does to a structurally different antigen and/or to an antigen with modified or mutated sequence. For example a multivalent antibody binds its target with K_Dof at least 1e-6, at least 1e-7, at least 1e-8, at least 1e-9 or at least 1e-10. Affinities greater than at least 1e-8 are preferred. An antigen binding fragment such as Fab fragment comprising one variable domain, may find its target with a 10 fold or 100 fold less affinity than a multivalent interaction with a non-fragmented antibody.

The term “selective” or “preferential” as used herein with respect to an antibody that selectively/preferentially binds a form of TDP-43 (e.g. native, or misfolded protein) means that the binding protein binds the form with at least 3 fold, or at least 5 fold, at least 10 fold, at least 20 fold, at least 100 fold, at least 250 fold, or at least 500 fold or more greater affinity. Accordingly an antibody that is more selective for a particular conformation (e.g. misfolded protein) preferentially binds the particular form of TDP-43 with at least 3 fold, or at least 5 fold, at least 10 fold, at least 20 fold, at least 100 fold, at least 250 fold, or at least 500 fold or more greater affinity compared to another form.

The term “animal” or “subject” as used herein includes all members of the animal kingdom including mammals, optionally including or excluding humans.

The term “treating” or “treatment” as used herein and as is well understood in the art, means an approach for obtaining beneficial or desired results, including clinical results. Beneficial or desired clinical results can include, but are not limited to, alleviation or amelioration of one or more symptoms or conditions, diminishment of extent of disease, stabilized (i.e. not worsening) state of disease, preventing spread of disease, delay or slowing of disease progression, amelioration or palliation of the disease state, diminishment of the reoccurrence of disease, and remission (whether partial or total), whether detectable or undetectable. “Treating” and “Treatment” can also mean prolonging survival as compared to expected survival if not receiving treatment. “Treating” and “treatment” as used herein also include prophylactic treatment, for example in a subject identified as carrying a mutation associated with familial forms, such as the familial form of ALS. A subject with a TDP-43 proteinopathy such as ALS can be treated to delay or slow disease progression. Subjects can be treated with a compound, antibody (including vectorized antibody or intrabody), immunogen, immunoconjugate, or composition described herein to prevent progression.

In understanding the scope of the present disclosure, the term “consisting” and its derivatives, as used herein, are intended to be close ended terms that specify the presence of stated features, elements, components, groups, integers, and/or steps, and also exclude the presence of other unstated features, elements, components, groups, integers and/or steps.

The recitation of numerical ranges by endpoints herein includes all numbers and fractions subsumed within that range (e.g. 1 to 5 includes 1, 1.5, 2, 2.75, 3, 3.90, 4, and 5). It is also to be understood that all numbers and fractions thereof are presumed to be modified by the term “about.” Further, it is to be understood that “a”, “an” and “the” include plural referents unless the content clearly dictates otherwise. The term “about” means plus or minus 0.1 to 50%, 5-50%, or 10-40%, preferably 10-20%, more preferably 10% or 15%, of the number to which reference is being made.

Further, the definitions and embodiments described in particular sections are intended to be applicable to other embodiments herein described for which they are suitable as would be understood by a person skilled in the art. For example, in the following passages, different aspects of the invention are defined in more detail. Each aspect so defined may be combined with any other aspect or aspects unless clearly indicated to the contrary. In particular, any feature indicated as being preferred or advantageous may be combined with any other feature or features indicated as being preferred or advantageous.

II. Antibodies, Immunoconjugates, Cells and Nucleic Acids

As described in the Examples, single chain antibodies were prepared and vectorized.

The single chain antibodies are directed to an N-terminal epitope that is accessible in misfolded TDP-43 but is unavailable in natively folded non-disease associated TDP-43. Antibodies raised to an immunogen comprising the N-terminal TDP-43 sequence DAGWGNL (SEQ ID NO: 1), preferentially bound misfolded TDP-43 aggregates. Residue W68 was found to be an important residue in conferring antibody specificity for misfolded TDP-43 aggregates.

In one embodiment, the single chain antibodies are intrabodies.

The single chain antibodies, and particularly when as intrabodies, optionally include a lysosomal-targeting or autophagy-targeting signal. The heavy chain and light chain variable regions of several antibodies specific for misfolded TDP-43 were linked using various linkers and in various orientations. The vectorized single chain antibodies or intrabodies, were expressed in cells along with mutant dNLS-TDP-43 which comprises deletion of its nuclear localization signal. dNLS-TDP-43 localizes to the cytoplasm where it forms aggregates. As demonstrated herein, the vectorized antibodies were able to co-localize with and induce degradation of intracellular misfolded TDP-43 aggregates. The vectorized antibodies were not toxic to the cells confirming their lack of interference with normal TDP-43 function.

An aspect includes a single chain antibody that binds W68 in misfolded TDP-43 and comprises a heavy chain variable region comprising complementarity determining regions CDR-H1, CDR-H2 and CDR-H3 and a light chain variable region comprising complementarity determining regions CDR-L1, CDR-L2 and CDR-L3, wherein the heavy chain variable region and the light chain variable region are linked by a linker. The orientation of the heavy and light chain variable regions and linker can be heavy chain variable region—linker-light chain variable region or light chain variable region—linker—heavy chain variable region. In some embodiments, the single chain antibody further comprises a lysosomal or autophagy targeting sequence.

In one embodiment, the single chain antibody has CDR sequences comprising

SEQ ID NO: 130

CDR-H1: GFTFSSYY;

SEQ ID NO: 131

CDR-H2: INSNGGST;

SEQ ID NO: 132

CDR-H3: VRQNYEGAY;

SEQ ID NO: 133

CDR-L1: QSIVHSNGNTY;

SEQ ID NO: 134

CDR-L2: KVS;

and

SEQ ID NO: 135

CDR-L3: FQSSHVPWT.

Single chain antibodies comprising CDRs SEQ ID NO: 130-135, specifically bind W68 in the context of DAGWGNL (SEQ ID NO: 1) in misfolded TDP-43. The single chain antibody comprises CDR sequences of antibody 2F7.

In an embodiment, the single chain antibody comprises a heavy chain variable region comprising: i) an amino acid sequence as set forth in SEQ ID NO: 138, ii) an amino acid sequence with at least 80%, at least 85%, at least 90% or at least 95% sequence identity to SEQ ID NO: 138, wherein the CDR sequences are as set forth in SEQ ID NOs: 130-132, or iii) a conservatively substituted amino acid sequence of i), and/or wherein the single chain antibody comprises a light chain variable region comprising an amino acid sequence as set forth in SEQ ID NO: 139, ii) an amino acid sequence with at least 80%, at least 85%, at least 90% or at least 95% sequence identity to SEQ ID NO: 139, wherein the CDR sequences are as set forth in SEQ ID NOs: 133-135, or iii) a conservatively substituted amino acid sequence of i), optionally wherein the heavy chain variable region amino acid sequence is encoded by a nucleotide sequence as set out in SEQ ID NO: 136 or a codon degenerate or optimized version thereof and/or the light chain variable region amino acid sequence is encoded by a nucleotide sequence as set out in SEQ ID NO: 137 or a codon degenerate or optimized version thereof.

In one embodiment, the single chain antibody has CDR sequences comprising

SEQ ID NO: 10

CDR-H1: GFSLSRYY;

SEQ ID NO: 11

CDR-H2: IIPGGTT;

SEQ ID NO: 12

CDR-H3: AGGPTGNSHFTL;

SEQ ID NO: 13

CDR-L1: ESVYNNNH;

SEQ ID NO: 14

CDR-L2: EAS;

and

SEQ ID NO: 15

CDR-L3: SGYKRVTTDGIA.

Single chain antibodies comprising CDRs SEQ ID NO: 10-15, specifically bind W68 in the context of DAGWGNL (SEQ ID NO: 1) in misfolded TDP-43. The single chain antibody comprises CDR sequences of antibody 1H3-1 K3.

In an embodiment, the single chain antibody comprises a heavy chain variable region comprising: i) an amino acid sequence as set forth in SEQ ID NO: 98, ii) an amino acid sequence with at least 80%, at least 85%, at least 90% or at least 95% sequence identity to SEQ ID NO: 98, wherein the CDR sequences are as set forth in SEQ ID NOs: 10-12, or iii) a conservatively substituted amino acid sequence of i), and/or wherein the single chain antibody comprises a light chain variable region comprising an amino acid sequence as set forth in SEQ ID NO: 99, ii) an amino acid sequence with at least 80%, at least 85%, at least 90% or at least 95% sequence identity to SEQ ID NO: 99, wherein the CDR sequences are as set forth in SEQ ID NOs: 13-15, or iii) a conservatively substituted amino acid sequence of i), optionally wherein the heavy chain variable region amino acid sequence is encoded by a nucleotide sequence as set out in SEQ ID NO: 76 or a codon degenerate or optimized version thereof and/or the light chain variable region amino acid sequence is encoded by a nucleotide sequence as set out in SEQ ID NO: 77 or a codon degenerate or optimized version thereof.

In one embodiment, the single chain antibody has CDR sequences comprising

SEQ ID NO: 120

CDR-H1: GFSLSSYN;

SEQ ID NO: 121

CDR-H2: IGTGGIT;

SEQ ID NO: 122

CDR-H3: VRSSGSDWWFHI;

SEQ ID NO: 123

CDR-L1: QSVYNNNN;

SEQ ID NO: 124

CDR-L2: RAS;

and

SEQ ID NO: 125

CDR-L3: QGYFSGFITT.

Single chain antibodies comprising CDRs SEQ ID NO: 120-125, specifically bind W68 in the context of DAGWGNL (SEQ ID NO: 1) in misfolded TDP-43. The single chain antibody comprises CDR sequences of antibody 28H3-28K1.

In an embodiment, the single chain antibody comprises a heavy chain variable region comprising: i) an amino acid sequence as set forth in SEQ ID NO: 128, ii) an amino acid sequence with at least 80%, at least 85%, at least 90% or at least 95% sequence identity to SEQ ID NO: 128, wherein the CDR sequences are as set forth in SEQ ID NOs: 120-122, or iii) a conservatively substituted amino acid sequence of i), and/or wherein the single chain antibody comprises a light chain variable region comprising an amino acid sequence as set forth in SEQ ID NO: 129, ii) an amino acid sequence with at least 80%, at least 85%, at least 90% or at least 95% sequence identity to SEQ ID NO: 129, wherein the CDR sequences are as set forth in SEQ ID NOs: 123-125, or iii) a conservatively substituted amino acid sequence of i), optionally wherein the heavy chain variable region amino acid sequence is encoded by a nucleotide sequence as set out in SEQ ID NO: 126 or a codon degenerate or optimized version thereof and/or the light chain variable region amino acid sequence is encoded by a nucleotide sequence as set out in SEQ ID NO: 127 or a codon degenerate or optimized version thereof.

In one embodiment, the single chain antibody has CDR sequences comprising

SEQ ID NO: 16

CDR-H1: GFSFSSNYV;

SEQ ID NO: 17

CDR-H2: IWFAGIVDTT;

SEQ ID NO: 18

CDR-H3: ARNPVGSVNL;

SEQ ID NO: 19

CDR-L1: ESVYSNNR;

SEQ ID NO: 20

CDR-L2: YAS;

and

SEQ ID NO: 21

CDR-L3: AGWRGARTDGVD.

Single chain antibodies comprising CDRs SEQ ID NO: 16-21, specifically bind W68 in the context of DAGWGNL (SEQ ID NO: 1) in misfolded TDP-43. In one embodiment, the single chain antibody comprises CDR sequences of antibody 14H1-14K2.

In an embodiment, the single chain antibody comprises a heavy chain variable region comprising: i) an amino acid sequence as set forth in SEQ ID NO: 100, ii) an amino acid sequence with at least 80%, at least 85%, at least 90% or at least 95% sequence identity to SEQ ID NO: 100, wherein the CDR sequences are as set forth in SEQ ID NOs: 16-18, or iii) a conservatively substituted amino acid sequence of i), and/or wherein the single chain antibody comprises a light chain variable region comprising an amino acid sequence as set forth in SEQ ID NO: 101, ii) an amino acid sequence with at least 80%, at least 85%, at least 90% or at least 95% sequence identity to SEQ ID NO: 101, wherein the CDR sequences are as set forth in SEQ ID NOs: 19-21, or iii) a conservatively substituted amino acid sequence of i), optionally wherein the heavy chain variable region amino acid sequence is encoded by a nucleotide sequence as set out in SEQ ID NO: 78 or a codon degenerate or optimized version thereof and/or the light chain variable region amino acid sequence is encoded by a nucleotide sequence as set out in SEQ ID NO: 79 or a codon degenerate or optimized version thereof.