Novel nucleic acid compositions, vector systems, modified cells, isolated peptides, isolated nucleic acid sequences and pharmaceutical compositions that encode or express T cell receptor components directed against Jchain are provided herein. These novel components may be used to enhance an immune response in a subject diagnosed with a B cell associated disease or condition. Associated methods for treating such subjects are also provided herein.
Haematological malignancies are cancers that affect the cellular components of blood or immune system. The cancer may begin in blood-forming tissue (e.g. bone marrow), or in the cells of the immune system. Examples of haematological malignancies include forms or leukemia, lymphoma and multiple myeloma (MM).
Multiple myeloma is a malignancy of the bone marrow which is characterized by uncontrolled expansion of malignant plasma cells. Advances in treatment options have extended survival of MM patients, however curative therapies are lacking and are urgently needed(1). Impressive initial response rates to CD19 chimeric antigen receptor (CAR) T cell therapy for acute lymphoblastic leukemia (ALL) and B cell lymphoma demonstrate great promise for engineered T cell therapy for B cell malignancies. Despite the high response rates, antigen escape variants of the tumors may occur. Because CD19 is not expressed by MM, B cell maturation antigen (BCMA) targeting CAR T cells were investigated. BCMA is expressed in a subset of memory B cells and in plasma cells as well as MM. BCMA CAR T cells were effective and safe but long-term complete responses were rare(2-4). Relapses often resulted from heterogenous BCMA expression within tumors, which frequently led to outgrowth of antigen low or negative MM cells (5,6).
To date, allogeneic stem cell transplantation (allo-SCT) has been the only curative therapy for MM but is decreasingly performed due to high toxicity risks. Allo-SCT effectiveness has been attributed to polyclonal T cell responses simultaneously targeting multiple antigens thereby preventing tumor antigen escape. The curative potential of allo-SCT demonstrated that multi-antigen targeting T cell responses can be a curative treatment option for MM. The recent observation that single-antigen-targeting BCMA CAR T cells were insufficient to eradicate all MM cells strengthens the belief that multi-antigen therapy should be developed to prevent antigen escape of heterogenous MM tumors(7).
To generate multi-antigen T cell therapy for MM, novel target antigens need to be identified and validated. To extend CAR T therapy options for MM, alternative CAR targets like SLAMF7 and GPRC5D are currently being explored. However, applicability of these antigens needs to be assessed as the SLAMF7 expression profile causes concerns regarding on-target off-tumor toxicity(8). GPRC5D expression profiles implicate less risk for on-target off-tumor toxicity but efficacy of GPRC5D targeting remains to be determined(9). The requirement for CAR targets to be present on the cell surface largely restricts discovery of new CAR target antigens and thus CAR therapy alone might not result in safe and curative T cell therapy.
In addition to CAR T cells, T cell receptor (TCR) engineered T cells could be of major value since TCRs can recognize peptides derived from any antigen presented in human leukocyte antigen (HLA) molecules on the surface of target cells and therefore an additional pool of MM antigens derived from intracellular proteins can be targeted. Previously, TCRs targeting peptides from the MM associated transcription factor BOB1 presented in HLA-B*07:02 and HLA-B*35:01 have been identified(10, 11). T cells engineered to express BOB1 specific TCRs induced potent lysis of MM cells in vitro. BOB1 targeting is a promising therapy for MM, as the BOB1 protein has been implicated to be a critical factor in MM cell survival and antigen escape is therefore unlikely.
Furthermore, efficacy of TCR engineered T cells for MM was investigated in trials using TCR engineered T cells targeting NY-ESO antigen presented in HLA-A2. T cells were administered after autologous-stem cell transplantation, which resulted in short-term anti-tumor response in 80% of individuals (12, 13). While efficacy should be further investigated in randomized control trials, these initial results demonstrated that further improvements need to be made since antigen escape variants frequently developed. Novel TCRs with different HLA restrictions as well as TCRs targeting multiple antigens will be needed in order to generate multi-antigen targeting therapies for all individuals.
As is clear from the above, there is a need for novel immunotherapies for treating haematological malignancies, including B cell associated haematological malignancies, such as multiple myeloma.
Although most of the disease burden is caused by infiltration of the malignant cells, the immunoglobulins produced by the B-cell malignancies and MM can give rise to severe organ damage which may even be lethal by affecting critical organs like heart, kidney, blood cells or cells from the neural system. This organ damage may be caused by antigen specific targeting of the immunoglobulins or precipitation of immune complexes. This organ damaging production of aberrant immunoglobulins may also be caused by pre-malignant or non-malignant counterparts of these disorders. Such disorders may be categorized as autoimmune disorders. In these cases specific targeting of the damaging immunoglobulin producing cells is necessary and sufficient. Several disorders are caused by specific immunoglobulin subsets including IgA or IgM antibodies.
The inventors recently identified the joining chain (Jchain) as a potential target antigen for immunotherapy of B cell associated diseases or conditions, particularly MM, as it was highly expressed in patient MM samples but not in healthy tissues of non-B cell origins(11). Jchain is a 159 amino acid long protein that links monomers of multimeric IgA and IgM when secreted by plasma cells, but Jchain expression in plasma cells is not exclusive for IgA and IgM secreting plasma cells(14, 15). Additionally, Jchain facilitates transport of dimeric IgA and pentameric IgM across mucosal barriers, poly-Ig receptor (plgR) binds the Jchain on the basolateral side of the membrane, after which transcytosis occurs and the complexes are secreted on the luminal side(16).
The presence of Jchain in MM has previously been reported in multiple studies. Jchain expression was maintained with disease progression even when immunoglobulin production was lost(17). Additionally, Jchain was observed in MM independent of the immunoglobulin isotype produced(18). These observations suggest that the Jchain could be a promising antigen for T cell-based targeting of MM. However, as Jchain is located intracellularly, TCR mediated targeting of Jchain derived peptides presented in HLA on the surface of target cells is required for it to be a bonafide antigen for T cell based targeting of MM.
By performing epitope discovery experiments as described herein, the inventors have now identified several Jchain derived antigens that are presented on MM cells in HLA-A*01:01 (HLA-A1), HLA-A*02:01 (HLA-A2), HLA-A*03:01 (HLA-A3), HLA-A*11:01 (HLA-A11), HLA-A*24:02 (HLA-A24), and HLA-B*40:01 (HLA-B40). Specifically, the inventors identified the Jchain derived peptides YTAVVPLVY (SEQ ID NO: 99), TAVVPLVY (SEQ ID NO: 102), VLAVFIKAVHV (SEQ ID NO: 103), VLAVFIKAV (SEQ ID NO: 104), YTAVVPLV (SEQ ID NO: 105), ISDPTSPLRTR (SEQ ID NO: 101), RIIVPLNNR (SEQ ID NO: 106), CYTAVVPLV (SEQ ID NO: 100) and GETKMVETAL (SEQ ID NO: 107).
The inventors identified that Jchain derived peptides YTAVVPLVY (SEQ ID NO: 99) and TAVVPLVY (SEQ ID NO: 102) are capable of being presented by HLA-A*01:01. Additionally, the inventors identified that Jchain derived peptides VLAVFIKAVHV (SEQ ID NO:103), VLAVFIKAV (SEQ ID NO: 104) and YTAVVPLV (SEQ ID NO: 105) are capable of being presented by HLA-A*02:01. The inventors further identified that Jchain derived peptide ISDPTSPLRTR (SEQ ID NO: 101) is capable of being presented by HLA-A*03:01 and/or HLA-A*11:01, and that the Jchain derived peptide RIIVPLNNR (SEQ ID NO: 106) is capable of being presented by HLA-A*11:01. The inventors further identified that Jchain derived peptide CYTAVVPLV (SEQ ID NO: 100) is capable of being presented by HLA-A*24:02, and that Jchain derived peptide GETKMVETAL (SEQ ID NO: 107) is capable of being presented by HLA-B*40:01.
Advantageously, the above mentioned peptides (e.g. SEQ ID NO:s 99 to 107) can be used as therapeutic agents (e.g. vaccines) to treat or prevent B cell associated diseases or conditions (as described elsewhere herein). The peptides themselves therefore have utility e.g. in isolated form, or when formulated as a pharmaceutical composition. Alternatively, said peptides can be used as a target antigen for treatment of such patients with modified cells described herein (e.g. peripheral blood lymphocytes or tumour-infiltrating lymphocytes (TILs)) having T cell receptors or other binding proteins that specifically recognize one of the specified peptides, for example T cell receptors that specifically recognize one of the specified peptides in the context of a specific HLA as described herein). Further examples of the utility of the peptides described herein are provided in detail elsewhere herein.
Using the methods described herein, the inventors successfully isolated T cells with Jchain specific recognition of these Jchain peptides when presented in HLA-A1, HLA-A3, HLA-A11 or HLA-A24 (as appropriate). Advantageously, upon sequencing and transfer of TCRs from selected T cell clones, in vitro as well as in vivo killing of MM was demonstrated without off-target effects.
The inventors have identified several TCRs that bind to a specific Jchain peptide presented in HLA-A1, HLA-A3, HLA-A11 and/or HLA-A24. Specifically, the inventors identified the following Jchain-specific TCR clones:
Accordingly, provided herein are isolated nucleic acid compositions encoding Jchain antigen-specific binding proteins (as well as corresponding vector systems, modified cells, pharmaceutical compositions etc). The TCR components of clones (i) to (vii) form the basis of the Jchain antigen-specific binding proteins of the invention (e.g. the isolated nucleic acid compositions encoding Jchain antigen-specific binding proteins of the invention) and are described in more detail elsewhere herein.
Advantageously, cells expressing the Jchain antigen-specific binding proteins described herein can be used as an effective immunotherapy in the treatment of B cell associated diseases or conditions, such as MM. Furthermore, they may be used in the treatment of B cell associated autoimmune diseases, as described in more detail below.
The invention therefore provides an isolated nucleic acid composition that encodes a Jchain antigen-specific binding protein having a TCR a chain variable (Vα) domain and a TCR β chain variable (Vβ) domain, the composition comprising:
Suitably, the Jchain antigen may comprise an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO:99), CYTAVVPLV (SEQ ID NO:100), ISDPTSPLRTR (SEQ ID NO:101), TAVVPLVY (SEQ ID NO:102), VLAVFIKAVHV (SEQ ID NO:103), VLAVFIKAV (SEQ ID NO:104), YTAVVPLV (SEQ ID NO:105), RIIVPLNNR (SEQ ID NO:106), and GETKMVETAL (SEQ ID NO:107).
Suitably, the Jchain antigen may comprise an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO:99), CYTAVVPLV (SEQ ID NO:100), and ISDPTSPLRTR (SEQ ID NO:101).
Suitably, the encoded binding protein may be capable of specifically binding to a peptide:HLA complex selected from the group consisting of: a YTAVVPLVY:HLA-A*01:01 complex; a CYTAVVPLV:HLA-A*24:02 complex; a ISDPTSPLRTR:HLA-A*03:01 complex; a ISDPTSPLRTR:HLA-A*11:01 complex; a TAVVPLVY:HLA-A*01:01 complex; a VLAVFIKAVHV:HLA-A*02:01 complex; a VLAVFIKAV:HLA-A*02:01 complex; a YTAVVPLV:HLA-A*02:01 complex; a RIIVPLNNR:HLA-A*11:01 complex; and a GETKMVETAL:HLA-B*40:01 complex.
Suitably, the peptide:HLA complex may be selected from the group consisting of: a YTAVVPLVY:HLA-A*01:01 complex; a CYTAVVPLV:HLA-A*24:02 complex; a ISDPTSPLRTR:HLA-A*03:01 complex; and a ISDPTSPLRTR:HLA-A*11:01 complex.
Suitably, the composition may comprise:
Suitably:
Suitably:
Suitably, the composition may comprise a nucleic acid sequence that encodes a TCR Vα domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:31, or a functional fragment thereof; and a nucleic acid sequence that encodes a TCR Vβ domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:34, or a functional fragment thereof.
Suitably, the CDR3 of the Vα domain may comprise or consist of the amino acid sequence of SEQ ID NO: 31, and the CDR3 of the Vβ domain may comprise or consist of the amino acid sequence of SEQ ID NO:34.
Suitably, the Vα domain may comprise an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 35; and (ii) the Vβ domain may comprise an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 37.
Suitably, the composition may comprise:
Suitably:
Suitably:
Suitably, the nucleic acid sequence may be codon optimised for expression in a host cell, optionally wherein the host cell is a human cell.
Suitably, the nucleic acid composition may further comprise a TCR a chain constant domain and/or a TCR β chain constant domain.
Suitably, the encoded binding protein may comprise a TCR, an antigen binding fragment of a TCR, a chimeric antigen receptor (CAR), or an ImmTAC.
Suitably, the antigen binding fragment of a TCR may be a single chain TCR (scTCR) or a chimeric TCR dimer in which the antigen binding fragment of the TCR is linked to an alternative transmembrane and intracellular signalling domain.
The invention also provides a vector system comprising a nucleic acid composition as described herein.
Suitably, the vector may be a plasmid, a viral vector, or a cosmid, optionally wherein the vector is selected from the group consisting of a retrovirus, lentivirus, adeno-associated virus, adenovirus, vaccinia virus, canary poxvirus, herpes virus, minicircle vector and synthetic DNA or RNA.
The invention also provides a modified cell comprising a nucleic acid composition as described herein, or a vector system as described herein.
Suitably, the modified cell may be selected from the group consisting of a CD8 T cell, a CD4 T cell, an NK cell, an NK-T cell, a gamma-delta T cell, a hematopoietic stem cell, an inducible pluripotent stem cell, a progenitor cell, a T cell line and a NK-92 cell line.
Suitably, the modified cell may be a human cell.
The invention also provides an isolated peptide comprising or consisting of an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO:99), CYTAVVPLV (SEQ ID NO:100), ISDPTSPLRTR (SEQ ID NO:101), TAVVPLVY (SEQ ID NO:102), VLAVFIKAVHV (SEQ ID NO:103), VLAVFIKAV (SEQ ID NO:104), YTAVVPLV (SEQ ID NO:105), RIIVPLNNR (SEQ ID NO:106), and GETKMVETAL (SEQ ID NO:107).
Suitably, the peptide may be no more than 20 amino acids.
The invention also provides an isolated nucleic acid sequence encoding the peptide described herein. The invention also provides a vector system comprising the nucleic acid sequence.
The invention also provides a pharmaceutical composition comprising a nucleic acid composition as described herein, a vector system as described herein, a modified cell as described herein, an isolated peptide as described herein, or a nucleic acid sequence as described herein, and a pharmaceutically acceptable excipient, adjuvant, diluent and/or carrier.
Suitably, the composition comprises an isolated peptide as described herein, a nucleic acid sequence as described herein on, or a vector system as described herein, wherein the pharmaceutical composition is formulated as a vaccine.
The invention also provides a pharmaceutical composition as described herein for use in inducing or enhancing an immune response in human subject diagnosed with a B cell associated disease or condition.
The invention also provides a pharmaceutical composition as described herein for use in stimulating a cell mediated immune response to a target cell population or tissue in a human subject.
The invention also provides a pharmaceutical composition as described herein for use in providing anti-tumor immunity to a human subject.
The invention also provides a pharmaceutical composition as described herein for use in treating an human subject having a disease or condition associated with an elevated level of HLA-restricted Jchain antigen.
Suitably, the human subject may have at least one tumor.
Suitably, the subject may have been diagnosed with a B cell associated disease or condition. Suitably, the B cell associated disease or condition may be a hematological malignancy or an autoimmune disease or disorder.
Suitably, the hematological malignancy may be selected from the group consisting of: Multiple myeloma, plasma cell leukemia, (AL) Amyloidosis, Acute lymphoblastoid leukemia (ALL), Chronic lymphocytic leukemia (CLL), Waldenstrom macroglobulinemia and B cell lymphoma, optionally wherein the B cell lymphoma is selected from the group consisting of: Diffuse large B cell lymphoma (DLBCL), High grade B cell lymphoma, Mantel cell lymphoma (MCL), Follicular lymphoma (FL), and Burkitt Lymphoma.
Suitably, the hematological malignancy may be multiple myeloma.
Suitably, the autoimmune disease or disorder may be selected from the group consisting of: Rheumatoid arthritis, Multiple sclerosis, Vasculitis including Urticarial vasculitis, systemic vasculitis, renal vasculitis, Systemic lupus erythematosus (SLE), Autoimmune hemolytic anemia and Thrombocytopenia.
The invention also provides a method of generating a binding protein that is capable of specifically binding to a peptide containing a Jchain antigen and does not bind to a peptide that does not contain the Jchain antigen, comprising contacting a nucleic acid composition as described herein with a cell under conditions in which the nucleic acid composition is incorporated and expressed by the cell.
Suitably, the method may be ex vivo.
The invention also provides an isolated nucleic acid sequence comprising or consisting of the nucleotide sequence of any one of SEQ ID NOs: 8, 10, 12, 14, 22, 24, 26, 28, 36, 38, 40, 42, 50, 52, 54, 56, 64, 66, 68, 70, 78, 80, 82, 84, 92, 94, 96 or 98.
The invention also provides an isolated nucleic acid sequence comprising or consisting of the nucleotide sequence of any one of SEQ ID NOs: 8, 10, 12, 14, 22, 24, 26, 28, 36, 38, 40, 42, 50, 52, 54, 56, 64, 66, 68, 70, 78, 80, 82, 84, 92, 94, 96 or 98 for use in therapy.
Throughout the description and claims of this specification, the words “comprise” and “contain” and variations of them mean “including but not limited to”, and they are not intended to (and do not) exclude other moieties, additives, components, integers or steps.
Throughout the description and claims of this specification, the singular encompasses the plural unless the context otherwise requires. In particular, where the indefinite article is used, the specification is to be understood as contemplating plurality as well as singularity, unless the context requires otherwise.
Features, integers, characteristics, compounds, chemical moieties or groups described in conjunction with a particular aspect, embodiment or example of the invention are to be understood to be applicable to any other aspect, embodiment or example described herein unless incompatible therewith.
Various aspects of the invention are described in further detail below.
Embodiments of the invention are further described hereinafter with reference to the accompanying drawings, in which:
The patent, scientific and technical literature referred to herein establish knowledge that was available to those skilled in the art at the time of filing. The entire disclosures of the issued patents, published and pending patent applications, and other publications that are cited herein are hereby incorporated by reference to the same extent as if each was specifically and individually indicated to be incorporated by reference. In the case of any inconsistencies, the present disclosure will prevail.
Various aspects of the invention are described in further detail below.
Nucleic Acid Compositions that Encode Binding Protein Components
An isolated nucleic acid composition that encodes a Jchain antigen-specific binding protein having a TCR α chain variable (Vα) domain and a TCR β chain variable (Vβ) domain is provided herein, the composition comprising:
As would be clear to a person of skill in the art, the CDR3 amino acid sequences described herein specifically bind to their target (in this case a Jchain peptide), when the target (i.e. the appropriate Jchain peptide) is presented in the context of HLA. The binding proteins (and CDR3 sequences specifically described herein) are therefore capable of specifically binding to a Jchain peptide: HLA complex. These complexes are described in more detail elsewhere herein.
Any of the permutations described below for (a) may be combined with the permutations described below for (b) (e.g. to form an appropriate nucleic acid composition that encodes a Jchain antigen-specific binding protein having a TCR α chain variable (Vα) domain and a TCR β chain variable (Vβ) domain).
The invention provides an isolated nucleic acid composition that encodes a binding protein comprising T cell receptor (TCR) components that specifically bind a Jchain antigen. The encoded binding protein is therefore capable of specifically binding to a peptide containing a Jchain antigen (e.g. a Jchain antigen comprising an amino acid sequence selected from the group consisting of: SEQ ID NO: 99 to 107) and does not bind to a peptide that does not contain a Jchain antigen (e.g. it does not bind to a peptide that does not contain Jchain antigen comprising an amino acid sequence selected from the group consisting of: SEQ ID NO: 99 to 107).
The nucleic acid composition comprises (a) a nucleic acid sequence that encodes a TCR Vα domain with the specified features described herein and (b) a nucleic acid sequence that encodes a TCR Vβ domain with the specified features described herein. The encoded TCR components form a Jchain antigen-specific binding protein.
The nucleic acid sequences of (a) and (b) above may be distinct nucleic acid sequences within the nucleic acid composition. The TCR components of the binding protein may therefore be encoded by two (or more) nucleic acid sequences (with distinct nucleotide sequences) which, together, encode all of the TCR components of the binding protein. In other words, some of the TCR components may be encoded by one nucleic acid sequence in the nucleic acid composition, and others may be encoded by another (distinct) nucleic acid sequence within the nucleic acid composition.
Alternatively, the nucleic acid sequences of (a) and (b) may be part of a single nucleic acid sequence. The TCR components of the binding protein may therefore all be encoded by a single nucleic acid sequence (for example with a single open reading frame, or with multiple (e.g. 2 or more, three or more etc.) open reading frames).
Nucleic acid sequences described herein may form part of a larger nucleic acid sequence that encodes a larger component part of a functioning binding protein. For example, a nucleic acid sequence that encodes a TCR Vα domain with the specified features described herein may be part of a larger nucleic acid sequence that encodes a functional TCR α chain (including the constant domain). As another example, a nucleic acid sequence that encodes a TCR Vβ domain with the specified features described herein may be part of a larger nucleic acid sequence that encodes a functional TCR B chain (including the constant domain). As a further example, both nucleic acid sequences (a) and (b) above may be part of a larger nucleic acid sequence that encodes a combination of a functional TCR α chain (including the constant domain) and a functional TCR β chain (including the constant domain), optionally wherein the sequence encoding the functional TCR α chain is separated from the sequence encoding the functional TCR β chain by a linker sequence that enables coordinate expression of two proteins or polypeptides in the same nucleic acid sequence. More details on this are provided below.
The nucleic acid sequences described herein may alternatively encode a small component of a T cell receptor e.g. a TCR Vα domain, or a TCR Vβ domain, only. The nucleic acid sequences may be considered as “building blocks” that provide essential components for peptide binding specificity. The nucleic acid sequences described herein may be incorporated into a distinct nucleic acid sequence (e.g. a vector) that encodes the other elements of a functional binding protein such as a TCR, such that when the nucleic acid sequence described herein is incorporated, a new nucleic acid sequence is generated that encodes e.g. a TCR α chain and/or a TCR β chain that specifically binds to a Jchain antigen (e.g. wherein the Jchain antigen comprises an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO: 99), TAVVPLVY (SEQ ID NO: 102), VLAVFIKAVHV (SEQ ID NO: 103), VLAVFIKAV (SEQ ID NO: 104), YTAVVPLV (SEQ ID NO: 105), ISDPTSPLRTR (SEQ ID NO: 101), RIIVPLNNR (SEQ ID NO: 106), CYTAVVPLV (SEQ ID NO: 100) and GETKMVETAL (SEQ ID NO: 107)). The nucleic acid sequences described herein therefore have utility as essential components that confer binding specificity for a Jchain antigen, and thus can be used to generate a larger nucleic acid sequence encoding a binding protein with the required antigen binding activity and specificity.
The nucleic acid sequences described herein may be codon optimised for expression in a host cell, for example they may be codon optimised for expression in a human cell, such as a cell of the immune system, a inducible pluripotent stem cell (iPSC), a hematopoietic stem cell, a T cell, a primary T cell, a T cell line, a NK cell, or a natural killer T cell (Scholten et al, Clin. Immunol. 119: 135, 2006). The T cell can be a CD4+ or a CD8+ T cell. Codon optimisation is a well-known method in the art for maximizing expression of a nucleic acid sequence in a particular host cell. For instance, one or more cysteine residues may also be introduced into the encoded TCR alpha and beta chain components (e.g. to reduce the risk of mispairing with endogenous TCR chains).
In one example, the nucleic acid sequences described herein are codon optimised for expression in a suitable host cell, and/or are modified to introduce codons encoding one or more cysteine amino acids (e.g. into the constant domain of the encoded TCR alpha chain and/or the encoded TCR beta chain) to reduce the risk of mispairing with endogenous TCR chains. In one example, the nucleic acid sequences described herein are codon optimised for expression in a suitable host cell, optionally wherein the host cell is a human cell.
In certain examples, a TCR constant domain is modified to enhance pairing of desired TCR chains. For example, enhanced pairing between a heterologous TCR α chain and a heterologous TCR β chain due to a modification may result in the preferential assembly of a TCR comprising two heterologous chains over an undesired mispairing of a heterologous TCR chain with an endogenous TCR chain (see, e.g., Govers et al, Trends Mol. Med. 16(2):11 (2010)). Exemplary modifications to enhance pairing of heterologous TCR chains include the introduction of complementary cysteine residues in each of the heterologous TCR α chain and β chain.
A binding protein that is encoded by the nucleic acid compositions described herein is specific for a Jchain antigen and comprises Jchain antigen specific-TCR components. However, the encoded binding protein is not limited to being a TCR. Other appropriate binding proteins that comprise the specified Jchain antigen specific -TCR components are also encompassed. For example, the encoded binding protein may comprise a TCR, an antigen binding fragment of a TCR, a chimeric antigen receptor (CAR), or an immTAC. TCRs, antigen binding fragments thereof, CARs and ImmTACs are well defined in the art. A non-limiting example of an antigen binding fragment of a TCR is a single chain TCR (scTCR) or a chimeric dimer composed of the antigen binding fragments of the TCR α and TCR β chain linked to transmembrane and intracellular domains of a dimeric complex so that the complex is a chimeric dimer TCR (cdTCR). An ImmTAC comprises a TCR connected to an anti-CD3 antibody. ImmTACs are therefore bispecific, combining Jchain-recognizing TCR components with immune activating complexes.
In certain examples, an antigen-binding fragment of a TCR comprises a single chain TCR (scTCR), which comprises both the TCR Vα and TCR Vβ domains, but only a single TCR constant domain. In other examples, an antigen-binding fragment of a TCR comprises a chimeric TCR dimer in which the antigen binding fragment of the TCR is linked to an alternative transmembrane and intracellular signalling domain (where the alternative transmembrane and intracellular signalling domain are not naturally found in TCRs). In further examples, an antigen-binding fragment of a TCR or a chimeric antigen receptor is chimeric (e.g., comprises amino acid residues or motifs from more than one donor or species), humanized (e.g., comprises residues from a non-human organism that are altered or substituted so as to reduce the risk of immunogenicity in a human), or human.
“Chimeric antigen receptor” (CAR) refers to a fusion protein that is engineered to contain two or more naturally-occurring amino acid sequences linked together in a way that does not occur naturally or does not occur naturally in a host cell, which fusion protein can function as a receptor when present on a surface of a cell. CARs described herein include an extracellular portion comprising an antigen binding domain (i.e., obtained or derived from an immunoglobulin or immunoglobulin-like molecule, such as an scFv derived from an antibody or TCR specific for an antigen (e.g. a cancer antigen etc), or an antigen binding domain derived or obtained from a killer immunoreceptor from an NK cell) linked to a transmembrane domain and one or more intracellular signalling domains (optionally containing co-stimulatory domain(s)) (see, e.g., Sadelain et al, Cancer Discov., 3(4):388 (2013); see also Harris and Kranz, Trends Pharmacol. Sci., 37(3):220 (2016), and Stone et al, Cancer Immunol. Immunother., 63(11): 1163 (2014)).
Methods for producing engineered TCRs are described in, for example, Bowerman et al, Mol. Immunol, 5(15):3000 (2009). Methods for making CARs are well known in the art and are described, for example, in U.S. Pat. Nos. 6,410,319; 7,446,191; U.S. Patent Publication No. 2010/065818; U.S. Pat. No. 8,822,647; PCT Publication No. WO 2014/031687; U.S. Pat. No. 7,514,537; and Brentjens et al, 2007, Clin. Cancer Res. 73:5426.
The binding proteins described herein may also be expressed as part of a transgene construct that encodes additional accessory proteins, such as a safety switch protein, a tag, a selection marker, a CD8 co-receptor β-chain, α-chain or both, or any combination thereof.
A T cell receptor (TCR) is a molecule found on the surface of T cells (T lymphocytes) that is responsible for recognising a peptide that is bound to (presented by) a major histocompatibility complex (MHC) molecule on a target cell. The invention is directed to nucleic acid compositions that encode binding proteins comprising TCR components that interact with a particular peptide in the context of the appropriate serotype of MHC, i.e. a Jchain antigen in the context of HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A*11:01, HLA-A*24:02 and/or HLA-B*40:01 (in other words, the encoded binding protein is capable of specifically binding to a Jchain antigen: specific HLA complex). In an example, the invention is directed to nucleic acid compositions that encode binding proteins comprising TCR components that interact with a particular peptide in the context of the appropriate serotype of MHC, i.e. YTAVVPLVY (SEQ ID NO: 99) in the context of HLA-A*01:01; TAVVPLVY (SEQ ID NO: 102) in the context of HLA-A*01:01; VLAVFIKAVHV(SEQ ID NO: 103) in the context of HLA-A*02:01; VLAVFIKAV (SEQ ID NO: 104) in the context of HLA-A*02:01; YTAVVPLV (SEQ ID NO: 105) in the context HLA-A*02:01; ISDPTSPLRTR (SEQ ID NO: 101) in the context of HLA-A*03:01 or HLA-A*11:01; RIIVPLNNR (SEQ ID NO: 106) in the context of HLA-A*11:01; CYTAVVPLV (SEQ ID NO: 100) in the context of HLA-A*24:02; or GETKMVETAL (SEQ ID NO: 107) in the context of HLA-B*40:01.
HLA-A*02:01 is a globally common human leukocyte antigen serotype within the HLA-A serotype group. Peptides that are presented by HLA-A*02:01 to TCRs are described as being “HLA-A*02:01 restricted”.
HLA-A*03:01, HLA-A*11:01, HLA-A*01:01, HLA-A*24:02 and HLA-B*40:01 are also common human leukocyte antigen serotypes within the HLA-A and HLA-B serotype groups. Peptides that are presented by HLA-A*03:01 to TCRs are described as being “HLA-A*03:01 restricted”. Similarly, peptides that are presented by HLA-A*11:01 to TCRs are described as being “HLA-A*11:01 restricted”. Similarly, peptides that are presented by HLA-A*01:01 to TCRs are described as being “HLA-A*01:01 restricted”. Similarly, peptides that are presented by HLA-A*24:02 to TCRs are described as being “HLA-A*24:02 restricted”. Finally, peptides that are presented by HLA-A*40:01 to TCRs are described as being “HLA-A*40:01 restricted”.
HLA-A*01:01 is also referred to herein as HLA-A1. Similarly, HLA-A*02:01 is also referred to herein as HLA-A2; HLA-A*03:01 is also referred to herein as HLA-A3; HLA-A*11:01 is also referred to herein as HLA-A11; HLA-A*24:02 is also referred to herein as HLA-A24; and HLA-B*40:01 is also referred to herein as HLA-B40.
As described herein, the inventors have identified several Jchain derived peptides presented on MM cells in HLA-A*01:01 (HLA-A1), HLA-A*02:01 (HLA-A2), HLA-A*03:01 (HLA-A3), HLA-A*11:01 (HLA-A11), HLA-A*24:02 (HLA-A24), and HLA-B*40:01 (HLA-B40). Specifically, the inventors identified the Jchain derived peptides YTAVVPLVY (SEQ ID NO: 99), TAVVPLVY (SEQ ID NO: 102), VLAVFIKAVHV (SEQ ID NO: 103), VLAVFIKAV (SEQ ID NO: 104), YTAVVPLV (SEQ ID NO: 105), ISDPTSPLRTR (SEQ ID NO: 101), RIIVPLNNR (SEQ ID NO: 106), CYTAVVPLV (SEQ ID NO: 100) and GETKMVETAL (SEQ ID NO: 107).
Accordingly, the Jchain antigen specifically bound by a binding protein described herein may comprise an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO: 99), TAVVPLVY (SEQ ID NO: 102), VLAVFIKAVHV (SEQ ID NO: 103), VLAVFIKAV (SEQ ID NO: 104), YTAVVPLV (SEQ ID NO: 105), ISDPTSPLRTR (SEQ ID NO: 101), RIIVPLNNR (SEQ ID NO: 106), CYTAVVPLV (SEQ ID NO: 100) and GETKMVETAL (SEQ ID NO: 107). In one example, the Jchain antigen specifically bound by a binding protein described herein comprises an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO: 99), ISDPTSPLRTR (SEQ ID NO: 101) and CYTAVVPLV (SEQ ID NO: 100). The antigen may be an antigenic fragment (i.e. a portion) of an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO: 99), TAVVPLVY (SEQ ID NO: 102), VLAVFIKAVHV (SEQ ID NO: 103), VLAVFIKAV (SEQ ID NO: 104), YTAVVPLV (SEQ ID NO: 105), ISDPTSPLRTR (SEQ ID NO: 101), RIIVPLNNR (SEQ ID NO: 106), CYTAVVPLV (SEQ ID NO: 100) and GETKMVETAL (SEQ ID NO: 107), it may consist of an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO: 99), TAVVPLVY (SEQ ID NO: 102), VLAVFIKAVHV (SEQ ID NO: 103), VLAVFIKAV (SEQ ID NO: 104), YTAVVPLV (SEQ ID NO: 105), ISDPTSPLRTR (SEQ ID NO: 101), RIIVPLNNR (SEQ ID NO: 106), CYTAVVPLV (SEQ ID NO: 100) and GETKMVETAL (SEQ ID NO: 107), or it may comprise (i.e. include within a longer sequence) an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO: 99), TAVVPLVY (SEQ ID NO: 102), VLAVFIKAVHV (SEQ ID NO: 103), VLAVFIKAV (SEQ ID NO: 104), YTAVVPLV (SEQ ID NO: 105), ISDPTSPLRTR (SEQ ID NO: 101), RIIVPLNNR (SEQ ID NO: 106), CYTAVVPLV (SEQ ID NO: 100) and GETKMVETAL (SEQ ID NO: 107).
Accordingly, in an example, the Jchain antigen comprises an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO: 99), TAVVPLVY (SEQ ID NO: 102), VLAVFIKAVHV (SEQ ID NO: 103), VLAVFIKAV (SEQ ID NO: 104), YTAVVPLV (SEQ ID NO: 105), ISDPTSPLRTR (SEQ ID NO: 101), RIIVPLNNR (SEQ ID NO: 106), CYTAVVPLV (SEQ ID NO: 100) and GETKMVETAL (SEQ ID NO: 107).
In another example, the Jchain antigen comprises an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO: 99), ISDPTSPLRTR (SEQ ID NO: 101) and CYTAVVPLV (SEQ ID NO: 100).
In one example, the Jchain antigen comprises the amino acid sequence YTAVVPLVY (SEQ ID NO: 99). In another example, the Jchain antigen comprises the amino acid sequence ISDPTSPLRTR (SEQ ID NO: 101). In a further example, the Jchain antigen comprises the amino acid sequence CYTAVVPLV (SEQ ID NO: 100).
The inventors identified that the Jchain derived peptides YTAVVPLVY (SEQ ID NO: 99) and TAVVPLVY (SEQ ID NO: 102) are capable of being presented by HLA-A*01:01; that the Jchain derived peptides VLAVFIKAVHV (SEQ ID NO: 103), VLAVFIKAV (SEQ ID NO: 104) and YTAVVPLV (SEQ ID NO: 105) are capable of being presented by HLA-A*02:01; that the Jchain derived peptide ISDPTSPLRTR (SEQ ID NO: 101) is capable of being presented by HLA-A*03:01 and HLA-A*11:01; that the Jchain derived peptide RIIVPLNNR (SEQ ID NO: 106) is capable of being presented by HLA-A*11:01; that the Jchain derived peptide CYTAVVPLV (SEQ ID NO: 100) is capable of being presented by HLA-A*24:02; and that the Jchain derived peptide GETKMVETAL (SEQ ID NO: 107) is capable of being presented by HLA-B*40:01.
Accordingly, in one example, the encoded binding protein is capable of specifically binding to a peptide:HLA complex selected from the group consisting of: a YTAVVPLVY:HLA-A*01:01 complex; a TAVVPLVY:HLA-A*01:01 complex; a VLAVFIKAVHV:HLA-A*02:01 complex; a VLAVFIKAV:HLA-A*02:01 complex; a YTAVVPLV:HLA-A*02:01 complex; a ISDPTSPLRTR:HLA-A*03:01 complex; a RIIVPLNNR:HLA-A*11:01 complex; a ISDPTSPLRTR:HLA-A*11:01 complex; a CYTAVVPLV:HLA-A*24:02 complex; and a GETKMVETAL:HLA-B*40:01 complex.
In another example, the encoded binding protein is capable of specifically binding to a peptide:HLA complex selected from the group consisting of: a YTAVVPLVY:HLA-A*01:01 complex; a ISDPTSPLRTR:HLA-A*03:01 complex; a ISDPTSPLRTR:HLA-A*11:01 complex; and a CYTAVVPLV:HLA-A*24:02 complex.
In a further example, the encoded binding protein is capable of specifically binding to a YTAVVPLVY:HLA-A*01:01 complex. In another example, the encoded binding protein is capable of specifically binding to a ISDPTSPLRTR:HLA-A*03:01 complex. In another example, the encoded binding protein is capable of specifically binding to a ISDPTSPLRTR:HLA-A*11:01 complex. In a further example, the encoded binding protein is capable of specifically binding to a CYTAVVPLV:HLA-A*24:02 complex.
In one example, the Jchain derived peptide of the peptide:HLA complex comprises an antigenic fragment of an amino acid sequence selected from the group consisting of: SEQ ID NO: 99 to 107. In a further example, the Jchain derived peptide of the peptide:HLA complex comprises or consists of an amino acid sequence selected from the group consisting of: SEQ ID NO:99 to 107.
The TCR is composed of two different polypeptide chains. In humans, 95% of TCRs consist of an alpha (α) chain and a beta (β) chain (encoded by TRA and TRB respectively). When the TCR engages with peptide in the context of HLA (e.g. in the context of HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A*11:01, HLA-A*24:02 and/or HLA-B*40:01, in particular in the context of HLA-A*01:01, HLA-A*03:01, HLA-A*11:01 and/or HLA-A*24:02 as appropriate), the T cell is activated through signal transduction.
The alpha and beta chains of the TCR are highly variable in sequence. Each chain is composed of two extracellular domains, a variable domain (V) and a constant domain (C). The constant domain is proximal to the T cell membrane followed by a transmembrane region and a short cytoplasmic tail while the variable domain binds to the peptide/HLA complex.
An isolated nucleic acid composition that encodes a Jchain antigen-specific binding protein is provided herein having a TCR α chain variable (Vα) domain and a TCR β chain variable (Vβ) domain. In one example the nucleic acid composition described herein may comprise a TCR α chain constant domain and/or a TCR β chain constant domain.
The variable domain of each chain has three hypervariable regions (also called complementarity determining regions (CDRs)). Accordingly, the TCR alpha variable domain (referred to herein as a TCR Vα domain, TCR V alpha domain, Vα domain or V alpha domain, alpha variable domain etc) comprises a CDR1, a CDR2 and CDR3 region. Similarly, the TCR beta variable domain (referred to herein as a TCR Vβ domain, TCR V beta domain, Vβ domain or V beta domain, beta variable domain etc) also comprises a (different) CDR1, CDR2, and CDR3 region. In each of the alpha and beta variable domains it is CDR3 that is mainly responsible for recognizing the peptide being presented by the HLA molecules.
As will be clear to a person of skill in the art, the phrase “TCR α chain variable domain” refers to the variable (V) domain (extracellular domain) of a TCR alpha chain, and thus includes three hypervariable regions (CDR1, CDR2 and the specified CDR3), as well as the intervening sequences, but does not include the constant (C) domain of the alpha chain, which does not form part of the variable domain.
As will be clear to a person of skill in the art, the phrase “TCR β chain variable domain” refers to the variable (V) domain (extracellular domain) of a TCR beta chain, and thus includes three hypervariable regions (CDR1, CDR2 and the specified CDR3), as well as the intervening sequences, but does not include the constant (C) domain of the beta chain, which does not form part of the variable domain.
The isolated nucleic acid composition described herein encodes a Jchain antigen-specific binding protein. As discussed herein, the inventors have identified several TCRs that specifically bind to a Jchain antigen selected from YTAVVPLVY (SEQ ID NO:99), ISDPTSPLRTR (SEQ ID NO:101) and CYTAVVPLV (SEQ ID NO:100).
(i) Vα Domains that Interact with YTAVVPLVY (SEQ ID NO: 99)
As provided elsewhere herein, the inventors identified TCR clone 4G8.8 which interacts with YTAVVPLVY (SEQ ID NO:99) in the context of HLA-A*01:01. The sequences provided herein that correspond to TCR clone 4G8.8 are SEQ ID NO:s 1 to 14.
As discussed above, an isolated nucleic acid composition that encodes a Jchain antigen-specific binding protein having a TCR α chain variable (Vα) domain and a TCR β chain variable (Vβ) domain is provided, the composition comprising: (a) a nucleic acid sequence that encodes a TCR Vα domain comprising a CDR3 amino acid sequence; and (b) a nucleic acid sequence that encodes a TCR Vβ domain comprising a CDR3 amino acid sequence; wherein the CDR3 amino acid sequences of (a) and (b) together specifically bind to Jchain.
An example of an appropriate TCR Vα domain CDR3 amino acid sequence that confers specific binding to a Jchain antigen (e.g. to YTAVVPLVY (SEQ ID NO: 99)) is shown in SEQ ID NO:3. As would be clear to a person of skill in the art, variants of the amino acid sequence shown in SEQ ID NO:3 may also be functional (i.e. retain their ability to confer specific binding to a Jchain antigen (e.g. to the peptide YTAVVPLVY (SEQ ID NO: 99)) when the CDR3 is part of TCR Vα domain). Such functional variants are therefore encompassed herein.
For example, appropriate (functional) Vα domain CDR3 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 3, i.e. they may have at least 80%, at least 86%, at least 93%, or 100% sequence identity to SEQ ID NO: 3. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO:3). In other words, appropriate (functional) Vα domain CDR3 amino acid sequences may vary from the sequence shown in SEQ ID NO:3 by one or several (e.g. two etc) amino acids.
As stated above, functional variants of SEQ ID NO:3 retain their ability to confer specific binding to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99) when the CDR3 is part of TCR Vα domain.
Functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO:3. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one, two or more amino acids of SEQ ID NO:3, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the CDR3.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 3 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO:3 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the CDR3 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 3. In examples where the TCR Vα domain CDR3 has the amino acid sequence of SEQ ID NO:3, the CDR3 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may comprise, in addition to the specified CDR3, a CDR1 comprising an amino acid sequence of SEQ ID NO: 1, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to the Jchain antigen (e.g. the peptide shown in SEQ ID NO:99)). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO:1. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO:1, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 1 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO:1 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vα domain CDR1 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 1, i.e. it may have at least 80%, at least 85%, or 100% sequence identity to SEQ ID NO: 1. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO:1). In other words, appropriate functional Vα domain CDR1 amino acid sequences may vary from the sequence shown in SEQ ID NO: 1 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO:1. As stated above, functional variants of SEQ ID NO: 1 retain the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99) when the CDR1 is part of TCR Vα domain).
In one example, the CDR1 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO:1. In examples where the TCR Vα domain CDR1 has the amino acid sequence of SEQ ID NO:1, the CDR1 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may also comprise, in addition to the specified CDR3 (and optionally the specified CDR1 above), a CDR2 comprising an amino acid sequence of SEQ ID NO:2, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to HLA-A*01:01). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO:2. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO:2, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 2 that do not specifically bind to HLA-A*01:01. Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 2 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vα domain CDR2 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 2, i.e. it may have at least 80%, at least 87%, or 100% sequence identity to SEQ ID NO: 2. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO:2). In other words, appropriate (functional) Vα domain CDR2 amino acid sequences may vary from the sequence shown in SEQ ID NO:2 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO:2). As stated above, a functional variant of SEQ ID NO: 2 retains the ability to specifically bind to HLA-A*01:01.
In one example, the CDR2 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 2. In examples where the TCR Vα domain CDR2 has the amino acid sequence of SEQ ID NO:2, the CDR2 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may therefore comprise the CDRs mentioned in detail above (by SEQ ID specifically i.e. SEQ ID NO:3, SEQ ID NO: 1 and SEQ ID NO: 2, or functional variants thereof), with appropriate intervening sequences between the CDRs.
The encoded TCR Vα domain may comprise an amino acid sequence of SEQ ID NO:7, or a functional variant thereof (i.e. wherein the variant TCR Vα domain retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99) when part of a binding protein described herein). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO:7. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO:7, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 7 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO:7 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the encoded TCR Vα domain may have an amino acid sequence having at least 75%, at least 80%, at least 85% or at least 90% (or at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to the amino acid sequence of SEQ ID NO: 7, whilst retaining the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99). In other words, a functional TCR Vα domain with one or several amino acid substitutions compared to the sequence of SEQ ID NO:7 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:7 may all be in regions of the TCR Vα domain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 3, SEQ ID NO: 1 and/or SEQ ID NO: 2, and still have 25% (or less) sequence variability compared to SEQ ID NO:7). In other words, the sequence of the CDRs of SEQ ID NO: 7 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 7).
As an example, the encoded TCR Vα domain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 7, wherein the TCR Vα domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 3. In this example, the TCR Vα domain CDR1 may have an amino acid sequence of SEQ ID NO: 1 and the TCR Vα domain CDR2 may have an amino acid sequence of SEQ ID NO: 2.
As another example, the encoded TCR Vα domain may comprise an amino acid sequence having at the amino acid sequence of SEQ ID NO: 7, with 0 to 10 (or 0 to 5) amino acid substitutions, insertions or deletions), wherein the TCR Vα domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 3. In this example, the TCR Vα domain CDR1 may have an amino acid sequence of SEQ ID NO: 1 and the TCR Vα domain CDR2 may have an amino acid sequence of SEQ ID NO: 2.
In examples where the TCR Vα domain has the amino acid sequence of SEQ ID NO:7, the TCR Vα domain may be encoded by the nucleic acid sequence of SEQ ID NO:8, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code).
The phrase “genetically degenerate sequence thereof” is used interchangeably with “derivative thereof” herein.
For the avoidance of doubt, the nucleic acid sequence encoding the TCR Vα domain may also encode a TCR α chain constant domain. An example of a suitable constant domain (for either a TCR α chain or a TCR β chain) is encoded in the MP71-TCR-flex retroviral vector. However, the invention is not limited to this specific constant domain, and encompasses any appropriate TCR α chain constant domain. The constant domain may be murine derived, human derived or humanised. Methods for identifying or generating appropriate constant domains are well known to a person of skill in the art and are well within their routine capabilities.
By way of example only, the constant domain may be encoded by or derived from a vector, such as a lentiviral, retroviral or plasmid vector but also adenovirus, adeno-associated virus, vaccinia virus, canary poxvirus or herpes virus vectors in which murine or human constant domains are pre-cloned. Recently, minicircles have also been described for TCR gene transfer (non-viral Sleeping Beauty transposition from minicircle vectors as published by R Monjezi, et al., 2017). Moreover, naked (synthetic) DNA/RNA can also be used to introduce the TCR. As an example, a pMSGV retroviral vector with pre-cloned TCR-Ca and Cb genes as described in LV Coren et al., BioTechniques 2015 may be used to provide an appropriate constant domain. Alternatively, single stranded or double stranded DNA or RNA can be inserted by homologous directed repair into the TCR locus (see Roth et a/2018 Nature vol 559; page 405). As a further option, non-homologous end joining is possible.
An example of a specific TCR α chain amino acid sequence that includes a TCR Vα domain described herein with an appropriate constant domain is shown in SEQ ID NO: 11. Appropriate functional variants of SEQ ID NO:11 are also encompassed (e.g. variants having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 11, wherein the variant TCR α chain amino acid sequence retains its ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99) when part of a binding protein described herein). In other words, a functional TCR α chain with one or several amino acid substitutions compared to the sequence of SEQ ID NO:11 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:11 may all be in regions of the TCR α chain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 3, SEQ ID NO: 1 and/or SEQ ID NO: 2, and still have 25% (or less) sequence variability compared to SEQ ID NO:11). In other words, the sequence of the CDRs of SEQ ID NO: 11 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 11).
As an example, the encoded TCR α chain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 11, wherein the TCR α chain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 3. In this example, the TCR α chain CDR1 may have an amino acid sequence of SEQ ID NO:1 and the TCR α chain CDR2 may have an amino acid sequence of SEQ ID NO: 2.
In examples where the TCR α chain has the amino acid sequence of SEQ ID NO:11, the TCR α chain may be encoded by the nucleic acid sequence of SEQ ID NO:12, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code). It is noted that SEQ ID NO:12 is the nucleic acid sequence for TCR α chain of clone 4G8.8.
In one example, the nucleic acid composition provided herein comprises a nucleic acid sequence that encodes a TCR Vα domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:3, or a functional fragment thereof.
In another example, the CDR3 of the Vα domain of a nucleic acid composition provided herein comprises or consists of the amino acid sequence of SEQ ID NO: 3.
In another example, the Vα domain of a nucleic acid composition provided herein comprises an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 7.
As provided elsewhere herein, the inventors have also identified TCR clone 5D12.9 which interacts with YTAVVPLVY (SEQ ID NO:99) in the context of HLA-A*01:01. The sequences provided herein that correspond to TCR clone 5D12.9 are SEQ ID NO:s 15 to 28.
Accordingly, another example of an appropriate TCR Vα domain CDR3 amino acid sequence that confers specific binding to a Jchain antigen (e.g. to YTAVVPLVY (SEQ ID NO: 99)) is shown in SEQ ID NO:17. As would be clear to a person of skill in the art, variants of the amino acid sequence shown in SEQ ID NO:17 may also be functional (i.e. retain their ability to confer specific binding to a Jchain antigen (e.g. to the peptide YTAVVPLVY (SEQ ID NO: 99)) when the CDR3 is part of TCR Vα domain). Such functional variants are therefore encompassed herein.
For example, appropriate (functional) Vα domain CDR3 amino acid sequences may have at least 80% sequence identity to SEQ ID NO:17, i.e. they may have at least 80%, at least 83%, at least 91%, or 100% sequence identity to SEQ ID NO:17. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO:17). In other words, appropriate (functional) Vα domain CDR3 amino acid sequences may vary from the sequence shown in SEQ ID NO:17 by one or several (e.g. two etc) amino acids.
As stated above, functional variants of SEQ ID NO:17 retain their ability to confer specific binding to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99) when the CDR3 is part of TCR Vα domain.
Functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO:17. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one, two or more amino acids of SEQ ID NO:17, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the CDR3.
Non-functional variants are amino acid sequence variants of SEQ ID NO:17 that do not specifically bind to a Jchain antigen (e.g. to the peptide shown in SEQ ID NO:99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO:17 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the CDR3 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 17. In examples where the TCR Vα domain CDR3 has the amino acid sequence of SEQ ID NO:17, the CDR3 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may comprise, in addition to the specified CDR3, a CDR1 comprising an amino acid sequence of SEQ ID NO: 15, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99)). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO:15. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO:15, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 15 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO:15 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vα domain CDR1 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 15, i.e. it may have at least 80%, at least 87%, or 100% sequence identity to SEQ ID NO: 15. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO:15). In other words, appropriate functional Vα domain CDR1 amino acid sequences may vary from the sequence shown in SEQ ID NO: 15 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO:15. As stated above, functional variants of SEQ ID NO: 15 retain the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99) when the CDR1 is part of TCR Vα domain).
In one example, the CDR1 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO:15. In examples where the TCR Vα domain CDR1 has the amino acid sequence of SEQ ID NO:15, the CDR1 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may also comprise, in addition to the specified CDR3 (and optionally the specified CDR1 above), a CDR2 comprising an amino acid sequence of SEQ ID NO:16, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to HLA-A*01:01). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO:16. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO:16, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO:16 that do not specifically bind to HLA-A*01:01. Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO:16 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vα domain CDR2 amino acid sequences may have at least 80% sequence identity to SEQ ID NO:16, i.e. it may have at least 80%, at least 83%, or 100% sequence identity to SEQ ID NO:16. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO:16). In other words, appropriate (functional) Vα domain CDR2 amino acid sequences may vary from the sequence shown in SEQ ID NO:16 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO:16). As stated above, a functional variant of SEQ ID NO:16 retains the ability to specifically bind to HLA-A*01:01.
In one example, the CDR2 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 16. In examples where the TCR Vα domain CDR2 has the amino acid sequence of SEQ ID NO:16, the CDR2 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may therefore comprise the CDRs mentioned in detail above (by SEQ ID specifically i.e. SEQ ID NO:17, SEQ ID NO:15 and SEQ ID NO: 16, or functional variants thereof), with appropriate intervening sequences between the CDRs.
The encoded TCR Vα domain may comprise an amino acid sequence of SEQ ID NO:21, or a functional variant thereof (i.e. wherein the variant TCR Vα domain retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99) when part of a binding protein described herein). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO:21. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO:21, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO:21 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO:21 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the encoded TCR Vα domain may have an amino acid sequence having at least 75%, at least 80%, at least 85% or at least 90% (or at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to the amino acid sequence of SEQ ID NO:21, whilst retaining the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99). In other words, a functional TCR Vα domain with one or several amino acid substitutions compared to the sequence of SEQ ID NO:21 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:21 may all be in regions of the TCR Vα domain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 17, SEQ ID NO:15 and/or SEQ ID NO:16, and still have 25% (or less) sequence variability compared to SEQ ID NO:21). In other words, the sequence of the CDRs of SEQ ID NO: 21 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 21).
As an example, the encoded TCR Vα domain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 21, wherein the TCR Vα domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 17. In this example, the TCR Vα domain CDR1 may have an amino acid sequence of SEQ ID NO: 15 and the TCR Vα domain CDR2 may have an amino acid sequence of SEQ ID NO: 16.
As another example, the encoded TCR Vα domain may comprise an amino acid sequence having at the amino acid sequence of SEQ ID NO: 21, with 0 to 10 (or 0 to 5) amino acid substitutions, insertions or deletions), wherein the TCR Vα domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 17. In this example, the TCR Vα domain CDR1 may have an amino acid sequence of SEQ ID NO: 15 and the TCR Vα domain CDR2 may have an amino acid sequence of SEQ ID NO: 16.
In examples where the TCR Vα domain has the amino acid sequence of SEQ ID NO:21, the TCR Vα domain may be encoded by the nucleic acid sequence of SEQ ID NO: 22, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code).
For the avoidance of doubt, the nucleic acid sequence encoding the TCR Vα domain may also encode a TCR α chain constant domain. Examples of suitable constant domains are generally discussed above.
An example of a specific TCR α chain amino acid sequence that includes a TCR Vα domain described herein with an appropriate constant domain is shown in SEQ ID NO: 25. Appropriate functional variants of SEQ ID NO: 25 are also encompassed (e.g. variants having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 25, wherein the variant TCR α chain amino acid sequence retains its ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99) when part of a binding protein described herein). In other words, a functional TCR α chain with one or several amino acid substitutions compared to the sequence of SEQ ID NO:25 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:25 may all be in regions of the TCR α chain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 17, SEQ ID NO: 15 and/or SEQ ID NO: 16, and still have 25% (or less) sequence variability compared to SEQ ID NO:25). In other words, the sequence of the CDRs of SEQ ID NO: 25 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 25).
As an example, the encoded TCR α chain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 25, wherein the TCR α chain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 17. In this example, the TCR α chain CDR1 may have an amino acid sequence of SEQ ID NO:15 and the TCR α chain CDR2 may have an amino acid sequence of SEQ ID NO: 16.
In examples where the TCR α chain has the amino acid sequence of SEQ ID NO:25, the TCR α chain may be encoded by the nucleic acid sequence of SEQ ID NO:26, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code). It is noted that SEQ ID NO:26 is the nucleic acid sequence for TCR α chain of clone 5D12.9.
In one example, the nucleic acid composition provided herein comprises a nucleic acid sequence that encodes a TCR Vα domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:17, or a functional fragment thereof.
In another example, the CDR3 of the Vα domain of a nucleic acid composition provided herein comprises or consists of the amino acid sequence of SEQ ID NO: 17.
In another example, the Vα domain of a nucleic acid composition provided herein comprises an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 21.
As provided elsewhere herein, the inventors identified TCR clone 13F6.6 which interacts with YTAVVPLVY (SEQ ID NO:99) in the context of HLA-A*01:01. The sequences provided herein that correspond to TCR clone 13F6.6 are SEQ ID NO:s 85 to 98.
Accordingly, another example of an appropriate TCR Vα domain CDR3 amino acid sequence that confers specific binding to a Jchain antigen (e.g. to YTAVVPLVY (SEQ ID NO: 99)) is shown in SEQ ID NO:87. As would be clear to a person of skill in the art, variants of the amino acid sequence shown in SEQ ID NO:87 may also be functional (i.e. retain their ability to confer specific binding to a Jchain antigen (e.g. to the peptide YTAVVPLVY (SEQ ID NO: 99)) when the CDR3 is part of TCR Vα domain). Such functional variants are therefore encompassed herein.
For example, appropriate (functional) Vα domain CDR3 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 87, i.e. they may have at least 80%, at least 84%, at least 92%, or 100% sequence identity to SEQ ID NO: 87. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 87). In other words, appropriate (functional) Vα domain CDR3 amino acid sequences may vary from the sequence shown in SEQ ID NO: 87 by one or several (e.g. two etc) amino acids.
As stated above, functional variants of SEQ ID NO: 87 retain their ability to confer specific binding to a Jchain antigen (i.e. the peptide shown in SEQ ID NO:99) when the CDR3 is part of TCR Vα domain.
Functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 87. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one, two or more amino acids of SEQ ID NO: 87, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the CDR3.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 87 that do not specifically bind to a Jchain antigen (i.e. the peptide shown in SEQ ID NO: 99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 87 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the CDR3 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 87. In examples where the TCR Vα domain CDR3 has the amino acid sequence of SEQ ID NO:87, the CDR3 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may comprise, in addition to the specified CDR3, a CDR1 comprising an amino acid sequence of SEQ ID NO: 85, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99)). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 85. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 85, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 85 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 85 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vα domain CDR1 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 85, i.e. it may have at least 80%, at least 83%, or 100% sequence identity to SEQ ID NO: 85. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 85). In other words, appropriate functional Vα domain CDR1 amino acid sequences may vary from the sequence shown in SEQ ID NO: 85 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 85. As stated above, functional variants of SEQ ID NO: 85 retain the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99) when the CDR1 is part of TCR Vα domain).
In one example, the CDR1 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 85. In examples where the TCR Vα domain CDR1 has the amino acid sequence of SEQ ID NO:85, the CDR1 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may also comprise, in addition to the specified CDR3 (and optionally the specified CDR1 above), a CDR2 comprising an amino acid sequence of SEQ ID NO: 86, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to HLA-A*01:01). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 86. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 86, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 86 that do not specifically bind to HLA-A*01:01. Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 86 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vα domain CDR2 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 86, i.e. it may have at least 80%, at least 87, or 100% sequence identity to SEQ ID NO: 86. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 86). In other words, appropriate (functional) Vα domain CDR2 amino acid sequences may vary from the sequence shown in SEQ ID NO: 86 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 86). As stated above, a functional variant of SEQ ID NO: 86 retains the ability to specifically bind to HLA-A*01:01.
In one example, the CDR2 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 86. In examples where the TCR Vα domain CDR2 has the amino acid sequence of SEQ ID NO: 86, the CDR2 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may therefore comprise the CDRs mentioned in detail above (by SEQ ID specifically i.e. SEQ ID NO:87, SEQ ID NO: 85 and SEQ ID NO: 86, or functional variants thereof), with appropriate intervening sequences between the CDRs.
The encoded TCR Vα domain may comprise an amino acid sequence of SEQ ID NO: 91, or a functional variant thereof (i.e. wherein the variant TCR Vα domain retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99) when part of a binding protein described herein). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 91. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 91, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 91 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 91 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the encoded TCR Vα domain may have an amino acid sequence having at least 75%, at least 80%, at least 85% or at least 90% (or at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to the amino acid sequence of SEQ ID NO: 91, whilst retaining the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99). In other words, a functional TCR Vα domain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 91 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO: 91 may all be in regions of the TCR Vα domain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 87, SEQ ID NO: 85 and/or SEQ ID NO: 86, and still have 25% (or less) sequence variability compared to SEQ ID NO:91). In other words, the sequence of the CDRs of SEQ ID NO: 91 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 91).
As an example, the encoded TCR Vα domain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 91, wherein the TCR Vα domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 87. In this example, the TCR Vα domain CDR1 may have an amino acid sequence of SEQ ID NO: 85 and the TCR Vα domain CDR2 may have an amino acid sequence of SEQ ID NO: 86.
As another example, the encoded TCR Vα domain may comprise an amino acid sequence having at the amino acid sequence of SEQ ID NO: 91, with 0 to 10 (or 0 to 5) amino acid substitutions, insertions or deletions), wherein the TCR Vα domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 87. In this example, the TCR Vα domain CDR1 may have an amino acid sequence of SEQ ID NO: 85 and the TCR Vα domain CDR2 may have an amino acid sequence of SEQ ID NO: 86.
In examples where the TCR Vα domain has the amino acid sequence of SEQ ID NO: 91, the TCR Vα domain may be encoded by the nucleic acid sequence of SEQ ID NO: 92, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code).
For the avoidance of doubt, the nucleic acid sequence encoding the TCR Vα domain may also encode a TCR α chain constant domain. Examples of suitable constant domains are generally discussed above.
An example of a specific TCR α chain amino acid sequence that includes a TCR Vα domain described herein with an appropriate constant domain is shown in SEQ ID NO: 95. Appropriate functional variants of SEQ ID NO: 95 are also encompassed (e.g. variants having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 95, wherein the variant TCR α chain amino acid sequence retains its ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO:99) when part of a binding protein described herein). In other words, a functional TCR α chain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 95 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO: 95 may all be in regions of the TCR α chain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 87, SEQ ID NO: 85 and/or SEQ ID NO: 86, and still have 25% (or less) sequence variability compared to SEQ ID NO: 95). In other words, the sequence of the CDRs of SEQ ID NO: 95 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 95).
As an example, the encoded TCR α chain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 95, wherein the TCR α chain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 87. In this example, the TCR α chain CDR1 may have an amino acid sequence of SEQ ID NO:85 and the TCR α chain CDR2 may have an amino acid sequence of SEQ ID NO: 86.
In examples where the TCR α chain has the amino acid sequence of SEQ ID NO:95, the TCR α chain may be encoded by the nucleic acid sequence of SEQ ID NO:96, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code). It is noted that SEQ ID NO:96 is the nucleic acid sequence for TCR α chain of clone 13F6.6.
In one example, the nucleic acid composition provided herein comprises a nucleic acid sequence that encodes a TCR Vα domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:87, or a functional fragment thereof.
In another example, the CDR3 of the Vα domain of a nucleic acid composition provided herein comprises or consists of the amino acid sequence of SEQ ID NO: 87.
In another example, the Vα domain of a nucleic acid composition provided herein comprises an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 91.
(ii) Vα Domains that Interact with ISDPTSPLRTR (SEQ ID NO: 101)
As provided elsewhere herein, the inventors identified TCR clone 5C8.16 which interacts with ISDPTSPLRTR (SEQ ID NO:101) in the context of HLA-A*03:01. The sequences provided herein that correspond to TCR clone 5C8.16 are SEQ ID NO:s 43 to 56.
An example of an appropriate TCR Vα domain CDR3 amino acid sequence that confers specific binding to a Jchain antigen (e.g. to ISDPTSPLRTR (SEQ ID NO: 101)) is shown in SEQ ID NO: 45. As would be clear to a person of skill in the art, variants of the amino acid sequence shown in SEQ ID NO: 45 may also be functional (i.e. retain their ability to confer specific binding to a Jchain antigen (e.g. to the peptide ISDPTSPLRTR (SEQ ID NO: 101)) when the CDR3 is part of TCR Vα domain). Such functional variants are therefore encompassed herein.
For example, appropriate (functional) Vα domain CDR3 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 45, i.e. they may have at least 80%, at least 85%, at least 92%, or 100% sequence identity to SEQ ID NO: 45. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 45). In other words, appropriate (functional) Vα domain CDR3 amino acid sequences may vary from the sequence shown in SEQ ID NO: 45 by one or several (e.g. two etc) amino acids.
As stated above, functional variants of SEQ ID NO: 45 retain their ability to confer specific binding to a Jchain antigen (i.e. the peptide shown in SEQ ID NO: 101) when the CDR3 is part of TCR Vα domain.
Functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 45. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one, two or more amino acids of SEQ ID NO: 45, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the CDR3.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 45 that do not specifically bind to a Jchain antigen (i.e. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 45 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the CDR3 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 45. In examples where the TCR Vα domain CDR3 has the amino acid sequence of SEQ ID NO: 45, the CDR3 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may comprise, in addition to the specified CDR3, a CDR1 comprising an amino acid sequence of SEQ ID NO: 43, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101)). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 43. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 43, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 43 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 43 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vα domain CDR1 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 43, i.e. it may have at least 80%, at least 83%, or 100% sequence identity to SEQ ID NO: 43. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 43). In other words, appropriate functional Vα domain CDR1 amino acid sequences may vary from the sequence shown in SEQ ID NO: 43 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 43. As stated above, functional variants of SEQ ID NO: 43 retain the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when the CDR1 is part of TCR Vα domain).
In one example, the CDR1 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 43. In examples where the TCR Vα domain CDR1 has the amino acid sequence of SEQ ID NO: 43, the CDR1 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may also comprise, in addition to the specified CDR3 (and optionally the specified CDR1 above), a CDR2 comprising an amino acid sequence of SEQ ID NO: 44, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to HLA-A*03:01). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 44. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 44, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 44 that do not specifically bind to HLA-A*03:01. Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 44 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vα domain CDR2 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 44, i.e. it may have at least 80%, at least 87%, or 100% sequence identity to SEQ ID NO: 44. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 44). In other words, appropriate (functional) Vα domain CDR2 amino acid sequences may vary from the sequence shown in SEQ ID NO: 44 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 44). As stated above, a functional variant of SEQ ID NO: 44 retains the ability to specifically bind to HLA-A*03:01.
In one example, the CDR2 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 44. In examples where the TCR Vα domain CDR2 has the amino acid sequence of SEQ ID NO: 44, the CDR2 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may therefore comprise the CDRs mentioned in detail above (by SEQ ID specifically i.e. SEQ ID NO:45, SEQ ID NO: 43 and SEQ ID NO: 44, or functional variants thereof), with appropriate intervening sequences between the CDRs.
The encoded TCR Vα domain may comprise an amino acid sequence of SEQ ID NO: 49, or a functional variant thereof (i.e. wherein the variant TCR Vα domain retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when part of a binding protein described herein). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 49. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 49, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 49 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 49 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the encoded TCR Vα domain may have an amino acid sequence having at least 75%, at least 80%, at least 85% or at least 90% (or at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to the amino acid sequence of SEQ ID NO: 49, whilst retaining the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). In other words, a functional TCR Vα domain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 49 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO: 49 may all be in regions of the TCR Vα domain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 45, SEQ ID NO: 43 and/or SEQ ID NO: 44, and still have 25% (or less) sequence variability compared to SEQ ID NO:49). In other words, the sequence of the CDRs of SEQ ID NO: 49 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 49).
As an example, the encoded TCR Vα domain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 49, wherein the TCR Vα domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 45. In this example, the TCR Vα domain CDR1 may have an amino acid sequence of SEQ ID NO: 43 and the TCR Vα domain CDR2 may have an amino acid sequence of SEQ ID NO: 44.
As another example, the encoded TCR Vα domain may comprise an amino acid sequence having at the amino acid sequence of SEQ ID NO: 49, with 0 to 10 (or 0 to 5) amino acid substitutions, insertions or deletions), wherein the TCR Vα domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 45. In this example, the TCR Vα domain CDR1 may have an amino acid sequence of SEQ ID NO: 43 and the TCR Vα domain CDR2 may have an amino acid sequence of SEQ ID NO: 44.
In examples where the TCR Vα domain has the amino acid sequence of SEQ ID NO: 49, the TCR Vα domain may be encoded by the nucleic acid sequence of SEQ ID NO: 50, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code).
For the avoidance of doubt, the nucleic acid sequence encoding the TCR Vα domain may also encode a TCR α chain constant domain. Examples of suitable constant domains are generally discussed above.
An example of a specific TCR α chain amino acid sequence that includes a TCR Vα domain described herein with an appropriate constant domain is shown in SEQ ID NO: 53. Appropriate functional variants of SEQ ID NO:53 are also encompassed (e.g. variants having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 53, wherein the variant TCR α chain amino acid sequence retains its ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when part of a binding protein described herein). In other words, a functional TCR α chain with one or several amino acid substitutions compared to the sequence of SEQ ID NO:53 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:53 may all be in regions of the TCR α chain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 45, SEQ ID NO: 43 and/or SEQ ID NO: 44, and still have 25% (or less) sequence variability compared to SEQ ID NO: 53). In other words, the sequence of the CDRs of SEQ ID NO: 53 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 53).
As an example, the encoded TCR α chain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 53, wherein the TCR α chain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 45. In this example, the TCR α chain CDR1 may have an amino acid sequence of SEQ ID NO:43 and the TCR α chain CDR2 may have an amino acid sequence of SEQ ID NO: 44.
In examples where the TCR α chain has the amino acid sequence of SEQ ID NO:53, the TCR α chain may be encoded by the nucleic acid sequence of SEQ ID NO:54, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code). It is noted that SEQ ID NO:54 is the nucleic acid sequence for TCR α chain of clone 5C8.16.
In one example, the nucleic acid composition provided herein comprises a nucleic acid sequence that encodes a TCR Vα domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:45, or a functional fragment thereof.
In another example, the CDR3 of the Vα domain of a nucleic acid composition provided herein comprises or consists of the amino acid sequence of SEQ ID NO: 45.
In another example, the Vα domain of a nucleic acid composition provided herein comprises an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 49.
As provided elsewhere herein, the inventors identified TCR clone 16C7.9 which interacts with ISDPTSPLRTR (SEQ ID NO:101) in the context of HLA-A*11:01. The sequences provided herein that correspond to TCR clone 16C7.9 are SEQ ID NO:s 57 to 70.
An example of an appropriate TCR Vα domain CDR3 amino acid sequence that confers specific binding to a Jchain antigen (e.g. to ISDPTSPLRTR (SEQ ID NO: 101)) is shown in SEQ ID NO: 59. As would be clear to a person of skill in the art, variants of the amino acid sequence shown in SEQ ID NO: 59 may also be functional (i.e. retain their ability to confer specific binding to a Jchain antigen (e.g. to the peptide ISDPTSPLRTR (SEQ ID NO: 101)) when the CDR3 is part of TCR Vα domain). Such functional variants are therefore encompassed herein.
For example, appropriate (functional) Vα domain CDR3 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 59, i.e. they may have at least 80%, at least 86%, at least 93%, or 100% sequence identity to SEQ ID NO: 59. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 59). In other words, appropriate (functional) Vα domain CDR3 amino acid sequences may vary from the sequence shown in SEQ ID NO: 59 by one or several (e.g. two etc) amino acids.
As stated above, functional variants of SEQ ID NO: 59 retain their ability to confer specific binding to a Jchain antigen (i.e. the peptide shown in SEQ ID NO: 101) when the CDR3 is part of TCR Vα domain.
Functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 59. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one, two or more amino acids of SEQ ID NO: 59, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the CDR3.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 59 that do not specifically bind to a Jchain antigen (i.e. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 59 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the CDR3 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 59. In examples where the TCR Vα domain CDR3 has the amino acid sequence of SEQ ID NO: 59, the CDR3 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may comprise, in addition to the specified CDR3, a CDR1 comprising an amino acid sequence of SEQ ID NO: 57, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101)). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 57. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 57, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 57 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 57 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vα domain CDR1 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 57, i.e. it may have at least 80%, at least 85%, or 100% sequence identity to SEQ ID NO: 57. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 57). In other words, appropriate functional Vα domain CDR1 amino acid sequences may vary from the sequence shown in SEQ ID NO: 57 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 57. As stated above, functional variants of SEQ ID NO: 57 retain the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when the CDR1 is part of TCR Vα domain).
In one example, the CDR1 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 57. In examples where the TCR Vα domain CDR1 has the amino acid sequence of SEQ ID NO: 57, the CDR1 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may also comprise, in addition to the specified CDR3 (and optionally the specified CDR1 above), a CDR2 comprising an amino acid sequence of SEQ ID NO: 58, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to HLA-A*11:01). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 58. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 58, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 58 that do not specifically bind to HLA-A*11:01. Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 58 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vα domain CDR2 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 58, i.e. it may have at least 80%, at least 87%, or 100% sequence identity to SEQ ID NO: 58. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 58). In other words, appropriate (functional) Vα domain CDR2 amino acid sequences may vary from the sequence shown in SEQ ID NO: 58 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 58). As stated above, a functional variant of SEQ ID NO: 58 retains the ability to specifically bind to HLA-A*11:01.
In one example, the CDR2 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 58. In examples where the TCR Vα domain CDR2 has the amino acid sequence of SEQ ID NO: 58, the CDR2 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may therefore comprise the CDRs mentioned in detail above (by SEQ ID specifically i.e. SEQ ID NO:59, SEQ ID NO: 57 and SEQ ID NO: 58, or functional variants thereof), with appropriate intervening sequences between the CDRs.
The encoded TCR Vα domain may comprise an amino acid sequence of SEQ ID NO: 63, or a functional variant thereof (i.e. wherein the variant TCR Vα domain retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when part of a binding protein described herein). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 63. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 63, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 63 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 63 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the encoded TCR Vα domain may have an amino acid sequence having at least 75%, at least 80%, at least 85% or at least 90% (or at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to the amino acid sequence of SEQ ID NO: 63, whilst retaining the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). In other words, a functional TCR Vα domain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 63 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO: 63 may all be in regions of the TCR Vα domain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 59, SEQ ID NO: 57 and/or SEQ ID NO: 58, and still have 25% (or less) sequence variability compared to SEQ ID NO:7). In other words, the sequence of the CDRs of SEQ ID NO: 63 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 63).
As an example, the encoded TCR Vα domain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 63, wherein the TCR Vα domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 59. In this example, the TCR Vα domain CDR1 may have an amino acid sequence of SEQ ID NO: 57 and the TCR Vα domain CDR2 may have an amino acid sequence of SEQ ID NO: 58.
As another example, the encoded TCR Vα domain may comprise an amino acid sequence having at the amino acid sequence of SEQ ID NO: 63, with 0 to 10 (or 0 to 5) amino acid substitutions, insertions or deletions), wherein the TCR Vα domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 59. In this example, the TCR Vα domain CDR1 may have an amino acid sequence of SEQ ID NO: 57 and the TCR Vα domain CDR2 may have an amino acid sequence of SEQ ID NO: 58.
In examples where the TCR Vα domain has the amino acid sequence of SEQ ID NO: 63, the TCR Vα domain may be encoded by the nucleic acid sequence of SEQ ID NO: 64, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code).
For the avoidance of doubt, the nucleic acid sequence encoding the TCR Vα domain may also encode a TCR α chain constant domain. Examples of suitable constant domains are generally discussed above.
An example of a specific TCR α chain amino acid sequence that includes a TCR Vα domain described herein with an appropriate constant domain is shown in SEQ ID NO: 67. Appropriate functional variants of SEQ ID NO:67 are also encompassed (e.g. variants having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 67, wherein the variant TCR α chain amino acid sequence retains its ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when part of a binding protein described herein). In other words, a functional TCR α chain with one or several amino acid substitutions compared to the sequence of SEQ ID NO:67 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:67 may all be in regions of the TCR α chain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 59, SEQ ID NO: 57 and/or SEQ ID NO: 58, and still have 25% (or less) sequence variability compared to SEQ ID NO: 67). In otherwords, the sequence of the CDRs of SEQ ID NO: 67 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 67).
As an example, the encoded TCR α chain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 67, wherein the TCR α chain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 59. In this example, the TCR α chain CDR1 may have an amino acid sequence of SEQ ID NO:57 and the TCR α chain CDR2 may have an amino acid sequence of SEQ ID NO: 58.
In examples where the TCR α chain has the amino acid sequence of SEQ ID NO: 67, the TCR α chain may be encoded by the nucleic acid sequence of SEQ ID NO: 68, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code). It is noted that SEQ ID NO:68 is the nucleic acid sequence for TCR α chain of clone 16C7.9.
In one example, the nucleic acid composition provided herein comprises a nucleic acid sequence that encodes a TCR Vα domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:59, or a functional fragment thereof.
In another example, the CDR3 of the Vα domain of a nucleic acid composition provided herein comprises or consists of the amino acid sequence of SEQ ID NO: 59.
In another example, the Vα domain of a nucleic acid composition provided herein comprises an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 63.
As provided elsewhere herein, the inventors identified TCR clone 13D4.9 which interacts with ISDPTSPLRTR (SEQ ID NO:101) in the context of HLA-A*11:01. The sequences provided herein that correspond to TCR clone 13D4.9 are SEQ ID NO:s 71 to 84.
An example of an appropriate TCR Vα domain CDR3 amino acid sequence that confers specific binding to a Jchain antigen (e.g. to ISDPTSPLRTR (SEQ ID NO: 101)) is shown in SEQ ID NO: 73. As would be clear to a person of skill in the art, variants of the amino acid sequence shown in SEQ ID NO: 73 may also be functional (i.e. retain their ability to confer specific binding to a Jchain antigen (e.g. to the peptide ISDPTSPLRTR (SEQ ID NO: 101)) when the CDR3 is part of TCR Vα domain). Such functional variants are therefore encompassed herein.
For example, appropriate (functional) Vα domain CDR3 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 73, i.e. they may have at least 80%, at least 85%, at least 92%, or 100% sequence identity to SEQ ID NO: 73. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 73).
In other words, appropriate (functional) Vα domain CDR3 amino acid sequences may vary from the sequence shown in SEQ ID NO: 73 by one or several (e.g. two etc) amino acids.
As stated above, functional variants of SEQ ID NO: 73 retain their ability to confer specific binding to a Jchain antigen (i.e. the peptide shown in SEQ ID NO: 101) when the CDR3 is part of TCR Vα domain.
Functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 73. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one, two or more amino acids of SEQ ID NO: 73, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the CDR3.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 73 that do not specifically bind to a Jchain antigen (i.e. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 73 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the CDR3 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 73. In examples where the TCR Vα domain CDR3 has the amino acid sequence of SEQ ID NO: 73, the CDR3 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may comprise, in addition to the specified CDR3, a CDR1 comprising an amino acid sequence of SEQ ID NO: 71, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101)). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 71. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 71, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 71 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 71 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vα domain CDR1 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 71, i.e. it may have at least 80%, at least 83%, or 100% sequence identity to SEQ ID NO: 71. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 71). In other words, appropriate functional Vα domain CDR1 amino acid sequences may vary from the sequence shown in SEQ ID NO: 71 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 71. As stated above, functional variants of SEQ ID NO: 71 retain the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when the CDR1 is part of TCR Vα domain).
In one example, the CDR1 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 71. In examples where the TCR Vα domain CDR1 has the amino acid sequence of SEQ ID NO: 71, the CDR1 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may also comprise, in addition to the specified CDR3 (and optionally the specified CDR1 above), a CDR2 comprising an amino acid sequence of SEQ ID NO: 72, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to HLA-A*11:01). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 72. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 72, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 72 that do not specifically bind to HLA-A*11:01. Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 72 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vα domain CDR2 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 72, i.e. it may have at least 80%, at least 85%, or 100% sequence identity to SEQ ID NO: 72. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 72). In other words, appropriate (functional) Vα domain CDR2 amino acid sequences may vary from the sequence shown in SEQ ID NO: 72 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 72). As stated above, a functional variant of SEQ ID NO: 72 retains the ability to specifically bind to HLA-A*11:01.
In one example, the CDR2 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 72. In examples where the TCR Vα domain CDR2 has the amino acid sequence of SEQ ID NO: 72, the CDR2 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may therefore comprise the CDRs mentioned in detail above (by SEQ ID specifically i.e. SEQ ID NO:73, SEQ ID NO: 71 and SEQ ID NO: 72, or functional variants thereof), with appropriate intervening sequences between the CDRs.
The encoded TCR Vα domain may comprise an amino acid sequence of SEQ ID NO: 77, or a functional variant thereof (i.e. wherein the variant TCR Vα domain retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when part of a binding protein described herein). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 77. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 77, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 77 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 77 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the encoded TCR Vα domain may have an amino acid sequence having at least 75%, at least 80%, at least 85% or at least 90% (or at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to the amino acid sequence of SEQ ID NO: 77, whilst retaining the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). In other words, a functional TCR Vα domain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 77 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO: 77 may all be in regions of the TCR Vα domain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 73, SEQ ID NO: 71 and/or SEQ ID NO: 72, and still have 25% (or less) sequence variability compared to SEQ ID NO:77). In other words, the sequence of the CDRs of SEQ ID NO: 77 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 77).
As an example, the encoded TCR Vα domain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 77, wherein the TCR Vα domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 73. In this example, the TCR Vα domain CDR1 may have an amino acid sequence of SEQ ID NO: 71 and the TCR Vα domain CDR2 may have an amino acid sequence of SEQ ID NO: 72.
As another example, the encoded TCR Vα domain may comprise an amino acid sequence having at the amino acid sequence of SEQ ID NO: 77, with 0 to 10 (or 0 to 5) amino acid substitutions, insertions or deletions), wherein the TCR Vα domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 73. In this example, the TCR Vα domain CDR1 may have an amino acid sequence of SEQ ID NO: 71 and the TCR Vα domain CDR2 may have an amino acid sequence of SEQ ID NO: 72.
In examples where the TCR Vα domain has the amino acid sequence of SEQ ID NO: 77, the TCR Vα domain may be encoded by the nucleic acid sequence of SEQ ID NO: 78, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code).
For the avoidance of doubt, the nucleic acid sequence encoding the TCR Vα domain may also encode a TCR α chain constant domain. Examples of suitable constant domains are generally discussed above.
An example of a specific TCR α chain amino acid sequence that includes a TCR Vα domain described herein with an appropriate constant domain is shown in SEQ ID NO: 81. Appropriate functional variants of SEQ ID NO:81 are also encompassed (e.g. variants having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 81, wherein the variant TCR α chain amino acid sequence retains its ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when part of a binding protein described herein). In other words, a functional TCR α chain with one or several amino acid substitutions compared to the sequence of SEQ ID NO:81 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:81 may all be in regions of the TCR α chain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 73, SEQ ID NO: 71 and/or SEQ ID NO: 72, and still have 25% (or less) sequence variability compared to SEQ ID NO: 81). In other words, the sequence of the CDRs of SEQ ID NO: 81 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 81).
As an example, the encoded TCR α chain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 81, wherein the TCR α chain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 73. In this example, the TCR α chain CDR1 may have an amino acid sequence of SEQ ID NO:71 and the TCR α chain CDR2 may have an amino acid sequence of SEQ ID NO: 72.
In examples where the TCR α chain has the amino acid sequence of SEQ ID NO: 81, the TCR α chain may be encoded by the nucleic acid sequence of SEQ ID NO: 82, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code). It is noted that SEQ ID NO:82 is the nucleic acid sequence for TCR α chain of clone 13D4.9.
In one example, the nucleic acid composition provided herein comprises a nucleic acid sequence that encodes a TCR Vα domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:73, or a functional fragment thereof.
In another example, the CDR3 of the Vα domain of a nucleic acid composition provided herein comprises or consists of the amino acid sequence of SEQ ID NO: 73.
In another example, the Vα domain of a nucleic acid composition provided herein comprises an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 77.
(iii) Vα Domains that Interact with CYTAVVPLV (SEQ ID NO: 100)
As provided elsewhere herein, the inventors identified TCR clone 10H11.1 which interacts with CYTAVVPLV (SEQ ID NO:100) in the context of HLA-A*24:02. The sequences provided herein that correspond to TCR clone 10H11.1 are SEQ ID NO:s 29 to 42.
An example of an appropriate TCR Vα domain CDR3 amino acid sequence that confers specific binding to a Jchain antigen (e.g. to CYTAVVPLV (SEQ ID NO: 100)) is shown in SEQ ID NO:31. As would be clear to a person of skill in the art, variants of the amino acid sequence shown in SEQ ID NO:31 may also be functional (i.e. retain their ability to confer specific binding to a Jchain antigen (e.g. to the peptide CYTAVVPLV (SEQ ID NO: 100)) when the CDR3 is part of TCR Vα domain). Such functional variants are therefore encompassed herein.
For example, appropriate (functional) Vα domain CDR3 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 31, i.e. they may have at least 80%, at least 84%, at least 92%, or 100% sequence identity to SEQ ID NO: 31. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO:31). In other words, appropriate (functional) Vα domain CDR3 amino acid sequences may vary from the sequence shown in SEQ ID NO:31 by one or several (e.g. two etc) amino acids.
As stated above, functional variants of SEQ ID NO: 31 retain their ability to confer specific binding to a Jchain antigen (i.e. the peptide shown in SEQ ID NO: 100) when the CDR3 is part of TCR Vα domain.
Functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 31. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one, two or more amino acids of SEQ ID NO: 31, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the CDR3.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 31 that do not specifically bind to a Jchain antigen (i.e. the peptide shown in SEQ ID NO: 100). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 31 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the CDR3 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 31. In examples where the TCR Vα domain CDR3 has the amino acid sequence of SEQ ID NO: 31, the CDR3 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may comprise, in addition to the specified CDR3, a CDR1 comprising an amino acid sequence of SEQ ID NO: 29, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 100)). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 29. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 29, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 29 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 100). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 29 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vα domain CDR1 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 29, i.e. it may have at least 80%, at least 83%, or 100% sequence identity to SEQ ID NO: 29. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 29). In other words, appropriate functional Vα domain CDR1 amino acid sequences may vary from the sequence shown in SEQ ID NO: 29 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 29. As stated above, functional variants of SEQ ID NO: 29 retain the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 100) when the CDR1 is part of TCR Vα domain).
In one example, the CDR1 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 29. In examples where the TCR Vα domain CDR1 has the amino acid sequence of SEQ ID NO: 29, the CDR1 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may also comprise, in addition to the specified CDR3 (and optionally the specified CDR1 above), a CDR2 comprising an amino acid sequence of SEQ ID NO: 30, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to HLA-A*24:02). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 30. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 30, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 30 that do not specifically bind to HLA-A*24:02. Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 30 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vα domain CDR2 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 30, i.e. it may have at least 80%, at least 85%, or 100% sequence identity to SEQ ID NO: 30. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO:30). In other words, appropriate (functional) Vα domain CDR2 amino acid sequences may vary from the sequence shown in SEQ ID NO:30 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 30). As stated above, a functional variant of SEQ ID NO: 30 retains the ability to specifically bind to HLA-A*24:02.
In one example, the CDR2 of the Vα domain comprises or consists of the amino acid sequence of SEQ ID NO: 30. In examples where the TCR Vα domain CDR2 has the amino acid sequence of SEQ ID NO:30, the CDR2 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vα domain may therefore comprise the CDRs mentioned in detail above (by SEQ ID specifically i.e. SEQ ID NO: 31, SEQ ID NO: 29 and SEQ ID NO: 30, or functional variants thereof), with appropriate intervening sequences between the CDRs.
The encoded TCR Vα domain may comprise an amino acid sequence of SEQ ID NO: 35, or a functional variant thereof (i.e. wherein the variant TCR Vα domain retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 100) when part of a binding protein described herein). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 35. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 35, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 35 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 100). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 35 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the encoded TCR Vα domain may have an amino acid sequence having at least 75%, at least 80%, at least 85% or at least 90% (or at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to the amino acid sequence of SEQ ID NO: 35, whilst retaining the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 100). In other words, a functional TCR Vα domain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 35 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO: 35 may all be in regions of the TCR Vα domain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 31, SEQ ID NO: 29 and/or SEQ ID NO: 30, and still have 25% (or less) sequence variability compared to SEQ ID NO: 35). In other words, the sequence of the CDRs of SEQ ID NO: 35 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 35).
As an example, the encoded TCR Vα domain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 35, wherein the TCR Vα domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 31. In this example, the TCR Vα domain CDR1 may have an amino acid sequence of SEQ ID NO: 29 and the TCR Vα domain CDR2 may have an amino acid sequence of SEQ ID NO: 30.
As another example, the encoded TCR Vα domain may comprise an amino acid sequence having at the amino acid sequence of SEQ ID NO: 35, with 0 to 10 (or 0 to 5) amino acid substitutions, insertions or deletions), wherein the TCR Vα domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 31. In this example, the TCR Vα domain CDR1 may have an amino acid sequence of SEQ ID NO: 29 and the TCR Vα domain CDR2 may have an amino acid sequence of SEQ ID NO: 30.
In examples where the TCR Vα domain has the amino acid sequence of SEQ ID NO: 35, the TCR Vα domain may be encoded by the nucleic acid sequence of SEQ ID NO: 36, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code).
For the avoidance of doubt, the nucleic acid sequence encoding the TCR Vα domain may also encode a TCR α chain constant domain. Examples of suitable constant domains are generally discussed above.
An example of a specific TCR α chain amino acid sequence that includes a TCR Vα domain described herein with an appropriate constant domain is shown in SEQ ID NO: 39. Appropriate functional variants of SEQ ID NO: 39 are also encompassed (e.g. variants having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 39, wherein the variant TCR α chain amino acid sequence retains its ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 100) when part of a binding protein described herein). In other words, a functional TCR α chain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 39 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO: 39 may all be in regions of the TCR α chain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 31, SEQ ID NO: 29 and/or SEQ ID NO: 30, and still have 25% (or less) sequence variability compared to SEQ ID NO: 39). In other words, the sequence of the CDRs of SEQ ID NO: 39 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 39).
As an example, the encoded TCR α chain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 39, wherein the TCR α chain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 31. In this example, the TCR α chain CDR1 may have an amino acid sequence of SEQ ID NO: 29 and the TCR α chain CDR2 may have an amino acid sequence of SEQ ID NO: 30.
In examples where the TCR α chain has the amino acid sequence of SEQ ID NO: 39, the TCR α chain may be encoded by the nucleic acid sequence of SEQ ID NO: 40, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code). It is noted that SEQ ID NO:40 is the nucleic acid sequence for TCR α chain of clone 10H11.11
In one example, the nucleic acid composition provided herein comprises a nucleic acid sequence that encodes a TCR Vα domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:31, or a functional fragment thereof.
In another example, the CDR3 of the Vα domain of a nucleic acid composition provided herein comprises or consists of the amino acid sequence of SEQ ID NO: 31.
In another example, the Vα domain of a nucleic acid composition provided herein comprises an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 35.
The isolated nucleic acid composition described herein encodes a Jchain antigen-specific binding protein. As discussed herein, the inventors identified several TCRs that specifically bind to a Jchain antigen selected from YTAVVPLVY (SEQ ID NO:99), ISDPTSPLRTR (SEQ ID NO:101) and CYTAVVPLV (SEQ ID NO:100). The sequences for the Vα domains are discussed above, with the corresponding sequences for the Vβ domains discussed below.
(i) Vβ Domains that Interact with YTAVVPLVY (SEQ ID NO: 99)
As provided elsewhere herein, the inventors identified TCR clone 4G8.8 which interacts with YTAVVPLVY (SEQ ID NO:99) in the context of HLA-A*01:01. The sequences provided herein that correspond to TCR clone 4G8.8 are SEQ ID NO:s 1 to 14.
Accordingly, an example of an appropriate TCR Vβ domain CDR3 amino acid sequence that confers specific binding to a Jchain antigen (e.g. to Jchain antigen YTAVVPLVY (SEQ ID NO: 99)) is shown in SEQ ID NO:6. As would be clear to a person of skill in the art, variants of the amino acid sequence shown in SEQ ID NO:6 may also be functional (i.e. retain their ability to confer specific binding to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99) when the CDR3 is part of TCR Vβ domain). Such functional variants are therefore encompassed herein.
For example, appropriate (functional) Vβ domain CDR3 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 6, i.e. they may have at least 80%, at least 84%, at least 92%, or 100% sequence identity to SEQ ID NO: 6. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 6). In other words, appropriate (functional) Vβ domain CDR3 amino acid sequences may vary from the sequence shown in SEQ ID NO: 6 by one or several (e.g. two) amino acids. As stated above, functional variants of SEQ ID NO: 6 retain their ability to confer specific binding to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99) when the CDR3 is part of TCR Vβ domain.
Functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 6. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 6, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the CDR3.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 6 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 6 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the CDR3 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 6. In examples where the TCR Vβ domain CDR3 has the amino acid sequence of SEQ ID NO:6, the CDR3 may be encoded by Any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may comprise, in addition to the specified CDR3, a CDR1 comprising an amino acid sequence of SEQ ID NO: 4, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99)). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 4. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 4, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 4 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 4 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vβ domain CDR1 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 4, i.e. it may have at least 80%, or 100% sequence identity to SEQ ID NO: 4. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 4). In other words, appropriate (functional) Vβ domain CDR1 amino acid sequences may vary from the sequence shown in SEQ ID NO:4 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO:4). As stated above, functional variants of SEQ ID NO: 4 retain the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99) when the CDR1 is part of TCR Vβ domain).
In one example, the CDR1 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 4. In examples where the TCR Vα domain CDR1 has the amino acid sequence of SEQ ID NO:4, the CDR1 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may also comprise, in addition to the specified CDR3 (and optionally the specified CDR1 above), a CDR2 having an amino acid sequence of SEQ ID NO: 5, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to HLA-A*01:01). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 5. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 5, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 5 that do not specifically bind to HLA-A*01:01. Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 5 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vβ domain CDR2 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 5, i.e. it may have at least 80%, at least 83%, or 100% sequence identity to SEQ ID NO: 5. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 5). In other words, appropriate (functional) Vβ domain CDR2 amino acid sequences may vary from the sequence shown in SEQ ID NO: 5 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 5). As stated above, a functional variant of SEQ ID NO: 5 retains the ability to specifically bind to HLA-A*01:01.
In one example, the CDR2 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 5. In examples where the TCR Vβ domain CDR2 has the amino acid sequence of SEQ ID NO:5, the CDR2 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may therefore comprise the CDRs mentioned in detail above (by SEQ ID specifically i.e. SEQ ID NO:6, SEQ ID NO: 4 and SEQ ID NO: 5, or functional variants thereof), with appropriate intervening sequences between the CDRs.
The encoded TCR Vβ domain may have an amino acid sequence of SEQ ID NO: 9, or a functional variant thereof (i.e. wherein the variant TCR Vβ domain retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99) when part of a binding protein described herein). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 9. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 9, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 9 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO:9 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the encoded TCR Vβ domain may have an amino acid sequence having at least 75%, at least 80%, at least 85% or at least 90% (or at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to the amino acid sequence of SEQ ID NO: 9, whilst retaining the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99). In other words, a functional TCR Vβ domain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 9 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:9 may all be in regions of the TCR Vβ domain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 6, SEQ ID NO: 4 and/or SEQ ID NO: 5, and still have 25% (or less) sequence variability compared to SEQ ID NO: 9). In otherwords, the sequence of the CDRs of SEQ ID NO: 9 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 9).
As an example, the encoded TCR Vβ domain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 9, wherein the TCR Vβ domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 6. In this example, the TCR Vβ domain CDR1 may have an amino acid sequence of SEQ ID NO:4 and the TCR Vβ domain CDR2 may have an amino acid sequence of SEQ ID NO: 5.
In examples where the TCR Vβ domain has the amino acid sequence of SEQ ID NO:9, the TCR Vβ domain may be encoded by the nucleic acid sequence of SEQ ID NO:10, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code).
For the avoidance of doubt, the nucleic acid sequence encoding the TCR Vβ domain may also encode a TCR β chain constant domain. Examples of suitable constant domains are generally discussed above.
An example of a specific TCR β chain amino acid sequence that includes a TCR Vβ domain described herein and an appropriate constant domain is shown in SEQ ID NO: 13. Appropriate functional variants of SEQ ID NO: 13 are also encompassed (e.g. variants having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 13, wherein the variant TCR β chain amino acid sequence retains its ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99) when part of a binding protein described herein). In other words, a functional TCR β chain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 13 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:13 may all be in regions of the TCR β chain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 6, SEQ ID NO: 4 and/or SEQ ID NO: 5, and still have 25% (or less) sequence variability compared to SEQ ID NO:13. In other words, the sequence of the CDRs of SEQ ID NO: 13 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 13).
As an example, the encoded TCR β chain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 13, wherein the TCR β chain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 6. In this example, the TCR 3 chain CDR1 may have an amino acid sequence of SEQ ID NO: 4 and the TCR β chain CDR2 may have an amino acid sequence of SEQ ID NO: 5.
In examples where the TCR β chain has the amino acid sequence of SEQ ID NO:13, the TCR β chain may be encoded by the nucleic acid sequence of SEQ ID NO:14, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code). It is noted that SEQ ID NO:14 is the nucleic acid sequence for TCR β chain of clone 4G8.8.
In an example, the nucleic acid composition provided herein comprises a nucleic acid sequence that encodes a TCR Vβ domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:6, or a functional fragment thereof.
In another example, the CDR3 of the Vβ domain of a nucleic acid composition provided herein comprises or consists of the amino acid sequence of SEQ ID NO:6.
In a further example, the Vβ domain of a nucleic acid composition provided herein comprises an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 9.
The TCR Vβ domain sequences derived from TCR clone 4G8.8 discussed above are particularly compatible with the TCR Vα domain sequences derived from TCR clone 4G8.8 discussed elsewhere herein.
Accordingly, in one example, a nucleic acid composition described herein encodes a Jchain antigen-specific binding protein having TCR Vα domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:3, or a functional fragment thereof; and a nucleic acid sequence that encodes a TCR Vβ domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:6, or a functional fragment thereof.
In a particular example, a nucleic acid composition described herein encodes a Jchain antigen-specific binding protein having a TCR Vα domain with a CDR3 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO: 3; and a TCR Vβ domain with a CDR3 comprising or consisting of the amino acid sequence of SEQ ID NO:6. In addition, the Jchain antigen may comprise or consist of the sequence shown in SEQ ID NO: 99. Furthermore, the TCR Vα domain may be part of a TCR α chain having a constant domain and the TCR Vβ domain may be part of a TCR β chain having a constant domain.
In this particular example, the Vα domain may comprise an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 7; and the Vβ domain may comprise an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 9. In one example, the Vα domain comprises the amino acid sequence of SEQ ID NO: 7 and the Vβ domain comprises the amino acid sequence of SEQ ID NO: 9. In such cases, the Vα domain may be encoded by a nucleic acid sequence comprising the sequence of SEQ ID NO: 8; and the Vβ domain may be encoded by a nucleic acid sequence comprising the sequence of SEQ ID NO: 10.
In this particular example, the TCR Vα domain may include a CDR1 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:1 and a CDR2 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:2. Furthermore, the TCR VB domain may include a CDR1 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:4 and a CDR2 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO: 5.
For the avoidance of doubt, this particular example encompasses components of TCR clone 4G8.8 exemplified herein. The different components of TCR clone 4G8.8 and their respective SEQ ID Nos are summarised in Table 3 below.
As stated in more detail elsewhere herein, the nucleic acid composition described herein encodes both a TCR Vα domain and a TCR Vβ domain, which form the binding protein that is capable of specifically binding to a Jchain antigen. In examples where the TCR Vα domain and the TCR Vβ domain are encoded by the same nucleic acid sequence, the TCR Vα domain and TCR Vβ domain may be joined together via a linker, e.g. a linker that enables expression of two proteins or polypeptides from the same vector. By way of example, a linker comprising a porcine teschovirus-1 2A (P2A) sequence may be used, such as 2A sequences from foot-and-mouth disease virus (F2A), equine rhinitis A virus (E2A) or Thosea asigna virus (T2A) as published by A. L. Szymczak et al., Nature Biotechnology 22, 589-594 (2004) or 2A-like sequences. 2A and 2A-like sequences are linkers that are cleavable once the nucleic acid molecule has been transcribed and translated. Another example of a linker is an internal ribosomal entry sites (IRES) which enables translation of two proteins or polypeptides from the same transcript. Any other appropriate linker may also be used. As a further example, the nucleic acid sequence encoding the TCR Vα domain and nucleic acid sequence encoding the TCR Vβ domain may be cloned into a vector with dual internal promoters (see e.g. S Jones et al., Human Gene Ther 2009). The identification of appropriate linkers and vectors that enable expression of both the TCR Vα domain and the TCR Vβ domain is well within the routine capabilities of a person of skill in the art.
Additional appropriate polypeptide domains may also be encoded by the nucleic acid sequences that encode the TCR Vα domain and/or the TCR Vβ domain. By way of example only, the nucleic acid sequence may comprise a membrane targeting sequence that provides for transport of the encoded polypeptide to the cell surface membrane of the modified cell. Other appropriate additional domains are well known and are described, for example, in WO2016/071758.
In one example, the nucleic acid composition described herein may encode a soluble TCR. For example, the nucleic acid composition may encode the variable domain of the TCR alpha and beta chains respectively together with an immune-modulator molecule such as a CD3 agonist (e.g. an anti-CD3 scFv). The CD3 antigen is present on mature human T cells, thymocytes and a subset of natural killer cells. It is associated with the TCR and is involved in signal transduction of the TCR. Antibodies specific for the human CD3 antigen are well known. One such antibody is the murine monoclonal antibody OKT3, which is the first monoclonal antibody approved by the FDA. Other antibodies specific for CD3 have also been reported (see e.g. WO2004/106380; U.S. Patent Application Publication No. 2004/0202657; U.S. Pat. No. 6,750,325). Immune mobilising mTCR Against Cancer (ImmTAC; Immunocore Limited, Milton Partk, Abington, Oxon, United Kingdom) are bifunctional proteins that combine affinity monoclonal T cell receptor (mTCR) targeting with a therapeutic mechanism of action (i.e., an anti-CD3 scFv). In another example, a soluble TCR of the invention may be combined with a radioisotope or a toxic drug. Appropriate radioisotopes and/or toxic drugs are well known in the art and are readily identifiable by a person of ordinary skill in the art.
In one example, the nucleic acid composition may encode a chimeric single chain TCR wherein the TCR alpha chain variable domain is linked to the TCR beta chain variable domain and a constant domain which is e.g. fused to the CD3 zeta signalling domain. In this example, the linker is non-cleavable. In an alternative embodiment, the nucleic acid composition may encode a chimeric two chain TCR in which the TCR alpha chain variable domain and the TCR beta chain variable domain are each linked to a CD3 zeta signalling domain or other transmembrane and intracellular domains. Methods for preparing such single chain TCRs and two chain TCRs are well known in the art; see for example RA Willemsen et al, Gene Therapy 2000.
As provided elsewhere herein, the inventors have also identified TCR clone 5D12.9 which interacts with YTAVVPLVY (SEQ ID NO:99) in the context of HLA-A*01:01. The sequences provided herein that correspond to TCR clone 5D12.9 are SEQ ID NO:s 15 to 28.
An example of an appropriate TCR Vβ domain CDR3 amino acid sequence that confers specific binding to a Jchain antigen (e.g. to Jchain antigen YTAVVPLVY (SEQ ID NO: 99)) is shown in SEQ ID NO:20. As would be clear to a person of skill in the art, variants of the amino acid sequence shown in SEQ ID NO:20 may also be functional (i.e. retain their ability to confer specific binding to a Jchain antigen (i.e. the peptide shown in SEQ ID NO: 99) when the CDR3 is part of TCR Vβ domain). Such functional variants are therefore encompassed herein.
For example, appropriate (functional) Vβ domain CDR3 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 20, i.e. they may have at least 80%, at least 85%, at least 92%, or 100% sequence identity to SEQ ID NO: 20. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 20). In other words, appropriate (functional) VB domain CDR3 amino acid sequences may vary from the sequence shown in SEQ ID NO: 20 by one or several (e.g. two) amino acids. As stated above, functional variants of SEQ ID NO: 20 retain their ability to confer specific binding to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99) when the CDR3 is part of TCR Vβ domain.
Functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 20. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 20, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the CDR3.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 20 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 20 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the CDR3 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 20. In examples where the TCR Vβ domain CDR3 has the amino acid sequence of SEQ ID NO:20, the CDR3 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may comprise, in addition to the specified CDR3, a CDR1 comprising an amino acid sequence of SEQ ID NO: 18, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99)). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 18. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 18, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 18 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 18 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vβ domain CDR1 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 18, i.e. it may have at least 80%, or 100% sequence identity to SEQ ID NO: 18. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 18). In other words, appropriate (functional) Vβ domain CDR1 amino acid sequences may vary from the sequence shown in SEQ ID NO:18 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO:18). As stated above, functional variants of SEQ ID NO: 18 retain the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99) when the CDR1 is part of TCR Vβ domain).
In one example, the CDR1 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 18. In examples where the TCR Vα domain CDR1 has the amino acid sequence of SEQ ID NO:18, the CDR1 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may also comprise, in addition to the specified CDR3 (and optionally the specified CDR1 above), a CDR2 having an amino acid sequence of SEQ ID NO: 19, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to HLA-A*01:01). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 19. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 19, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 19 that do not specifically bind to HLA-A*01:01. Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 19 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vβ domain CDR2 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 19, i.e. it may have at least 80%, at least 83%, or 100% sequence identity to SEQ ID NO: 19. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 19). In other words, appropriate (functional) VB domain CDR2 amino acid sequences may vary from the sequence shown in SEQ ID NO: 19 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 19). As stated above, a functional variant of SEQ ID NO: 19 retains the ability to specifically bind to HLA-A*01:01.
In one example, the CDR2 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 19. In examples where the TCR Vβ domain CDR2 has the amino acid sequence of SEQ ID NO:19, the CDR2 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may therefore comprise the CDRs mentioned in detail above (by SEQ ID specifically i.e. SEQ ID NO:20, SEQ ID NO: 18 and SEQ ID NO: 19, or functional variants thereof), with appropriate intervening sequences between the CDRs.
The encoded TCR Vβ domain may have an amino acid sequence of SEQ ID NO: 23, or a functional variant thereof (i.e. wherein the variant TCR Vβ domain retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99) when part of a binding protein described herein). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 23. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 23, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 23 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO:23 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the encoded TCR Vβ domain may have an amino acid sequence having at least 75%, at least 80%, at least 85% or at least 90% (or at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to the amino acid sequence of SEQ ID NO: 23, whilst retaining the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99). In other words, a functional TCR Vβ domain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 23 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:23 may all be in regions of the TCR Vβ domain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 20, SEQ ID NO: 18 and/or SEQ ID NO: 19, and still have 25% (or less) sequence variability compared to SEQ ID NO: 23). In other words, the sequence of the CDRs of SEQ ID NO: 23 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 23).
As an example, the encoded TCR Vβ domain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 23, wherein the TCR Vβ domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 20. In this example, the TCR Vβ domain CDR1 may have an amino acid sequence of SEQ ID NO:18 and the TCR Vβ domain CDR2 may have an amino acid sequence of SEQ ID NO: 19.
In examples where the TCR Vβ domain has the amino acid sequence of SEQ ID NO:23, the TCR Vβ domain may be encoded by the nucleic acid sequence of SEQ ID NO:24, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code).
For the avoidance of doubt, the nucleic acid sequence encoding the TCR Vβ domain may also encode a TCR β chain constant domain. Examples of suitable constant domains are generally discussed above.
An example of a specific TCR β chain amino acid sequence that includes a TCR Vβ domain described herein and an appropriate constant domain is shown in SEQ ID NO: 27. Appropriate functional variants of SEQ ID NO: 27 are also encompassed (e.g. variants having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 27, wherein the variant TCR β chain amino acid sequence retains its ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99) when part of a binding protein described herein). In other words, a functional TCR β chain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 27 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:27 may all be in regions of the TCR B chain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 20, SEQ ID NO: 18 and/or SEQ ID NO: 19, and still have 25% (or less) sequence variability compared to SEQ ID NO:27. In other words, the sequence of the CDRs of SEQ ID NO: 27 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 27).
As an example, the encoded TCR β chain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 27, wherein the TCR β chain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 20. In this example, the TCR β chain CDR1 may have an amino acid sequence of SEQ ID NO: 18 and the TCR β chain CDR2 may have an amino acid sequence of SEQ ID NO: 19.
In examples where the TCR β chain has the amino acid sequence of SEQ ID NO:27, the TCR β chain may be encoded by the nucleic acid sequence of SEQ ID NO:28, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code). It is noted that SEQ ID NO:28 is the nucleic acid sequence for TCR β chain of clone 5D12.9.
In an example, the nucleic acid composition provided herein comprises a nucleic acid sequence that encodes a TCR Vβ domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:20, or a functional fragment thereof.
In another example, the CDR3 of the Vβ domain of a nucleic acid composition provided herein comprises or consists of the amino acid sequence of SEQ ID NO:20.
In a further example, the Vβ domain of a nucleic acid composition provided herein comprises an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 23.
The TCR Vβ domain sequences derived from TCR clone 5D12.9 discussed above are particularly compatible with the TCR Vα domain sequences derived from TCR clone 5D12.9 discussed elsewhere herein.
Accordingly, in one example, a nucleic acid composition described herein encodes a Jchain antigen-specific binding protein having TCR Vα domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:17, or a functional fragment thereof; and a nucleic acid sequence that encodes a TCR Vβ domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:20, or a functional fragment thereof.
In a particular example, a nucleic acid composition described herein encodes a Jchain antigen-specific binding protein having a TCR Vα domain with a CDR3 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO: 17; and a TCR Vβ domain with a CDR3 comprising or consisting of the amino acid sequence of SEQ ID NO:20. In addition, the Jchain antigen may comprise or consist of the sequence shown in SEQ ID NO: 99. Furthermore, the TCR Vα domain may be part of a TCR α chain having a constant domain and the TCR Vβ domain may be part of a TCR β chain having a constant domain.
In this particular example, the Vα domain may comprise an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 21; and the Vβ domain may comprise an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 23. In one example, the Vα domain comprises the amino acid sequence of SEQ ID NO: 21 and the Vβ domain comprises the amino acid sequence of SEQ ID NO: 23. In such cases, the Vα domain may be encoded by a nucleic acid sequence comprising the sequence of SEQ ID NO: 22; and the Vβ domain may be encoded by a nucleic acid sequence comprising the sequence of SEQ ID NO: 24.
In this particular example, the TCR Vα domain may include a CDR1 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:15 and a CDR2 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:16. Furthermore, the TCR Vβ domain may include a CDR1 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:18 and a CDR2 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO: 19.
For the avoidance of doubt, this particular example encompasses components of TCR clone 5D12.9 exemplified herein. The different components of TCR clone 5D12.9 and their respective SEQ ID Nos are summarised in Table 4 below.
As stated in more detail elsewhere herein, the nucleic acid composition described herein encodes both a TCR Vα domain and a TCR Vβ domain, which form the binding protein that is capable of specifically binding to a Jchain antigen. In examples where the TCR Vα domain and the TCR Vβ domain are encoded by the same nucleic acid sequence, the TCR Vα domain and TCR Vβ domain may be joined together via a linker. Suitable linkers are discussed generally elsewhere herein. Additional appropriate polypeptide domains that may also be encoded by the nucleic acid sequences that encode the TCR Vα domain and/or the TCR Vβ domain are also discussed generally elsewhere herein.
In one example, the nucleic acid composition described herein may encode a soluble TCR or a chimeric single chain TCR wherein the TCR alpha chain variable domain is linked to the TCR beta chain variable domain and a constant domain which is e.g. fused to the CD3 zeta signalling domain. These are discussed generally in more detail elsewhere herein.
As provided elsewhere herein, the inventors identified TCR clone 13F6.6 which interacts with YTAVVPLVY (SEQ ID NO:99) in the context of HLA-A*01:01. The sequences provided herein that correspond to TCR clone 13F6.6 are SEQ ID NO:s 85 to 98.
An example of an appropriate TCR Vβ domain CDR3 amino acid sequence that confers specific binding to a Jchain antigen (e.g. to Jchain antigen YTAVVPLVY (SEQ ID NO: 99)) is shown in SEQ ID NO:90. As would be clear to a person of skill in the art, variants of the amino acid sequence shown in SEQ ID NO:90 may also be functional (i.e. retain their ability to confer specific binding to a Jchain antigen (i.e. the peptide shown in SEQ ID NO: 99) when the CDR3 is part of TCR Vβ domain). Such functional variants are therefore encompassed herein.
For example, appropriate (functional) Vβ domain CDR3 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 90, i.e. they may have at least 80%, at least 86%, at least 93%, or 100% sequence identity to SEQ ID NO: 90. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 90). In other words, appropriate (functional) Vβ domain CDR3 amino acid sequences may vary from the sequence shown in SEQ ID NO: 90 by one or several (e.g. two) amino acids. As stated above, functional variants of SEQ ID NO: 90 retain their ability to confer specific binding to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99) when the CDR3 is part of TCR Vβ domain.
Functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 90. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 90, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the CDR3.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 90 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 90 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the CDR3 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 90. In examples where the TCR Vβ domain CDR3 has the amino acid sequence of SEQ ID NO:90, the CDR3 may be encoded by Any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may comprise, in addition to the specified CDR3, a CDR1 comprising an amino acid sequence of SEQ ID NO: 88, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99)). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 88. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 88, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 88 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 88 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vβ domain CDR1 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 88, i.e. it may have at least 80%, or 100% sequence identity to SEQ ID NO: 88. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 88). In other words, appropriate (functional) Vβ domain CDR1 amino acid sequences may vary from the sequence shown in SEQ ID NO:88 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO:88). As stated above, functional variants of SEQ ID NO: 88 retain the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99) when the CDR1 is part of TCR Vβ domain).
In one example, the CDR1 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 88. In examples where the TCR Vα domain CDR1 has the amino acid sequence of SEQ ID NO:88, the CDR1 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may also comprise, in addition to the specified CDR3 (and optionally the specified CDR1 above), a CDR2 having an amino acid sequence of SEQ ID NO: 89, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to HLA-A*01:01). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 89. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 89, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 89 that do not specifically bind to HLA-A*01:01. Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 89 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vβ domain CDR2 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 89, i.e. it may have at least 80%, at least 83%, or 100% sequence identity to SEQ ID NO: 89. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 89). In other words, appropriate (functional) Vβ domain CDR2 amino acid sequences may vary from the sequence shown in SEQ ID NO: 89 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 89). As stated above, a functional variant of SEQ ID NO: 89 retains the ability to specifically bind to HLA-A*01:01.
In one example, the CDR2 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 89. In examples where the TCR Vβ domain CDR2 has the amino acid sequence of SEQ ID NO:89, the CDR2 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may therefore comprise the CDRs mentioned in detail above (by SEQ ID specifically i.e. SEQ ID NO:90, SEQ ID NO: 88 and SEQ ID NO: 89, or functional variants thereof), with appropriate intervening sequences between the CDRs.
The encoded TCR Vβ domain may have an amino acid sequence of SEQ ID NO: 93, or a functional variant thereof (i.e. wherein the variant TCR Vβ domain retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99) when part of a binding protein described herein). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 93. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 93, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 93 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO:93 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the encoded TCR Vβ domain may have an amino acid sequence having at least 75%, at least 80%, at least 85% or at least 90% (or at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to the amino acid sequence of SEQ ID NO: 93, whilst retaining the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99). In other words, a functional TCR Vβ domain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 93 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:93 may all be in regions of the TCR Vβ domain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 90, SEQ ID NO: 88 and/or SEQ ID NO: 89, and still have 25% (or less) sequence variability compared to SEQ ID NO: 93). In other words, the sequence of the CDRs of SEQ ID NO: 93 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 93).
As an example, the encoded TCR Vβ domain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 93, wherein the TCR Vβ domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 90. In this example, the TCR Vβ domain CDR1 may have an amino acid sequence of SEQ ID NO:88 and the TCR Vβ domain CDR2 may have an amino acid sequence of SEQ ID NO: 89.
In examples where the TCR Vβ domain has the amino acid sequence of SEQ ID NO:93, the TCR Vβ domain may be encoded by the nucleic acid sequence of SEQ ID NO:94, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code).
For the avoidance of doubt, the nucleic acid sequence encoding the TCR Vβ domain may also encode a TCR β chain constant domain. Examples of suitable constant domains are generally discussed above.
An example of a specific TCR β chain amino acid sequence that includes a TCR Vβ domain described herein and an appropriate constant domain is shown in SEQ ID NO: 97. Appropriate functional variants of SEQ ID NO: 97 are also encompassed (e.g. variants having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 97, wherein the variant TCR β chain amino acid sequence retains its ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 99) when part of a binding protein described herein). In other words, a functional TCR β chain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 97 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:97 may all be in regions of the TCR β chain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 90, SEQ ID NO: 88 and/or SEQ ID NO: 89, and still have 25% (or less) sequence variability compared to SEQ ID NO:97. In other words, the sequence of the CDRs of SEQ ID NO: 97 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 97).
As an example, the encoded TCR β chain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 97, wherein the TCR B chain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 90. In this example, the TCR β chain CDR1 may have an amino acid sequence of SEQ ID NO: 88 and the TCR β chain CDR2 may have an amino acid sequence of SEQ ID NO: 89.
In examples where the TCR β chain has the amino acid sequence of SEQ ID NO:97, the TCR β chain may be encoded by the nucleic acid sequence of SEQ ID NO:98, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code). It is noted that SEQ ID NO:98 is the nucleic acid sequence for TCR β chain of clone 13F6.6.
In an example, the nucleic acid composition provided herein comprises a nucleic acid sequence that encodes a TCR Vβ domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:90, or a functional fragment thereof.
In another example, the CDR3 of the Vβ domain of a nucleic acid composition provided herein comprises or consists of the amino acid sequence of SEQ ID NO:90.
In a further example, the Vβ domain of a nucleic acid composition provided herein comprises an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 93.
The TCR Vβ domain sequences derived from TCR clone 13F6.6 discussed above are particularly compatible with the TCR Vα domain sequences derived from TCR clone 13F6.6 discussed elsewhere herein.
Accordingly, in one example, a nucleic acid composition described herein encodes a Jchain antigen-specific binding protein having TCR Vα domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:87, or a functional fragment thereof; and a nucleic acid sequence that encodes a TCR Vβ domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:90, or a functional fragment thereof.
In a particular example, a nucleic acid composition described herein encodes a Jchain antigen-specific binding protein having a TCR Vα domain with a CDR3 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO: 87; and a TCR Vβ domain with a CDR3 comprising or consisting of the amino acid sequence of SEQ ID NO:90. In addition, the Jchain antigen may comprise or consist of the sequence shown in SEQ ID NO: 99. Furthermore, the TCR Vα domain may be part of a TCR α chain having a constant domain and the TCR Vβ domain may be part of a TCR β chain having a constant domain.
In this particular example, the Vα domain may comprise an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 91; and the Vβ domain may comprise an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 93. In one example, the Vα domain comprises the amino acid sequence of SEQ ID NO: 91 and the Vβ domain comprises the amino acid sequence of SEQ ID NO: 93. In such cases, the Vα domain may be encoded by a nucleic acid sequence comprising the sequence of SEQ ID NO: 92; and the Vβ domain may be encoded by a nucleic acid sequence comprising the sequence of SEQ ID NO: 94.
In this particular example, the TCR Vα domain may include a CDR1 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:85 and a CDR2 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:86. Furthermore, the TCR Vβ domain may include a CDR1 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:88 and a CDR2 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO: 89.
For the avoidance of doubt, this particular example encompasses components of TCR clone 13F6.6 exemplified herein. The different components of TCR clone 13F6.6 and their respective SEQ ID Nos are summarised in Table 9 below.
As stated in more detail elsewhere herein, the nucleic acid composition described herein encodes both a TCR Vα domain and a TCR Vβ domain, which form the binding protein that is capable of specifically binding to a Jchain antigen. In examples where the TCR Vα domain and the TCR Vβ domain are encoded by the same nucleic acid sequence, the TCR Vα domain and TCR Vβ domain may be joined together via a linker. Suitable linkers are discussed generally elsewhere herein. Additional appropriate polypeptide domains that may also be encoded by the nucleic acid sequences that encode the TCR Vα domain and/or the TCR Vβ domain are also discussed generally elsewhere herein.
In one example, the nucleic acid composition described herein may encode a soluble TCR or a chimeric single chain TCR wherein the TCR alpha chain variable domain is linked to the TCR beta chain variable domain and a constant domain which is e.g. fused to the CD3 zeta signalling domain. These are discussed generally in more detail elsewhere herein.
(ii) Vβ Domains that Interact with ISDPTSPLRTR (SEQ ID NO: 101)
As provided elsewhere herein, the inventors identified TCR clone 5C8.16 which interacts with ISDPTSPLRTR (SEQ ID NO:101) in the context of HLA-A*03:01. The sequences provided herein that correspond to TCR clone 5C8.16 are SEQ ID NO:s 43 to 56.
An example of an appropriate TCR Vβ domain CDR3 amino acid sequence that confers specific binding to a Jchain antigen (e.g. to Jchain antigen ISDPTSPLRTR (SEQ ID NO: 101)) is shown in SEQ ID NO:48. As would be clear to a person of skill in the art, variants of the amino acid sequence shown in SEQ ID NO:48 may also be functional (i.e. retain their ability to confer specific binding to a Jchain antigen (i.e. the peptide shown in SEQ ID NO: 101) when the CDR3 is part of TCR Vβ domain). Such functional variants are therefore encompassed herein.
For example, appropriate (functional) Vβ domain CDR3 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 48, i.e. they may have at least 80%, at least 86%, at least 93%, or 100% sequence identity to SEQ ID NO: 48. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 48). In other words, appropriate (functional) Vβ domain CDR3 amino acid sequences may vary from the sequence shown in SEQ ID NO: 48 by one or several (e.g. two) amino acids. As stated above, functional variants of SEQ ID NO: 48 retain their ability to confer specific binding to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when the CDR3 is part of TCR Vβ domain.
Functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 48. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 48, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the CDR3.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 48 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 48 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the CDR3 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 48. In examples where the TCR Vβ domain CDR3 has the amino acid sequence of SEQ ID NO:48, the CDR3 may be encoded by Any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may comprise, in addition to the specified CDR3, a CDR1 comprising an amino acid sequence of SEQ ID NO: 46, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101)). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 46. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 46, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 46 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 46 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vβ domain CDR1 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 46, i.e. it may have at least 80%, or 100% sequence identity to SEQ ID NO: 46. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 46). In other words, appropriate (functional) Vβ domain CDR1 amino acid sequences may vary from the sequence shown in SEQ ID NO:46 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO:46). As stated above, functional variants of SEQ ID NO: 46 retain the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when the CDR1 is part of TCR Vβ domain).
In one example, the CDR1 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 46. In examples where the TCR Vα domain CDR1 has the amino acid sequence of SEQ ID NO:46, the CDR1 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may also comprise, in addition to the specified CDR3 (and optionally the specified CDR1 above), a CDR2 having an amino acid sequence of SEQ ID NO: 47, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to HLA-A*03:01). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 47. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 47, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 47 that do not specifically bind to HLA-A*03:01. Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 47 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vβ domain CDR2 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 47, i.e. it may have at least 80%, at least 83%, or 100% sequence identity to SEQ ID NO: 47. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 47). In other words, appropriate (functional) Vβ domain CDR2 amino acid sequences may vary from the sequence shown in SEQ ID NO: 47 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 47). As stated above, a functional variant of SEQ ID NO: 47 retains the ability to specifically bind to HLA-A*03:01.
In one example, the CDR2 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 47. In examples where the TCR Vβ domain CDR2 has the amino acid sequence of SEQ ID NO:47, the CDR2 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may therefore comprise the CDRs mentioned in detail above (by SEQ ID specifically i.e. SEQ ID NO:48, SEQ ID NO: 46 and SEQ ID NO: 47, or functional variants thereof), with appropriate intervening sequences between the CDRs.
The encoded TCR Vβ domain may have an amino acid sequence of SEQ ID NO: 51, or a functional variant thereof (i.e. wherein the variant TCR Vβ domain retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when part of a binding protein described herein). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 51. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 51, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 51 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO:51 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the encoded TCR Vβ domain may have an amino acid sequence having at least 75%, at least 80%, at least 85% or at least 90% (or at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to the amino acid sequence of SEQ ID NO: 51, whilst retaining the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). In other words, a functional TCR Vβ domain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 51 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:51 may all be in regions of the TCR Vβ domain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 48, SEQ ID NO: 46 and/or SEQ ID NO: 47, and still have 25% (or less) sequence variability compared to SEQ ID NO: 51). In other words, the sequence of the CDRs of SEQ ID NO: 51 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 51).
As an example, the encoded TCR Vβ domain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 51, wherein the TCR Vβ domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 48. In this example, the TCR Vβ domain CDR1 may have an amino acid sequence of SEQ ID NO:46 and the TCR Vβ domain CDR2 may have an amino acid sequence of SEQ ID NO: 47.
In examples where the TCR Vβ domain has the amino acid sequence of SEQ ID NO:51, the TCR VB domain may be encoded by the nucleic acid sequence of SEQ ID NO:52, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code).
For the avoidance of doubt, the nucleic acid sequence encoding the TCR Vβ domain may also encode a TCR β chain constant domain. Examples of suitable constant domains are generally discussed above.
An example of a specific TCR β chain amino acid sequence that includes a TCR Vβ domain described herein and an appropriate constant domain is shown in SEQ ID NO: 55. Appropriate functional variants of SEQ ID NO: 55 are also encompassed (e.g. variants having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 55, wherein the variant TCR β chain amino acid sequence retains its ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when part of a binding protein described herein). In other words, a functional TCR β chain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 55 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:55 may all be in regions of the TCR β chain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 48, SEQ ID NO: 46 and/or SEQ ID NO: 47, and still have 25% (or less) sequence variability compared to SEQ ID NO:55. In other words, the sequence of the CDRs of SEQ ID NO: 55 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 55).
As an example, the encoded TCR β chain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 55, wherein the TCR β chain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 48. In this example, the TCR 3 chain CDR1 may have an amino acid sequence of SEQ ID NO: 46 and the TCR β chain CDR2 may have an amino acid sequence of SEQ ID NO: 47.
In examples where the TCR β chain has the amino acid sequence of SEQ ID NO:55, the TCR β chain may be encoded by the nucleic acid sequence of SEQ ID NO:56, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code). It is noted that SEQ ID NO:56 is the nucleic acid sequence for TCR B chain of clone 5C8.16.
In an example, the nucleic acid composition provided herein comprises a nucleic acid sequence that encodes a TCR Vβ domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:48, or a functional fragment thereof.
In another example, the CDR3 of the Vβ domain of a nucleic acid composition provided herein comprises or consists of the amino acid sequence of SEQ ID NO:48.
In a further example, the Vβ domain of a nucleic acid composition provided herein comprises an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 51.
The TCR Vβ domain sequences derived from TCR clone 5C8.16 discussed above are particularly compatible with the TCR Vα domain sequences derived from TCR clone 5C8.16 discussed elsewhere herein.
Accordingly, in one example, a nucleic acid composition described herein encodes a Jchain antigen-specific binding protein having TCR Vα domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:45, or a functional fragment thereof; and a nucleic acid sequence that encodes a TCR Vβ domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:48, or a functional fragment thereof.
In a particular example, a nucleic acid composition described herein encodes a Jchain antigen-specific binding protein having a TCR Vα domain with a CDR3 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO: 45; and a TCR Vβ domain with a CDR3 comprising or consisting of the amino acid sequence of SEQ ID NO:48. In addition, the Jchain antigen may comprise or consist of the sequence shown in SEQ ID NO: 101. Furthermore, the TCR Vα domain may be part of a TCR α chain having a constant domain and the TCR Vβ domain may be part of a TCR β chain having a constant domain.
In this particular example, the Vα domain may comprise an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 49; and the Vβ domain may comprise an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 51. In one example, the Vα domain comprises the amino acid sequence of SEQ ID NO: 49 and the Vβ domain comprises the amino acid sequence of SEQ ID NO: 51. In such cases, the Vα domain may be encoded by a nucleic acid sequence comprising the sequence of SEQ ID NO: 50; and the Vβ domain may be encoded by a nucleic acid sequence comprising the sequence of SEQ ID NO: 52.
In this particular example, the TCR Vα domain may include a CDR1 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO: 43 and a CDR2 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:44. Furthermore, the TCR Vβ domain may include a CDR1 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:46 and a CDR2 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO: 47.
For the avoidance of doubt, this particular example encompasses components of TCR clone 5C8.16 exemplified herein. The different components of TCR clone 5C8.16 and their respective SEQ ID Nos are summarised in Table 6 below.
As stated in more detail elsewhere herein, the nucleic acid composition described herein encodes both a TCR Vα domain and a TCR Vβ domain, which form the binding protein that is capable of specifically binding to a Jchain antigen. In examples where the TCR Vα domain and the TCR Vβ domain are encoded by the same nucleic acid sequence, the TCR Vα domain and TCR Vβ domain may be joined together via a linker. Suitable linkers are discussed generally elsewhere herein. Additional appropriate polypeptide domains that may also be encoded by the nucleic acid sequences that encode the TCR Vα domain and/or the TCR Vβ domain are also discussed generally elsewhere herein.
In one example, the nucleic acid composition described herein may encode a soluble TCR or a chimeric single chain TCR wherein the TCR alpha chain variable domain is linked to the TCR beta chain variable domain and a constant domain which is e.g. fused to the CD3 zeta signalling domain. These are discussed generally in more detail elsewhere herein.
As provided elsewhere herein, the inventors identified TCR clone 16C7.9 which interacts with ISDPTSPLRTR (SEQ ID NO:101) in the context of HLA-A*11:01. The sequences provided herein that correspond to TCR clone 16C7.9 are SEQ ID NO:s 57 to 70.
An example of an appropriate TCR Vβ domain CDR3 amino acid sequence that confers specific binding to a Jchain antigen (e.g. to Jchain antigen ISDPTSPLRTR (SEQ ID NO: 101)) is shown in SEQ ID NO:62. As would be clear to a person of skill in the art, variants of the amino acid sequence shown in SEQ ID NO:62 may also be functional (i.e. retain their ability to confer specific binding to a Jchain antigen (i.e. the peptide shown in SEQ ID NO: 101) when the CDR3 is part of TCR Vβ domain). Such functional variants are therefore encompassed herein.
For example, appropriate (functional) VB domain CDR3 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 62, i.e. they may have at least 80%, at least 85%, at least 92%, or 100% sequence identity to SEQ ID NO: 62. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 62). In other words, appropriate (functional) Vβ domain CDR3 amino acid sequences may vary from the sequence shown in SEQ ID NO: 62 by one or several (e.g. two) amino acids. As stated above, functional variants of SEQ ID NO: 62 retain their ability to confer specific binding to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when the CDR3 is part of TCR Vβ domain.
Functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 62. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 62, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the CDR3.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 62 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 62 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the CDR3 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 62. In examples where the TCR Vβ domain CDR3 has the amino acid sequence of SEQ ID NO:62, the CDR3 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may comprise, in addition to the specified CDR3, a CDR1 comprising an amino acid sequence of SEQ ID NO: 60, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101)). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 60. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 60, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 60 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 60 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vβ domain CDR1 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 60, i.e. it may have at least 80%, or 100% sequence identity to SEQ ID NO: 60. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 60). In other words, appropriate (functional) Vβ domain CDR1 amino acid sequences may vary from the sequence shown in SEQ ID NO:60 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO:60). As stated above, functional variants of SEQ ID NO: 60 retain the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when the CDR1 is part of TCR Vβ domain).
In one example, the CDR1 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 60. In examples where the TCR Vα domain CDR1 has the amino acid sequence of SEQ ID NO:60, the CDR1 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may also comprise, in addition to the specified CDR3 (and optionally the specified CDR1 above), a CDR2 having an amino acid sequence of SEQ ID NO: 61, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to HLA-A*11:01). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 61. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 61, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 61 that do not specifically bind to HLA-A*11:01. Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 61 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vβ domain CDR2 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 61, i.e. it may have at least 80%, at least 83, or 100% sequence identity to SEQ ID NO: 61. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 61). In other words, appropriate (functional) Vβ domain CDR2 amino acid sequences may vary from the sequence shown in SEQ ID NO: 61 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 61). As stated above, a functional variant of SEQ ID NO: 61 retains the ability to specifically bind to HLA-A*11:01.
In one example, the CDR2 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 61. In examples where the TCR Vβ domain CDR2 has the amino acid sequence of SEQ ID NO:61, the CDR2 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may therefore comprise the CDRs mentioned in detail above (by SEQ ID specifically i.e. SEQ ID NO:62, SEQ ID NO: 60 and SEQ ID NO: 61, or functional variants thereof), with appropriate intervening sequences between the CDRs.
The encoded TCR Vβ domain may have an amino acid sequence of SEQ ID NO: 65, or a functional variant thereof (i.e. wherein the variant TCR Vβ domain retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when part of a binding protein described herein). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 65. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 65, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 65 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO:65 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the encoded TCR Vβ domain may have an amino acid sequence having at least 75%, at least 80%, at least 85% or at least 90% (or at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to the amino acid sequence of SEQ ID NO: 65, whilst retaining the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). In other words, a functional TCR Vβ domain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 65 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:65 may all be in regions of the TCR Vβ domain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 62, SEQ ID NO: 60 and/or SEQ ID NO: 61, and still have 25% (or less) sequence variability compared to SEQ ID NO: 65). In other words, the sequence of the CDRs of SEQ ID NO: 65 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 65).
As an example, the encoded TCR Vβ domain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 65, wherein the TCR Vβ domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 62. In this example, the TCR Vβ domain CDR1 may have an amino acid sequence of SEQ ID NO:60 and the TCR Vβ domain CDR2 may have an amino acid sequence of SEQ ID NO: 61.
In examples where the TCR Vβ domain has the amino acid sequence of SEQ ID NO:65, the TCR Vβ domain may be encoded by the nucleic acid sequence of SEQ ID NO: 66, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code).
For the avoidance of doubt, the nucleic acid sequence encoding the TCR Vβ domain may also encode a TCR β chain constant domain. Examples of suitable constant domains are generally discussed above.
An example of a specific TCR β chain amino acid sequence that includes a TCR Vβ domain described herein and an appropriate constant domain is shown in SEQ ID NO: 69. Appropriate functional variants of SEQ ID NO: 69 are also encompassed (e.g. variants having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 69, wherein the variant TCR β chain amino acid sequence retains its ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when part of a binding protein described herein). In other words, a functional TCR β chain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 69 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:69 may all be in regions of the TCR β chain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 62, SEQ ID NO: 60 and/or SEQ ID NO: 61, and still have 25% (or less) sequence variability compared to SEQ ID NO:69. In other words, the sequence of the CDRs of SEQ ID NO: 69 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 69).
As an example, the encoded TCR β chain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 69, wherein the TCR β chain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 62. In this example, the TCR β chain CDR1 may have an amino acid sequence of SEQ ID NO: 60 and the TCR β chain CDR2 may have an amino acid sequence of SEQ ID NO: 61.
In examples where the TCR β chain has the amino acid sequence of SEQ ID NO:69, the TCR β chain may be encoded by the nucleic acid sequence of SEQ ID NO:70, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code). It is noted that SEQ ID NO:70 is the nucleic acid sequence for TCR β chain of clone 16C7.9.
In an example, the nucleic acid composition provided herein comprises a nucleic acid sequence that encodes a TCR Vβ domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:62, or a functional fragment thereof.
In another example, the CDR3 of the Vβ domain of a nucleic acid composition provided herein comprises or consists of the amino acid sequence of SEQ ID NO:62.
In a further example, the Vβ domain of a nucleic acid composition provided herein comprises an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 65.
The TCR Vβ domain sequences derived from TCR clone 16C7.9 discussed above are particularly compatible with the TCR Vα domain sequences derived from TCR clone 16C7.9 discussed elsewhere herein.
Accordingly, in one example, a nucleic acid composition described herein encodes a Jchain antigen-specific binding protein having TCR Vα domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:59, or a functional fragment thereof; and a nucleic acid sequence that encodes a TCR Vβ domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:62, or a functional fragment thereof.
In a particular example, a nucleic acid composition described herein encodes a Jchain antigen-specific binding protein having a TCR Vα domain with a CDR3 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO: 59; and a TCR Vβ domain with a CDR3 comprising or consisting of the amino acid sequence of SEQ ID NO:62. In addition, the Jchain antigen may comprise or consist of the sequence shown in SEQ ID NO: 101. Furthermore, the TCR Vα domain may be part of a TCR α chain having a constant domain and the TCR Vβ domain may be part of a TCR β chain having a constant domain.
In this particular example, the Vα domain may comprise an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 63; and the Vβ domain may comprise an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 65. In one example, the Vα domain comprises the amino acid sequence of SEQ ID NO: 63 and the Vβ domain comprises the amino acid sequence of SEQ ID NO: 65. In such cases, the Vα domain may be encoded by a nucleic acid sequence comprising the sequence of SEQ ID NO: 64; and the Vβ domain may be encoded by a nucleic acid sequence comprising the sequence of SEQ ID NO: 66.
In this particular example, the TCR Vα domain may include a CDR1 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:57 and a CDR2 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:58. Furthermore, the TCR Vβ domain may include a CDR1 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:60 and a CDR2 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO: 61.
For the avoidance of doubt, this particular example encompasses components of TCR clone 16C7.9 exemplified herein. The different components of TCR clone 16C7.9 and their respective SEQ ID Nos are summarised in Table 7 below.
As stated in more detail elsewhere herein, the nucleic acid composition described herein encodes both a TCR Vα domain and a TCR Vβ domain, which form the binding protein that is capable of specifically binding to a Jchain antigen. In examples where the TCR Vα domain and the TCR Vβ domain are encoded by the same nucleic acid sequence, the TCR Vα domain and TCR Vβ domain may be joined together via a linker. Suitable linkers are discussed generally elsewhere herein. Additional appropriate polypeptide domains that may also be encoded by the nucleic acid sequences that encode the TCR Vα domain and/or the TCR Vβ domain are also discussed generally elsewhere herein.
In one example, the nucleic acid composition described herein may encode a soluble TCR or a chimeric single chain TCR wherein the TCR alpha chain variable domain is linked to the TCR beta chain variable domain and a constant domain which is e.g. fused to the CD3 zeta signalling domain. These are discussed generally in more detail elsewhere herein.
As provided elsewhere herein, the inventors identified TCR clone 13D4.9 which interacts with ISDPTSPLRTR (SEQ ID NO:101) in the context of HLA-A*11:01. The sequences provided herein that correspond to TCR clone 13D4.9 are SEQ ID NO:s 71 to 84.
An example of an appropriate TCR Vβ domain CDR3 amino acid sequence that confers specific binding to a Jchain antigen (e.g. to Jchain antigen ISDPTSPLRTR (SEQ ID NO: 101)) is shown in SEQ ID NO:76. As would be clear to a person of skill in the art, variants of the amino acid sequence shown in SEQ ID NO:76 may also be functional (i.e. retain their ability to confer specific binding to a Jchain antigen (i.e. the peptide shown in SEQ ID NO: 101) when the CDR3 is part of TCR Vβ domain). Such functional variants are therefore encompassed herein.
For example, appropriate (functional) Vβ domain CDR3 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 76, i.e. they may have at least 80%, at least 81%, at least 87%, at least 93%, or 100% sequence identity to SEQ ID NO: 76. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 76). In other words, appropriate (functional) Vβ domain CDR3 amino acid sequences may vary from the sequence shown in SEQ ID NO: 76 by one or several (e.g. two) amino acids.
As stated above, functional variants of SEQ ID NO: 76 retain their ability to confer specific binding to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when the CDR3 is part of TCR Vβ domain.
Functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 76. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 76, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the CDR3.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 76 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 76 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the CDR3 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 76. In examples where the TCR Vβ domain CDR3 has the amino acid sequence of SEQ ID NO:76, the CDR3 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may comprise, in addition to the specified CDR3, a CDR1 comprising an amino acid sequence of SEQ ID NO: 74, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101)). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 74. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 74, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 74 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 74 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vβ domain CDR1 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 74, i.e. it may have at least 80%, or 100% sequence identity to SEQ ID NO: 74. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 74). In other words, appropriate (functional) Vβ domain CDR1 amino acid sequences may vary from the sequence shown in SEQ ID NO:74 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO:74). As stated above, functional variants of SEQ ID NO: 74 retain the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when the CDR1 is part of TCR Vβ domain).
In one example, the CDR1 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 74. In examples where the TCR Vα domain CDR1 has the amino acid sequence of SEQ ID NO:74, the CDR1 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may also comprise, in addition to the specified CDR3 (and optionally the specified CDR1 above), a CDR2 having an amino acid sequence of SEQ ID NO: 75, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to HLA-A*11:01). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 75. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 75, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 75 that do not specifically bind to HLA-A*11:01. Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 75 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vβ domain CDR2 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 75, i.e. it may have at least 80%, at least 83%, or 100% sequence identity to SEQ ID NO: 75. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 75). In other words, appropriate (functional) Vβ domain CDR2 amino acid sequences may vary from the sequence shown in SEQ ID NO: 75 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 75). As stated above, a functional variant of SEQ ID NO: 75 retains the ability to specifically bind to HLA-A*11:01.
In one example, the CDR2 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 75. In examples where the TCR Vβ domain CDR2 has the amino acid sequence of SEQ ID NO:75, the CDR2 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may therefore comprise the CDRs mentioned in detail above (by SEQ ID specifically i.e. SEQ ID NO:76, SEQ ID NO: 74 and SEQ ID NO: 75, or functional variants thereof), with appropriate intervening sequences between the CDRs.
The encoded TCR Vβ domain may have an amino acid sequence of SEQ ID NO: 79, or a functional variant thereof (i.e. wherein the variant TCR Vβ domain retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when part of a binding protein described herein). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 79. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 79, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 79 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO:79 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the encoded TCR Vβ domain may have an amino acid sequence having at least 75%, at least 80%, at least 85% or at least 90% (or at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to the amino acid sequence of SEQ ID NO: 79, whilst retaining the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101). In other words, a functional TCR Vβ domain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 79 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:79 may all be in regions of the TCR VB domain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 76, SEQ ID NO: 74 and/or SEQ ID NO: 75, and still have 25% (or less) sequence variability compared to SEQ ID NO: 79). In other words, the sequence of the CDRs of SEQ ID NO: 79 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 79).
As an example, the encoded TCR Vβ domain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 79, wherein the TCR Vβ domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 76. In this example, the TCR Vβ domain CDR1 may have an amino acid sequence of SEQ ID NO:74 and the TCR Vβ domain CDR2 may have an amino acid sequence of SEQ ID NO: 75.
In examples where the TCR Vβ domain has the amino acid sequence of SEQ ID NO:79, the TCR Vβ domain may be encoded by the nucleic acid sequence of SEQ ID NO: 80, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code).
For the avoidance of doubt, the nucleic acid sequence encoding the TCR Vβ domain may also encode a TCR β chain constant domain. Examples of suitable constant domains are generally discussed above.
An example of a specific TCR β chain amino acid sequence that includes a TCR Vβ domain described herein and an appropriate constant domain is shown in SEQ ID NO: 83. Appropriate functional variants of SEQ ID NO: 83 are also encompassed (e.g. variants having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 83, wherein the variant TCR β chain amino acid sequence retains its ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 101) when part of a binding protein described herein). In other words, a functional TCR β chain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 83 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:83 may all be in regions of the TCR β chain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 76, SEQ ID NO: 74 and/or SEQ ID NO: 75, and still have 25% (or less) sequence variability compared to SEQ ID NO:83. In other words, the sequence of the CDRs of SEQ ID NO: 83 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 83).
As an example, the encoded TCR β chain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 83, wherein the TCR β chain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 76. In this example, the TCR β chain CDR1 may have an amino acid sequence of SEQ ID NO: 74 and the TCR β chain CDR2 may have an amino acid sequence of SEQ ID NO: 75.
In examples where the TCR β chain has the amino acid sequence of SEQ ID NO:83, the TCR β chain may be encoded by the nucleic acid sequence of SEQ ID NO: 84, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code). It is noted that SEQ ID NO:84 is the nucleic acid sequence for TCR β chain of clone 13D4.9.
In an example, the nucleic acid composition provided herein comprises a nucleic acid sequence that encodes a TCR Vβ domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:76, or a functional fragment thereof.
In another example, the CDR3 of the Vβ domain of a nucleic acid composition provided herein comprises or consists of the amino acid sequence of SEQ ID NO:76.
In a further example, the Vβ domain of a nucleic acid composition provided herein comprises an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 79.
The TCR Vβ domain sequences derived from TCR clone 13D4.9 discussed above are particularly compatible with the TCR Vα domain sequences derived from TCR clone 13D4.9 discussed elsewhere herein.
Accordingly, in one example, a nucleic acid composition described herein encodes a Jchain antigen-specific binding protein having TCR Vα domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:73, or a functional fragment thereof; and a nucleic acid sequence that encodes a TCR Vβ domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:76, or a functional fragment thereof.
In a particular example, a nucleic acid composition described herein encodes a Jchain antigen-specific binding protein having a TCR Vα domain with a CDR3 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO: 73; and a TCR Vβ domain with a CDR3 comprising or consisting of the amino acid sequence of SEQ ID NO:76. In addition, the Jchain antigen may comprise or consist of the sequence shown in SEQ ID NO: 101. Furthermore, the TCR Vα domain may be part of a TCR α chain having a constant domain and the TCR Vβ domain may be part of a TCR β chain having a constant domain.
In this particular example, the Vα domain may comprise an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 77; and the Vβ domain may comprise an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 79. In one example, the Vα domain comprises the amino acid sequence of SEQ ID NO: 77 and the Vβ domain comprises the amino acid sequence of SEQ ID NO: 79. In such cases, the Vα domain may be encoded by a nucleic acid sequence comprising the sequence of SEQ ID NO: 78; and the Vβ domain may be encoded by a nucleic acid sequence comprising the sequence of SEQ ID NO: 80.
In this particular example, the TCR Vα domain may include a CDR1 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:71 and a CDR2 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:72. Furthermore, the TCR Vβ domain may include a CDR1 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:74 and a CDR2 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO: 75.
For the avoidance of doubt, this particular example encompasses components of TCR clone 13D4.9 exemplified herein. The different components of TCR clone 13D4.9 and their respective SEQ ID Nos are summarised in Table 8 below.
As stated in more detail elsewhere herein, the nucleic acid composition described herein encodes both a TCR Vα domain and a TCR Vβ domain, which form the binding protein that is capable of specifically binding to a Jchain antigen. In examples where the TCR Vα domain and the TCR Vβ domain are encoded by the same nucleic acid sequence, the TCR Vα domain and TCR Vβ domain may be joined together via a linker. Suitable linkers are discussed generally elsewhere herein. Additional appropriate polypeptide domains that may also be encoded by the nucleic acid sequences that encode the TCR Vα domain and/or the TCR Vβ domain are also discussed generally elsewhere herein.
In one example, the nucleic acid composition described herein may encode a soluble TCR or a chimeric single chain TCR wherein the TCR alpha chain variable domain is linked to the TCR beta chain variable domain and a constant domain which is e.g. fused to the CD3 zeta signalling domain. These are discussed generally in more detail elsewhere herein.
(iii) Vβ Domains that Interact with CYTAVVPLV (SEQ ID NO: 100)
As provided elsewhere herein, the inventors identified TCR clone 10H11.11 which interacts with CYTAVVPLV (SEQ ID NO:100) in the context of HLA-A*24:02. The sequences provided herein that correspond to TCR clone 10H11.1 are SEQ ID NO:s 29 to 42.
An example of an appropriate TCR Vβ domain CDR3 amino acid sequence that confers specific binding to a Jchain antigen (e.g. to Jchain antigen CYTAVVPLV (SEQ ID NO: 100)) is shown in SEQ ID NO:34. As would be clear to a person of skill in the art, variants of the amino acid sequence shown in SEQ ID NO:34 may also be functional (i.e. retain their ability to confer specific binding to a Jchain antigen (i.e. the peptide shown in SEQ ID NO: 100) when the CDR3 is part of TCR Vβ domain). Such functional variants are therefore encompassed herein.
For example, appropriate (functional) Vβ domain CDR3 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 34, i.e. they may have at least 80%, at least 81%, at least 87%, at least 93%, or 100% sequence identity to SEQ ID NO: 34. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 34). In other words, appropriate (functional) Vβ domain CDR3 amino acid sequences may vary from the sequence shown in SEQ ID NO: 34 by one or several (e.g. two) amino acids. As stated above, functional variants of SEQ ID NO: 34 retain their ability to confer specific binding to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 100) when the CDR3 is part of TCR Vβ domain.
Functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 34. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 34, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the CDR3.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 34 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 100). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 34 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the CDR3 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 34. In examples where the TCR Vβ domain CDR3 has the amino acid sequence of SEQ ID NO:34, the CDR3 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may comprise, in addition to the specified CDR3, a CDR1 comprising an amino acid sequence of SEQ ID NO: 32, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 100)). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 32. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 32, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 32 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 100). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 32 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vβ domain CDR1 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 32, i.e. it may have at least 80%, or 100% sequence identity to SEQ ID NO: 32. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 32). In other words, appropriate (functional) Vβ domain CDR1 amino acid sequences may vary from the sequence shown in SEQ ID NO:32 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO:32). As stated above, functional variants of SEQ ID NO: 32 retain the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 100) when the CDR1 is part of TCR Vβ domain).
In one example, the CDR1 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 32. In examples where the TCR Vα domain CDR1 has the amino acid sequence of SEQ ID NO:32, the CDR1 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may also comprise, in addition to the specified CDR3 (and optionally the specified CDR1 above), a CDR2 having an amino acid sequence of SEQ ID NO: 33, or a functional variant thereof (i.e. wherein the variant retains the ability to specifically bind to HLA-A*24:02). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 33. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 33, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 33 that do not specifically bind to HLA-A*24:02. Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO: 33 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
For example, appropriate functional Vβ domain CDR2 amino acid sequences may have at least 80% sequence identity to SEQ ID NO: 33, i.e. it may have at least 80%, at least 83%, or 100% sequence identity to SEQ ID NO: 33. Suitably, percent identity is calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 33). In other words, appropriate (functional) Vβ domain CDR2 amino acid sequences may vary from the sequence shown in SEQ ID NO: 33 by one or several amino acids. As stated previously, the variant may comprise an amino acid substitution such as a conservative amino acid substitution compared to the sequence shown in SEQ ID NO: 33). As stated above, a functional variant of SEQ ID NO: 33 retains the ability to specifically bind to HLA-A*24:02.
In one example, the CDR2 of the Vβ domain comprises or consists of the amino acid sequence of SEQ ID NO: 33. In examples where the TCR Vβ domain CDR2 has the amino acid sequence of SEQ ID NO:33, the CDR2 may be encoded by any appropriate nucleic acid sequence.
The encoded TCR Vβ domain may therefore comprise the CDRs mentioned in detail above (by SEQ ID specifically i.e. SEQ ID NO:34, SEQ ID NO: 32 and SEQ ID NO: 33, or functional variants thereof), with appropriate intervening sequences between the CDRs.
The encoded TCR Vβ domain may have an amino acid sequence of SEQ ID NO: 37, or a functional variant thereof (i.e. wherein the variant TCR Vβ domain retains the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 100) when part of a binding protein described herein). Such functional variants may be naturally occurring, synthetic, or synthetically improved functional variants of SEQ ID NO: 37. The term “variant” also encompasses homologues and fragments. Functional variants will typically contain only conservative substitutions of one or more amino acids of SEQ ID NO: 37, or substitution, deletion or insertion of non-critical amino acids in non-critical regions of the protein.
Non-functional variants are amino acid sequence variants of SEQ ID NO: 37 that do not specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 100). Non-functional variants will typically contain a non-conservative substitution, a deletion, or insertion or premature truncation of the amino acid sequence of SEQ ID NO:37 or a substitution, insertion or deletion in critical amino acids or critical regions. Methods for identifying functional and non-functional variants are well known to a person of ordinary skill in the art.
In one example, the encoded TCR Vβ domain may have an amino acid sequence having at least 75%, at least 80%, at least 85% or at least 90% (or at least 91%, at least 92%, at least 93%, at least 94%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to the amino acid sequence of SEQ ID NO: 37, whilst retaining the ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 100). In other words, a functional TCR Vβ domain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 37 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:37 may all be in regions of the TCR Vβ domain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 34, SEQ ID NO: 32 and/or SEQ ID NO: 33, and still have 25% (or less) sequence variability compared to SEQ ID NO: 37). In other words, the sequence of the CDRs of SEQ ID NO: 37 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 37).
As an example, the encoded TCR Vβ domain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 37, wherein the TCR Vβ domain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 34. In this example, the TCR Vβ domain CDR1 may have an amino acid sequence of SEQ ID NO:32 and the TCR Vβ domain CDR2 may have an amino acid sequence of SEQ ID NO: 33.
In examples where the TCR Vβ domain has the amino acid sequence of SEQ ID NO:37, the TCR Vβ domain may be encoded by the nucleic acid sequence of SEQ ID NO:38, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code).
For the avoidance of doubt, the nucleic acid sequence encoding the TCR Vβ domain may also encode a TCR β chain constant domain. Examples of suitable constant domains are generally discussed above.
An example of a specific TCR β chain amino acid sequence that includes a TCR Vβ domain described herein and an appropriate constant domain is shown in SEQ ID NO: 41. Appropriate functional variants of SEQ ID NO: 41 are also encompassed (e.g. variants having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 41, wherein the variant TCR β chain amino acid sequence retains its ability to specifically bind to a Jchain antigen (e.g. the peptide shown in SEQ ID NO: 100) when part of a binding protein described herein). In other words, a functional TCR β chain with one or several amino acid substitutions compared to the sequence of SEQ ID NO: 41 is also encompassed. As stated previously, the amino acid substitution may be a conservative amino acid substitution. The variability in sequence compared to SEQ ID NO:41 may all be in regions of the TCR β chain that do not form CDRs (i.e. the variant may have the CDRs of SEQ ID NO: 34, SEQ ID NO: 32 and/or SEQ ID NO: 33, and still have 25% (or less) sequence variability compared to SEQ ID NO:41. In other words, the sequence of the CDRs of SEQ ID NO: 41 may be retained whilst the rest of the sequence is varied, as appropriate within the “at least 75% identity” parameters specified above. Suitably, percent identity can be calculated as the percentage of identity to the entire length of the reference sequence (e.g. SEQ ID NO: 41).
As an example, the encoded TCR β chain may comprise an amino acid sequence having at least 75% (e.g. at least 75%, at least 80%, at least 85%, at least 90%, at least 95% etc) sequence identity to the amino acid sequence of SEQ ID NO: 41, wherein the TCR B chain comprises a CDR3 having an amino acid sequence of SEQ ID NO: 34. In this example, the TCR β chain CDR1 may have an amino acid sequence of SEQ ID NO: 32 and the TCR β chain CDR2 may have an amino acid sequence of SEQ ID NO: 33.
In examples where the TCR β chain has the amino acid sequence of SEQ ID NO:41, the TCR β chain may be encoded by the nucleic acid sequence of SEQ ID NO: 42, or a genetically degenerate sequence thereof (i.e. other nucleic acid sequences that encode the same protein as a result of the degeneracy of the genetic code). It is noted that SEQ ID NO:42 is the nucleic acid sequence for TCR β chain of clone 10H11.11.
In an example, the nucleic acid composition provided herein comprises a nucleic acid sequence that encodes a TCR Vβ domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:34, or a functional fragment thereof.
In another example, the CDR3 of the Vβ domain of a nucleic acid composition provided herein comprises or consists of the amino acid sequence of SEQ ID NO:34.
In a further example, the Vβ domain of a nucleic acid composition provided herein comprises an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 37.
The TCR Vβ domain sequences derived from TCR clone 10H11.11 discussed above are particularly compatible with the TCR Vα domain sequences derived from TCR clone 10H11.11 discussed elsewhere herein.
Accordingly, in one example, a nucleic acid composition described herein encodes a Jchain antigen-specific binding protein having TCR Vα domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:31, or a functional fragment thereof; and a nucleic acid sequence that encodes a TCR Vβ domain comprising a CDR3 amino acid sequence having at least 80% sequence identity to SEQ ID NO:34, or a functional fragment thereof.
In a particular example, a nucleic acid composition described herein encodes a Jchain antigen-specific binding protein having a TCR Vα domain with a CDR3 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO: 31; and a TCR Vβ domain with a CDR3 comprising or consisting of the amino acid sequence of SEQ ID NO:34. In addition, the Jchain antigen may comprise or consist of the sequence shown in SEQ ID NO: 100. Furthermore, the TCR Vα domain may be part of a TCR α chain having a constant domain and the TCR Vβ domain may be part of a TCR β chain having a constant domain.
In this particular example, the Vα domain may comprise an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 35; and the Vβ domain may comprise an amino acid sequence having at least 80% sequence identity to, comprising, or consisting of, SEQ ID NO: 37. In one example, the Vα domain comprises the amino acid sequence of SEQ ID NO: 35 and the Vβ domain comprises the amino acid sequence of SEQ ID NO: 37. In such cases, the Vα domain may be encoded by a nucleic acid sequence comprising the sequence of SEQ ID NO: 36; and the Vβ domain may be encoded by a nucleic acid sequence comprising the sequence of SEQ ID NO: 38.
In this particular example, the TCR Vα domain may include a CDR1 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO: 29 and a CDR2 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:30. Furthermore, the TCR Vβ domain may include a CDR1 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO:32 and a CDR2 amino acid sequence comprising or consisting of the amino acid sequence of SEQ ID NO: 33.
For the avoidance of doubt, this particular example encompasses components of TCR clone 10H11.11 exemplified herein. The different components of TCR clone 10H11.11 and their respective SEQ ID Nos are summarised in Table 5 below.
As stated in more detail elsewhere herein, the nucleic acid composition described herein encodes both a TCR Vα domain and a TCR Vβ domain, which form the binding protein that is capable of specifically binding to a Jchain antigen. In examples where the TCR Vα domain and the TCR Vβ domain are encoded by the same nucleic acid sequence, the TCR Vα domain and TCR Vβ domain may be joined together via a linker. Suitable linkers are discussed generally elsewhere herein. Additional appropriate polypeptide domains that may also be encoded by the nucleic acid sequences that encode the TCR Vα domain and/or the TCR Vβ domain are also discussed generally elsewhere herein.
In one example, the nucleic acid composition described herein may encode a soluble TCR or a chimeric single chain TCR wherein the TCR alpha chain variable domain is linked to the TCR beta chain variable domain and a constant domain which is e.g. fused to the CD3 zeta signalling domain. These are discussed generally in more detail elsewhere herein.
A vector system is also provided which includes a nucleic acid composition described herein. The vector system may have one or more vectors. As discussed previously, the binding protein components that are encoded by the nucleic acid composition may be encoded by one or more nucleic acid sequences in the nucleic acid composition. In examples where all of the binding protein components are encoded by a single nucleic acid sequence, the nucleic acid sequence may be present within a single vector (and thus the vector system described herein may comprise of one vector only). In examples where the binding protein components are encoded by two or more nucleic acid sequences (wherein the plurality of nucleic acid sequences, together, encode all of the components of the binding protein) these two or more nucleic acid sequences may be present within one vector (e.g. in different open reading frames of the vector), or may be distributed over two or more vectors. In this example, the vector system will comprise a plurality of distinct vectors (i.e. vectors with different nucleotide sequences).
Accordingly, in one example, a vector system is provided, comprising a nucleic acid composition described herein.
Any appropriate vector can be used. By way of example only, the vector may be a plasmid, a cosmid, or a viral vector, such as a retroviral vector or a lentiviral vector. Adenovirus, adeno-associated virus, vaccinia virus, canary poxvirus, herpes virus, minicircle vectors and naked (synthetic) DNA/RNA may also be used (for details on minicircle vectors, see for example non-viral Sleeping Beauty transposition from minicircle vectors as published by R Monjezi et al., Leukemia 2017). Alternatively, single stranded or double stranded DNA or RNA can be used to transfect lymphocytes with a TCR of interest (see Roth et al 2018 Nature vol 559; page 405).
In one example, the vector is a plasmid, a viral vector, or a cosmid, optionally wherein the vector is selected from the group consisting of a retrovirus, lentivirus, adeno-associated virus, adenovirus, vaccinia virus, canary poxvirus, herpes virus, minicircle vector and synthetic DNA or RNA.
As used herein, the term “vector” refers to a nucleic acid sequence capable of transporting another nucleic acid sequence to which it has been operably linked. The vector can be capable of autonomous replication or it can integrate into a host DNA. The vector may include restriction enzyme sites for insertion of recombinant DNA and may include one or more selectable markers or suicide genes. The vector can be a nucleic acid sequence in the form of a plasmid, a bacteriophage or a cosmid. Preferably the vector is suitable for expression in a cell (i.e. the vector is an “expression vector”). Preferably, the vector is suitable for expression in a human T cell such as a CD8+ T cell or CD4+ T cell, or stem cell, iPS cell, or NK cell. In certain aspects, the vector is a viral vector, such as a retroviral vector, a lentiviral vector or an adeno-associated vector. Optionally, the vector is selected from the group consisting of an adenovirus, vaccinia virus, canary poxvirus, herpes virus, minicircle vector and synthetic DNA or synthetic RNA.
Preferably the (expression) vector is capable of propagation in a host cell and is stably transmitted to future generations.
The vector may comprise regulatory sequences. “Regulatory sequences” as used herein, refers to, DNA or RNA elements that are capable of controlling gene expression. Examples of expression control sequences include promoters, enhancers, silencers, TATA-boxes, internal ribosomal entry sites (IRES), attachment sites for transcription factors, transcriptional terminators, polyadenylation sites etc. Optionally, the vector includes one or more regulatory sequences operatively linked to the nucleic acid sequence to be expressed. Regulatory sequences include those which direct constitutive expression, as well as tissue-specific regulatory and/or inducible sequences.
Optionally, the vector comprises the nucleic acid sequence of interest operably linked to a promoter. “Promoter”, as used herein, refers to the nucleotide sequences in DNA to which RNA polymerase binds to start transcription. The promoter may be inducible or constitutively expressed. Alternatively, the promoter is under the control of a repressor or stimulatory protein. The promoter may be one that is not naturally found in the host cell (e.g. it may be an exogenous promoter). The skilled person in the art is well aware of appropriate promoters for use in the expression of target proteins, wherein the selected promoter will depend on the host cell.
“Operably linked” refers to a single or a combination of the below-described control elements together with a coding sequence in a functional relationship with one another, for example, in a linked relationship so as to direct expression of the coding sequence.
The vector may comprise a transcriptional terminator. “Transcriptional terminator” as used herein, refers to a DNA element, which terminates the function of RNA polymerases responsible for transcribing DNA into RNA. Preferred transcriptional terminators are characterized by a run of T residues preceded by a GC rich dyad symmetrical region.
The vector may comprise a translational control element. “Translational control element”, as used herein, refers to DNA or RNA elements that control the translation of mRNA. Preferred translational control elements are ribosome binding sites. Preferably, the translational control element is from a homologous system as the promoter, for example a promoter and its associated ribozyme binding site. Preferred ribosome binding sites are known, and will depend on the chosen host cell.
The vector may comprise restriction enzyme recognition sites. “Restriction enzyme recognition site” as used herein, refers to a motif on the DNA recognized by a restriction enzyme.
The vector may comprise a selectable marker. “Selectable marker” as used herein, refers to proteins that, when expressed in a host cell, confer a phenotype onto the cell which allows a selection of the cell expressing said selectable marker gene. Generally this may be a protein that confers a new beneficial property onto the host cell (e.g. antibiotic resistance) or a protein that is expressed on the cell surface and thus accessible for antibody binding. Appropriate selectable markers are well known in the art.
Optionally, the vector may also comprise a suicide gene. “Suicide gene” as used herein encodes a protein that induce death of the modified cell upon treatment with specific drugs. By way of example, suicide can be induced in cells modified by the herpes simplex virus thymidine kinase gene upon treatment with specific nucleoside analogs including ganciclovir, cells modified by human CD20 upon treatment with anti-CD20 monoclonal antibody and cells modified with inducible Caspase9 (iCasp9) upon treatment with AP1903 (reviewed by B S Jones, L S Lamb, F Goldman, A Di Stasi; Improving the safety of cell therapy products by suicide gene transfer. Front Pharmacol. (2014) 5:254). Appropriate suicide genes are well known in the art.
Preferably the vector comprises those genetic elements which are necessary for expression of the binding proteins described herein by a host cell. The elements required for transcription and translation in the host cell include a promoter, a coding region for the protein(s) of interest, and a transcriptional terminator.
A person of skill in the art will be well aware of the molecular techniques available for the preparation of (expression) vectors and how the (expression) vectors may be transduced or transfected into an appropriate host cell (thereby generating a modified cell described further below). The (expression) vector system described herein can be introduced into cells by conventional techniques such as transformation, transfection or transduction. “Transformation”, “transfection” and “transduction” refer generally to techniques for introducing foreign (exogenous) nucleic acid sequences into a host cell, and therefore encompass methods such as electroporation, microinjection, gene gun delivery, transduction with retroviral, lentiviral or adeno-associated vectors, lipofection, superfection etc. The specific method used typically depends on both the type of vector and the cell. Appropriate methods for introducing nucleic acid sequences and vectors into host cells such as human cells are well known in the art; see for example Sambrook et al (1989) Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y; Ausubel et al (1987) Current Protocols in Molecular Biology, John Wiley and Sons, Inc., NY; Cohen et al (1972) Proc. Natl. Acad. Sci. USA 69, 2110; Luchansky et al (1988) Mol. Microbiol. 2, 637-646. Further conventional methods that are suitable for preparing expression vectors and introducing them into appropriate host cells are described in detail in WO2016/071758 for example.
It is understood that it some examples, the host cell is contacted with the vector system (e.g. viral vector) in vitro, ex vivo, and in some examples, the host cell is contacted with the vector system (e.g. viral vector) in vivo.
The term “host cell” includes any cell into which the nucleic acid composition or vector system described herein may be introduced. Once a nucleic acid molecule or vector system has been introduced into the cell, it may be referred to as a “modified cell” herein. Once the nucleic acid molecule or vector is introduced into the host cell, the resultant modified cell should be capable of expressing the encoded binding protein (and e.g. correctly localising the encoded binding protein for its intended function e.g. transporting the encoded binding protein to the cell surface).
The nucleic acid composition or vector system may be introduced into the cell using any conventional method known in the art. For example, the nucleic acid composition or vector system may be introduced using CRISPR technology. Insertion of the nucleic acid sequences at the endogenous TCR locus by engineering with CRISPR/Cas9 and homologous directed repair (HDR) or non-homologous end joining (NHEJ) is therefore encompassed. Other conventional methods such as transfection, transduction or transformation of the cell may also be used.
The term “modified cell” refers to a genetically altered (e.g. recombinant) cell. The modified cell includes at least one exogenous nucleic acid sequence (i.e. a nucleic acid sequence that is not naturally found in the host cell). In the context of the invention, the exogenous sequence comprises at least one of the T cell receptor component parts described herein for any of clones 4G8.8, 5D12.9, 13F6.6, 10H11.11, 5C8.16, 16C7.9 or 13D4.9 (e.g. the sequences etc that encode the CDR3 sequences that are specific for a Jchain antigen (e.g. the peptide of SEQ ID NO: 99, SEQ ID NO: 100 or SEQ ID NO: 101)). The term “modified cell” refers to the particular subject cell and also to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein.
In one example, a modified cell comprises a nucleic acid composition or a vector system provided herein.
The host cell (and thus the modified cell) is typically a eukaryotic cell, and particularly a human cell (e.g. a T cell such as a CD8+ T cell or a CD4+ T cell, or a mixture thereof, or a hematopoietic stem cell, an iPSC, or gamma-delta T cell, or a pluripotent stem cell, or a NK-T cell or NK cell). The host cell (and thus the modified cell) may be an autologous or allogeneic cell (e.g. such as a CD8+ T cell or a CD4+ T cell, or a mixture thereof, or a hematopoietic stem cell, an iPSC, or gamma-delta T cell, or a pluripotent stem cell, or a NK-T cell or NK cell). “Allogeneic cell” refers to a cell derived from a different individual to the individual to which it is later administered. In other words, the host cell (and thus the modified cell) may be an isolated cell from a distinct individual compared to the subject to be treated. “Autologous cell” refers to a cell derived from the individual to which it is also later administered. In other words, the host cell (and thus the modified cell) may be an isolated cell from the subject that is to be treated.
Accordingly, in an example, the modified cell is a human cell.
The host cell (and thus the modified cell) may be any cell that is able to confer anti-tumour immunity after TCR gene transfer. Non limiting examples of appropriate cells include autologous or allogeneic CD8 T cells, CD4 T cells, Natural Killer (NK) cells, NKT cells, gamma-delta T cells, inducible pluripotent stem cells (iPSCs), hematopoietic stem cells or other progenitor cells and any other autologous or allogeneic cell or cell line (NK-92 for example or T cell lines) that is able to confer anti-tumor immunity after TCR gene transfer.
Accordingly, in one example the modified cell is selected from the group consisting of a CD8 T cell, a CD4 T cell, an NK cell, an NK-T cell, a gamma-delta T cell, a hematopoietic stem cell, an inducible pluripotent stem cell, a progenitor cell, a T cell line and a NK-92 cell line.
In the context of the methods of treatment described herein, the host cell (and thus the modified cell) is typically for administration to a HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A*11:01, HLA-A*24:02 and/or HLA-B*40:01 positive human subject (e.g. to a HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A*11:01 and/or HLA-A*24:02 positive human subject). In view of this, the host cell (and thus the modified cell) is typically HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A*11:01, HLA-A*24:02 and/or HLA-B*40:01 positive but needs to be Jchain antigen negative (i.e. modified cells can either be HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A*11:01, HLA-A*24:02 and/or HLA-B*40:01 positive or negative).
In the context of the methods of treatment described herein, the host cell (and thus the modified cell) that is to be administered to the subject can either be autologous or allogeneic.
Advantageously, the modified cell is capable of expressing the binding protein encoded by the nucleic acid composition or vector system described herein (i.e. the TCR component parts) such that the modified cell provides an immunotherapy that specifically targets cells that express a Jchain antigen, and thus can be used to treat or prevent B cell associated diseases or conditions in a corresponding HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A*11:01 and/or HLA-A*24:02 positive human subject. More details on this use are given below.
The inventors have identified several Jchain derived peptides that are presented on MM cells in HLA-A*01:01 (HLA-A1), HLA-A*02:01 (HLA-A2), HLA-A*03:01 (HLA-A3), HLA-A*11:01 (HLA-A11), HLA-A*24:02 (HLA-A24), and HLA-B*40:01 (HLA-B40). Specifically, the inventors identified the Jchain derived peptide antigens YTAVVPLVY (SEQ ID NO:99),TAVVPLVY (SEQ ID NO:102), VLAVFIKAVHV (SEQ ID NO:103), VLAVFIKAV (SEQ ID NO:104), YTAVVPLV (SEQ ID NO:105), ISDPTSPLRTR (SEQ ID NO:101), RIIVPLNNR (SEQ ID NO:106), CYTAVVPLV (SEQ ID NO:100) and GETKMVETAL (SEQ ID NO:107).
In the context of an isolated peptide, the invention specifically provides for an isolated peptide comprising or consisting of an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO:99), TAVVPLVY (SEQ ID NO: 102), VLAVFIKAVHV (SEQ ID NO: 103), VLAVFIKAV (SEQ ID NO:104), YTAVVPLV (SEQ ID NO:105), ISDPTSPLRTR (SEQ ID NO:101), RIIVPLNNR (SEQ ID NO:106), CYTAVVPLV (SEQ ID NO:100) and GETKMVETAL (SEQ ID NO:107).
In one example, the invention specifically provides for an isolated peptide comprising or consisting of an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO:99), ISDPTSPLRTR (SEQ ID NO:101) and CYTAVVPLV (SEQ ID NO:100).
In a particular example, the invention specifically provides for an isolated peptide comprising or consisting of an amino acid sequence of YTAVVPLVY (SEQ ID NO:99).
In a particular example, the invention specifically provides for an isolated peptide comprising or consisting of an amino acid sequence of ISDPTSPLRTR (SEQ ID NO:101).
In a particular example, the invention specifically provides for an isolated peptide comprising or consisting of an amino acid sequence of CYTAVVPLV (SEQ ID NO:100).
As used herein, an “isolated peptide” refers to a peptide that is not in its natural environment. The peptide may therefore be of synthetic origin (or alternatively, of natural original, but isolated from its natural environment).
The isolated peptide may be relatively short (i.e. no more than 20 amino acids; e.g. no more than 19, 18, 17, 16, 15, 14, 13, 12, 11, or 10 amino acids).
In one example, the peptide may be no more than 20 amino acids long.
The peptide may consist of the amino acid sequence of SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101 SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106 or SEQ ID NO:107 only.
In a particular example, the peptide may consist of the amino acid sequence of SEQ ID NO:99, SEQ ID NO:100 or SEQ ID NO:101.
In a particular example, the peptide may consist of the amino acid sequence of SEQ ID NO:99.
In a particular example, the peptide may consist of the amino acid sequence of SEQ ID NO:100.
In a particular example, the peptide may consist of the amino acid sequence of SEQ ID NO:101.
The isolated peptide may be administered to a human subject in order to treat or prevent a B cell associated disease or condition, such as MM. For example, the isolated peptide may be administered to the subject in order to induce or enhance their immune response. The peptide may therefore be administered to the subject to induce T cell activation (e.g. in vivo T cell activation) in the subject, wherein the activated T cells are specific for the peptide (and thus will specifically target Jchain positive cells).
The isolated peptide may be administered as a peptide vaccine for treating or preventing a B cell associated disease or condition, such as MM. The isolated peptide may be administered to induce or enhance activation of T cells specific for Jchain positive cells.
The inventors have shown that YTAVVPLVY (SEQ ID NO:99), TAVVPLVY (SEQ ID NO: 102), VLAVFIKAVHV (SEQ ID NO: 103), VLAVFIKAV (SEQ ID NO:104), YTAVVPLV (SEQ ID NO:105), ISDPTSPLRTR (SEQ ID NO:101), RIIVPLNNR (SEQ ID NO:106), CYTAVVPLV (SEQ ID NO:100) and GETKMVETAL (SEQ ID NO:107) are Jchain derived peptides that are presented at the cell surface in the context of HLA and are recognized by T cells. Binding of these peptides to T cells has been demonstrated herein. These peptides therefore represent bonafide immunogenic Jchain-specific antigens that may be further exploited in the development of personalized vaccines, which may be particularly useful as an adjunct to other therapies (e.g. ACT as described herein). These immunogenic peptides can therefore be used as an immunotherapy in the form of peptide, RNA, DNA, dendritic cell based therapies, and adoptive TCR transgenic T cell-based therapies.
An isolated peptide as described herein may therefore be useful as an immunotherapy. For example, such isolated peptides may be used as an immunotherapy for subjects with, at risk of developing, or suspected of having a B cell associated disease or condition. Nucleic acid sequences and vectors encoding these peptides may also be useful for this purpose. Nucleic acid sequences and vectors encoding such isolated peptides are therefore also provided herein. A general discussion of vectors and nucleic acid sequences is provided elsewhere herein and applies equally to this aspect.
The particular peptide for administration may be chosen based on the HLA-A status of the subject. As explained elsewhere herein, a peptide comprising the sequence of SEQ ID NO:99 or SEQ ID NO: 102 may be particularly suitable for administration to a subject that is HLA-A*01:01 positive, whereas a peptide comprising the sequence of SEQ ID NO:103, SEQ ID NO:104, or SEQ ID NO:105 may be particularly suitable for administration to a subject that is positive for HLA-A*02:01.
Furthermore, a peptide comprising the sequence of SEQ ID NO:100 may be particularly suitable for administration to a subject that is positive for HLA-A*24:02, whereas a peptide comprising the sequence of SEQ ID NO:101 may be particularly suitable for administration to a subject that is positive for HLA-A*03:01 or HLA-A*11:01.
Furthermore, a peptide comprising the sequence of SEQ ID NO:106 may be particularly suitable for administration to a subject that is positive for HLA-A*11:01, whereas a peptide comprising the sequence of SEQ ID NO:107 may be particularly suitable for administration to a subject that is positive for HLA-B*40:01.
Isolated peptides of the invention may also be provided in compositions that comprise more than one of the peptides discussed above. By way of example, an isolated peptide may be provided (and/or administered) as a composition that comprises a mixture of (a) an isolated peptide comprising the amino acid sequence SEQ ID NO:99; and (b) an isolated peptide comprising the amino acid sequence SEQ ID NO:102. This composition may be particularly useful for administration to a subject that is HLA-A*01:01 positive as these peptides are both presented by this HLA-A serotype.
Alternatively, an isolated peptide may be provided (and/or administered) as a composition that comprises a mixture of two or three of: (a) an isolated peptide comprising the amino acid sequence SEQ ID NO:103; (b) an isolated peptide comprising the amino acid sequence SEQ ID NO:104; and (c) an isolated peptide comprising the amino acid sequence SEQ ID NO:105. This composition may be particularly useful for administration to a subject that is HLA-A*02:01 positive as these peptides are all presented by this HLA-A serotype.
In another example, an isolated peptide may be provided (and/or administered) as a composition that comprises a mixture of (a) an isolated peptide comprising the amino acid sequence SEQ ID NO:106; and (b) an isolated peptide comprising the amino acid sequence SEQ ID NO:101. This composition may be particularly useful for administration to a subject that is HLA-A*11:01 positive as these peptides are both presented by this HLA-A serotype.
A nucleic acid composition, vector system, modified cell, isolated peptide or isolated nucleic acid sequence described herein may be provided as part of a pharmaceutical composition. Advantageously, such compositions may be administered to a human subject in need thereof (as described elsewhere herein). A particularly suitable composition may be selected based on the HLA serotype of the human subject, as described in detail elsewhere herein.
A pharmaceutical composition may comprise a nucleic acid composition, vector system, modified cell, isolated peptide or isolated nucleic acid sequence described herein along with a pharmaceutically acceptable excipient, adjuvant, diluent and/or carrier.
Compositions may routinely contain pharmaceutically acceptable concentrations of salt, buffering agents, preservatives, compatible carriers, supplementary immune potentiating agents such as adjuvants and cytokines and optionally other therapeutic agents or compounds.
As used herein, “pharmaceutically acceptable” refers to a material that is not biologically or otherwise undesirable, i.e., the material may be administered to an individual along with the selected nucleic acid composition, vector system, modified cell, isolated peptide or isolated nucleic acid sequence without causing any undesirable biological effects or interacting in a deleterious manner with any of the other components of the pharmaceutical composition in which it is contained.
Excipients are natural or synthetic substances formulated alongside an active ingredient (e.g. a nucleic acid sequence, a nucleic acid composition, vector or vector system, modified cell, isolated peptide or isolated nucleic acid as provided herein), included for the purpose of bulking-up the formulation or to confer a therapeutic enhancement on the active ingredient in the final dosage form, such as facilitating drug absorption or solubility. Excipients can also be useful in the manufacturing process, to aid in the handling of the active substance concerned such as by facilitating powder flowability or non-stick properties, in addition to aiding in vitro stability such as prevention of denaturation over the expected shelf life. Pharmaceutically acceptable excipients are well known in the art. A suitable excipient is therefore easily identifiable by one of ordinary skill in the art. By way of example, suitable pharmaceutically acceptable excipients include water, saline, aqueous dextrose, glycerol, ethanol, and the like.
Adjuvants are pharmacological and/or immunological agents that modify the effect of other agents in a formulation. Pharmaceutically acceptable adjuvants are well known in the art. A suitable adjuvant is therefore easily identifiable by one of ordinary skill in the art.
Diluents are diluting agents. Pharmaceutically acceptable diluents are well known in the art. A suitable diluent is therefore easily identifiable by one of ordinary skill in the art.
Carriers are non-toxic to recipients at the dosages and concentrations employed and are compatible with other ingredients of the formulation. The term “carrier” denotes an organic or inorganic ingredient, natural or synthetic, with which the active ingredient is combined to facilitate the application. Pharmaceutically acceptable carriers are well known in the art. A suitable carrier is therefore easily identifiable by one of ordinary skill in the art.
In one example, the pharmaceutical composition comprises an isolated peptide described herein, an isolated nucleic acid sequence encoding the peptide or a vector system comprising said nucleic acid sequence, wherein the pharmaceutical composition is formulated as a vaccine (e.g. a composition that it used to stimulate the immune system and provide immunity against one or several diseases, where the composition acts as an antigen without inducing the disease).
Pharmaceutical compositions described herein may advantageously be administered to a HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A* 11:01, HLA-A*24:02 and/or HLA-B*40:01 positive human subject in need thereof (where certain compositions are more suitable for certain human subjects, based on their HLA status, as described in more detail elsewhere herein). In some examples, the pharmaceutical compositions described herein may advantageously be administered to a HLA-A*01:01, HLA-A*03:01, HLA-A*11:01 or HLA-A*24:02 positive human subject in need thereof.
Typically, the subject in need of treatment has a disease or condition that is associated with an elevated level of HLA-restricted Jchain antigens (i.e. Jchain antigens that are presented at the cell surface in the context of an HLA).
The disease or condition is typically a B cell associated disease or condition. As used herein, a B cell associated disease or condition is a disease or condition that is of plasma B cell, B cell or B cell precursor origin.
In one example, the B cell associated disease or condition may be a hematological malignancy. In other words, it may be a hematological malignancy of plasma cell, B-cell and B-cell precursor origin. Examples of appropriate hematological malignancies are well known in the art, and include, for example Multiple myeloma, plasma cell leukemia, (AL) Amyloidosis, Acute lymphoblastoid leukemia (ALL), Chronic lymphocytic leukemia (CLL), Waldenstrom macroglobulinemia or B cell lymphomas. Examples of B cell lymphomas include but are not limited to: Diffuse large B cell lymphoma (DLBCL), High grade B cell lymphoma, Mantel cell lymphoma (MCL), Follicular lymphoma (FL), and Burkitt Lymphoma.
In one example, the B cell associated disease or condition is multiple myeloma.
In an alternative example, the B cell associated disease or condition may be an autoimmune disease or disorder. In other words, it may be an autoimmune disease or disorder of plasma cell, B-cell and B-cell precursor origin. Examples of appropriate autoimmune diseases or disorders are well known in the art, and include, for example Rheumatoid arthritis, Multiple sclerosis, Vasculitis including Urticarial vasculitis, systemic vasculitis, renal vasculitis, Systemic lupus erythematosus (SLE), Autoimmune hemolytic anemia and Thrombocytopenia.
The B cell associated disease or condition may be a hyperproliferative disease or condition. For example, the B cell associated disease or condition may be a HLA-restricted Jchain antigen expressing tumor or cancer.
In an example, the pharmaceutical composition provided herein is for use in inducing or enhancing an immune response in human subject diagnosed with a B cell associated disease or condition.
As would be clear to a person skilled in the art, an appropriate therapy for subject in need thereof (e.g. an appropriate pharmaceutical composition described herein) may be selected based on the HLA serotype of the subject.
In one example, if the subject in need thereof is HLA-A*01:01 positive, an appropriate therapy (e.g. an appropriate pharmaceutical composition described herein) may comprise (i) components of TCR clone 4G8.8 exemplified herein; (ii) components of TCR clone 5D12.9 exemplified herein; or (iii) components of TCR clone 13F6.6 exemplified herein. Accordingly, TCRs comprising components of (i) TCR clone 4G8.8 exemplified herein; (ii) TCR clone 5D12.9 exemplified herein; or (iii) TCR clone 13F6.6 exemplified herein are particularly suitable for administration, or treating, stimulating, providing appropriate immunity (e.g. anti-tumor immunity etc) in HLA-A*01:01 positive human subjects. In a further example, an appropriate therapy (e.g. an appropriate pharmaceutical composition described herein) for a HLA-A*01:01 positive subject in need thereof may comprise an isolated peptide comprising or consisting of the amino acid sequence YTAVVPLVY (SEQ ID NO:99), an isolated nucleic acid encoding said peptide or a vector system encoding said nucleic acid.
In one example, if the subject in need thereof is HLA-A*24:02 positive, an appropriate therapy (e.g. an appropriate pharmaceutical composition described herein) may comprise components of TCR clone 10H11.11 exemplified herein. Accordingly, TCRs comprising components of TCR clone 10H11.11 exemplified herein are particularly suitable for administration, or treating, stimulating, providing appropriate immunity (e.g. anti-tumor immunity etc) in HLA-A*24:02 positive human subjects. In a further example, an appropriate therapy (e.g. an appropriate pharmaceutical composition described herein) for a HLA-A*24:02 positive subject in need thereof may comprise an isolated peptide comprising or consisting of the amino acid sequence CYTAVVPLV (SEQ ID NO:100), an isolated nucleic acid encoding said peptide or a vector system encoding said nucleic acid.
In one example, if the subject in need thereof is HLA-A*03:01 positive, an appropriate therapy (e.g. an appropriate pharmaceutical composition described herein) may comprise components of TCR clone 5C8.16 exemplified herein. Accordingly, TCRs comprising components of TCR clone 5C8.16 exemplified herein are particularly suitable for administration, or treating, stimulating, providing appropriate immunity (e.g. anti-tumor immunity etc) in HLA-A*03:01 positive human subjects. In a further example, an appropriate therapy (e.g. an appropriate pharmaceutical composition described herein) for a HLA-A*03:01 positive subject in need thereof may comprise an isolated peptide comprising or consisting of the amino acid sequence ISDPTSPLRTR (SEQ ID NO:101), an isolated nucleic acid encoding said peptide or a vector system encoding said nucleic acid.
In one example, if the subject in need thereof is HLA-A*11:01 positive, an appropriate therapy (e.g. an appropriate pharmaceutical composition described herein) may comprise (i) components of TCR clone 16C7.9 exemplified herein; or (ii) components of TCR clone 13D4.9 exemplified herein. Accordingly, TCRs comprising components of (i) TCR clone 16C7.9 exemplified herein; or (ii) TCR clone 13D4.9 exemplified herein are particularly suitable for administration, or treating, stimulating, providing appropriate immunity (e.g. anti-tumor immunity etc) in HLA-A*11:01 positive human subjects. In a further example, an appropriate therapy (e.g. an appropriate pharmaceutical composition described herein) for a HLA-A*11:01 positive subject in need thereof may comprise an isolated peptide comprising or consisting of the amino acid sequence ISDPTSPLRTR (SEQ ID NO:101), an isolated nucleic acid encoding said peptide or a vector system encoding said nucleic acid.
In a further example, an appropriate therapy (e.g. an appropriate pharmaceutical composition described herein) for a HLA-A*01:01 positive subject in need thereof may comprise an isolated peptide comprising or consisting of the amino acid sequence TAVVPLVY (SEQ ID NO:102), an isolated nucleic acid encoding said peptide or a vector system encoding said nucleic acid.
In a further example, an appropriate therapy (e.g. an appropriate pharmaceutical composition described herein) for a HLA-A*02:01 positive subject in need thereof may comprise an isolated peptide comprising or consisting of the amino acid sequence VLAVFIKAVHV (SEQ ID NO:103), an isolated nucleic acid encoding said peptide or a vector system encoding said nucleic acid.
In a further example, an appropriate therapy (e.g. an appropriate pharmaceutical composition described herein) for a HLA-A*02:01 positive subject in need thereof may comprise an isolated peptide comprising or consisting of the amino acid sequence VLAVFIKAV (SEQ ID NO:104), an isolated nucleic acid encoding said peptide or a vector system encoding said nucleic acid.
In a further example, an appropriate therapy (e.g. an appropriate pharmaceutical composition described herein) for a HLA-A*02:01 positive subject in need thereof may comprise an isolated peptide comprising or consisting of the amino acid sequence YTAVVPLV (SEQ ID NO:105), an isolated nucleic acid encoding said peptide or a vector system encoding said nucleic acid.
In a further example, an appropriate therapy (e.g. an appropriate pharmaceutical composition described herein) for a HLA-A*11:01 positive subject in need thereof may comprise an isolated peptide comprising or consisting of the amino acid sequence RIIVPLNNR (SEQ ID NO:106), an isolated nucleic acid encoding said peptide or a vector system encoding said nucleic acid.
In a further example, an appropriate therapy (e.g. an appropriate pharmaceutical composition described herein) for a HLA-B*40:01 positive subject in need thereof may comprise an isolated peptide comprising or consisting of the amino acid sequence GETKMVETAL (SEQ ID NO:107), an isolated nucleic acid encoding said peptide or a vector system encoding said nucleic acid.
The phrase “induced or enhanced immune response” refers to an increase in the immune response (e.g. a cell mediated immune response such as a T cell mediated immune response) of the subject during or after treatment compared to their immune response prior to treatment. An “induced or enhanced” immune response therefore encompasses any measurable increase in the immune response that is directly or indirectly targeted to the disease or condition being treated (or prevented).
In another example, the pharmaceutical composition may be for use in stimulating a cell mediated immune response to a target cell population or tissue in a human subject.
In another example, the pharmaceutical composition may be for use in stimulating a cell mediated immune response to a target cell population or tissue in a human subject. In such an example, the target cell population or tissue may be a HLA-restricted Jchain antigen expressing target cell population or tissue. Typically, it is a HLA-restricted Jchain antigen expressing target cell population or tissue. For example, it may be a target cell population or tissue comprising a HLA-restricted Jchain antigen expressing tumor or cancer.
The pharmaceutical composition may also be for use in providing anti-tumor immunity to a human subject.
In another example, the pharmaceutical composition may be for use in treating a human subject having a disease or condition associated with an elevated level of HLA-restricted Jchain antigen. Typically, the disease or condition associated with an elevated level of HLA-restricted Jchain antigen may be a B cell associated disease or condition.
A person of skill in the art will be fully aware of B cell associated diseases or conditions that may be treated in accordance with the invention. Appropriate examples of such diseases or conditions are discussed elsewhere herein.
As would be clear to a person skilled in the art, the B cell associated diseases or conditions may comprise at least one tumor (particularly, at least one HLA-restricted Jchain antigen expressing tumor).
As used herein, the terms “treat”, “treating” and “treatment” are taken to include an intervention performed with the intention of preventing the development or altering the pathology of a condition, disorder or symptom (e.g. a B cell associated disease or condition). Accordingly, “treatment” refers to both therapeutic treatment and prophylactic or preventative measures, wherein the object is to prevent or slow down (lessen) the targeted condition, disorder or symptom. “Treatment” therefore encompasses a reduction, slowing or inhibition of the amount or concentration of target cells, for example as measured in a sample obtained from the subject, of at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90% or 100% when compared to the amount or concentration of target cells before treatment. Methods of measuring the amount or concentration of target cells include, for example, qRT-PCR, and quantification of disease specific biomarkers in a sample obtained from the subject.
As used herein the term “subject” refers to an individual, e.g., a human, having or at risk of having a specified condition, disorder or symptom. The subject may be a patient i.e. a subject in need of treatment in accordance with the invention. The subject may have received treatment for the condition, disorder or symptom. Alternatively, the subject has not been treated prior to treatment in accordance with the present invention.
The compositions described herein can be administered to the subject by any conventional route, including injection or by gradual infusion over time. The administration may, for example, be by infusion or by intramuscular, intravascular, intracavity, intracerebral, intralesional, rectal, subcutaneous, intradermal, epidural, intrathecal, percutaneous administration.
The compositions described herein may be in any form suitable for the above modes of administration. For example, compositions comprising modified cells may in any form suitable for infusion. As further examples, suitable forms for parenteral injection (including, subcutaneous, intramuscular, intravascular or infusion) include a sterile solution, suspension or emulsion.
Alternatively, the route of administration may be by direct injection into the target area, or by regional delivery or by local delivery. The identification of suitable dosages of the compositions of the invention is well within the routine capabilities of a person of skill in the art.
Advantageously, the compositions described herein may be formulated for use in T cell receptor (TCR) gene transfer, an approach that is rapid, reliable and capable of generating large quantities of T cells with specificity for the Jchain antigenic peptides (e.g. the peptides shown in any one of SEQ ID NO:99 to 107), regardless of the patient's pre-existing immune repertoire. Using TCR gene transfer, modified cells suitable for infusion may be generated within a few days.
The compositions described herein are for administration in an effective amount. An “effective amount” is an amount that alone, or together with further doses, produces the desired (therapeutic or non-therapeutic) response. The effective amount to be used will depend, for example, upon the therapeutic (or non-therapeutic) objectives, the route of administration, and the condition of the patient/subject. For example, the suitable dosage of the composition of the invention for a given patient/subject will be determined by the attending physician (or person administering the composition), taking into consideration various factors known to modify the action of the composition of the invention for example severity and type of haematological malignancy, body weight, sex, diet, time and route of administration, other medications and other relevant clinical factors. The dosages and schedules may be varied according to the particular condition, disorder or symptom the overall condition of the patient/subject. Effective dosages may be determined by either in vitro or in vivo methods.
The pharmaceutical compositions described herein are advantageously presented in unit dosage form.
Methods of Generating Binding Proteins (e.g. TCRs)
A method of generating a binding protein that is capable of specifically binding to a peptide containing a Jchain antigen and does not bind to a peptide that does not contain the Jchain antigen is also provided, comprising contacting a nucleic acid composition (or vector system) described herein with a cell under conditions in which the nucleic acid composition is incorporated and expressed by the cell.
In the context of the binding proteins described herein, the Jchain antigen comprises or consists of a sequence comprising an amino acid sequence selected from the group consisting of: SEQ ID NO:99 to 107, or a functional fragment or variant thereof.
The method may be carried out on the (host) cell ex vivo or in vitro. Alternatively, the method may be performed in vivo, wherein the nucleic acid composition (or vector system) is administered to the subject and is contacted with the cell in vivo, under conditions in which the nucleic acid sequence is incorporated and expressed by the cell to generate the binding protein. In one example, the method is not a method of treatment of the human or animal body.
Appropriate in vivo, in vitro and ex vivo methods for contacting a nucleic acid sequence (or vector systems) with a cell under conditions in which the nucleic acid sequence (or vector) is incorporated and expressed by the cell are well known, as described elsewhere herein.
As stated elsewhere herein, the binding protein comprise a TCR, an antigen binding fragment of a TCR, a ImmTAC or a chimeric antigen receptor (CAR). Further details are provided elsewhere herein.
The binding proteins described herein may be used therapeutically, as described elsewhere herein. Furthermore, the binding proteins may be used in a diagnostic setting, e.g. to detect the presence of Jchain presented in the context of an appropriate HLA at the cell surface of diseased/malignant tissues.
As used herein “nucleic acid sequence”, “polynucleotide”, “nucleic acid” and “nucleic acid molecule” are used interchangeably to refer to an oligonucleotide sequence or polynucleotide sequence. The nucleotide sequence may be of genomic, synthetic or recombinant origin, and may be double-stranded or single-stranded (representing the sense or antisense strand). The term “nucleotide sequence” includes genomic DNA, cDNA, synthetic DNA, and RNA (e.g. mRNA) and analogs of the DNA or RNA generated, e.g., by the use of nucleotide analogs. In one example, the nucleotide sequence lacks introns. In other words, it is an intronless nucleic acid sequence. For example, the nucleotide sequence may be a DNA sequence that does not comprise intron sequences.
As used herein, “isolated nucleic acid sequence” or “isolated nucleic acid composition” refers to a nucleic acid sequence that is not in its natural environment when it is linked to its naturally associated sequence(s) that is/are also in its/their natural environment. In otherwords, an isolated nucleic acid sequence/composition is not a native nucleotide sequence/composition, wherein “native nucleotide sequence/composition” means an entire nucleotide sequence that is in its native environment and when operatively linked to an entire promoter with which it is naturally associated, which promoter is also in its native environment. Such a nucleic acid could be part of a vector and/or such nucleic acid or polypeptide could be part of a composition {e.g., a cell lysate), and still be isolated in that such vector or composition is not part of the natural environment for the nucleic acid or polypeptide. The term “gene” means the segment of DNA involved in producing a polypeptide chain; it includes regions preceding and following the coding region (“leader and trailer”) as well as intervening sequences (introns) between individual coding segments (exons).
The nucleic acid sequences of the invention may be a non-naturally occurring nucleic acid sequence (e.g. it may be that the entire sequence does not occur in its entirety in nature). For example, the nucleic acid sequence of the invention may be operably linked to a promoter, wherein the promoter is not naturally associated with equivalent human nucleic acid sequences in nature (e.g. human TCR sequences or fragments thereof); i.e. it is not the entire promoter that is naturally associated with the nucleic acid in its natural environment. In this context, such promoters may be considered exogenous promoters. Examples of appropriate promoters are described elsewhere.
As used herein “specifically binds” or “specific for” refers to an association or union of a binding protein (e.g., TCR receptor) or a binding domain (or fusion protein thereof) to a target molecule with an affinity or Ka (i.e., an equilibrium association constant of a particular binding interaction with units of 1/M) equal to or greater than 105M−1 (which equals the ratio of the on-rate [kon] to the off-rate [koff] for this association reaction), while not significantly associating or uniting with any other molecules or components in a sample. Binding proteins or binding domains (or fusion proteins thereof) may be classified as “high affinity” binding proteins or binding domains (or fusion proteins thereof) or as “low affinity” binding proteins or binding domains (or fusion proteins thereof). “High affinity” binding proteins or binding domains refer to those binding proteins or binding domains having a Ka of at least 107 M−1, at least 108 M−1, at least 109 M−1, at least 1010 M−1, at least 1011 M−1, at least 1012 M−1, or at least 1013 M−1. Low affinity” binding proteins or binding domains refer to those binding proteins or binding domains having a Ka of up to 107 M−1 up to 106 M−1, up to 105 M−1. Alternatively, affinity can be defined as an equilibrium dissociation constant (Kd) of a particular binding interaction with units of M (e.g., 10−5 M to 10−13 M).
In certain embodiments, a receptor or binding domain may have “enhanced affinity,” which refers to selected or engineered receptors or binding domains with stronger binding to a target antigen than a wild type (or parent) binding domain. For example, enhanced affinity may be due to a Ka (equilibrium association constant) for the target antigen that is higher than the wild type binding domain, due to a Kd (dissociation constant) for the target antigen that is less than that of the wild type binding domain, due to an off-rate (koff) for the target antigen that is less than that of the wild type binding domain, or a combination thereof. In certain embodiments, enhanced affinity TCRs can be codon optimized to enhance expression in a particular host cell, such as a cell of the immune system, a inducible pluripotent stem cell (iPSC), a hematopoietic stem cell, a T cell, a primary T cell, a T cell line, a NK cell, or a natural killer T cell (Scholten et al, Clin. Immunol. 119: 135, 2006). The T cell can be a CD4+ or a CD8+ T cell, or gamma-delta T cell.
As used herein, the term “Jchain antigen” or “Jchain peptide antigen” or “Jchain-containing peptide antigen” refers to a naturally or synthetically produced peptide portion of a Jchain protein ranging in length from about 7 amino acids, about 8 amino acids, about 9 amino acids, about 10 amino acids, up to about 20 amino acids, which can form a complex with a MHC (e.g., HLA) molecule, and a binding protein of this disclosure specific for a Jchain peptide:MHC (e.g., HLA) complex can specifically bind to such as complex. Typically, for the purposes of this disclosure, the Jchain peptide antigen comprises or consists of an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO:99), TAVVPLVY (SEQ ID NO:102), VLAVFIKAVHV (SEQ ID NO:103), VLAVFIKAV (SEQ ID NO:104), YTAVVPLV (SEQ ID NO:105), ISDPTSPLRTR (SEQ ID NO:101), RIIVPLNNR (SEQ ID NO:106), CYTAVVPLV (SEQ ID NO:100) and GETKMVETAL (SEQ ID NO:107). In some examples, the Jchain peptide antigen comprises or consists of an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO:99), ISDPTSPLRTR (SEQ ID NO:101) and CYTAVVPLV (SEQ ID NO:100). Additionally, for the purposes of this disclosure, the Jchain peptide antigen:HLA complex typically comprises a peptide:HLA complex selected from the group consisting of: a YTAVVPLVY:HLA-A*01:01 complex; a TAVVPLVY:HLA-A*01:01 complex; a VLAVFIKAVHV:HLA-A*02:01 complex; a VLAVFIKAV:HLA-A*02:01 complex; a YTAVVPLV:HLA-A*02:01 complex; a ISDPTSPLRTR:HLA-A*03:01 complex; a RIIVPLNNR:HLA-A*11:01 complex; a ISDPTSPLRTR:HLA-A*11:01 complex; a CYTAVVPLV:HLA-A*24:02 complex; and a GETKMVETAL:HLA-B*40:01 complex. In some examples, the Jchain peptide antigen:HLA complex comprises a peptide:HLA complex selected from the group consisting of: a YTAVVPLVY:HLA-A*01:01 complex; a ISDPTSPLRTR:HLA-A*03:01 complex; a ISDPTSPLRTR:HLA-A*11:01 complex; and a CYTAVVPLV:HLA-A*24:02 complex.
The term “Jchain-specific binding protein,” as used herein, refers to a protein or polypeptide, such as a TCR or CAR, that specifically binds to a Jchain peptide antigen (or to a Jchain peptide antigen:HLA complex, e.g., on a cell surface), and does not bind a peptide sequence that does not include the Jchain peptide antigen. Typically, for the purposes of this disclosure, the Jchain peptide antigen comprises or consists of an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO:99), TAVVPLVY (SEQ ID NO:102), VLAVFIKAVHV (SEQ ID NO:103), VLAVFIKAV (SEQ ID NO:104), YTAVVPLV (SEQ ID NO:105), ISDPTSPLRTR (SEQ ID NO:101), RIIVPLNNR (SEQ ID NO:106), CYTAVVPLV (SEQ ID NO:100) and GETKMVETAL (SEQ ID NO:107), and the Jchain peptide antigen:HLA complex comprises a peptide:HLA complex selected from the group consisting of: a YTAVVPLVY:HLA-A*01:01 complex; a TAVVPLVY:HLA-A*01:01 complex; a VLAVFIKAVHV:HLA-A*02:01 complex; a VLAVFIKAV:HLA-A*02:01 complex; a YTAVVPLV:HLA-A*02:01 complex; a ISDPTSPLRTR:HLA-A*03:01 complex; a RIIVPLNNR:HLA-A*11:01 complex; a ISDPTSPLRTR:HLA-A*11:01 complex; a CYTAVVPLV:HLA-A*24:02 complex; and a GETKMVETAL:HLA-B*40:01 complex, as appropriate.
In certain embodiments, a Jchain-specific binding protein specifically binds to a Jchain peptide antigen (or a Jchain peptide antigen:HLA complex) with a Kd of less than about 10−8 M, less than about 10−9 M, less than about 10−10 M, less than about 10−11 M, less than about 10−12 M, or less than about 10−13 M, or with an affinity that is about the same as, at least about the same as, or is greater than at or about the affinity exhibited by an exemplary Jchain-specific binding protein provided herein, such as any of the Jchain-specific TCRs provided herein, for example, as measured by the same assay. In certain embodiments, a Jchain-specific binding protein comprises a Jchain-specific immunoglobulin superfamily binding protein or binding portion thereof. Typically, for the purposes of this disclosure, the Jchain peptide antigen comprises or consists of an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO:99), TAVVPLVY (SEQ ID NO:102), VLAVFIKAVHV (SEQ ID NO:103), VLAVFIKAV (SEQ ID NO:104), YTAVVPLV (SEQ ID NO:105), ISDPTSPLRTR (SEQ ID NO:101), RIIVPLNNR (SEQ ID NO:106), CYTAVVPLV (SEQ ID NO:100) and GETKMVETAL (SEQ ID NO:107), and the Jchain peptide antigen:HLA complex comprises a peptide:HLA complex selected from the group consisting of: a YTAVVPLVY:HLA-A*01:01 complex; a TAVVPLVY:HLA-A*01:01 complex; a VLAVFIKAVHV:HLA-A*02:01 complex; a VLAVFIKAV:HLA-A*02:01 complex; a YTAVVPLV:HLA-A*02:01 complex; a ISDPTSPLRTR:HLA-A*03:01 complex; a RIIVPLNNR:HLA-A*11:01 complex; a ISDPTSPLRTR:HLA-A*11:01 complex; a CYTAVVPLV:HLA-A*24:02 complex; and a GETKMVETAL:HLA-B*40:01 complex, as appropriate.
The selective binding may be in the context of Jchain antigen presentation by HLA-A*01:01, HLA-A*02:01, HLA-A*03:01, HLA-A*11:01, HLA-A*24:02 and/or HLA-B*40:01, in particular in the context of HLA-A*01:01, HLA-A*03:01, HLA-A*11:01 and/or HLA-A*24:02. In other words, in certain embodiments, a binding protein that “specifically binds to a Jchain antigen” may only do so when it is being presented (i.e. it is bound by) by a specific HLA or is in an equivalent structural formation as when it is being presented by the specific HLA. As discussed elsewhere herein, the inventors identified that Jchain derived peptides YTAVVPLVY (SEQ ID NO:99) and TAVVPLVY (SEQ ID NO:102) are capable of being presented by HLA-A*01:01; that Jchain derived peptides VLAVFIKAVHV (SEQ ID NO:103), VLAVFIKAV (SEQ ID NO:104) and YTAVVPLV (SEQ ID NO:105) are capable of being presented by HLA-A*02:01; that Jchain derived peptide ISDPTSPLRTR (SEQ ID NO:101) is capable of being presented by HLA-A*03:01 and HLA-A*11:01; that Jchain derived peptide RIIVPLNNR (SEQ ID NO:106) is capable of being presented by HLA-A*11:01; that Jchain derived peptide CYTAVVPLV (SEQ ID NO:100) is capable of being presented by HLA-A*24:02; and that Jchain derived peptide GETKMVETAL (SEQ ID NO:107) is capable of being presented by HLA-B*40:01.
Accordingly, for example, in certain examples, a binding protein that “specifically binds to a Jchain antigen”, in particular YTAVVPLVY (SEQ ID NO:99), may only do so when it is being presented (i.e. it is bound by) HLA-A*01:01 or is in an equivalent structural formation as when it is being presented by HLA-A*01:01. In another example, a binding protein that “specifically binds to a Jchain antigen”, in particular ISDPTSPLRTR (SEQ ID NO:101) may only do so when it is being presented (i.e. it is bound by) HLA-A*11:01 and/or HLA-A*03:01, or is in an equivalent structural formation as when it is being presented by HLA-A*11:01 and/or HLA-A*03:01. In another example, a binding protein that “specifically binds to a Jchain antigen”, in particular CYTAVVPLV (SEQ ID NO:100) may only do so when it is being presented (i.e. it is bound by) HLA-A*24:02 or is in an equivalent structural formation as when it is being presented by HLA-A*24:02.
By “specifically bind(s) to” as it relates to a T cell receptor, or as it refers to a recombinant T cell receptor, nucleic acid fragment, variant, or analog, or a modified cell, such as, for example, the Jchain T cell receptors, and Jchain-expressing modified cells herein, is meant that the T cell receptor, or fragment thereof, recognizes, or binds selectively to a Jchain antigen (e.g. wherein the Jchain antigen comprises an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO:99), TAVVPLVY (SEQ ID NO:102), VLAVFIKAVHV (SEQ ID NO:103), VLAVFIKAV (SEQ ID NO:104), YTAVVPLV (SEQ ID NO:105), ISDPTSPLRTR (SEQ ID NO:101), RIIVPLNNR (SEQ ID NO:106), CYTAVVPLV (SEQ ID NO:100) and GETKMVETAL (SEQ ID NO:107)). Under certain conditions, for example, in an immunoassay, for example an immunoassay discussed herein, the T cell receptor binds to JChain (e.g. an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO:99), TAVVPLVY (SEQ ID NO:102), VLAVFIKAVHV (SEQ ID NO:103), VLAVFIKAV (SEQ ID NO:104), YTAVVPLV (SEQ ID NO:105), ISDPTSPLRTR (SEQ ID NO:101), RIIVPLNNR (SEQ ID NO:106), CYTAVVPLV (SEQ ID NO:100) and GETKMVETAL (SEQ ID NO:107)) and does not bind in a significant amount to other polypeptides. Thus the T cell receptor may bind to Jchain (e.g. an amino acid sequence selected from the group consisting of: YTAVVPLVY (SEQ ID NO:99), TAVVPLVY (SEQ ID NO:102), VLAVFIKAVHV (SEQ ID NO:103), VLAVFIKAV (SEQ ID NO:104), YTAVVPLV (SEQ ID NO:105), ISDPTSPLRTR (SEQ ID NO:101), RIIVPLNNR (SEQ ID NO:106), CYTAVVPLV (SEQ ID NO:100) and GETKMVETAL (SEQ ID NO:107)) with at least 10, 100, or 1000, fold more affinity than to a control antigenic polypeptide. This binding may also be determined indirectly in the context of a modified T cell that expresses a Jchain TCR. In assays such as, for example, an assay discussed herein, the modified T cell is specifically reactive against a multiple myeloma cell line and at least one malignant B cell lines such as, for example, ALL, CLL and mantle cell lymphoma cell lines. Thus, the modified Jchain-expressing T cell binds to a multiple myeloma cell line or a malignant B cell line with at least 10, 100, or 1000, fold more reactivity when compared to its reactivity against a control cell line that is not a multiple myeloma cell line or a malignant B cell line.
A “non-essential” (or “non-critical”) amino acid residue is a residue that can be altered from the wild-type sequence of (e.g., the sequence identified by SEQ ID NO herein) without abolishing or, more preferably, without substantially altering a biological activity, whereas an “essential” (or “critical”) amino acid residue results in such a change. For example, amino acid residues that are conserved are predicted to be particularly non-amenable to alteration, except that amino acid residues within the hydrophobic core of domains can generally be replaced by other residues having approximately equivalent hydrophobicity without significantly altering activity.
A “conservative amino acid substitution” is one in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art. These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), non-polar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a nonessential (or non-critical) amino acid residue in a protein is preferably replaced with another amino acid residue from the same side chain family. Alternatively, in another embodiment, mutations can be introduced randomly, and the resultant mutants can be screened for activity to identify mutants that retain activity.
Calculations of sequence homology or identity (the terms are used interchangeably herein) between sequences are performed as follows.
To determine the percent identity of two amino acid sequences, or of two nucleic acid sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second amino acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded for comparison purposes). In a preferred embodiment, the length of a reference sequence aligned for comparison purposes is at least 30%, preferably at least 40%, more preferably at least 50%, even more preferably at least 60%, and even more preferably at least 70%, 75%, 80%, 82%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% of the length of the reference sequence. The amino acid residues or nucleotides at corresponding amino acid positions or nucleotide positions are then compared. When a position in the first sequence is occupied by the same amino acid residue or nucleotide as the corresponding position in the second sequence, then the molecules are identical at that position (as used herein amino acid or nucleic acid “identity” is equivalent to amino acid or nucleic acid “homology”). The percent identity between the two sequences is a function of the number of identical positions shared by the sequences, taking into account the number of gaps, and the length of each gap, which need to be introduced for optimal alignment of the two sequences.
The comparison of sequences and determination of percent identity between two sequences can be accomplished using a mathematical algorithm. In a preferred embodiment, the percent identity between two amino acid sequences is determined using the Needleman et al. (1970) J. Mol. Biol. 48:444-453) algorithm which has been incorporated into the GAP program in the GCG software package (available at http://www.gcg.com), using either a BLOSUM 62 matrix or a PAM250 matrix, and a gap weight of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. In yet another preferred embodiment, the percent identity between two nucleotide sequences is determined using the GAP program in the GCG software package (available at http://www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 80 and a length weight of 1, 2, 3, 4, 5, or 6. A particularly preferred set of parameters (and the one that should be used if the practitioner is uncertain about what parameters should be applied to determine if a molecule is within a sequence identity or homology limitation of the invention) are a BLOSUM 62 scoring matrix with a gap penalty of 12, a gap extend penalty of 4, and a frameshift gap penalty of 5.
Alternatively, the percent identity between two amino acid or nucleotide sequences can be determined using the algorithm of Meyers et al. (1989) CABIOS 4:11-17) which has been incorporated into the ALIGN program (version 2.0), using a PAM120 weight residue table, a gap length penalty of 12 and a gap penalty of 4.
The nucleic acid and protein sequences described herein can be used as a “query sequence” to perform a search against public databases to, for example, identify other family members or related sequences. Such searches can be performed using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al. (1990) J. Mol. Biol. 215:403-410). BLAST nucleotide searches can be performed with the NBLAST program, score=100, wordlength=12 to obtain nucleotide sequences homologous to nucleic acid molecules of the invention. BLAST protein searches can be performed with the XBLAST program, score=50, wordlength=3 to obtain amino acid sequences homologous to protein molecules of the invention. To obtain gapped alignments for comparison purposes, gapped BLAST can be utilized as described in Altschul et al. (1997, Nucl. Acids Res. 25:3389-3402). When using BLAST and gapped BLAST programs, the default parameters of the respective programs (e.g., XBLAST and NBLAST) can be used. See <http://www.ncbi.nlm.nih.gov>.
The polypeptides and nucleic acid molecules described herein can have amino acid sequences or nucleic acid sequences sufficiently or substantially identical to the sequences identified by SEQ ID NO. The terms “sufficiently identical” or “substantially identical” are used herein to refer to a first amino acid or nucleotide sequence that contains a sufficient or minimum number of identical or equivalent (e.g. with a similar side chain) amino acid residues or nucleotides to a second amino acid or nucleotide sequence such that the first and second amino acid or nucleotide sequences have a common structural domain or common functional activity. In other words, amino acid sequences or nucleic acid sequences having one or several (e.g. two, three, four etc) amino acid or nucleic acid substitutions compared to the corresponding sequences identified by SEQ ID NO may be sufficiently or substantially identical to the sequences identified by SEQ ID NO (provided that they retain the requisite functionality). In such examples, the one or several (e.g. two, three, four etc) amino acid or nucleic acid substitutions may be conservative substitutions. For example, amino acid or nucleotide sequences that contain a common structural domain having at least about 60%, or 65% identity, likely 75% identity, more likely 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity are defined herein as sufficiently or substantially identical.
TCR sequences are defined according to IMGT. See the LeFranc references herein for further details i.e. [1]Lefranc M.-P. “Unique database numbering system for immunogenetic analysis” Immunology Today, 18: 509 (1997). [2]Lefranc M.-P. “The IMGT unique numbering for immunoglobulins, T cell Receptors and Ig-like domains” The immunologist, 7, 132-136 (1999). [3]Lefranc M.-P. et al. “IMGT unique numbering for immunoglobulin and Tcell receptor variable domains and Ig superfamily V-like domains” Dev. Comp. Immunol., 27, 55-77 (2003). [4]Lefranc M.-P. et al. “IMGT unique numbering for immunoglobulin and T cell receptor constant domains and Ig superfamily C-like domains” Dev. Comp. Immunol., 2005, 29, 185-203 PMID: 15572068.
As used herein, the term “ex vivo” refers to “outside” the body. The term “in vitro” can be used to encompass “ex vivo” components, compositions and methods.
Unless defined otherwise herein, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention pertains. For example, Singleton and Sainsbury, Dictionary of Microbiology and Molecular Biology, 2d Ed., John Wiley and Sons, NY (1994); and Hale and Marham, The Harper Collins Dictionary of Biology, Harper Perennial, NY (1991) provide those of skill in the art with a general dictionary of many of the terms used in the invention. Although any methods and materials similar or equivalent to those described herein find use in the practice of the present invention, the preferred methods and materials are described herein. Accordingly, the terms defined immediately below are more fully described by reference to the Specification as a whole. Also, as used herein, the singular terms “a”, “an,” and “the” include the plural reference unless the context clearly indicates otherwise. Unless otherwise indicated, nucleic acids are written left to right in 5′ to 3′ orientation; amino acid sequences are written left to right in amino to carboxy orientation, respectively. It is to be understood that this invention is not limited to the particular methodology, protocols, and reagents described, as these may vary, depending upon the context they are used by those of skill in the art.
Aspects of the invention are demonstrated by the following non-limiting examples.
The inventors recently identified the joining chain (Jchain) as a potential target antigen for MM, as it was highly expressed in patient MM samples but not in healthy tissues of non-B cell origins(11). As Jchain is located intracellularly, TCR mediated targeting of the Jchain derived peptides presented in HLA on the surface of target cells is required for them to be bonafide antigens for T cell based targeting of MM.
As described herein, the inventors identified Jchain derived peptides presented on MM cells in HLA-A*01:01 (HLA-A1), HLA-A*02:01 (HLA-A2), HLA-A*03:01 (HLA-A3), HLA-A*11:01 (HLA-A11), HLA-A*24:02 (HLA-A24), and HLA-B*40:01 (HLA-B40) and subsequently successfully isolated T cells with Jchain specific recognition of peptides presented in HLA-A1, HLA-A3, HLA-A11 and HLA-A24. To allow identification of potent T cells recognizing non-mutated self-peptides the inventors performed a method previously described by their group(10, 11). In this method, the inventors made use of the immunogenicity of peptides in the HLA mismatched setting, for which the inventors previously demonstrated that healthy donor PBMCs from individuals not expressing a certain HLA allele can be used to isolate potent T cell clones recognizing non-mutated peptide sequences presented in this HLA allele(10, 11). Upon sequencing and transfer of TCRs from selected T cell clones, in vitro as well as in vivo killing of MM was demonstrated without off-target effects, demonstrating that the TCRs identified herein are useful in the treatment of B cell associated diseases or conditions such as MM.
In a previous study, the inventors identified target genes for cellular therapy of B cell malignancies using Illumina microarray data(11, 19). The Jchain, was identified as a promising target since it was highly expressed in 4 out of 5 patient derived MM samples and expression in healthy tissues of non-B cell origin was absent (
bHLA-I origin of peptides according to binding motifs of netMHC4.0 and HLA typing of MM cells from which peptides were eluted.
cAffinity (nM) for the respective HLA-I allele according to NetMHC 4.0. SB: strong binder, WB: weak binder, <WB: less than weak binder.
dUM9 cells (HLA-A1, -A11, -B7, -B35, -C3 and -C7 positive) were HLA-A2 transduced and U266 cells (HLA-A2, -A3, -B7, -B40, -C3 and -C7 positive) were HLA-A24 transduced.
Jchain Specific T Cell Clones Identified from HLA Mismatched Healthy Donors
As previously described, pHLA-tetramers were used to isolate T cells from healthy donors which were selected to not express target HLA molecules(11). PBMCs, isolated from complete buffy coats, were incubated with PE-labelled pHLA-tetramers (Table 1) and PE magnetic activated cell sorting (MACS) was performed, after which pHLA-tetramerpos, CD8pos T cells were single-cell sorted and clonally expanded. In total, 26 buffy coats were used, each pHLA-tetramer was included between 4 and 19 times for T cell isolations (data not shown). Overall, 29000 pHLA-tetramerpos, CD8pos T cells were sorted of which 17000 T cell clones expanded. For expanded clones, peptide specificity followed by recognition of endogenous antigen was tested in high throughput screenings (data not shown). Previous studies demonstrated that recognition of target gene transduced cells is a good predictor for recognition of malignant cells naturally expressing the target antigen. Therefore, the 10 most potent T cell clones were selected based on recognition of Jchain Td K562 cells, these T cell clones were specific for Jchain peptides YTAVVPLVY (SEQ ID NO:99) in HLA-A1 (YTA-A1), CYTAVVPLV (SEQ ID NO:100) (CYT-A24), and ISDPTSPLRTR (SEQ ID NO:101) in HLA-A3 (ISD-A3) as well as in HLA-A11 (ISD-A11) (
Target-peptide specific recognition was demonstrated in peptide titration experiments. Jchain negative K562 cells were separately transduced with HLA alleles and loaded with decreasing concentration of Jchain peptides. All selected T cell clones exhibited peptide specific recognition, defined by IFN-γ production after overnight co-culture, which decreased with lower peptide concentrations while unloaded K562 cells were not recognized (
Safety Screenings Revealed T Cell Clones with Restricted On-Target Recognition
As TCRs have the intrinsic characteristics that pHLA recognition is often promiscuous, Jchain pHLA specificity of selected T cell clones was investigated to identify potentially harmful cross-reactivities. To uncover cross-reactivity with peptides presented in other HLA alleles, T cell clones were stimulated with a panel of EBV transformed lymphoblastoid cell lines (EBV-LCLs) designed, per HLA restriction, to express as many common HLA-I alleles as possible without expressing the target restriction allele (
In short, Jchain HLA-A1 specific T cell clones did not recognize any EBV-LCLs or cell lines of the non-B cell panel, demonstrating absence of general cross-reactivity for these T cell clones (
To determine the relevance of the identified T cell clones, recognition and killing of the Jchain expressing MM cell line U266 was investigated. Jchain expression in U266 cells was approximately 10-fold higher than expression of HKGs as measured by qPCR (
TCR Sequencing and Transfer Installs Jchain Specific Effector Functions onto CD8 T Cells
To investigate the potential for TCR gene transfer, TCRs of selected T cell clones were sequenced and retrovirally introduced in CD4 and CD8 T cells. For all four TCRs, introduction into healthy donor CD4pos and CD8pos T cells followed by TCR enrichment resulted in TCR cell surface expression as demonstrated by pH LA-tetramer binding (
Peptide titration experiments and recognition of endogenously processed and presented antigen using Jchain Td K562 cells demonstrated functionality of all Jchain TCR Td CD8 T cells (
To assess off-tumor targeting, Jchain TCR Td T cells were co-cultured with healthy subsets of hematopoietic and non-hematopoietic origin. For each subset, cells from multiple donors positive and negative for HLA-A1, -A3, -A11 and -A24 were included. Jchain TCR Td CD8 T cells did not recognize primary immature dendritic cells (DCs), mature DCs, PHA activated T cells, fibroblasts, and keratinocytes (
To gain further insight in the relevance of Jchain targeting TCR gene therapy for patients with MM, in vitro lysis of target HLA positive patient derived MM bone marrow (BM) samples was investigated. Due to the low percentage of malignant cells in the BM samples, FACS based killing assays were performed. MM cells were visualized using a marker panel staining for CD3, CD19, CD56, CD38 and CD45. MM cells were defined as CD3neg, CD19neg, CD45low, CD56pos and CD38pos. Bone marrow samples were incubated overnight with CD8 T cells Td with HLA-A1, -24, -A3 or -All Jchain targeting TCRs or an irrelevant CMV TCR as negative control. Jchain expression in sorted MM cells ranged from 0.03 to 416 times relative to housekeeping genes, 2 out of 8 MM samples were negative for Jchain (<0.1 relative to HKG) (
Finally, the inventors investigated in vivo targeting of MM cells using a previously established U266 xenograft model. NSG mice were separately inoculated with HLA-A1 Td, HLA-A24 Td, HLA-All Td or WT HLA-A3pos U266 cells 3 weeks prior to T cell infusion to generate a model for established MM.
Subsequently, mice were infused with 3-6×10{circumflex over ( )}6 Jchain HLA-A1, -A24, -A3 or -A11 TCR or irrelevant CMV TCR Td CD8 T cells after which tumor outgrowth was followed measuring bioluminescence at regular intervals (
In addition to T cell clones 5D12.9 and 4G8.8, a third promising YTA-A1 Jchain specific T cell clone was identified. For practical reasons, the recognition profile of this T cell clone was less extensively studied. Nevertheless, safety as well as potency screenings demonstrated comparable and promising functionality. T cell clone 13F6.6 did not demonstrate any off-target recognition while Jchain Td K562 cells were highly recognized (
The reader's attention is directed to all papers and documents which are filed concurrently with or previous to this specification in connection with this application and which are open to public inspection with this specification, and the contents of all such papers and documents are incorporated herein by reference.
All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and/or all of the steps of any method or process so disclosed, may be combined in any combination, except combinations where at least some of such features and/or steps are mutually exclusive.
Each feature disclosed in this specification (including any accompanying claims, abstract and drawings), may be replaced by alternative features serving the same, equivalent, or similar purpose, unless expressly stated otherwise. Thus, unless expressly stated otherwise, each feature disclosed is one example only of a generic series of equivalent or similar features.
The invention is not restricted to the details of any foregoing embodiments. The invention extends to any novel one, or any novel combination, of the features disclosed in this specification (including any accompanying claims, abstract and drawings), or to any novel one, or any novel combination, of the steps of any method or process so disclosed.
Number | Date | Country | Kind |
---|---|---|---|
2030990 | Feb 2022 | NL | national |
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/NL2023/050073 | 2/16/2023 | WO |