The instant application contains a Sequence Listing which is being submitted in ASCII format via EFS-WEB and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Aug. 7, 2020, is named P20411029USSEQ.txt and is 104 kb.
The present invention relates to the field of biomedicine, in particular to an EGFRvIII antibody and a conjugate thereof, a preparation method therefor and the use thereof.
Human epidermal growth factor receptor (EGFR, also known as her-1 or Erb-B1) is a 170-kDa transmembrane protein receptor encoded by proto-oncogene c-erbB, and has a tyrosine kinase activity in the intramembrane region (Modjtahedi et al., Br. J. Cancer 73: 228-235, 1996; Herbst and Shin, Cancer 94: 1593-1611, 2002). The full-length sequence of EGFR has the entry number P00533 in SwissProt database. Through tyrosine kinase-mediated signal transduction pathways, EGFR regulates a variety of cell physiological processes, mainly including cell proliferation and differentiation, cell survival and apoptosis, angiogenesis, and mitosis and cell transfer (Atalay et al., Ann. Oncology 14: 1346-1363, 2003; Tsao and Herbst, Signal 4: 4-9, 2003; Herbst and Shin, Cancer 94: 1593-1611, 2002; Modjtahedi et al., Br. J. Cancer 73: 228-235, 1996).
Ligands of EGFR include EGF, TGFA/TGF-alpha, amphiregulin, epigen/EPGN, BTC/betacellulin, epiregulin/EREG and HBEGF/heparin-binding EGF. Receptor-ligand binding would trigger EGFR to form a homodimer or heterodimer, thereby resulting in autophosphorylation in the intracellular region and further activating the complex downstream signaling cascade reaction, which mainly includes the following signaling pathways: RAS-RAF-MEK-ERK signaling pathway, phosphatidylinositol 3-kinase (PI3K) signaling pathway, PLC gamma-PKC signaling pathway and STATs modules signaling pathway.
Overexpression of EGFR has been found in a variety of tumors, including bladder cancer, brain cancer, head and neck cancer, pancreatic cancer, lung cancer, breast cancer, ovarian cancer, colon cancer, prostate cancer, renal cancer, etc. (Atalay et al., Ann. Oncology 14: 1346-1363, 2003; Herbst and Shin, Cancer 94: 1593-1611, 2002; and Modjtahedi et al., Br. J. Cancer 73: 228-235, 1996). In many cases, overexpression of EGFR is associated with poor prognosis of patients (Herbst and Shin, Cancer 94: 1593-1611, 2002; Modjtahedi et al., Br. J. Cancer 73: 228-235, 1996). EGFR is also expressed in normal tissues and has relatively high expression levels in epithelial tissues of skin, liver and gastrointestinal tract; however, its expression level in normal tissues is much lower than that in tumor tissues (Herbst and Shin, Cancer 94: 1593-1611, 2002).
EGFR gene amplification and mutation can be detected simultaneously in a large portion of tumor samples. EGFRvIII is one of the mutant types, and is also known as de2-7 EGFR, ΔEGFR, or Δ2-7 (Olapade-Olaopa et al., Br. J. Cancer. 82, 186-94, 2000). In a mature EGFRvIII mRNA, 801 nucleotides of exons 2-7 are deleted; correspondingly, in an EGFRvIII protein, 267 amino acids (6-273) are deleted and a glycine residue is inserted, forming an unique linker peptide (Wong et al., Proc. Natl. Acad. Sci. U.S.A. 89, 2965-9, 1992; Yamazaki et al., Jpn. J. Cancer Res. 81, 773-9, 1990; Yamazaki et al., Mol. Cell. Biol. 8, 1816-20, 1988; and Sugawa et al., Proc. Natl. Acad. Sci. U.S.A. 87, 8602-6, 1990).
Expression of EGFRvIII has been reported in various tumor types, including glioma, breast cancer, lung cancer, ovarian cancer and prostate cancer (Wikstrand et al., Cancer Res. 57, 4130-40, 1997; Olapade-Olaopa et al., Br. J. Cancer. 82, 186-94, 2000; Wikstrand, et al., Cancer Res. 55, 3140-8, 1995; Garcia de Palazzo et al., Cancer Res. 53, 3217-20, 1993). Although EGFRvIII cannot bind to a ligand, it is in a continuous low activation state. In a nude mouse tumor-xenograft model, the expression of EGFRvIII can significantly promote the growth of glioma cells (Nishikawa et al., Proc. Natl. Acad. Sci. U.S.A. 91, 7727-31, 1994). In addition, the expression of EGFRvIII can transform NIH3T3 cells and MCF-7 cells to produce carcinogenicity (Batra et al., Cell Growth Differ. 6, 1251-9, 1995). The mechanism of action of EGFRvIII in gliomas is not completely clear; however, according to existing reports, EGFRvIII can reduce the apoptosis of glioma cells and slightly increase the proliferation of glioma cells (Nagane et al., Cancer Res. 56, 5079-86, 1996). EGFRvIII is specifically expressed in tumor tissues but not in normal tissues, so it is a highly specific target in antibody therapies.
Monoclonal antibodies have been developed into new therapeutic drugs due to the targeting property, specificity, selectivity, high affinity, and other advantages. However, early clinical trials have revealed that the use of non-human monoclonal antibodies in human bodies often results in severe immune responses due to human anti-mouse antibody (HAMA) and human anti-rat antibody (HARA) responses, and therefore, the non-human monoclonal antibodies are quickly cleared. Subsequently, the antibodies having a low immunogenicity are developed, including chimeric antibodies, humanized antibodies, and complete human antibodies. According to the degree of humanization, therapeutic monoclonal antibody drugs can be divided into 4 types: murine antibody (having no humanized amino acid sequence), chimeric antibody (having 60% to 70% of humanized amino acid sequences), CDR-grafted antibody (having 90% to 95% of humanized amino acid sequences), and complete human antibody (having 100% of humanized amino acid sequences). As the degree of humanization increases, non-murine monoclonal antibodies may alleviate human anti-murine antibody responses (HAMA and HARA responses) during the treatment in human body, gradually eliminate the issue of causing immunogenicity by heterologous antibodies, and improve the pharmacokinetics of the antibodies while maintaining a high affinity for antigens; clinically, these antibody drugs have been widely used for targeted therapies.
An antibody-drug conjugate (ADC) is a new biotherapy method, which produces a specific killing effect on tumor cells by linking a biologically active small-molecule drug to a monoclonal antibody via a chemical linker, wherein the monoclonal antibody is acted as a carrier for targeting transport of the small-molecule drug to the surface of target tumor cells, and the small-molecule drug is released after the ADC is endocytosed by the tumor cells.
In the past 100 years, antibody-based immunotherapies and chemical drug-based chemotherapies have been the two major clinical strategies for cancer treatment. Antibodies target antigens that are overexpressed on tumor cells, and various therapeutic monoclonal antibodies have achieved great success clinically. In clinical practice, although therapeutic antibodies have a good targeting, but the killing effect thereof is limited. Although small-molecule chemical drugs have an efficient killing effect on cancer cells, they also cause the same damage to non-cancer cells. Therefore, the antibody drugs and small-molecule drugs have their own clinical limitations, which puts forward new requirements for drug research and development. A new generation of antibody-drug conjugates uses the specific binding ability of antibodies to target cells to deliver highly cytotoxic chemical drugs, thereby achieving targeted and efficient killing effect on cancer cells. With the emergence of new chemical linking techniques, antibody-drug conjugates began to enter clinical research in the late 1980s, and currently, there are 4 ADC drugs approved by the FDA for marketing.
The development of the ADC drugs involves several aspects such as screening of drug targets, preparation of recombinant antibodies, development of linker techniques, and screening and optimization of highly cytotoxic compounds. EGFRvIII is merely expressed in tumor tissues and not in normal tissues, so it is a highly specific target in antibody therapies.
The technical problem to be solved in the present invention is for overcoming the existing deficiency of lacking an EGFRvIII antibody, thus providing an EGFRvIII antibody with a high affinity and a strong specificity, and a preparation method therefor and the use thereof. The EGFRvIII antibody has a high affinity with an EGFRvIII protein, and can enter cells after coupling with small molecule toxins such as MMAF and has a cytotoxic killing effect on EGFRvIII positive cells. The antibody therefore can be used for preparing drugs for treating tumors and other diseases.
The inventors take the EGFRvIII protein or a recombinant cell line overexpressing the EGFRvIII protein as an immunogen, and use a traditional hybridoma preparation technique. This technique was established by Kohler and Milstein 40 years ago (Kohler and Milstein1975, Nature 256: 495). Through a series of adjustments and improvements, a leading antibody of the EGFRvIII antibody is obtained. Then through preliminary production, purification and detection and identification of the leading antibody, the EGFRvIII antibody having excellent biological properties, such as a high antibody affinity and a cytotoxic killing effect on EGFRvIII positive cells produced by entering the cells after coupling with small-molecule toxin MMAF, is obtained. Subsequently, the amino acid sequence of the heavy chain variable region of the EGFRvIII antibody and the light chain variable region of the EGFRvIII antibody are obtained by means of sequencing using molecular biological methods.
In the art, the binding regions of antibodies or antigens contain a light chain variable region and a heavy chain variable region, each variable region containing three domains: CDR1, CDR2, and CDR3.
The present invention provides an EGFRvIII antibody, which comprises complementarity determining regions (CDRs): one or more of heavy chain CDR1, heavy chain CDR2 and heavy chain CDR3, and/or one or more of light chain CDR1, light chain CDR2 and light chain CDR3, wherein the amino acid sequence of the heavy chain CDR1 is as shown in SEQ ID No. 2, SEQ ID No. 10, SEQ ID No. 18, SEQ ID No. 26, SEQ ID No. 34, SEQ ID No. 42, SEQ ID No. 50, SEQ ID No. 58, SEQ ID No. 66, SEQ ID No. 74, SEQ ID No. 82, SEQ ID No. 90 or SEQ ID No. 98 in the sequence listing; the amino acid sequence of the heavy chain CDR2 is as shown in SEQ ID No. 3, SEQ ID No. 11, SEQ ID No. 19, SEQ ID No. 27, SEQ ID No. 35, SEQ ID No. 43, SEQ ID No. 51, SEQ ID No. 59, SEQ ID No. 67, SEQ ID No. 75, SEQ ID No. 83, SEQ ID No. 91, SEQ ID No. 99, SEQ ID NO. 184 or SEQ ID NO. 186 in the sequence listing; the amino acid sequence of the heavy chain CDR3 is as shown in SEQ ID No. 4, SEQ ID No. 12, SEQ ID No. 20, SEQ ID No. 28, SEQ ID No. 36, SEQ ID No. 44, SEQ ID No. 52, SEQ ID No. 60, SEQ ID No. 68, SEQ ID No. 76, SEQ ID No. 84, SEQ ID No. 92 or SEQ ID No. 100 in the sequence listing; the amino acid sequence of the light chain CDR1 is as shown in SEQ ID No. 6, SEQ ID No. 14, SEQ ID No. 22, SEQ ID No. 30, SEQ ID No. 38, SEQ ID No. 46, SEQ ID No. 54, SEQ ID No. 62, SEQ ID No. 70, SEQ ID No. 78, SEQ ID No. 86, SEQ ID No. 94 or SEQ ID No. 102 in the sequence listing; the amino acid sequence of the light chain CDR2 is as shown in SEQ ID No. 7, SEQ ID No. 15, SEQ ID No. 23, SEQ ID No. 31, SEQ ID No. 39, SEQ ID No. 47, SEQ ID No. 55, SEQ ID No. 63, SEQ ID No. 71, SEQ ID No. 79, SEQ ID No. 87, SEQ ID No. 95 or SEQ ID No. 103 in the sequence listing; the amino acid sequence of the light chain CDR3 is as shown in SEQ ID No. 8, SEQ ID No. 16, SEQ ID No. 24, SEQ ID No. 32, SEQ ID No. 40, SEQ ID No. 48, SEQ ID No. 56, SEQ ID No. 64, SEQ ID No. 72 or SEQ ID No. 80, SEQ ID No. 88, SEQ ID No. 96 or SEQ ID No. 104 in the sequence listing;
or the amino acid sequence of the heavy chain CDR1 is as shown in an amino acid sequence which has at least 80% sequence homology with the amino acid sequence as shown in SEQ ID No. 2, SEQ ID No. 10, SEQ ID No. 18, SEQ ID No. 26, SEQ ID No. 34, SEQ ID No. 42, SEQ ID No. 50, SEQ ID No. 58, SEQ ID No. 66, SEQ ID No. 74, SEQ ID No. 82, SEQ ID No. 90 and SEQ ID No. 98 in the sequence listing; the amino acid sequence of the heavy chain CDR2 is as shown in an amino acid sequence which has at least 80% sequence homology with the amino acid sequence as shown in SEQ ID No. 3, SEQ ID No. 11, SEQ ID No. 19, SEQ ID No. 27, SEQ ID No. 35, SEQ ID No. 43, SEQ ID No. 51, SEQ ID No. 59, SEQ ID No. 67, SEQ ID No. 75, SEQ ID No. 83, SEQ ID No. 91, SEQ ID No. 99, SEQ ID NO. 184 or SEQ ID NO. 186 in the sequence listing; the amino acid sequence of the heavy chain CDR3 is as shown in an amino acid sequence which has at least 80% sequence homology with the amino acid sequence as shown in SEQ ID No. 4, SEQ ID No. 12, SEQ ID No. 20, SEQ ID No. 28, SEQ ID No. 36, SEQ ID No. 44, SEQ ID No. 52, SEQ ID No. 60, SEQ ID No. 68, SEQ ID No. 76, SEQ ID No. 84, SEQ ID No. 92 and SEQ ID No. 100 in the sequence listing; the amino acid sequence of the light chain CDR1 is as shown in an amino acid sequence which has at least 80% sequence homology with the amino acid sequence as shown in SEQ ID No. 6, SEQ ID No. 14, SEQ ID No. 22, SEQ ID No. 30, SEQ ID No. 38, SEQ ID No. 46, SEQ ID No. 54, SEQ ID No. 62, SEQ ID No. 70, SEQ ID No. 78, SEQ ID No. 86, SEQ ID No. 94 and SEQ ID No. 102 in the sequence listing; the amino acid sequence of the light chain CDR2 is as shown in an amino acid sequence which has at least 80% sequence homology with the amino acid sequence as shown in SEQ ID No. 7, SEQ ID No. 15, SEQ ID No. 23, SEQ ID No. 31, SEQ ID No. 39, SEQ ID No. 47, SEQ ID No. 55, SEQ ID No. 63, SEQ ID No. 71, SEQ ID No. 79, SEQ ID No. 87, SEQ ID No. 95 and SEQ ID No. 103 in the sequence listing; the amino acid sequence of the light chain CDR3 is as shown in an amino acid sequence which has at least 80% sequence homology with the amino acid sequence as shown in SEQ ID No. 8, SEQ ID No. 16, SEQ ID No. 24, SEQ ID No. 32, SEQ ID No. 40, SEQ ID No. 48, SEQ ID No. 56, SEQ ID No. 64, SEQ ID No. 72, SEQ ID No. 80, SEQ ID No. 88, SEQ ID No. 96 and SEQ ID No. 104 in the sequence listing; preferably, the amino acid sequence of the heavy chain CDR1 is as shown in an amino acid sequence which has at least 90% sequence homology with the amino acid sequence as shown in SEQ ID No. 2, SEQ ID No. 10, SEQ ID No. 18, SEQ ID No. 26, SEQ ID No. 34, SEQ ID No. 42, SEQ ID No. 50, SEQ ID No. 58, SEQ ID No. 66, SEQ ID No. 74, SEQ ID No. 82, SEQ ID No. 90 and SEQ ID No. 98 in the sequence listing; the amino acid sequence of the heavy chain CDR2 is as shown in an amino acid sequence which has at least 90% sequence homology with the amino acid sequence as shown in SEQ ID No. 3, SEQ ID No. 11, SEQ ID No. 19, SEQ ID No. 27, SEQ ID No. 35, SEQ ID No. 43, SEQ ID No. 51, SEQ ID No. 59, SEQ ID No. 67, SEQ ID No. 75, SEQ ID No. 83, SEQ ID No. 91, SEQ ID No. 99, SEQ ID NO. 184 or SEQ ID NO. 186 in the sequence listing; the amino acid sequence of the heavy chain CDR3 is as shown in an amino acid sequence which has at least 90% sequence homology with the amino acid sequence as shown in SEQ ID No. 4, SEQ ID No. 12, SEQ ID No. 20, SEQ ID No. 28, SEQ ID No. 36, SEQ ID No. 44, SEQ ID No. 52, SEQ ID No. 60, SEQ ID No. 68, SEQ ID No. 76, SEQ ID No. 84, SEQ ID No. 92 and SEQ ID No. 100 in the sequence listing; the amino acid sequence of the light chain CDR1 is as shown in an amino acid sequence which has at least 90% sequence homology with the amino acid sequence as shown in SEQ ID No. 6, SEQ ID No. 14, SEQ ID No. 22, SEQ ID No. 30, SEQ ID No. 38, SEQ ID No. 46, SEQ ID No. 54, SEQ ID No. 62, SEQ ID No. 70, SEQ ID No. 78, SEQ ID No. 86, SEQ ID No. 94 and SEQ ID No. 102 in the sequence listing; the amino acid sequence of the light chain CDR2 is as shown in an amino acid sequence which has at least 90% sequence homology with the amino acid sequence as shown in SEQ ID No. 7, SEQ ID No. 15, SEQ ID No. 23, SEQ ID No. 31, SEQ ID No. 39, SEQ ID No. 47, SEQ ID No. 55, SEQ ID No. 63, SEQ ID No. 71, SEQ ID No. 79, SEQ ID No. 87, SEQ ID No. 95 and SEQ ID No. 103 in the sequence listing; the amino acid sequence of the light chain CDR3 is as shown in an amino acid sequence which has at least 90% sequence homology with the amino acid sequence as shown in SEQ ID No. 8, SEQ ID No. 16, SEQ ID No. 24, SEQ ID No. 32, SEQ ID No. 40, SEQ ID No. 48, SEQ ID No. 56, SEQ ID No. 64, SEQ ID No. 72, SEQ ID No. 80, SEQ ID No. 88, SEQ ID No. 96 and SEQ ID No. 104 in the sequence listing; more preferably, the amino acid sequence of the heavy chain CDR1 is as shown in an amino acid sequence which has at least 95% sequence homology with the amino acid sequence as shown in SEQ ID No. 2, SEQ ID No. 10, SEQ ID No. 18, SEQ ID No. 26, SEQ ID No. 34, SEQ ID No. 42, SEQ ID No. 50, SEQ ID No. 58, SEQ ID No. 66, SEQ ID No. 74, SEQ ID No. 82, SEQ ID No. 90 and SEQ ID No. 98 in the sequence listing; the amino acid sequence of the heavy chain CDR2 is as shown in an amino acid sequence which has at least 95% sequence homology with the amino acid sequence as shown in SEQ ID No. 3, SEQ ID No. 11, SEQ ID No. 19, SEQ ID No. 27, SEQ ID No. 35, SEQ ID No. 43, SEQ ID No. 51, SEQ ID No. 59, SEQ ID No. 67, SEQ ID No. 75, SEQ ID No. 83, SEQ ID No. 91, SEQ ID No. 99, SEQ ID NO. 184 or SEQ ID NO. 186 in the sequence listing; the amino acid sequence of the heavy chain CDR3 is as shown in an amino acid sequence which has at least 95% sequence homology with the amino acid sequence as shown in SEQ ID No. 4, SEQ ID No. 12, SEQ ID No. 20, SEQ ID No. 28, SEQ ID No. 36, SEQ ID No. 44, SEQ ID No. 52, SEQ ID No. 60, SEQ ID No. 68, SEQ ID No. 76, SEQ ID No. 84, SEQ ID No. 92 and SEQ ID No. 100 in the sequence listing; the amino acid sequence of the light chain CDR1 is as shown in an amino acid sequence which has at least 95% sequence homology with the amino acid sequence as shown in SEQ ID No. 6, SEQ ID No. 14, SEQ ID No. 22, SEQ ID No. 30, SEQ ID No. 38, SEQ ID No. 46, SEQ ID No. 54, SEQ ID No. 62, SEQ ID No. 70, SEQ ID No. 78, SEQ ID No. 86, SEQ ID No. 94 and SEQ ID No. 102 in the sequence listing; the amino acid sequence of the light chain CDR2 is as shown in an amino acid sequence which has at least 95% sequence homology with the amino acid sequence as shown in SEQ ID No. 7, SEQ ID No. 15, SEQ ID No. 23, SEQ ID No. 31, SEQ ID No. 39, SEQ ID No. 47, SEQ ID No. 55, SEQ ID No. 63, SEQ ID No. 71, SEQ ID No. 79, SEQ ID No. 87, SEQ ID No. 95 and SEQ ID No. 103 in the sequence listing; the amino acid sequence of the light chain CDR3 is as shown in an amino acid sequence which has at least 95% sequence homology with the amino acid sequence as shown in SEQ ID No. 8, SEQ ID No. 16, SEQ ID No. 24, SEQ ID No. 32, SEQ ID No. 40, SEQ ID No. 48, SEQ ID No. 56, SEQ ID No. 64, SEQ ID No. 72, SEQ ID No. 80, SEQ ID No. 88, SEQ ID No. 96 and SEQ ID No. 104 in the sequence listing; further more preferably, the amino acid sequence of the heavy chain CDR1 is as shown in an amino acid sequence which has at least 99% sequence homology with the amino acid sequence as shown in SEQ ID No. 2, SEQ ID No. 10, SEQ ID No. 18, SEQ ID No. 26, SEQ ID No. 34, SEQ ID No. 42, SEQ ID No. 50, SEQ ID No. 58, SEQ ID No. 66, SEQ ID No. 74, SEQ ID No. 82, SEQ ID No. 90 and SEQ ID No. 98 in the sequence listing; the amino acid sequence of the heavy chain CDR2 is as shown in an amino acid sequence which has at least 99% sequence homology with the amino acid sequence as shown in SEQ ID No. 3, SEQ ID No. 11, SEQ ID No. 19, SEQ ID No. 27, SEQ ID No. 35, SEQ ID No. 43, SEQ ID No. 51, SEQ ID No. 59, SEQ ID No. 67, SEQ ID No. 75, SEQ ID No. 83, SEQ ID No. 91, SEQ ID No. 99, SEQ ID NO. 184 or SEQ ID NO. 186 in the sequence listing; the amino acid sequence of the heavy chain CDR3 is as shown in an amino acid sequence which has at least 99% sequence homology with the amino acid sequence as shown in SEQ ID No. 4, SEQ ID No. 12, SEQ ID No. 20, SEQ ID No. 28, SEQ ID No. 36, SEQ ID No. 44, SEQ ID No. 52, SEQ ID No. 60, SEQ ID No. 68, SEQ ID No. 76, SEQ ID No. 84, SEQ ID No. 92 and SEQ ID No. 100 in the sequence listing; the amino acid sequence of the light chain CDR1 is as shown in an amino acid sequence which has at least 99% sequence homology with the amino acid sequence as shown in SEQ ID No. 6, SEQ ID No. 14, SEQ ID No. 22, SEQ ID No. 30, SEQ ID No. 38, SEQ ID No. 46, SEQ ID No. 54, SEQ ID No. 62, SEQ ID No. 70, SEQ ID No. 78, SEQ ID No. 86, SEQ ID No. 94 and SEQ ID No. 102 in the sequence listing; the amino acid sequence of the light chain CDR2 is as shown in an amino acid sequence which has at least 99% sequence homology with the amino acid sequence as shown in SEQ ID No. 7, SEQ ID No. 15, SEQ ID No. 23, SEQ ID No. 31, SEQ ID No. 39, SEQ ID No. 47, SEQ ID No. 55, SEQ ID No. 63, SEQ ID No. 71, SEQ ID No. 79, SEQ ID No. 87, SEQ ID No. 95 and SEQ ID No. 103 in the sequence listing; the amino acid sequence of the light chain CDR3 is as shown in an amino acid sequence which has at least 99% sequence homology with the amino acid sequence as shown in SEQ ID No. 8, SEQ ID No. 16, SEQ ID No. 24, SEQ ID No. 32, SEQ ID No. 40, SEQ ID No. 48, SEQ ID No. 56, SEQ ID No. 64, SEQ ID No. 72, SEQ ID No. 80, SEQ ID No. 88, SEQ ID No. 96 and SEQ ID No. 104 in the sequence listing.
It should be noted that although the antibodies used in antibody drugs are monoclonal antibodies obtained from the clones of single antibody-producing cells, the antibodies are biological macromolecules with a very complicated structure. Therefore, various post-translational modifications and degradation reactions occur during production, transportation, storage, and in vivo use, such as N-terminal cyclization, glycosylation, deamidation, isomerization, oxidation, fragmentation, and disulfide bond mismatch. These quality attributes may affect the antibody stability, biological activity and bioavailability of final products. Therefore, it is very important to control the stability and consistency of the product quality. The amino acid sequences having “at least 80% (or 90%, 95%, 99%) sequence homology” described in the present invention are obtained by performing insertion, deletion or substitution on the amino acid sequences shown in the preceding sequences, thereby overcoming the above-mentioned problem of possible instability. For example, the computer structure simulation analysis is performed on the sequences, and the site analysis is performed on potential post-translational modifications (PTMs), especially in CDR regions, including analysis and substitution on antibody aggregation, asparagine deamidation sensitive sites (NG, NS, NH, etc.), aspartic acid isomerization sensitive sites (DG and DP), N glycosylation sensitive site (N-{P}S/T) and oxidation sensitive sites.
Among them, deamidation refers to the reaction of converting the amide on the side chain of asparagine and aspartic acid into carboxylic acid; although the deamidation speed of glutamine is usually one-tenth that of asparagine, the mechanism is the same. Isomerization means that the carboxyl group located on the side chain of asparagine and aspartic acid is attacked by a nitrogen atom electron pair located on the residues of the C-terminal side, resulting in deamidation of asparagine or dehydration of aspartic acid to form an unstable cyclic imide intermediate, wherein through cleavage, most of the intermediate becomes isoaspartic acid, and the remaining part becomes aspartic acid. In addition, it has been reported that the deamidation reaction is particularly likely to occur at the adjacent site (Asn-Gly) of asparagine and glycine (Geiger et al., (J. Biol. Chem. 1987, 262: 785-794)). It has also been reported that aspartic acid undergoes a peptide chain breakage through a hydrolysis reaction, and especially the sequence (Asp-Pro) containing proline at the C-terminal side is susceptible to decomposition under acidic conditions (Segalas et al., FEBS Letters, 1995, 371: 171-175). Deamidation and isomerization reactions that occur during storage of these antibodies and proteins become the cause of product heterogeneity, so it is desired to be controlled as much as possible. In order to suppress deamidation, isomerization and hydrolysis, amino acid modifications of removing glutamyl residues and aspartoyl residues that are used as sites undergoing deamidation can be appropriately performed. As an non-limiting scheme of removing particularly effective deamidation sites, it is preferred to list sites promoting the deamidation reaction; i.e., glycine residues, asparagine residues, or glutamine residues in motifs represented by NG and QG sequences, wherein the substitution of any of the amino acid residues (N, Q or G) can significantly inhibit the deamidation reaction (WO 2003/057881, WO 2005/067620, etc.). In addition, by adjusting the method for culturing antibody-producing cells or by appropriately using a deamidation reaction, a method for inhibiting antibodies can be obtained.
Preferably, the amino acid sequence of the heavy chain CDR1 is as shown in SEQ ID No. 2 in the sequence listing, the amino acid sequence of the heavy chain CDR2 is as shown in SEQ ID No. 3 in the sequence listing, and the amino acid sequence of the heavy chain CDR3 is as shown in SEQ ID No. 4 in the sequence listing; the amino acid sequence of the heavy chain CDR1 is as shown in SEQ ID No. 10 in the sequence listing, the amino acid sequence of the heavy chain CDR2 is as shown in SEQ ID No. 11 in the sequence listing, and the amino acid sequence of the heavy chain CDR3 is as shown in SEQ ID No. 12 in the sequence listing; the amino acid sequence of the heavy chain CDR1 is as shown in SEQ ID No. 18 in the sequence listing, the amino acid sequence of the heavy chain CDR2 is as shown in SEQ ID No. 19 in the sequence listing, and the amino acid sequence of the heavy chain CDR3 is as shown in SEQ ID No. 20 in the sequence listing; the amino acid sequence of the heavy chain CDR1 is as shown in SEQ ID No. 26 in the sequence listing, the amino acid sequence of the heavy chain CDR2 is as shown in SEQ ID No. 27 in the sequence listing, and the amino acid sequence of the heavy chain CDR3 is as shown in SEQ ID No. 28 in the sequence listing; the amino acid sequence of the heavy chain CDR1 is as shown in SEQ ID No. 34 in the sequence listing, the amino acid sequence of the heavy chain CDR2 is as shown in SEQ ID No. 35 in the sequence listing, and the amino acid sequence of the heavy chain CDR3 is as shown in SEQ ID No. 36 in the sequence listing; the amino acid sequence of the heavy chain CDR1 is as shown in SEQ ID No. 42 in the sequence listing, the amino acid sequence of the heavy chain CDR2 is as shown in SEQ ID No. 43 in the sequence listing, and the amino acid sequence of the heavy chain CDR3 is as shown in SEQ ID No. 44 in the sequence listing; the amino acid sequence of the heavy chain CDR1 is as shown in SEQ ID No. 50 in the sequence listing, the amino acid sequence of the heavy chain CDR2 is as shown in SEQ ID No. 51 in the sequence listing, and the amino acid sequence of the heavy chain CDR3 is as shown in SEQ ID No. 52 in the sequence listing; the amino acid sequence of the heavy chain CDR1 is as shown in SEQ ID No. 58 in the sequence listing, the amino acid sequence of the heavy chain CDR2 is as shown in SEQ ID No. 59 in the sequence listing, and the amino acid sequence of the heavy chain CDR3 is as shown in SEQ ID No. 60 in the sequence listing; the amino acid sequence of the heavy chain CDR1 is as shown in SEQ ID No. 66 in the sequence listing, the amino acid sequence of the heavy chain CDR2 is as shown in SEQ ID No. 67 in the sequence listing, and the amino acid sequence of the heavy chain CDR3 is as shown in SEQ ID No. 68 in the sequence listing; the amino acid sequence of the heavy chain CDR1 is as shown in SEQ ID No. 74 in the sequence listing, the amino acid sequence of the heavy chain CDR2 is as shown in SEQ ID No. 75 in the sequence listing, and the amino acid sequence of the heavy chain CDR3 is as shown in SEQ ID No. 76 in the sequence listing; the amino acid sequence of the heavy chain CDR1 is as shown in SEQ ID No. 82 in the sequence listing, the amino acid sequence of the heavy chain CDR2 is as shown in SEQ ID No. 83 in the sequence listing, and the amino acid sequence of the heavy chain CDR3 is as shown in SEQ ID No. 84 in the sequence listing; the amino acid sequence of the heavy chain CDR1 is as shown in SEQ ID No. 90 in the sequence listing, the amino acid sequence of the heavy chain CDR2 is as shown in SEQ ID No. 91 in the sequence listing, and the amino acid sequence of the heavy chain CDR3 is as shown in SEQ ID No. 92 in the sequence listing; the amino acid sequence of the heavy chain CDR1 is as shown in SEQ ID No. 98 in the sequence listing, the amino acid sequence of the heavy chain CDR2 is as shown in SEQ ID No. 99 in the sequence listing, and the amino acid sequence of the heavy chain CDR3 is as shown in SEQ ID No. 100 in the sequence listing; the amino acid sequence of the heavy chain CDR1 is as shown in SEQ ID No. 2 in the sequence listing, the amino acid sequence of the heavy chain CDR2 is as shown in SEQ ID No. 184 in the sequence listing, and the amino acid sequence of the heavy chain CDR3 is as shown in SEQ ID No. 4 in the sequence listing; or the amino acid sequence of the heavy chain CDR1 is as shown in SEQ ID No. 2 in the sequence listing, the amino acid sequence of the heavy chain CDR2 is as shown in SEQ ID No. 186 in the sequence listing, and the amino acid sequence of the heavy chain CDR3 is as shown in SEQ ID No. 4 in the sequence listing;
the amino acid sequence of the light chain CDR1 is as shown in SEQ ID No. 6 in the sequence listing, the amino acid sequence of the light chain CDR2 is as shown in SEQ ID No. 7 in the sequence listing, and the amino acid sequence of the light chain CDR3 is as shown in SEQ ID No. 8 in the sequence listing; the amino acid sequence of the light chain CDR1 is as shown in SEQ ID No. 14 in the sequence listing, the amino acid sequence of the light chain CDR2 is as shown in SEQ ID No. 15 in the sequence listing, and the amino acid sequence of the light chain CDR3 is as shown in SEQ ID No. 16 in the sequence listing; or the amino acid sequence of the light chain CDR1 is as shown in SEQ ID No. 22 in the sequence listing, the amino acid sequence of the light chain CDR2 is as shown in SEQ ID No. 23 in the sequence listing, and the amino acid sequence of the light chain CDR3 is as shown in SEQ ID No. 24 in the sequence listing; the amino acid sequence of the light chain CDR1 is as shown in SEQ ID No. 30 in the sequence listing, the amino acid sequence of the light chain CDR2 is as shown in SEQ ID No. 31 in the sequence listing, and the amino acid sequence of the light chain CDR3 is as shown in SEQ ID No. 32 in the sequence listing; the amino acid sequence of the light chain CDR1 is as shown in SEQ ID No. 38 in the sequence listing, the amino acid sequence of the light chain CDR2 is as shown in SEQ ID No. 39 in the sequence listing, and the amino acid sequence of the light chain CDR3 is as shown in SEQ ID No. 40 in the sequence listing; the amino acid sequence of the light chain CDR1 is as shown in SEQ ID No. 46 in the sequence listing, the amino acid sequence of the light chain CDR2 is as shown in SEQ ID No. 47 in the sequence listing, and the amino acid sequence of the light chain CDR3 is as shown in SEQ ID No. 48 in the sequence listing; the amino acid sequence of the light chain CDR1 is as shown in SEQ ID No. 54 in the sequence listing, the amino acid sequence of the light chain CDR2 is as shown in SEQ ID No. 55 in the sequence listing, and the amino acid sequence of the light chain CDR3 is as shown in SEQ ID No. 56 in the sequence listing; the amino acid sequence of the light chain CDR1 is as shown in SEQ ID No. 62 in the sequence listing, the amino acid sequence of the light chain CDR2 is as shown in SEQ ID No. 63 in the sequence listing, and the amino acid sequence of the light chain CDR3 is as shown in SEQ ID No. 64 in the sequence listing; the amino acid sequence of the light chain CDR1 is as shown in SEQ ID No. 70 in the sequence listing, the amino acid sequence of the light chain CDR2 is as shown in SEQ ID No. 71 in the sequence listing, and the amino acid sequence of the light chain CDR3 is as shown in SEQ ID No. 72 in the sequence listing; the amino acid sequence of the light chain CDR1 is as shown in SEQ ID No. 78 in the sequence listing, the amino acid sequence of the light chain CDR2 is as shown in SEQ ID No. 79 in the sequence listing, and the amino acid sequence of the light chain CDR3 is as shown in SEQ ID No. 80 in the sequence listing; the amino acid sequence of the light chain CDR1 is as shown in SEQ ID No. 86 in the sequence listing, the amino acid sequence of the light chain CDR2 is as shown in SEQ ID No. 87 in the sequence listing, and the amino acid sequence of the light chain CDR3 is as shown in SEQ ID No. 88 in the sequence listing; the amino acid sequence of the light chain CDR1 is as shown in SEQ ID No. 94 in the sequence listing, the amino acid sequence of the light chain CDR2 is as shown in SEQ ID No. 95 in the sequence listing, and the amino acid sequence of the light chain CDR3 is as shown in SEQ ID No. 96 in the sequence listing; or the amino acid sequence of the light chain CDR1 is as shown in SEQ ID No. 102 in the sequence listing, the amino acid sequence of the light chain CDR2 is as shown in SEQ ID No. 103 in the sequence listing, and the amino acid sequence of the light chain CDR3 is as shown in SEQ ID No. 104 in the sequence listing.
More preferably, the EGFRvIII antibody comprises a heavy chain variable region and/or a light chain variable region containing the CDRs, wherein the amino acid sequence of the heavy chain variable region is as shown in SEQ ID No. 1, SEQ ID No. 9, SEQ ID No. 17, SEQ ID No. 25, SEQ ID No. 33, SEQ ID No. 41, SEQ ID No. 49, SEQ ID No. 57, SEQ ID No. 65, SEQ ID No. 73, SEQ ID No. 81, SEQ ID No. 89, SEQ ID No. 97, SEQ ID No. 179 or SEQ ID No. 180 in the sequence listing; and the amino acid sequence of the light chain variable region is as shown in SEQ ID No. 5, SEQ ID No. 13, SEQ ID No. 21, SEQ ID No. 29, SEQ ID No. 37, SEQ ID No. 45, SEQ ID No. 53, SEQ ID No. 61, SEQ ID No. 69, SEQ ID No. 77, SEQ ID No. 85, SEQ ID No. 93 or SEQ ID No. 101 in the sequence listing.
Preferably, the amino acid sequence of the heavy chain variable region is as shown in SEQ ID No. 1 in the sequence listing, and the amino acid sequence of the light chain variable region is as shown in SEQ ID No. 5 in the sequence listing; the amino acid sequence of the heavy chain variable region is as shown in SEQ ID No. 9 in the sequence listing, and the amino acid sequence of the light chain variable region is as shown in SEQ ID No. 13 in the sequence listing; the amino acid sequence of the heavy chain variable region is as shown in SEQ ID No. 17 in the sequence listing, and the amino acid sequence of the light chain variable region is as shown in SEQ ID No. 21 in the sequence listing; the amino acid sequence of the heavy chain variable region is as shown in SEQ ID No. 25 in the sequence listing, and the amino acid sequence of the light chain variable region is as shown in SEQ ID No. 29 in the sequence listing; the amino acid sequence of the heavy chain variable region is as shown in SEQ ID No. 33 in the sequence listing, and the amino acid sequence of the light chain variable region is as shown in SEQ ID No. 37 in the sequence listing; the amino acid sequence of the heavy chain variable region is as shown in SEQ ID No. 41 in the sequence listing, and the amino acid sequence of the light chain variable region is as shown in SEQ ID No. 45 in the sequence listing; the amino acid sequence of the heavy chain variable region is as shown in SEQ ID No. 49 in the sequence listing, and the amino acid sequence of the light chain variable region is as shown in SEQ ID No. 53 in the sequence listing; the amino acid sequence of the heavy chain variable region is as shown in SEQ ID No. 57 in the sequence listing, and the amino acid sequence of the light chain variable region is as shown in SEQ ID No. 61 in the sequence listing; the amino acid sequence of the heavy chain variable region is as shown in SEQ ID No. 65 in the sequence listing, and the amino acid sequence of the light chain variable region is as shown in SEQ ID No. 69 in the sequence listing; the amino acid sequence of the heavy chain variable region is as shown in SEQ ID No. 73 in the sequence listing, and the amino acid sequence of the light chain variable region is as shown in SEQ ID No. 77 in the sequence listing; the amino acid sequence of the heavy chain variable region is as shown in SEQ ID No. 81 in the sequence listing, and the amino acid sequence of the light chain variable region is as shown in SEQ ID No. 85 in the sequence listing; the amino acid sequence of the heavy chain variable region is as shown in SEQ ID No. 89 in the sequence listing, and the amino acid sequence of the light chain variable region is as shown in SEQ ID No. 93 in the sequence listing; the amino acid sequence of the heavy chain variable region is as shown in SEQ ID No. 97 in the sequence listing, and the amino acid sequence of the light chain variable region is as shown in SEQ ID No. 101 in the sequence listing; the amino acid sequence of the heavy chain variable region is as shown in SEQ ID No. 179 in the sequence listing, and the amino acid sequence of the light chain variable region is as shown in SEQ ID No. 5 in the sequence listing; or the amino acid sequence of the heavy chain variable region is as shown in SEQ ID No. 180 in the sequence listing, and the amino acid sequence of the light chain variable region is as shown in SEQ ID No. 5 in the sequence listing.
In summary, the numbers of the above-mentioned amino acid sequences are as shown in table 1:
The numbers in table 1 are the sequence numbers in the sequence listing. For example, the amino acid sequence of the heavy chain protein variable region of 75G7C6 is SEQ ID No. 1, and the amino acid sequence of the CDR1 in the heavy chain protein variable region of 75G7C6 is SEQ ID No. 2.
Preferably, the EGFRvIII antibody further comprises a framework region (or also known as a frame region or a backbone region), wherein the framework region further comprises a heavy chain framework region and/or a light chain framework region; preferably, the heavy chain framework region is a human or murine antibody heavy chain framework region, and/or the light chain framework region is a human or murine antibody light chain framework region; more preferably, the heavy chain framework region is a human antibody heavy chain framework region, and the light chain framework region is a human antibody light chain framework region. The framework residues are a part of the light chain variable region or the heavy chain variable region, and are residues of the antibody variable regions other than hypervariable residues (mostly referring to complementarity determining regions or CDRs) or CDR residues, wherein the residues of the antibody variable regions serve as a scaffold for the antigen-binding loop (CDR) of the variable domain. The framework residues may be derived from naturally occurring human antibodies, for example, the framework regions of human antibodies substantially similar to those of the murine anti-EGFRvIII antibody 75G7C6 or 63A10A7. Artificial framework sequences that represent consensus sequences among individual sequences can also be used. Once humanized framework regions are selected and used, sequences widely presented in humans may be superior to less common sequences. Additional mutations in the human framework receptor sequences may be prepared to restore murine residues thought to be involved in antigen contact and/or residues involved in the structural integrity of the antigen-binding sites, or to improve antibody expression. Peptide structure prediction can be used to analyze humanized heavy chain variable region sequences and light chain variable region sequences so as to identify and avoid post-translational protein modification sites introduced by humanized design.
More preferably, when the EGFRvIII antibody of the present invention is a humanized antibody or complete human antibody, the heavy chain framework region is a human antibody heavy chain framework region, and the residues of the human antibody light chain framework region may comprise sequences encoded by germlines IGKV1-12, IGKV1-13, IGKV1-16, IGKV1-17, IGKV1-22, IGKV1-27, IGKV1-32, IGKV1-37, IGKV1-39, IGKV1-5, IGKV1-8, IGKV1-9, IGKV3-34, IGKV2-36, IGKV3-11, IGKV2-40, IGKV3-25, IGKV6-21, IGKV7-3, IGKV5-2, IGKV4-1, IGKV2-4, IGKV2-19, IGKV2-18, IGKV3-20, IGKV2-26 and IGKV2-24, particularly FR1, FR2 and FR3 of these germlines; and Jk fragments Jk1, Jk2, Jk3, Jk4 and Jk5, particularly FR4 of these germlines. The residues of the human antibody heavy chain framework region may comprise sequences encoded by germlines IGHV7-81, IGHV4-80, IGHV3-79, IGHV(II)-78-1, IGHV5-78, IGHV3-76, IGHV3-73, IGHV3-72, IGHV2-70, IGHV1-69-2, IGHV2-70D, IGHV3-49, IGHV3-43, IGHV1-2, IGHV1-3, IGHV1-8, IGHV1-18, IGHV1-24, IGHV1-45, IGHV1-46, IGHV2-5, IGHV2-21, IGHV2-26, IGHV2-70, IGHD3-7, IGHV3-9, IGHV3-11, IGHV3-7, IGHV3-7 and IGHD3-9, particularly FR1, FR2 and FR3 of these germlines, and JH fragments JH1, JH2, JH3, JH4, JH4b, JH5 and JH6, particularly FR4 of these germlines; or consensus sequences of the heavy chain framework region. Such framework region sequences can be obtained from public DNA databases or published references that include germline antibody gene sequences. For example, the germline DNA sequences of human heavy and light chain variable region genes can be obtained from the “VBase” human germline sequence database (www.mrcco8.com.ac.uk/vbase), and found in Kabat, E. A et al., 1991 Sequences of Proteins of Immunological Interest, 5th edition. In a more preferred embodiment of the humanized antibody of the present invention, the human acceptable sequences of the humanized EGFRvIII antibody are selected from human germline exons VH, JH, Vk and Jk sequences, wherein the template of the antibody heavy chain variable region is preferably IGHV1-46*01 of the human germline antibody heavy chain exon VH, and JH-4 of the exon JH; or IGHV1-46*01 of the human germline antibody heavy chain exon VH, and JH-6b of the exon JH. The template of the antibody light chain variable region is preferably IGKV1-39*01 of the human germline antibody light chain exon VK, and JK-2 of the exon JK; or IGKV3-11*01 of the human germline antibody light chain exon VK, and JK-4 of the exon JK.
Preferably, the protein further comprises an antibody heavy chain constant region and/or an antibody light chain constant region, wherein the antibody heavy chain constant region is conventional in the art, preferably a mouse antibody heavy chain constant region or a human antibody heavy chain constant region, and more preferably a human antibody heavy chain constant region. The antibody light chain constant region is conventional in the art, preferably a mouse light chain antibody constant region or a human antibody light chain constant region, and more preferably a human antibody light chain constant region.
The heavy chain variable region having an amino acid sequence as shown in SEQ ID NO. 1, 9, 17, 25 or 179 and the light chain variable region having an amino acid sequence as shown in SEQ ID NO. 5, 13, 21 or 29 can form a murine EGFRvIII antibody with a murine heavy chain constant region, and also can form an EGFRvIII chimeric antibody with a human heavy chain constant region and a human light chain constant region (wherein the amino acid sequence shown in SEQ ID NO. 179 is obtained by mutating the amino acid sequence shown in SEQ ID NO. 1, with NG in the CDR2 region of the latter mutated to NA).
Further, in the above-mentioned chimeric antibody, the heavy chain CDR having a sequence as shown in SEQ ID NOs 2, 184 and 4 (corresponding to CDR1, 2 and 3, respectively) or SEQ ID NOs 10 to 12, determined according to the definition of Kabat, in the heavy chain variable region having an amino acid sequence as shown in SEQ ID NO. 179 or 9, and the light chain CDR having a sequence as shown in SEQ ID NOs 6 to 8 or SEQ ID NOs 14 to 16, determined according to the definition of Kabat, in the light chain variable region having an amino acid sequence as shown in SEQ ID NO. 5 or 13 are respectively grafted into the selected human germline template, thereby replacing the CDR regions of the human germline template to obtain a humanized antibody; the light chain framework region and the heavy chain framework region in the germline template are as described above; upon selection of the human antibody variable region frameworks, the light chain FR sequences on the antibody light chain variable region are derived from human germline light chain sequences comprising a combination of human antibody light chain framework segments of 1) FR1, FR2 and FR3 of IGKV1-39*01 or IGKV3-11*01, and 2) FR4 of JK-2 or JK-4; the heavy chain FR sequences on the antibody heavy chain variable region are derived from human germline heavy chain sequences comprising a combination of human antibody heavy chain framework segments of 1) FR1, FR2 and FR3 of IGHV1-46*01, and 2) FR4 of JH-4 or JH-6b. In general, the framework regions of a human receptor should be selected similar to those of the donor antibody or most similar to the consensus sequences of the variable region subfamilies. After graft, sequence mutations can be made in the donor and/or receptor sequences to optimize antigen binding, functionality, codon usage, expression levels, etc., including the introduction of non-human residues into the framework regions.
Preferably, after the above-mentioned CDR regions are grafted into the selected human germline template and the framework region residues are subjected to back mutation, the amino acid sequence of the heavy chain variable region is preferably as shown in SEQ ID NO. 133, 134, 135, 136, 143, 144, 145, 146, 147, 141, 142 or 147; and the amino acid sequence of the light chain variable region is preferably as shown in SEQ ID NO. 137, 138, 139, 140, 148, 149, 150, 151, 152 or 153.
The humanized EGFRvIII antibody of the present invention preferably comprises at least one heavy chain variable region and/or at least one light chain variable region, wherein the amino acid sequence of the heavy chain variable region is as shown in SEQ ID NO. 133, SEQ ID NO. 134, SEQ ID NO. 135, SEQ ID NO. 136, SEQ ID NO. 141, SEQ ID NO. 142, SEQ ID NO. 143, SEQ ID NO. 144, SEQ ID NO. 145, SEQ ID NO. 146 or SEQ ID NO. 147 in the sequence listing; the sequence of the light chain variable region is as shown in SEQ ID NO. 137, SEQ ID NO. 138, SEQ ID NO. 139, SEQ ID NO. 140, SEQ ID NO. 148, SEQ ID NO. 149, SEQ ID NO. 150, SEQ ID NO. 151, SEQ ID NO. 152 or SEQ ID NO. 153 in the sequence listing;
or the amino acid sequence of the heavy chain variable region is as shown in an amino acid sequence which has at least 80% sequence homology with the amino acid sequence as shown in SEQ ID NO. 133, SEQ ID NO. 134, SEQ ID NO. 135, SEQ ID NO. 136, SEQ ID NO. 141, SEQ ID NO. 142, SEQ ID NO. 143, SEQ ID NO. 144, SEQ ID NO. 145, SEQ ID NO. 146 or SEQ ID NO. 147 in the sequence listing; the sequence of the light chain variable region is as shown in an amino acid sequence which has at least 80% sequence homology with the amino acid sequence as shown in SEQ ID NO. 137, SEQ ID NO. 138, SEQ ID NO. 139, SEQ ID NO. 140, SEQ ID NO. 148, SEQ ID NO. 149, SEQ ID NO. 150, SEQ ID NO. 151, SEQ ID NO. 152 or SEQ ID NO. 153 in the sequence listing; more preferably, the amino acid sequence of the heavy chain variable region is an amino acid sequence which has 90% sequence homology with the amino acid sequence encoded by the nucleotide sequence as shown in SEQ ID No. 154, SEQ ID No. 155, SEQ ID No. 156, SEQ ID No. 157, SEQ ID No. 162, SEQ ID No. 163, SEQ ID No. 164, SEQ ID No. 165, SEQ ID No. 166, SEQ ID No. 167 or SEQ ID No. 168 in the sequence listing; the amino acid sequence of the light chain variable region is an amino acid sequence which has 90% sequence homology with the amino acid sequence encoded by the nucleotide sequence as shown in SEQ ID No. 158, SEQ ID No. 159, SEQ ID No. 160, SEQ ID No. 161, SEQ ID No. 169, SEQ ID No. 170, SEQ ID No. 171, SEQ ID No. 172, SEQ ID No. 173 or SEQ ID No. 174 in the sequence listing; further more preferably, the amino acid sequence of the heavy chain variable region is an amino acid sequence which has 95% sequence homology with the amino acid sequence encoded by the nucleotide sequence as shown in SEQ ID No. 154, SEQ ID No. 155, SEQ ID No. 156, SEQ ID No. 157, SEQ ID No. 162, SEQ ID No. 163, SEQ ID No. 164, SEQ ID No. 165, SEQ ID No. 166, SEQ ID No. 167 or SEQ ID No. 168 in the sequence listing; the amino acid sequence of the light chain variable region is an amino acid sequence which has 95% sequence homology with the amino acid sequence encoded by the nucleotide sequence as shown in SEQ ID No. 158, SEQ ID No. 159, SEQ ID No. 160, SEQ ID No. 161, SEQ ID No. 169, SEQ ID No. 170, SEQ ID No. 171, SEQ ID No. 172, SEQ ID No. 173 or SEQ ID No. 174 in the sequence listing; most preferably, the amino acid sequence of the heavy chain variable region is an amino acid sequence which has 99% sequence homology with the amino acid sequence encoded by the nucleotide sequence as shown in SEQ ID No. 154, SEQ ID No. 155, SEQ ID No. 156, SEQ ID No. 157, SEQ ID No. 162, SEQ ID No. 163, SEQ ID No. 164, SEQ ID No. 165, SEQ ID No. 166, SEQ ID No. 167 or SEQ ID No. 168 in the sequence listing; the amino acid sequence of the light chain variable region is an amino acid sequence which has 99% sequence homology with the amino acid sequence encoded by the nucleotide sequence as shown in SEQ ID No. 158, SEQ ID No. 159, SEQ ID No. 160, SEQ ID No. 161, SEQ ID No. 169, SEQ ID No. 170, SEQ ID No. 171, SEQ ID No. 172, SEQ ID No. 173 or SEQ ID No. 174 in the sequence listing.
Preferably, the amino acid sequence of the heavy chain variable region is an amino acid sequence which has 80% sequence homology with the amino acid sequence encoded by the nucleotide sequence as shown in SEQ ID No. 154, SEQ ID No. 155, SEQ ID No. 156, SEQ ID No. 157, SEQ ID No. 162, SEQ ID No. 163, SEQ ID No. 164, SEQ ID No. 165, SEQ ID No. 166, SEQ ID No. 167 or SEQ ID No. 168 in the sequence listing; the amino acid sequence of the light chain variable region is an amino acid sequence which has 80% sequence homology with the amino acid sequence encoded by the nucleotide sequence as shown in SEQ ID No. 158, SEQ ID No. 159, SEQ ID No. 160, SEQ ID No. 161, SEQ ID No. 169, SEQ ID No. 170, SEQ ID No. 171, SEQ ID No. 172, SEQ ID No. 173 or SEQ ID No. 174 in the sequence listing; more preferably, the amino acid sequence of the heavy chain variable region is an amino acid sequence which has 90% sequence homology with the amino acid sequence encoded by the nucleotide sequence as shown in SEQ ID No. 154, SEQ ID No. 155, SEQ ID No. 156, SEQ ID No. 157, SEQ ID No. 162, SEQ ID No. 163, SEQ ID No. 164, SEQ ID No. 165, SEQ ID No. 166, SEQ ID No. 167 or SEQ ID No. 168 in the sequence listing; the amino acid sequence of the light chain variable region is an amino acid sequence which has 90% sequence homology with the amino acid sequence encoded by the nucleotide sequence as shown in SEQ ID No. 158, SEQ ID No. 159, SEQ ID No. 160, SEQ ID No. 161, SEQ ID No. 169, SEQ ID No. 170, SEQ ID No. 171, SEQ ID No. 172, SEQ ID No. 173 or SEQ ID No. 174 in the sequence listing; further more preferably, the amino acid sequence of the heavy chain variable region is an amino acid sequence which has 95% sequence homology with the amino acid sequence encoded by the nucleotide sequence as shown in SEQ ID No. 154, SEQ ID No. 155, SEQ ID No. 156, SEQ ID No. 157, SEQ ID No. 162, SEQ ID No. 163, SEQ ID No. 164, SEQ ID No. 165, SEQ ID No. 166, SEQ ID No. 167 or SEQ ID No. 168 in the sequence listing; the amino acid sequence of the light chain variable region is an amino acid sequence which has 95% sequence homology with the amino acid sequence encoded by the nucleotide sequence as shown in SEQ ID No. 158, SEQ ID No. 159, SEQ ID No. 160, SEQ ID No. 161, SEQ ID No. 169, SEQ ID No. 170, SEQ ID No. 171, SEQ ID No. 172, SEQ ID No. 173 or SEQ ID No. 174 in the sequence listing; most preferably, the amino acid sequence of the heavy chain variable region is an amino acid sequence which has 99% sequence homology with the amino acid sequence encoded by the nucleotide sequence as shown in SEQ ID No. 154, SEQ ID No. 155, SEQ ID No. 156, SEQ ID No. 157, SEQ ID No. 162, SEQ ID No. 163, SEQ ID No. 164, SEQ ID No. 165, SEQ ID No. 166, SEQ ID No. 167 or SEQ ID No. 168 in the sequence listing; the amino acid sequence of the light chain variable region is an amino acid sequence which has 99% sequence homology with the amino acid sequence encoded by the nucleotide sequence as shown in SEQ ID No. 158, SEQ ID No. 159, SEQ ID No. 160, SEQ ID No. 161, SEQ ID No. 169, SEQ ID No. 170, SEQ ID No. 171, SEQ ID No. 172, SEQ ID No. 173 or SEQ ID No. 174 in the sequence listing.
Preferably, the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 133 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 137 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 133 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 138 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 133 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 139 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 133 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 140 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 134 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 137 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 134 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 138 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 134 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 139 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 134 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 140 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 135 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 137 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 135 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 138 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 135 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 139 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 135 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 140 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 136 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 137 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 136 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 138 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 136 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 139 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 136 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 140 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 143 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 149 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 144 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 149 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 145 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 149 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 146 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 149 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 147 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 149 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 143 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 148 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 143 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 150 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 143 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 151 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 143 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 152 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 143 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 153 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 141 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 149 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 141 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 150 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 141 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 151 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 141 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 152 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 141 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 153 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 142 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 149 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 142 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 150 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 142 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 151 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 142 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 152 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 142 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 153 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 145 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 148 in the sequence listing; the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 146 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 148 in the sequence listing; or the amino acid sequence of the heavy chain variable region is a sequence as shown in SEQ ID No. 147 in the sequence listing, and the amino acid sequence of the light chain variable region is a sequence as shown in SEQ ID No. 148 in the sequence listing.
In summary, the numbers of the above-mentioned amino acid sequences are as shown in table 1-1.
The numbers in table 1-1 are the sequence numbers in the sequence listing. For example, the amino acid sequence of the heavy chain protein variable region of h75G7C6-1 is SEQ ID No. 133, and the amino acid sequence of the light chain protein variable region of h75G7C6-1 is SEQ ID No. 137.
The CDR1, CDR2 and CDR3 of the heavy chain variable region of c75G7C6-1 are as shown in SEQ ID NO. 2, SEQ ID NO. 184 and SEQ ID NO. 4 in the sequence listing, respectively.
The CDR1, CDR2 and CDR3 of the heavy chain variable region of c75G7C6-2 are as shown in SEQ ID NO. 2, SEQ ID NO. 186 and SEQ ID NO. 4 in the sequence listing, respectively.
Preferably, the humanized anti-EGFRvIII antibody further comprises a human antibody heavy chain constant region and/or a human antibody light chain constant region. The heavy chain variable region and the light chain variable region form a full-length humanized antibody protein with the human heavy chain constant region and the human light chain constant region. The human antibody heavy chain constant region described therein is conventional in the art, and may comprise a constant region derived from a human constant region, which further comprises a heavy chain constant region of human IgG1, IgG2, IgG3, IgG4 or a variant thereof; the human antibody light chain constant region described therein is conventional in the art, and may comprise a constant region derived from a human constant region, which further comprises a light chain constant region of human κ and λ chains or a variant thereof.
The humanized anti-EGFRvIII antibody of the present invention is prepared by using any of various methods, including veneering and graft of complementarity determining regions (CDRs), graft of shortened CDRs, graft of specificity determining regions (SDRs), and Frankenstein assembly. Specificity determining regions (SDRs) are the residues within CDRs that interact directly with antigens. SDRs correspond to hypervariable residues. See Padlan et al. (1995) FASEB J. 9: 133-139).
The preparation of the representative humanized anti-EGFRvIII antibody of the present invention is described in example 1.
The human antibodies of the present invention are also a class of chimeric antibodies in a broad sense, in which the variable region residues responsible for antigen binding include complementarity determining regions derived from non-human species, shortened complementarity determining regions, or any other residues involved in antigen binding; the remaining variable region residues, such as framework region residues and constant region residues, are derived at least in part from human antibody sequences. A subset of the framework region residues and constant region residues of the humanized antibody may be derived from non-human sources. The variable regions of the humanized antibody are also described as a humanized light chain variable region and/or heavy chain variable region. Non-human species are generally species used for immunization with antigens, such as mice, rats, rabbits, non-human primates, or other non-human mammal species. Humanized antibodies are generally less immunogenic than traditional chimeric antibodies and show improved stability after administration to humans.
The humanized antibodies also include super-humanized antibodies, which are used in a method for preparing humanized antibodies. This method does not rely on human framework sequences as analysis points, but on the comparison of the canonical CDR structure types of non-human antibodies and CDR structure types of human antibodies, especially human antibodies encoded by human germline sequences, from which candidate human antibody sequences that can be used to obtain suitable human framework sequences are identified. For example, human residues can replace non-human residues in CDRs, where one or more changes have been introduced into the CDRs. One premise of veneering is that the immunogenicity of the murine antibody variable regions originates from the surface residues thereof, and the mobility of the residues and the accessibility of solvents are the basic conditions thereof to become an antigenic determinant. According to the analysis results of the existing antibody crystal structure data, the fidelity of the relative solvent accessibility distribution of human and murine antibody variable region residues at the sequence pairing position reaches 98%, indicating that the residues that induce the immune response among the heterologous species are caused by the remaining species-specific solvent accessible surface residues. Therefore, by replacing the murine-specific surface residues with human-specific surface residues, the surface profile of human antibodies can be simulated to escape the recognition of the human immune system and achieve the purpose of humanization. In short, veneering is based on the concept of reducing the potential immunogenic amino acid sequences in rodent or other non-human antibodies by reconstructing a solvent-accessible surface of the antibody with human amino acid sequences. See Padlan (1991) Mol. Immunol. 28: 489-980. Veneering is performed by identifying the external framework residues of the solvent accessible residues exposed on the surface in non-human antibodies (the residues are different from those at the same position in the framework regions of human antibodies), and replacing the identified residues with those amino acids occupying the same positions in human antibodies; that is to say, the surface residues of the veneered antibodies are mainly human sequences, while the residues wrapped inside are mainly original murine sequences. The graft of CDRs is performed by replacing one or more CDRs of a receptor antibody (e.g., a human antibody or other antibodies containing the desired framework residues) with the CDRs of a donor antibody (e.g., a non-human antibody). The receptor antibody may be selected based on the similarity of framework residues between a candidate receptor antibody and donor antibody. For instance, according to the Frankenstein method, human framework regions having substantial sequence homology with each framework region of the relevant non-human antibody are identified, and the CDRs of the non-human antibody are grafted onto the complex of these different human framework regions. The above-mentioned methods may be combined to produce the anti-EGFRvIII antibody having any desired sequence.
The EGFRvIII antibody of the present invention is preferably one or more of an antibody full-length protein, antigen-antibody binding domain protein fragment, bispecific antibody, multispecific antibody, single chain antibody (scFv), single domain antibody (sdAb) and single-domain antibody, and a monoclonal antibody or polyclonal antibody prepared from the above-mentioned antibodies. The monoclonal antibody may be developed by various ways and techniques, including a hybridoma technique, phage display technique, single lymphocyte gene cloning technique, etc, wherein the monoclonal antibody is mainly prepared from wild-type or transgenic mice by hybridoma technique.
The antibody full-length protein is a conventional full-length antibody protein in the art, comprising a heavy chain variable region, light chain variable region, heavy chain constant region, and light chain constant region. The heavy chain variable region and light chain variable region of the protein form a complete human antibody full-length protein with a human heavy chain constant region and a human light chain constant region. Preferably, the full-length antibody protein is IgG1, IgG2, IgG3 or IgG4.
The single chain antibody is a conventional single chain antibody in the art, comprising a heavy chain variable region, light chain variable region and short peptide of 15 to 20 amino acids.
The antigen-antibody binding domain protein fragment is a conventional antigen-antibody binding domain protein fragment in the art, comprising a light chain variable region, light chain constant region, and Fd segment of the heavy chain constant region. Preferably, the antigen-antibody binding domain protein fragment is Fab and F(ab′).
The single domain antibody is a conventional single domain antibody in the art, comprising a heavy chain variable region and heavy chain constant region.
The single-domain antibody is a conventional single-domain antibody in the art, only comprising a heavy chain variable region.
The EGFRvIII antibody of the present invention also includes a super-humanized antibody, diabody, etc.
The method for preparing the EGFRvIII antibody is a conventional preparation method in the art. The preparation method preferably comprises: obtaining the antibody by isolation from an expression transformant recombinantly expressing the protein or by artificial synthesis of sequences of the protein. The method for obtaining the antibody by isolation from an expression transformant recombinantly expressing the protein preferably comprises: cloning a nucleic acid molecule encoding the EGFRvIII antibody and having a point mutation into a recombinant vector, transforming the obtained recombinant vector into a transformant to obtain a recombinant expression transformant, and culturing the obtained recombinant expression transformant to obtain the EGFRvIII antibody by isolation and purification.
The present invention further provides a nucleic acid encoding the above-mentioned EGFRvIII antibody.
Preferably, the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 105, SEQ ID No. 107, SEQ ID No. No. 109, SEQ ID No. 111, SEQ ID No. 113, SEQ ID No. 115, SEQ ID 117, SEQ ID No. 119, SEQ ID No. 121, SEQ ID No. 123, SEQ ID No. 125, SEQ ID No. 127, SEQ ID No. 129, SEQ ID No. 185, SEQ ID No. 154, SEQ ID No. 155, SEQ ID No. 156, SEQ ID No. 157, SEQ ID No. 162, SEQ ID No. 163, SEQ ID No. 164, SEQ ID No. 165, SEQ ID No. 166, SEQ ID No. 167 or SEQ ID No. 168 in the sequence listing; and/or the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 106, SEQ ID No. 108, SEQ ID No. 110, SEQ ID No. 112, SEQ ID No. 114, SEQ ID No. 116, SEQ ID No. 118, SEQ ID No. 120, SEQ ID No. 122, SEQ ID No. 124, SEQ ID No. 126, SEQ ID No. 128, SEQ ID No. 130, SEQ ID No. 158, SEQ ID No. 159, SEQ ID No. 160, SEQ ID No. 161, SEQ ID No. 169, SEQ ID No. 170, SEQ ID No. 171, SEQ ID No. 172, SEQ ID No. 173 or SEQ ID No. 174 in the sequence listing.
More preferably, the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 105 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 106 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 107 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 108 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 109 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 110 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 111 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 112 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 113 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 114 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 115 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 116 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 117 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 118 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 119 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 120 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 121 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 122 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 123 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 124 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 125 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No 126 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 127 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 128 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 129 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 130 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 185 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 106 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 154 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 158 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 154 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 159 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 154 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 160 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 154 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 161 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 155 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 158 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 155 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 159 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 155 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 160 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 155 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 161 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 156 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 158 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 156 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 159 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 156 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 160 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 156 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 161 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 157 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 158 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 157 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 159 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 157 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 160 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 157 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 161 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 164 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 170 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 165 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 170 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 166 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 170 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 167 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 170 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 168 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 170 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 164 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 169 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 164 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 171 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 164 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 172 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 164 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 173 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 164 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 174 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 162 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 170 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 162 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 171 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 162 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 172 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 162 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 173 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 162 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 174 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 163 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 170 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 163 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 171 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 163 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 172 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 163 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 173 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 163 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 174 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 166 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 169 in the sequence listing; the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 167 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 169 in the sequence listing; or the nucleotide sequence of the nucleic acid encoding the heavy chain variable region is as shown in SEQ ID No. 168 in the sequence listing, and the nucleotide sequence of the nucleic acid encoding the light chain variable region is as shown in SEQ ID No. 169 in the sequence listing.
In summary, the numbers of the above-mentioned nucleotide sequences are as shown in tables 1-2 and 2.
The numbers in table 1-2 are the sequence numbers in the sequence listing. For example, the nucleotide sequence of the heavy chain protein variable region of h75G7C6-1 is SEQ ID No. 154, and the nucleotide sequence of the light chain protein variable region of h75G7C6-1 is SEQ ID No. 158.
The method for preparing the nucleic acid is a conventional preparation method in the art, and preferably, the method comprises the following steps: using a gene cloning technique to obtain a nucleic acid molecule encoding the above-mentioned humanized anti-EGFRvIII antibody, or using an artificial complete sequence synthesis method to obtain a nucleic acid molecule encoding the above-mentioned humanized anti-EGFRvIII antibody.
Those of skill in the art is aware that the nucleic acid encoding the above-mentioned humanized anti-EGFRvIII antibody can be appropriately introduced a substitution, deletion, alteration, insertion or addition to provide a homolog of a polynucleotide. The homologue of a polynucleotide in the present invention can be prepared by performing substitution, deletion or addition on one or more nucleotides of the nucleic acid encoding the humanized anti-EGFRvIII antibody within the range of maintaining antibody activity.
The numbers in table 2 are the sequence numbers in the sequence listing. For example, the nucleotide sequence encoding the heavy chain protein variable region of 75G7C6 is SEQ ID No. 105.
The complementarity determining regions (CDRs) of the present invention are the residues of antibody variable regions involved in antigen binding. Several numbering systems for identifying CDRs are commonly used, including for example, Kabat definition, Chothia definition, and AbM definition. To sum up, the Kabat definition is based on sequence variability; the Chothia definition is based on the location of structural loop regions; and the AbM definition is a compromise between the Kabat method and the Chothia method. According to the Kabat, Chothia, or AbM algorithm, the light chain variable region has three CDR regions, where CDR1 is located at amino acids at positions 24-34 (CDR1-L), CDR2 located at amino acids at positions 50-56 (CDR2-L), and CDR3 located at amino acids at positions 89-97 (CDR3-L). Due to the change in length of the variable regions, in different centers or different subgroups, there may be 1-6 amino acids at position 27 and there may also be 1-6 amino acids at position 95; and the amino acids are numbered by adding English letters based on the original numbering, such as 27A, 27B, 95A and 95B. According to the Kabat definition, the CDRs of the heavy chain variable region are defined by residues at positions 31 and 35B (CDR1-H), positions 50 and 65 (CDR2-H), and positions 95 and 102 (CDR3-H) (according to Kabat numbering). According to the Chothia definition, the CDRs of the heavy chain variable region are defined by residues at positions 26 and 32 (CDR1-H), positions 52 and 56 (CDR2-H), and positions 95 and 102 (CDR3-H) (according to Chothia numbering). According to the AbM definition, the CDRs of the heavy chain variable region are defined by residues at positions 26 and 35B (CDR1-H), positions 50 and 58 (CDR2-H), and positions 95 and 102 (CDR3-H) (according to Kabat numbering). Similar to the light chain variable region, there may also be multiple amino acids at positions 35, 52, 82, and 100, numbered A, B, C, etc. See Martin et al., (1989) Proc. Nat 1. Acad. Sci. USA 86: 9268-9272; Martin et al., (1991) Methods Enzymol. 203: 121-153; Pedersen et al., (1992) Immunomethods 1: 126; Protein Structure Prediction, Oxford University Press, Oxford, pp. 141-172. In the present invention, the CDRs of the following murine antibodies are numbered by Chothia numbering system, and those of the subsequent humanized antibodies are numbered by Kabat numbering system.
The nucleotide sequence encoding CDR1 in the heavy chain protein variable region of 75G7C6 is position 76 to position 96 of SEQ ID No. 105 in the sequence listing;
the nucleotide sequence encoding CDR2 in the heavy chain protein variable region of 75G7C6 is position 154 to position 171 of SEQ ID No. 105 in the sequence listing;
the nucleotide sequence encoding CDR3 in the heavy chain protein variable region of 75G7C6 is position 295 to position 339 of SEQ ID No. 105 in the sequence listing;
the nucleotide sequence encoding CDR1 in the light chain protein variable region of 75G7C6 is position 70 to position 105 of SEQ ID No. 106 in the sequence listing;
the nucleotide sequence encoding CDR2 in the light chain protein variable region of 75G7C6 is position 151 to position 171 of SEQ ID No. 106 in the sequence listing;
the nucleotide sequence encoding CDR3 in the light chain protein variable region of 75G7C6 is position 268 to position 294 of SEQ ID No. 106 in the sequence listing;
the nucleotide sequence encoding CDR1 in the heavy chain protein variable region of 63A10A7 is position 76 to position 96 of SEQ ID No. 107 in the sequence listing;
the nucleotide sequence encoding CDR2 in the heavy chain protein variable region of 63A10A7 is position 154 to position 171 of SEQ ID No. 107 in the sequence listing;
the nucleotide sequence encoding CDR2 in the heavy chain protein variable region of c75G7C6-1 are as shown in position 154 to position 171 of SEQ ID No. 185 in the sequence listing;
the nucleotide sequence encoding CDR2 in the heavy chain protein variable region of c75G7C6-2 are as shown in position 154 to position 171 of SEQ ID No. 187 in the sequence listing.
the nucleotide sequence encoding CDR3 in the heavy chain protein variable region of 63A10A7 is position 295 to position 318 of SEQ ID No. 107 in the sequence listing;
the nucleotide sequence encoding CDR1 in the light chain protein variable region of 63A10A7 is position 70 to position 102 of SEQ ID No. 108 in the sequence listing;
the nucleotide sequence encoding CDR2 in the light chain protein variable region of 63A10A7 is position 148 to position 168 of SEQ ID No. 108 in the sequence listing;
the nucleotide sequence encoding CDR3 in the light chain protein variable region of 63A10A7 is position 265 to position 291 of SEQ ID No. 108 in the sequence listing.
The nucleotide sequence encoding CDR1 in the heavy chain protein variable region of 64F1F8 is position 76 to position 96 of SEQ ID No. 109 in the sequence listing;
the nucleotide sequence encoding CDR2 in the heavy chain protein variable region of 64F1F8 is position 154 to position 171 of SEQ ID No. 109 in the sequence listing;
the nucleotide sequence encoding CDR3 in the heavy chain protein variable region of 64F1F8 is position 295 to position 318 of SEQ ID No. 109 in the sequence listing;
the nucleotide sequence encoding CDR1 in the light chain protein variable region of 64F1F8 is position 70 to position 99 of SEQ ID No. 110 in the sequence listing;
the nucleotide sequence encoding CDR2 in the light chain protein variable region of 64F1F8 is position 145 to position 165 of SEQ ID No. 110 in the sequence listing;
the nucleotide sequence encoding CDR3 in the light chain protein variable region of 64F1F8 is position 262 to position 288 of SEQ ID No. 110 in the sequence listing.
The nucleotide sequence encoding CDR1 in the heavy chain protein variable region of 7E5-2A9 is position 76 to position 96 of SEQ ID No. 111 in the sequence listing;
the nucleotide sequence encoding CDR2 in the heavy chain protein variable region of 7E5-2A9 is position 154 to position 171 of SEQ ID No. 111 in the sequence listing;
the nucleotide sequence encoding CDR3 in the heavy chain protein variable region of 7E5-2A9 is position 295 to position 309 of SEQ ID No. 111 in the sequence listing;
the nucleotide sequence encoding CDR1 in the light chain protein variable region of 7E5-2A9 is position 70 to position 120 of SEQ ID No. 112 in the sequence listing;
the nucleotide sequence encoding CDR2 in the light chain protein variable region of 7E5-2A9 is position 166 to position 186 of SEQ ID No. 112 in the sequence listing;
the nucleotide sequence encoding CDR3 in the light chain protein variable region of 7E5-2A9 is position 283 to position 309 of SEQ ID No. 112 in the sequence listing.
The nucleotide sequence encoding CDR1 in the heavy chain protein variable region of 43H6A1 is position 127 to position 150 of SEQ ID No. 113 in the sequence listing;
the nucleotide sequence encoding CDR2 in the heavy chain protein variable region of 43H6A1 is position 208 to position 222 of SEQ ID No. 113 in the sequence listing;
the nucleotide sequence encoding CDR3 in the heavy chain protein variable region of 43H6A1 is position 346 to position 366 of SEQ ID No. 113 in the sequence listing;
the nucleotide sequence encoding CDR1 in the light chain protein variable region of 43H6A1 is position 70 to position 102 of SEQ ID No. 114 in the sequence listing;
the nucleotide sequence encoding CDR2 in the light chain protein variable region of 43H6A1 is position 148 to position 168 of SEQ ID No. 114 in the sequence listing;
the nucleotide sequence encoding CDR3 in the light chain protein variable region of 43H6A1 is position 265 to position 291 of SEQ ID No. 114 in the sequence listing;
The nucleotide sequence encoding CDR1 in the heavy chain protein variable region of 46C4A6 is position 76 to position 96 of SEQ ID No. 115 in the sequence listing;
The nucleotide sequence encoding CDR2 in the heavy chain protein variable region of 46C4A6 is position 154 to position 171 of SEQ ID No. 115 in the sequence listing;
The nucleotide sequence encoding CDR3 in the heavy chain protein variable region of 46C4A6 is position 295 to position 306 of SEQ ID No. 115 in the sequence listing;
the nucleotide sequence encoding CDR1 in the light chain protein variable region of 46C4A6 is position 70 to position 117 of SEQ ID No. 116 in the sequence listing;
the nucleotide sequence encoding CDR2 in the light chain protein variable region of 46C4A6 is position 163 to position 183 of SEQ ID No. 116 in the sequence listing;
the nucleotide sequence encoding CDR3 in the light chain protein variable region of 46C4A6 is position 280 to position 306 of SEQ ID No. 116 in the sequence listing;
the nucleotide sequence encoding CDR1 in the heavy chain protein variable region of 49G12G5 is position 76 to position 96 of SEQ ID No. 117 in the sequence listing;
the nucleotide sequence encoding CDR2 in the heavy chain protein variable region of 49G12G5 is position 154 to position 171 of SEQ ID No. 117 in the sequence listing;
the nucleotide sequence encoding CDR3 in the heavy chain protein variable region of 49G12G5 is position 295 to position 318 of SEQ ID No. 117 in the sequence listing;
the nucleotide sequence encoding CDR1 in the light chain protein variable region of 49G12G5 is position 70 to position 105 of SEQ ID No. 118 in the sequence listing;
the nucleotide sequence encoding CDR2 in the light chain protein variable region of 49G12G5 is position 151 to position 171 of SEQ ID No. 118 in the sequence listing;
the nucleotide sequence encoding CDR3 in the light chain protein variable region of 49G12G5 is position 268 to position 294 of SEQ ID No. 118 in the sequence listing.
The nucleotide sequence encoding CDR1 in the heavy chain protein variable region of 51G7C2 is position 76 to position 96 of SEQ ID No. 119 in the sequence listing;
the nucleotide sequence encoding CDR2 in the heavy chain protein variable region of 51G7C2 is position 154 to position 171 of SEQ ID No. 119 in the sequence listing;
the nucleotide sequence encoding CDR3 in the heavy chain protein variable region of 51G7C2 is position 295 to position 318 of SEQ ID No. 119 in the sequence listing;
the nucleotide sequence encoding CDR1 in the light chain protein variable region of 51G7C2 is position 70 to position 105 of SEQ ID No. 120 in the sequence listing;
the nucleotide sequence encoding CDR2 in the light chain protein variable region of 51G7C2 is position 151 to position 171 of SEQ ID No. 120 in the sequence listing;
the nucleotide sequence encoding CDR3 in the light chain protein variable region of 51G7C2 is position 268 to position 294 of SEQ ID No. 120 in the sequence listing.
The nucleotide sequence encoding CDR1 in the heavy chain protein variable region of 53D8H4 is position 76 to position 96 of SEQ ID No. 121 in the sequence listing;
the nucleotide sequence encoding CDR2 in the heavy chain protein variable region of 53D8H4 is position 154 to position 171 of SEQ ID No. 121 in the sequence listing;
the nucleotide sequence encoding CDR3 in the heavy chain protein variable region of 53D8H4 is position 295 to position 318 of SEQ ID No. 121 in the sequence listing;
the nucleotide sequence encoding CDR1 in the light chain protein variable region of 53D8H4 is position 70 to position 102 of SEQ ID No. 122 in the sequence listing;
the nucleotide sequence encoding CDR2 in the light chain protein variable region of 53D8H4 is position 148 to position 168 of SEQ ID No. 122 in the sequence listing;
the nucleotide sequence encoding CDR3 in the light chain protein variable region of 53D8H4 is position 265 to position 291 of SEQ ID No. 122 in the sequence listing.
The nucleotide sequence encoding CDR1 in the heavy chain protein variable region of 54D2G10 is position 76 to position 96 of SEQ ID No. 123 in the sequence listing;
the nucleotide sequence encoding CDR2 in the heavy chain protein variable region of 54D2G10 is position 154 to position 171 of SEQ ID No. 123 in the sequence listing;
the nucleotide sequence encoding CDR3 in the heavy chain protein variable region of 54D2G10 is position 295 to position 315 of SEQ ID No. 123 in the sequence listing;
the nucleotide sequence encoding CDR1 in the light chain protein variable region of 54D2G10 is position 70 to position 99 of SEQ ID No. 124 in the sequence listing;
the nucleotide sequence encoding CDR2 in the light chain protein variable region of 54D2G10 is position 145 to position 165 of SEQ ID No. 124 in the sequence listing;
the nucleotide sequence encoding CDR3 in the light chain protein variable region of 54D2G10 is position 262 to position 288 of SEQ ID No. 124 in the sequence listing.
The nucleotide sequence encoding CDR1 in the heavy chain protein variable region of 56F10G2 is position 76 to position 96 of SEQ ID No. 125 in the sequence listing;
the nucleotide sequence encoding CDR2 in the heavy chain protein variable region of 56F10G2 is position 154 to position 171 of SEQ ID No. 125 in the sequence listing;
The nucleotide sequence encoding CDR3 in the heavy chain protein variable region of 56F10G2 is position 295 to position 315 of SEQ ID No. 125 in the sequence listing;
the nucleotide sequence encoding CDR1 in the light chain protein variable region of 56F10G2 is position 70 to position 99 of SEQ ID No. 126 in the sequence listing;
the nucleotide sequence encoding CDR2 in the light chain protein variable region of 56F10G2 is position 145 to position 165 of SEQ ID No. 126 in the sequence listing;
the nucleotide sequence encoding CDR3 in the light chain protein variable region of 56F10G2 is position 262 to position 288 of SEQ ID No. 126 in the sequence listing.
The nucleotide sequence encoding CDR1 in the heavy chain protein variable region of 59G3F12 is position 76 to position 99 of SEQ ID No. 127 in the sequence listing;
the nucleotide sequence encoding CDR2 in the heavy chain protein variable region of 59G3F12 is position 157 to position 171 of SEQ ID No. 127 in the sequence listing;
the nucleotide sequence encoding CDR3 in the heavy chain protein variable region of 59G3F12 is position 295 to position 318 of SEQ ID No. 127 in the sequence listing;
the nucleotide sequence encoding CDR1 in the light chain protein variable region of 59G3F12 is position 70 to position 99 of SEQ ID No. 128 in the sequence listing;
the nucleotide sequence encoding CDR2 in the light chain protein variable region of 59G3F12 is position 145 to position 165 of SEQ ID No. 128 in the sequence listing;
the nucleotide sequence encoding CDR3 in the light chain protein variable region of 59G3F12 is position 262 to position 288 of SEQ ID No. 128 in the sequence listing.
The nucleotide sequence encoding CDR1 in the heavy chain protein variable region of 64B11F2 is position 76 to position 96 of SEQ ID No. 129 in the sequence listing;
the nucleotide sequence encoding CDR2 in the heavy chain protein variable region of 64B11F2 is position 154 to position 171 of SEQ ID No. 129 in the sequence listing;
the nucleotide sequence encoding CDR3 in the heavy chain protein variable region of 64B11F2 is position 295 to position 315 of SEQ ID No. 129 in the sequence listing;
the nucleotide sequence encoding CDR1 in the light chain protein variable region of 64B11F2 is position 70 to position 102 of SEQ ID No. 130 in the sequence listing;
the nucleotide sequence encoding CDR2 in the light chain protein variable region of 64B11F2 is position 148 to position 168 of SEQ ID No. 130 in the sequence listing;
the nucleotide sequence encoding CDR3 in the light chain protein variable region of 64B11F2 is position 265 to position 291 of SEQ ID No. 130 in the sequence listing.
The method for preparing the nucleic acid is a conventional preparation method in the art, and preferably, the method comprises the following steps: using a gene cloning technique to obtain a nucleic acid molecule encoding the above-mentioned protein, or using an artificial complete sequence synthesis method to obtain a nucleic acid molecule encoding the above-mentioned protein.
Those of skill in the art is aware that the base sequence encoding the amino acid sequence of the above-mentioned protein can be appropriately introduced a substitution, deletion, alteration, insertion or addition to provide a homolog of a polynucleotide. The homolog of a polynucleotide in the present invention can be prepared by performing substitution, deletion or addition on one or more bases of the gene encoding the sequence of protein within the range of maintaining antibody activity.
The present invention further provides a recombinant expression vector containing the nucleic acid,
wherein the recombinant expression vector can be obtained by conventional methods in the art, that is, it is constructed by linking the nucleic acid molecule of the present invention to various expression vectors. The expression vectors are various conventional vectors in the art, as long as it can carry the aforementioned nucleic acid molecule. The vector preferably comprises: various plasmids, cosmids, phage or viral vectors, etc.
The present invention further provides a recombinant expression transformant comprising the above-mentioned recombinant expression vector, wherein the method for preparing the recombinant expression transformant is a conventional preparation method in the art, and preferably: it is obtained by transforming the above-mentioned recombinant expression vector into host cells. The host cells are various conventional host cells in the art, as long as it meets the following requirements: the above-mentioned recombinant expression vector is stably replicated by itself, and the carried nucleic acid can be efficiently expressed. Preferably, the host cells are E. coli TG1 or BL21 cells (expressing a single chain antibody or Fab antibody), or CHO-K1 cells (expressing a full-length IgG antibody). By transforming the aforementioned recombinant expression plasmid into the host cells, a preferred recombinant expression transformant of the present invention can be obtained, wherein the transformation method is a conventional transformation method in the art, preferably a chemical transformation method, a heat shock method or an electric transformation method.
The present invention further provides a method for preparing an EGFRvIII antibody, comprising the following steps: culturing the above-mentioned recombinant expression transformant, and obtaining the EGFRvIII antibody from the culture.
The present invention further provides an immunoconjugate, comprising the above-mentioned EGFRvIII antibody covalently attached to a cytotoxic agent.
Preferably, in the immunoconjugate, the aforementioned 1 equivalent of the protein is linked to y equivalent of the cytotoxic agent via x equivalent of the linker, which has the structure of formula 1,
Ab-(L)x-(D)y formula 1
where Ab is the above-mentioned EGFRvIII antibody; L is a linker; D is a cytotoxic agent; the x represents a conventional degree of cross linking in the art, with x being a natural number, preferably 1 to 20; y is a natural number greater than, preferably an integer of 1 to 20; x and y are each independently and preferably 2w, with w being an integer of 1 to 5, further preferably 3 and 4; and the ratio of x and y is preferably 1:1.
The L is a conventional linker (or cross-linking agent or coupling agent) in the art. The L contains 2 functional groups, i.e., a group that reacts with an antibody and a group that reacts with a drug (such as aldehyde or ketone).
The drug is coupled to the aforementioned EGFRvIII antibody via a linker molecule. The L is released after entering cells, including but not limited to the following functional groups: active esters, carbonates, carbamates, imidophosphates, oximes, hydrazones, acetals, orthoesters, aminos, small peptide segments or nucleotide fragments, and for another example, maleimidocaproyl (MC), maleimidocaproyl-L-valine-L-citrulline-p-aminobenzyl alcohol (MC-VC-PAB) or 4-(N-maleimidomethyl)cyclohexane-1-carboxylic acid succinimide ester (SMCC).
Preferably, the L mainly comprises the structure of formula 2, which is the remaining part in L after the leaving group leaves:
(CO-Alk1-Sp1-Ar-Sp2-Alk2-C(Z1)=Q-Sp) formula 2
where Alk1 and Alk2 are independently bonds or branched or unbranched (C1-C10) alkylene chains; Sp1 is —S—, —O—, —CONH—, —NHCO—, —NR′—, —N(CH2CH2)2N—, or —X—Ar′—Y—(CH2)n—Z, where X, Y and Z are independently bonds, —NR′—, —S— or —O—, provided that when n=0, at least one of Y and Z must be a bond, and Ar′ is 1,2-, 1,3- or 1,4-phenylene optionally substituted with 1, 2 or 3 groups selected from (C1-C5)alkyl, (C1-C4)alkoxy, (C1-C4)thioalkoxy, halogen, nitro, —COOR′, —CONHR′, —(CH2)nCOOR′, S(CH2)nCOOR′, —O(CH2)nCONHR′ or —S(CH2)nCONHR′, with n being an integer of 0 to 5, provided that when Alk1 is a bond, Sp1 is a bond; R′ is a branched or unbranched (C1-C5) chain optionally substituted with 1 or 2 groups selected from —OH, (C1-C4)alkoxy, (C1-C4)thioalkoxy, halogen, nitro, (C1-C3)dialkylamino, or (C1-C3)trialkylammonium-A, where A is a pharmaceutically acceptable anion to accomplish the formation of the salt; Ar is 1,2-, 1,3- or 1,4-phenylene optionally substituted with 1, 2 or 3 groups selected from (C1-C6)alkyl, (C1-C5)alkoxy, (C1-C4)thioalkoxy, halogen, nitro, —COOR′, —CONHR′, —O(CH2)nCOOR′, —S(CH2)nCOOR′, —O(CH2)nCONHR″ or —S(CH2)nCONHR′, where n and R′ are as defined above, or Ar is 1,2-, 1,3-, 1,4-, 1,5-, 1,6-, 1,7-, 1,8-, 2,3-, 2,6- or 2,7-naphthylene, wherein naphthylene or phenothiazine each is optionally substituted with 1, 2, 3 or 4 groups selected from (C1-C6)alkyl, (C1-C5)alkoxy, (C1-C4)thioalkoxy, halogen, nitro, —COOR′, —CONHR′, —O(CH2)nCOOR′, —S(CH2)nCOOR′, or —S(CH2)nCONHR′, where n and R′ are as defined above, provided that when Ar is phenothiazine, Sp1 is a bond linked to nitrogen only;
Sp2 is a bond, —S— or —O—, provided that when Alk2 is a bond, Sp2 is a bond;
Z1 is H, (C1-C5)alkyl, or phenyl optionally substituted with 1, 2, or 3 groups selected from (C1-C5)alkyl, (C1-C5)alkoxy, (C1-C4)thioalkoxy, halogen, nitro, —COOR′, —CONHR′, —O(CH2)nCOOR′, —S(CH2)nCOOR′, —O(CH2)nCONHR′ or —S(CH2)nCONHR′, where n and R′ are as defined above;
Sp is a linear or branched divalent or trivalent (C1-C18) group, divalent or trivalent aryl or heteroaryl group, divalent or trivalent (C3-C18) cycloalkyl or heterocycloalkyl group, divalent or trivalent aryl or heteroaryl-aryl(C1-C18) group, divalent or trivalent cycloalkyl or heterocycloalkyl-alkyl(C1-C18) group, or divalent or trivalent (C2-C18) unsaturated alkyl group, wherein the heteroaryl is preferably furyl, thienyl, N-methylpyrrolyl, pyridyl, N-methylimidazolyl, oxazolyl, pyrimidinyl, quinolinyl, isoquinolinyl, N-methylcarbazolyl, aminocoumarin, or phenazinyl, and wherein if Sp is a trivalent group, the Sp may also be optionally substituted with a lower (C1-C5)dialkylamino group, lower (C1-C5)alkoxy group, hydroxyl, or lower (C1-C5)alkylthio group; and Q is ═NHNCO—, ═NHNCS—, ═NHNCONH—, ═NHNCSNH— or ═NHO—.
Preferably, Alk1 is a branched or unbranched (C1-C5)alkylene chain, and Sp1 is a bond, —S—, —O—, —CONH—, —NHCO— or —NR′, where R′ is as defined above, provided that when Alk′ is a bond, Sp′ is a bond;
Ar is 1,2-, 1,3- or 1,4-phenylene optionally substituted with 1, 2 or 3 groups selected from (C1-C6)alkyl, (C1-C5)alkoxy, (C1-C4)thioalkoxy, halogen, nitro, —COOR′, —CONHR′, —O(CH2)nCOOR′, —S(CH2)nCOOR′, —O(CH2)nCONHR′ or —S(CH2)nCONHR′, where n and R′ are as defined above, or Ar is 1,2-, 1,3-, 1,4-, 1,5-, 1,6-, 1,7-, 1,8-, 2,3-, 2,6- or 2,7-naphthylene each optionally substituted with 1, 2, 3 or 4 groups selected from (C1-C6)alkyl, (C1-C5)alkoxy, (C1-C4)thioalkoxy, halogen, nitro, —COOR′, —CONHR′, —O(CH2)nCOOR′, —S(CH2)nCOOR′, —O(CH2)nCONHR′ or —S(CH2)nCONHR′.
Z1 is (C1-C5)alkyl, or phenyl optionally substituted with 1, 2, or 3 groups selected from (C1-C5)alkyl, (C1-C4)alkoxy, (C1-C4)thioalkoxy, halogen, nitro, —COOR′, —CONHR′, —O(CH2)nCOOR′, —S(CH2)nCOOR′, —O(CH2)nCONHR′ or —S(CH2)nCONHR′; Alk2 and Sp2 are both bonds; and Sp and Q are only as defined above. The aforementioned bonds have the meaning of covalent bonds.
The L is maleimidocaproyl (MC).
The D may be a conventional cytotoxic agent in the art, preferably selected from a cytotoxin, chemotherapeutic agent, radioisotope, therapeutic nucleic acid, immunomodulator, anti-angiogenic agent, anti-proliferative and pro-apoptotic agent or cytolytic enzyme.
The cytotoxin is a conventional cytotoxin in the art, and generally refers to an active agent that inhibits or prevents cell functions and/or causes cell destruction. Preferably, the cytotoxin is selected from an antibiotic, inhibitor of tubulin polymerization, alkylating agent, protein synthesis inhibitor, protein kinase inhibitor, phosphatase inhibitor, topoisomerase inhibitor, protein kinase, phosphatase, topoisomerase or cyclin. More preferably, the cytotoxin is selected from doxorubicin, daunorubicin, idarubicin, aclarubicin, zorubicin, mitoxantrone, epirubicin, carubicin, nogalamycin, menogaril, pirarubicin, valrubicin, cytarabine, gemcitabine, trifluridine, ancitabine, enocitabine, azacitidine, doxifluridine, pentostatin, broxuridine, capecitabine, cladribine, decitabine, floxuridine, fludarabine, gougerotin, puromycin, tegafur, tiazofurin, adriamycin, cisplatin, carboplatin, cyclophosphamide, dacarbazine, vinblastine, vincristine, bleomycin, nitrogen mustard, prednisone, methyhydrazine, methotrexate, fluorouracil, etoposide, taxol, taxol analog, platinum (such as cisplatin and carboplatin), mitomycin, thiotepa, taxane, daunorubicin, actinomycin, anthramycin, azaserine, tamoxifen, dolastatin, auristatin and a derivative thereof, hemiasterlin, esperamicin or maytansine compounds, and further more preferably selected from monomethylauristatin E (MMAE), monomethylauristatin F (MMAF) or N2′-deacetyl-N2′-3-mercapto-1 oxopropyl-maytansine (DM1). Most preferably, the cytotoxin is monomethylauristatin F (MMAF) or monomethylauristatin E (MMAE).
The chemotherapeutic agent is a conventional chemotherapeutic agent in the art, preferably selected from an alkylating agent, alkyl sulfonate chemotherapeutic agent, aziridine chemotherapeutic agent, vinylamide and methylmelamine chemotherapeutic agents, nitrogen mustard, nitrourea chemotherapeutic agent, antibiotic, antimetabolite, folic acid chemotherapeutic agent, purine analog, pyrimidine analog, androgen, anti-adrenaline, folic acid supplement, maytansinol, polysaccharide complex, taxane, platinum analog or retinoid, or a pharmaceutically acceptable salt, acid and derivative thereof.
The alkylating agent is a conventional alkylating agent in the art, preferably selected from thiotepa or cyclophosphamide. The alkyl sulfonate chemotherapeutic agent is a conventional alkyl sulfonate chemotherapeutic agent in the art, preferably selected from busulfan, improsulfan or piposulfan. The aziridine chemotherapeutic agent is a conventional aziridine chemotherapeutic agent in the art, preferably selected from aziridines such as carboquone, meturedepa or uredepa. The vinylamide and methylmelamine chemotherapeutic agents are conventional vinylamide and methylmelamine chemotherapeutic agents in the art, preferably selected from hexamethylmelamine, triethylenemelamine, triethylenephosphoramide, triethylenethiophosphoramide or trimethylolmelamine. The nitrogen mustard is a conventional nitrogen mustard in the art, preferably selected from chlorambucil, chlornaphazine, estramustine, ifosfamide, nitrogen mustard, nitromin hydrochloride, phenylalanine mustard, novembichin, phenesterine, prednimustine, trofosfamide or uramustine. The nitrourea chemotherapeutic agent is a conventional nitrourea chemotherapeutic agent in the art, preferably selected from carmustine, chlorozotocin, fotemustine, lomustine, nimustine or ranimustine. The antibiotic is a conventional antibiotic in the art, preferably selected from clarithromycin, actinomycin, anthramycin, azaserine, bleomycin, actinomycin c, calicheamicin, carubicin, carminomycin, carcinophylin, chromomycin, dactinomycin, daunorubicin, detorubicin, 6-diazo-5-oxo-L-norleucine, doxorubicin, esorubicin, idarubicin, narbomycin, mitomycin, mycophenolic acid, nogalamycin, olivomycin, peplomycin, porfiromycin, puromycin, triferricdoxorubicin, rodorubicin, streptonigrin, streptozocin, tuberculocidin, ubenimex, zinostatin or zorubicin. The antimetabolite is a conventional antimetabolite in the art, preferably selected from methotrexate or 5-fluorouracil (5-FU). The folic acid chemotherapeutic agent is a conventional folic acid chemotherapeutic agent in the art, preferably selected from denopterin, pteropterin or trimetrexate. The purine analog is a conventional purine analog in the art, preferably selected from fludarabine, 6-mercaptopurine, thiamiprine or thioguanine. The pyrimidine analog is a conventional pyrimidine analog in the art, preferably selected from ancitabine, azacitidine, 6-azauridine, carmofur, cytarabine, dideoxyuridine, doxifluridine, enocitabine, floxuridine or 5-EU. The androgen is a conventional androgen in the art, preferably selected from calusterone, methyl androstanone propionate, epithioandrostanol, mepitiostane or testolactone. The anti-adrenaline is a conventional anti-adrenaline in the art, preferably selected from aminoglutethimide, mitotane or trilostane. The folic acid supplement is a conventional folic acid supplement in the art, preferably selected from folinic acid, aceglatone, aldophosphamide glycoside, aminolevulinic acid, amsacrine, atrimustine, bisantrene, edatrexate, defosfamide, demecolcine, diaziquone, eflornithine, elliptinium acetate, epothilone, etoglucid, gallium nitrate, hydroxyurea, lentinan or lonidamine. The maytansinol is a conventional maytansinol in the art, preferably selected from maytansine, ansamitocin, mitoguazone, mitoxantrone, mopidamol, nitracrine, pentostatin, phenamet, pirarubicin, losoxantrone, podophyllic acid, 2-ethyl hydrazide or procarbazine. The polysaccharide complex is a conventional polysaccharide complex in the art, preferably selected from razoxane, rhizomycin, sizofiran, spirogermanium, tenuazonic acid, triaziquone, 2,2′,2″-trichlorotriethylamine, trichothecene mycotoxins, urethane, vindesine, dacarbazine, mannomustine, dibromannitol, dibromodulcitol, pipobroman, gacytosine, cytarabine, cyclophosphamide or thiotepa. More preferably, the polysaccharide complex is selected from T-2 toxin, verrucarine A, bacillosporin A or anguidine. The taxane is a conventional taxane in the art, preferably selected from taxol, unhydrogenated castor oil, an albumin engineered nanoparticle preparation (American Pharmaceutical Partners, Schaumberg, Ill.) of taxol, docetaxel, chlorambucil, gemcitabine, 6-thioguanine, mercaptopurine or methotrexate. The platinum analog is a conventional platinum analog in the art, preferably selected from cisplatin, carboplatin, vinblastine, etoposide, ifosfamide, mitoxantrone, vincristine, novantrone, teniposide, edatrexate, daunorubicin, aminopterin, capecitabine ibandronate, CPT-11, topoisomerase inhibitor RFS 2000 or difluoromethylornithine. The retinoid is a retinoid in the art, preferably retinoic acid.
The radioisotope is a conventional radioisotope in the art, and preferably it directly binds to the above-mentioned EGFRvIII antibody, or binds to the above-mentioned EGFRvIII antibody through a chelating agent. More preferably, it directly binds to the cysteine residue of the EGFRvIII antibody. Preferably, the radioisotope is selected from an α-emitter, β-emitter and Auger electron suitable for radiotherapy, and a positron emitter or γ-emitter suitable for diagnosis. More preferably, the radioisotope is selected from 18fluorine, 64copper, 65copper, 67gallium, 68gallium, 77bromine, 80mbromine, 95 ruthenium, 97ruthenium, 103ruthenium, 105 ruthenium, 99mtechnetium, 107mercury, 203mercury, 123iodine, 124iodine, 125iodine, 126iodine, 131iodine, 133iodine, 111indium, 113indium, 90mrhenium, 105 rhenium, 101 rhenium, 186rhenium, 188rhenium, 121mtellurium, 99technetium, 122mtellurium, 125mtellurium, 165thulium, 167thulium, 168thulium, 90yttrium, 213bismuth, 213lead or 225actinium, or nitrides or oxides derived therefrom.
The therapeutic nucleic acid is a conventional nucleic acid in the art, and preferably is a gene encoding an immunomodulator, anti-angiogenic agent, anti-proliferative agent or pro-apoptotic agent. The therapeutic agent includes the therapeutic agent described herein, a derivative thereof, and a pharmaceutically acceptable salt, acid and derivative of the therapeutic agent.
The immunomodulator is a conventional immunomodulator in the art, which is an agent that triggers an immune response, including a humoral immune response (e.g., the production of antigen-specific antibodies) and a cell-mediated immune response (e.g., lymphocyte proliferation). Preferably, the immunomodulator is selected from a cytokine, growth factor, hormone, anti-hormonal drug, immunosuppressive agent or corticosteroid. The cytokine is a conventional cytokine in the art, preferably selected from xanthine, interleukin or interferon. The growth factor is a conventional growth factor in the art, preferably selected from TNF, CSF, GM-CSF or G-CSF. The hormone is a conventional hormone in the art, preferably selected from estrogen, androgen or progestogen. More preferably, the estrogen is diethylstilbestrol or estradiol. More preferably, the androgen is testosterone or fluoxymesterone. More preferably, the progestogen is megestrol acetate or medroxyprogesterone acetate. The corticosteroid is a conventional corticosteroid in the art, preferably selected from prednisone, dexamethasone or hydrocortisone. The anti-hormonal drug is a conventional anti-hormonal drug in the art, which can block the effects of hormones on tumors, inhibit the production of cytokines, down-regulate the expression of autoantigens, or mask the immunosuppressive agent of MHC antigens. Preferably, the anti-hormonal drug is selected from an anti-estrogen drug, anti-androgen drug or anti-adrenaline drug. More preferably, the anti-estrogen drug is selected from tamoxifen, raloxifene, aromatase inhibitory 4 (5)-imidazoles, 4-hydroxy tamoxifen, trioxifene or toremifene. The anti-androgen drug is selected from flutamide, nilutamide, bicalutamide, leuprolide or goserelin. The immunosuppressive agent is a conventional immunosuppressive agent in the art, preferably selected from 2-amino-6 aryl-5 substituted pyrimidines, azathioprine, cyclophosphamide, bromocriptine, danazol, dapsone, glutaraldehyde, anti-idiotypic antibodies against MHC antigens and MHC fragments, cyclosporin A, steroids for example glucocorticosteroids, streptokinase, TGFb, rapamycin, T cell receptors, T cell receptor fragments, cytokine receptor antagonists or T cell receptor antibodies. More preferably, the cytokine receptor antagonists are selected from an anti-interferon antibody, anti-IL10 antibody, anti-TNFa antibody or anti-IL2 antibody.
The anti-angiogenic agent is a conventional anti-angiogenic agent in the art, preferably selected from a farnesyl transferase inhibitor, COX-2 inhibitor, VEGF inhibitor, bFGF inhibitor, steroid sulfatase inhibitor, interleukin-24, thrombospondin, metallospondin protein, type I interferon, interleukin 12, protamine, angiostatin, laminin, endostatin or prolactin fragment. More preferably, the anti-angiogenic agent is 2-methoxyestradiol disulfmate (2-MeOE2bisMATE).
The anti-proliferative and pro-apoptotic agent is a conventional anti-proliferative and pro-apoptotic agent in the art, preferably selected from a PPAR-γ activator, retinoid, triterpenoid compound, EGF receptor inhibitor, telomere terminal transferase inhibitor, iron chelator, apoptin, Bcl-2 and Bcl-X (L) inhibitors, TNF-α/FAS ligand/TNF-related apoptosis-inducing ligand and a signal transduction activator thereof, or PI3K-Akt survival pathway signaling inhibitor. The PPAR-γ activator is a conventional PPAR-γ activator in the art, and preferably is cyclopentenone prostaglandins (cyPGs). The triterpenoid compound is a conventional triterpenoid compound in the art, preferably selected from cycloartane, lupinane, ursane, oleanane, friedelane, dammarane, cucurbitacine, limonin analog or other triterpenoid compounds. The EGF receptor inhibitor is a conventional EGF receptor inhibitor in the art, preferably selected from HER4, rapamycin or 1,25-dihydroxycholecalciferol (vitamin D). The iron chelator is a conventional iron chelator in the art, and preferably is 3-aminopyridine-2-carbaldehyde thiosemicarbazone. The apoptin is a conventional apoptin in the art, and preferably is a viral protein 3-VP3 of the chicken anemia virus. The PI3K-Akt survival pathway signaling inhibitor is a conventional PI3K-Akt survival pathway signaling inhibitor in the art, and preferably is UCN-01 or geldanamycin.
The cytolytic enzyme is a conventional cytolytic enzyme in the art, and preferably is RNase.
Preferably, in the present invention, x=y=n in formula 1; the D is a tubulin synthetase inhibitor—monomethylauristatin F (MMAF), and the linker L is maleimidocaproyl (MC), and the structure of the immunoconjugate is as shown in formula 3-1 or 3-2,
where mAb is the above-mentioned EGFRvIII antibody; where n is a natural number, preferably an integer of 1 to 20, and more preferably 2w, with w being an integer of 1 to 5, further preferably 3 and 4;
in the formula 3-2, m is 1 to 10, preferably m is 5, and L is maleimidocaproyl-L-valine-L-citrulline-p-aminobenzyl alcohol; D is monomethylauristatin E (MMAE); where n is a natural number, preferably an integer of 1 to 20, and more preferably 2w, with w being an integer of 1 to 5, further preferably 3 and 4.
The method for preparing the immunoconjugate is conventional in the art, and preferably the preparation method described in Doronina, 2006, Bioconjugate Chem. 17, 114-124 is used. Preferably, by the preparation method, an immunoconjugate with a minimum low coupling fraction (LCF) of less than 10% is produced.
In a preferred embodiment of the present invention, the preparation method comprises the following steps: the above-mentioned EGFRvIII antibody is dialyzed with a sodium borate buffer at pH 6.5 to 8.5, and then tris(2-carboxyethyl)phosphine (TCEP) is added thereto, with the molar ratio of the TCEP to the above-mentioned EGFRvIII antibody being 2 to 10; reduction is performed for 2 to 4 hours at room temperature, excess TCEPs are removed through a G25 desalting filler; and a certain proportion of MC-MMAF (with drug/antibody ratio being 5-20) is added and reacted for 4 hours. Then cysteine is added to neutralize excess drugs, and excess small molecules are removed through G25. A purified antibody-drug conjugate is afforded (for the coupling method, see Doronina, 2006, Bioconjugate Chem. 17, 114-124).
The immunoconjugate may exist in any physical form known in the art, and preferably is a clear solution.
The present invention further provides a pharmaceutical composition, comprising the above-mentioned EGFRvIII antibody or immunoconjugate, and a pharmaceutically acceptable carrier.
The pharmaceutically acceptable carrier is a conventional carrier in the art, which may be any suitable physiologically or pharmaceutically acceptable pharmaceutical auxiliary material. The pharmaceutical auxiliary material is a conventional pharmaceutical auxiliary material in the art, preferably comprising a pharmaceutically acceptable excipient, filler, diluent, etc. More preferably, the pharmaceutical composition comprises 0.01% to 99.99% of the above-mentioned EGFRvIII antibody or the above-mentioned immunoconjugate, and 0.01% to 99.99% of the pharmaceutically acceptable carrier, wherein the percentage is the mass percentage accounting for the pharmaceutical composition.
The administration route of the pharmaceutical composition of the present invention is preferably parenteral administration, injection administration or oral administration. The injection administration preferably includes intravenous injection, intramuscular injection, intraperitoneal injection, intradermal injection, or subcutaneous injection. The pharmaceutical composition is in various conventional dosage forms in the art, preferably in the form of solid, semisolid or liquid, namely, the pharmaceutical composition may be an aqueous solution, non-aqueous solution or suspension, and more preferably in the form of a tablet, capsule, granule, injection, infusion, etc. More preferably, the pharmaceutical composition is administered intravascularly, subcutaneously, intraperitoneally, or intramuscularly. Preferably, the pharmaceutical composition may also be administered as an aerosol or crude spray, i.e., administered nasally; or administered intrathecally, intramedullarily, or intraventricularly. More preferably, the pharmaceutical composition may further be administered transdermally, percutaneously, topically, enterally, intravaginally, sublingually or rectally.
The administration dose level of the pharmaceutical composition of the present invention can be adjusted according to the amount of the composition achieving the desired diagnosis or treatment result. The administration schedule may also be a single injection or multiple injections, or be adjusted. The selected dosage level and schedule are reasonably adjusted depending on various factors including the activity and stability (i.e., half-life) of the pharmaceutical composition, formulation, administration route, combination with other drugs or therapies, diseases or conditions to be detected and/or treated, and health status and previous medical history of a subject to be treated.
The therapeutically effective dose for the pharmaceutical composition of the present invention can be estimated initially in cell culture experiments or animal models such as rodents, rabbits, dogs, pigs, and/or primates. Animal models may also be used to determine the appropriate concentration range and route of administration. The models may then be used to determine useful doses and routes for administration in humans. In general, the determination and adjustments of the effective amount or dosage administered and the evaluation of when and how such adjustments are made are known to those of skill in the art.
For combination therapies, the above-mentioned EGFRvIII antibody, the above-mentioned immunoconjugate, and/or additional therapeutic or diagnostic agents can each be used as a single agent within any time range suitable for performing the intended treatment or diagnosis. Therefore, these single agents can be administered substantially simultaneously (i.e., administered as a single formulation or administered within a few minutes or hours) or administered continuously in sequence. For example, these single agents may be administered within one year, or within 10, 8, 6, 4, or 2 months, or within 4, 3, 2, or 1 week, or within 5, 4, 3, 2, or 1 day.
For additional guidance on formulation, dosage, administration schedule, and measurable treatment results, see Berkow et al., (2000) The Merck Manual of Medical Information and Merck & Co. Inc., Whitehouse Station, N.J.; Ebadi (1998) CRC Desk Reference of Clinical Pharmacology and other works.
The present invention further provides the use of the above-mentioned antibody, the above-mentioned conjugate or the above-mentioned pharmaceutical composition in the preparation of an anti-tumor drug.
The present invention further provides a method for detecting a cell overexpressing an EGFRvIII protein, comprising the following steps: contacting the above-mentioned EGFRvIII antibody with a sample to be detected in vitro, then detecting the binding of the EGFRvIII antibody to the sample to be detected.
The present invention further provides a composition for detecting a cell overexpressing an EGFRvIII protein, comprising the above-mentioned EGFRvIII antibody as an active ingredient.
The present invention further provides the use of the above-mentioned antibody, immunoconjugate or pharmaceutical composition in the preparation of a drug for preventing or treating a disease related to abnormal expression or function of the EGFRvIII; preferably, the disease related to abnormal expression or function of the EGFRvIII is a tumor, wherein the tumor is preferably bladder cancer, brain cancer, head and neck cancer, pancreatic cancer, lung cancer, breast cancer, ovarian cancer, colon cancer, prostate cancer, or renal cancer.
On the basis of meeting common knowledge in the art, the above-mentioned various preferred conditions can be combined in any form, such that various preferred examples of the present invention are obtained.
Reagents and raw materials used in the present invention are all commercially available.
The positive effect of the present invention lies in that the protein of the present invention is an EGFRvIII antibody, which has a high affinity with an EGFRvIII protein and can bind to the extracellular region of the EGFRvIII protein receptor at the protein level and the cellular level. The antibody cross-linking drug obtained by coupling the EGFRvIII antibody with MC-MMAF and other small-molecule compounds can effectively perform a cytotoxic killing effect on EGFRvIII positive cells. The EGFRvIII antibody brings the small-molecule toxin (e.g., MMAF) into cells via endocytosis, and degrades and releases small-molecule compounds in the cells, thereby playing the role of cytotoxicity. Therefore, the EGFRvIII antibody can be used for preparing an antibody cross-linking drug and effectively killing tumor cells, or can be used in the preparation of a drug for treating tumors.
The following examples further illustrate the present invention, but the present invention is not limited thereto. In experimental methods in the following examples where no specific conditions are indicated, choices can be made according to conventional methods and conditions or commodity instructions.
The following examples further illustrate the present invention, but the present invention is not limited thereto. In experimental methods in the following examples where no specific conditions are indicated, choices can be made according to conventional methods and conditions or commodity instructions.
The room temperature described in the examples is a conventional room temperature in the art, generally is 10° C. to 30° C.
The nucleotide sequences containing the amino acid sequences (NCBI: NP 005219.2, with deletion of amino acids at positions 30-297) encoding the extracellular region of the human EGFRvIII protein are cloned into a pCpC vector (purchased from Invitrogen, V044-50) with a human IgG Fc fragment (hFc); and plasmids are prepared according to established standard molecular biology methods, and for specific methods, please see Sambrook, J., Fritsch, E. F., and Maniatis, T. (1989). Molecular Cloning: A Laboratory Manual, Second Edition (Plainview, N.Y.: Cold Spring Harbor Laboratory Press). HEK293 cells (purchased from Invitrogen) are subjected to transient transfection (PEI, Polysciences) and an expanded culture is performed using FreeStyle™ 293 (Invitrogen) at 37° C. After 4 days, the cell culture broth is collected, and the cell components are removed by centrifugation to obtain a culture supernatant containing the extracellular region of the EGFRvIII protein. The culture supernatant is loaded onto a protein A affinity chromatography column (Mabselect Sure, purchased from GE Healthcare), and the change in ultraviolet absorption values (A280 nm) is monitored with an ultraviolet (UV) detector. After loading, the protein A affinity chromatography column is washed with PBS phosphate buffer (pH 7.2) until the UV absorption value returns back to the baseline, and then eluted with 0.1 M glycine hydrochloric acid (pH 2.5) to collect the hFc-tagged EGFRvIII protein (EGFRvIII-hFc) eluted from the protein A affinity chromatography column, which is dialyzed with the PBS phosphate buffer (pH 7.2) in the refrigerator at 4° C. overnight. The dialyzed protein is sterile filtered through 0.22 microns, and then packaged and stored at −80° C. to obtain a purified immunogen A.
The nucleotide sequence encoding the full-length amino acid sequence of the human EGFRvIII (NCBI: NP_005219.2, with deletion of amino acids at positions 30-297) and the nucleotide sequence encoding the full-length amino acid sequence of the human EGFR (NCBI: NP_005219.2) are cloned into a pIRES vector (purchased from Clontech), and plasmids are prepared. The HEK293 cell line, U87MG cell line and CHO-K1 cell line (all purchased from Invitrogen) are subjected to plasmid transfection (PEI, purchased from Polysciences), and then selectively cultured for 2 weeks in a DMEM medium containing 0.5 μg/ml of 10% (w/w) fetal bovine serum. The culture is subcloned in a 96-well culture plate by using a limiting dilution method, and cultured at 37° C., 5% (v/v) CO2; after about 2 weeks, a part of monoclonal wells are selected and amplified into a 6-well plate. The amplified clones are screened by a flow cytometry analysis method using a known EGFRvIII antibody (purchased from Absoluteantibody, #Ab00184-1.1). The monoclonal cell line with better growth vigour and higher fluorescence intensity is selected for further expanded culture and same was cryopreserved in liquid nitrogen to obtain an immunogen B. The specific selection results are as shown in table 3 and
A. 6 to 8 week-old BALB/cAnNCrl mice or SJL/JorllcoCrl mice (purchased from Shanghai SLAC Laboratory Animal Co., Ltd.) are used in the immunization with immunogen A, and the mice are raised under SPF conditions. For the primary immunization, an immunogen A protein is emulsified with a Freund's complete adjuvant and each mouse is intraperitoneally injected at 0.25 ml, namely, each mouse is injected with 50 μg of the immunogen A protein. For the booster immunization, an immunogen A protein is emulsified with a Freund's incomplete adjuvant and each mouse is intraperitoneally injected at 0.25 ml, namely, each mouse is injected with 50 μg of the immunogen A protein. The interval between the primary immunization and the first booster immunization is 2 weeks, and the interval between subsequent booster immunizations is 3 weeks. Blood is collected 1 week after each booster immunization, and the antibody titer and specificity of immunogen A in the serum are detected by ELISA and FACS, with the results as shown in
B. 6 to 8 week-old BALB/cAnNCrl mice or SJL/JorllcoCrl mice (purchased from Shanghai SLAC Laboratory Animal Co., Ltd.) are used in the immunization with immunogen B, and the mice are raised under SPF conditions. The HEK293 cell line is transfected with the pIRES plasmid containing the nucleotide sequence encoding the full-length amino acid sequence of human EGFRvIII [see step (II) in example 1] to obtain an HEK293 stable cell line 293F-hEGFRvIII 1C10 containing human EGFRvIII (using X-treme GENE HP DNA Transfection Reagent, which is purchased from Roche, Cat #06 366 236 001, and operating according to the instructions); the cell line is subjected to an expanded culture to 90% confluence in a T-75 cell culture flask; the medium is sucked out, and the cells are washed twice with a DMEM basal medium (purchased from Invitrogen), and then treated with an enzyme-free cell dissociation solution (purchased from Invitrogen) at 37° C. until cells can be fallen off from the wall of the culture dish, and the cells are collected. The cells are washed twice with the DMEM basal medium and are counted, and then the cells are diluted with a phosphate buffer (pH 7.2) to 2×107 cells/ml. Each mouse is injected intraperitoneally with 0.5 ml of the cell suspension at each immunization. The interval between the first immunization and the second immunization is 2 weeks, and the interval between subsequent immunizations is 3 weeks. In addition to the first immunization, blood is collected 1 week after each immunization, and the antibody titer and specificity in the serum are detected by FACS. After the second booster immunization, the serum antibody titer reaches at least 1:1000 detected by FACS.
Before the completion of steps A and B, each selected mouse is the last time immunized intraperitoneally with 100 μg of the purified immunogen A (mice immunized with the immunogen A) or immunogen B (mice immunized with the immunogen B); after 5 days, the mice are sacrificed and spleen cells are collected. NH4OH is added to a final concentration of 1% (w/w) and red blood cells incorporated in the spleen cells are lysed to obtain a spleen cell suspension. The cells are washed with DMEM basic medium and centrifuged at 1000 rpm for 3 times, then are mixed at a 5:1 ratio of living cell number with mouse myeloma cells SP2/0 (purchased from ATCC), then are subjected to cell fusion by using an efficient electrical fusion method (see METHODS IN ENZYMOLOGY, VOL. 220). The fused cells are diluted into a DMEM medium containing 20% fetal bovine serum and 1×HAT, with the percentage being the mass percentage. Then the cells at 1×105/200 μL/well are added to a 96-well cell culture plate and placed in an incubator with 5% CO2 and 37° C., with the percentage being the volume percentage. After 14 days, the cell fusion plate supernatant is screened by ELISA and Acumen (a microplate cell detection method), and the positive clones with OD450 nm>1.0 in ELISA and MFI value >100 in Acumen are amplified to a 24-well plate, and an expanded culture is performed in DMEM (invitrogen) containing 10% (w/w) HT fetal bovine serum under the conditions of 37° C. and 5% (v/v) CO2. After culturing for 3 days, the culture broth from the expanded culture in the 24-well plate is centrifuged; the supernatant is collected and analyzed for antibody subtypes; and the binding activity in the EGFRvIII protein and EGFRvIII positive cells (for the detection method of the binding activity, please refer to example 3A and example 3B respectively), and the anti-mouse antibody-MMAF indirect cytotoxic killing experiment (for detection method of the indirect cytotoxic killing activity, please refer to example 4) are determined by ELISA and FACS.
According to the screening results of the 24-well plate, hybridoma cells with OD450 nm>1.0 in the ELISA experiment, with the MFI value >50 in the FACS experiment, and with the killing rate of the hybridoma cell culture supernatant for the EGFRvIII-positive cells reaching 50% in the indirect cytotoxicity killing experiment are selected as eligible positive clones, and eligible hybridoma cells are subcloned in the 96-well plate by a limiting dilution method and cultured in a DMEM medium (purchased from invitrogen) containing 10% (w/w) FBS under conditions of 37° C. and 5% (v/v) CO2. 10 days after subcloning, preliminary screening is performed by ELISA and Acumen, and a single positive monoclone is selected and amplified to a 24-well plate for further cultivation. 3 days later, the FACS is used to determine the positive antigen binding, and the anti-mouse antibody-MMAF indirect cytotoxicity killing experiment is used to evaluate the biological activity (evaluation criteria: OD450 nm>1.0 in the ELISA experiment, the MFI value >50 in the FACS experiment, and the killing rate of the hybridoma cell culture supernatant for the EGFRvIII-positive cells reaching 50% in the indirect cytotoxicity killing experiment).
Based on the detection results of the samples in the 24-well plate, the best clones are selected, subjected to an expanded culture in the DMEM medium (purchased from invitrogen) containing 10% (w/w) FBS under conditions of 37° C. and 5% (v/v) CO2, and then cryopreserved in liquid nitrogen to obtain the hybridoma cells of the present invention, which can be used for subsequent antibody production and purification.
The antibody produced by the hybridoma cells has a relatively low concentration which is only about 1 to 10 μg/ml, and the concentration varied greatly. Moreover, various proteins produced by cell culture in the culture medium and fetal bovine serum components contained in the culture medium interfered with many biological activity analysis methods in varying degrees, and therefore, it is necessary to perform a small-scale (1 to 5 mg) antibody production and purification.
The hybridoma cells obtained in example 1 are inoculated into a T-75 cell culture flask and acclimated and passaged for 3 generations by using a production medium (Hybridoma serum free medium, purchased from Invitrogen). When the cells have a good growing status, they are inoculated into a roller bottle for tissue culture. 200 ml of the production medium is added to each 2 liter of the culture roller bottle, with the cell density inoculated being 1.0×105/ml. The bottle cap is screwed down, and the roller bottle is placed on a rotary machine in a 37° C. incubator at a speed of 3 rpm/min. After a continuous rotating culture for 14 days, the cell culture broth is collected and filtered to remove cells, and the culture supernatant is filtered with a 0.45 micron filter membrane until it is clear. The clear culture supernatant may be purified immediately or cryopreserved at −30° C.
Monoclonal antibodies in the clear culture supernatant (200 mL) of hybridoma cells are purified by using a 2 mL protein A column (purchased from GE Healthcare). The protein G column is first equilibrated with an equilibration buffer (PBS phosphate buffer, pH 7.4), and then the clear culture supernatant is loaded onto the protein A column, with the flow rate controlled at 3 mL/min. After loading, the protein G column is washed with the equilibration buffer at a volume of 4 column bed volumes of the protein G column. The EGFRvIII antibody bound to the protein A column is eluted with an eluent (0.1 M sodium citrate buffer, pH 3.5), and the elution status is monitored with an ultraviolet detector (A280 ultraviolet absorption peak). The eluted antibodies are collected, and 10% 1.0 M Tris-HCl buffer is added to neutralize the pH, with the percentage being the volume percentage, and then immediately dialyzed with the PBS phosphate buffer overnight. The next day, the dialysate is changed once and is further dialyzed for 3 hours. The dialyzed EGFRvIII antibody is collected, aseptically filtered with a 0.22 micron filter, and stored aseptically to obtain a purified EGFRvIII antibody.
The purified EGFRvIII antibody is detected and analyzed for protein concentration (A280/1.4), purity, endotoxin (Lonza kit), etc. The results as shown in table 9 indicate that the endotoxin concentration of the final antibody product is within 1.0 EU/mg.
The purified EGFRvIII antibody obtained in example 2 is reacted with a human EGFRvIII-hFc protein.
The purified immunogen A (EGFRvIII-hFc) obtained in example 1 (for the preparation method, see step (I) in example 1) is diluted with PBS to a final concentration of 1.0 μg/mL, and then added to a 96-well ELISA plate at 100 μl/well. The plate is sealed with a plastic film and incubated at 4° C. overnight. The next day, the plate is washed twice with a plate washing solution [PBS+0.01% (v/v) Tween20], and added with a blocking solution [PBS+0.01% (v/v) Tween20+1% (w/w) BSA] for blocking 2 hours at room temperature. The blocking solution is discarded, and the purified EGFRvIII antibody obtained in example 2 is added at 100 μl/well. The plate is incubated for 2 hours at 37° C., and then washed 3 times with the plate washing solution [PBS+0.01% (v/v) Tween20]. The plate is added with an HRP (horseradish peroxidase) labeled secondary antibody (purchased from Sigma), incubated at 37° C. for 2 hours, and then washed 3 times with the plate washing solution [PBS+0.01% (v/v) Tween20]. The plate is added with 100 μl of TMB substrate/well, incubated for 30 minutes at room temperature, and then added with 100 μl of stopping solution (1.0N HCl)/well. A450 nm value is read using an ELISA plate reader (SpectraMax 384plus, purchased from Molecular Device), and the results are shown in
The desired cells are subjected to an expanded culture to 90% confluence in a T-75 cell culture flask; the medium is sucked out, and the cells are washed twice with a HBSS buffer (Hanks Balanced Salt Solution) (purchased from Invitrogen), and then treated with an enzyme-free cell dissociation solution (Versene solution: purchased from Life Technology) and collected. The cells are washed twice with the HBSS buffer and are counted, and then the cells are diluted with the HBSS buffer to 2×106 cells/ml, added with 1% goat serum blocking solution, with the percentage being the mass percentage, and incubated on ice for 30 minutes, and then centrifugally washed twice with the HBSS buffer. The collected cells are suspended to 2×106 cells/mL with an FACS buffer (HBSS+1% BSA, with the percentage being the mass percentage), added to a 96-well FACS reaction plate at 100 μL/well, then added with 100 μL/well of the purified EGFRvIII antibody (samples to be tested) obtained in example 2, and incubated on ice for 2 hours. The plate is centrifugally washed twice with the FACS buffer, added with 100 μL/well of a fluorescence (Alexa 488) labeled secondary antibody (purchased from Invitrogen), and then incubated on ice for 1 hour. The plate is centrifugally washed 3 times with the FACS buffer, added with 100 μL/well of a fixing solution [4% (v/v) paraformaldehyde] for suspending the cells, and centrifugally washed twice with the FACS buffer after 10 minutes. The cells are suspended in 100 μL of the FACS buffer, and the results are detected and analyzed by FACS (FACS Calibur, purchased from BD). Data analysis is performed by using the software (CellQuest) to obtain the mean fluorescence intensity (MFI) of the cells. Data is further analyzed by using the software (GraphPad Prism5) to perform data fitting and calculate EC50. The analysis results as shown in table 11 and
The antibody is dialyzed with a sodium borate buffer at Ph 6.5 to 8.5, and then a certain proportion of TCEP is added thereto (with TCEP/antibody ratio being 2-10); reduction is performed for 2 to 4 hours at room temperature or 37° C., and excess TCEPs are removed through a G25 desalting filler, and a certain proportion of MC-MMAF (with drug/antibody ratio being 5-20) is added and reacted for 4 hours. Then cysteine is added to neutralize excess drugs, and excess small molecules are removed through G25. A purified antibody-drug conjugate is afforded (for the coupling method, see Doronina, 2006, Bioconjugate Chem. 17, 114-124). The cytotoxic activity is analyzed after HIC analysis of drug cross-linking rate, purity and other parameters. For comparison, the drug cross-linking rate of all antibody conjugates is 8.
The resulting antibody conjugates are respectively subjected to gradient dilution with a complete medium, and a 96-well cell culture plate is added with 90 μL of the U87MG-EGFRvIII cell suspension at 1000 cells/well, and to each well is further added 10 μL of different concentrations of antibody-drug conjugate dilutions; after continuing the culture for 5 days, the cell viability is detected using the CellTiter-Glo kit (purchased from Promega, and the usage method refers to the product instructions).
The results are as shown in table 12 and
In order to identify the binding sites of antibodies to antigens, EGFRvIII antibodies are grouped using a competitive ELISA method.
The purified antibodies to be tested are diluted with PBS to 1 μg/mL, coated onto a 96-well high-adsorption ELISA plate at 50 μL/well at 4° C. overnight, and then blocked with 250 μL of the blocking solution [PBS containing 0.01% (v/v) Tween20 and 1% (w/w) BSA] for one hour at room temperature, and each well is added with 0.05 μg/mL of the biotin-labeled recombinant EGFRvIII protein. Simultaneously, 5 μg/mL of the competitive antibody (i.e., the purified EGFRvIII antibody obtained in example 2) is added thereto, and incubated at 25° C.−37° C. for 1 to 2 hours. The plate is washed 3 times with the plate washing solution [PBS containing 0.01% (v/v) Tween20], and added with HRP (horseradish peroxidase) labeled streptavidin (purchased from Sigma). The plate is incubated for 0.5 hour at 37° C., and then washed 3 times with the plate washing solution [PBS containing 0.01% (v/v) Tween20]. The plate is added with 100 μL of TMB substrate/well, incubated for 30 minutes at room temperature, and then added with 100 μL of stopping solution (1.0N HCl)/well. A450 nm value is read using an ELISA plate reader (SpectraMax 384 plus, purchased from Molecular Device), and the results are shown in
Competitive ELISA results show that epitopes of control antibody 02 and murine anti-7E5, -63A10 and -64F1 are competitive and should be the same or similar epitopes. Since the murine anti-75G7 is not competitive with control antibody 02, the epitope of the murine anti-75G7 should be different from that of the control antibody 02, wherein the control antibody 01 is AMG-595 (purchased from Amgen), and the control antibody 02 is ABT-414 (purchased from Abbvie).
In order to determine the specific antibody binding epitopes of 4 candidate antibodies of the present invention, a polypeptide dot matrix ELISA method is used to analyze the antigen binding epitopes of the antibodies. A polypeptide of 16 amino acids in length is synthesized by using amino acids at positions 1 to 50 of the EGFRvIII protein as a template, starting from the N-terminus, and moving in a manner of overlapping 15 amino acids. The C-terminus of the polypeptide is added with a biotin label. The polypeptide is synthesized by GL Biochem, with 35 biotin labeled polypeptides synthesized in total. The detailed sequences are shown in table 13-2.
The results of the polypeptide dot matrix ELISA show that the control antibody 01 and murine anti-7E5, murine anti-64F1 and murine anti-63A10 have the same or similar epitope, i.e., CGADSYEMEEDGVRKC. Since the murine anti-75G7 does not bind to the 35 polypeptides, the epitope of the murine anti-75G7 should be different from that of the control antibody 01 and that of the control antibody 02, wherein the control antibody 01 is AMG-595 (purchased from Amgen), and the control antibody 02 is ABT-414 (purchased from Abbvie). Therefore, the epitope of the murine anti-75G7 needs to be further verified by experiments.
The EGFRvIII antibody can specifically recognize the activated EGFR, i.e., the mutant EGFRvIII and overexpressed wild-type EGFR in tumor tissues, but does not recognize the wild-type EGFR in normal tissues. Therefore, the antibody has an excellent target specificity, which can effectively avoid the harm of the antibody and conjugates thereof to human normal tissues, and effectively gather the antibody and conjugates thereof at the location of the tumor, thereby accurately and effectively killing tumor tissues, and significantly reducing the toxic and side effects of drugs on other normal organs.
Seven human normal tissue cell lines are respectively selected from different tissues, and see table 14 for details. The binding of the EGFRvIII antibody to the seven human normal tissue cell lines is detected and analyzed by FACS. The results as shown in
The human glioma tissue chip (Cat #GL805d) and human multi-organ normal tissue chip (Cat #FDA999n) are both purchased from Alenabio, Co., Ltd. (Xi'an).
The experimental procedure of the immunohistochemical staining is as follows:
1) Baking the slices at 60° C. for 2 hours. The purpose of baking the slices is to firmly stick the tissue slices with paraffin on the slide to prevent the slices from falling off during the staining. The purpose is achieved by placing the slices in an incubator at 56° C. to 60° C. for at least 1 hour; however, the usual temperature of baking slices is 8° C. to 60° C. and the time thereof is 2 to 6 hours. Since high-temperature drying may accelerate the oxidation of antigens in the tissues, baking slices at a high temperature has a destructive effect on antigens.
2) Performing deparaffinage and hydration. Soaking is carried out in xylene I and II for 15 to 30 minutes to remove the paraffin from the tissues. However, the xylene entering human tissues is immiscible with a water-soluble staining solution, which makes it necessary to gradually replace the xylene in the tissues with a descending gradient of ethanol. The slices are soaked respectively in 100% ethanol I and II for 5 minutes and respectively in 95%, 90%, 80% and 70% ethanol for 2 minutes, rinsed with PBS 3 times, with 3 minutes each time, and placed in distilled water for later use.
3) Antigen repair. The tissue chip is placed in an EDTA solution (pH 9.0) and repair is performed at high temperature for 5 minutes.
4) Blocking endogenous peroxidase. The tissue chip is placed in 3% H2O2 at room temperature for 5 minutes.
5) Blocking non-immune serum. The tissue chip is placed in 5% FBS at room temperature for 15 minutes. Antibodies can be adsorbed by charged collagen and connective tissue components in tissue slices, which causes background coloring, and in order to prevent this situation, it is best to select the non-immune serum derived from the same species as the secondary antibody to block the charge before treating the slices with specific antibodies, which prevents the primary antibody from binding to the charge, and inhibits the non-specific background coloring. A common method is to use 2% to 10% sheep serum or 2% to 5% bovine serum albumin for reaction at room temperature for 10 to 30 minutes.
6) Incubating the primary antibody and placing the primary antibody at room temperature for 1 hour.
7) Incubating the secondary antibody and placing the secondary antibody at room temperature for 0.5 hour.
8) Allowing DAB to develop color and placing same at room temperature for 5 minutes.
9) Counterstaining nucleus with hematoxylin and placing same at room temperature for 10 seconds.
10) Sealing for detection.
The results of the immunohistochemical staining are shown in
It is proved in the FACS and IHC experiments that the EGFRvIII antibody can specifically recognize the EGFR in tumor tissues, but does not recognize the EGFR in normal tissues. Therefore, the antibody has an excellent target specificity, which can effectively avoid the harm of the antibody and conjugates thereof to human normal tissues, and effectively gather the antibody and conjugates thereof at the location of the tumor, thereby accurately and effectively killing tumor tissues, and significantly reducing the toxic and side effects of drugs on other normal organs.
Total RNA isolation: 5×107 hybridoma cells obtained in example 1 are collected by centrifugation, added with 1 mL Trizol and mixed evenly, transferred to a 1.5 mL centrifuge tube, and placed at room temperature for 5 minutes. The tube is added with 0.2 mL of chloroform, shaken for 15 seconds, standing for 10 minutes, and then centrifuged at 4° C. and at 12,000 g for 5 minutes, and the supernatant is taken and transferred to a new 1.5 mL centrifuge tube. 0.5 mL isopropanol is added, the liquid in the tube is mixed homogeneously and gently, then the tube is standing at room temperature for 10 minutes, is centrifuged at 4° C. and at 12000 g for 15 minutes, then the supernatant is discarded. After adding 1 mL of 75% (v/v) ethanol, the precipitate is gently washed; then the tube is centrifuged at 4° C. and at 12000 g for 5 minutes to discard the supernatant; the precipitate is dried in the air, and then the DEPC-treated H2O is added to perform dissolution (promoting the dissolution at 55° C. in water bath for 10 minutes) so as to obtain total RNA.
Reverse transcription and PCR: 1 μg of total RNA is taken to formulate a 20 μL system, to which a reverse transcriptase is added, for reacting at 42° C. for 60 minutes and at 7° C. for 10 minutes so as to stop the reaction. A 50 μL PCR system is formulated, comprising 1 μL cDNA, 25 pmol of each primer, 1 μL DNA polymerase, a corresponding buffer system, and 250 μmol dNTPs; the PCR program is set: pre-denaturating at 95° C. for 3 minutes; denaturating at 95° C. for 30 seconds; annealing at 55° C. for 30 seconds; extending at 72° C. for 35 seconds; after 35 cycles, further extending at 72° C. for 5 minutes to obtain a PCR product. The kit used for reverse transcription is PrimeScript RT Master Mix, purchased from Takara (Cat #RR036); and the kit used for PCR includes a Q5 ultra-fidelity enzyme, purchased from NEB (Cat #M0492).
Cloning and sequencing: 5 μL of the PCR product is taken for an agarose gel electrophoresis detection, and the column recovery kit is used to purify the positive test samples, wherein the recovery kit is NucleoSpin® Gel & PCR Clean-up, purchased from MACHEREY-NAGEL (Cat #740609). The ligation reaction is performed as follows: 50 ng of the sample, 50 ng of the T vector, 0.5 μL of the ligase, 1 μL of the buffer, and 10 μL of the reaction system are reacted at 16° C. for half an hour to obtain a ligation product. The ligated kit is a T4 DNA ligase, purchased from NEB (Cat #M0402); 5 μL of the ligation product is added to 100 μL of competent cells (Ecos 101 competent cells, purchased from Yeasten, Cat #FYE607), put into an ice bath for 5 minutes, then heat shocked in 42° C. water bath for 1 minute, put back on ice for 1 minute, then added with 650 μL of the antibiotic-free SOC medium, and recovered at a speed of 200 RPM on a shaker at 37° C. for 30 minutes. 200 μL of the mixture is taken and coated on an LB solid medium containing antibiotics, and incubated in a 37° C. incubator overnight. The next day, primers M13F and M13R on the T vector are used to formulate 30 μL of the PCR system for a colony PCR, wherein the colony is dipped with a pipette tip to the PCR reaction system, and then is blown and sucked; and 0.5 μL is sucked out and placed on another LB solid culture dish containing 100 nM ampicillin to preserve the strain. After the PCR reaction is completed, 5 μL is taken for an agarose gel electrophoresis detection, and the positive samples are sequenced and analyzed (wherein the CDR is divided according to the Chothia definition system). The sequencing results are shown in tables 16 and 17.
The numbers in table 17 are the sequence numbers in the sequence listing. For example, the amino acid sequence of the heavy chain protein variable region of 43H6A1 is SEQ ID No. 1, and the amino acid sequence of the CDR1 in the heavy chain protein variable region of 43H6A1 is SEQ ID No. 2.
The numbers in table 18 are the sequence numbers in the sequence listing. For example, the nucleotide sequence encoding the heavy chain protein variable region of 75G7C6 is SEQ ID No. 105.
1) Plasmid construction and preparation: The heavy chain variable region sequence of the above-mentioned murine lead antibody is recombined into an expression vector containing a signal peptide and a human heavy chain antibody IgG1 constant region (wherein the expression vector is purchased from Invitrogen, and the recombination steps are also completed by Shanghai ChemPartner Co., Ltd.), and the light chain variable region sequence of the EGFRvIII antibody is recombined into an expression vector containing a signal peptide and a human antibody light chain kappa constant region (wherein the expression vector is purchased from Invitrogen, and the recombination steps are also completed by Shanghai ChemPartner Co., Ltd), thereby obtaining recombinant plasmids (for the experimental principles and steps of the above-mentioned plasmid recombination, see “Molecular Cloning: A Laboratory Manual (Third Edition)”, (US) J. Sambrook et al.) which are verified by sequencing. An alkaline lysis kit (purchased from MACHEREY-NAGEL) is used for a medium-amount extraction of recombinant plasmids having a high purity, wherein the recombinant plasmids have a mass of at least 500 μg and filtered through a 0.22 μm filter membrane (purchased from Millopore) for transfection.
2) Cell transfection: 293E cells (purchased from Invitrogen) are cultured in a Freestyle 293 expression medium (purchased from Invitrogen). The shaker is set to 37° C., 130 RPM and 8% CO2 (v/v). During transfection, the Freestyle 293 expression medium is added with 10% (v/v) F68 (purchased from Invitrogen) to a final F68 concentration of 0.1% (v/v), thereby obtaining a Freestyle 293 expression medium containing 0.1% (v/v) F68 (i.e., a medium A). 5 mL of the medium A and 200 μg/mL PEI (purchased from Sigma) are mixed evenly to obtain a medium B. 5 mL of the medium A and 100 μg of the recombinant plasmids (herein referred to the mixed recombinant plasmids obtained by mixing the above-mentioned heavy chain recombinant plasmid and light chain recombinant plasmid in a conventional equal proportion) obtained in step 1) are mixed evenly to obtain a medium C. After 5 minutes, the medium B and the medium C are combined and mixed evenly, and allowed to stand for 15 minutes to obtain a mixed solution D. 10 mL of the mixed solution D is slowly added to 100 mL of the Freestyle 293 expression medium containing 293E cells until the cell density of 293E is 1.5×106 cells/mL, wherein the addition is performed with shaking to avoid excessive concentration of PEI, and the mixture is placed in a shaker for culture. The next day peptone is added to a final concentration of 0.5% (w/v). On days 5 to 7, the antibody titer of the culture broth is measured. On days 6 and 7, the supernatant is collected by centrifugation (3500 RPM, 30 minutes) and filtered through a 0.22 μm filter membrane to obtain the filtered cell supernatant for purification.
3) Antibody purification: The continuously produced endotoxin-free chromatography columns and protein A fillers are treated with 0.1 M NaOH for 30 minutes or rinsed with 5 column volumes of 0.5 M NaOH; the long-term unused column materials and chromatography columns are soaked with 1 M NaOH for at least 1 hour, rinsed with endotoxin-free water to neutrality, and washed with 10-fold column volumes of 1% Triton X100. 5 column volumes of PBS are used for equilibration, and the filtered cell supernatant is loaded onto the column, and the flow-through fluid is collected if necessary. After loaded onto the column, 5-fold column volumes of PBS are used for washing. 5-fold column volumes of 0.1 M Glycine-HCl (pH 3.0) is used for eluting, and the eluate is collected and neutralized with 1/10 volume of 1 M Tris-HCl (1.5 M NaCl) (pH 8.5). The antibodies are harvested, and then dialyzed in 1×PBS overnight to avoid endotoxin contamination. After dialysis, the concentration is measured using a spectrophotometer or a kit; the purity of the antibodies is measured using HPLC-SEC; and the endotoxin content of the antibodies is detected using an endotoxin detection kit (purchased from Lonza).
Purified EGFRvIII chimeric antibodies 75G7, 63A10, 64F1 and 7E5 are obtained respectively.
The purified EGFRvIII chimeric antibody is detected and analyzed for protein concentration (A280/1.4), purity, endotoxin (Lonza kit), etc. The results as shown in table 18 indicate that the endotoxin concentration of the final antibody product is within 1.0 EU/mg.
The purified EGFRvIII chimeric antibody obtained in example 2 is reacted with a human EGFRvIII-hFc protein.
The purified immunogen A (EGFRvIII-hFc) obtained in example 1 (for the preparation method, see step (I) in example 1) is diluted with PBS to a final concentration of 1.0 μg/mL, and then added to a 96-well ELISA plate at 100 μl/well. The plate is sealed with a plastic film and incubated at 4° C. overnight. The next day, the plate is washed twice with a plate washing solution [PBS+0.01% (v/v) Tween20], and added with a blocking solution [PBS+0.01% (v/v) Tween20+1% (w/w) BSA] for blocking 2 hours at room temperature. The blocking solution is discarded, and the purified EGFRvIII chimeric antibody obtained in example 2 is added at 100 μl/well. The plate is incubated for 2 hours at 37° C., and then washed 3 times with the plate washing solution [PBS+0.01% (v/v) Tween20]. The plate is added with an HRP (horseradish peroxidase) labeled secondary antibody (purchased from Sigma), incubated at 37° C. for 2 hours, and then washed 3 times with the plate washing solution [PBS+0.01% (v/v) Tween20]. The plate is added with 100 μl of TMB substrate/well, incubated for 30 minutes at room temperature, and then added with 100 μl of stopping solution (1.0N HCl)/well. A450 nm value is read using an ELISA plate reader (SpectraMax 384plus, purchased from Molecular Device), and the results are shown in
The desired cells are subjected to an expanded culture to 90% confluence in a T-75 cell culture flask; the medium is sucked out, and the cells are washed twice with a HBSS buffer (Hanks Balanced Salt Solution) (purchased from Invitrogen), and then treated with an enzyme-free cell dissociation solution (Versene solution: purchased from Life Technology) and collected. The cells are washed twice with the HBSS buffer and are counted, and then the cells are diluted with the HBSS buffer to 2×106 cells/ml, added with 1% goat serum blocking solution, with the percentage being the mass percentage, and incubated on ice for 30 minutes, and then centrifugally washed twice with the HBSS buffer. The collected cells are suspended to 2×106 cells/mL with an FACS buffer (HBSS+1% BSA, with the percentage being the mass percentage), added to a 96-well FACS reaction plate at 100 μL/well, then added with 100 μL/well of the purified EGFRvIII antibody (samples to be tested) obtained in example 2, and incubated on ice for 2 hours. The plate is centrifugally washed twice with the FACS buffer, added with 100 μL/well of a fluorescence (Alexa 488) labeled secondary antibody (purchased from Invitrogen), and then incubated on ice for 1 hour. The plate is centrifugally washed 3 times with the FACS buffer, added with 100 μL/well of a fixing solution [4% (v/v) paraformaldehyde] for suspending the cells, and centrifugally washed twice with the FACS buffer after 10 minutes. The cells are suspended in 100 μL of the FACS buffer, and the results are detected and analyzed by FACS (FACS Calibur, purchased from BD). Data analysis is performed by using the software (CellQuest) to obtain the mean fluorescence intensity (MFI) of the cells. Data is further analyzed by using the software (GraphPad Prism5) to perform data fitting and calculate EC50. The analysis results as shown in table 20 and
The antibody is dialyzed with a sodium borate buffer at Ph 6.5 to 8.5, and then a certain proportion of TCEP is added thereto (with TCEP/antibody ratio being 2-10); reduction is performed for 2 to 4 hours at room temperature or 37° C., and excess TCEPs are removed through a G25 desalting filler, and a certain proportion of MC-MMAF (with drug/antibody molar ratio being 4.5) is added and reacted for 4 hours. Then cysteine is added to neutralize excess drugs, and excess small molecules are removed through G25. A purified antibody-drug conjugate is afforded (for the coupling method, see Doronina, 2006, Bioconjugate Chem. 17, 114-124). The cytotoxic activity is analyzed after HIC analysis of drug cross-linking rate, purity and other parameters. For comparison, the drug cross-linking rate of all antibody conjugates is 8 [the DAR (drug-antibody coupling ratio) is adjusted through experiments to reach the preset value, and the exact DAR is detected and determined later].
The resulting antibody conjugates are respectively subjected to gradient dilution with a complete medium, and a 96-well cell culture plate is added with 90 μL of the U87MG-EGFRvIII cell suspension at 1000 cells/well, and to each well is further added 10 μL of different concentrations of antibody-drug conjugate dilutions; after continuing the culture for 5 days, the cell viability is detected using the CellTiter-Glo kit (purchased from Promega, and the usage method refers to the product instructions).
The results are as shown in table 21 and
Since the expression of EGFRvIII genes may gradually be lost during the in vitro cultivation of tumor cells, there is no endogenous tumor cell line that stably expresses the EGFRvIII gene. Therefore, the U87MG-EGFRvIII recombinant cell line is constructed and the U87MG-EGFRvIII subcutaneous xenograft tumor model is established to test the in vivo efficacy of the chimeric antibody-drug conjugate.
See example 1 for the construction method of the U87MG-EGFRvIII recombinant cell line. 200 μl (1×106 cells) of the U87MG-EGFRvIII cell suspension is inoculated on the right back of Balb/c nude mice; after 7 days, tumor grew to 150-250 mm3, and then tumor-bearing mice with appropriate tumor volume are selected, randomly grouped according to the tumor volume before administering drugs per the established schedule. See table 22 for specific research plan and administration dose.
The experimental indicators are to investigate whether tumor growth may be inhibited, delayed or cured. The next day the diameter of the tumor is measured with a vernier caliper. The computational formula of the tumor volume is: V=0.5×a×b2, where a and b represent the long diameter and short diameter of the tumor, respectively.
The tumor suppressive effect (TGI) of the compounds is evaluated by TGI (%). TGI %=1−(TRTVn−TRTV1)/(CRTVn−CRTV1)×100% (TRTVn: RTV on the last day of the treatment group; CRTV: RTV in the negative control group). The computational formula of the relative tumor volume RTV is RTV=Vt/V0, wherein V0 is the tumor volume measured during administration in different cages (Day 0), and Vt is the tumor volume at each measurement. The percentage value of T/C (%) reflects the tumor growth inhibition rate; according to guiding principles regarding anti-tumor drugs of Center for Drug Evaluation, TGI %≤58% indicates that the drug is effective.
The results are shown in
By analyzing the antibody sequences of 75G7C6 and 63A10A7, it is found that there is a possibility of deamidation reaction of NG at positions 54 and 55 of the CDR2 (SEQ ID No. 3) and isomerization of DG at positions 98 and 99 of the CDR3 (SEQ ID No. 4) in the heavy chain variable region (SEQ ID No. 1) of the 75G7C6 antibody. In order to suppress deamidation, isomerization and hydrolysis, upon calculation and analysis, the NG located in the CDR2 of the 75G7C6 antibody is mutated to NA, and the amino acid sequence of the heavy chain variable region of the 75G7C6 antibody after mutation is as shown in SEQ ID No. 179; or the NG is mutated to QG, and the amino acid sequence of the heavy chain variable region of the 75G7C6 antibody after mutation is as shown in SEQ ID No. 180. Moreover, the DG located in the CDR3 is mutated to SG, and the amino acid sequence of the heavy chain variable region of the 75G7C6 antibody after mutation is as shown in SEQ ID No. 181 (the corresponding nucleic acid sequence is as shown in SEQ ID No. 188); or the DG is mutated to EG, and the amino acid sequence of the heavy chain variable region of the 75G7C6 antibody after mutation is as shown in SEQ ID No. 182 (the corresponding nucleic acid sequence is as shown in SEQ ID No. 131); or the DG is mutated to DA, and the amino acid sequence of the heavy chain variable region of the 75G7C6 antibody after mutation is as shown in SEQ ID No. 183 (the corresponding nucleic acid sequence is as shown in SEQ ID No. 132). Amino acid modifications of removing asparagine residues and aspartoyl residues that are used as sites undergoing deamidation, are intended to be performed by means of the above-mentioned site-directed mutations.
After the site-directed mutant sequences of the heavy chain variable region of the chimeric 75G7C6 antibody are genetically synthesized, plasmid construction, cell transfection, and antibody purification are performed as described in example 8. The purified antibodies are identified for the mutant chimeric antibody binding activity by ELISA and FACS as described in example 9. ELISA identification results are shown in
FACS identification results are as shown in
As can be seen from table 24 and
As can be seen from table 25 and
As can be seen from the above-mentioned ELISA and FACS results, the NG located at the CDR2 in the heavy chain variable region of the c75G7C6 can be mutated to NA or QG, and the binding activity in antigens of the mutated antibodies is not affected; however, whether the DG located in CDR3 is mutated to EG, SG or DA, it affects the binding of the antibodies to antigens, in which case mutation is not suitable. Therefore, the antibody c75G7C6-1 with NG of the CDR2 mutated to NA will be selected for follow-up study.
The heavy and light chain variable region templates of human germline antibodies that best match the non-CDR regions of the above-mentioned chimeric antibodies 63A10A7 and 75G7C6 are selected from the Germline database. The human acceptable sequences of the humanized EGFRvIII antibody are selected from human germline exons VH, JH, Vk and Jk sequences. The template of the heavy chain variable region of the 63A10A7 antibody is IGHV1-46*01 of the human germline antibody heavy chain exon VH, and JH-4 of the exon JH; the template of the light chain variable region is IGKV1-39*01 of the human germline antibody light chain exon VK, and JK-2 of the exon JK. The template of the heavy chain variable region of the 75G7C6 antibody is IGHV1-46*01 of the human germline antibody heavy chain exon VH, and JH-6b of the exon JH; and the template of the light chain variable region is IGKV3-11*01 of the human germline antibody light chain exon VK, and JK-4 of the exon JK.
The heavy chain and light chain CDRs of the chimeric antibodies 63A10A7 and 75G7C6 determined according to the Kabat definition are grafted into the selected human germline templates, respectively, to replace the CDR regions of the human germline templates so as to obtain a humanized antibody. Then, based on the three-dimensional structure of the murine antibody, the residues that embed residues and directly interact with the CDR regions, and the residues in framework regions that have an important influence on the conformations of VL and VH are subjected to back mutation to obtain a humanized antibody. Briefly, synthetic overlapping oligonucleotides across the humanized VH or VL domain are generated, and the PCR overlap extension is used to assemble each domain. The restriction site incorporated into the PCR product is used to directionally clone the VH domain into an expression vector containing a signal peptide and a human antibody heavy chain IgG1 constant region, and to directionally clone the VL domain into an expression vector containing a signal peptide and a human antibody light chain kappa constant region, thereby obtaining recombinant plasmids which are verified by sequencing; an alkaline lysis kit (purchased from MACHEREY-NAGEL) is used for a medium-amount extraction of the recombinant plasmids having a high purity, wherein the recombinant plasmids have a mass of at least 500 μg and filtered through a 0.22 μm filter membrane (purchased from Millopore) for transfection.
The sequence alignments of the heavy and light chain variable regions of the humanized anti-EGFRVIII antibody variants with the human germline heavy and light chain variable regions and the heavy and light chain variable regions of chimeric antibodies are as shown in
Several framework positions are selected to reintroduce mouse donor residues. Several humanized 75G7C6 and 63A10A7 variants may be generated by incorporating different combinations of mouse framework donor residues into the VH domain or of human CDR residues into the VL domain. Table 26 and table 27 summarize these variants. Table 26 and table 27 show the variable regions but not constant regions of these variants. In tables 26 and 24, beginning with c represents a chimeric antibody, and beginning with h represents a humanized antibody; in the column of donor framework residues and back mutations, for example, “Q1E” in the heavy chain variable region h75G7C6.VH of the humanized anti-EGFRvIII antibody 75G7C6 and variants thereof means that the amino acid at position 1 as shown in
cDNAs (i.e., the sequences respectively shown in SEQ ID NO. 154, 155, 156, 157, 158, 159, 160, 161, 162, 163, 164, 165, 166, 167, 168, 169, 170, 171, 172, 173 and 174 in the sequence listing) are synthesized based on the amino acid sequence of the light chain variable region and heavy chain variable region of each humanized antibody; the heavy chain cDNA is digested with FspAI and AfeI, and the light chain cDNA is digested with FspAI and BsiwI; then the cDNA fragments are inserted into an expression vector containing a signal peptide and a human heavy chain antibody IgG1 constant region and an expression vector containing a signal peptide and a human antibody light chain kappa constant region (wherein the expression vectors are purchased from Invitrogen, and the recombination steps are also completed by Shanghai ChemPartner Co., Ltd) through FspAI/AfeI or FspAI/BsiwI restriction enzyme sites, respectively, thereby obtaining recombinant plasmids which are verified by sequencing; an alkaline lysis kit (purchased from MACHEREY-NAGEL) is used for a medium-amount extraction of the recombinant plasmids having a high purity, wherein the recombinant plasmids have a mass of at least 500 μg and filtered through a 0.22 μm filter membrane (purchased from Millopore) for transfection.
Before transfection, 293E cells (purchased from Invitrogen) are cultured in a Freestyle 293 expression medium (purchased from Invitrogen). During transfection, the Freestyle 293 expression medium is added with 10% (v/v) F68 (purchased from Invitrogen) to a final F68 concentration of 0.1% (v/v), thereby obtaining a Freestyle 293 expression medium containing 0.1% (v/v) F68 (i.e., a medium A). 5 mL of the medium A and 200 μg/mL PEI (purchased from Sigma) are mixed evenly to obtain a medium B. 5 mL of the medium A and 100 μg of the heavy and light chain recombinant plasmids (the mass ratio of the heavy chain recombinant plasmid to the light chain recombinant plasmid is in the range of 1:1-1:3) are mixed evenly to obtain a medium C. After 5 minutes, the medium B and the medium C are combined and mixed evenly, and allowed to stand for 15 minutes to obtain a mixed solution D. 10 mL of the mixed solution D was slowly added to 100 mL of the Freestyle 293 expression medium containing 293E cells until the cell density of 293E is 1.5×106 cells/mL, wherein the addition was performed with shaking to avoid excessive concentration of PEI, and the mixture was placed in a shaker for culture, with the shaker being set to 37° C., 130 RPM and 8% CO2 (v/v). The next day peptone is added to a final concentration of 0.5% (w/v). On days 5 to 7, the antibody titer of the culture broth is measured. On days 6 and 7, the supernatant is collected by centrifugation (3500 RPM, 30 minutes) and filtered through a 0.22 μm filter membrane to obtain the filtered cell supernatant for purification.
During antibody purification, the continuously produced endotoxin-free chromatography columns and protein A fillers (purchased from GE) are rinsed with 5 column volumes of 0.5 M NaOH. Then 5 column volumes of PBS (a PBS buffer, pH 7.4) are used for equilibration to neutrality, and then the filtered cell supernatant is loaded onto the column, and the flow-through fluid is collected if necessary. After loaded onto the column, 5-fold column volumes of PBS are used for washing. 5-fold column volumes of 0.1 M Glycine-HCl (pH 3.0) is used for eluting, and the eluate is collected, and immediately added with 0.1-fold volume of 1 M Tris-HCl (1.5 M NaCl) (pH 8.5) to neutralize the EGFRvIII antibody. All the solutions used above need to be freshly prepared. The EGFRvIII antibody is harvested, and then dialyzed in 1×PBS for 4 hours to avoid endotoxin contamination. After dialysis, the concentration is measured using a spectrophotometer or a kit; the purity of the antibodies is measured using HPLC-SEC; and the endotoxin content of the antibodies is detected using an endotoxin detection kit (purchased from Lonza). The obtained EGFRvIII antibody is subjected to characterization (the operation steps are as described in example 13 below).
The purified humanized anti-EGFRvIII antibody obtained in example 12 is reacted with a human EGFRvIII-hFc protein.
The purified immunogen A (EGFRvIII-hFc) obtained in example 1 (for the preparation method, see step (I) in example 1) is diluted with PBS to a final concentration of 1.0 μg/mL, and then added to a 96-well ELISA plate at 100 μl/well. The plate is sealed with a plastic film and incubated at 4° C. overnight. The next day, the plate is washed twice with a plate washing solution [PBS+0.01% (v/v) Tween20], and added with a blocking solution [PBS+0.01% (v/v) Tween20+1% (w/w) BSA] for blocking 2 hours at room temperature. The blocking solution is discarded, and the purified EGFRvIII chimeric antibody obtained in example 2 is added at 100 μl/well. The plate is incubated for 2 hours at 37° C., and then washed 3 times with the plate washing solution [PBS+0.01% (v/v) Tween20]. The plate is added with an HRP (horseradish peroxidase) labeled secondary antibody (purchased from Sigma), incubated at 37° C. for 2 hours, and then washed 3 times with the plate washing solution [PBS+0.01% (v/v) Tween20]. The plate is added with 100 μl of TMB substrate/well, incubated for 30 minutes at room temperature, and then added with 100 μl of stopping solution (1.0N HCl)/well. A450 nm value is read using an ELISA plate reader (SpectraMax 384plus, purchased from Molecular Device). The results are shown in
The preparation of U87MG-EGFRvIII required for FACS detection is as described in the preparation of the immunogen B in example 1. Normal human primary hepatocytes are purchased from BioreclamationIVT, and A431 tumor cells are purchased from ATCC. The recovered hepatocytes are directly used for FACS detection, and the recovered U87MG-EGFRvIII and A431 cells are subjected to an expanded culture to 90% confluence in a T-75 cell culture flask; the medium is sucked out, and the cells are washed twice with a HBSS buffer (Hanks Balanced Salt Solution) (purchased from Invitrogen), and then treated with an enzyme-free cell dissociation solution (Versene solution: purchased from Life Technology) and collected. The cells are washed twice with the HBSS buffer and are counted, and then the cells are diluted with the HBSS buffer to 2×106 cells/ml, added with 1% goat serum blocking solution, with the percentage being the mass percentage, and incubated on ice for 30 minutes, and then centrifugally washed twice with the HBSS buffer. The collected cells are resuspended to 2×106 cells/mL with an FACS buffer (HBSS+2% FBS, with the percentage being the volume percentage), added to a 96-well FACS reaction plate at 100 μL/well, then added with 100 μL/well of the purified EGFRvIII antibody (samples to be tested) obtained in example 12, and incubated on ice for 2 hours. The plate is centrifugally washed twice with the FACS buffer, added with 100 μL/well of a 1:1000 diluted fluorescence (Alexa 488) labeled secondary antibody (purchased from Invitrogen), and incubated on ice for 1 hour. The plate is centrifugally washed 3 times with the FACS buffer, added with 100 μL/well of a fixing solution [4% (v/v) paraformaldehyde] to resuspend the cells, and centrifugally washed twice with the FACS buffer after 10 minutes. The cells are resuspended in 100 μL of the FACS buffer, and the results are detected and analyzed by FACS (FACS Calibur, purchased from BD). Data analysis is performed by using the software (CellQuest) to obtain the mean fluorescence intensity (MFI) of the cells. Data is further analyzed by using the software (GraphPad Prism5) to perform data fitting and calculate EC50.
The analysis results are as shown in tables 30 and 31, and
In addition, humanized 75G7C6 and 63A10A7 antibody variants can also weakly bind to the wild-type EGFR. A431 cells (human epidermal cell cancer cell line) overexpress a large amount of wild-type EGFR proteins, and normal human primary hepatocytes also have a certain amount of wild-type EGFR protein expressed on the surface thereof; as shown in
Similarly, each variant of the humanized 63A10A7 antibody has a different binding ability regarding the EGFR protein on the surfaces of A431 and normal human primary hepatocytes. As shown in
In order to evaluate the binding specificity and affinity of the humanized anti-EGFRvIII antibody to the EGFRvIII and EGFR proteins, on an OctetRED (purchased from Pall) instrument, an anti-human Fc biosensor AHC (purchased from ForteBio) is used to bind the humanized h75G7C6 antibody variant to be tested, which then binds to the EGFRvIII-his (purchased from Sino Biological) or EGFR-his (purchased from Sino Biological) proteins, and detected by means of bio-layer interferometry (BLI) technology. BLI technology involves: detecting the interaction of the molecules fixed on the sensor with the molecules in the solution, which results in that the biofilm thickness is increased, and the interference spectrum curve is shifted along the direction of increased wavelength, wherein the phase shift of light waves can be detected by workstation in real time, and analyzed to quantitatively obtain the changes in the number of molecules on the sensor surface and related concentration and kinetic data. The binding rate (Ka), dissociation rate (Kdis) and binding affinity (KD) of the humanized h75G7C6 antibody variant to the EGFRvIII and EGFR proteins are as shown in tables 34 and 35, wherein the chimeric antibody c75G7C6 is used as a control.
OctetRED results show that the binding affinity to the EGFRvIII and EGFR proteins of the humanized h75G7C6 antibody variant is very close to that of the chimeric antibody c75G7C6, which verifies that the humanized h75G7C6 antibody does not significantly reduce the antigen binding activity compared to the chimeric antibody. Furthermore, the binding affinity of the humanized h75G7C6 antibody variant to the EGFRvIII protein is about 1.5 nM, and the binding affinity thereof to the EGFR protein is about 0.2 uM.
Although the specific embodiments of the present invention have been described above, it will be understood by those skilled in the art that these are merely illustrative, and that various alterations or modifications can be made to these embodiments without departing from the principle and essence of the present invention. Therefore, the scope of protection of the present invention is defined by the appended claims.
Number | Date | Country | Kind |
---|---|---|---|
201711329680.1 | Dec 2017 | CN | national |
The present application is a 371 of PCT/CN2018/120959, filed Dec. 13, 2018, which is based upon and claims priority to Chinese patent application CN 201711329680.1 filed on Dec. 13, 2017, both of which are hereby incorporated by reference in their entireties.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CN2018/120959 | 12/13/2018 | WO | 00 |