The present invention relates to generation of antibodies and proteins which combine two or more segments of amino acid sequence from a human antibody or protein within the final antibody or protein molecule. In particular, the present invention provides such combinations of sequence segments such that the number of T cell epitopes in the final antibody or protein molecule is reduced or avoided. The invention particularly relates to the generation of antibodies and proteins for use as pharmaceutical agents in humans or as in vivo diagnostic agents.
The last 20 years has seen great advances in the generation of recombinant monoclonal antibodies for use as potential pharmaceuticals in man. The techniques of chimerization, humanization and human antibody cloning either by phage display or transgenic mice have provided antibodies which are generally well tolerated when administered to man with less immunogenicity than with non-human monoclonal antibodies. However, several antibodies generated by these techniques have been shown to elicit immunogenicity in patients even where the genetic origins of such antibodies are human. For example, the human antibody Humira®, elicits immunogenicity in 12% of rheumatoid arthritis patients and the humanized antibody CAMPATH® elicits immunogenicity in about 50% of patients. Such induction of immunogenicity is likely to result from the presence, within the antibody variable region, of tracts of non-self amino acid sequences which, in some cases, can create T cell epitopes which induce T cell responses resulting in immunogenicity. There is therefore a need for improved techniques and antibody compositions which have a high human origin but which avoid, as much as possible, creation of sequences which might induce T cell responses.
The present invention provides methods and resultant antibody compositions whereby, for therapeutic use, such antibodies (herein termed “composite antibodies”) combine two or more segments of amino acid sequence from a human antibody within the final antibody molecule. Thus, in a first aspect, the present invention provides modified antibody or antigen-binding fragment thereof wherein the heavy and light chain variable regions of the modified antibody or antigen-binding fragment are each composed of two or more segments of amino acid sequence from one or more other antibodies or antigen-binding fragments, whereby the segments are neither whole CDRs nor framework regions.
In the context of the present invention, the term “segments” refers to contiguous amino acid sequence found within an antibody molecule, such segments ranging in size from 2 to 125 amino acids long, preferably ranging from 2 to 31 amino acids long where such segments are neither whole CDRs nor whole framework regions. For therapeutic use, composite antibodies of the present invention will typically combine two or more segments of amino acid sequence from different human antibodies within the variable regions of the composite antibody. In particular, the present invention relates to composite antibody heavy and light chain variable regions (VH and VL respectively) where each VH and VL is composed entirely of segments of sequence from two or more human antibody variable regions and where typically each composite VH and VL includes segments of human variable region sequence positions corresponding to their positions in the source human antibody VHs and VLs, for example amino acids 1 to 10 in the composite VH sequence will derive from amino acids 1 to 10 in a human antibody. Alternatively, segments of human VH or VL sequence in the composite antibody may be positioned at any sequence location irrespective of the sequence position in the source human antibody VH or VL. The source human antibody VHs and VLs will be any existing human antibody variable (V) region amino acid sequence, for example as provided in databases of human monoclonal antibody V region sequences, and may include sequences from affinity-matured antibodies with V region somatic mutations and other variations differing from germ-line, sequences from germ-line V regions, sequences from artificially constructed antibody V regions created from segments of sequence from antibodies of the species such as antibodies with a set of fixed V region frameworks but with variable CDRs, sequences selected from human antibody libraries such as phage display libraries, and sequences of human antibodies derived from transgenic animals expressing genes encoding human antibodies or antibody fragments.
In a preferred embodiment of the present invention, composite antibodies of the invention for therapeutic use are constructed by combining multiple human VH and VL sequence segments in combinations which limit or avoid human T cell epitopes in the final composite antibody V regions.
Human T cell epitopes in this respect are amino acid sequences which can bind to human MHC class II molecules and, through presentation to CD4+ T cells, induce a helper T cell response. Human VH and VL sequence segments and combinations of segments can be chosen which limit or avoid T cell epitopes in the final composite antibody. This can be achieved by use of segments which do not contain T cell epitopes, such as from human germ-line sequences, and by joining of adjacent segments to create a new sequence which does not contain T cell epitopes, for example by creation of a non-MHC binding sequence at the junction of two segments, by creation of another human germ-line sequence, or by creation of a sequence which does not induce a helper T cell response despite a non-germ-line sequence.
In another preferred embodiment of the present invention, additional amino acid sequences can be added or created within the composite antibody molecules which provide for one or more regulatory T cell epitopes (“Tr epitopes”). For the purpose of the invention, Tr epitopes are MHC binding peptides which stimulate CD4+ CD25+ T cells with the ability to regulate immune responses by the secretion of inhibitory cytokines such as IL-10 and TGF-β, as well as contact dependent mechanisms. As such, within the scope of the invention, regulatory T cell epitopes can include peptides shown to induce one or more activities in vitro or in vivo which could contribute to regulation of immune responses under certain conditions. For example, regulatory T cell epitopes will include peptides with the action of inducing or activating CD4+ CD25+ T cells, with the action of inducing release of inhibitory cytokines such as IL-10 and/or TGF-β, or with other measurable immunosuppression-related activities either in vitro or in vivo, in all cases where the actions are related to the action of CD4+ CD25+ T cells. Thus, such Tr epitopes can provide an additional measure to limit or avoid immunogenicity in the composite antibody. Tr epitopes can be introduced into the composite antibody VH or VL by incorporation of segments of human VH and VL containing these epitopes or by creation of such epitopes via combination of two or more human sequence segments or by screening for new Tr epitopes, for example from peptides corresponding to segments of human antibody or protein sequence, for induction or activation of CD4+ CD25+ T cells, for example by measurement of release of inhibitory cytokines such as IL-10 and/or TGF-β (e.g. Hall et al., Blood, vol. 100 (2002) p 4529-36). Alternatively, known Tr epitopes can be incorporated within composite antibody V regions at positions within VH and/or VL which do not inhibit binding or function or expression of the composite antibody or can be incorporated at one terminus of the composite VH or VL sequence, for example at the N terminus of VH. Alternatively, Tr epitopes can be incorporated into one or both constant regions of a composite antibody at locations which do not interfere with function of the composite antibody (e.g. within the hinge regions) or cause some other deleterious effect such as lack of expression. Alternatively, for one or both of composite VH and VL's within antibody fusion proteins, antibody conjugates, Fab and Fv-type forms (including single chain antibodies (SCAs) with VH and VL linked), single domain antibodies, or homodimeric antibodies, Tr epitopes can be incorporated at locations which do not interfere with function of the composite antibody or cause some other deleterious effect such as lack of expression. For example, in SCAs, an especially preferred location for a Tr epitope is within the linker region joining VH and VL. Optimally, Tr epitopes will be flanked by appropriate sequences to optimise the release and presentation of regulatory T cell epitopes on MHC class II molecules, for example by flanking the epitope with sequences that are sensitive to the action of endocytic proteases. Typically, flanking residues at positions ranging from P-20 to P30 (with the core nonomer defined as P1-P9) that will target the action of proteases during antigen processing are introduced, if necessary using additional segments of human antibody sequence.
As discussed herein, the present invention also provides methods for the production of modified or composite antibodies. Thus, in another aspect, the present invention provides a method for producing a modified antibody comprising the steps;
(1) preparing antibody variable region genes by combining segments of amino acid sequence from a range of other antibody variable regions in order to generate a library of different variable region genes
(2) cloning the library of antibody variable region genes into an expression vector
(3) screening the library of antibody variable regions and recovering members of the library with desirable properties
In a first preferred method ‘A’ of the present invention, a library of composite human antibodies is generated and screened for antibodies with desirable properties such as binding to a specific antigen. This method involves 6 steps as follows;
(1) design of composite VH and VL genes
(2) cloning of composite VH and VL genes
(3) expression of composite VH and VL genes
(4) screening and selection of composite antibodies with desirable properties
(5) optimisation of lead composite antibodies
(6) (optional) avoidance of T cell epitopes
For step (1), the library of composite VH and VL sequences are designed by selecting segments of VH and VL sequence from known human V region sequences such as those available in the Kabat antibody database (www.bioinf.org.uk/abs/simkab.html), the NCBI database (www.ncbi.nlm.nih.gov) and from protein databases, such as UniProt (www.ebi.uniprot.org) and PRF/SEQDB (www.prf.or.jp). In addition, these can be supplemented by collection of human VH and VL sequences by direct sequencing of amplified VH and VL mRNA from one or more individual donors. Various combinations of sequence segments can be considered for design of VH and VL genes. One method used is to fix the length of the composite VH and VL sequences and to design these using fixed length sequence segments from corresponding Kabat numbering positions in different human V regions.
For example, the library would comprise VH and VL regions of 121 and 107 amino acids respectively and would include, for example, an assortment of different segments for VH amino acids 1-27 using Kabat numbering. For VH with CDRs corresponding to Kabat numbering CDR1:30-35, CDR2:50-66 and CDR3:95-106, sequence segments for the following Kabat positions are used as one option: 1-27, 28-31, 32-36, 37-42, 43-50, 51-56, 57-60, 61-63, 64-69, 70-82a, 82b-96, 97-98, 99-101, 102-117. For VL with CDRs corresponding to Kabat numbering CDR1:24-34, CDR2: 50-56, CDR3:89-97, sequence segments for the following Kabat positions are used as one option: 1-22, 23-27, 28-30, 31-33, 34-35, 36-47, 48-52, 53-55, 56-59, 60-87, 88-92, 93-94, 95-107. Therefore, in this example, composite VHs are composed of 14 human segments and composite VLs are composed of 13 human segments. In practice, a computer program is used to generate combinations of these segments. Preferably, the program includes an algorithm to avoid non-preferred combinations of certain segments which might, for example, avoid certain canonical structures of CDRs or which might disrupt VH and/or VL folding or VH/VL interaction. As an optional addition, the program could include an algorithm to limit the number of T cell epitopes formed by the combination of sequence segments (see in silico methods in step (6) below).
For step (2), having designed a library of composite human sequences, composite VH and VL genes are then generated preferably using synthetic oligonucleotides. Typically, synthetic oligonucleotides encoding longer segments of V region sequence will be ligated to a mixture of oligonucleotides which encode two or more consecutive segments of V region sequence. Alternatively, composite V regions could be assembled by other methods such as overlapping PCR or other amplification techniques using existing human VH and VL genes as templates. For example, using PCR, small segments of V regions can be amplified separately and then joined by overlapping PCR reactions.
In other methods, mixed synthetic oligonucleotides can be produced to create a range of sequence segments preferably using doping methods to enrich for sequences encoding specific V region segments. Composite human VH and VL genes with extensive variability of human V region segment representation can be assembled in many ways using techniques known to those skilled in the art such as those described in Molecular Cloning: A Laboaratory Manual; 3rd Ed., vols. 1-3 (2001) Cold Spring Harbor Laboratory Press and using standard PCR methods for immunoglobulins such as those described in Orlandi et al., Proc Natl Acad Sci USA., 86 (1989) 3833-3837.
For step (3), once composite human VH and VL genes are generated, these can be cloned into a variety of expression vectors for production of either complete antibody molecules or antigen-binding fragments such as Fv's, Fab's, Fab2, SCAs, single domain antibodies (e.g. comprising VHs only) and multimeric derivatives of each of these. Alternatively, VH and VL genes can be fused to genes encoding other molecules to generate fusion proteins. Also included might be sequences encoding detectable markers such as poly-histidine tags at the C terminus of one chain of an Fv or Fab. Expression vectors include those for expression in mammalian cells, bacterial cells, bacteriophage, yeast, fungus and other micro-organisms. Such vectors also include those for expression in vivo from transgenic animals and those for expression using in vitro systems such as in vitro translation using ribosome preparations.
For step (4), screening of libraries of composite human antibodies is usually for binding to one or more specific antigens of interest. There are many screening methods known to those skilled in the art, the selection of which will depend on the form of expression of the composite human antibodies and the composition of the antibody molecules i.e. complete antibody or Fab, Fv, SCA, single domain antibody etc. In some cases where an existing antibody is available which binds to the antigen of interest, either VH or VL from this antibody may be combined with the composite human VL or VH respectively and tested for binding.
Screening methods will range from immobilising individual members of the library or pools of such members on a solid phase to immobilising the antigen of interest either individually or in pools. Where antibodies are immobilised, the antigen of interest is then added and is either detectable directly or indirectly by addition of one or more additional reagents. For example, if the antigen is a fusion protein or conjugate with an enzyme such as alkaline phosphatase, detection can be achieved by subsequent addition from a wide range of substrates which produce colour, fluorescence or chemiluminescent signals. Where antibody pools are immobilised in one location (e.g. the well of a microtitre dish) and a signal results from addition of antigen, this pool can then be dereplicated prior to rescreening of either individual members of the pool or smaller pools. Where the antigen of interest is immobilised, the composite antibody library may be screened in several ways ranging from addition of individual antibodies to the antigen of interest which is immobilised at a specific location, to addition of pools of antibody, to addition of the whole composite library and subsequent recovery of antibodies bound to the antigen of interest. In the last case, a common strategy is to immobilise antigen on a solid phase such as in a column or on beads, to add the library, to subsequently wash the solid phase for example with a low salt buffer (to detach loosely associated members of the library), and to then elute antibodies which bind to the antigen using, for example, a high salt buffer. Common formats for expression of members of the library for this purpose are phage display, yeast display, ribosome display and bead display, in each case where nucleic acid encoding composite VH and VL chains remains attached to the composite V region which binds to the antigen.
Screening methods will also include functional or biological tests which may be substituted for direct antigen binding tests where a functional or biological activity is measured such as in vitro tests involving cell growth, cell growth inhibition, cell differentiation or cell migration, or alternative in vivo tests involving measuring responses to the antigen at the level of the whole organism, for example changes in blood cell counts in a mouse or growth inhibition of a transplanted tumour.
For step (5), following selection of one or more “lead” composite human antibodies with desirable properties such as binding to an antigen of interest, optionally the properties of the lead antibody may be improved, for example by increasing affinity for binding to the antigen or fusing the antibody to an additional moiety. Increased affinity may be achieved by mutagenesis of composite variable region sequences in order to select for mutations in the selected composite V region sequences which increase or alter binding in a desirable way. The present invention includes novel methods for mutagenesis of variable region sequence by replacing one or more individual V region sequence segments from the lead antibody with corresponding sequence segments from one or more human antibody sequences. In particular, segments overlapping with or within CDR region may be replaced by one or more alternative segments from other human antibodies including segments of different lengths. Within the scope of the invention, specific segments may be included from human antibodies with related properties to the selected lead antibody, for example from antibodies which bind to the same antigen, or from non-human antibodies with related properties, or from human antibodies with sequence segments which retain certain key amino acids which appear important for function in a non-human antibody with related sequence. One or more composite human antibodies subject to such mutagenesis can then be screened for improved properties.
For the optional step (6), following selection of a lead composite human antibody, T cell epitopes are limited or avoided by, where required, exchanging V region segments contributing to or encoding a T cell epitope with alternative segments which avoid T cell epitopes. Such T cell epitopes can be detected by a range of methods. For example, peptides corresponding to one or more loci in the composite V region sequence can be synthesised and tested in T cell assays to determine the presence of T cell epitopes. Typically such peptides will be 15 amino acids in length and, where it is desirable to test a longer contiguous V region sequence, overlapping peptides from the sequence such as 15mers with 12 amino acid overlaps are used. For detection of T cell epitopes, a range of different T cell assays can be used for measurement of activation or proliferation of CD4+ T cells such as those measuring cytokine release, proliferation (for example, by uptake of 3H-thymidine), Ca2+ flux, surface marker expression, gene transcription etc.
Alternatively, overlapping peptides corresponding to the composite V region sequences are analysed for binding to human MHC class II molecules either using in vitro methods or in silico methods, in each case to determine potential T cell epitopes i.e. MHC binding peptides which may induce a T cell response. In silico methods will include methods involving modelling of peptide-MHC class II binding interactions, methods involving identification of motifs common for binding to MHC class II and methods using databases of peptides or specific amino acids within peptides with known in vitro MHC binding properties. Other methods can be used such as producing longer peptides from composite V region sequences or whole antibodies containing composite V region sequences and testing these in T cell assays or in MHC binding assays, for example by testing for MHC-peptide tetramers, or by searching the proposed or constructed sequences in a database of known human T cell epitopes. Avoidance of T cell epitopes in composite human V regions can also be assisted by avoidance of MHC class II binding motifs or avoidance of particular amino acids which anchor the binding of peptides to MHC class II. In the preferred method for avoidance of T cell epitopes from one or more lead composite human antibodies, in silico methods are initially applied to analyse the composite human antibody V regions for potential T cell epitopes and, where these are identified, new segments of human VH or VL sequence are introduced to avoid these epitopes and to avoid introduction of new T cell epitopes.
Following any such introduction of new human V region segments and rescreening of such modified lead composite human antibodies for desirable properties, one or more final lead composite human V region can then be further tested in human T cell assays either by testing overlapping peptides typically of 15 to 45 amino acids in length, for example 15mer peptides with 12 amino acid overlaps from the composite human V region sequences (whole V regions or parts thereof) or by testing whole composite human antibodies directly in human T cell assays. A final analysis using T cell assays for testing whole composite human antibody is preferred allowing for direct testing for T cell activation against the whole antibody.
In a second preferred method ‘B’ of the present invention, a library of composite human antibodies is generated to include desirable amino acids from one or more reference antibodies with desirable properties. This method involves 7 steps as follows;
In step (1), typical reference antibodies will be rodent, especially mouse, with properties and/or binding specificities which are desirable in a human form of antibody. Where one or more reference antibody V region sequences are available, these are analysed to determine sequences of the CDRs and to identify amino acids which might be important for the desirable properties of the antibody such as binding specificity. For a reference antibody, such analysis is performed, for example, by alignment of the reference V region sequences with other sequences of the same species and also, if the reference antibody is non-human, human V region sequences. Such alignments are performed, for example, using the program CLUSTAL (Thompson et al., Nucleic Acids Res. 22 (1994) p 4673-80). Such alignments can identify unusual or rare amino acids in the V region of the reference antibody and homologous V region families. In addition; conserved V region structures such as canonical structures of the CDRs can be identified using, for example, the Protein Data Bank (Berman et al.: The Protein Data Bank, Nucleic Acids Research, 28 (2000) 235-242). In addition, the reference antibody variable regions can be modelled, where a structure is not known, using modelling software such as MODELLER (SalI and Blundell, J. Mol. Biol. 234 (1993) p 779-815) and, in some cases, models of antibody-antigen interactions can be generated. Such analyses of the reference antibody V regions are used to guide on selection of segments of human V region sequence for the composite human antibody.
For step (2), having determined amino acids which might be important for the desirable properties of the composite human antibody, segments of human V region sequences are then selected to include some or all of these amino acids. A library of composite human V region sequences is thereby designed including selected segments with typically one or more alternative human V region segments at particular loci where the effect of such segments on properties of the composite human antibody is uncertain. Such composite human antibody sequences can be further analysed as with the reference antibody(s) by alignment with other human antibody sequences and conserved structures and, in addition, further modelling of the structure of composite human antibody V regions can be undertaken in order to refine, as required, the combinations of human V region segments used in the composite human antibodies to avoid defects in protein structure, intermolecular and intramolecular interactions within composite V regions, and incorrect structural orientations of important amino acids.
For the optional step (3), as an additional criteria for selection of segments, those segments or combinations of segments which limit or avoid T cell epitopes in the final composite human V regions are selected. T cell epitopes are analysed by the methods described in method A, step (6) above using in silico or in vitro methods, preferably by use of in silico methods at the stage of designing composite human V region sequences.
For step (4), having designed a library of composite human sequences, composite VH and VL genes are then generated preferably using synthetic oligonucleotides. Typically, synthetic oligonucleotides encoding longer segments of V region sequence will ligated to a mixture of oligonucleotides which encode alternative segments of sequence to generate different members of the library of composite human V regions. Alternatively, each member of the library of composite human V regions will be generated separately using oligonucleotides encoding the sequence of the specific human V region. Alternatively, composite V regions can be assembled by other methods such as overlapping PCR or other amplification techniques using existing human VH and VL genes as templates or using one or more reference antibody V region genes as template.
Steps (5) and (6) for method B are as described in method A, steps (4) and (5).
Optional step (7) will be employed as in method A, step (6) where further avoidance of T cell epitopes is required in the lead composite human antibody(s). A final analysis using T cell assays for testing whole composite human antibody is preferred allowing for direct testing for T cell activation from the whole antibody.
It will be understood to those skilled in the art that, in addition to methods A and B, there will be other methods for creating and testing composite human antibodies and for optimising the properties of such antibodies. Composite human antibodies of the present invention are new and, as a result of the total human origin of the V regions, should be less immunogenic in humans than other antibodies containing non-human sequences. Additional optional features of composite human antibodies, namely the avoidance of T cell epitopes and/or the addition of Tr epitopes, may also contribute to lower immunogenicity. It will be understood by those skilled in the art that the object of lower immunogenicity may be achieved using less preferred composite antibodies containing V regions without all human sequence segments, for example composite human antibodies including segments at sequence positions in the composite antibody different from their sequence positions in the source human antibody, composite antibodies with only partial incorporation of segments of human V region sequence, composite antibodies with segments of non-human sequence, or composite antibodies with human sequence which has been mutated, for example to increase binding affinity to an antigen or to avoid a T cell eptiope.
It will be understood that V region sequence segments and their combinations within composite human antibodies might be selected to meet a range of criteria including the optional avoidance of T cell epitopes as above. For example, segments of human V region sequence and combinations thereof can be selected for avoidance of B cell epitopes and other epitopes such as MHC class I-restricted epitopes, for avoidance of amino acid sequences which might be deleterious to expression of composite antibodies, for avoidance of sequences which might direct inappropriate modification of composite antibodies such as N-glycosylation, for inclusion of certain functions such as inclusion of helper T cell epitopes and/or B cell epitopes (for example, in vaccine applications), for subsequent conjugation to other moieties such as one or more surface lysine residues, and for a range of other criteria.
It will also be understood by those skilled in the art that, in addition to human, composite antibodies with V region segments derived from other species either wholly or in part can be generated and should be considered within the scope of the invention. For example, for studies in mice, composite mouse antibodies can be generated comprising V region sequence segments wholly or partly of mouse origin.
The present invention also applies to proteins other than antibodies whereby, for therapeutic use, such proteins (herein termed “composite proteins”) combine two or more segments of amino acid sequence from a human protein within the final protein molecule.
Thus, in a further aspect, the present invention provides a modified protein having improved immunogenicity through insertion of one or more segments of amino acid sequence.
In relation to proteins, the term “segments” refers to contiguous amino acid sequence found within a protein molecule, such segments ranging in size from 2 to 250 amino acids long. For therapeutic use, composite proteins of the present invention will typically combine two or more segments of amino acid sequence from different human proteins within the composite protein. In particular, the present invention relates to composite proteins with insertions composed entirely of segments of sequence from two or more human proteins. Where human proteins exist with homology to the composite protein or with homologous regions to regions of the composite protein, segments of human protein sequence at sequence positions in the composite protein sequences corresponding to their sequence positions in the source human protein may be used, for example amino acids 1 to 10 in the composite protein sequence will derive from amino acids 1 to 10 in a source human protein. Alternatively, segments of human protein sequence may be positioned in the composite protein at any sequence location in the composite protein irrespective of the sequence position in the source human protein. The source human proteins will be any existing human protein amino acid sequence, for example as provided in databases of human protein sequences, and may include sequences from naturally mutated or rearranged forms of the human protein and other variations differing from germ-line, sequences from artificially constructed human-derived proteins and sequences derived from human genes or RNA whether the corresponding proteins are expressed or not.
In a preferred embodiment of this aspect of the present invention, composite proteins for therapeutic use are constructed by combining or inserting human protein sequence segments in combinations which limit or avoid human T cell epitopes in the final composite protein. A preferred aspect of the invention as applied to composite proteins is to modify an existing reference protein such as a non-human protein by insertion of human protein sequence segments in order to limit or avoid T cell epitopes in the final composite protein.
In a preferred method of the present invention for generation of composite proteins, a library of composite human proteins is generated to include desirable amino acids from one or more reference proteins with desirable properties such as an absence of T cell epitopes. This method involves 7 steps as follows;
In step (1), typical reference proteins will be non-human with properties which are desirable in a composite protein. For therapeutic application, typically the reduction or elimination of immunogenicity in the composite protein will be an objective. Where one or more reference protein sequences are available, these are analysed to identify amino acids which might be important for the desirable properties of the protein. In addition, any known structure of the reference protein can be analysed or, alternatively, a structure modelled using modelling software. Where homologues of the reference protein are available, either interspecies or intraspecies, these can be sometimes be used to determine relationships between sequence differences and differences in properties between homologues. Where the protein interacts with another molecule, models of this interaction can sometimes be generated and amino acids important for the interaction determined. As an optional addition to step 1, the sequence location of T cell epitopes in the reference protein are determined, in particular using in vitro human T cell assays as detailed for composite human antibodies above. Alternatively, in silico methods for analysing T cell epitopes can be used. Such analyses of the reference proteins are used to guide on segments of human protein sequence selected for the composite protein. For composite proteins where a reduction or elimination of immunogenicity compared to a reference protein, especially non-human, is the objective, commonly one or more human sequence segments corresponding to locations of T cell epitopes will be used in the composite protein in combination with segments of sequence from the reference protein from other locations without T cell epitopes.
For step (2), having determined amino acids which might be important for the desirable properties of the composite protein, segments of protein sequences are then selected to include some or all of these amino acids. A library of composite human protein sequences is thereby designed including selected segments with typically one or more alternative human protein segments at particular loci where the effect of such segments on properties of the composite protein is uncertain. Such composite protein sequences can be further analysed as with the reference protein by alignment with any homologues or by modelling of the structure of composite proteins or by other analyses in order to refine, as required, the combinations of human protein segments used in the composite human proteins to avoid defects in protein structure and incorrect structural orientations of important amino acids.
For the optional step (3), as an additional criteria or only criteria for selection of segments, those segments or combinations of segments which limit or avoid T cell epitopes in the final composite proteins are selected. T cell epitopes are analysed by the methods described for composite human antibodies above using in silico or in vitro methods.
For step (4), having designed a library of composite proteins, composite protein genes are then generated preferably using synthetic oligonucleotides. Typically, synthetic oligonucleotides encoding longer segments of protein sequence will ligated to a mixture of oligonucleotides which encode alternative segments of sequence to generate different members of the library of composite proteins. Alternatively, each member of the library of composite proteins will be generated separately using oligonucleotides encoding the sequence of the specific composite protein. Alternatively, composite proteins can be assembled by other methods such as overlapping PCR or other amplification techniques using existing human protein genes as templates or using one or more reference protein genes as template.
For step (5), screening of libraries of composite proteins is usually for one or more desirable properties of the composite protein. There are many screening methods known to those skilled in the art, the selection of which will depend on the form of expression of the composite proteins and the protein function. Screening methods will range from immobilising individual members of the library or pools of such members on a solid phase, to screening member of the library in solution phase, to immobilising another molecule with which the composite protein is designed to interact by binding either individually or in pools. Screening methods may also include functional or biological tests where a functional or biological activity is measured such as in vitro tests involving cell growth, cell growth inhibition, cell differentiation or cell migration, or alternative in vivo tests involving measuring responses to the composite protein at the level of the whole organism, for example changes in blood cell counts in a mouse or growth inhibition of a transplanted tumour.
For step (6), following selection of one or more “lead” composite proteins with desirable properties, optionally the properties of the lead protein may be improved, for example by increasing the specific activity of an enzyme or by increasing the binding of a protein ligand to a receptor. An improvement in properties may be achieved by mutagenesis of composite protein sequences in order to select for mutations which alter properties of the composite protein in a desirable way. The present invention includes novel methods for mutagenesis of a protein sequence by replacing one or more individual protein sequence segments from the protein with sequence segments from one or more human protein sequences. One or more composite proteins subject to such mutagenesis can then be screened for improved properties.
For the optional step (7), following selection of a lead composite protein, T cell epitopes are limited or avoided by, where required, exchanging protein sequence segments contributing to or encoding a T cell epitope with alternative segments which avoid T cell epitopes. Such T cell epitopes can be detected by a range of methods. For example, peptides corresponding to one or more loci in the composite protein can be synthesised and tested in T cell assays to determine the presence of T cell epitopes. Typically such peptides will be 15 amino acids in length and, where it is desirable to test a longer contiguous sequence, overlapping peptides from the sequence such as 15mers with 12 amino acid overlaps are used. Alternatively, overlapping peptides corresponding to the composite protein sequences are analysed for binding to human MHC class II molecules either using in vitro methods or in silico methods, in each case to determine potential T cell epitopes i.e. MHC binding peptides which may induce a T cell response. In silico methods will include methods involving modelling of peptide-MHC class II binding interactions, methods involving identification of motifs common for binding to MHC class II and methods using databases of peptides or specific amino acids within peptides with known in vitro MHC binding properties. Other methods can be used such as producing longer peptides from composite protein sequences or whole composite proteins and testing these in T cell assays or in MHC binding assays on antigen presenting cells. Avoidance of T cell epitopes in composite proteins can also be assisted by avoidance of MHC class II binding motifs or avoidance of particular amino acids which anchor the binding of peptides to MHC class II. In the preferred method for avoidance of T cell epitopes from one or more lead composite proteins, in silico methods are initially applied to analyse the composite protein for potential T cell epitopes and, where these are determined, new segments of human protein sequence are introduced to avoid these epitopes and to avoid introduction of new T cell epitopes. Following such introduction of new human segments if required to avoid T cell epitopes and rescreening for modified lead composite proteins for desirable properties, one or more final lead composite proteins can optionally tested in human T cell assays either by testing overlapping peptides typically of 15 to 45 amino acids in length, for example 15mer peptides with 12 amino acid overlaps from the composite protein sequences (whole proteins or parts thereof) or by testing whole composite proteins directly in human T cell assays. A final analysis using T cell assays for testing whole composite protein is preferred allowing for direct testing for T cell activation from the whole protein.
It will be understood to those skilled in the art that there will be other methods for creating and testing composite proteins and for optimising the properties of such proteins. Composite proteins of the present invention are new and, where used for therapeutic purposes, the human origin of some or all protein sequence segments should render the composite protein less immunogenic in humans than other comparable or non-human reference proteins containing non-human sequences. Additional optional features of composite proteins, namely the avoidance of T cell epitopes and/or the addition of Tr epitopes, may also contribute to lower immunogenicity. It will be understood by those skilled in the art that the object of lower immunogenicity may be achieved using composite proteins without all human sequence segments and may also include composite proteins with human sequence segments which have been mutated to eliminate a T cell eptiope or segments of non-human protein homologous to the reference protein. It will be understood that protein segments and their combinations within composite proteins might be selected to meet a range of criteria including the optional avoidance of T cell epitopes. For example, segments of human protein sequence and combinations thereof can be selected for avoidance of B cell epitopes and other epitopes such as MHC class I-restricted epitopes, for avoidance of amino acid sequences which might be deleterious to expression of composite proteins, for avoidance of sequences which might direct inappropriate modification of composite proteins such as N-glycosylation, for inclusion of certain functions such as inclusion of helper T cell epitopes and/or B cell epitopes (for example, in vaccine applications), for subsequent conjugation to other moieties, and for a range of other criteria.
It will also be understood by those skilled in the art that, in addition to human, composite proteins with sequence segments derived from other species either wholly or in part can be generated and should be considered within the scope of the invention. For example, for studies in mice, composite proteins including mouse protein sequence segments can be generated. It will also be understood that composite proteins can include protein sequence segments from one species combined with other protein sequence segments from homologous proteins within the same species. For example, the invention will include construction of plant type I RIPs (ribosome inhibitory proteins) where a RIP is assembled using sequence segments from the numerous plant type I RIP sequences available. Such composite RIPs would be assembled by introducing combinations of sequence segments which would retain RIP activity and, if for use in humans, would include avoidance of human T cell epitopes in the final composite sequence.
As in the case of antibodies, the invention includes the option of further modifications to the composite protein sequences by random, semi-random or directed mutagenesis of the composite protein to achieve further improvement in one or more other properties of the final protein. It will be understood that the invention is particularly suitable to producing proteins with low immunogenicity when used in humans or used by humans such as proteins for pharmaceutical use, or proteins for use in food, detergents, cosmetics and other consumer items where allergic responses are limited or eliminated by use of compositions of the present invention. It will be understood that the invention is particularly suitable to producing proteins with low allergenicity in humans especially by producing proteins with allergy associated T cell epitopes removed or replaced by non-allergy associated epitopes (e.g. TH2 for TH1 T cell-inducing epitopes) and/or by addition of Tr epitopes to suppress immune responses in allergic individuals. It will be understood that the invention is particularly suitable to producing proteins with reduced inflammatory properties in humans especially by producing proteins with inflammation associated T cell epitopes removed or replaced by non-inflammation associated epitopes (e.g. TH1 for TH2 T cell-inducing epitopes) and/or by addition of Tr epitopes to suppress inflammatory responses.
As discussed herein, the modified/composite proteins and antibodies of the invention are useful in treating disease and exhibit less immunogenicity. Thus, in yet a further aspect, the present invention provides a pharmaceutical formulation comprising a modified antibody, antigen-binding fragment or protein as defined in any one of claims 1 to 18, optionally together with one or more pharmaceutically acceptable excipients, carriers or diluents.
The compositions of the invention may be presented in unit dose forms containing a predetermined amount of each active ingredient per dose. Such a unit may be adapted to provide 5-100 mg/day of the compound, preferably either 5-15 mg/day, 10-30 mg/day, 25-50 mg/day 40-80 mg/day or 60-100 mg/day. For compounds of formula I, doses in the range 100-1000 mg/day are provided, preferably either 100-400 mg/day, 300-600 mg/day or 500-1000 mg/day. Such doses can be provided in a single dose or as a number of discrete doses. The ultimate dose will of course depend on the condition being treated, the route of administration and the age, weight and condition of the patient and will be at the doctor's discretion.
The compositions of the invention may be adapted for administration by any appropriate route, for example by the oral (including buccal or sublingual), rectal, nasal, topical (including buccal, sublingual or transdermal), vaginal or parenteral (including subcutaneous, intramuscular, intravenous or intradermal) route. Such formulations may be prepared by any method known in the art of pharmacy, for example by bringing into association the active ingredient with the carrier(s) or excipient(s).
Pharmaceutical formulations adapted for oral administration may be presented as discrete units such as capsules or tablets; powders or granules; solutions or suspensions in aqueous or non-aqueous liquids; edible foams or whips; or oil-in-water liquid emulsions or water-in-oil liquid emulsions.
Pharmaceutical formulations adapted for transdermal administration may be presented as discrete patches intended to remain in intimate contact with the epidermis of the recipient for a prolonged period of time. For example, the active ingredient may be delivered from the patch by iontophoresis as generally described in Pharmaceutical Research, 3(6), 318 (1986).
Pharmaceutical formulations adapted for topical administration may be formulated as ointments, creams, suspensions, lotions, powders, solutions, pastes, gels, sprays, aerosols or oils.
For applications to the eye or other external tissues, for example the mouth and skin, the formulations are preferably applied as a topical ointment or cream. When formulated in an ointment, the active ingredient may be employed with either a paraffinic or a water-miscible ointment base. Alternatively, the active ingredient may be formulated in a cream with an oil-in-water cream base or a water-in-oil base.
Pharmaceutical formulations adapted for topical administration to the eye include eye drops wherein the active ingredient is dissolved or suspended in a suitable carrier, especially an aqueous solvent.
Pharmaceutical formulations adapted for topical administration in the mouth include lozenges, pastilles and mouth washes.
Pharmaceutical formulations adapted for rectal administration may be presented as suppositories or enemas.
Pharmaceutical formulations adapted for nasal administration wherein the carrier is a solid include a coarse powder having a particle size for example in the range 20 to 500 microns which is administered in the manner in which snuff is taken, i.e. by rapid inhalation through the nasal passage from a container of the powder held close up to the nose. Suitable formulations wherein the carrier is a liquid, for administration as a nasal spray or as nasal drops, include aqueous or oil solutions of the active ingredient.
Pharmaceutical formulations adapted for administration by inhalation include fine particle dusts or mists which may be generated by means of various types of metered dose pressurised aerosols, nebulizers or insufflators.
Pharmaceutical formulations adapted for vaginal administration may be presented as pessaries, tampons, creams, gels, pastes, foams or spray formulations.
Pharmaceutical formulations adapted for parenteral administration include aqueous and non-aqueous sterile injection solutions which may contain anti-oxidants, buffers, bacteriostats and solutes which render the formulation isotonic with the blood of the intended recipient; and aqueous and non-aqueous sterile suspensions which may include suspending agents and thickening agents. The formulations may be presented in unit-dose or multi-dose containers, for example sealed ampoules and vials, and may be stored in a freeze-dried (lyophilized) condition requiring only the addition of the sterile liquid carrier, for example water for injections, immediately prior to use. Extemporaneous injection solutions and suspensions may be prepared from sterile powders, granules and tablets.
Preferred unit dosage formulations are those containing a daily dose or sub-dose, as herein above recited, or an appropriate fraction thereof, of an active ingredient.
It should be understood that in addition to the ingredients particularly mentioned above, the formulations may also include other agents conventional in the art having regard to the type of formulation in question, for example those suitable for oral administration may include flavouring agents.
The following examples should not be considered limiting for the scope of the invention. The figures and tables relate to the examples below and are as follows;
FIG. 1/2—Sequence of VH (
FIG. 3—Inhibition of proliferation of human SK-BR-3 cells after 8 days incubation with chimeric 4D5 IgG1/kappa, Composite Human anti-HER2 antibody and epitope avoided anti-HER2 “EACHAB” with chimeric anti-IgE control (see example 4)
FIGS. 4/5—Sequence of VH (
FIGS. 6/7—Sequence of VH (
FIG. 8—Sequence of VH and VL genes used for Composite Mouse anti-human TNFα antibody including avoidance of human T cell epitopes
FIG. 9—ELISA for binding to human TNFα by Composite Mouse and chimeric anti-human TNFα antibody
FIG. 10—V region sequences of anti-TNFα antibody A2
FIG. 11—Sequences of composite human anti-TNFα VH variants
FIG. 12—Sequences of composite human anti-TNFα VL variants
FIGS. 13/14—Oligonucleotides for construction of chimeric mouse:human anti-TNFα VH (
FIGS. 15/16—Oligonucleotides for construction of primary composite human anti-TNFα VH (
FIG. 18—WEHI-164 protection Assay for composite human anti-TNFα antibodies
FIG. 19—Time-Course human T cell assay of lead composite human anti-TNFα
FIG. 20—Activity of composite bouganin molecules with inserted human sequence segments
Tables 1-3—CDRs used in Composite Human antibody scFv library comprising 186×9 residue-long VH CDR3s (table 1), 77×8 residue-long VL CDR3s (table 2), and 153×10 residue-long VL CDR3s (table 3).
Table 4—Human sequence segments used for primary composite human anti-TNFα VH and VL variants
Table 5—Activity of composite human anti-TNFα variants
Table 6—Immunogenic peptide sequences of bouganin and replacement human segments in bouganin variants
For creation of a human variable region sequence segment library, amino acid sequences from a range of human immunoglobulins were collected into a single database comprising the in silico human variable region sequence library including heavy (VH) and light (VL) chain variable region sequences. Sources of sequences included the NCBI Igblast database (www.ncbi.nih.gov), Kabat databases (Kabat et al., Sequences of Proteins of Immunological Interest, NIH publication 91-3242, 5th ed. (1991) (and later updates)), Vbase (www.mrc-cpe.cam.ac.uk/imt.doc), Genbank (Benson et al., Nucl. Acids Res. 25 (1997) p 1-6 or via www.bioinf.org.uk/abs) databases. The reference antibody variable region sequences used was a humanised anti-HER2 antibody known as Herceptin® (Carter et al., Proc. Nat. Acad. Sci. USA, vol 89 (1992) p 4285, U.S. Pat. No. 5,821,337). Segments from the in silico human variable region sequence library were selected for identity to the corresponding amino acids in the Herceptin® variable region sequence and combined to produce the composite human VH and VL sequences as shown in
Recombinant DNA techniques were performed using methods well known in the art and, as appropriate, supplier instructions for use of enzymes used in these methods. Sources of general methods included Molecular Cloning, A Laboratory Manual, 3rd edition, vols 1-3, eds. Sambrook and Russel (2001) Cold Spring Harbor Laboratory Press, and Current Protocols in Molecular Biology, ed. Ausubel, John Wiley and Sons. Detailed laboratory methods are also described in example 7 below. Composite human VH and VL sequences corresponding to Herceptin® were created using, for each chain, eight synthetic oligonucleotides of 30-60 amino acids in length encoding the entire composite human VH and VL sequences. In parallel, as a control reagent, a chimeric form of the mouse monoclonal antibody 4D5 (Hudziak et al., Mol. Cell. Biol., (March 1989) p 1165-1172)), was also created using eight synthetic oligonucleotides per chain. Separate VH and VL oligonucleotides were first phosphorylated, mixed at equal molar ratios, heated to 94° C. for 5 min in a thermal cycler followed by cooling to 65° C. and incubation at 65° C. for 2 min. Incubations were then continued at 45° C. for 2 min., 35° C. for 2 min., 25° C. for 2 min and 4° C. for 30 min. Oligonucleotides were then ligated using T4 DNA ligase (Life Technologies, Paisley UK) at 14° C. for 18 hours.
To each of the VH and VL oligonucleotide mixtures, additional oligonucleotides encoding a 5′ flanking sequence, including a Kozak sequence, the leader signal peptide sequence and the leader intron, and 3′ flanking sequence, including the splice site and intron sequence, were added and annealed as above. The Composite Human VH and VK and the 4D5 expression cassettes produced were cloned as HindIII to BamHI fragments into the plasmid vector pUC19 and the entire DNA sequence was confirmed. These were transferred to the expression vectors pSVgpt and pSVhyg which include human IgG1 (VH) or Kappa (VK) constant regions respectively and markers for selection in mammalian cells. The DNA sequence was confirmed to be correct for the Composite Human VH and VK and 4D5 VH and VK in the expression vectors.
The host cell line for antibody expression was NS0, a non-immunoglobulin producing mouse myeloma, obtained from the European Collection of Animal Cell Cultures, Porton, UK (ECACC No 85110503). The heavy and light chain expression vectors were co-transfected into NS0 cells by electroporation. Colonies expressing the gpt gene were selected in Dulbecco's Modified Eagle's Medium (DMEM) supplemented with 10% foetal bovine serum, 0.8 μg/ml mycophenolic acid and 250 μg/ml xanthine. Transfected cell clones were screened for production of human antibody by ELISA for human IgG. Cell lines secreting antibody were expanded and the highest producers selected and frozen down in liquid nitrogen. The modified antibodies were purified using Prosep®-A (Bioprocessing Ltd, Northumberland, UK). The concentration was determined by ELISA for human IgGK antibody.
The Composite Human antibody and chimeric 4D5 antibodies were tested for inhibition of proliferation of the HER2+ human breast tumour cell line SK-BR-3 in conjunction with a negative control non-Her-2 binding human IgG1/Kappa antibody exactly as described by Hudziak et al. (ibid). The results (
In order to test the epitope avoidance option in the invention, the sequences of the Composite Human heavy and light chain variable regions were analysed for non-self human MHC class II binders using Peptide Threading (www.csd.abdn.ac.uk/˜gjlk/MHC-thread). This software predicts favourable interactions between amino acid side chains of the peptide and specific binding pockets within the MHC class II binding groove. All overlapping 13mers from the Composite Human heavy and light chain variable sequences were threaded through a database of MHC class II allotypes and scored based on their fit and interactions with the MHC class II molecules. Peptides predicted to bind MHC class II were 13mers beginning at residues 16 and 67 in VH, and 9 and 44 in VL. As a result, new segments of the human variable region sequence library were chosen instead of those used in the Composite Human sequences of
T cell proliferation assays were carried out to compare the immunogenicity of the Composite Human anti-HER2 antibody, the EACHAB variant and the chimaeric 4D5 antibody (see example 1). These antibodies were prepared from NS0 cells grown in serum-free, animal derived component-free, protein-free medium, HyClone HyQ®ADCF-Mab™ (Hyclone Cat No: Cat no: SH30349) supplemented with HyQ®LS 1000 Lipid Supplement (Hyclone Cat No: SH30554) and sodium pyruvate (Gibco Cat No: 11360-039). After buffer exchange into 50 mM MES pH6 on a Sephadex G25 (PD10 column), the antibodies were each passed through a cation exchange column (Mono-S 10/10) and eluted with a sodium chloride gradient (0 to 0.5M). The antibody containing fractions were then applied to a Superdex 200 preparative column (XK16/60) run in PBS. Peak fractions were pooled and stored at 4° C. The antibody concentrations were determined by ELISA for human IgG.
Immunogenicity analysis was performed using PBMCs (peripheral blood mononuclear cells) that were isolated from healthy human donor blood and cryopreserved in liquid nitrogen. Each donor was tissue-typed using an Allsee™ PCR based tissue-typing kit (Dynal, Wirral, UK) and 20 healthy donors were selected according to individual MHC haplotypes. 2 ml bulk cultures containing 4×106 PBMC in AIM V (Invitrogen, Paisley, UK) were incubated in a 24 well tissue-culture plate with test peptides (5 μM final concentration) and proliferation was assessed on days 5, 6, 7, and 8 by gently resuspending the bulk cultures and transferring triplicate 100 μl samples of PBMC to a U-bottomed 96 well plate. Cultures were harvested onto glass fibre filter mats using a Tomtec Mach III plate harvester (Receptor Technologies, UK) and counts per minute (cpm) values determined by scintillation counting using a Wallac Microbeta TriLux plate reader (using a paralux high efficiency counting protocol). For each test antibody, the stimulation index (SI) was calculated as the ratio of counts per minute (cpm) of the test antibody:cpm of the negative control with SI>2 considered a significant T cell epitope response. The results showed that the chimaeric 4D5 antibody induced significant proliferative responses on at least one of the four days of proliferation tested (SI greater than 2) in five of twenty healthy donors tested (25%), the Composite Human anti-HER2 antibody induced SI>2 in three of twenty donors (15%) whilst the EACHAB anti-HER2 antibody induced SI>2 in none of twenty donors (0%). These results indicated an order of immunogenicity of chimeric 4D5>Composite Human anti-HER2>EACHAB anti-HER2 with the latter showing no evidence of immunogenicity in any donor blood sample tested.
A Composite Human antibody specific for sialylated Lewis Y antigen was constructed as described in example 1 using, as the reference antibody variable region sequences, the humanised 3S193 antibody (Scott et al.; Cancer Res., 60 (2000) p 3254-3261, U.S. Pat. No. 5,874,060). Segments from the in silico human variable region sequence library were selected for identity to the corresponding amino acids in the humanised 3S193 variable region sequence and combined to produce the Composite Human VH and VL sequences as shown in
A Composite Human Anti-IgE antibody was constructed as described in example 1 using, as the reference antibody variable region sequences, the humanised anti-IgE antibody known as Xolair® (Presta et al., J. Immunol., 151(5) (1993) p 2623-2632). Segments from the in silico human variable region sequence library were selected for identity to the corresponding amino acids in the Xolair® variable region sequence and combined to produce the Composite Human VH and VL sequences as shown in
The specificity of the Fabs was further characterized by surface plasmon resonance (BIAcore 2000, Biacore AB, Uppsala, Sweden). Recombinant human IgE Fab was produced as described by Flicker et al., J. Immunol., 165 (2000) p 3849-3859. Test antibodies were purified and immobilized onto flow cells of a CM chip using a NHS/EDC kit (Biacore) to obtain 2010 RU for chimeric anti-IgE and 2029 RU for Composite Human anti-IgE. 10 and 25 nM recombinant human IgE Fab in Hepes-buffered saline (10 mM Hepes, 3.4 mM EDTA, 150 mM NaCl, 0.05% (v/v) surfactant P20, pH 7.4) was passed over the test antibodies at a flow rate of 5 μl/min for 10 minutes. The results showed that for both 10 and 25 nM IgE Fab, an equivalent SPR (surface plasmon resonance) curve was detected for the chimeric anti-IgE and the Composite Human anti-IgE antibodies showing that the latter had successfully achieved binding efficiency equivalent to the reference anti-IgE antibody.
The strategy for initial construction of the human scFv (single-chain Fv) library was to construct seven consensus human VH and four consensus human VL (kappa) genes as detailed in Knappik et al., J. Mol. Biol., 296 (2000) 57-86 and to clone into these a large number of VH and VL CDR3 segments from databases of human variable regions. This list of CDR3s is shown in table 1 for VH CDR3s, table 2 for VL CDR3s of 8 amino acids and table 3 for VL CDR3s of 10 amino acids. For the master VH and VL construction, 6 overlapping synthetic oligonucleotides encoding VH and VL up to the end of framework 3 were synthesised as detailed by Knappik et al., ibid, and subjected to recursive PCR (Prodromou and Pearl, Protein Engineering, 5 (1992) 827-829). These were ligated into EcoRV digested pZero-1 vector (Invitrogen, Paisley, UK). For addition of CH1 and C kappa, both initially with 4D5 CDR3s (Carter et al, Bio/Technology, 10 (1992) 163-167), the protocol of Knappik et al., ibid, was followed except that the VH-CH1 SapI-EcoRI and VL-C kappa NsiI-SphI fragments were both blunt-end cloned into EcoRV digested pZero-1.
For insertion of CDR3s, single oligonucleotides encoding each of the CDR3s of table H from the plus strand were synthesised with 12 homologous nucleotides added to each termini for annealing to the consensus VH and VL genes. In addition to these CDR sequences, CDRs from the antibody E25 (see example 4) were included. These primers were extended and secondary primers were added to introduce directly adjacent to the N and C termini of the VH and VL genes (without C regions) 5′NotI-3′XbaI sites for VH and 5′SpeI-3′BamHI for VL. Prior to cloning, a further pair of complimentary primers was used to insert the linker sequence (Gly4Ser)3 between VH and VL whilst maintaining XbaI and SpeI sites. Full-sized VH-linker-VL fragments were digested with NotI and BamHI and were cloned into NotI-BamHI digested pBluescript II KS(+/−) (Stratagene, Amsterdam, Netherlands).
Individual Bluescript clones were picked, plasmid DNA was purified and dispensed robotically into 96 well plates as described in WO99/11777. DNAs were then subjected to IVTT including tRNA-biotinyl-lysine and further robotically arrayed onto a streptavidin surface as described in WO99/11777. The immobilised initial scFv library of 10,000 independent clones was then screened by incubation with recombinant human IgE Fab (see example 4). Wells were blocked with PBS/3% BSA at room temperature for 1 hour, washed three times in PBS and treated with 5 ug/ml human IgE Fab in PBS/3% BSA for 1 hour. Wells were then washed a further three times in PBS and treated with 5 ug/ml alkaline phosphatase-labelled chimeric anti-IgE (example 4) in PBS/3% BSA for 1.5 hrs. Wells were further washed five times in PBS and colour developed using the substrates 5-bromo-1-chloro-3-indolyl phosphate and nitro blue tetrazolium (Roche Molecular) for visualization. A strong signal observed at a frequency of 1 of 9600 wells was shown to derive from a VH and VL pair both containing E25 CDR3's.
A mouse variable region sequence library was created as described in example 1 for the human library using NCBI Igblast, Kabat and Genbank databases. The reference antibody variable region sequences used was a chimeric anti-TNFα antibody known as Remicade® (Le et al., U.S. Pat. No. 6,277,969) using the variable regions of the mouse cA2 antibody. Segments from the in silico mouse variable region sequence library were selected partly corresponding amino acids in the Remicade® variable region but including variations designed to avoid human T cell epitopes in the sequence in the form of non-self human MHC class II binders measured as in example 1. Composite mouse VH and VL sequences compared to sequences used in the chimeric antibody are shown in
The Composite Mouse and chimeric anti-TNFα antibodies were generated as described in example 1. Comparison of purified antibodies for binding to immobilised human TNFα in a standard ELISA (described in WO 03/042247A2) showed that the Composite Mouse antibody retained the full binding capacity of the chimeric anti-TNFα antibody (
The reference mouse variable region heavy and light chain sequences of antibody A2 directed against human TNFα was obtained from U.S. Pat. No. 5,656,272 (
Human V region sequence segments that included as many of these residues as possible were selected (table 4) to create full-length VH and VL sequences. Alterations were made to these sequences to include all the identified structurally important residues to create sequences to serve as a template for epitope avoidance and Composite Human Antibody design. A preferred sequence for each composite VH and VL was designed to include important residues from the reference mouse antibody. These variable heavy and light chain amino acid sequences are shown in
The composite heavy and light chain variable region sequences were scanned for the presence of potential T cell epitopes using a variety of in silico methods (e.g. Propred [http://imtech.res.in/raghava/propred/index.html], Peptide Threading [www.csd.abdn.ac.uk/˜gjlk/MHC-thread], SYFPEITHI (www.syfpeithi.de), MHCpred (www.jenner.ac.uk/MHCPred/) and compared to homologous human germ-line framework region sequences in conjunction with reference mouse CDRs.
The following heavy chain variable region variants were made (see
SEQ. ID. No. 6 contains the following changes with respect to SEQ. ID. No. 3: T82aN+R83K+D82bS
SEQ. ID. No. 7 contains the following changes with respect to SEQ. ID. No. 3: T82aN+R83K+D82bS+V23A.
SEQ. ID. No. 8 contains the following changes with respect to SEQ. ID. No. 3: T82aN+R83K+D82bS+V23A+V78A
The following light chain variable region variants were made (see
SEQ. ID. No. 10 contains the following changes with respect to SEQ. ID. No. 4: I10T+N103R+S80A.
SEQ. ID. No. 11 contains the following changes with respect to SEQ. ID. No. 4: I10T+N103R+S80A+N41D.
For construction of a control chimeric antibody, the nucleotide sequences that translate to give SEQ. IDs No. 1 and No. 2 were constructed using a series of overlapping 40mer synthetic oligonucleotides. The V region sequences were flanked by additional 5′ and 3′ sequences to facilitate cloning into mammalian expression vectors. The sequences of the oligonucleotides are shown in
Oligonucleotides were purchased from Sigma-Genosys (Poole, UK) and resuspended at a concentration of 100 μM. 1 μl of each of the heavy chain sense strand oligonucleotides, except the most 5′ oligonucleotide, were mixed together and 1.50 (approx. 1 μg) of the mix was treated with Polynucleotide Kinase (PNK, Invitrogen, Paisley UK) in a 200 reaction containing additionally: 20 μl× PNK buffer, 2 μl 10 mM ATP, 14 μl H2O, 0.5 μl (5 units) PNK. The reaction was incubated at 37° C. for 30 min and the enzyme inactivated by heating at 70° C. for 20 min. The heavy chain antisense, light chain sense and antisense oligonucleotides were similarly phosphorylated. The 5′ oligonucleotide from each set was diluted 1 in 9 with H2O and 1.5 μl added to the appropriate reaction mix. Each reaction was then diluted to 0.5 ml and spin dialyzed in an Amicon microcon YM3 concentrator for 90 min at 8000 rpm until the volume was not more than 44 μl.
The sense and antisense mixes for the heavy chain, and those for the light chain, were combined and made up to 88 μl with H2O. 10 μl 10× Ligase Chain Reaction (LCR) buffer and 2 μl Pfu ligase (8 units, Stratagene, Cambridge UK) were added to each reaction and incubated as follows in a programmable heating block: 94° C. for 4 min, then 60° C. for 3 min for 1 cycle followed by 20 cycles of 94° C. for 39 sec. then 60° C. for 2 min. Finally the reactions were incubated for 5 min at 60° C. 10 μl of each LCR was run through a 1% agarose gel stained with ethidium bromide and compared to 1 Kb ladder markers (Invitrogen). A smear of ligated DNA was observed in each lane, surrounding a faint specific band of approximately 400 bp.
The heavy and light chain LCRs were amplified via PCR using as primers SEQ. ID. No.s 12 and 22 for the heavy chain and SEQ. ID. No.s 33 and 43 for the light chain. The following were included in each reaction: 5 μl LCR, 5 μl 10× Expand HiFi buffer (Roche, Lewes UK), 1 μl 10 mM NTP mix, 0.25 μl each primer (from 100 μM stocks), 0.5 μl Expand HiFi polymerase (3 units, Roche) and 38 μl H2O. The reactions were cycled as follows: 94° C. 2 min followed by 20 cycles of 94° C. for 30 sec, 60° C. for 30 sec and 72° C. for 30 sec. Finally the reaction was incubated for 5 min at 72° C. The yield and specificity of the reaction was confirmed by agarose gel electrophoresis, as above. Specific, sharp bands at approximately 400 bp were observed for each reaction.
The reaction products were purified using a Qiagen PCR purification kit and each eluted in 30 μl H2O. The heavy chain product was digested in a standard reaction with restriction enzymes Mlu 1 and Hind III and the light chain product was digested with BssH II and BamH I. The reaction products were again purified using a Qiagen PCR purification kit and each eluted in 30 μl H2O.
The light chain expression vector pANT08 was based upon a pAT153 backbone and contains in the following order: CMV immediate/early enhancer promoter −590 to +7, a 30 nt 5′ UTR derived from a highly expressed mouse antibody light chain RNA, a mouse consensus light chain signal sequence incorporating a BssH II restriction site near the variable region start codon, a short linker (in place of a variable region) to a human composite intron continuing 33 nt from the variable region splice site to a BamH I restriction site followed by a fragment of human genomic DNA containing 343 nt of the intron preceding the human constant Kappa (CK) region gene, the CK gene and CK polyA.
The heavy chain expression vector pANT09 was similar to pANT08 through the promoter region, which is followed by: a 62 nt 5′ UTR derived from the heavy chain counterpart of that described above, a mouse heavy chain consensus signal sequence that incorporates a Mlu I restriction site near the variable region start codon, a short linker (in place of a variable region) to the variable region splice site immediately followed by a fragment of human genomic DNA from a Hind III restriction site located in the intron 211 nt upstream of the CH1 gene, to the end of the CH region poly A site. This fragment includes the CH1, hinge, CH2 and CH3 introns and exons of human IgG1. This vector also included a gene for dihydrofolate reductase, controlled by an SV40 promoter and polyA signal, for resistance to methotrexate.
2 μg each vector was digested with the relevant restriction enzymes in standard reactions in a total volume of 30 μl. Each reaction was run through a 1% agarose gel, as above, and the vector specific bands (6.0 Kbp heavy chain and 4.2 Kbp light chain) were excised from the gel and purified using a Qiagen gel extract kit and eluted in 30 μl H2O.
1 μl each digested vector was ligated to 3 μl of the corresponding digested variable gene PCR product using a Ligafast kit (Promega, Southampton UK). 2.5 μl each ligation reaction was transformed into sub-cloning efficiency competent XL1-blue (Stratagene), as instructed by the manufacturer, and plated onto LB agar plates containing 100 μg/ml ampicillin and incubated overnight at 37° C. Ten bacterial colonies from each ligation were inoculated into 10 ml 2× YT broth containing 100 μg/ml ampicillin and grown overnight at 37° C. with shaking. Plasmid was purified from 1.5 ml each overnight culture using a Qiagen plasmid preparation kit and each eluted in 50 μl H2O. The plasmids were sent to a contract sequencing facility and sequenced with a standard CMV promoter primer and clones with the correct V region gene sequence identified.
For construction of Composite Human Antibodies, the nucleotide sequences that translate to give SEQ. IDs No. 3 and No. 4 were constructed using a series of overlapping 40mer synthetic oligonucleotides. The sequences of the oligonucleotides are shown in
The two purified fragments were joined in a PCR reaction using oligonucleotide primers SEQ. ID. No.s 53 and 63 using PCR conditions as described above, except that the template used was 1 μl 295 bp product and 1 μl 126 bp product, hence the amount of H2O was reduced to 410. The joined PCR product of 396 bp was purified using a Qiagen PCR purification kit and was eluted in 30 μl H2O.
The nucleotide sequence that translates to give SEQ. ID. No. 6 was constructed via overlap PCR using oligonucleotide primers SEQ. ID. No.s 96 and 97 (
The nucleotide sequence that translates to give SEQ. ID. No. 7 was constructed via overlap PCR using oligonucleotide primers SEQ. ID. No.s 98 and 99 (
The nucleotide sequence that translates to give SEQ. ID. No. 8 was constructed via overlap PCR using oligonucleotide primers SEQ. ID. No.s 100 and 101 (
Each of the above PCR products was digested with Mlu I and Hind III and ligated into similarly digested pANT09. The ligations were transformed and plated, colonies picked, plasmids prepared and sequenced all as described above.
The nucleotide sequence that translates to give SEQ. ID. No. 9 was constructed via PCR using oligonucleotide primers SEQ. ID. No.s 102 and 103 (
The nucleotide sequence that translates to give SEQ. ID. No. 10 was constructed via overlap PCR using oligonucleotide primers SEQ. ID. No.s 104 and 105 (
The nucleotide sequence that translates to give SEQ. ID. No. 11 was constructed via overlap PCR using oligonucleotide primers SEQ. ID. No.s 106 and 107 (
Each of the above PCR products was digested with BssH II and BamH I and ligated into similarly digested pANT08. The ligations were transformed and plated, colonies picked, plasmids prepared and sequenced all as described above.
CHO-K1 cells (ATCC# CCL-61) were propagated in high glucose DMEM containing 10% FCS, L-glutamine, sodium pyruvate and L-proline. Near confluent cultures were harvested for transfection using Lipofectamine 2000 as instructed by the manufacturer (Invitrogen). Transfections were done in 48 well plates seeded with 200 μl cells at 3×105 cells/ml using a total of 0.5 μg plasmid DNA comprising 0.3 μg heavy chain construct and 0.2 μg light chain construct.
Transfections were incubated at 37° C./5% CO2 for 48 to 72 h before harvesting the supernatants. Antibody expression was quantified by ELISA using: a mouse monoclonal anti-human IgG capture antibody, human IgG1/Kappa standards and HRP conjugated goat anti-human Kappa light chains as detection antibody (all reagents from Sigma).
All combinations of heavy and light chains were transfected (i.e. 6 heavy chains×5 light chains=30 transfections). Antibody expression levels were generally in the range of 0.5 to 2.0 μg/ml, however no expression was observed with heavy chain SEQ. ID. No. 8.
The expressed antibodies were tested for their ability to neutralize the activity of human TNFα using TNF-sensitive WEHI-164 cells (Espevik et al., J. Immunol. Methods 1986, 95, 99-105). Cells were plated in 1 μg/ml actinomycin D at 5×104 cells per well in 96-well microtiter plates for 3-4 hours. Cells were exposed to 40 pM human TNFα and varying concentrations of the chimeric antibody (range 1 ng/ml to 500 ng/ml) to create a standard curve. The various combinations of heavy and light chains were tested at a single concentration point of 25 ng/ml that had previously been determined as the ED50 of the chimeric antibody. All assays were done in triplicate.
The mixtures were incubated overnight at 37° C. Cell viability was determined by adding 3-[4,5-dimethyl-thiazol-2-yl]-2,5-diphenyltetrazoliumbromide dye (MTT) to a final concentration of 0.5 mg/ml, incubating for 4 hours at 37° C., lysing the cells in 0.1M HCl, 0.1% SDS and measuring the optical density at 550 nm wavelength.
The optical densities from the heavy and light chain combinations were used to calculate the apparent antibody concentrations from the standard curve. The apparent concentration of the chimeric was divided by that of each of the variant combinations to give a fold difference value. Values lower than that for the chimeric indicated that those combinations were more effective at protecting the cells from TNFα cytotoxicity, whereas higher values indicated that they were less effective. The values for all combinations are shown in Table 5.
The following Composite Human Antibody heavy and light chain combinations gave fold differences close to 1.0: SEQ. ID. No.s 5/10, SEQ. ID. No.s 7/4, SEQ. ID. No.s 7/9, SEQ. ID. No.s 7/10. These combinations were selected for further study.
The expression plasmids carrying the sequences selected above were transfected into NS0 cells (ECACC No. 85110503). The cells were grown in high glucose DMEM containing L-glutamine, sodium pyruvate, 5% ultra low IgG FCS and pen/strep. Cells were harvested during log phase of growth, spun down and resuspended at 5×106 cells/ml in fresh growth media. 750 μl cells were mixed with a total of 30 μg of each plasmid pair, which had been linearised with Ssp I, in 50 μl H2O. The cell/plasmid mixture was transferred to a 4 mm gap cuvette and electroporated using an Equibio Easyject Plus at 250V, 1500 μF, infinite resistance. The electroporate was immediately transferred to 25 ml pre-warmed growth media and plated out in 5×96 well flat bottomed plates at 100 μl/well. The plates were incubated at 37° C./5% CO2. 48 h post-electroporation, 50 μl media containing 300 nM methotrexate was added to each well to give a final concentration of 100 nM. 7 days post-electroporation a further 50 μl of media containing 100 nM methotrexate was added to each well.
Approximately 2 week post-electroporation, the media in some wells began to turn yellow, indicating transfected colony growth. Media from these wells were tested for antibody expression using the anti-human IgG Fc capture/anti-human Ig Kappa light chain HRP conjugate detection ELISA. The test samples were compared to a human IgG1/Kappa standard and antibody expression levels estimated. Colonies expressing useful amounts of antibody were expanded in media containing 200 nM methotrexate.
Antibodies were purified from 500 ml culture media via protein A affinity chromatography followed by size exclusion chromatography using Sephacryl S200.
The purified antibodies were quantified by UV absorbance at 280 nm, assuming that OD2801=1.4 mg/ml.
Purified chimeric and composite antibodies were tested for activity via the WEHI-164 protection assay described in example 4 above. Each antibody was tested over the full concentration range previously used to create the standard curve (see
Therefore since Composite Human Antibody 7/10 was predicted to have the most favourable MHC class II binding profile and was the most active variant, this was selected for testing in a time course T cell proliferation assay. Human PBMCs were prepared from buffy coats derived from human blood donations via two rounds of Ficoll density centrifugation. The prepared PBMC were resuspended at a density of 3×107 cells/ml in 1 ml aliquots in 90% human AB serum/10% DMSO, and stored under liquid nitrogen. PBMC were tissue typed using a Dynal Allset® PCR typing kit.
The lead Composite Human Antibody was compared to the chimeric antibody in whole protein T cell assays using human PBMC from 20 healthy donors. PBMC from each donor were thawed, washed and resuspended in AIM V serum free lymphocyte growth media. On day 1, 50 μg protein was added to 2 ml bulk cultures of 4×106 PBMC in 24 well plates, and triplicate 100 μl aliquots were removed and transferred to 96 well plates on days 6 to 9. Each aliquot was pulsed with 75 μl media containing 1 μCi tritiated thymidine for 24 h, before harvesting and measuring incorporation of radioactivity. Results were normalised by calculation of the Stimulation Index (SI). Coverage of a wide range of HLA DR allotypes was achieved by selecting donors according to individual MHC haplotypes.
The results of the time-course assay are shown in
Composite variants of the plant type I Ribosome Inhibitory Protein (RIP) bouganin (derived from Bougainvillea spectabilis) were generated using methods described in WO2005090579. The location of T cell epitopes in bouganin was tested by analysis of overlapping 15mer peptides as in WO2005090579 and the peptides of SEQ ID 11-13 in table 6 (corresponding to residues 121-135, 130-144 and 148-162) were identified as epitopes. Bouganin was cloned from leaf tissue from a Bougainvillea spectabilis plant. mRNA was extracted using a polyA Tract System 1000 kit (Promega) from 100 mg tissue as instructed by the manufacturer. cDNA was synthesised from the mRNA template using an AccessQuick RT-PCR system (Promega) with the following primers: ATGTACAACACTGTGTCATTTAAC and TTATTTGGAGCTTTTAAACTTAAGGATACC. The first primer additionally contains an ATG start codon and the second primer additionally contains a TAA stop codon. The PCR product was cloned using a T/A cloning system (pGEM T Easy, Promega) and several clones were sequenced to identify a correct clone orientated with the transcription direction of the T7 promoter contained within the vector.
121AKVDRKDLELGVYKL135
130LGVYKLEFSIEAIHG144
148NGQEIAKFFLIVIQM162
A series of variants were made containing the human sequence segments identified as shown in table 6. These were constructed using overlap PCR with a high fidelity polymerase (Expand Hi-Fi, Roche). The 5′ and 3′ primers were as above and the PCR products were cloned into the T/A cloning vector, as above, and correct clones identified that were orientated with the transcription direction of the T7 promoter. Clones were assayed for activity in a coupled transcription and translation reaction that included a control DNA expressing a luciferase gene (Luciferase T7 Control, Promega). Since bouganin is a ribosome inactivating protein, it significantly reduces the levels of translation of the luciferase gene and this reduction is conveniently assayed using a luciferase detection system such as Steady-Glo (Promega). Purified wild type or mutant bouganin plasmids were linearised with Not I and diluted to 10 ng/μl. Luciferase T7 Control DNA was diluted to 125 ng/μl. 1 μl each DNA was mixed with 10 μl TnT mix (Promega), 0.25 μl Methionine and 0.25 μl nuclease free water (supplied in TnT kit). Controls were wt bouganin and Luciferase T7 Control only. Reactions were undertaken in triplicate and incubated for 1 hour at 30° C. 5 μl each reaction was transferred to a black walled 96 well luminometer plate and mixed with 45 μl water and 50 μl Steady-Glo reagent. Luminescence was read in a Wallac Microbeta Trilux luminometer. Activity was expressed as a percentage of the luminescence observed from the Luciferase T7 Control DNA alone.
Peptides containing the human sequence segments within the active COMB variant corresponding to residues 121-135, 130-144 and 148-162 were synthesised and compared to the corresponding wild type peptides in a time-course T cell assay with human PBMCs from 20 healthy donors as described in example 7. The results showed that peptides containing human sequence segments induced no T cell proliferation in any donor at any time point whilst each of the wild type peptides induced proliferation with SI>2 in >10% of all donors for at least one time point.
Composite variants of the thrombin inhibitor hirudin (derived from Hirudo medicinalis) were generated using methods described in WO2004113386 using the protein with SEQ ID No 14 in table 7 as wild type. The location of T cell epitopes in hirudin was tested by analysis of overlapping 15mer peptides as in WO2004113386 and the peptide 27-41 CILGSDGEKNQCVTG was shown to give a significant T cell response with human PBMCs from 20 healthy donors. The human sequence segment KCRH from human melanoma-associated antigen (AAN40505.1) was used to replace the hirudin residues at 26-29 using overlap PCR as in example 8 resulting in a variant hirudin molecule with 28/29IL changed to 28/29RH which retained full activity of the wild type hirudin using assays described in WO2004113386. The modified peptide 27-41 CRHGSDGEKNQCVTG was tested together with the wild type peptide 27-41 CILGSDGEKNQCVTG in T cell assays as in example 8 demonstrating the loss of T cell epitope activity in the modified peptide.
VH and VL genes from the Composite Human Anti-IgE antibody of example 4 were cloned according to standard polymerase chain reaction (PCR) methods from Orlandi et al., ibid into separate plasmid vectors as templates for a VL- and VH-specific PCR using oligonucleotide primer pairs. Overlapping complementary sequences were introduced into the PCR products that combined during the subsequent fusion PCR to form the coding sequence either of a 20 amino acid (G4S1)4 linker or, alternatively, a 20 amino acid sequence GGSNNLSCLTIPASANNGGS containing a 10 amino acid Tr epitope from the hepatitis C core protein (P19, MacDonald et al., Journal of Infectious Diseases, 185 (2002) p 720-727) flanked each side by two asparagines residues and a GGS triplet. This final amplification step was performed with primer pairs for subsequent cleavage with the restriction enzymes EcoRV and BspE1 and cloning into the bluescript KS vector (Stratagene). Dimeric forms of the Composite Human anti-IgE single chain antibodies (scFvs) were then constructed by the method of Mack et al., Proc Natl Acad Sci USA., 92 (1995) p 7021-7025. The dimeric VL-linker-VH-VL-linker-VH fragment was subcloned into the EcoRI/SalI sites of the expression vector pEF-DHFR. (Mack et al., ibid) and transfected into DHFR-deficient Chinese hamster ovary (CHO) cells by electroporation. Selection, gene amplification, and protein production were performed as described by Mach et al., ibid). The dimeric scFv's were purified via the C-terminal histidine tails by affinity chromatography on a nickel-nitrilotriacetic acid (Ni-NTA) column (Qiagen) to give dimeric Fvs designated CHABIgEG4S1x4 ((G4S1)4 linker between VL and VH) and CHABIgEHCVP19 (HCV Tr epitope between VL and VH).
Dimeric scFvs were subsequently tested in human T cell assays at 50 μg/ml exactly as described by Hall et al., Blood 100 (2002) p 4529-4536 using PBMCs from 20 healthy donors. The results showed no significant proliferation of T cell for either CHABIgEG4S1x4 or CHABIgEHCVP19 but showed a significant level of IL-10 production (SI>2) from 4 out of 20 donors stimulated with CHABIgEHCVP19 but not with CHABIgEG4S1x4 (SI>2 in 0 of 20 donors). This demonstrates the effect of a Tr epitope included within the antibody molecule for the induction of the immunosuppressive cytokine IL-10.
Number | Date | Country | Kind |
---|---|---|---|
0502201.7 | Feb 2005 | GB | national |
0503190.1 | Feb 2005 | GB | national |
0506945.5 | Apr 2005 | GB | national |
This application is a continuation of U.S. Ser. No. 11/815,507, filed Aug. 31, 2007, entitled HUMAN ANTIBODIES AND PROTEINS, which is the National Stage Entry of PCT/GB2006/000355, filed Feb. 3, 2006, the contents of each of which are hereby incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
Parent | 11815507 | Aug 2007 | US |
Child | 14324520 | US |