The present disclosure relates to methods and materials for determining an isoelectric point of a protein including, for example, a binding molecule such as an antibody. The isoelectric points may be used in methods for the preparation of proteins. The methods of the present disclosure may be used for selecting and utilizing a buffer for purification of a protein, preparing a protein formulation, purifying a protein and/or stabilizing a protein in solution.
The isoelectric point (pL) of a molecule is the pH at which it has no net electrical charge. Biological molecules such as proteins are comprised of amino acids which may be positive, negative, neutral or polar in nature, and together give a protein its overall charge. At a pH below its pl, a protein carries a net positive charge while at a pH above its pl it carries a net negative charge. Lack of charge may have certain consequences on a protein. For example, proteins are often minimally soluble in water or buffers near their pl, which can lead to difficulties in the purification and/or formulation of therapeutics (Mosavi et al. (2003) Protein Engineering 16(10):739-745) and often precipitate out of solution.
The pl of a protein may be determined mathematically by several methods of calculation including, for example by using the Henderson-Hasselbalch equation. The pl of a protein may be computed by this equation by taking into account the acid-dissociation constant (pKa) of nine different chemical groups, including the side chains of seven amino acids, aspartic acid, glutamic acid, lysine, histidine, arginine, tyrosine and cysteine as well as the amino and carboxy terminal amino acid residues of the protein. Alternatively, the pl of a protein may be determined experimentally using isoelectic focusing. For example, when a protein is in a pH region below its isoelectric point (pl), it will be positively charged and so will migrate towards a cathode. As it migrates, however, the charge will decrease until the protein reaches the pH region that corresponds to its pl. At this point it has no net charge and so migration ceases. As a result, the proteins become focused into sharp stationary bands with each protein positioned at a point in the pH gradient corresponding to its pl. The technique is capable of extremely high resolution with proteins differing by a single charge being fractionated into separate bands. However, isoelectric focusing, although accurate in its determination of a protein's pl, may be time consuming and require laboratory resources making it not practical for widespread use. In contrast, pl values calculated mathematically can be determined quickly but may not make accurate predications of a protein's pl. Accordingly, improved methods are desired for the determination of a protein's pl that may have an accuracy more similar to isoelectric focusing but that are mathematically based.
The present disclosure relates to methods and materials for determining isoelectric points of proteins including, for example, binding molecules such as an antibodies. The isoelectric points may be used in methods for the preparation of proteins. The proteins may be prepared, for example, by identifying surface exposed amino acid residues in a sequence of amino acid residues of the protein; assigning a pKa value to the surface exposed amino acid residues; calculating the isoelectric point (pl) of the protein from the pKa values assigned to the surface exposed amino acid residues; preparing the protein by at least one of: selecting a buffer with a pH not equal to the calculated isoelectric point of the protein and utilizing the selected buffer for purification of the protein; and preparing a formulation of the protein with a pH not equal to the calculated isoelectric point of the protein.
The present disclosure provides methods for determining an isoelectric point of a protein by identifying surface exposed amino acid residues in a sequence of amino acid residues of the protein, assigning a pKa value to the surface exposed amino acid residues, and calculating the isoelectric point (pl) of the protein from the pKa values assigned to the surface exposed amino acid residues.
The present disclosure provides methods for selecting and utilizing a buffer for purification of a protein by identifying surface exposed amino acid residues in a sequence of amino acid residues of the protein, assigning a pKa value to the surface exposed amino acid residues, calculating an isoelectric point for the protein from the pKa values assigned to the surface exposed amino acid residues, selecting a buffer with a pH not equal to the calculated isoelectric point of the protein and utilizing the selected buffer for purification of the protein.
The present disclosure also provides methods of preparing a protein formulation by identifying amino acid residues that are exposed on the surface of the protein in a sequence of amino acid residues of the protein, assigning a pKa value to the surface exposed amino acid residues, calculating an isoelectric point for the protein from the pKa values assigned to the surface exposed amino acid residues, and preparing the formulation with a pH not equal to the calculated isoelectric point of the protein.
The present disclosure also provides method for purifying a protein from a heterogeneous population of proteins and/or other non-protein molecules and/or other contaminants by identifying amino acid residues that are exposed on the surface of the protein in a sequence of amino acid residues of the protein, assigning a pKa value to the surface exposed amino acid residues, calculating an isoelectric point (pl) for the protein from the pKa values assigned to the surface exposed amino acid residues, and utilizing the calculated pl to isolate the protein from the heterogeneous population of proteins.
The present disclosure also provides methods for stabilizing a protein in solution by identifying amino acid residues that are exposed on the surface of the protein in a sequence of amino acid residues of the protein, assigning a pKa value to the surface exposed amino acid residues, calculating an isoelectric point for the protein from the pKa values assigned to the surface exposed amino acid residues, preparing a formulation with a pH not equal to the calculated isoelectric point of the protein, and placing the protein in the prepared formulation.
The present disclosure also provides methods for determining an isoelectric point of a protein, the method comprising: receiving data indicative of a sequence of amino acid residues of the protein via an input device of a computing device; identifying surface exposed amino acid residues in the sequence of amino acid residues; assigning a pKa value to the surface exposed amino acid residues; calculating the isoelectric point (pl) of the protein from the pKa values assigned to the surface exposed amino acid residues using the computing device; and transferring the isoelectric point to an output device associated with the computing device.
The present disclosure also provides methods for determining an isoelectric point of an antibody, the method comprising: receiving data indicative of a sequence of amino acid residues of the antibody via an input device of a computing device; identifying surface exposed amino acid residues in the sequence of amino acid residues by aligning the sequence of amino acids residues of the antibody to a second antibody sequence of amino acid residues that are fixed to an IsoX line and are assigned an isoX value as shown in
In some embodiments of any of the disclosed methods, the protein is a binding molecule such as an antibody or antibody fragment. In some embodiments of any of the disclosed methods, the antibody or antibody fragment is an IgG, a Fab or a scFv.
In some embodiments of any of the disclosed methods, the pKa values are assigned to the surface exposed amino acid residues by the system of EMBOSS, DTASelect, Solomon, Sillero, Rodwell, Patrickios or Wikipedia.
In some embodiments of any of the disclosed methods, all of the surface exposed amino acid residues are assigned a pKa value.
In some embodiments of any of the disclosed methods, the pl is calculated using the Henderson-Hasselbalch equation. In some embodiments of any of the disclosed methods, the pl is calculated using the method of EMBOSS, DTASelect, Solomon, Sillero, Rodwell, Patrickios or Wikipedia.
In some embodiments of any of the disclosed methods, the surface exposed amino acid residues are identified as those amino acid residues with an ASA value equal to or greater than 2. In some embodiments of any of the disclosed methods, the ASA values represent measured exposures for each amino acid residue. In some embodiments of any of the disclosed methods, the ASA values represent estimated exposures for each amino acid residue.
In some embodiments of any of the disclosed methods, the surface exposed amino acid residues are identified by aligning the sequence of amino acid residues of the antibody to a second antibody sequence of amino acid residues that are fixed to the “expo” line and are assigned an expo value as shown in
In some embodiments of any of the disclosed methods, the surface exposed amino acid residues are identified by aligning the sequence of amino acids residues of the antibody to a second antibody sequence of amino acid residues that are fixed to an IsoX line and are assigned an isoX value as shown in
In some embodiments of any of the disclosed methods, the buffer/formulation is used for pharmaceutical administration.
In some embodiments of any of the disclosed methods, the pH of the selected buffer/formulation is greater than the pl of the protein. In some embodiments of any of the disclosed methods, the pH of the selected buffer/formulation is less than the pl of the protein.
Additional features and advantages are described herein, and will be apparent from, the following Detailed Description and the Figures.
The present disclosure provides methods and materials for determining an isoelectric point for a protein including, for example, a binding molecule such as an antibody (e.g., an IgG, a Fab or a scFv). An isoelectric point determined by any of the disclosed methods or materials may be used in methods to prepare a protein, including an antibody. The protein may be prepared, for example, by identifying surface exposed amino acid residues in a sequence of amino acid residues of the protein; assigning a pKa value to the surface exposed amino acid residues; calculating the isoelectric point (pl) of the protein from the pKa values assigned to the surface exposed amino acid residues; preparing the protein by at least one of: selecting a buffer with a pH not equal to the calculated isoelectric point of the protein and utilizing the selected buffer for purification of the protein; and preparing a formulation of the protein with a pH not equal to the calculated isoelectric point of the protein. Surprisingly, it has been found that the pl of a protein calculated from amino acid residues located on the surface of a protein (referred to herein as “surface exposed amino acid residues) approaches the pl of the protein as determined by isoelectric focusing. Such methods may be used to determine the isoelectric point of a protein, select and utilize a buffer for purification of a protein, prepare a protein formulation, purify a protein from a heterogeneous population of proteins and/or stabilize a protein in solution.
Methods provided by the present disclosure may be used for determining an isoelectric point of protein including, for example, a binding molecule such as an antibody or binding fragment thereof by identifying amino acid residues that are surface exposed, assigning a pKa value to the surface exposed amino acid residues, and calculating the isoelectric point of the protein from the pKa values assigned to the surface exposed amino acid residues and optionally but preferably the pKa values assigned to amino and carboxy terminal amino acid residues. Amino acid residues that are surface exposed may be identified by determining their ASA value. Alternatively, amino acid residues that are surface exposed may be identified using the “expo” line as shown in
The present disclosure also provides methods for determining an isoelectric point of an antibody by identifying amino acid residues in the antibody with an ASA value equal to or greater than 2 as surface exposed, assigning a pKa value to the surface exposed amino acid residues, and calculating the isoelectric point (pl) of the antibody from the pKa values assigned to the surface exposed amino acid residues.
The present disclosure also provides methods for determining an isoelectric point of an antibody or binding fragment thereof by aligning the sequence of amino acid residues of the antibody to a second antibody sequence of amino acid residues that are fixed to the “expo” line and are assigned an expo value as shown in
The present disclosure also provides methods for determining an isoelectric point of an antibody or binding fragment thereof by aligning the sequence of amino acids residues of the antibody to a second antibody sequence of amino acid residues that are fixed to an IsoX line and are assigned an isoX value as shown in
The present disclosure provides methods for selecting and utilizing a buffer for purification of a protein including, for example, a binding molecule such as an antibody by identifying surface exposed amino acid residues in a sequence of amino acid residues of the protein, assigning a pKa value to the surface exposed amino acid residues, calculating an isoelectric point for the protein from the pKa values assigned to the surface exposed amino acid residues, selecting a buffer with a pH not equal to the calculated isoelectric point of the protein and utilizing the selected buffer for purification of the protein.
The present disclosure also provides methods of preparing a protein including, for example, a binding molecule such as an antibody formulation by identifying amino acid residues that are exposed on the surface of the protein in a sequence of amino acid residues of the protein, assigning a pKa value to the surface exposed amino acid residues, calculating an isoelectric point for the protein from the pKa values assigned to the surface exposed amino acid residues, and preparing the formulation with a pH not equal to the calculated isoelectric point of the protein.
The present disclosure also provides method for purifying a protein including, for example, a binding molecule such as an antibody from a heterogeneous population of proteins and/or other non-protein molecules and/or other contaminants by identifying amino acid residues that are exposed on the surface of the protein in a sequence of amino acid residues of the protein, assigning a pKa value to the surface exposed amino acid residues, calculating an isoelectric point (pl) for the protein from the pKa values assigned to the surface exposed amino acid residues, and utilizing the calculated pl to isolate the protein from the heterogeneous population of proteins.
The present disclosure also provides methods for stabilizing a protein including, for example, a binding molecule such as an antibody in solution by identifying amino acid residues that are exposed on the surface of the protein in a sequence of amino acid residues of the protein, assigning a pKa value to the surface exposed amino acid residues, calculating an isoelectric point for the protein from the pKa values assigned to the surface exposed amino acid residues, preparing a formulation with a pH not equal to the calculated isoelectric point of the protein, and placing the protein in the prepared formulation.
The present disclosure also provides methods for determining an isoelectric point of a protein including, for example, a binding molecule such as an antibody, the method comprising: receiving data indicative of a sequence of amino acid residues of the protein via an input device of a computing device; identifying surface exposed amino acid residues in the sequence of amino acid residues; assigning a pKa value to the surface exposed amino acid residues; calculating the isoelectric point (pl) of the protein from the pKa values assigned to the surface exposed amino acid residues using the computing device; and transferring the isoelectric point to an output device associated with the computing device.
The present disclosure also provides methods for determining an isoelectric point of an antibody, the method comprising: receiving data indicative of a sequence of amino acid residues of the antibody via an input device of a computing device; identifying surface exposed amino acid residues in the sequence of amino acid residues by aligning the sequence of amino acids residues of the antibody to a second antibody sequence of amino acid residues that are fixed to an IsoX line and are assigned an isoX value as shown in
In referring to a pH “not equal to” the calculated isoelectric point, the present disclosure contemplates that a range of pH values may be utilized which differ (e.g., greater than, less than) from the calculated isoelectric point. For example, a pH “not equal to” the calculated isoelectric point may represent a numerical difference in pH values (e.g., 6.5 versus 6.0), a functional difference in protein solubility (e.g., when selecting a buffer for purification of a protein and/or preparing a formulation of a protein), or preferably both. Preferably, the pH should differ from (e.g., not equal to) the calculated isoelectric point, so as to reduce or prevent aggregation or precipitation of the protein, such as for example in selecting a buffer for purification of the protein and/or preparing a formulation of the protein.
In some embodiments, the pH may be at least about 0.2 pH units, at least about 0.3 pH units, at least about 0.4 pH units, at least about 0.5 pH units, at least about 0.6 pH units, at least about 0.7 pH units, at least about 0.8 pH units, at least about 0.9 pH units, at least about 1.0 pH units, at least about 1.2 pH units, at least about 1.5 pH units, or at least about 2.0 pH units greater than or less than the calculated isoelectric point as disclosed herein. Alternatively or in addition, in some embodiments, the pH may be at least about 2%, at least about 3%, at least about 4%, at least about 5%, at least about 6%, at least about 7%, at least about 8%, at least about 9%, at least about 10%, at least about 12%, at least about 15%, or at least about 20% greater than or less than the calculated isoelectric point as disclosed herein.
The present disclosure provides novel methods for identifying one or more surface exposed amino acid residues including, for example, each surface exposed amino acid residue in a sequence of amino acid residues from a protein of interest (e.g., an antibody or binding fragment thereof, such as an IgG, Fab or scFv). Surface exposed amino acid residues may be identified by their ASA value, by using the “expo” line as shown in
An ASA value for each amino acid position in a protein may be used to identify those amino acid residues that are surface exposed (see, e.g., http://www.netasa.org/asaview/, referred to herein as “Netasa web server” and Ahmad et al. (2004) BMC Bioinformatics 5:51). Surface exposed amino acid residues may be identified as those amino acid residues with an ASA value equal to or greater than 2. ASA values for a protein may be viewed in the form of a bar graph as shown by the Netasa web server, in which a linear amino-acid sequence may be plotted along the horizontal axis, and the degree of solvent exposure for each residue represented by the height of a vertical bar, whose color-coding distinguishes the sidechain as nonpolar (e.g., grey) or polar (e.g., green) or negative (e.g., red) or positive (e.g., blue) or cysteine (e.g., yellow). These bar graphs may depict groups of exposed (e.g., tall bar) or buried (e.g., short bar) amino acid residues, as well as the linear distribution of polarity and charge. Additionally or alternatively, ASA values for a protein can be obtained in numerical form as a text-only file and exported to programs that allow manipulation of the data (e.g., Microsoft Word or Excel). ASA values may be represented as single digit (from “0” to “9”), corresponding to the “tens digit” of the exposure percentage (ranging from 0% to 100% exposed). Thus, for example, 37.1% exposure is coded as “3”, while 52.7% is coded as “5”. Note that 4.6% is coded as “0”, since it represents 04.6%. Also, to preserve the single-digit scheme, 100.0% is coded as “9”, since it is nearly equivalent to 99.9%.
When the crystal structure of the protein is known ASA values represent measured exposures for each amino acid residue (see, e.g., Ahmad et al. (2002) Bioinformatics 18:819-824). ASA values obtained from a crystal protein structure are obtained as “.asa” files (see, e.g., Table 1) from the Netasa web server and may be represented on a text line. Text lines may display information including, for example, information about the surface exposure of an amino acid residue in a protein. For example, on a text line such as “E83 27.2”, (“E”) is the one-letter amino-acid code for glutamic acid, (“83”) is the non-Kabat position number in the linear sequence and (“27.2”) is the ASA coefficient of surface exposure to solvent percent exposure of the residue's total surface area.
Alternatively, when a protein's crystal structure is not known, ASA values may represent estimated exposures for each amino acid residue based upon the statistical frequencies of various linear amino-acid fragments among a large group of crystallized proteins (Ahmad et al. (2003) Bioinformatics 19:1849-1851). ASA values obtained from a protein in which the crystal structure is unknown are obtained as “.rvp” files (see, e.g., Table 1) from the Netasa web server and may be represented on a text line. Text lines may display information including, for example, information about the surface exposure of an amino acid residue in a protein. For example, on a text line such as “83 E 27.2 47.6 E”, (“83”) is the non-Kabat position number in the linear protein sequence, (“E”) is the one-letter amino-acid code for glutamic acid, (“27.2”) is the ASA (RVP) statistical estimate of surface exposure to solvent, (“47.6”) is the AA2 value in square angstroms of the amount of exposed surface area and (“E”) is the one-letter category-designation for buried or exposed, based on a threshold percentage.
In certain cases, ASA view may provide an incorrect one letter amino acid code at a position in protein. For example, cysteine residues in the variable domain of an antibody may be represented as an amino acid other than a (C). Also, in some instances ASA view inserts amino acids (e.g., using two letter codes) at various amino acid positions in the protein sequence. Accordingly, it may be useful to manually edit the ASA test file before processing.
ASA values for proteins in which the complete three-dimensional structure is known may be calculated using programs such as ACCESS, DSSP, ASC, NACCESS, or GETAREA. Furthermore, the ASA values can also be obtained directly from the DSSP database, if the corresponding PDB code is known.
Surface exposed amino acid residues may also be identified by using the asa line as shown in
In those instances when a protein's crystal structure is not known, surface exposed amino acid residues may be determined by using the “expo” line of
In another exemplary method, when the crystal structure of a protein, is unknown, surface exposed amino acid residues in the protein may be identified by using the “IsoX” line as shown in
Non-conserved amino acid residues in an antibody or binding fragment of interest including, for example, complementarity determining regions (CDRs) or mutations, that do not match a corresponding residue in the sequence of amino acid residues fixed to the “IsoX” line may be considered as surface exposed. Alternatively, non-conserved amino acid residues in an antibody of binding fragment including, for example, complementarity determining regions (CDRs) or mutations, that do not match a corresponding residue in the sequence fixed to the IsoX line may be considered as buried. In some embodiments, amino acid residues from the antibody or binding fragment of interest that are in the CDRs and do not match a corresponding residue in the sequence fixed to the “IsoX” line may be considered as surface exposed while all other amino acid residue mismatches are considered as buried residues.
Without wishing to be bound by a theory of the invention, it is believed that the identification of surface exposed amino acid residues from the “IsoX” line are likely to be more precise than the ASA statistical estimates because they represent the conserved structural features of antibody molecules. However, when an antibody's crystal structure is known, the ASA-View coefficients may be more precise than the average conserved exposures represented by the “IsoX” line.
Moreover, surface exposed amino acid residues may be identified by using tables based on short peptide fragments (e.g., 3 to 5 amino acids in length) from proteins with known and well-characterized crystal structures (Ahmad et al. (2003) Genome Informatics 14:482-483). Table entries may contain the statistical frequencies of exposure or burial for the middle residue (“X”) in each short fragment (O-X-O or O-O-X-O-O), as a function of its close neighbors (“O”) on either side. Additionally, Fourier transform mass spectrometry may be employed to detect the reactivity of side-chain groups to chemical modification, such as acetylation of primary amines (Novak et al. (2004) J. Mass Spectrom. 39:322-328). Side chain that are more reactive to chemical modification may be indicated as exposed.
The isoelectric point of a protein, for example, an antibody such as an scFv may be calculated mathematically by using acid-dissociation constant (“pKa”) values assigned to certain individual amino acid residues.
In an exemplary method an isoelectric point for a protein may be determined by using nine different chemical groups, including the sidechains of seven amino acids and their amino and carboxy termini. These amino acids may include: cysteine (Cys, C), aspartic acid (Asp, D), glutamic acid (Glu, E), histidine (H is, H), lysine (Lys, K), arginine (Arg, R) and tyrosine (Tyr, Y). The pl of a protein may be computed using the Henderson-Hasselbalch equation which takes into account the logarithm of the pKa for each of the nine chemical groups. In a protein, each of the nine chemical groups may be present in zero or more copies (“N”) per molecule, all of which contribute proportionally to the final pl. Thus, for example, the Henderson-Hasselbalch contribution of lysine must be multiplied by NK=7 in a protein containing seven lysines.
An exemplary algorithm utilizes a formula for the total concentration of charges associated with each amino acid, both for anionic [A−] species (e.g., D, E, Y, C, or the carboxy terminus) and for cationic [HA+] species (e.g., K, H, R, or the amino terminus). The mathematical basis for algorithms for calculation of pl involves converting the Henderson-Hasselbalch equation from logarithmic to exponential form as shown below:
pKa=pH+log([HA]/[A−])
pKa=pH+log([HA+]/[A])
pKa=pH+log([HA]/[A−])
pKa=pH−log([A]/[HA+])
pKa=pH+log([HA]/[A−])
−pKa=−pH+log([A]/[HA+])
10̂(pKa−pH)=([HA]/[A−])
10̂(pH−pKa)=([A]/[HA+])
1+(10̂(pKa−pH))=1+[HA]/[A−])
1+(10̂(pH−pKa))=1+[A]/[HA+])
1+(10̂(pKa−pH))=(([HA]+[A])/[A])
1+(10̂(pH−pKa))=(([A]+[HA+])/[HA+])
Next, a separate equation may be set out for the total charge C contributed by N copies of each positive or negative amino-acid species:
C=−N[A
−]/([HA]+[A−])
C=+N [HA
+]/([A]+[HA+])
Rearranging this gives:
(([HA]+[A−])/[A−])=((−N)/C)
(([A]+[HA+])/[HA+])=((+N)/C)
Substituting this into the Henderson-Hasselbalch equation eliminates the references to concentrations:
1+(10̂(pKa−pH))=((−N)/C)
1+(10̂(pH−pKa))=((+N)/C)
Solving for the charge C gives:
C=−N/(1+(10̂(pKa−pH)))
C=N/(1+(10̂(pH−pKa)))
Finally, nine separate versions of these two equations are generated, each with a different chemical group represented by the subscript “i”—either anionic (e.g., D, E, Y, C, or carboxy) in the top equation, or cationic (e.g., K, H, R, or amino) in the bottom equation:
C
i
=−N
i/(1+(10̂(pKai−pH)))
C
i
=N
i/(1+(10̂(pH−pKai)))
The total charge T contributed by all nine species is:
T=C
D
+C
E
+C
K
+C
H
+C
R
+C
Y
+C
C
+C
amino
+C
carboxy
The sum T of all charges from all the different amino-acid species equals zero at the isoelectric point, which may be somewhere between pH 0 and pH 14. To begin the iterative process, a trial pH may be chosen in the middle at pH 7, and this value then plugged into the equation to determine whether the total charge T is positive or negative or zero at the trial pH. On the one hand, if this charge T is positive, then the pl must be greater than the trial pH. It must lie between the trial value (pH 7) and the highest untested value (pH 14), so a new trial pH is chosen in the middle at pH 10.5. On the other hand, if this charge T is negative, then the pl must be less than the trial pH. It must lie between the lowest untested value (pH 0) the and trial value (pH 7), so a new trial pH is chosen in the middle at pH 3.5. Each time this “binary search” cycle is repeated, the remaining range of possible untested pl values will be cut in half (or “bisected”), and the calculation will quickly converge to the correct pl value, when the total charge T finally becomes zero.
Computer programs may be employed to determine the pl of a protein (see, e.g., Sillero et al. (2006) Comput Biol Med. 36(2):157-66; Hennig (2001) Prep Biochem Biotechnol. 31(2):201-207; Ribeiro et al. (1991) Comput Biol Med. 21(3):131-141; Ribeiro et al. (1990) Comput Biol Med. 20(4):235-42; Tabb's DTASelect algorithm at “http://fields.scripps.edu/DTASelect/20010710-pl-Algorithm.pdf; and the QT4 version of the isoelectric point calculator at “http://isoelectric.ovh.org/files/isoelectric-point-windows.zip). Although most algorithms consider the protonation or deprotonation of each ionizable residue in isolation, others may account for the influence of the local chemical environment generated by neighboring residues in the primary sequence. For example, one method based on a 5000-peptide database takes into account the effect of adjacent amino acids on the pl value (see, e.g., Cargile et al. (2008) Electrophoresis 29(13):2768-2778.
Minor variations of the algorithm derived above include, for example, EMBOSS, DTASelect, Solomon, Sillero, Rodwell, Patrikios or Wikipedia. Such methods accept the linear amino-acid sequence of a protein, without utilizing any additional structural information (e.g., surface exposure) to direct their calculations. However, they disagree about the pKa values associated with the various amino acids and termini. PKa values assigned to the nine chemical groups by these methods are shown in Table 1.
Methods for assigning a pKa value to an amino acid residue may take into account the interaction between a particular residue and the local environment created by surrounding residues. For example, pKa values may be assigned to amino acid residues based on experimental pKa values determined in protein chains with known structures (He, et al. (2007) Proteins 69(1):75-82). Other methods for calculating the pKa values of ionizable groups in proteins may be based on a distance and position dependent screening of the electrostatic potential (see, e.g., Sandberg et al. (1999) Proteins 36(4):474-483). Additionally, methods based on experimental isoelectric points and amino acid compositional data may uses linear regression to estimate pKa values for ionizable alpha and beta positions of acidic or basic amino-acid residues (Patrickios et al. (1995) Anal Biochem. 231(1):82-91).
A flowchart of an example process 500 for displaying an isoelectric point associated with an amino acid sequence of an antibody is presented in
A computing device begins the example process 500 by receiving an alphabetic string indicative of an amino acid sequence (block 502). For example, a user may enter the alphabetic string using an input device such as a keyboard, or the user may retrieve the alphabetic string from a database, such as a database stored on the computing device or a network device (e.g., the IMGT germ line sequence database, the Kabat database, etc.). The amino acid sequence represented by the alphabetic string may include a variable region and/or a constant region of a heavy chain and/or a light chain of an antibody (e.g., an antibody or fragment thereof such as an IgG, a Fab or a scFv). In some embodiments, the alphabetic string may include a partial or full-length heavy and/or light chain of an antibody. In some embodiments, the alphabetic string may include a variable region of a heavy and/or light chain of an antibody. In some embodiments, the alphabetic string may include a variable region of a heavy chain and/or one or more constant regions of a heavy chain (e.g., CH1, CH2 and/or CH3) and/or a variable region of a light chain and/or a constant region of a light chain (e.g., CL) of an antibody. In some embodiments, the alphabetic string may include two full-length heavy chains and/or two full-length light chains of an antibody.
Once the computing device receives the alphabetic string indicative of the amino acid sequence, the computing device preferably displays an indication of surface exposure (block 504). For example, the computing device may display different symbols adjacent to the alphabetic string to indicate a level of surface exposure. In the example of
Finally, the computing device calculates the isoelectric point 614 associated with the amino acid sequence based on the surface exposure and transfers the isoelectric point 614 to an output device such as a display (block 506). For example, the computing device may identify which amino acids in the amino acid sequence are near a surface of the antibody and which amino acids are not near the surface of the antibody (e.g., based on the data used to display the surface exposure row 612 generated by block 504). The isoelectric point 614 of the amino acid sequence may then be calculated using only the amino acids that are at and/or near a surface of the antibody (e.g., a surface pl). For example, the isoelectric point 614 may be calculated using just the amino acids associated with an outward exposure as indicated by the “+” symbol in the surface exposure row 612. Alternatively, the isoelectric point 614 may be calculated using just the amino acids associated with a partial exposure as indicated by the “o” symbol in the surface exposure row 612. In yet another example, the isoelectric point 614 may be calculated using just the amino acids associated with an outward exposure and a partial exposure as indicated respectively by the “+” symbol and the “o” symbol in the surface exposure row 612.
Another screen shot 700 of an example user interface for displaying alphabetic strings and associated chemical property predictions is shown in
Yet another screen shot 800 of an example user interface for displaying alphabetic strings and associated chemical property predictions is shown in
The present disclosure provides methods and materials for determining an isoelectric point of a protein, which isoelectric point may be used for the preparation of a protein, including for purification (such as to select and utilize one or more buffers for purification of the protein) and/or for formulation. The methods may include one or more steps to purify the protein from a heterogeneous population of proteins and/or non-protein macromolecules (e.g., nucleic acids, endotoxin) and/or other contaminants. Such buffers may be used to stabilize a protein in solution.
A variety of methods are known in the art for purification of proteins, including, for example, purification of binding molecules such as antibodies and antibody fragments (see, e.g., Protein Purification: Principles, High-Resolution Methods, and Applications, 2nd Edition, 1997, Janson, J.-C., and Rydén. L. (Eds.), Wiley; Isolation and Purification of Proteins, 2003, Hatti-Kaul, R. and Mattiasson, B. (Eds.), CRC Press; Protein Purification Techniques: A Practical Approach, 2nd Edition, 2001, Roe, S. (Ed.), Oxford University Press; Huse et al., 2002, J. Biochem. Biophys. Methods 51:217-231; Low et al., 2007, J. Chromatography 848:48-63; Hober et al., 2007, J. Chromatography 848:40-47; Aldington et al., 2007, J. Chromatography 848:64-78). Purification methods may include one or more chromatographic purification steps, wherein a purification step may involve one or more buffers. Chromatographic purification steps may include, for example, Protein A chromatography, ion exchange chromatography (e.g., cation exchange, anion exchange), hydrophobic interaction chromatography, ceramic hydroxyapetite chromatography, affinity chromatography and/or size exclusion chromatography. Proteins subjected to purification may be “crude” preparations of protein (e.g., microbial or mammalian cell culture supernatants, cell lysates) or partially purified preparations of protein previously subjected to one or more purification steps. Optionally, crude preparations of protein may be subjected to one or more steps of clarification to remove cell debris (e.g., centrifugation, filtration) concentration (e.g., tangential flow filtration), and or treatment with a nuclease (e.g., benzonase) to digest nucleic acids.
Ion exchange chromatography involves one or more buffers and separates compounds, such as proteins, based on the nature and degree of their ionic charge. In the case of proteins, ion exchange chromatography generally involves the binding of a protein to a charged matrix or resin under conditions where other protein or non-protein contaminants (e.g., nucleic acids, endotoxin) are not bound, followed by elution of the protein from the charges of the resin. The ion exchanger may comprise, a cationic exchanger, such as for example, a sulphopropyl cation exchanger, a carboxymethyl cation exchanger, a sulfonic acid exchanger, a methyl sulfonate cation exchanger, an SO3-exchanger, or an ion exchanger such as for example, a DEAE, TMAE, and DMAE. Non-limiting examples of commercially available ion exchangers useful in the purification of proteins include DEAE-Sepharose Fast Flow, TSKgel SP-2SW, DEAE-Toyopearl 650S, TSKgel SuperQ-5PW, Q-Sepharose Fast Flow, TSKgel Q-STAT, Resource Q, TSKgel DNA-STAT, Mono Q, CM-Sepharose FF, TSKgel SP-STAT, CM-Toyopearl 650S, SP-Toyopearl 650S, S-Sepharose FF and the like. Protein A chromatography involves one or more buffers and involves the specific binding the Fc region of antibodies, but not most non-IgG contaminants, to immobilized protein A resin.
An important factor for binding of the protein in chromatographic purification steps such as Protein A and ion exchange chromatography is the pH of the buffer used to equilibrate and load the protein. Important factors for the elution are pH and/or ionic strength. Generally the selection of appropriate buffer conditions (e.g. pH) for use in purification will take into consideration the isoelectric point of the particular protein. Selection of a buffer pH that is the same as or very close to the isoelectric point of the protein may lead to undesirable aggregation or precipitation of the purified protein. Aggregation of proteins, including, for example, binding molecules such as antibodies and antibody fragments may be monitored, by a variety of methods, including as non-limiting examples by SEC-HPLC and/or light scattering measurement. In contrast, a buffer pH that is too different from the isoelectric point of the protein may not provide sufficient purification of the protein away from other protein or non-protein contaminants. Thus, it is important to accurately determine the isoelectric point of a protein in order to select and utilize a buffer for purification of the protein.
The present disclosure provide methods and materials for determining an isoelectric point of a protein, which isoelectric point may be used to select a pH for the preparation of a formulation of the protein. A variety of methods are known in the art for formulation of proteins, including, for example, where the proteins are binding molecules such as antibodies and antibody fragments (see, e.g., Protein formulation and delivery, 2nd Edition, 2007, McNally, E. J., and Hastedt J. E. (Eds.), Drugs and the Pharmaceutical Sciences Series, Vol. 175, Taylor & Francis, Inc.; Carpenter et al., 2002, Pharm Biotechnol. 13:109-33; Patro et al., 2002, Biotechnol. Annu. Rev. 8:55-84; Forkjaer et al., 2005, Nat. Rev. Drug Discov. 4:298-306; Wang, 1999, Int. J. Pharma. 185:129-188). For example, for liquid formulations an isoelectric point of the protein as determined by the methods described herein may be used to select a pH for the formulation. The pH of the formulation may be selected to be above or below the isoelectric point of the protein, so as to stabilize the protein (e.g., decrease protein aggregation and/or increase protein solubility.
This disclosure is further illustrated by the following examples which are provided to facilitate the practice of the disclosed methods. These examples are not intended to limit the scope of the disclosure in any way.
Calculated isoelectric points of proteins including, for example, an antibody may be determined as represented in
In an exemplary method, calculated isoelectric points of two exemplary antibodies, including a first antibody comprising a heavy chain (Genbank Accession No. CAC10540) and a kappa light chain (Genbank Accession No. BAC01559) and a second antibody comprising a heavy chain (Genbank Accession No. CAC10540) and a lambda light chain (Genbank Accession No. CAE18238) was determined. Each of the heavy, kappa and lambda chains (e.g., bottom string of amino acid residues in
While the present disclosure has been described and illustrated herein by references to various specific materials, procedures and examples, it is understood that the disclosure is not restricted to the particular combinations of materials and procedures selected for that purpose. Numerous variations of such details can be implied as will be appreciated by those skilled in the art. It is intended that the specification and examples be considered as exemplary, only, with the true scope and spirit of the disclosure being indicated by the following claims. All references, patents, and patent applications referred to in this application are herein incorporated by reference in their entirety.
This application claims priority to U.S. Provisional Application No. 61/138,408, filed on Dec. 17, 2008 and U.S. Provisional Application No. 61/138,411, filed on Dec. 17, 2008, each of which is hereby incorporated by reference in its entirety.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/US09/68531 | 12/17/2009 | WO | 00 | 9/6/2011 |
Number | Date | Country | |
---|---|---|---|
Parent | 61138408 | Dec 2008 | US |
Child | 13140554 | US | |
Parent | 61138411 | Dec 2008 | US |
Child | 61138408 | US |