The present disclosure generally relates to the field of immunology, and more specifically to antibody analysis and assessment of the antibody repertoire.
Antibodies are one of the main effectors of the adaptive immune system. Their ability to bind an antigen has been harnessed in order to generate several tools for diagnostic, research and clinical applications. Antibodies are among the fastest developing biomolecules in clinical trials with a worldwide market estimated at 130.9 billion for 2020 and estimated to grow to 223.7 billion by the end of 2025 (https://www.marketdataforecast.com/market-reports/antibodies-market).
Antibodies are classified as glycoproteins. Antibodies are composed of four polypeptide chains: two identical copies of both a heavy (H, 55 kDa), and a light chain (L, 25 kDa), held together by a disulfide bridge. The basic profile is similar to a “Y” shape. Each polypeptide chain has a constant region, which is conserved across antibodies, and a variable region, which is specific to each antibody thus mainly responsible for conferring affinity to an antigen.
From an application point of view, antibodies can be divided into two main categories: monoclonal antibodies (mAbs), which are produced by a single type of plasma cell, and have the same heavy and light chain sequence and bind to a unique epitope on a given antigen. On the other hand, polyclonal antibodies (pAb) consist of a pool of different antibodies from various B cells recognizing both similar and different parts of the same antigens; those regions are named epitopes.
Antibodies are secreted by plasma cells, which are differentiated mature B lymphocytes (Pioli 2019). Plasma cells produce multiple antibodies in response to an antigen. In 1975, hybridoma technology was developed to immortalize individual plasma cells (Packer 2021). The antibodies produced by a population of homogeneous immortalized plasma cell, denoted as a clone, are called monoclonal antibodies (mAbs). In contrast, antibodies produced from plasma cells in an animal are called polyclonal antibodies (pAbs). Monoclonal antibodies have the same heavy (H) and light (L) chain sequence and bind to a unique epitope on a given antigen. On the other hand, pAbs recognize different parts or epitopes of the same antigen and are a mixture of antibodies having different sequences.
Monoclonal antibodies are developed and used for several biological applications, diagnostics and clinicals use such as treatments for autoimmune disease, cancer and infectious diseases to name a few. However, mAb therapeutics are simplistic versions of the much more complex native immune response, which involves several different antibodies targeting different epitopes (Wang et al 2013).
On the other hand, pAb treatments are available for specific cases such as rabies. Treatments are based on either the use of human rabies immunoglobulin (HRIG), or a cheaper and safe horse version (ERIG). Both are obtained from pooled sera of human donors or horses vaccinated against rabies. Other issues associated with HRIG usage include potential health risks, batch-to-batch variability, and limited global supply in certain regions of the world or under of an unexpected mass exposure. Bakker et al. showed that recapitulation of a pAb mixture by combining a few different monoclonal antibodies targeting different epitopes of the rabies virus glycoproteins could be efficient against different street rabies viruses and against escape mutants of the virus (Bakker, 2005).
Choosing to generate or work with mAbs or pAbs is dictated by different factors, including application type, production cost and time, and technical expertise. Both mAbs and pAbs have different advantages and disadvantages. Polyclonal antibodies can be generated more rapidly than mAbs with less demanding technical skills as all that is required is animal being inoculated with a target antigen and adjuvant. In addition, pAbs are heterogenous in nature, which ensures they can recognize a given epitope under different conformations or with small changes pAbs are also more flexible in term of buffer use and epitope changes. Furthermore, pAb generated from different animals will exhibit variation in their affinity and drastic variation within an animal can be expected.
Unlike pAbs, mAbs are homogeneous, and therefore have low batch-to-batch variability, but their high specificity can limit their use in some cases; for instance, a small change in the structure of the epitope can drastically affect antibody-antigen affinity. Moreover, because mAbs are generated from identical immortalized B-cells, often by fusing a B cell and a myeloma cell, their production can be sustained in vitro with a given specificity.
However, hybridoma cells can be subjected to gene loss, gene mutation, additional chains found in hybridoma and cell line drift. This later problem identified in mAbs can be overcome using recombinant antibodies (rAbs). For recombinant antibody rAbs, the antibody sequence has to be determined in order to synthesize the immunoglobulin (Ig) L and H chain genes and generate expression constructs. The constructs are then transfected into a high yield cell such as CHO or HEK293.
A similar strategy could be applied to pAbs. Being able to sequence several antibodies from a polyclonal mixture, at least some of its main dominant forms, to generate recombinants to produce a simpler complex mixture such as a recombinant oligoclonal mixture is therefore an attractive solution, which combines advantages from both the mAbs and rAbs side (removal of batch-to-batch variability, circumvention of loss of hybridoma), and pAbs (a response closer to that of the natural immune system). To date, very few efforts have been attempted to sequence pAbs.
In an approach developed by Cheung et al., antibodies were first enriched from an immunized animal and analyzed using a standard proteomics mass spectrometry (MS)-based approach searched against a reference database created by Next Generation Sequencing (NGS) of the B cell Ig repertoire of the immunized animal. Pairing of the H and L chains was performed by exploring all possible combinations of H and L chains. A relatively similar approach was proposed by Wine et al. (2013).
Guthals et al 2016, provided an example of sequencing a mixture derived from a polyclonal mixture purified by glycoprotein B antigen affinity from a cytomegalovirus-exposed individual; they managed to sequence several H and L chains and concentrate their efforts on the top four LC and seven HC. They had to generate all possible pairwise combinations expressed in mammalian cells to validate antigen binding. Like previous attempts, the main issue they encountered was that the pairing was blindly explored through a combinatorial approach; such an approach of H and L pairing is feasible for a low number of antibodies but is rather impossible or too demanding for large datasets as it is the multiplication product of the number of identified L chains by the number of identified H chains.
The importance of proper pairing has been discussed in different works. Czerwinski et al (1998) highlighted the significance of proper pairing by screening a library of chimeric antibody antigen-binding domains (Fab). Brandon et al (2013) addressed the H/L pairing by performing single B-cell sequencing. Jared Shaw (2020), proposed a direct chain pairing using top down and middle down mass spectrometry approaches; their proposed approaches rely on the complete knowledge of the different sequences to identify within a pair, and can be performed on intact fragments only.
Most of the proposed strategies used to identify an antibody or several antibodies from a large population are conducted either through proteomics or transcriptomics; the sequencing strategy is performed on samples where H and L chain are under reduced separate conditions (i.e. disulfide bonds disrupted and the molecules being fragmented), and consequently the H/L pairing information is lost during the sequencing procedure. One of the main strategies often used to identify the proper H/L pairs simply consist of generating all permutations by expressing all possible assemblies, and performing a functionality assay, which can be a tedious task for complex samples.
There is thus a need for novel approaches to identify light and heavy chain pairs in complex antibody mixtures such as in polyclonal antibody mixtures, or at least to reduce the number of possible valid pairing to consider.
The present description refers to a number of documents, the content of which is herein incorporated by reference in their entirety.
The present disclosure provides the following items 1 to 32:
Other objects, advantages and features of the present disclosure will become more apparent upon reading of the following non-restrictive description of specific embodiments thereof, given by way of example only with reference to the accompanying drawings.
In the appended drawings:
The use of the terms “a” and “an” and “the” and similar referents in the context of describing the technology (especially in the context of the following claims) are to be construed to cover both the singular and the plural, unless otherwise indicated herein or clearly contradicted by context.
The terms “comprising”, “having”, “including”, and “containing” are to be construed as open-ended terms (i.e., meaning “including, but not limited to”) unless otherwise noted.
All methods described herein can be performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context.
The use of any and all examples, or exemplary language (“e.g.”, “such as”) provided herein, is intended merely to better illustrate embodiments of the claimed technology and does not pose a limitation on the scope unless otherwise claimed.
No language in the specification should be construed as indicating any non-claimed element as essential to the practice of embodiments of the claimed technology.
Herein, the term “about” has its ordinary meaning. The term “about” is used to indicate that a value includes an inherent variation of error for the device or the method being employed to determine the value, or encompass values close to the recited values, for example within 10% of the recited values (or range of values).
Recitation of ranges of values herein are merely intended to serve as a shorthand method of referring individually to each separate value falling within the range, unless otherwise indicated herein, and each separate value is incorporated into the specification as if it were individually recited herein. All subsets of values within the ranges are also incorporated into the specification as if they were individually recited herein.
Where features or aspects of the disclosure are described in terms of Markush groups or list of alternatives, those skilled in the art will recognize that the disclosure is also thereby described in terms of any individual member, or subgroup of members, of the Markush group or list of alternatives.
Unless specifically defined otherwise, all technical and scientific terms used herein shall be taken to have the same meaning as commonly understood by one of ordinary skill in the art (e.g., in stem cell biology, cell culture, molecular genetics, immunology, immunohistochemistry, protein chemistry, and biochemistry).
Unless otherwise indicated, the recombinant protein, cell culture, and immunological techniques utilized in the present disclosure are standard procedures, well known to those skilled in the art. Such techniques are described and explained throughout the literature in sources such as, J. Perbal, A Practical Guide to Molecular Cloning, John Wiley and Sons (1984), J. Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbour Laboratory Press (1989), T. A. Brown (editor), Essential Molecular Biology: A Practical Approach, Volumes 1 and 2, IRL Press (1991), D. M. Glover and B. D. Hames (editors), DNA Cloning: A Practical Approach, Volumes 1-4, IRL Press (1995 and 1996), and F. M. Ausubel et al. (editors), Current Protocols in Molecular Biology, Greene Pub. Associates and Wiley-Interscience (1988, including all updates until present), Ed Harlow and David Lane (editors) Antibodies: A Laboratory Manual, Cold Spring Harbour Laboratory, (1988), and J. E. Coligan et al. (editors) Current Protocols in Immunology, John Wiley & Sons (including all updates until present).
The present disclosure provides a method for pairing the Heavy (H) and Light (L) chains of an antibody present in an antibody mixture that does not necessitate the complete exact L and H chain sequences and allows pairing of specific short H chain fragments with specific short L chain fragments in particular their respective CDR3. The method relies on either separating the antibody mixture under non-reducing condition using chromatography (HIC or reverse phase or any separation technique) or gel (native gel or denaturing gel, 1D- or 2D-gel), or using cross-linking reagents and protease digestion to identify close proximity signature peptides from both H and L chains. Different data analysis strategies may be used to confirm the right pairings or to reduce the number of possible pairings.
The present disclosure provides a method for determining heavy and light chain pairing of an antibody or an antibody fragment in an antibody and/or antibody fragment mixture, the method comprising (a) submitting the mixture to a separation step under non-reducing condition to obtain isolated antibodies or antibody fragments; (b) digesting the isolated antibodies or antibody fragments (e.g., with a chemical cleavage reagent such as cyanogen bromide (CNBr) and/or one or more proteases) to obtain antibody or antibody fragment peptides; (c) analyzing the antibody or antibody fragment peptides; and (d) determining heavy chain and light chain pairing of the antibody or antibody fragment based on the analysis.
The present disclosure also provides a method for determining heavy and light chain pairing of an antibody or an antibody fragment in an antibody and/or antibody fragment mixture, the method comprising (a) contacting the mixture with a cross-linking agent to obtain cross-linked antibodies or antibody fragments; (b) digesting the cross-linked antibodies or antibody fragments (e.g., with a chemical cleavage reagent such as CNBr and/or one or more proteases) to obtain antibody or antibody fragment peptides; (c) analyzing the antibody or antibody fragment peptides; and (d) determining heavy chain and light chain pairing of the antibody or antibody fragment based on the analysis.
The expression “determining heavy and light chain pairing of an antibody or an antibody fragment in an antibody and/or antibody fragment mixture” means (a) identifying the exact heavy and light chain pairing of an antibody or antibody fragment, or (b) reducing the number of possible heavy and light chain pairing. For example, if a mixture comprises 10 antibodies (i.e., 10 light and 10 heavy chains sequenced), using to the method of Guthals et al. 2016, it would be necessary to recombinantly produce in cells and validate antigen binding for all 100 possible pairwise combinations to determine the proper pairing. However, the method of the present disclosure may permit to directly identify the proper pairing without the need to recombinantly produce in cells and validate antigen binding for all 100 possible pairwise combinations, or may limit the number of possible pairwise combinations to be produced by identifying light and heavy chains that cannot pair together because they were clearly found in distinct bands or fractions following separation under non-reducing conditions. Only light and heavy chains that were found in (c) under non-reducing conditions (i.e., in overlapping bands or fractions) would require further analysis (e.g., recombinant production in cells and assessment of antigen binding) to identify or confirm proper heavy and light chain pairing.
The present disclosure provides methods that allow for direct determination of heavy chain and light chain pairing of an antibody or antibody fragment in a mixture, for example, in a sample containing intact polyclonal antibodies (pAbs), a sample containing intact monoclonal antibodies (mAbs), or a sample containing antibody fragments (which can be generated from pAbs or mAbs (e.g., Fab fragments and/or F(ab′)2 fragments)).
The separation of the antibodies or antibody fragments under non-reducing conditions may be performed using any known methods of separation of proteins under non-reducing conditions (conditions that maintain the disulfide bridges between the light and heavy chains, e.g., in the absence of reducing agents such as beta-mercaptoethanol (β-ME), dithiothreitol (DTT) or tris (2-carboxyethyl) phosphine (TCEP)), including chromatography-based methods such as hydrophobic interaction chromatography (HIC), ion exchange chromatography (anion or cation exchange chromatography), reverse phase (RP) or size exclusion chromatography, capillary electrophoresis (e.g., imaged capillary isoelectric focusing, CEF), as well as gel-based methods such as native polyacrylamide gel electrophoresis (PAGE), and agarose gel electrophoresis (denaturing 2D agarose gel). This step permits to divide the antibodies or antibody fragments present in the mixture into several fractions, with each fraction (or gel band) comprising either a single antibody or antibody fragment, or in the case of mixtures comprising antibodies or antibody fragments having similar migration patterns, one or more of the fractions or gel bands may comprise more than one antibody or antibody fragment (e.g., 2, 3 or 4 antibodies or antibody fragments).
In an embodiment, the method of the present disclosure further comprises contacting the antibody and/or antibody fragment mixture with an agent that modifies the charge of a given amino acid molecule in a sequence specific manner (thus changing the migration pattern of the antibodies or antibody fragments. Such agents are well known in the art and include agents that add positive or negative charges on certain neutral residues, agents that react with negatively charged residues (e.g., glutamic or aspartic acids) to remove the negative charge or to replace it by a positive charge, as well as agents that react with positively charged residues (e.g., lysine or arginine) to remove the positive charge or to replace it with a negative charge. Examples of such agents include citraconic anhydride (CA), which reacts with primary amines (e.g., in the side chain of lysine) to form an amide bond with a terminal carboxyl group, as well as sulfo-NHS-acetate (SNA), which also reacts with primary amines and forms a stable, covalent amide bond.
In an alternative embodiment of the method, the antibody and/or antibody fragment mixture is contacted with a protein cross-linking agent to obtain cross-linked antibodies or antibody fragments. The objective of this step is to generate covalent bonds between the heavy and light chains. The protein cross-linking agent may be a bifunctional reagent or heterofunctional reagent. The reactive function of the protein cross-linking agent can be an NHS ester compound (that reacts with primary amine), a maleimide compound (that reacts with sulfhydryl containing molecule), an hydrazide compound (that reacts with aldehyde containing molecule), or a carbodiimide based compound such as EDC (that reacts with carboxylate containing molecule). The two functional groups may be separated by a spacer, such as an alkyl or polyethylene glycol (PEG) chain. Agents to induce protein cross-linking are well known in the art and include, for example, glutaraldehyde, DSG, disuccinimidyl suberate (DSS), Bis(sulfosuccinimidyl) suberate (BS3), Bis(succinimidyl) penta(ethylene glycol (BS(PEG)5), TSAT, DSP, DTSSP, DST, BSOCOES, EGS, Sulfo-EGS, DMA, DMP, DMS, DTBP, DFDNB, SIA, SMAP, SIAB, Sulfo-SIAB, AMAS, BMPS, GMBS, Sulfo-GMBS, MBS, Sulfo-MBS, SMCC, Sulfo-SMCC, SMBP, Sulfo-SMBP, SMPH, LC-SMCC, Sulfo-KMUS, SPDP, LC-SPDP, Sulfo-LC-SPDP, or SMPT.
In an embodiment, in cases where at least the partial amino acid sequence of the antibodies or antibody fragments is not known, the method of the present disclosure further comprises a step of determining the amino acid sequence of at least a portion of the light and heavy chains of the antibodies or antibody fragments present in the mixture (e.g., at least the heavy chain CDR3 and light chain CDR3). The sequences of the light and heavy chains of the antibodies or antibody fragments may be determined using methods well known in the art, such as de novo MS/MS sequencing (Guthals et al. 2016), high throughout Ig sequencing (Georgiou et al., 2014), proteogenomics-based sequencing (Hashimoto et al., 2020; Cheung et al., 2012, Wine et al., 2013), and single-cell sequencing (Meijer et al., 2006). Sequences of the heavy chain CDR3 may be determined using the method described in PCT application No. PCT/CA2021/050791. The method may also include collecting additional information such as the germline/allele, if sequencing successfully the entire antibody is not possible
After the separation of the antibodies or antibody fragments under non-reducing conditions, one or more of the fractions or gel bands (or the cross-linked antibodies or antibody fragments) are digested (under reduced and/or non-reduced condition) with a suitable agent to generate digested peptides proper for LC-MS analysis. Agents to cleave proteins include chemical agents such cyanogen bromide (CNBr) that cleaves at methionine (Met) residues; BNPS-skatole that cleaves at tryptophan (Trp) residues; formic acid that cleaves at aspartic acid-proline (Asp-Pro) peptide bonds; hydroxylamine that cleaves at asparagine-glycine (Asn-Gly) peptide bonds, and 2-nitro-5-thiocyanobenzoic acid (NTCB) that cleaves at cysteine (Cys) residues, as well as proteases. Any suitable protease or combination of proteases may be used to digest the isolated fractions or gel bands, such as trypsin, chymotrypsin, AspN, GluC, ArgC, LysC, LysN, pepsin or any combination thereof. In the case of cross-linked antibodies or antibody fragments, the digestion will generate peptides of both the light and heavy chains. The digestion may be performed with a combination of chemical agent(s) and protease(s).
Prior to and/or after the digesting step, the sample may be subjected to various treatments including alkylation and deglycosylation.
The antibody or antibody fragment peptides are then submitted to a suitable analysis to identify the composition of the heavy and light chain in each fraction (or gel band), and thus to determine the proper heavy and light chain pairing in the mixture. For example, the peptides could be resolved by reverse phase chromatography and in-line nanoelectrospray ionization/high-resolution tandem mass spectrometry (MS/MS). In an embodiment, the analysis comprises submitting the antibody or antibody fragment peptides to mass spectrometry (MS), such as liquid chromatography-mass spectrometry (LC-MS), preferably tandem MS. The MS/MS profile of the antibody or antibody fragment peptides may be compared to databases of antibody peptide sequences, or de novo sequencing of at least a portion of the antibodies or antibody fragments in the mixture may be performed.
The analysis may also comprise performing a quantitative analysis of the peak intensity of the antibody or antibody fragment peptides. The analysis comprises performing a clustering analysis or cross-correlation analysis to identify peptides that are found at similar level/frequency in the same fraction or gel band. Suppose there are n fractions or gel bands. Each peptide P(i) is then associated with a quantification vector X(i)=(x1(i), x2(i), . . . , xn(i), where each x) is the normalized quantity of peptide P(i) in fraction (or gel band) j. Since the initial fractionation is at the protein level, peptides from the same antibody or antibody fragment should have very similar quantification vectors. In contrast, the quantification vectors of peptides from different antibodies or antibody fragments are usually different because their corresponding proteins are likely separated into different fractions (or gel bands) or appear in different fractions (or gel bands) at different concentration. As such, a clustering analysis using the quantification vector may cluster the peptides according to their corresponding proteins. The peptides belonging to the same cluster are then used together to assemble one or more protein sequences. Such clustering can also be used to pair the corresponding H and L chains of the same antibody or antibody fragment. Essentially, the chains whose peptides mostly belong to the same cluster are paired together.
The clustering analysis can be carried out with any standard clustering algorithms widely known to the data analysis community, such as K-means clustering and Hierarchical clustering that are available through software packages such as R (r-project.org) and scikit learn (scikit-learn.org). A standard principal component analysis (PCA) can also be used to reduce the dimensionality of the quantification vectors before the clustering analysis. PCA is available in many standard data analysis software packages such as R and scikit learn. When the number of antibodies or antibody fragments is small (e.g., less than 5), a PCA can also be used to project the vectors to a two- or three-dimensions space so the data can be visualized to determine the clustering or to pair the H and L chains. In an embodiment, the clustering analysis is a principal component analysis (PCA).
Analysis of cross-linked peptides may be performed using suitable tools such as pLink (Fan et al. 2015), ECL (Yu et al., 2016), xQuest (Rinner et al., 2008; Walzthoeni et al., 2012), ProteinProspector (Chu et al., 2010; Trnka et al., 2014), Kojak (Hoopmann et al., 2015), OpenPepXL (Netz et al., 2020) or MS-Annika (Pirklbauer et al., 2021).
In an embodiment, the method further comprises assessing the binding of the antibody or antibody fragment to its target antigen. This step may be used to confirm that the putative heavy and light chain pair identified by the method form a functional antibody or antibody fragment, or in the case where more than one putative/candidate heavy and light chain pairs are identified (e.g., when two or more antibodies or antibody fragments in the mixture have similar migration patterns), this step permits to confirm which of the putative/candidate heavy and light chain pairs forms a functional antibody or antibody fragment able to bind to the target antigen. This may be done, for example, by recombinantly expressing an antibody or antibody fragment comprising the putative heavy and light chain pair identified, and assessing the binding of the antibody or antibody fragment to its target antigen. This may be achieved by introducing nucleic acids encoding the heavy and light chains into a suitable expression system, such as CHO or HEK293 cells, culturing the cells into conditions suitable for the production of the antibody or antibody fragment, and assessing the binding of the antibody or antibody fragment to the antigen, for example by immunoassay (e.g., ELISA). In an embodiment, the binding of the antibody or antibody fragment to its target antigen may be assessed by expressing the antibody or antibody fragment at the surface of a phage and assessing the binding of the phages to the antigen (phage panning).
The present disclosure is illustrated in further details by the following non-limiting examples.
The aim of this study was to properly pair two mouse IgG2a from an artificial mixture using hydrophobic interaction chromatography (HIC). The two antibodies used were two mouse IgG2a recognizing two different antigens (Absolute Antibody, referred to as P13 and P14).
The HIC column used is a Propac 2.1 mm×100 mm Thermo Fisher #063653. 50 μg of each intact antibody were mixed and separated on a 35-minute gradient. In
Two different analyses were performed to confirm the proper H and L chain pairing from that mixture.
Analysis 1: To identify the composition of the H and L chain in each fraction, the mass spectrometry peptides were compared to an internal database of all in-house antibodies.
Conclusion based on PSM and peptides specificity: analysis of the number of hits per peak show that Peak 1 at 15.7 min contain mostly the H and L chain of P13 while Peak 2 show a higher content for the H and L chain for P14 (as shown by the number in italics). Thus, HIC can separate antibodies and its peptide spectrum matches can be used to pair correctly the heavy and light chain of P14 and similar for P13.
Analysis 2: To identify the proper pairing of the heavy and light chain in each fraction, principal component analysis (PCA) of the peptides trend across the 4 fractions (2 fractions per peak) was performed.
The same experimental dataset described above was analyzed using a principal component analysis. Suppose there are n fractions. Each peptide P(O is then associated with a quantification vector X(i)=(x1(i), x2(i), . . . , xn(i)), where each x) is the normalized quantity of peptide P(i) in fraction j. Since the fractionation is at the protein level, peptides from the same mAb protein should have very similar quantification vectors. In contrast, the quantification vectors of peptides from different mAb proteins are usually different because their corresponding proteins are likely separated into different fractions or appear in different fractions at different concentration. As such, a clustering analysis using the quantification vector may cluster the peptides according to their corresponding proteins. The peptides belonging to the same cluster are then used together to assemble one or more protein sequences. Such clustering can also be used to pair the corresponding H and L chains of the same mAb. Essentially, the chains whose peptides mostly belong to the same cluster are paired together.
The clustering analysis can be carried out with any standard clustering algorithms widely known to the data analysis community, such as K-means clustering and Hierarchical clustering that are available through software packages such as R (r-project.org) and scikit learn (scikit-learn.org). A standard PCA can also be used to reduce the dimensionality of the quantification vectors before the clustering analysis. PCA is available in many standard data analysis software packages such as R and scikit learn. When the number of mAbs is small (e.g., less than 5), a PCA can also be used to project the vectors to a two- or three-dimensions space so the data can be visualized to determine the clustering or to pair the H and L chains.
For example,
In the PCA plot, there are no clustering of the two Light or the two Heavy chains but instead two main clusters of one pair each of H with L chains, antibody 1/P13 H and L chains on the left and antibody 2/P14 H and L chains on the right. In PC2, a homogeneous distribution of H and L for both antibodies is seen.
Proteins loaded on polyacrylamide gel electrophoresis (PAGE) separate according to their charge, shape and molecular weight. Antibody proteins have similar molecular weights and shapes but could differ from each other by their charge. Several factors contribute to the overall charge of the Ig molecule, including the abundance of amino acids with basic and acidic side chains, and the amount and composition of glycans. Pure mAbs subjected to native PAGE will thus exhibit varying degrees of migration into the gel. In addition, mAbs band pattern will often contain multiple bands, presumably reflecting the degree of glycosylation.
The pairing strategy in this example takes advantage of the distinctive migration pattern of rabbit mAbs on native PAGE. The strategy comprises separating the rabbit mAb mixture under native (non-denaturing, non-reducing) conditions on PAGE, digesting the proteins in the cut gel bands with the protease, followed by the LC-MS/MS analysis of the peptides and the computer algorithm matching the unique peptides originating from the H and L chains found in the same gel band.
Four rabbit IgG recombinant antibodies (anti-beta 3 integrin (P17), anti-ERBB2 (P18), anti-IL-18 (P19), and anti-alpha-Vp5 integrin (P20), each at 1 mg/mL) from Absolute Antibody with known amino acid sequences were mixed in equal proportions. Twenty μg of the mixture was mixed with the 2× native loading dye and loaded on the 7.5% PAGE. In parallel, pure antibodies (10 μg each) were loaded on the same gel for comparison. The separation was performed at 130V for 1 H 15 min. The gel was then stained with Coomassie Blue dye and de-stained with Methanol/Acetic acid to visualize the antibodies band pattern (
The MS analysis was performed on Orbitrap Fusion™ Tribrid™ Mass Spectrometer coupled to Evosep One LC system.
The data were analyzed using PCA as described above in Example 1, analysis 2. The consistent finding of the unique peptides from H and L chains in the same fraction identifies a cognate H-L chain pair. Unique peptides to each antibody have a distinctive pattern across the different fractions. The best correlation pattern between a given H and L chain allows for proper pairing.
Complex mixture of mAbs with similar migration patterns on native PAGE may not be efficiently separated into individual mAbs and thus may not be accurately paired by PCA. In addition, some mAbs do not migrate well in the gel resulting in the inability to generate data for pairing. In this example, a pairing strategy that takes advantage of mAb labeling that modifies the charge of the protein molecule in a sequence specific manner. The main principle consists of changing the migration pattern of the mAb on the native PAGE. This labelling strategy was used to solve these issues and pair up to 5 different mouse recombinant Abs. Two compounds, citraconic anhydride (CA) and sulfo-NHS-acetate (SNA), reacting with primary amines were selected as modifying agents.
Citraconic anhydride (or 2-methyl maleic anhydride) reacts with primary amines to form terminal carboxylate under neutral pH. Under acidic conditions (pH 3-4) the reaction is reversible. Introduction of a carboxylate group contributes a negative charge and results in a net charge change of −2 per modified amine group. It was found that the reaction efficiency depends on the pH and the amount of citraconic anhydride added. If the product of CA reaction is then separated on the native PAGE, the observed migration pattern would be different depending on the specific conditions of the reaction. Two labeling conditions produced recognizable and distinct migration patterns for mAbs mixture on the native PAGE: reaction with 0.5 μl CA and reaction with 1 μl CA. The reversible nature of the reaction is advantageous, since peptide separation prior to MS analysis is done under acidic conditions that favor the release of the free amine group. Thus, no specific amino acid modification needs to be considered at the data analysis level.
Sulfo-NHS-acetate is commonly used to block or protect primary amines. The reaction with amine groups of proteins at neutral pH results in the formation of non-reversible acetamide modifications. Acetylation of lysines and free primary amines, reduce the overall protein charge under neutral and acid condition. Indeed, the reaction with sulfo-NHS acetate also changes the migration pattern of the modified protein on the native PAGE. The acetylation of lysine residues adds 42.01056 amu, which is taken into account at the data analysis.
Both CA and sulfo-NHS-acetate modify N-terminus of the protein as well as lysine residues. Since trypsin, Lys-C and in some instances Lys-N, does not cut after modified lysine residues, none of those enzymes were chosen for the enzymatic digestion of the modified proteins. Instead, pepsin or chymotrypsin were used.
Five mouse IgG2A mAbs recognizing different antigens (referred to as P12, P13, P14, P15 and P16, each at 1 mg/mL, all from Absolute Antibodies) with known amino acid sequences were mixed in the equal proportions by mixing 75 μl aliquot of each mAb. The resulting mixture was split into 4 treatments:
All treated samples (conditions 2-4) were then incubated at 25° C. for 2 hours with mixing, followed by overnight incubation at +4° C. 125 μl aliquot (50 μg) of each treated sample was desalted on Zeba desalting column with 7.5 kDa MWCO and dried under low pressure centrifugation (SpeedVac™). The dry pellet was reconstituted in 25 μl of 50 mM ammonium bicarbonate buffer and mixed with 25 μl of 2× native loading dye. 30 μl (30 μg) of the mixture was then loaded on the 7.5% polyacrylamide gel. In parallel, 20 μg of untreated mixture as well as 10 μg of each individual mAb were loaded on the same gel. The native PAGE separation was performed as described in Example 2. Upon the separation on the gel, the bands were cut out of the untreated and treated mAbs mixtures as follows: bands 1-8 were cut from untreated mixture, 9-13 were cut from sample treated with 0.5 μl of CA, 14-19—from sample treated with 1 μl of CA, 20-24—from sample treated with sulfo-NHS-acetate. The cut gel bands were then digested with chymotrypsin and analyzed with LC-MS/MS as described in the Example 2.
For this example, a natural rabbit pAbs directed against the JC region adjacent to the CDR3 human H chain (referred to as internally as a rabbit polyclonal antibody with the naming “PD025”) was used.
As a proof of concept, the natural rabbit pAb PD025 was separated under non-reducing condition using a 2D gel, followed by in-gel trypsin digestion. In this natural pAb, 2 main antibodies, named R1 and R3, were identified. Their sequence alignments are shown in
As stated earlier, it is assumed in that example that R1 and R3 H and L chains were assembled and only the right pairing was missing. As shown in
Assuming the different H and L chains are either fully assembled or specific proteotypic peptide identified for each chain (i.e., the respective L and H chains sequences are complete or partly complete but not paired), the proposed strategy includes identifying unique peptides to each H and L chains. The mixture was separated under non-reducing condition using 2D gel (
The antibody PD025 was separated using a 2D gel under non-reducing condition as following. For sample cleanup, 150 μg was precipitated using TCA (25%) at 4° C. overnight to remove glycerol and salt. The sample was centrifuged at 27,237×g at 4° C. for 30 min. The pellet was washed with cold acetone twice, centrifuged at 27237×g at 4° C. for 10 min between each wash step. Then, the pellet was resuspended in 130 μl DeStreak™ Rehydration Solution (Cytiva) with 1.3 μl BioLyte® 3/10 Ampholyte (Bio-Rad). The resuspended sample was used to passively rehydrate a 7 cm IPG strip (pH3-10NL, Bio-Rad) overnight at room temperature Next day, isoelectric focusing was conducted in a PROTEAN® IEF Cell (Bio-Rad) at 250 V 20 min, 4,000 V 2 h, and 4,000 V to 10,000 V-h with the current set at 50 μA per gel. When the IEF separation was completed, the IPG strip was removed from the focusing tray and transferred side up onto a blotting filter paper. For non-reducing sample, the IPG strip was equilibrated in SDS Equilibration buffer [6M urea; 0.375M Tris-HCl, pH 8.8; 2% SDS; 20% glycerol] for 20 min at room temperature. Next, the IPG strip was equilibrated in 1× Tris-glycine-SDS running buffer before it was sealed onto a 4-20% Mini-PROTEAN® TGX™ Precast gel (Bio-Rad) using agarose solution [0.5% low melting point agarose in 1× Tris-glycine-SDS and 0.003% bromophenol blue]. The gel was stained using Coomassie Brilliant Blue R-250 (see
The peak intensities for the 9 selected peptides described in Table 2 were evaluated for fractions 1 to 12 (see
Crosslink approach has been used in the field of IgG characterization for a while to map antibody/antigen contact. Maibom-Thomsen et al. (2019) have shown using crosslink that IgG has a compact structure with a hidden Fc domain. Most of the work done on using crosslink and IgG are either toward coupling covalently the antibody to protein A/G beads or to study the epitope sites.
100 μg of each of R1 and R3 were combined, followed by addition of 4 μL 50U of the protease IdeS derived from Streptococcus pyogenes (Promega), and dilution up to 100 μL with 0.01M PBS. The mixture was incubated at 37° C. for 1 hour. 60 μL of protein A slurry was washed with 500 μL 0.01M PBS for R1/R3 mix and incubated for 1 hour at room temperature and mix. The supernatant, containing F(ab′)2 fragments, was retained. 60 μg of F(ab′)2 was dried down using centrifuge under low pressure. 30 μg R1 and 30 μg R3 undigested (“intact” fraction) were combined, then dried down under low pressure. R1/R3 F(ab′)2 were reconstituted in 20 μL 400 μM bis(sulfosuccinimidyl)suberate (BS3, Thermo Fisher) dissolved in 25 mM sodium phosphate (20× molar excess of BS3 to R1/R3 F(ab′)2). Intact R1/R3 mix was reconstituted in 50 μL 280 μM BS3 dissolved in 25 mM sodium phosphate (35× molar excess of BS3 to R1/R3 mix). The crosslinking reaction was allowed to take place at room temperature on shaker for 1 hour. The reaction was quenched by adding 1 M tris to a final concentration of 60 mM. F(ab′)2 and intact R1/R3 were diluted to 50 μL using 0.01M PBS followed by addition of 1.5 μL 1 M DTT to each and incubation at 95° C. for 15 minutes. 17.2 μL was removed for cysteine modification. 30 μL of 0.5M 2-Bromoethylamine hydrobromide (BEA) solution as well as 10 μL 1M tris pH 8 were added to volume set aside for cysteine modification (Cet). The reaction was incubated at room temperature for 4 hours, and 1M tris was added every hour to maintain reaction pH at 7. The remaining 32.8 μL was alkylated with 5 μL 0.5M iodoacetamide for 30 minutes at room temperature in dark. 170 μL acetone was added to iodoacetamide-treated samples, and 300 μL acetone was added to cysteine-modified samples for 1 hour at −20° C. The mixtures were centrifuged at 27,237×g at 4° C. for 10 minutes, and the acetone was decanted. Any remaining acetone was dried using centrifuge under low pressure. The pellet was reconstituted in 4 μL 4M urea in 37° C. shaker for 10 minutes. Cysteine-modified sample was then diluted up to 20 μL using HPLC grade water, and 30 μL 50 mM ammonium bicarbonate was added. 1 μg trypsin was and incubated overnight at 37° C. The iodoacetamide-treated sample was diluted up to 40 μL using HPLC grade water then split 20 μL into two tubes. 30 μL 50 mM ammonium bicarbonate and 1 μg of chymotrypsin were added to one tube and 25 μL 0.04 μg/μL pepsin (1 ug) plus 2 μL HCl were added to the second tube. Pepsin digest was performed at 37° C. for 15 minutes, followed by inactivation at 95° C. for 3 minutes, and the mixture was dried down with centrifuge under low pressure. Following overnight digestion, samples were dried down under low pressure centrifugation (Speedvac™). Digests were reconstituted in 40 μL 0.1% FA to achieve sample concentration of 0.5 μg/μL. The digest was loaded on Evotips following manufacturer's instruction.
5 μg of each digest (pepsin, chymotrypsin and cysteine modified followed by trypsin digestion) were loaded for both F(Ab′)2 and intact samples in 6 tips, followed by centrifugation at 1000×g for 1 minute. The tips were washed by adding 25 μL 0.1% FA in water and spinning at 700×g for 1 minute. The tips were left in 200 μL 0.1% FA by spinning down briefly. The samples were loaded on Q Exactive mass spectrometer using 44-minute LC-MS method.
The identification of BS3 cross-linked and mono-linked peptides was performed using pLink v2.3.9 software (Fan et al. 2015). The search parameters were set as follow: Trypsin (Try) or Pepsin (Pep) was selected as the protease, cysteines were set as fixed modification either carbamidomethyl-C or CEthy. As variable modifications, deamidated-N, deamidated-Q, oxidation-M, pyro-Glu at any Q N-term, and/or carbamidomethyl at any N-term were selected. The range of peptide mass and peptide length was set as 400-9000 Da and 4-90 aa, respectively. The cross-linked and mono-linked peptides were filtered using a <=5% FDR cut-off at PSM level and a pLink score <0.01.
The proof of concept of this study was to show that by mixing 2 known antibodies (from same species) and crosslinking the antibodies, it is possible to detect a unique crosslink peptide allowing the pairing of a given heavy chain with its corresponding light chain. By using a mixture, the importance of possible artefact (i.e., wrong crosslink pairs) was also evaluated. In Table 3, at least 7 crosslink peptides were identified for proper H/L pairing, 5 for R1 and 2 for R3. All other crosslinks were intrachain which do not allow to resolve H/L pairing. No interchain crosslink were found between R1 and R3.
The numbers in parentheses correspond to the position of the residues involved in the cross-linking. 1=cross-linking via the amino-terminal end.
Four rabbit monoclonal antibodies (rabbit monoclonal mAbs internal naming: R1, P17, P18, P19) were mixed at the same amount (40 μg of each). The sample was desalted using 3 kDa MWCO Amicon® Ultra-0.5 mL centrifugal filter devices (Merck Millipore, Germany) against a buffer containing 20 mM sodium phosphate, pH 7.4.
iCIEF experiment was performed at Advanced Electrophoresis Solutions Ltd (Cambridge, ON, Canada). For preparative iCIEF, 1 mg/ml of the sample in 4% UH 3-10 AESlytes, 6 M urea was mixed with deionised water for a total volume of 40 μl. CEInfinite system with UV detector (280 nm) was used for fraction collection. The focusing was 1 min at 1000 V, 1 min at 2000 V, 10 min at 3000 V, and 3000 V during mobilisation (0.16 μl/min). A total of 6 fractions were collected based on peaks profile controlled and acquired by CEInsight software. The experiment was repeated three more times and each fraction was collected in the same tube.
Sample fractions were reduced, alkylated and deglycosylated using PNGaseF (Promega, WI, US). ProteaseMax Surfactant (Promega, WI, US) was added at a final concentration of 0.01% during trypsin digestion. The digests were loaded onto Evotip Pure™ for mass spectrometry using Whisper method (Orbitrap Exploris 240).
Although the present invention has been described hereinabove by way of specific embodiments thereof, it can be modified, without departing from the spirit and nature of the subject invention as defined in the appended claims. In the claims, the word “comprising” is used as an open-ended term, substantially equivalent to the phrase “including, but not limited to”. The singular forms “a”, “an” and “the” include corresponding plural references unless the context clearly dictates otherwise.
The present application claims the benefits of U.S. provisional application Ser. No. 63/260,027, filed on Aug. 6, 2021, which is incorporated herein by reference in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CA2022/051194 | 8/5/2022 | WO |
Number | Date | Country | |
---|---|---|---|
63260027 | Aug 2021 | US |