The present disclosure relates generally to the field of cancer diagnosis and therapy. More particularly, it concerns the use of citrullinated proteins in the diagnosis and therapy of cancer.
Despite great progress in the fields of cancer diagnosis and therapy, about 600,000 residents of the United States were projected to die of cancer in 2020.
Citrullination is the post-translational conversion of the amino acid arginine to citrulline. Protein citrullination is brought about by the action of protein arginine deiminase (PADI) family members. Dysregulated protein citrullination by PADI family members has been associated with autoimmune diseases. To date, no enzyme has been identified that can reverse protein citrullination. The role of protein citrullination has been best investigated in the context of rheumatoid arthritis (RA), where elevated protein citrullination, notably of keratins, filaggrin, vimentin, actin, histones, nucleophosmin, and nuclear lamin C, has been shown to elicit an autoimmune response. Autoimmunity in RA is considered to be principally facilitated through MHC class II mediated presentation of citrullinated peptides that elicit a B-cell response.
It would be desirable to have additional diagnostic, therapeutic, and prophylactic modalities against cancer.
The following presents a simplified summary of the disclosure in order to provide a basic understanding of some aspects of the disclosure. This summary is not an exhaustive overview of the disclosure. It is not intended to identify key or critical elements of the disclosure or to delineate the scope of the disclosure. Its sole purpose is to present some concepts in a simplified form as a prelude to the more detailed description that is discussed later.
In one aspect, provided is a method of detecting cancer and/or diagnosing a subject with cancer comprising using a citrullinated peptide or protein for detection of autoantibodies present in a biological sample from the subject, wherein said autoantibodies specifically bind to the citrullinated peptide or protein. In another aspect, provided is a method of detecting cancer and/or diagnosing a subject with cancer comprising detecting a citrullinated peptide or proteins in a biological sample from the subject. In another aspect, provided are methods of treating a subject diagnosed with cancer having elevated levels of a citrullinated peptide or protein or having autoantibodies against a citrullinated peptide or protein comprising administering an anti-cancer agent. In another aspect, provided is a method of reducing the risk of or preventing cancer in a subject comprising administering a composition comprising at least one citrullinated peptide or protein associated with the cancer to the subject.
In one embodiment, the present disclosure relates to a method, comprising providing a plasma sample from a patient suffering from a cancer or suspected of suffering from the cancer; incubating the plasma sample with at least one citrullinated protein selected from the group consisting of the proteins encoded by genes listed in Table 1, the proteins encoded by genes listed in Table 2, and the proteins encoded by genes listed in Table 10 under conditions sufficient for any autoantibodies against the citrullinated protein(s) that may be present in the plasma sample to bind to the citrullinated protein(s); incubating the citrullinated protein(s) and any bound autoantibodies against it/them with a detectable label, under conditions in which the detectable label will bind to the bound autoantibodies and will substantially not bind to other molecules; detecting the detectable label bound to the bound autoantibodies; classifying the patient as suffering from the cancer in response to the detected amount of the detectable label bound to the bound autoantibodies being equal to or greater than a threshold; and classifying the patient as not suffering from the cancer in response to the detected amount of the detectable label bound to the bound autoantibodies being less than the threshold. In some embodiments, wherein the patient as classified as suffering from the cancer, the method further comprises administering at least one anti-cancer agent to the patient.
In one embodiment, the present disclosure relates to a method of treating a patient, the method comprising administering at least one anti-cancer agent to the patient, wherein the patient has a level of autoantibodies against at least one citrullinated protein selected from the group consisting of the proteins encoded by genes listed in Table 1, the proteins encoded by genes listed in Table 2, and the proteins encoded by genes listed in Table 10 that is greater to or equal than a threshold.
In one embodiment, the present disclosure relates to a kit, comprising a substrate; at least one citrullinated protein selected from the group consisting of the proteins encoded by genes listed in Table 1, the proteins encoded by genes listed in Table 2, and the proteins encoded by genes listed in Table 10; and instructions for performing the method described above.
In one embodiment, the present disclosure relates to a method, comprising providing a tissue sample from a patient suffering from a cancer or suspected of suffering from the cancer; assaying the tissue sample for at least one citrullinated protein selected from the group consisting of the proteins encoded by genes listed in Table 1, the proteins encoded by genes listed in Table 2, and the proteins encoded by genes listed in Table 10; classifying the patient as suffering from the cancer in response to the tissue sample containing an amount of the citrullinated protein(s) equal to or greater than a threshold; and classifying the patient as not suffering from the cancer in response to the tissue sample containing an amount of the citrullinated protein(s) less than the threshold.
In one embodiment, the present disclosure relates to a kit, comprising a cell lysis agent; and instructions for performing the method described above.
In one embodiment, the present disclosure relates to a method, comprising providing a tumor sample from a patient suffering from a cancer; assaying the tumor sample for at least one citrullinated amino acid sequence selected from the group consisting of sequences and corresponding modifications listed in Table 2, Table 9, and/or Table 10, and sequences having at least 70% identity to the sequences listed in Table 2, Table 9, and/or Table 10 and comprising at least one arginine residue; and presenting to the immune system of the patient at least one peptide, wherein each peptide comprises at least one of the citrullinated amino acid sequences, in response to the tumor sample containing an amount of the citrullinated amino acid sequence(s) equal to or greater than a threshold.
In one embodiment, the present disclosure relates to a kit, comprising at least one peptide, wherein each peptide comprises at least one citrullinated amino acid sequence selected from the group consisting of sequences and corresponding modifications listed in Table 2, Table 9, and/or Table 10, and sequences having at least 70% identity to the sequences listed in Table 2, Table 9, and/or Table 10 and comprising at least one arginine residue; and instructions for performing the method described above.
In one embodiment, the present disclosure relates to a method, comprising providing a tumor sample from a patient suffering from a cancer; assaying the tumor sample for a citrullinated protein encoded by a gene selected from the group consisting of genes listed in Table 1 as having a plasma membrane location; and administering to the patient an anti-cancer agent targeting the citrullinated protein, in response to the tumor sample containing an amount of the citrullinated protein equal to or greater than a threshold.
In one embodiment, the present disclosure relates to a kit, comprising an anti-cancer agent targeting a citrullinated protein encoded by a gene selected from the group consisting of genes listed in Table 1 as having a plasma membrane location; and instructions for performing the method described above.
In one embodiment, the present disclosure relates to an isolated peptide comprising at least 70% sequence identity to a peptide selected from the group consisting of peptides listed in Table 2, Table 9, and Table 10.
In embodiments, the present disclosure may allow the diagnosis and/or therapy of cancer.
The following drawings form part of the present specification and are included to further demonstrate certain aspects of the present disclosure. The disclosure may be better understood by reference to one or more of these drawings in combination with the detailed description of specific embodiments presented herein. The drawings do not limit the scope of the compositions and methods, unless the written description expressly indicates that such is the case.
While the subject matter disclosed herein is susceptible to various modifications and alternative forms, specific embodiments thereof have been shown by way of example in the drawings and are herein described in detail. It should be understood, however, that the description herein of specific embodiments is not intended to limit the disclosure to the particular forms disclosed, but on the contrary, the intention is to cover all modifications, equivalents, and alternatives falling within the spirit and scope of the disclosure as defined by the appended claims. Moreover, the stylized depictions illustrated in the drawings are not drawn to any absolute scale.
The present disclosure is based in part on the discovery by the inventors that protein arginine deaminase (PADI) family member PADI2 is highly expressed in several cancer types, including breast cancer. As detailed in the Examples herein, immunohistochemical analysis of breast tumor tissues revealed increased expression of PADI2 in tumors, along with a positive correlation between PADI2 protein expression and peptidyl-citrulline staining. PADI2 expression exhibited strong positive correlations with a B-cell immune signature and with MHC-Ul bound citrullinated peptides. Provided herein are diagnostic, therapeutic, and prophylactic compositions and methods for detecting, treating, and/or reducing the risk of or preventing cancer, based in part on neoantigens involved in the citrullinome immune response in cancer.
Dysregulated protein citrullination by PADI family members has been associated with autoimmune diseases, with recent interest in its relevance to cancer given the occurrence of autoimmunity as a manifestation of cancer (refs. 1 and 2). PADI comprises a family of enzymes that, in the presence of calcium ions, catalyze the post-translational modification of proteins via the deamination of arginine to citrulline. In total, 5 PADI family members are known, with sequence homology ranging from 70% to 95% (ref. 2). To date, no enzyme has been identified that can reverse protein citrullination. The role of protein citrullination has been best investigated in the context of rheumatoid arthritis (RA), where elevated protein citrullination, notably of keratins, filaggrin, vimentin, actin, histones, nucleophosmin, and nuclear lamin C, has been shown to elicit an autoimmune response (refs. 3 and 4). Autoimmunity in RA is considered to be principally facilitated through MHC class II mediated presentation of citrullinated peptides that elicit a B-cell response (refs. 5 and 6). There is also currently increased interest in MHC-H neoantigens as shaping tumor immunity (ref. 7).
Comprehensive assessment of the expression of PADI family members in cancer has been limited. PADI4 has been investigated largely with respect to its interactions with histone H3, ING4, p53, and HDAC2 (refs. 8-11). Moreover, the extent to which protein citrullination in tumors induces an immune response is largely unexplored. Studies aimed at interrogating antitumor immunity against citrulline-peptides of vimentin and a-enolase have demonstrated that these peptides can trigger a CD4+ T cell response (refs. 12 and 13), providing evidence that protein citrullination in tumors may be immunogenic. Given the antigenicity of citrullinated proteins, exploration of PADI family members among cancer types and their impact on citrullination and immune response has potential for the development of tumor vaccines with citrullinated antigens or for identification of citrullinated antigens as biomarkers for cancer detection or prediction of response to immunotherapy (refs. 7 and 14-18).
To date, PADI4 has been the most investigated among family members in the context of cancer. PADI4 is the only PADI member known to encompass a nuclear transport sequence to citrullinate nuclear proteins including histones (ref. 47). As demonstrated in the Examples and described herein, the protein expression of PADI family members was investigated among 196 cancer cell lines reflective of 12 common cancer types by proteomic profiling as well as by analysis of mRNA expression datasets from The Cancer Genome Atlas (TCGA) for 9,721 human tumors consisting of 32 different cancer types. It was found that with respect to overall expression levels, PADI2 is preferentially expressed at the protein level in cancer compared with other PADI family members. Expression of PADI2 in breast cancer cell lines was recapitulated in breast cancer TMA.
Mass spectrometry was used to further explore the association of PADI2 with the citrullinome of 28 breast cancer cell lines, and the findings of PADI2 expression and citrullination in 422 breast tumors were confirmed using immunohistochemistry (IHC). Citrullination is contingent on PADI expression and intracellular Ca2+ levels. In RA, cytosolic Ca2+ levels are increased compared with normal cellular concentrations (ref. 48), and in tumors, aberrant levels of Ca2+ channels and pumps were expressed (ref. 49) which may change the intercellular Ca2+ flux preferable to PADIs. PADI2-mediated citrullination was found to be elevated in breast tumor tissue compared with adjacent non-tumor tissue or normal mammary gland tissue and other organ sites, and IgG bound citrullinated proteins were shown to be elevated in the plasma of patients with newly diagnosed breast cancer compared with controls. In some embodiments, provided herein are methods of detecting, diagnosing, treating, and reducing the risk of or preventing cancer using autoantibodies against citrullinated proteins as cancer biomarkers and/or using citrullinated proteins and/or peptides as neoantigens and/or biomarkers.
PADI2-mediated citrullination was also found to be associated with a distinct tumor immunophenotype using TCGA gene expression datasets and immunohistochemical analysis of human breast cancer tumors. In contrast to limited prior studies of protein citrullination in cancer and their impact on immune response, the occurrence of anti-citrulline autoantibodies in RA driven by a B-cell response is well documented (ref. 5). As demonstrated herein, PADI2 exhibited statistically significant positive correlations with tumor infiltrating B cells indicative of similarities with autoimmune disease. MHC analysis of citrullinated peptides in breast cancer cell lines yielded peptides in the MHC-U binding sequence length, supporting a B cell-mediated immune response. Of interest is the occurrence of a substantial fraction of citrullinated proteins as derived from the nuclear compartment pointing to a similarity between breast cancer and autoimmune diseases, for which elevated levels of circulating antinuclear antibodies are diagnostic markers (refs. 50 and 51). Also demonstrated herein is evidence for circulating autoantibodies against citrullinated tumor-associated proteins in cancer. It is likely that autoantibodies to citrullinated proteins are likely to be cancer type and subtype specific. Network analysis yielded a cytokeratin complex in common among breast cancer subtypes whereas distinct features of hormone receptors-centered network were observed in Luminal A, and a MYC centered network was observed in TNBC.
Given the current interest in cancer vaccine development and in immunotherapy, the findings described herein point to the importance of citrullination for antigenicity. Interestingly, several citrullinated proteins were identified that have been targeted for vaccine development, including vimentin and a-enolase (refs. 12 and 13). The importance of citrullination of a-enolase in inducing anti-cancer immunity has been recently demonstrated (ref. 13). Provided herein are methods for reducing the risk of or preventing cancer using citrullinated peptides (e.g., in cancer vaccine development) to induce cancer immunity.
Various illustrative embodiments of the disclosure are described below. In the interest of clarity, not all features of an actual implementation are described in this specification. It will of course be appreciated that in the development of any such actual embodiment, numerous implementation-specific decisions must be made to achieve the developers' specific goals, such as compliance with system-related, regulatory, and business-related constraints, which will vary from one implementation to another. Moreover, it will be appreciated that such a development effort might be complex and time-consuming but would nevertheless be a routine undertaking for those of ordinary skill in the art having the benefit of this disclosure. The description is to be read from the perspective of one of ordinary skill in the art; therefore, information well known to the skilled artisan is not necessarily included.
The present subject matter will now be described with reference to the attached figures. Various structures, systems, and devices are schematically depicted in the drawings for purposes of explanation only and so as to not obscure the present disclosure with details that are well known to those skilled in the art. Nevertheless, the attached drawings are included to describe and explain illustrative examples of the present disclosure.
The words and phrases used herein should be understood and interpreted to have a meaning consistent with the understanding of those words and phrases by those skilled in the relevant art. No special definition of a term or phrase, i.e., a definition that is different from the ordinary and customary meaning as understood by those skilled in the art, is intended to be implied by consistent usage of the term or phrase herein. To the extent that a term or phrase is intended to have a special meaning, i.e., a meaning other than that understood by skilled artisans, such a special definition will be expressly set forth in the specification in a definitional manner that directly and unequivocally provides the special definition for the term or phrase.
As used herein the specification, “a” or “an” may mean one or more. As used herein in the claim(s), when used in conjunction with the word “comprising,” the words “a” or “an” may mean one or more than one.
The use herein of the terms “including,” “comprising,” or “having,” and variations thereof, is meant to encompass the elements listed thereafter and equivalents thereof as well as additional elements. Embodiments recited as “including,” “comprising,” or “having” certain elements are also contemplated as “consisting essentially of” and “consisting of” those certain elements. As used herein, “and/or” refers to and encompasses any and all possible combinations of one or more of the associated listed items, as well as the lack of combinations where interpreted in the alternative (“or”).
As used herein, the transitional phrase “consisting essentially of” (and grammatical variants) is to be interpreted as encompassing the recited materials or steps “and those that do not materially affect the basic and novel characteristic(s)” of the claimed invention. See, In re Herz, 537 F.2d 549, 551-52, 190 U.S.P.Q. 461, 463 (CCPA 1976) (emphasis in the original); see also MPEP § 2111.03. Thus, the term “consisting essentially of” as used herein should not be interpreted as equivalent to “comprising.”
Recitation of ranges of values herein are merely intended to serve as a shorthand method of referring individually to each separate value falling within the range, unless otherwise indicated herein, and each separate value is incorporated into the specification as if it were individually recited herein. For example, if a concentration range is stated as 1% to 50%, it is intended that values such as 2% to 40%, 10% to 30%, or 1% to 3%, etc., are expressly enumerated in this specification. These are only examples of what is specifically intended, and all possible combinations of numerical values between and including the lowest value and the highest value enumerated are to be considered to be expressly stated in this disclosure.
The terms “about” and “approximately” as used herein shall generally mean an acceptable degree of error for the quantity measured given the nature or precision of the measurements. Exemplary degrees of error are within 20% (%); preferably, within 10%; and more preferably, within 5% of a given value or range of values. Any reference to “about X” or “approximately X” specifically indicates at least the values X, 0.95X, 0.96X, 0.97X, 0.98X, 0.99X, 1.01X, 1.02X, 1.03X, 1.04X, and 1.05X. Thus, expressions “about X” or “approximately X” are intended to teach and provide written support for a claim limitation of, for example, “0.98X.” Alternatively, in biological systems, the terms “about” and “approximately” may mean values that are within an order of magnitude, preferably within 5-fold, and more preferably within 2-fold of a given value. Numerical quantities given herein are approximate unless stated otherwise, meaning that the term “about” or “approximately” can be inferred when not expressly stated. When “about” is applied to the beginning of a numerical range, it applies to both ends of the range.
“Polypeptide,” “peptide,” and “protein” are used interchangeably herein to refer to a polymer of amino acid residues. As used herein, the terms encompass amino acid chains of any length, including full-length proteins, wherein the amino acid residues are linked by covalent peptide bonds.
The amino acids in the polypeptides described herein can be any of the 20 naturally occurring amino acids, D-stereoisomers of the naturally occurring amino acids, unnatural amino acids and chemically modified amino acids. Unnatural amino acids (that is, those that are not naturally found in proteins) are also known in the art, as set forth in, for example, Zhang et al. “Protein engineering with unnatural amino acids,” Curr. Opin. Struct. Biol. 23(4): 581-587 (2013); Xie et al. “Adding amino acids to the genetic repertoire,” 9(6): 548-54 (2005)); and all references cited therein. Beta and gamma amino acids are known in the art and are also contemplated herein as unnatural amino acids. Unless otherwise indicated, a particular amino acid sequence also implicitly encompasses conservatively modified variants thereof as well as the sequence explicitly indicated.
As used herein, a chemically modified amino acid refers to an amino acid whose side chain has been chemically modified. For example, a side chain can be modified to comprise a signaling moiety, such as a fluorophore or a radiolabel. A side chain can also be modified to comprise a new functional group, such as a thiol, carboxylic acid, or amino group. Post-translationally modified amino acids are also included in the definition of chemically modified amino acids.
The term “identity” or “substantial identity,” as used in the context of a polypeptide sequence described herein, refers to a sequence that has at least 60% sequence identity to a reference sequence. Alternatively, percent identity can be any integer from 60% to 100%. Exemplary embodiments include at least: 60%, 65%, 70%, 75%, 80%, 85%, 88%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%, as compared to a reference sequence using the programs described herein; preferably BLAST using standard parameters, as described below. One of skill will recognize that these values can be appropriately adjusted to determine corresponding identity of proteins encoded by two nucleotide sequences by taking into account codon degeneracy, amino acid similarity, reading frame positioning and the like.
For sequence comparison, typically one sequence acts as a reference sequence to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are entered into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. Default program parameters can be used, or alternative parameters can be designated. The sequence comparison algorithm then calculates the percent sequence identities for the test sequences relative to the reference sequence, based on the program parameters.
A “comparison window,” as used herein, includes reference to a segment of any one of the number of contiguous positions selected from the group consisting of from 20 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which a sequence may be compared to a reference sequence of the same number of contiguous positions after the two sequences are optimally aligned. Methods of alignment of sequences for comparison are well-known in the art. Optimal alignment of sequences for comparison may be conducted by the local homology algorithm of Smith and Waterman Add. APL. Math. 2:482 (1981), by the homology alignment algorithm of Needleman and Wunsch J. Mol. Biol. 48:443 (1970), by the search for similarity method of Pearson and Lipman Proc. Natl. Acad. Sci. (U.S.A.) 85: 2444 (1988), by computerized implementations of these algorithms (e.g., BLAST), or by manual alignment and visual inspection.
Algorithms that are suitable for determining percent sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which are described in Altschul et al. (1990) J. Mol. Biol. 215: 403-410 and Altschul et al. (1977) Nucleic Acids Res. 25: 3389-3402, respectively. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (NCBI) web site. The algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul et al. supra). These initial neighborhood word hits acts as seeds for initiating searches to find longer HSPs containing them. The word hits are then extended in both directions along each sequence for as far as the cumulative alignment score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward score for a pair of matching residues; always >0) and N (penalty score for mismatching residues; always <0). For amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue alignments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a word size (W) of 28, an expectation (E) of 10, M=1, N=−2, and a comparison of both strands. For amino acid sequences, the BLASTP program uses as defaults a word size (W) of 3, an expectation (E) of 10, and the BLOSUM62 scoring matrix (see Henikoff & Henikoff, Proc. Natl. Acad. Sci. USA 89:10915 (1989)).
The BLAST algorithm also performs a statistical analysis of the similarity between two sequences (see, e.g., Karlin & Altschul, Proc. Nat'l. Acad. Sci. USA 90:5873-5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic acid is less than about 0.01, more preferably less than about 10−5, and most preferably less than about 10−20.
The term “autoantibody”, as used herein, refers to an antibody that specifically binds to an endogenous molecule in a subject that produces said autoantibody. The level of such antibody is typically elevated compared to the average of any other antibodies binding specifically to such an endogenous molecule. The endogenous molecule may be an autoantigen. An autoantigen is defined as a peptide, protein, or protein complex (and sometimes DNA or RNA) that is recognized by the immune system (e.g., through autoantibodies) of a subject suffering from a specific disease or disorder. These antigens should not be, under normal conditions, the target of the immune system, but T cells instead attack cells expressing the autoantigens.
The term “cancer” refers to a disease characterized by the uncontrolled growth of aberrant cells. The term includes all known cancers and neoplastic conditions, whether characterized as malignant, soft tissue, or solid, and cancers of all stages and grades including pre- and post-metastatic cancers, as well as recurrent cancer. Examples of different types of cancer include, but are not limited to, digestive and gastrointestinal cancers such as gastric cancer (e.g., stomach cancer), colorectal cancer, gastrointestinal stromal tumors, gastrointestinal carcinoid tumors, colon cancer, rectal cancer, anal cancer, bile duct cancer, small intestine cancer, esophageal cancer, breast cancer, lung cancer (e.g., non-small cell lung cancer), gallbladder cancer, liver cancer, pancreatic cancer, appendix cancer, prostate cancer, ovarian cancer, cervical cancer, uterine cancer, renal cancer, cancer of the central nervous system, skin cancer (e.g., melanoma), lymphomas, gliomas, choriocarcinomas, head and neck cancers, osteogenic sarcomas, and blood cancers.
The term “suffering from cancer,” as used herein in relation to a subject, indicates that a cancer is detectable in the subject's body using any diagnostic technique presently known or to be discovered. “Suffering” does not require the subject to be in pain from or have any naturally-perceptible symptoms of cancer.
The term “suspected of suffering from the cancer,” as used herein in relation to a subject, indicates that the subject has one or more symptoms that, in the judgment of a physician, may indicate that the subject suffers from cancer.
A “biological sample,” as used herein, generally refers to a bodily tissue or fluid obtained from a human, preferably a mammalian subject. Exemplary subjects include, but are not limited to humans, non-human primates such as monkeys, dogs, cats, mice, rats, cows, horses, camels, goats, and sheep. In some embodiments, the subject is a human. Non-limiting examples of biological samples include blood, blood fractions or blood products (e.g., serum, plasma, platelets, red blood cells, peripheral blood mononuclear cells and the like), sputum or saliva, stool, urine, other biological fluids (e.g., lymph, prostatic fluid, gastric fluid, intestinal fluid, renal fluid, lung fluid, cerebrospinal fluid, and the like). Additionally, solid tissues, for example, tissue biopsies (e.g., tumor tissue) may be used. A biological sample may be processed prior to use in a detection assay including dilution, addition of buffer or preservative, concentration, purification, or partial purification.
In one aspect, provided herein are methods of detecting cancer, diagnosing a subject with cancer, treating a subject with cancer, and/or reducing the risk of or preventing cancer in a subject. In some embodiments, the methods comprise detecting a citrullinated peptide or an autoantibody that specifically binds to a citrullinated peptide. In some embodiments, detecting an autoantibody that specifically binds to a citrullinated peptide comprises use of a citrullinated protein or peptide (e.g., as described herein). In some embodiments, the citrullinated peptide is a sequence corresponding to a portion of a protein that is known to be expressed in cancer cells. In some embodiments, the citrullinated peptide is a sequence corresponding to a portion of a protein that is highly expressed by cancer cells. In some embodiments, the protein that is highly expressed in cancer cells is not expressed or minimally expressed by corresponding normal (i.e., non-cancerous) tissues. In some embodiments, the citrullinated peptide is a sequence corresponding to a portion of a protein that is known to be expressed on the cell surface of cancer cells. In some embodiments, the citrullinated peptide is a sequence corresponding to a portion of a protein that has been targeted in cancer immunotherapy (e.g., any of the peptides listed in Table 10). In some embodiments, the citrullinated peptide is a sequence corresponding to a portion of a protein encoded by any of the genes listed in Table 1, any of the genes listed in Table 2, and/or any of the genes listed in Table 10. In some embodiments, the citrullinated peptide has at least 70% identity to any of the sequences listed in Table 2, Table 9, and/or Table 10. In some embodiments, the citrullinated peptide is a
In some embodiments of the methods provided herein, the subject is suspected of having cancer or at elevated risk of having cancer. In some embodiments, the subject is displaying symptoms of having cancer. In some embodiments, the subject has a family history of cancer. In some embodiments, the subject has one or more genetic factors (e.g., mutations) that are associated with an increased risk of having or developing cancer. In some embodiments, the subject was previously treated for cancer. In some embodiments, the subject is in remission. In some embodiments, the subject has an elevated level of any of the biomarkers described herein.
The methods of detecting cancer and/or diagnosing a subject with cancer provided herein can comprise various techniques using a citrullinated peptide for detection of autoantibodies (e.g., autoantibodies that specifically bind to citrullinated peptides or proteins, also referred to herein as citrullinated protein-specific autoantibodies). As described above and demonstrated in the Examples herein, the presence of autoantibodies in a subject that bind to citrullinated proteins may indicate that the subject has cancer. For each of the citrullinated peptides described herein, the entire peptide can used in the provided methods, a fragment of the peptide may be used, a variant of the peptide may be used, or a combination of two or more of the full length peptide, a fragment, or a variant thereof as described in this disclosure may be used. For example, citrullinated peptides can be used in an immunoassay to detect citrullinated protein-specific autoantibodies in a biological sample from a subject. Citrullinated peptides used in an immunoassay can be in a cell lysate (such as, for example, a whole cell lysate or a cell fraction), or purified citrullinated peptides or fragments thereof can be used provided at least one antigenic site recognized by citrullinated protein-specific autoantibodies remains available for binding.
Generally, the biological sample is assessed for the presence of citrullinated protein-specific autoantibodies by contacting the biological sample with a citrullinated peptide or fragment or variant thereof. In some embodiments, the citrullinated peptide or fragment thereof is present in a solid tissue such as a tissue section. For example, a tissue sample comprising citrullinated peptides or fragments may be used, which may be in the form of a tissue section fixed on a carrier, for example a glass slide for microscopic analysis. For example, the solid tissue can be a tumor tissue or a tissue sample comprising tumor tissue and adjacent normal tissue. In some embodiments, the citrullinated peptide or fragment thereof is present in a sample from a mammal. Tissue sections used in immunohistochemistry are well known in the art and are commercially available from a number of companies (e.g., Asterand, Inc. (Detroit, Michigan); Euroimmun (Morris Plains, New Jersey); and Imgenex (San Diego, California)). In other embodiments, the citrullinated peptide or fragment thereof is in a cell lysate, blood, serum, cerebrospinal fluid (CSF), or urine.
In some embodiments, a liquid sample comprising citrullinated protein-specific autoantibodies from a subject may be used to practice the methods provided herein. Exemplary liquid samples include cell lysate, blood, serum, cerebrospinal fluid (CSF), and urine. A step of contacting a liquid sample comprising citrullinated protein-specific autoantibodies with a citrullinated peptide or fragment or variant thereof may be carried out by incubating an immobilized form of said peptide in the presence of the liquid sample under conditions that are compatible with the formation of a complex comprising said peptides and said citrullinated protein-specific autoantibodies. Optionally, one or more washing steps may be contemplated.
In some embodiments, the citrullinated peptide or fragment thereof is an isolated, purified citrullinated peptide or fragment thereof as discussed below. In some embodiments, the citrullinated peptide or fragment thereof is in a phage display or eukaryotic cell display library. In some embodiments, the citrullinated peptide or fragments thereof is heterologously-expressed on the surface of a cell.
In some embodiments, the biological sample is contacted with a citrullinated peptide or fragment thereof and a secondary antibody. As is well known in the art, the secondary antibody is an antibody raised against the IgG of the animal species in which the primary antibody originated. Secondary antibodies bind to the primary antibody to assist in detection, sorting and purification of target antigens to which a specific primary antibody is first bound. The secondary antibody must have specificity both for the antibody species as well as the isotype of the primary antibody being used. If a citrullinated protein-specific autoantibodies is present in the biological sample, under appropriate conditions, a complex is formed between the citrullinated peptide or fragment thereof, the citrullinated protein-specific autoantibody in the biological sample, and the secondary antibody.
A complex comprising the citrullinated protein-specific autoantibodies and citrullinated peptides or fragments may be detected using a variety of methods known to the person skilled in the art, for example immunofluorescence microscopy or spectroscopy, luminescence, NMR spectroscopy, immunodiffusion, radioactivity, chemical crosslinking, surface plasmon resonance, native gel electrophoresis, or enzymatic activity. Depending on the nature of the sample, either or both immunoassays and immunocytochemical staining techniques may be used. Enzyme-linked immunosorbent assays (ELISA), Western blot, and radioimmunoassays are methods used in the art, and can be used as described herein to detect the presence of citrullinated protein-specific autoantibodies in a biological sample. While some of these methods allow for the direct detection of the complex, in some embodiments, the second antibody is labeled such that the complex may be detected specifically owing to intrinsic properties of the label such as, for example, fluorescence, radioactivity, enzymatic activity, visibility in NMR, or MRI spectra or the like. In some embodiments, the detection method may include any of Western blot, dot blot, protein microarray, ELISA, line blot radioimmune assay, immunoprecipitation, indirect immunofluorescence microscopy, radioimmunoassay, radioimmunodiffusion, ouchterlony immunodiffusion, rocket immunoelectrophoresis, immunohistostaining, complement fixation assay, FACS, and protein chip, but is not limited thereto. Methods and compositions are described herein that can be used for detecting, by immunohistochemistry, the presence of citrullinated protein-specific autoantibodies in a biological sample. Immunohistochemical methods are well known in the art, and non-limiting exemplary methods are described in U.S. Pat. Nos. 5,073,504; 5,225,325; and 6,855,552. See also Dabbs, Diagnostic Immunohistochemistry, 2nd Ed., 2006, Churchill Livingstone; and Chu & Weiss, Modern Immunohistochemistry, 2009, Cambridge University Press. It would be understood by those skilled in the art that immunohistochemistry routinely includes steps that are not necessarily discussed herein in detail such as washing the tissue samples to remove unbound secondary antibodies and the parallel staining experiments with proper controls. Exemplary detection methods are described in the Examples of this disclosure. While particular protocols are described below, variations of these assays are routine and known in the art.
In some instances, the secondary antibody is conjugated to a detectable label. Detectable labels are well known in the art and include, without limitation, a fluorescent label, an enzymatic label, a radioactive label, a luminescent label, or an affinity tag such as biotin or streptavidin. Exemplary fluorescent dyes include water-soluble rhodamine dyes, fluoresceins, 2′,7′-dichlorofluoresceins, fluorescein isothiocyanate (FITC), DyLight™ 488, phycoerythrin (PE), propidium iodide (PI), PerCP, PE-Alexa Fluor® 700, Cy5, allophycocyanin, Cy7, benzoxanthene dyes, and energy transfer dyes, as disclosed in the following references: Handbook of Molecular Probes and Research Reagents, 8th ed. (2002), Molecular Probes, Eugene, OR; U.S. Pat. Nos. 6,191,278, 6,372,907, 6,096,723, 5,945,526, 4,997,928, and 4,318,846; and Lee et al., 1997, Nucleic Acids Research 25:2816-2822. Exemplary enzymatic labels include but are not limited to alkaline phosphatase (AP) and horseradish peroxidase (HP)). Luminescent labels include, e.g., any of a variety of luminescent lanthanide (e.g., europium or terbium) chelates. For example, suitable europium chelates include the europium chelate of diethylene triamine pentaacetic acid (DTPA) or tetraazacyclododecane-1,4,7,10-tetraacetic acid (DOTA). Suitable radioactive labels include, for example, 32P, 33P, 14C, 125I, 131I, 35S, and 3H. In some instances, the detectable label can be a heterologous polypeptide such as an antigenic tag such as, for example, FLAG, polyhistidine, hemagglutinin (HA), glutathione-S-transferase (GST), or maltose-binding protein (MBP)) for use in purifying the citrullinated peptide or antigenic fragments or variants thereof. In some instances, the detectable label can be a heterologous polypeptide that is useful as diagnostic or detectable marker such as, for example, luciferase, a fluorescent protein (such as a green fluorescent protein (GFP)), or chloramphenicol acetyl transferase (CAT). Another labeling technique which may result in greater sensitivity is the coupling the antibodies to low molecular weight haptens. These haptens can then be specifically altered by means of a second reaction. For example, it is common to use haptens such as biotin, which reacts with avidin, or dinitrophenol, pyridoxal, or fluorescein, which can react with specific anti-hapten antibodies.
In some embodiments, the method comprises contacting a citrullinated peptide or fragment or variant thereof with a biological sample from a subject and a secondary antibody having a suitable label thereon under conditions in which a complex is formed between the citrullinated peptide or antigenic fragment or variant thereof, a corresponding citrullinated protein-specific autoantibody in the biological sample, if present, and the secondary antibody, and detecting the complex formed, if formed, by detecting the label of the secondary antibody, wherein the presence of the secondary antibody is indicative of the presence of a citrullinated protein-specific autoantibody in the biological sample, and wherein the absence of the secondary antibody is indicative of the absence of a citrullinated protein-specific autoantibody in the biological sample. In some instances, the secondary antibody is detectably-labeled. Immobilization of the citrullinated peptide or antigenic fragment or variant thereof on a solid carrier (also referred to herein as a substrate) can facilitate the method of citrullinated protein-specific autoantibody detection as discussed below.
In some instances, the method comprises contacting a citrullinated peptide or antigenic fragment or variant thereof having a suitable label thereon with a biological sample from a subject, and immunoprecipitating any complex formed between the citrullinated peptide or antigenic fragment or variant thereof and a corresponding citrullinated protein-specific autoantibody in the biological sample, and monitoring for said label on any of said complexes, wherein the presence of said label is indicative of the presence of a citrullinated protein-specific autoantibody in the biological sample and the absence of said label is indicative of the absence of a citrullinated protein-specific autoantibody in the biological sample.
In some instances, the method comprises a combination of immunoprecipitation and Western blot analysis to detect the presence of a citrullinated protein-specific autoantibody in a biological sample from a subject. For example, the method may comprise contacting a citrullinated peptide or fragment or variant thereof with a biological sample from a subject under conditions in which a complex is formed between the citrullinated peptide or fragment or variant thereof and a corresponding citrullinated protein-specific autoantibody in the biological sample, if present; immunoprecipitating any complex formed between the citrullinated peptide or antigenic fragment or variant thereof and a corresponding citrullinated protein-specific autoantibody in the biological sample to produce an immunoprecipitate comprising any such complex formed; separating components of the immunoprecipitate from each other (e.g., by electrophoresis), said components comprising the citrullinated peptide or antigenic fragment or variant thereof and a corresponding citrullinated protein-specific autoantibody in the biological sample, if present; and contacting the components of the immunoprecipitate with a secondary antibody having a suitable label thereon that specifically binds to a constant region of the citrullinated protein-specific autoantibody, if present; and detecting the complex formed, if formed, by detecting the label of the secondary antibody, wherein the presence of the secondary antibody is indicative of the presence of a citrullinated protein-specific autoantibody in the biological sample, and wherein the absence of the secondary antibody is indicative of the absence of a citrullinated protein-specific autoantibody in the biological sample. For example, immunoprecipitation assay may be performed to detect the presence of citrullinated protein-specific autoantibodies in a subject by contacting recombinant citrullinated proteins with a biological sample from the subject, such as serum. Exemplary labels include any of the detectable labels described in this disclosure including, for example, fluorescent dyes and radioactive labels.
In some embodiments, an isolated, purified citrullinated peptide or antigenic fragment or variant thereof may be used in the provided methods. Protein expression and purification methods are well known in the art. However, the teachings of the present invention may not only be carried out using peptides, in particular any citrullinated peptides having the exact amino acid sequence and modifications listed in Table 2, Table 9, and/or Table 10 herein, but also using fragments or variants of such peptides. Thus, modified citrullinated peptides and antigenic fragments or variants thereof are also contemplated, such as those in which one or more amino acid residues are substituted or modified (such as with glutaraldehyde).
An “isolated” or “purified” polypeptide, or portion thereof, is substantially or essentially free from components that normally accompany or interact with the polypeptide or protein as found in its naturally occurring environment. Thus, an isolated or purified polypeptide or protein is substantially free of other cellular material, or culture medium when produced by recombinant techniques, or substantially free of chemical precursors or other chemicals when chemically synthesized. A protein that is substantially free of cellular material includes preparations of protein having less than about 30%, 20%, 10%, 5%, or 1% (by dry weight) of contaminating protein. When the citrullinated peptide or antigenic portion thereof is recombinantly produced, optimally culture medium represents less than about 30%, 20%, 10%, 5%, or 1% (by dry weight) of chemical precursors or non-protein-of-interest chemicals.
The term “fragment” with regard to a citrullinated protein or peptide refers to an amino acid residue sequence of a portion of the full-length protein or peptide, encompassing, for example, an amino acid residue sequence that is truncated at one or both termini by one or more amino acids. The citrullinated peptide fragment retains its antigenicity such that it is bound specifically under appropriate binding conditions by a citrullinated protein-specific autoantibody that would bind specifically to the corresponding full-length citrullinated protein or peptide under appropriate binding conditions. An antigenic portion of the citrullinated protein or peptide can be a polypeptide that is, for example, 10, 25, 50, 100, 150, 200, 250 or more amino acid residues in length of the full length citrullinated protein or peptide. Alternatively or in addition, such peptide sequence may comprise one or more internal deletions of one or more amino acid residues. Thereby the residual length of the fragment equals or exceeds the length of one or more continuous or conformational epitopes, e.g., 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 21, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, or more amino acid residues.
The person of skill in the art is familiar with guidelines used to design peptides having sufficient immunogenicity such as, for example, those described in Jackson, D. C., et al., Vaccine 18(3-4): 355-361 (1999) and Black, M., et al., Expert Rev. Vaccines, 9(2): 157-173 (2010). Briefly, it is desirable that the peptide meets as many as possible of the following requirements: (a) it has a high degree of hydrophilicity, (b) it comprises one or more residues selected from the group comprising aspartate, proline, tyrosine, and phenylalanine, (c) is has, for higher specificity, no or little homology with other known peptides or polypeptides, (d) it is sufficiently soluble, (e) it comprises no glycosylation or phosphorylation sites unless required for specific reasons, and (f) it contains at least one arginine residue that can be citrullinated (e.g., by PADI family enzymes). Alternatively, bioinformatics approaches may be followed such as, for example, those described by Moreau, V., et al., BMC Bioinformatics 2008, 9:71 (2008). Such biologically active portions can be prepared by recombinant techniques and evaluated for pesticidal activity.
The term “variant” of a citrullinated protein or peptide, or fragments thereof, refers to a polypeptide comprising an amino acid residue sequence that is at least 70, 75, 80, 85, 90, 92, 94, 95, 96, 97, 98 or 99% identical to the normal sequence of the citrullinated protein, peptide, or fragment thereof. Within the context of this disclosure, a variant of a citrullinated protein or peptide, or a fragment thereof, retains its antigenicity such that it is bound specifically under appropriate conditions by a citrullinated protein-specific autoantibody that would specifically bind to the corresponding full length citrullinated protein or peptide under appropriate conditions. In some instances, variants are modified at amino acid residues other than those essential for the biological activity, for example the ability of an antigen to bind specifically to a citrullinated protein-specific antibody, such as a citrullinated protein-specific autoantibody. In some instances, one or more such essential amino acid residues may optionally be replaced in a conservative manner or additional amino acid residues may be inserted such that the biological activity (i.e. antigenicity) of the variant polypeptide is preserved.
Such variants of citrullinated proteins or peptides and fragments thereof may be prepared, for example, by introducing deletions, insertions or substitutions in nucleic acid sequences encoding them, or by chemical synthesis or modification. Moreover, variants of citrullinated proteins or peptides and fragments thereof may also be generated by fusion with other known polypeptides or variants thereof and encompass active portions or domains, preferably having a sequence identity of at least 70, 75, 80, 85, 90, 92, 94, 95, 96, 97, 98 or 99%/o when aligned with the active portion of the reference sequence, wherein the term “active portion”, as used herein, refers to an amino acid sequence, which is less than the full length amino acid sequence or, in the case of a nucleic acid sequence, codes for less than the full length amino acid sequence, respectively, but retains at least some of the biological activity. For example, an active portion an antigenic polypeptide retains the ability to bind to an antibody or autoantibody and, preferably, when administered to mammals, causes an immune response to occur.
The one or more citrullinated proteins or peptides and fragments and variants thereof may be provided in any form and at any degree of purification, from tissues or cells comprising said polypeptides in an endogenous form, such as cells overexpressing the polypeptide and crude or enriched lysates of such cells, to purified and/or isolated polypeptides that are essentially pure. In embodiments, the one or more citrullinated proteins or peptides or antigenic fragments or variants thereof have a native configuration, wherein the term “native configuration”, as used herein, refers to a folded polypeptide, such as a folded polypeptide purified from tissues or cells, such as mammalian cells or tissues or from non-recombinant tissues or cells. In another embodiment, the one or more citrullinated proteins or peptides or antigenic fragments or variants thereof are recombinant proteins, wherein the term “recombinant”, as used herein, refers to a polypeptide produced using genetic engineering approaches at any stage of the production process, for example by fusing a nucleic acid encoding the polypeptide to a strong promoter for overexpression in cells or tissues or by engineering the sequence of the polypeptide itself. In another embodiment, the one or more citrullinated proteins or peptides or antigenic fragments or variants thereof are synthetic (chemically synthesized). Such techniques are well known in the art.
In some instances, the one or more citrullinated peptides or antigenic fragments or variants thereof can be denatured such as by heating, freezing or ultraviolet ray, or chemical treatments such as a surfactant or a denaturant. For example, such a denatured form may be prepared by treating them with sodium dodecyl sulfate (SDS) or dithiothreitol (DTT). Citrullinated peptides or antigenic fragments or variants thereof that are included in a kit or a panel as described herein can be provided within a cell, in a solution in which they are soluble, or the citrullinated peptides or fragments or variants thereof can be provided in a lyophilized form.
In some embodiments, the one or more citrullinated peptides or antigenic fragments or variants thereof can be immobilized on a solid carrier insoluble in an aqueous solution, such as via a covalent bond, electrostatic interactions, encapsulation or entrapment, for example by denaturing a globular polypeptide in a gel, or via hydrophobic interactions such as via one or more covalent bonds. Various suitable carriers, for example paper, metal, silicon or glass surfaces, microfluidic channels, membranes, beads such as magnetic beads, column chromatography media, biochips, polyacrylamide gels and the like have been described in the literature, for example in Kim, D., Herr, A. E. (2013), Protein immobilization techniques for microfluidic assays, Biomicrofluidics 7(4), 041501. This way, the immobilized molecule, together with the insoluble carrier, may be separated from an aqueous solution in a straightforward manner, for example by filtration, centrifugation or decanting. An immobilized molecule may be immobilized in a reversible or irreversible manner. For example, the immobilization is reversible if the molecule interacts with the carrier via ionic interactions that can be masked by addition of a high concentration of salt or if the molecule is bound via a cleavable covalent bond such as a disulfide bridge which may be cleaved by addition of thiol-containing reagents. By contrast, the immobilization is irreversible if the molecule is tethered to the carrier via a covalent bond that cannot be cleaved in aqueous solution, for example a bond formed by reaction of an epoxide group and an amine group as frequently used to couple lysine side chains to affinity columns. The protein may be indirectly immobilized, for example by immobilizing an antibody or other entity having affinity to the molecule, followed by formation of a complex to the effect that the molecule-antibody complex is immobilized. Various ways to immobilize molecules are described in the literature such as, for example, in Kim and Herr (2013). In addition, various reagents and kits for immobilization reactions are commercially available such as, for example, from Pierce Biotechnology.
In some embodiments, the citrullinated peptide or fragment thereof is present in a tissue section, and the method comprises contacting a tissue section with a biological sample and a detectably-labeled secondary antibody under conditions in which a complex is formed between citrullinated peptides in the tissue section, a corresponding citrullinated protein-specific autoantibody in the biological sample, if present, and the detectably-labeled secondary antibody; and (b) identifying a pattern of complex formation in the tissue sample by detecting the detectably-labeled secondary antibody, wherein the presence of a pattern of complex formation is indicative of the presence of citrullinated protein-specific autoantibodies in the biological sample, and wherein the absence of a pattern of complex formation is indicative of the absence of citrullinated protein-specific autoantibodies in the biological sample.
In some embodiments, the three components—the tissue section, the biological sample, and the detectably-labeled secondary antibody—are combined under conditions in which a complex is formed between citrullinated peptides in the tissue section, and a corresponding citrullinated protein-specific autoantibody in the biological sample, if present, and the detectably-labeled secondary antibody. Using the detectable label and appropriate detection means, the pattern of complex formation within the tissue sections is identified. The pattern of complex formation within the tissue sections is directly related to the cellular location(s) of the antigen (e.g., an antigenic citrullinated peptide) bound by an autoantibody, when present, in the biological sample. As described herein, the presence of a particular pattern of complex formation in one or more types of tissue indicates the presence of citrullinated protein-specific autoantibodies in the biological sample.
The methods of detecting cancer and/or diagnosing a subject with cancer provided herein can comprise using a protein detection and/or quantification method for detection of citrullinated peptides or proteins (i.e., rather than detecting autoantibodies that bind to citrullinated proteins). As described above and demonstrated in the Examples herein, the presence of citrullinated proteins or peptides in a subject (e.g., in a biological sample from the subject) may indicate that the subject has cancer. Methods of detecting and quantifying proteins are known to those of skill in the art and include, but are not limited to, mass spectrometry, immunoassays (e.g., ELISA), Western blots, fluorescence microscopy, and immunohistochemistry.
In some embodiments, the citrullinated peptide that is detected and/or quantified is a sequence corresponding to a portion of a protein that is known to be expressed in cancer cells. In some embodiments, the citrullinated peptide is a sequence corresponding to a portion of a protein that is highly expressed by cancer cells. In some embodiments, the protein that is highly expressed in cancer cells is not expressed or minimally expressed by corresponding normal (i.e., non-cancerous) tissues. In some embodiments, the citrullinated peptide is a sequence corresponding to a portion of a protein that is known to be expressed on the cell surface of cancer cells. In some embodiments, the citrullinated peptide is a sequence corresponding to a portion of a protein that has been targeted in cancer immunotherapy (e.g., any of the peptides listed in Table 10). In some embodiments, the citrullinated peptide is a sequence corresponding to a portion of a protein encoded by any of the genes listed in Table 1, any of the genes listed in Table 2, and/or any of the genes listed in Table 10. In some embodiments, the citrullinated peptide has at least 70% identity to any of the sequences listed in Table 2, Table 9, and/or Table 10.
In some instances, more than one of the detection methods described above may be used in a complementary manner for more reliable results. In some embodiments, other immunoassays can be performed either in alternative to or before and/or after the immunohistochemistry methods. For example, a Western blot may be performed using, for example, a panel of known antigens associated with autoantibodies, the panel including a citrullinated peptide or antigenic fragment or variants thereof, the results of which may warrant further evaluation using, for example, the immunohistochemistry methods described herein. In another example, an immunohistochemistry method as described herein may be performed, followed by a Western blot in order to, for example, further confirm the specific antigens, including the citrullinated peptide, recognized by the autoantibodies in the biological sample.
Any data demonstrating the presence or absence of a citrullinated protein-specific autoantibody and the citrullinated peptide or antigenic fragment or variant thereof may be correlated with reference data. For example, detection of a citrullinated protein-specific autoantibody may indicate that the subject who provided the sample analyzed has cancer (e.g., a specific type or subtype of cancer). If the subject has been previously diagnosed, the amount of citrullinated protein-specific autoantibodies detected at the time of prior diagnosis and in the present time may be correlated to find out about the progression of the disease and/or the success of a treatment. For example, if the amount of citrullinated protein-specific autoantibodies is found to increase, it may be concluded that the disease is progressing and/or that any treatment attempted is unsuccessful.
Also provided herein are methods of treating cancer. In some embodiments, the methods comprise targeting anti-cancer agents to a citrullinated protein or peptide. In some embodiments, the citrullinated protein or peptide is expressed by a cancer cell. Some embodiments of the diagnostic methods provided herein (e.g., the illustrative embodiments depicted in
In some embodiments, anti-cancer agents that may be used comprise immunotherapeutic agents. As used throughout, immunotherapy is a therapy that uses the subject's own immune system to treat cancer in the subject. Examples of cancer immunotherapy include, but are not limited to, monoclonal antibodies, chimeric antigen receptors, antibody-drug conjugates, bispecific antibodies (e.g., bispecific T engagers, bispecific NK cell engagers, etc.), immune checkpoint inhibitors, cancer vaccines, cytokines and interferons. In some embodiments, the immunotherapy comprises administering to the subject an antibody that specifically binds to a citrullinated protein or peptide (e.g., as described herein). In some embodiments, the antibody is a human antibody, chimeric antibody, humanized antibody, an F(ab)′2, an Fab, an Fv, a single domain antibody, a bispecific antibody, a helix-stabilized antibody, a single-chain antibody molecule, a disulfide stabilized antibody, or a domain antibody.
In some embodiments, the immunotherapy comprises administering to the subject a T cell, natural killer cell, or macrophage comprising a chimeric antigen receptor that specifically binds to a citrullinated protein or peptide (e.g., as described herein). Chimeric antigen receptors (CARs, also known as chimeric T cell receptors) are designed to be expressed in host effector cells, e.g., T cells, NK cells, or macrophages, and to induce an immune response against a specific target antigen (e.g., a citrullinated peptide as described herein) and cells expressing that antigen (e.g., cancer cells expressing citrullinated proteins or peptides as described herein). Adoptive T cell immunotherapy, in which a patient's own T lymphocytes are engineered to express CARs, has shown great promise in treating hematological malignancies. CARs can be engineered and used as described, for example, in Sadelain et al., 2013, Cancer Discov. 3:388-398. A CAR typically comprises an extracellular target-binding module, a transmembrane (TM) domain, and an intracellular signaling domain (ICD). The CAR domains can be joined via flexible hinge and/or spacer regions. The extracellular target-binding module generally comprises an antibody or antigen binding fragment thereof (e.g., an antibody or antigen binding fragment thereof that specifically binds to citrullinated proteins or peptides as described herein).
Immunotherapy can comprise the administration of monoclonal antibodies. Monoclonal antibodies are designed to attach to a specific target. The monoclonal antibodies used to treat liver cancer affect a tumor's ability to form new blood vessels, also known as angiogenesis. These therapeutics are often referred to angiogenesis inhibitors and include: Bevacizumab (Avastin), which can be used in conjunction with the immunotherapy drug atezolizumab (Tecentriq); Ramucirumab (Cyramza).
In some embodiments of the methods for treating a subject with cancer provided herein, an additional cancer therapy is administered to the subject. In some embodiments, the additional therapy comprises surgical treatment for the cancer. For example, the patient may receive surgical resection (removal of the tumor with surgery). Small tumors may also be treated with other types of treatment such as ablation or radiation. Ablation is treatment that destroys tumors without removing them. These techniques can be used in patients with a few small tumors and when surgery is not a good option. They are less likely to cure the cancer than surgery, but they can still be very helpful for some people. Ablation is best used for tumors no larger than 3 cm across. For slightly larger tumors (1 to 2 inches, or 3 to 5 cm across), it may be used along with embolization. Because ablation often destroys some of the normal tissue around the tumor, it might not be a good choice for treating tumors near major blood vessels, the diaphragm, or major bile ducts. In some embodiments, the ablation is radiofrequency ablation (RFA). In some embodiments, the ablation is microwave ablation (MWA). In some embodiments, the ablation is cryoablation (cryotherapy). In some embodiments, the ablation is ethanol (alcohol) ablation, e.g., percutaneous ethanol injection (PEI).
In some embodiments, a patient with cancer is also treated using radiation therapy. Radiation therapy uses high-energy rays, or particles to destroy cancer cells. Radiation can be helpful, e.g., in treating cancer that cannot be removed by surgery, cancer that cannot be treated with ablation or did not respond well to such treatment; cancer that has spread to areas such as the brain or bones; patients experiencing severe pain due to large cancers; and patients having a tumor thrombus.
In some embodiments, a patient with cancer is also treated using drug therapy, e.g., targeted drug therapy or chemotherapy. Targeted drugs work differently from standard chemotherapy drugs and include, e.g., kinase inhibitors; Sorafenib (Nexavar), lenvatinib (Lenvima), Regorafenib (Stivarga), and cabozantinib (Cabometyx). Common chemotherapy drugs for treating cancer include, for example: Gemcitabine (Gemzar); Oxaliplatin (Eloxatin); Cisplatin; Doxorubicin (pegylated liposomal doxorubicin); 5-fluorouracil (5-FU); Capecitabine (Xeloda); Mitoxantrone (Novantrone), or combinations thereof. Chemotherapy can be regional when drugs are inserted into an artery that leads to the part of the body with the tumor. thereby focusing the chemotherapy on the cancer cells in that area of the body and reducing side effects by limiting the amount of drug reaching the rest of the body. For example, hepatic artery infusion (HAI), or chemo given directly into the hepatic artery, is an example of a regional chemotherapy that can be used for liver cancer.
In some embodiments, the additional therapy is a monoclonal antibody therapy, checkpoint inhibitor therapy, oncolytic virus therapy, or thermal therapy.
Thus, in one aspect, provided herein is a method for treating cancer in a subject comprising administering an effective amount of an immunotherapeutic agent targeting a citrullinated peptide as described herein and at least one additional cancer treatment.
Also provided herein are methods of reducing the risk of or preventing cancer in a subject. In some embodiments, the methods comprise inducing a protective immune response in the subject by administering a composition comprising at least one citrullinated peptide or protein associated with the cancer to the subject. As used herein, a “protective immune response” refers to an immune response induced after administration of a vaccine composition to a subject where, upon exposure to the source of the antigenic component of the vaccine (e.g., a citrullinated peptide or cell expressing the antigen), the clinical symptoms elicited by the source are diminished. In some embodiments, inducing a protective immune response comprises administering a vaccine to the subject, wherein the vaccine comprises at least one citrullinated peptide or protein associated with the cancer.
In some embodiments, the vaccine comprises a cancer neoantigen. In some embodiments, the cancer neoantigen comprises a citrullinated peptide or protein. In some embodiments, the citrullinated peptide or protein is immunogenic (i.e., able to induce a protective immune response). In some embodiments, the citrullinated peptide is a sequence corresponding to a portion of a protein that is known to be expressed in cancer cells. In some embodiments, the citrullinated peptide is a sequence corresponding to a portion of a protein that is highly expressed by cancer cells. In some embodiments, the protein that is highly expressed in cancer cells is not expressed or minimally expressed by corresponding normal (i.e., non-cancerous) tissues. In some embodiments, the citrullinated peptide is a sequence corresponding to a portion of a protein that is known to be expressed on the cell surface of cancer cells. In some embodiments, the citrullinated peptide is a sequence corresponding to a portion of a protein that has been targeted in cancer immunotherapy (e.g., any of the peptides listed in Table 10). In some embodiments, the citrullinated peptide is a sequence corresponding to a portion of a protein encoded by any of the genes listed in Table 1, any of the genes listed in Table 2, and/or any of the genes listed in Table 10. In some embodiments, the citrullinated peptide has at least 70% identity to any of the sequences listed in Table 2, Table 9, and/or Table 10. In some embodiments, the citrullinated peptide is optimized for vaccination according to known methods for peptide vaccine selection and preparation (e.g., as described in Example 7 herein).
Cancer neoantigens (e.g., citrullinated peptides or proteins as described herein) can be recognized by tumor-infiltrating cytotoxic CD8+ T cells, and increased immune cell infiltration and the related cytotoxicity signatures have been observed in tumors with a higher neoantigen load. Accordingly, neoantigen presentation and load have been positively correlated with prognosis in patients with a variety of cancers and with benefit from immune-checkpoint inhibitors (ICIs) in patients with melanoma, non-small-cell lung cancer (NSCLC) or colorectal cancer with mismatch-repair deficiency. Together, these studies highlight the potential therapeutic benefit of developing immunotherapies that specifically “train” the immune system to target neoantigens.
Vaccines have traditionally been used for the prevention of infectious diseases, however, the ability of such agents to elicit and amplify antigen-specific immune responses has long been recognized as a potentially valuable tool for the treatment of cancer. Early therapeutic vaccination strategies focused on self-antigens abnormally expressed or overexpressed in tumors, termed tumor-associated antigens (TAAs), were largely unsuccessful in generating clinically effective antitumor immune responses, probably owing to the TAA-specific T cells being subject to central and/or peripheral tolerance. Such TAAs can also be expressed to some extent in nonmalignant tissues, which raises the risk of vaccine-induced autoimmune toxicities. Thus, these early studies highlighted the lack of tumor specificity and poor immunogenicity as fundamental issues to overcome in developing cancer vaccines.
Vaccines predicated on neoantigens rather than traditionally used TAAs have several advantages. First, neoantigens are exclusively expressed by tumor cells and can, therefore, elicit truly tumor-specific T cell responses, thereby preventing “off-target” damage to nonmalignant tissues. Second, the neoantigens described herein are de novo epitopes derived from citrullination of proteins expressed in cancer cells, which presents the possibility to circumvent T cell central tolerance of self-epitopes and thus induce immune responses to tumors. Personalized neoantigen-based vaccines therefore afford the opportunity to boost tumor-specific immune responses and add an additional tool to the immunotherapy toolbox. Furthermore, the potential of these vaccine-boosted neoantigen-specific T cell responses to persist and provide post-treatment immunological memory presents the possibility of long-term protection against disease recurrence.
Exemplary embodiments are described below with reference to
In some embodiments, the patient is a human. In embodiments, the present method may be performed in a veterinary context. That is, the patient may be any non-human mammal suffering from a cancer. The non-human mammal may be a research animal, a pet, livestock, a working animal, a racing animal (e.g., a horse, a dog, a camel, etc.), an animal at stud (e.g., a bull, a retired racing stallion, etc.) or any other non-human mammal for which it is desired to treat its cancer. For convenience, the present disclosure will typically refer to human patients. However, the person of ordinary skill in the art having the benefit of the present disclosure will readily be able to adapt the teachings of the present disclosure to a veterinary context.
The cancer may be any cancer known to the person of ordinary skill in the art having the benefit of the present disclosure. Generally, the present disclosure considers cancers characterized by increased protein citrullination to be of greatest interest. Examples of such cancers include, but are not limited to, breast cancers, lung cancers, skin cancers, endometrial cancers, ovarian cancers, and colorectal cancers.
The plasma sample may be taken from the patient's bloodstream and purified of intact cells by any appropriate technique known to the person of ordinary skill in the art having the benefit of the present disclosure.
The first method 1100 also comprises incubating (at 1120) the plasma sample with at least one citrullinated protein selected from the group consisting of the proteins encoded by genes listed in Table 1, the proteins encoded by genes listed in Table 2, and the proteins encoded by genes listed in Table 10, under conditions sufficient for any autoantibodies against the citrullinated protein(s) that may be present in the plasma sample, to bind to the citrullinated protein(s).
Though not to be bound by theory, as demonstrated in the Examples herein, cancers characterized by increased protein citrullination may elicit autoantibodies, i.e., the patient's immune system may develop antibodies against proteins commonly citrullinated in the patient's cancer cells, and such antibodies may circulate in the patient's plasma. On the other hand, if the patient is suspected of suffering from the cancer but is not actually suffering from the cancer, the patient's plasma will have very few, if any, autoantibodies against citrullination of proteins commonly citrullinated in the cancer.
In one embodiment, the citrullinated protein(s) is/are selected from the group consisting of citrullinated vimentin and citrullinated α-enolase.
Incubating (at 1120) may be performed by any appropriate technique. Such techniques can be performed as a routine matter by the person of ordinary skill in the art having the benefit of the present disclosure. The conditions sufficient for autoantibodies in the plasma sample, if any, to bind to the citrullinated protein(s) can be established by the person of ordinary skill in the art as a routine matter. In embodiments, the conditions may comprise immobilizing the citrullinated protein(s) on a substrate, such as, e.g., the wells of a 96-well plate, among others.
The first method 1100 also comprises incubating (at 1130) the citrullinated protein(s) and any bound autoantibodies against it/them with a detectable label, under conditions wherein the detectable label will bind to the bound autoantibodies and will substantially not bind to other molecules.
Detectable labels generally comprise (i) a binding moiety that will bind to a molecule of interest, in this case, the autoantibodies (if any) bound to the citrullinated protein(s) and (ii) a moiety that produces a detectable signal, either constitutive or induced. Examples of binding moieties that can be used in the detectable label of the first method 1100 include non-human antibodies against human immunoglobulins, among others. Examples of moieties producing detectable signals include moieties comprising radioisotopes, moieties comprising chromophores, moieties comprising fluorophores, enzymatic active sites catalyzing the conversion of a molecule to a product having a different detectable signal than the molecule, among others. Detectable labels are generally known to the person of ordinary skill in the art and need not be described in detail. Conditions wherein the detectable label will bind to the bound autoantibodies and will substantially not bind to other molecules can be implemented as a routine matter.
In one embodiment, the detectable label is an antibody against the autoantibody, wherein the antibody comprises a fluorescent moiety.
The first method 1100 also comprises detecting (at 1140) the label bound to the bound autoantibodies. The precise implementation of detecting (at 1140) will depend on the label's moiety producing a detectable signal, and is a routine matter for the person of ordinary skill in the art.
In one embodiment, the incubating (at 1120), incubating (at 1130), and detecting (at 1140) may be performed as part of an enzyme-linked immunosorbent assay (ELISA), which is a well-known and long-established assay.
The intensity of the signal detected (at 1140) can be processed to provide an amount of the label bound to the bound autoantibodies. The detected amount of the label is then considered in a decision block (1150). If the amount of the label detected is greater than or equal to a threshold, a relatively large number of autoantibodies is present, from which it can be inferred that the patient has a relatively large amount of the citrullinated protein(s) in his/her body, which is a marker of a cancer characterized by increased protein citrullination. Hence, flow passes from the decision block 1150 along the YES path to classifying (at 1160) the patient as suffering from the cancer, in response to the detected amount of the label bound to the bound autoantibodies being equal to or greater than a threshold.
On the other hand, if the amount of the label detected is less than the threshold, few if any autoantibodies are present, from which it can be inferred that the patient has a relatively low amount of the citrullinated protein(s) in his/her body, which indicates that the patient is free of a cancer characterized by increased protein citrullination. In this circumstance, flow passes from the decision block 1150 to classifying (at 1170) the patient as not suffering from the cancer, in response to the detected amount of the label bound to the bound autoantibodies being less than the threshold.
After classifying (at 1160), the patient may receive a cancer treatment, such as surgical resection, chemotherapy, monoclonal antibody therapy, checkpoint inhibitor therapy, oncolytic virus therapy, thermal therapy (e.g., RFA, microwave ablation, and/or cryotherapy), radiotherapy, and two or more thereof, among others. In the other circumstance, after classifying (at 1170), the patient may be spared the expense and inconvenience of unnecessary cancer treatment, and/or the patient may be tested for other medical conditions, including other cancers not characterized by increased protein citrullination.
The cancer may be any cancer known to the person of ordinary skill in the art having the benefit of the present disclosure. Generally, the present disclosure considers cancers characterized by increased protein citrullination to be of greatest interest. Examples of such cancers include, but are not limited to, breast cancers, lung cancers, skin cancers, endometrial cancers, ovarian cancers, and colorectal cancers.
The second method 1200 also comprises assaying (at 1220) the tissue sample for at least one citrullinated protein selected from the group consisting of the proteins encoded by genes listed in Table 1, the proteins encoded by genes listed in Table 2, and the proteins encoded by genes listed in Table 10. “Assaying” may involve any qualitative, semi-qualitative, or quantitative evaluation of the tissue sample's citrullinated protein(s) content. When two or more citrullinated proteins are assayed, they may be evaluated individual or in the aggregate.
Assaying (at 1220) may comprise any known techniques. In one embodiment, the assaying (at 1220) comprises lysing cells of the tissue sample, to yield a tissue sample cell lysate, and isolating a protein fraction from the tissue sample cell lysate. Citrullinated protein(s) in the isolated protein fraction may be identified and quantified as a routine matter by the person of ordinary skill in the art having the benefit of the present disclosure. In one embodiment, the assaying (at 1220) may comprise liquid chromatography-mass spectrometry (LC-MS).
In one embodiment, the citrullinated protein(s) is/are selected from the group consisting of citrullinated vimentin and citrullinated α-enolase.
The results of the assaying (at 1220) are considered at decision block 1230. If the tissue sample contains an amount of the citrullinated protein(s) equal to or greater than a threshold, flow passes to classifying (at 1240) the patient as suffering from the cancer. If the amount of the citrullinated protein(s) in the tissue sample is less than the threshold, flow instead passes to classifying (at 1250) the patient as not suffering from the cancer.
After classifying (at 1240), the patient may receive a cancer treatment, such as surgical resection, chemotherapy, monoclonal antibody therapy, checkpoint inhibitor therapy, oncolytic virus therapy, thermal therapy (e.g., RFA, microwave ablation, and/or cryotherapy), radiotherapy, a cancer vaccine comprising the citrullinated protein(s), a targeted therapy against the citrullinated proteins (e.g., CAR-T therapy), and two or more thereof, among others. In the other circumstance, after classifying (at 1250), the patient may be spared the expense and inconvenience of unnecessary cancer treatment, and/or the patient may be tested for other medical conditions, including other cancers not characterized by increased protein citrullination.
Turning to
The patient, the cancer, the tumor sample, and techniques for the provision thereof have been described above and/or are well-known in the art, and need not be described further.
The third method 1300 also comprises assaying (at 1320) the tumor sample for at least one citrullinated amino acid sequence selected from the group consisting of sequences and corresponding modifications listed in Table 2, Table 9, and/or Table 10, and sequences having at least 70% identity to the sequences listed in Table 2, Table 9, and/or Table 10 and comprising at least one arginine residue. In one embodiment, the citrullinated amino acid sequences(s) is/are selected from the group consisting of GVMVSHR*SGETEDTF (SEQ ID NO:43), LAQANGWGVMVSHR*SGETEDTF (SEQ ID NO:44), and AVEKGVPLYR*HIADLAGNS (SEQ ID NO:45), wherein R* is citrulline.
Assaying (at 1320) may generally be performed as the assaying action 1220 described above with reference to
The results of assaying (at 1320) are considered at decision block 1330. If the tumor sample contains an amount of (a) citrullinated amino acid sequence(s) equal to or greater than a threshold, then the third method 1300 flows to presenting (at 1340) to the immune system of the patient at least one peptide, wherein each peptide comprises at least one of the citrullinated amino acid sequences present in an amount at or above the threshold. By presenting (at 1410), the patient's immune system may develop antibodies against the citrullinated amino acid sequence, i.e., the citrullinated amino acid sequence may be considered a cancer vaccine.
The peptide(s) comprising the citrullinated amino acid sequence(s) that are presented (at 1340) may be derived from the tumor sample of the patient, may be synthesized to comprise the citrullinated amino acid sequence(s), or a combination of thereof. The peptide(s) may be provided as-is and/or may be presented by and/or encoded by an oncolytic virus.
Cancer vaccine techniques are undergoing rapid development of which the person of ordinary skill in the art having the benefit of the present disclosure will be aware.
The cancer vaccine arising from the presenting (at 1340) may be sufficient treatment for the patient's cancer. Even so, in embodiments, the third method 1300 may further comprise administering (at 1350), to the patient, an additional cancer therapy selected from the group consisting of surgical resection, chemotherapy, monoclonal antibody therapy, checkpoint inhibitor therapy, oncolytic virus therapy, thermal therapy, radiotherapy, and two or more thereof.
The additional cancer therapies referred to above are well-known and need not be described in detail. They may be implemented as a routine matter by the person of ordinary skill in the art having the benefit of the present disclosure.
Returning to the decision block 1330, if the tumor sample contains an amount of (a) citrullinated amino acid sequence(s) equal to or greater than a threshold, then flow passes out the NO node, to one of two destinations. In one embodiment, flow passes to administering (at 1350) the additional therapy/ies discussed above. Alternatively, the third method 1300 may terminate (at 1399).
Turning to
The fourth method 1400 also comprises assaying (at 1420) the tumor sample for a citrullinated protein encoded by a gene selected from the group consisting of genes listed in Table 1 as having a plasma membrane location.
The assaying (at 1420) may be substantially as described above relating to other methods. By “plasma membrane location” is meant that a preponderance of the citrullinated protein in the tumor sample is exposed on the cell surface of cancer cells.
After assaying (at 1420), flow passes to decision block 1430. If the amount of citrullinated protein is greater than or equal to a threshold amount, it may be reasonably inferred that the citrullinated protein has a significant presence on the cell surface of cancer cells. Not to be bound by theory, but it is unlikely that the citrullinated protein has any significant presence on the cell surface of healthy cells. Accordingly, the citrullinated protein may provide a target for homing in an anti-cancer agent to cancer cells possessing the citrullinated protein on their cell surfaces, with little if any attack by the anti-cancer agent on healthy tissues.
Accordingly, flow in this circumstance may follow the YES path to administering (at 1440) to the patient an anti-cancer agent targeting the citrullinated protein, in response to the tumor sample containing an amount of the citrullinated protein equal to or greater than a threshold.
Similarly to the third method 1300, the fourth method 1400 may further comprise administering (at 1450), to the patient, an additional cancer therapy not targeting the citrullinated protein, as described below. The administering (at 1450) may be performed with administering (at 1440), i.e., the fourth method 1400 may provide a combination therapy; and/or the administering (at 1450) may be performed if the assay (at 1420) discovered insufficient amounts of the citrullinated protein, thereby leading flow from decision block 1430 along the NO path. Alternatively, the NO path may lead to termination of the fourth method 1400 (at 1499).
In the detection and diagnostic methods provided herein, the presence of cancer in a subject is determined by detecting levels of biomarkers, e.g., citrullinated protein-specific autoantibody biomarkers or citrullinated protein or peptide biomarkers, in a biological sample. As used herein, a “biomarker” refers to a molecule whose level in a biological sample, e.g., a blood sample such as a plasma sample, is correlated with the presence or absence of cancer (e.g., a specific type or subtype of cancer). The levels of each of the biomarkers need not be correlated with the cancer status in all subjects; rather, a correlation will exist at the population level, such that the level is sufficiently correlated within the overall population of individuals with cancer that it can be combined with the levels of other biomarkers, in any of a number of ways, as described elsewhere herein, and used to determine the cancer status. The values used for the measured level of the individual biomarkers can be determined in any of a number of ways, including direct readouts from relevant instruments or assay systems, e.g., as described below and/or using means known to those of skill in the art. In some embodiments, the readout values of the biomarkers are compared to the readout value of a reference or control, a peptide or other molecule whose level does not vary according to cancer status and whose level is measured at the same time as the biomarkers.
The term “correlating” generally refers to determining a relationship between one random variable with another. In various embodiments, correlating a given biomarker level with the presence or absence of cancer comprises determining the presence, absence or amount of at least one biomarker in a subject with the same outcome. In specific embodiments, a set of biomarker levels, absences or presences is correlated to a particular outcome, using receiver operating characteristic (ROC) curves.
In some embodiments, as demonstrated in the Examples herein, AUC values are used as a measure of the ability of a biomarker or combination of biomarkers to determine the cancer status of an individual. The “area under curve” or “AUC” refers to area under a ROC curve. AUC under a ROC curve is a measure of accuracy. An area of 1 represents a perfect test, whereas an area of 0.5 represents an insignificant test. For suitable biomarkers as described herein, the AUC may be between 0.700 and 1. For example, the AUC may be at least about 0.700, at least about 0.750, at least about 0.800, at least about 0.810, at least about 0.820, at least about 0.830, at least about 0.840, at least about 0.850, at least about 0.860, at least about 0.870, at least about 0.880, at least about 0.890, at least about 0.900, at least about 0.910, at least about 0.920, at least about 0.930, at least about 0.940, at least about 0.950, at least about 0.960, at least about 0.970, at least about 0.980, at least about 0.990, or at least about 0.995.
Additional cancer biomarkers can be assessed and identified using any standard analysis method or metric, e.g., by analyzing data from biological samples taken from subjects with or without a diagnosis of cancer, as described in more detail elsewhere herein and as illustrated, e.g., in the Examples. In some embodiments, differences in data between groups (e.g., between samples from cancer patients and samples from healthy patients without cancer) can be evalutated using two groups comparison tests, e.g., Student's T-tests, Welch's T-tests, or the Mann-Whitney U test, or multiple comparison tests, e.g., Kruskal Wallis Tests or Analysis of Variance (ANOVA) tests. In some embodiments, principal component analysis (PCA) or partial least squares discriminate analysis (PLS-DA) can be performed. Receiver operating characteristic (ROC) curves can be generated, e.g., using the pROC package in R, and the AUCs calculated with a 95% confidence interval as well as sensitivity and specificity values. Different learning algorithms, including deep learning, gradient boosting machine, auto-machine learning, iterative random forest, LASSO regularization, and bionomial logistic regression analysis can be performed, e.g., for the analysis of combinations of multiple variables.
To determine the presence of cancer (i.e., the “cancer status”) in an individual (i.e., a subject or patient), the measured biomarker levels in a sample obtained from the individual are generally compared to reference levels, e.g., levels taken from a healthy individual without cancer. The reference control levels can be measured at the same time as the biomarker levels, i.e., using the same sample, or can be a level determined based on previous measurements.
When using multiple biomarkers, it is not necessary that all of the biomarkers are elevated or depressed relative to control levels in a sample, e.g., a plasma sample, from a given subject to give rise to a determination of cancer. For example, for a given biomarker level there can be some overlap between individuals falling into different probability categories. However, collectively the combined levels for all of the biomarkers included in the assay gives rise to an AUC score that indicates a high probability of, e.g., the presence of cancer.
In some embodiments, the levels of the selected biomarkers are quantified and compared to one or more preselected or threshold levels. Threshold values can be selected that provide an ability to predict the presence or absence of cancer. Such threshold values can be established, e.g., by calculating receiver operating characteristic (ROC) curves using a first population with cancer and a second population without cancer.
In some embodiments, measuring the levels of biomarkers described herein (e.g., citrullinated protein-specific autoantibodies or citrullinated peptides) comprises the detection and quantification (e.g., semi-quantification) of the selected biomarkers in the sample. In some embodiments, the measured biomarker levels are adjusted relative to one or more standard level(s) (“normalized”). As known in the art, normalizing is done to remove technical variability inherent to a platform to give a quantity or relative quantity.
In some embodiments, the measurement of differential levels of specific biomarkers from biological samples may be accomplished using a range of technologies, reagents, and methods. These include any of the methods of measurement as described elsewhere herein.
The biomarker levels are typically normalized following detection and quantification as appropriate for the particular platform using methods routinely practiced by those of ordinary skill in the art.
Threshold or cut-off values can be adjusted to change test performance, e.g., test sensitivity and specificity. For example, the threshold for cancer may be intentionally lowered to increase the sensitivity of the test for cancer, if desired.
Determining the accuracy of detection and/or diagnosis may involve the use of accuracy measures such as sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), accuracy, and area under the curve (AUC) of a Receiver Operating Characteristic (ROC) curve corresponding to the diagnostic accuracy of detecting or predicting cancer.
It will be appreciated that for any particular biomarker, a distribution of biomarker levels for subjects with and without cancer may overlap. Under such conditions, a test does not absolutely distinguish the two populations (i.e., with or without cancer) with 100% accuracy, and the area of overlap indicates where the test cannot distinguish them. A threshold value is selected, above which the test is considered to be “positive” and below which the test is considered to be “negative.” The area under the ROC curve (AUC) provides the C-statistic, which is a measure of the probability that the perceived measurement will allow correct identification of a condition (see, e.g., Hanley et al., Radiology 143: 29-36 (1982)).
In some embodiments, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 or more biomarkers are selected to discriminate between subjects with cancer and subjects without cancer with at least about 70%, 75%, 80%, 85%, 90%, 95% accuracy or having a C-statistic of at least about 0.70, 0.75, 0.80, 0.85, 0.90, 0.95.
The phrases “assessing the likelihood” and “determining the likelihood,” as used herein, refer to methods by which the skilled artisan can predict the presence or absence of a condition (e.g., cancer) in a patient. The skilled artisan will understand that this phrase includes within its scope an increased probability that a condition (e.g., cancer) is present or absent in a patient; that is, that a condition is more likely to be present or absent in a subject. For example, the probability that an individual identified as having a specified condition actually has the condition can be expressed as a “positive predictive value” or “PPV.” Positive predictive value can be calculated as the number of true positives divided by the sum of the true positives and false positives. PPV is determined by the characteristics of the predictive methods of the present methods as well as the prevalence of the condition in the population analyzed. The statistical algorithms can be selected such that the positive predictive value in a population having a condition prevalence is in the range of 70% to 99% and can be, for example, at least 70%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 900, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%.
In other examples, the probability that an individual identified as not having a specified condition or outcome actually does not have that condition can be expressed as a “negative predictive value” or “NPV.” Negative predictive value can be calculated as the number of true negatives divided by the sum of the true negatives and false negatives. Negative predictive value is determined by the characteristics of the diagnostic or prognostic method, system, or code as well as the prevalence of the disease in the population analyzed. The statistical methods and models can be selected such that the negative predictive value in a population having a condition prevalence is in the range of about 70% to about 99% and can be, for example, at least about 70%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%.
In some embodiments, a subject is determined to have a significant probability of having or not having a specified condition or outcome (e.g., cancer). By “significant probability” is meant that the subject has a reasonable probability (0.6, 0.7, 0.8, 0.9 or more) of having, or not having, a specified condition or outcome.
In some embodiments, a detection of cancer can be based not solely on biomarker levels, but can also take into account clinical and/or other data about the subject, e.g., clinical data about the subject's current medical state (e.g., the presence of cancer-related symptoms), the presence of any symptoms characteristic of cancer, the medical history of the subject, the presence of one or more risk factors for HCC (e.g., family history of cancer, genetic factors including mutant alleles relevant to cancer), and/or demographic data about the subject (age, sex, etc.).
Kits and devices useful for performing the provided methods of detecting cancer and/or diagnosing a subject with cancer are described below.
In one aspect, a system, e.g., measurement system is provided. Such systems allow, e.g., the detection of biomarker levels (e.g., citrullinated protein-specific autoantibody biomarkers or citrullinated protein or peptide biomarkers) in a sample and the recording of the data resulting from the detection. The stored data can then be analyzed to determine the cancer status of a subject. Such systems can comprise, e.g., assay systems (e.g., comprising an assay device and detector), which can transmit data to a logic system (such as a computer or other system or device for capturing, transforming, analyzing, or otherwise processing data from the detector). The logic system can have any one or more of multiple functions, including controlling elements of the overall system such as the assay system, sending data or other information to a storage device or external memory, and/or issuing commands to a treatment device.
Also provided is a system for detecting citrullinated protein-specific autoantibody biomarkers or citrullinated protein or peptide biomarkers in a sample, by utilizing a station for analyzing the sample (e.g., by mass spectrometry (Mass Spec or MS), liquid chromatography/mass spectrometry (LC/MS), enzyme-linked immunosorbent assay (ELISA), or immunohistochemistry (IHC)) to detect one or more (e.g., two or more) citrullinated protein-specific autoantibody biomarkers or citrullinated protein or peptide biomarkers, such as, e.g., any of the citrullinated proteins or peptides described herein (e.g., the peptides listed in Table 2, Table 9, and Table 10) or autoantibodies that specifically bind to such proteins or peptides, and the sample is a sample of a bodily fluid obtained from a subject (e.g., a blood sample such as a plasma sample) or a tissue sample obtained from a subject. In some embodiments, the system comprises a station suitable for the assay used to detect the biomarkers. For example, a mass spectrometer and/or a liquid chromatography system for MS and/or LC/MS, a plate reader for ELISA, and/or a microscope for IHC. Optionally, a station for generating a report containing information on results of the analyzing is further included.
Also provided is a method of generating a report containing information on results of the detection of citrullinated protein-specific autoantibody biomarkers or citrullinated protein or peptide biomarkers in a sample, including detecting one or more (e.g., two or more) biomarkers in the sample, and generating the report, wherein the one or more (e.g., two or more) are any of the citrullinated proteins or peptides described herein (e.g., the peptides listed in Table 2, Table 9, and Table 10) or autoantibodies that specifically bind to such proteins or peptides; the sample is a sample of a bodily fluid obtained from a subject (e.g., a blood sample such as a plasma sample) or a tissue sample obtained from the subject, and the report is useful for diagnosing cancer in the subject.
Certain aspects of the methods described herein may be totally or partially performed with a computer system including one or more processors, which can be configured to perform the steps. Thus, embodiments are directed to computer systems configured to perform the steps of methods described herein, potentially with different components performing a respective step or a respective group of steps. The computer systems of the present disclosure can be part of a measuring system as described above, or can be independent of any measuring systems. In some embodiments, the present disclosure provides a computer system that uses inputted biomarker expression (and optionally other) data, and determines the cancer status of a subject.
A computer system can include desktop and laptop computers, tablets, mobile phones and other mobile devices. The system can include various elements such as a printer, keyboard, storage device(s), monitor (e.g., a display screen, such as an LED), peripherals, devices to connect a computer system to a wide area network such as the Internet, a mouse input device, scanner, a storage device(s), computer readable medium, camera, microphone, accelerometer, and the like. Any of the data mentioned herein can be output from one component to another component and can be output to the user.
In one aspect, the present disclosure provides a computer implemented method for determining the presence or absence of cancer in a patient. The computer performs steps comprising, e.g.,: receiving inputted patient data comprising values for the levels of one or more biomarkers in a biological sample from the patient; analyzing the levels of one or more biomarkers and optionally comparing them to respective reference values, optionally comparing the biomarker levels to one or more threshold values to determine cancer status; and displaying information regarding the cancer status or probability in the patient. In certain embodiments, the inputted patient data comprises values for the levels of a plurality of biomarkers in a biological sample from the patient, e.g., biomarkers comprising one or more pairs or three-way combinations of biomarkers comprising any of the citrullinated proteins or peptides described herein (e.g., the peptides listed in Table 2, Table 9, and Table 10) or autoantibodies that specifically bind to citrullinated proteins or peptides as described herein.
In a further aspect, a diagnostic system is included for performing the computer implemented method, as described. A diagnostic system may include a computer containing a processor, a storage component (i.e., memory), a display component, and other components typically present in general purpose computers. The storage component stores information accessible by the processor, including instructions that may be executed by the processor and data that may be retrieved, manipulated or stored by the processor.
The storage component includes instructions for determining the cancer status of the subject. For example, the storage component includes instructions for determining cancer status based on biomarker levels, as described herein. The computer processor is coupled to the storage component and configured to execute the instructions stored in the storage component in order to receive patient data and analyze patient data according to one or more algorithms. The display component displays information regarding the diagnosis of the patient. The storage component may be of any type capable of storing information accessible by the processor, such as a hard-drive, memory card, ROM, RAM, DVD, CD-ROM, USB Flash drive, write-capable, and read-only memories.
The instructions may be any set of instructions to be executed directly (such as machine code) or indirectly (such as scripts) by the processor. In that regard, the terms “instructions,” “steps” and “programs” may be used interchangeably herein. The instructions may be stored in object code form for direct processing by the processor, or in any other computer language including scripts or collections of independent source code modules that are interpreted on demand or compiled in advance.
Data may be retrieved, stored or modified by the processor in accordance with the instructions. For instance, although the diagnostic system is not limited by any particular data structure, the data may be stored in computer registers, in a relational database as a table having a plurality of different fields and records, XML documents, or flat files. The data may also be formatted in any computer-readable format such as, but not limited to, binary values, ASCII or Unicode. Moreover, the data may comprise any information sufficient to identify the relevant information, such as numbers, descriptive text, proprietary codes, pointers, references to data stored in other memories (including other network locations) or information which is used by a function to calculate the relevant data. In certain embodiments, the processor and storage component may comprise multiple processors and storage components that may or may not be stored within the same physical housing. For example, some of the instructions and data may be stored on removable CD-ROM and others within a read-only computer chip. Some or all of the instructions and data may be stored in a location physically remote from, yet still accessible by, the processor. Similarly, the processor may actually comprise a collection of processors which may or may not operate in parallel. In one aspect, computer is a server communicating with one or more client computers. Each client computer may be configured similarly to the server, with a processor, storage component and instructions. Although the client computers and may comprise a full-sized personal computer, many aspects of the system and method are particularly advantageous when used in connection with mobile devices capable of wirelessly exchanging data with a server over a network such as the Internet.
Also provided herein are kits and panels that can be used for performing the methods described herein. In some embodiments, the kits and panels comprise one or more citrullinated peptides or antigenic fragments or variants thereof to which citrullinated protein-specific autoantibodies can specifically bind. As used herein, a “kit” refers to a package containing the listed materials, and instructions of any form that are provided in connection with the materials in a manner such that a clinical professional will clearly recognize that the instructions are to be associated with the materials. As used herein, “instructions” typically involve written text or graphics on or associated with packaging of materials. Instructions also can include any oral or electronic instructions provided in any manner. Written text or graphics may include a website URL or a QR code encoding a website URL, where other instructions or supplemental information may be provided in electronic form.
The peptide used in the kits and panels is preferably designed such that it is immunogenic, particularly that it binds to citrullinated protein-specific autoantibodies from subjects. In some instances, the kits include a panel as provided herein, such as a prognostic or diagnostic panel.
In certain embodiments, a kit as described herein includes one or more solubilizing agents for increasing the solubility of a peptide such as, for example, a buffer solution. The kit may further include reagents that provide a detectable signal when used in conjunction with the citrullinated peptides or fragments or variants thereof and a biological sample. In some embodiments, the kit includes a detectably-labeled secondary antibody that is able to bind to a citrullinated protein-specific autoantibody specifically binding to said one or more citrullinated peptides or fragments or variants thereof. Reagents for the detection of the secondary antibody label can also be included in the kit. The secondary antibody is detected by a method that depends on a labeling group used. Exemplary labels for secondary antibodies are described above in this disclosure.
In addition, a kit can include directions for using the citrullinated peptides or fragments or variants thereof and/or directions for practicing a method described herein; particularly, detecting citrullinated protein-specific autoantibodies in a biological sample. The concentration or amount of citrullinated protein-specific autoantibodies contained in the biological sample is indirectly measured by measuring the amount of the detectable label. The obtained measurement value may be converted to a relative or absolute concentration, amount, activity, etc. using a calibration curve or the like.
In some embodiments, a kit or a panel as provided herein includes a reference sample, such as a normal control sample. In some embodiments, a kit or a panel as provided herein includes one or more control antibody that detects an antigen that is expected to be present in a biological sample such as, for example, a biological sample from a healthy subject, or a biological sample from a subject with cancer. If such a sample is included, the obtained measurement values for such sample are compared with the results of the test sample, so that the presence or absence of cancer in the subject or biological sample can be more objectively determined.
In addition to the one or more citrullinated peptides, fragments, and/or variants, the panel can include additional polypeptides such as, for example, positive or negative controls or other antigens known to bind to autoantibodies of prognostic and/or diagnostic value, particularly those related to cancer and/or autoimmune disease.
In one aspect, provided herein is a prognostic or diagnostic device comprising a panel as described above, the panel including one or more citrullinated peptides or antigenic fragments or variants thereof. In some embodiments, such a prognostic or diagnostic panel device comprises one or more citrullinated peptides, fragments, or variants in a form as described above that allows contacting it with an aqueous solution, more preferably the liquid human sample, in a straightforward manner. In particular, the one or more citrullinated peptides, fragments, or variants may be immobilized on the surface of a carrier (also referred to as a substrate), which carrier comprises, but is not limited to glass plates or slides, biochips, microtiter plates, beads, for example magnetic beads, chromatography columns, membranes or the like. Exemplary devices include line blots, microtiter plates and biochips. In some embodiments, the device can include additional polypeptides such as, for example, positive or negative controls or other antigens known to bind to autoantibodies of prognostic and/or diagnostic value, particularly those related to cancer or autoimmune diseases as discussed above.
In some embodiments, the kits provided herein comprise a substrate; at least one citrullinated protein selected from the group consisting of the proteins encoded by genes listed in Table 1, the proteins encoded by genes listed in Table 2, and the proteins encoded by genes listed in Table 10; and instructions for performing a method, comprising instructions to provide a plasma sample from a patient suffering from a cancer or suspected of suffering from the cancer; incubate the plasma sample with the citrullinated protein(s) under conditions sufficient for any autoantibodies against the citrullinated protein(s) that may be present in the plasma sample to bind to the citrullinated protein(s); incubate the citrullinated protein(s) and any bound autoantibodies against it/them with a detectable label under conditions wherein the detectable label will bind to the bound autoantibodies and will substantially not bind to other molecules; detect the label bound to the bound autoantibodies; classify the patient as suffering from the cancer in response to the detected amount of the label bound to the bound autoantibodies being equal to or greater than a threshold; and classify the patient as not suffering from the cancer in response to the detected amount of the label bound to the bound autoantibodies being less than the threshold.
The kit may contain one or more containers, each of which can contain one or more of the materials. The kits also may contain instructions for mixing, diluting, using, or administering the materials. The kits also can include other containers with one or more solvents, surfactants, preservatives, buffers, washes, and/or diluents (e.g., normal saline (0.9% NaCl), or 5% dextrose) as well as containers for mixing, diluting, incubating, washing, etc.
The materials may be provided in any suitable form, for example, as a liquid solution or as a dried product. When the material provided is a dry product, the material may be reconstituted by the addition of solvent, which may also be provided by the kit. In embodiments where liquid forms of a material are used, the liquid form may be concentrated or ready to use.
The kit, in one embodiment, may comprise a carrier being compartmentalized to receive in close confinement one or more containers such as vials, tubes, and the like
The substrate may be as described hereinabove. The citrullinated protein(s) may also be as described hereinabove.
In addition to the substrate and the citrullinated protein(s), in embodiments, the kit may also comprise the detectable label, and any materials required to allow the label to generate a detectable signal.
The method to which the instructions relate may be as described hereinabove.
In some embodiments, the kits provided herein are useful for diagnostic methods comprising detecting citrullinated peptides and/or proteins (e.g., in a biological sample from a subject). In some embodiments, the kits comprise a cell lysis agent; and instructions for performing a method, comprising instructions to providing a tissue sample from a patient suffering from a cancer or suspected of suffering from the cancer; lyse cells of the tissue sample by exposing the tissue sample to the cell lysis agent; assay the tissue sample for at least one citrullinated protein selected from the group consisting of the proteins encoded by genes listed in Table 1, the proteins encoded by genes listed in Table 2, and the proteins encoded by genes listed in Table 10; classify the patient as suffering from the cancer, in response to the tissue sample containing an amount of the citrullinated protein(s) equal to or greater than a threshold; and classify the patient as not suffering from the cancer, in response to the tissue sample containing an amount of the citrullinated protein(s) less than the threshold.
Any cell lysis agent may be included in the kit. In one embodiment, the cell lysis agent may comprise octyl glucoside, and the instructions may further comprise instructions to prepare the octyl glucoside for use by bringing to 1% wt/vol in phosphate-buffered saline (PBS).
The kit may comprise other materials as described above or routinely included in such kits.
The method for which the kit provides instructions may be as described above.
In some embodiments, the kits provided herein comprise at least one peptide, wherein each peptide comprises at least one citrullinated amino acid sequence selected from the group consisting of sequences and corresponding modifications listed in Table 2, Table 9, and/or Table 10, and sequences having at least 70% identity to the sequences listed in Table 2, Table 9, and/or Table 10 and comprising at least one arginine residue; and instructions for performing a method, comprising instructions to: provide a tumor sample from a patient suffering from a cancer: assay the tumor sample for the citrullinated amino acid sequence(s) contained in each of the peptide(s); and present to the immune system of the patient one or more of the peptides, in response to the tumor sample containing an amount equal to or greater than a threshold of the citrullinated amino acid sequence contained in the peptide(s).
The citrullinated amino acid sequence(s) and the peptide(s) have been described above.
The method for which the instructions are provided can be as described above.
In some embodiments, the kit may further comprise a cell lysis agent, and the instructions may further comprise instructions to assay the tumor sample by lysing cells of the tumor sample, to yield a tumor sample cell lysate, and quantifying the amount of the citrullinated amino acid sequence(s) in the tumor sample cell lysate.
Alternatively or in addition, the instructions may further comprise instructions to administer, to the patient, an additional cancer therapy selected from the group consisting of surgical resection, chemotherapy, monoclonal antibody therapy, checkpoint inhibitor therapy, oncolytic virus therapy, thermal therapy, radiotherapy, and two or more thereof. In a further embodiment, wherein the additional cancer therapy is selected from the group consisting of chemotherapy, monoclonal antibody therapy, checkpoint inhibitor therapy, and oncolytic virus therapy, the kit may further comprise one or more of a chemotherapeutic agent, a monoclonal antibody, a checkpoint inhibitor, or an oncolytic virus.
In some embodiments, the kits provided herein comprise an anti-cancer agent targeting a citrullinated protein encoded by a gene selected from the group consisting of genes listed in Table 1 as having a plasma membrane location; and instructions for performing a method, comprising instructions to provide a tumor sample from a patient suffering from a cancer; assay the tumor sample for the citrullinated protein; and administer to the patient an anti-cancer agent targeting the citrullinated protein, in response to the tumor sample containing an amount of the citrullinated protein equal to or greater than a threshold.
Anti-cancer agents and citrullinated proteins have been described above.
The method for which the instructions are provided can be as described above.
In one embodiment, the kit may further comprise a cell lysis agent, and the instructions may further comprise instructions to assay the tumor sample by lysing cells of the tumor sample, to yield a tumor sample cell lysate, and quantifying the amount of the citrullinated amino acid sequence(s) in the tumor sample cell lysate.
Alternatively or in addition, the instructions may further comprise instructions to administer, to the patient, an additional cancer therapy selected from the group consisting of surgical resection, chemotherapy, monoclonal antibody therapy, checkpoint inhibitor therapy, oncolytic virus therapy, thermal therapy, radiotherapy, and two or more thereof. In a further embodiment, wherein the additional cancer therapy is selected from the group consisting of chemotherapy, monoclonal antibody therapy, checkpoint inhibitor therapy, and oncolytic virus therapy, the kit may further comprise one or more of a chemotherapeutic agent, a monoclonal antibody, a checkpoint inhibitor, or an oncolytic virus.
In embodiments, the present disclosure relates to an isolated peptide comprising at least 70% sequence identity to a peptide selected from the group consisting of peptides listed in Table 2, Table 9, and Table 10.
In some embodiments, the peptide comprises at least 6 contiguous amino acids (e.g., at least 7, at least 8, at least 9, at least 10, at least 11, at least 12, at least 13, at least 14, or at least 15 contiguous amino acids) of a peptide listed in Table 2, Table 9, and/or Table 10.
In some embodiments, the peptide is 15 amino acids or fewer (e.g., 14, 13, 12, 11, 10, 9, 8, 7, 6 amino acids or fewer) in length.
In some embodiments, the peptide is around 9 amino acids in length.
In some embodiments, the peptide is around 15 amino acids in length.
In some embodiments, the peptide is 20-25 amino acids in length.
In some embodiments, the peptide is 22-24 amino acids in length.
In some embodiments, the peptide is immunogenic.
In some embodiments, the peptide is modified, i.e., one or more bonds present in the wild-type peptide is replaced with a bond to an atom or a molecule absent from the wild-type peptide.
In some embodiments, the modification comprises conjugation to a molecule. For example, the molecule may comprise an antibody, a lipid, an adjuvant, or a detection moiety.
In some embodiments, the peptide has at least 90% sequence identity to a peptide listed in Table 2, Table 9, and/or Table 10.
In some embodiments, the peptide has 1, 2 or 3 substitutions relative to a peptide listed in Table 2, Table 9, and/or Table 10.
In some embodiments, the peptide comprises 100% sequence identity to a peptide listed in Table 2, Table 9, and/or Table 10.
The following examples are included to demonstrate preferred embodiments of the disclosure. It should be appreciated by those of skill in the art that the techniques disclosed in the examples which follow represent techniques discovered by the inventor to function well in the practice of the disclosure, and thus can be considered to constitute preferred modes for its practice. However, those of skill in the art should, in light of the present disclosure, appreciate that many changes can be made in the specific embodiments which are disclosed and still obtain a like or similar result without departing from the spirit and scope of the disclosure. Many of the following examples are further described in Katayama et al., 2021, “Protein citrullination as a source of cancer neoantigens,” Journal for ImmunoTherapy of Cancer 9:e002549, which is hereby incorporated by reference in its entirety. Reference is made to this Katayama et al. 2021 publication for illustration of certain experimental data as described in the instant disclosure.
Breast cancer cell line citrullinome analysis. Proteomic analysis was performed as previously described (refs. 19-22). For in-depth citrullinome analysis, 28 Breast cancer cell lines MCF7, MDA-MB-231, SKBR3, HCC1954, HCC1143, BT474, HCC1500, T47D, ZR75-1, HCC1937, HCC1599, HCC202, HCC1806, MDA-MB-468, HCC2218, HCC70, HCC1187, Hs578T, BT549, MCF10A, MCF12A, MDA-MB-361, HCC1395, CAMA1, HCC38, MDA-MB-436, BT20 and MDA-MB-157 were labeled with 13C6 Lys (#CNLM-2247, Cambridge isotope laboratories) in RPMI1640 containing 10% dialyzed FBS and 1% penicillin/streptomycin cocktail (Gibco). Stable Isotope Labeling by Amino acid in Cell culture (SILAC labeling) (ref. 23) was performed to discriminate FBS derived proteins from cell proteins.
For proteomic analysis of whole-cell lysates, −2×107 cells were lysed in 1 mL of phosphate-buffered saline (PBS) containing octyl-glucoside (1% w/v) and protease inhibitors (complete protease inhibitor cocktail, Roche Diagnostics), followed by sonication and centrifugation at 20,000×g with collection of the supernatant and filtration through a 0.22 μm filter. Two milligrams of whole cell extract (WCE) proteins were reduced in dithio-threitol (DTT) and alkylated with acrylamide before fractionation with reversed phase-high performance liquid chromatography (RP-HPLC). A total of 84 fractions were collected at a rate of 3 fractions/min. Mobile phase A consisted of water (H2O):acetonitrile (ACN) (95:5, v/v) with 0.1% of trifluoroacetic acid (TFA). Mobile phase B consisted of ACN:H2O (95:5) with 0.1% of TFA. Collected fractions from HPLC were dried by lyophilization, followed by in-solution digestion with trypsin (Mass Spectrometry Grade, Thermo Fisher).
Based on the chromatogram profile, 84 fractions were pooled into 24 fractions for liquid chromatography-tandem mass spectrometry (LC-MS/MS) analysis per cell line. In total, 2,688 fractions were subjected to reversed phase LC-MS/MS (RPLC-MS/MS) using a nanoflow LC system (EASYnano HPLC system, Thermo Scientific) coupled online with LTQ Orbitrap ELITE mass spectrometer (Thermo Scientific). Separations were performed using 75 μm inner diameter (id)×360 μm outer diameter (od)×25-cm-long fused-silica capillary column (Column Technology) slurry packed with 3 sm, 100 A° pore size C18 silica-bonded stationary phase. Following injection of ˜500 ng of protein digest onto a C18 trap column (Waters, 180 μm idx20 mm), peptides were eluted using a linear gradient of 0.35% mobile phase B (0.1 formic acid in ACN) per minute for 90 min, then to 95% B for an additional 10 min, all at a constant flow rate of 300 nL/min. Eluted peptides were analyzed by LTQ Orbitrap ELITE in data-dependent acquisition mode. Each full MS scan (m/z 400-1800) was followed by 20 MS/MS scans (collision-induced dissociation (CID) normalized collision energy of 35%). Acquisition of each full mass spectrum was followed by the acquisition of MS/MS spectra for the 20 most intense +2, +3 or +4 ions within a duty cycle, dynamic exclusion was enabled to minimize redundant selection of peptides previously selected for MS/MS analysis. Parameters for MS1 were 60,000 for resolution, 1×106 for automatic gain control target, and 150 ms for maximum injection time. MS/MS was done by CID fragmentation with 3×104 for automatic gain control, 10 ms for maximum injection time, 35 for normalized collision energy, 2.0 m/z for isolation width, 0.25 for activation q-value, and 10 ms for activation time.
MS/MS spectra were searched against the Uniprot human proteome database (January 2017) using Sequest HT in Proteome Discoverer V.1.4 pipeline. One fixed modification of propionamide at Cys (71.037114 Da) and three variable modifications, oxidation at Met (15.9949 Da) deamidation at Arg (0.984016 Da) and SILAC 13C6 at Lys (6.0201 Da), were chosen. The mass error allowed was 10 parts per million (ppm) for parent monoisotopic and 0.5 Da for tandem mass (MS2) fragment monoisotopic ions. Full trypsin was specified as protein cleavage site, with possibility of two missed cleavages allowed. The searched result was filtered with false discovery rate (FDR)=0.01, and the peptides with deamidated Arg at C-terminal of the tryptic peptides were considered as false identification and removed as well.
Clinical subjects for Ig-bound analysis. Plasma samples were collected from 156 women with newly diagnosed breast cancer (0-0.8 years) as cases, and 40 age-matched cancer-free women were used as control. For the cases, only patients who had no documented distant metastasis at the time of sample collection were included in this study. Written informed consent was obtained and the study was approved by the IRB at MD Anderson Cancer Center. The timing of blood draw was after the diagnostic biopsy and prior to neoadjuvant chemotherapy, or definitive surgery in patients who did not receive chemotherapy in the neoadjuvant setting (ref. 23) (Table 6 and Table 8). The additional 73 healthy control plasmas were obtained from MD Anderson Cancer Center Gynecologic Tissue Bank following Institutional Review Board approval and informed consent (Table 6).
Plasma Ig-bound work flow. Plasma Ig-bound proteins were prepared and analyzed as described previously (ref. 24). Twenty six pooled plasma samples from 156 women with newly diagnosed breast cancer and 12 pooled plasma samples from 113 cancer-free subjects as controls were analyzed (Table 6). Briefly, 100 μL of pooled plasma for each experimental condition was processed with the immuno-depletion column Hu-14 10×100 mm (#5188-6559; Agilent Technologies, Santa Clara, California, USA) to remove the top 14 high abundance proteins, Albumin, IgG, IgA, Transferrin, Haptoglobin, Fibrinogen, al-Antitrypsin, al-Acid Glycoprotein, Apolipoprotein AI, Apolipoprotein AII, Complement C3, Transthyretin, IgM, and α2-Macroglobulin. The bound fraction was used for IgG-bound protein analysis as previously described (refs. 24 and 25).
LC-high-definition MSE (HDMSE) Data were acquired in resolution mode with SYNAPT G2-S using Waters Masslynx (V.4.1, SCN851). The capillary voltage was set to 2.80 kV, sampling cone voltage to 30 V, source offset to 30 V, and source temperature to 100° C. Mobility used high-purity N2 as the drift gas in the ion-mobility spectrometry (IMS) TriWave cell. Pressures in the helium cell, Trap cell, IMS TriWave cell, and Transfer cell were 4.50 mbar, 2.47e-2 mbar, 2.90 mbar, and 2.53e-3 mbar, respectively. The IMS wave velocity was 600 m/s, the helium cell DC was 50 V, the trap DC bias was 45 V, the IMS TriWave DC bias was 3 V, and the IMS wave delay was 1000 μs. The mass spectrometer was operated in V-mode with a typical resolving power of at least 20,000. All analyses were performed using positive mode electrospray ionization (ESI) using a NanoLockSpray source. The lock mass channel was sampled every 60 s. The mass spectrometer was calibrated with a [Glu1]-fibrinopeptide solution (300 fmol/μL) delivered through the reference sprayer of the NanoLockSpray source. Accurate mass LC-HDMSE data were collected in an alternating, low energy (MS) and high energy (MSE) mode of acquisition with mass scan range from m/z 50 to 1800. The spectral acquisition time in each mode was 1.0 s with a 0.1-s inter-scan delay. In low energy HDMS mode, data were collected at constant collision energy of 2 eV in both Trap cell and Transfer cell. In high energy HDMSE mode, the collision energy was ramped from 25 to 55 eV in the Transfer cell only. The RF applied to the quadrupole mass analyzer was adjusted such that ions from m/z 300 to 2000 were efficiently transmitted, ensuring that any ions observed in the LC-HDMSE data less than m/z 300 were known to arise from dissociations in the Transfer collision cell.
The acquired LC-HDMSE data were processed and searched against the Uniprot human proteome database (January 2017) through ProteinLynx Global Server (PLGS, Waters Company) with False Discovery Rate 4%. The modification search settings and the deamidated Arg, C-terminal miss cleavage assessment were the same with the cell line citrullinome. The plot values are represented as the number of citrullinated proteins relative to the total unique peptides per sample as to adjust for batch effects that occurred during the data acquisition. The Ingenuity Pathway Analysis (IPA) network analysis of plasma IgG-bound was done by calculating the ratio of the spectral counts of the citrullinome in each BC receptor subtype compared with healthy controls with greater than or equal to 1.5.
The recombinant unmodified Vimentin (Cayman Chemical, #11234) was incubated with 100 mM HEPES (pH7.6) containing 100 mM NaCl and 1 mM CaCl2) buffer and stored under room temperature for 1 hour, and the citrullinated form was prepared. Each of the recombinant unmodified Vimentin and citrullinated Vimentin was loaded 0.1, 0.5, 1.0 μg to Criterion XT 12% gel and transferred to PVDF membrane using Trans-Blot Turbo Transfer System (BioRad, #1704150). From the Healthy control (n=8 pooled) and triple negative breast cancer (TNBC) stage II (n=11 pooled), 2 μL of the plasmas was diluted 150-fold with 0.05% casein dissolved in Tris buffered saline containing 0.01% Tween 20 (TBST) and incubated with the recombinant proteins transferred PVDF membrane for 2 hours, respectively. Then after washing the membrane with TBST, the secondary antibody of ECL anti-human IgG horseradish peroxidase-linked whole Ab (GE Healthcare #NA933) was added and stored for 1 hour under room temperature followed by Clarity Western ECL (BioRad, #170-5061) detection. The band intensities were read by ImageJ V.1.46r (imagej.nih.gov/).
Clinical subjects for the plasma ELISA assay. Eleven newly diagnosed stage II TNBC patient plasmas and 31 healthy controls were used for autoantibody assays. Case plasmas were derived from a subset of patients in the breast cohort that were used for proteomic analyses of Ig-bound proteins that had sufficient volume. The independent set of healthy control plasmas was obtained from the MD Anderson Cancer Center Gynecologic Tissue Bank as described above (Table 6 and Table 8).
Plasma autoantibody assay method. Concentrations for anti-Vimentin, anti-citrullinated Vimentin autoantibodies were determined using Luminex bead-based immunoassays on the MAGPIX instrument (Luminex Corporation, Austin Texas). Samples were analyzed in the same batch in random order. MagPlex Microspheres were conjugated with purified recombinant Vimentin (Cayman Chemical, #11234) and recombinant Citrullinated Vimentin (Cayman Chemical, #21942) at 5 μg/million concentration. The samples were tested at the final dilution of 1:16,250. The acid dissociated autoantibodies were prepared by 5 μL plasma diluted 50-fold with 0.1 M Gly-HCl (pH 3.0), stored under room temperature for 30 minutes, and exchanged buffer to Reagent Diluent Concentrate 2 buffer (R&D Systems, #841380) using Zeba spin column (ThermoFisher Scientific, #89890). The samples were incubated with MagPlex beads for 2 hrs at room temperature on a shaker. After the samples were washed with PBST, PE conjugated secondary antibody was used to incubate samples for 30 mins. The measured fluorescent intensity (MFI) of each well was read using MAGPIX instrument. The TNBC stage II plasmas in Table 6 were used for the assay and the anonymous individual subject information is presented in Table 8.
A total of 5×10′ HCC1954 and TNBC MDA-MB-468 cells were used for culture in peptidyl-citrullinome analysis using the LTQ Orbitrap ELITE as described previously (ref. 26). The MS/MS spectra were searched against the Uniprot human proteome database (January 2017) using Sequest HT in Proteome Discoverer V.1.4 pipeline. The search was conducted for amino acid lengths from 8 to 34. Two variable modifications, oxidation at Met (15.9949 Da) and deamidation at Arg (0.984016 Da), were chosen. The mass error allowed was 10 ppm for parent monoisotopic and 0.5 Da for MS2 fragment monoisotopic ions. The searched result was filtered with FDR=0.01. The identified peptides with amino acid lengths putatively from 8 to 11 were considered as MHC-I binding peptides and from 12 to 34 as MHC-IU binding peptides.
We additionally performed in silico binding affinity predictions between peptides and MHC-II molecules using the well stablished prediction tool NetMHC-II pan V.2.3 (refs. 27 and 28). Briefly, identified peptide sequences were loaded to the prediction tool NEetMHC-II pan2.3, then the data were sorted by the binding peptide core affinity prediction (IC50, nM) with MHC-II pocket assessed by the artificial intelligent network SSNAligment, and the percentile rank that is generated by comparing the peptide's score against the scores of one million random 15 mers selected from Swiss-Prot database.
F. PADI2 siRNA Knockdown
Transient knockdown of PADI2 was performed by transfecting cells with 50 nM of siControl (Silencer Select Negative Control No. 1, Thermo Fisher Scientific), siPADI2#1 (s22187, Thermo Fisher Scientific) and siPADI2#2 (s22188, Thermo Fisher Scientific) in Lipofectamine RNAiMAX (Thermo Fisher Scientific). After 72 hours of transfection, total RNA was isolated using RNeasy Mini Kit (Qiagen, Germantown, Maryland). Reverse transcription was performed with 1 ug of total RNA using High-Capacity cDNA Reverse Transcription Kit (Thermo Fisher Scientific) and real time PCR performed using TaqMan Gene Expression Assays (Hs00247108_ml, PADI2 FAM-MGB, Thermo Fisher Scientific). 18S (Hs99999901_s1, VIC-MGB, Thermo Fisher Scientific) was used as an internal control. All samples were assayed in triplicate.
Proteins were extracted using (8M Urea and 50 mM Triethylammonium bicarbonate (TEAB) with protease inhibitor cocktail (complete Protease Inhibitor Cocktail, Millipore Sigma). A total of 100 ug protein was used for tandem mass tag (TMT) labeling per channel. After reduction with TCEP (Tris(2-carboxyethyl)phosphine) and alkylation with acrylamide, proteins were digested with Lys-C (WAKO) overnight at 37° C. Digested peptides were desalted with Monospin C18 column (GL Sciences, Tokyo, Japan) and dried by speedvac. The tryptic peptides were re-suspended in 0.1M TEAB buffer/acetonitrile and reacted with each TMT channel for one hour (TMTsixplex Isobaric Label Reagent Set, Thermo Fisher) and then quenched with hydroxylamine, mixed together and dried by speedvac. Tryptic peptides were subsequently fractionated under high-pH conditions using Monospin L C18 column (GL Sciences) in 0.1% trimethylamine/acetonitrile with step elution of acetonitrile into 10 fractions.
Fractionated peptides were injected into an EASYnano HPLC system (Thermo Fisher Scientific) online coupled with Q Exactive Mass Spectrometer (Thermo Fisher Scientific). The system was equipped with a Waters Symmetry C18 nanoAcquity trap-column (180 μm×20 mm, 5 μm) and a C18 analytical column (75 μm×200 mm, 3 μm; Column Technology). The separation column temperature was set ambient, and the temperature of the tray compartment in the auto-sampler was set at 6° C. Mass spectrometer parameters were spray voltage 2.5 kV, capillary temperature 320° C., Fourier transform (FT) resolution 70,000, AGC target value 3×106, 1 microscan with 30 ms injection time. Mass spectra were acquired in a data-dependent mode in the m/z range of 350-1,800. The step gradient of Normalized Collision Energy (NCE) 20, 25, 35 was applied to induce fragmentation. Acquisition of each full mass spectrum was followed by acquisition of MS/MS spectra for the 10 most intense +2, +3 or +4 ions within a duty cycle. The acquired LC-MS/MS data were processed by the Proteome Discoverer V.1.4 (Thermo Scientific). Sequest HT was used as a search engine with the parameters including fixed modification of Cys alkylated with acrylamide (+71.03714), Lys with TMT (+229.162932, N-terminal and Lys), and variable modification of Met oxidation (+15.99491) and Arg deamidation (+0.984016). Mass error of 10 ppm was allowed for the parent MS1 and 0.02 Da was allowed for the MS2 fragments. Data were searched against the Uniprot human database (2017) and were further filtered with FDR=0.01 and TMT ratios were quantified.
Clinical subjects for breast cancer tissue microarray analysis. Breast cancer tissue microarrays (TMA; BC081120C, BR20810, and BR20811) including 127 Luminal A, 99 Luminal B, 57 HER2-enriched, and 139 TNBC from a total of 422 patient tissues, were used for assessment of PADI2 expression related to clinical parameters and co-expression with peptidyl-citrulline (Table 3).
IHC work flow. Healthy tissue microarray (TMA; BN0001a) and breast cancer tissue sections (HuCAT297, HuCAT298, 2017-16604A TNBC, Fmg0105378 Her2+, and Fmg030209B5 ER+) were purchased from US Biomax (Rockville, Maryland, USA) and healthy mammary gland tissue section was obtained from Zyagen (HP-414). Sections were de-paraffinized in xylene, rehydrated in a descending ethanol series, and then treated with 3% hydrogen peroxide for 10 min. Antigen retrieval was conducted in a pressure cooker in 1× ImmunoDNA Retriever with citrate (Bio SB, Santa Barbara, California, USA) and 0.1% Tween 20 at 121° C. for 15 min. Sections were hybridized with 1:2000-times diluted anti-PADI2 monoclonal antibody (66386-1-1g, Proteintech, Rosemont, Illinois, USA), 1:1000-times diluted anti-citrulline monoclonal antibody (Clone F95, Millipore Sigma, Burlington, Massachusetts, USA), 1:250 times diluted anti-CD20cy monoclonal antibody (Clone L26, Agilent Technologies), 1:50 times diluted anti-CD19 monoclonal antibody (Clone LE-CDI9, Agilent Technologies), and 1:1000 times diluted anti-pan Cytokeratin monoclonal antibody (Clone AE1/AE3+5D3, Abcam, Cambridge, UK) for 16 hrs at 4° C. After washing with TBS for 5 min×3, signal development was performed by Histofine DAB-2V kit (Nichirei Bioscience, Tokyo, Japan). Images were scanned by MD Anderson Cancer Center (MDACC)-North Campus Research Histology Core Laboratory and analyzed using Aperio Imagescope (Leica Biosystems, Buffalo Grove, Illinois, USA). For PADI2 and Citrulline IHC evaluation was independently performed by three operators. Positivity of cancer cells was scored as Low: 0-24%, Low-Moderate: 25-49%, Moderate-High: 50-74%, and High: 75-100/c and p-values were calculated by 2-sided X2-square test for the trend between PADI2 staining positivity and the respective comparative groups. Since BR20810 and BR20811 consisted of duplicate tissues per patient, we derived an average score.
For antigen absorption based IHC, we used a previously developed method (ref. 29) with slight modification. Anti-PADI2 antibody was 2,000× and mixed with various concentrations of recombinant citrullinated fibrinogen (18473, Cayman Chemical) and uncitrullinated fibrinogen (16088, Cayman Chemical) and incubated at 4° C. overnight. After centrifugation at 20,000×g for 30 min, the supernatant was used as primary antibody for IHC. Reactivity of the antibody was confirmed using breast invasive ductal carcinoma tissue sections and the positive staining was observed in the presence of unmodified fibrinogen whereas staining was abrogated in the presence of citrullinated fibrinogen (
The Cancer Genome Atlas (TCGA) gene expression data, HM450 methylation data, and clinical data were downloaded from cBioPortal (ref. 30). Gene expression for the Curtis dataset (ref. 31) was obtained through the Oncomine database (ref. 32).
Immune signatures were derived as previously described (ref. 33). Briefly, specific immune cell infiltration was computationally inferred using RNA-seq data based on gene sets overexpressed in one of 24 immune cell types according to Bindea et al. (ref. 34). Scoring of TCGA cancer samples for each of the immune cell signatures and for expression of Antigen Presentation MHC class I (APM1) genes (HLA-A/B/C, p2M, TAP1/2, TAPBP) or Antigen Presentation MHC class II (APM2) genes (HLA-DR/DQ/DP/DM) is described elsewhere (ref. 35).
Unsupervised hierarchical clustering heatmaps were generated using R statistical software. Figures were generated using R statistical software or GraphPad Prism V.8. Spearman's correlation analyses were performed to assess relationships between continuous variables. Fisher's exact tests were used to assess relationships between categorical variables. All statistical tests were two-sided unless specified otherwise.
Proteomic analysis of PADI family protein expression in whole cell lysates from 196 cancer cell lines were stratified by cancer type of brain, breast, colon, gastric, glioma, leukemia, small cell lung, non-small cell lung, melanoma, ovarian, pancreatic and prostate cancer. The quantitative expression was based on the spectral counts of the peptides in common as well as unique sequences that distinct PADI family members. The PADI2 protein expression was highest among other family members and relatively enriched in breast cancer (data not shown). Using mRNA expression datasets from TCGA, we further interrogated differential expression of PADI family members amongst 9,721 human tumors consisting of 32 different cancer types including ACC: Adrenocortical carcinoma, BLCA: Bladder Urothelial Carcinoma, BRCA: Breast invasive carcinoma, CESC: Cervical squamous cell carcinoma and endocervical adenocarcinoma, CHOL: Cholangiocarcinoma, COAD: Colon adenocarcinoma, DLBC: Lymphoid Neoplasm Diffuse Large B-cell Lymphoma, ESCA: Esophageal carcinoma, GBM: Glioblastoma multiforme, HNSC: Head and Neck squamous cell carcinoma, KICH: Kidney Chromophobe, KIRC: Kidney renal clear cell carcinoma, KIRP: Kidney renal papillary cell carcinoma, LAML: Acute Myeloid Leukemia, LGG: Brain Lower Grade Glioma, LIHC: Liver Hepatocellular carcinoma, LUAD: Lung adenocarcinoma, LUSC: Lung squamous cell carcinoma, MESO: Mesothelioma, OV: Ovarian serous cystadenocarcinoma, PAAD: Pancreatic adenocarcinoma, PCPG: Pheochromocytoma and Paraganglioma, PRAD: Prostate adenocarcinoma, SARC: Sarcoma, SKCM: Skin Cutaneous Melanoma, STAD: Stomach adenocarcinoma, TGCT: Testicular Germ Cell Tumors, THCA: Thyroid carcinoma, THYM: Thymoma, UCEC: Uterine Corpus Endometrial Carcinoma, UCS: Uterine carcinosarcoma, UVM: Uveal Melanom. Concordant with proteomic data, relative to the other PADI family members, PADI2 exhibited the highest mRNA expression levels (RNA Seq V2 RSEM) among the cancer types (data not shown). We determined the differential expression of PADI2 among breast cancer subtypes in 1,725 breast tumors and 144 normal breast tissues from the Curtis cohort (ref. 36). Breast tumors exhibited statistically significantly elevated mRNA expression of PADI2 compared with normal breast tissues with an increasing trend from Luminal A/B, HR-/HER2-enriched to TNBC tumors (Dunn's multiple comparison test, two-sided p<0.001,
We next examined whether the extent of protein citrullination was reflective of differential expression of PADI2 among the breast cancer subtypes. Global analysis of the protein citrullinome in whole cell lysates of 28 breast cancer cell lines (8 Luminal A/B, 5 hormone receptor (HR)-/HER2-enriched, and 15 TNBC) by mass spectrometry with a strict criteria of FDR=0.01 revealed that the total number or citrullinated spectra in the cell lines had positive correlation with PADI2 expression level (
M
LVSNR*VSWWGSTLAM
To confirm the association of PADI2 expression with protein citrullination in breast tumor tissues, we assessed PADI2 protein expression and citrullinome by means of immunohistochemistry (IHC) in 422 breast tumors (Table 3). PADI2 protein expression in the cohort of 422 breast tumors was statistically significantly higher in Grade III vs Grades I and 11 tumors (X2-square test, 2-sided P<0.0001), estrogen receptor (ER)−(HR-/HER2-enriched and TNBC) versus ER+(Luminal AB) (X2-square test, 2-sided P<0.0001) and TNBC versus Non-TNBC (X2-square test, 2 sided P<0.0001) (Table 3). PADI2 expression was not associated with age or stage in a statistically significant manner (Table 3). Staining for peptidyl-citrulline (Low+Low-Moderate vs Moderate-High+High) was significantly positively correlated with PADI2 protein expression (OR 4.45, 95% CI: 2.48-7.78; X2-square test, 2-sided P<0.0001) (Table 3, data not shown). Staining for PADI2 and peptidyl-citrulline was negative in mammary gland, colon, kidney, liver, lung, stomach, rectum and esophagus as well as the tumor's adjacent normal tissue (
We determined whether PADI2 is a major contributor to protein arginine deamination by knocking down PADI2 with siRNA in the TNBC HCC1187 cell line and performing global assessment of protein citrullination by immunoblots as well as by mass spectrometry analysis. Knockdown of PADI2 reduced PADI2 mRNA and protein (
We next evaluated whether elevated PADI2 expression is associated with a distinct tumor immunophenotype. Using TCGA-derived mRNA expression datasets for 974 breast tumors, we first performed Spearman's correlation analyses between tumor PADI2 mRNA expression and gene expression profiles of checkpoint blockade-related genes as well as gene expression signatures reflective of immune cell infiltrates (ref 34). Statistically significant positive correlations were observed between PADI2 mRNA and virtually all checkpoint blockade-related genes as well as gene expression signatures reflective of immune cell infiltrates (
To further confirm the association of PADI2 protein expression with increased tumor protein citrullination and with increased tumor infiltration of B cells, we performed IHC on breast cancer tissue section slides for PADI2, anti-peptidyl-citrulline, Pan-cytokeratin (PanCK) and B cell markers CD19 and CD20 (
To determine the extent to which autoantibody reactivity is directed against citrulline-containing epitopes, we first used a western blot approach and evaluated differential autoantibody reactivity against citrullinated- and non-citrullinated vimentin using plasma pools from patients with newly diagnosed stage II TNBC (n=11 subjects/pool) (Table 8) and healthy controls (n=8 subjects/pool). We chose to focus on vimentin as our antigen of interest as autoantibody reactivity against vimentin has previously been reported in autoimmune disease and cancer (refs. 43-46). The high autoantibody reactivity against citrullinated vimentin was observed in the TNBC stage II patient plasma pool compared with the healthy control pool (
We have experimentally confirmed the occurrence of citrullinated mass spectrometry identified peptide fragments of ENO1 in the cell surfaceome, immunopeptidome of breast cancer cell lines, and circulating plasma IgG bound citrullinated peptides among TNBC cases compared to healthy controls (refs 52, 53). All identified peptides in the immunopeptidome were found to be greater than 12 amino acids in length and thus citrullinated peptides were exclusively class II peptides and suitable for peptide vaccination (refs 54, 55).
In some cases, the experimentally identified peptides described above may not be the best amino acid length for vaccination since protease digestion was used to make the proteins and peptides the proper length for detection via mass spectrometry. To synthesize the peptides for ELISpot assays in best length of 22 to 24 amino acid length used in many of vaccination studies, we additionally performed in silico binding affinity predictions between peptides and MHC-II molecules using the well-established prediction tool NetMHC-IIpanV.2.3. software(refs 56, 57). Briefly, identified citrullinated peptide sequences were considered as unmodified form and loaded to the prediction tool, followed by sorting the binding peptide core affinity prediction (IC50, nM) with MHC-II pocket assessed by the artificial intelligent network SSNAlignment, and the percentile rank that was generated by comparing the peptide's score against the scores of one million random 15 mers selected from Swiss-Prot database.
Using ELISpot assays, we tested the unmodified and citrullinated forms of three pairs of synthesized ENO1 peptides predicted as having high MHC-II binding affinity and assessed B cell-mediated IgG and T cell-mediated IFNγ secretion (Table 9). Two of the three citrullinated peptides (peptide-1 and -2) elicited a potent IgG response; none of the unmodified peptides showed appreciable differences compared to media (
Cancer immunotherapy approaches which target overexpressed proteins on the surface of cancer cells have shown enormous clinical efficacy. However, expression of these proteins is generally not limited to cancer cells (e.g., tumors) and may also occur at a low to moderate level in normal tissues. In these cases, targeting of these proteins in immunotherapy approaches often causes negative side effects. As described above, PADI family enzymes are highly overexpressed in various cancer types and not expressed in normal tissues, which means that citrullinated proteins expressed on the surface of cancer cells may be preferable as immunotherapy targets.
Table 10 shows citrullinated sites on representative proteins that are commonly targeted in cancer immunotherapy, including TACSTD2 (TROP2), EGFR, ERBB2 and various other targets. Cancer cell surface proteins were biotin labeled, purified using streptavidin, digested with trypsin, and analyzed via mass spectrometry. The mass spectra were searched against the Uniprot Human genome database, with arginine citrullination considered as a variable modification. Multiple citrullination sites were identified from those proteins in various cancer types, and the citrullination modifications on those proteins were unreported elsewhere. Targeting citrullinated forms of proteins expressed on the surface of cancer cells (e.g., tumors) is a unique approach which further increases the specificity of cancer cell targeting (i.e., compared to targeting of normal cells, which can cause side effects).
The following U.S. patents and published patent applications are hereby incorporated herein by reference: U.S. Pat. Nos. 6,872,385; 7,547,759; 9,562,070; 9,629,877; 10,239,916; 11,155,599; US 2019/0093085, U.S. Pat. Nos. 10,088,479 and 10,695,438.
All of the methods disclosed and claimed herein can be made and executed without undue experimentation in light of the present disclosure. While the compositions and methods of this disclosure have been described in terms of preferred embodiments, it will be apparent to those of skill in the art that variations may be applied to the methods and in the steps or in the sequence of steps of the method described herein without departing from the concept, spirit and scope of the disclosure. More specifically, it will be apparent that certain agents which are both chemically and physiologically related may be substituted for the agents described herein while the same or similar results would be achieved. All such similar substitutes and modifications apparent to those skilled in the art are deemed to be within the spirit, scope and concept of the disclosure as defined by the appended claims.
Disclosed herein are materials, compositions, and methods that can be used for, can be used in conjunction with or can be used in preparation for the disclosed embodiments. These and other materials are disclosed herein, and it is understood that when combinations, subsets, interactions, groups, etc. of these materials are disclosed that while specific reference of each various individual and collective combinations and permutations of these compositions may not be explicitly disclosed, each is specifically contemplated and described herein. For example, if a method is disclosed and discussed, and a number of modifications that can be made to a number of molecules included in the method are discussed, each and every combination and permutation of the method, and the modifications that are possible are specifically contemplated unless specifically indicated to the contrary. Likewise, any subset or combination of these is also specifically contemplated and disclosed. This concept applies to all aspects of this disclosure including, but not limited to, steps in methods using the disclosed compositions. Thus, if there are various additional steps that can be performed, it is understood that each of these additional steps can be performed with any specific method steps or combination of method steps of the disclosed methods, and that each such combination or subset of combinations is specifically contemplated and should be considered disclosed.
Publications cited herein and the material for which they are cited are hereby specifically incorporated by reference in their entireties. The following description provides further non-limiting examples of the disclosed compositions and methods.
This application claims priority to and the benefit of U.S. Provisional Patent Application Ser. No. 63/168,164, filed Mar. 30, 2021, the contents of which is incorporated herein by this reference as if fully set forth herein.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2022/071434 | 3/30/2022 | WO |