The present invention generally relates to combinatorial post-translationally-modified histone peptides and arrays thereof. The invention further relates to methods of using the same.
Protein posttranslational modifications (PTMs), such as phosphorylation, methylation, acetylation, and ubiquitination, regulate many processes, such as protein degradation, protein trafficking, and mediation of protein-protein interactions. Perhaps the best-studied PTMs are those found to be associated with histone proteins.
The enormous number of potential combinations of histone PTMs represents a major obstacle to our understanding of how PTMs regulate chromatin-templated processes, as well as to our ability to develop high-quality diagnostic tools for chromatin and epigenetic studies. A major limitation in exploring the full extent of the histone code has been the lack of a comprehensive library of modified histone peptides that can be used to rapidly and efficiently screen for effector proteins that bind to unique modification patterns.
The present invention addresses previous shortcomings in the art by providing combinatorial post-translationally-modified histone peptide, arrays thereof, and methods of using the same.
A first aspect of the present invention comprises a plurality of synthetic histone peptides, wherein a portion of the synthetic histone peptides comprise at least one post-translational modification, and wherein a portion of the synthetic histone peptides are at least 21 amino acids in length.
A second aspect of the present invention comprises a peptide array comprising: a substrate comprising a surface; and a plurality of synthetic histone peptides immobilized on the substrate surface, wherein a portion of the synthetic histone peptides comprise at least one post-translational modification, and wherein a portion of the synthetic histone peptides are at least 21 amino acids in length.
Another aspect of the present invention comprises a method for determining the binding of a protein to a peptide comprising: providing a peptide array comprising: a substrate comprising a surface; and a plurality of synthetic histone peptides immobilized on the substrate surface, wherein a portion of the synthetic histone peptides comprise at least one post-translational modification, and wherein a portion of the synthetic histone peptides are at least 21 amino acids in length; applying a protein to the peptide array; and detecting binding of the protein to one or more synthetic histone peptides in the peptide array.
A further aspect of the present invention comprises a method for detecting the influence of neighboring post-translational modifications on protein binding comprising: providing a peptide array comprising: a substrate comprising a surface; and a plurality of synthetic histone peptides immobilized on the substrate surface, the plurality of synthetic histone peptides comprising peptides with no post-translational modifications, peptides with one post-translational modification, and peptides with more than one post-translational modification, wherein a portion of the synthetic histone peptides are at least 21 amino acids in length; applying a protein to the peptide array; detecting binding of the protein to one or more synthetic histone peptides in the peptide array; and comparing the sequences of the synthetic histone peptides bound to the protein, thereby detecting the influence of neighboring post-translational modifications on protein binding.
The foregoing and other aspects of the present invention will now be described in more detail with respect to other embodiments described herein. It should be appreciated that the invention can be embodied in different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.
The present invention will now be described more fully hereinafter. This invention may, however, be embodied in different forms and should not be construed as limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.
The terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. As used in the description of the invention and the appended claims, the singular forms “a”, “an” and “the” are intended to include the plural forms as well, unless the context clearly indicates otherwise.
Unless otherwise defined, all terms (including technical and scientific terms) used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. It will be further understood that terms, such as those defined in commonly used dictionaries, should be interpreted as having a meaning that is consistent with their meaning in the context of the present application and relevant art and should not be interpreted in an idealized or overly formal sense unless expressly so defined herein. The terminology used in the description of the invention herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. All publications, patent applications, patents and other references mentioned herein are incorporated by reference in their entirety. In case of a conflict in terminology, the present specification is controlling.
Also as used herein, “and/or” refers to and encompasses any and all possible combinations of one or more of the associated listed items, as well as the lack of combinations when interpreted in the alternative (“or”).
Unless the context indicates otherwise, it is specifically intended that the various features of the invention described herein can be used in any combination. Moreover, the present invention also contemplates that in some embodiments of the invention, any feature or combination of features set forth herein can be excluded or omitted. To illustrate, if the specification states that a complex comprises components A, B and C, it is specifically intended that any of A, B or C, or a combination thereof, can be omitted and disclaimed.
As used herein, the transitional phrase “consisting essentially of” (and grammatical variants) is to be interpreted as encompassing the recited materials or steps “and those that do not materially affect the basic and novel characteristic(s)” of the claimed invention. See, In re Herz, 537 F.2d 549, 551-52, 190 U.S.P.Q. 461, 463 (CCPA 1976); see also MPEP §2111.03. Thus, the term “consisting essentially of” as used herein should not be interpreted as equivalent to “comprising.”
The term “about,” as used herein when referring to a measurable value such as an amount or concentration (e.g., the amount of a peptide) and the like, is meant to encompass variations of ±10%, ±5%, ±1%, ±0.5%, or even ±0.1% of the specified amount. A range provided herein for a measureable value may include any other range and/or individual value therein.
The present invention comprises, consists essentially of, or consists of a synthetic histone peptide. A “synthetic histone peptide” is a peptide that is synthetically produced and comprises an amino acid sequence similar to a naturally occurring histone amino acid sequence. Histones are known in the art and as those of skill in the art will appreciate, the amino acid sequence of a histone can be obtained by known methods. For example, amino acid sequences useful to the present invention can be obtained through publicly available databases, such as the National Center for Biotechnology Information (NCBI) database. Exemplary histones include, but are not limited to, H1, H2A, H2B, H3, H4, H5, or any combination thereof. In some embodiments of the present invention, the amino acid sequence of a synthetic histone peptide can be similar to one or more, such as 2, 3, 4, or more, naturally occurring histone amino acid sequences. Synthetic histone peptides of the present invention can be synthesized using methods known in the art, such as, but not limited to chemical peptide synthesis methods, including using an automated peptide synthesizer. The amino acid sequence of a synthetic histone peptide can be similar to a N-terminal tail of a histone, a C-terminal tail of a histone, an internal region of a histone, or any combination thereof. In some embodiments of the present invention, the amino acid sequence of a synthetic histone peptide is similar to a N-terminal tail of a histone or a C-terminal tail of a histone.
A synthetic histone peptide can comprise from 10 to 40 amino acids in length or any range therein, such as, but not limited to, from 15 to 35 amino acids or 20 to 30 amino acids. In particular embodiments of the present invention, a synthetic histone peptide is 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, or 40 amino acids in length, or any range therein. In certain embodiments of the present invention, a synthetic histone peptide is at least 21 amino acids in length, e.g., at least 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, or 40 amino acids in length, or any range therein.
“Similar” as used herein in reference to the amino acid sequence of a synthetic histone peptide and a naturally occurring histone amino acid sequence refers to a synthetic histone amino acid sequence that is at least about 70%, 75%, 80%, 85%, 90%, 95%, 97%, 98%, 99%, or 100% identical to a naturally occurring histone amino acid sequence. In some embodiments of the present invention, the amino acid sequence of a synthetic histone peptide is about 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 100%, or any range therein, identical to a naturally occurring histone amino acid sequence. According to some embodiments of the present invention, a section or piece of a synthetic histone peptide (e.g., 5 to 25 consecutive amino acids or any range therein) can be similar to one or more naturally occurring histone amino acid sequences.
A synthetic histone peptide of the present invention can be “similar” to a naturally occurring histone amino acid sequence in that during the design and/or preparation of the synthetic histone peptide, a naturally occurring histone amino acid sequence is used as a template and/or model amino acid sequence, but one or more amino acids in the naturally occurring histone amino acid sequence are changed and/or modified to comprise a post-translational modification, a different amino acid, and/or an amino acid derivative. “Amino acid derivative” as used herein, refers to an amino acid substituted with one or more substituents. Exemplary substituents include, but are not limited to, alkyl, lower alkyl, halo, haloalkyl, alkenyl, alkynyl, cycloalkyl, cycloalkylalkyl, heterocyclo, heterocycloalkyl, aryl, arylalkyl, lower alkoxy, thioalkyl, hydroxyl, thio, mercapto, amino, imino, halo, cyano, nitro, nitroso, azido, carboxy, sulfide, sulfone, sulfoxy, phosphoryl, silyl, silylalkyl, silyloxy, boronyl, modified lower alkyl, and any combination thereof. Exemplary amino acid derivatives include, but are not limited to, alanine methyl ester, valine ethyl ester, phenylalainamide, N-acetyl-tyrosine, and O-benzyl-tyrosine. In some embodiments of the present invention, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10 or more amino acids in a naturally occurring histone amino acid sequence are changed and/or modified to produce a synthetic histone peptide of the present invention. In particular embodiments of the present invention, the change and/or modification comprises one or more post-translational modifications.
“Post-translational modification” as used herein refers to a chemical modification to an amino acid. In naturally occurring peptides and/or proteins post-translational modifications occur after the peptide/protein is translated. Thus, as those skilled in the art will recognize, a naturally occurring histone amino acid sequence can comprise one or more post-translational modifications. Post-translational modifications include, but are not limited to, phosphorylation, methylation (e.g., lysine methylation (mono-, di-, or trimethylation) and arginine methylation (mono, asymmetric dimethylation, or symmetric dimethylation)), acetylation, ubiquitination, myristoylation, palmitoylation, isoprenylation, prenylation, acylation, glycosylation, hydroxylation, iodination, oxidation, sulfation, selenoylation, SUMOylation, citrullination, deamidation, carbamylation, ADP-ribosylation, lysine crotonylation, formylation, propionyllysine, butyryllysine, or any combination thereof. One or more post-translational modifications of a synthetic histone peptide of the present invention can comprise changing and/or modifying an amino acid during and/or after the synthesis of the synthetic histone peptide.
In some embodiments of the present invention, a synthetic histone peptide comprises one or more post-translational modifications, such as 2, 3, 4, 5, 6, 7, 8, 9, or more post-translational modifications. When more than one post-translational modification is present in a synthetic histone peptide of the present invention, the post-translational modifications can be the same and/or different. For example, the post-translational modifications can be the same type of post-translation modification (e.g., methylation) on different amino acids, which can be the same (e.g., the two modified amino acids are lysine) or different (e.g., the two modified amino acids are lysine and serine). When the two or more post-translational modifications are different types (e.g., methylation and acetylation), the modifications are on different amino acids, which can be the same or different. In certain embodiments of the present invention, two or more different types of post-translational modifications, such as 2, 3, 4, 5, 6, 7, 8, 9, or more, are present in a synthetic histone peptide of the present invention.
The one or more post-translational modifications in a synthetic histone peptide of the present invention can be the same as and/or different than the post-translational modifications found in a naturally occurring histone amino acid sequence. For example, a post-translational modification can be the same type of post-translation modification (e.g., methylation) on a specific amino acid in both a naturally occurring histone amino acid sequence and synthetic histone peptide sequence. A post-translational modification in a synthetic histone peptide of the present invention can be different compared to a post-translation modification on a specific amino acid in a naturally occurring histone amino acid sequence (e.g., the modification is methylation on a specific lysine in a synthetic histone peptide and acetylation on the corresponding lysine in a naturally occurring histone amino acid sequence). Similarly, a synthetic histone peptide of the present invention can comprise an amino acid that is not post-translationally modified (i.e., an unmodified amino acid) as it may be found in a naturally occurring histone amino acid sequences (i.e., a naturally occurring histone amino acid sequence comprises a post-translational modification on a specific amino acid and a similar synthetic histone peptide does not comprise that modification, but rather is an unmodified amino acid). Exemplary synthetic histone peptides of the present invention include, but are not limited to, those shown in Tables 1 and 2.
According to some embodiments of the present invention, the amino acid sequence of a synthetic histone peptide is modeled after a naturally occurring histone sequence and modified to include different and/or additional post-translational modifications that may exist and/or to provide different combinations of post-translational modifications within the peptide sequence. A synthetic histone peptide sequence can comprise combinations of two or more naturally occurring histone sequences from the same histone and/or a different histone, such as 2, 3, 4, or more histones. For example, a synthetic histone peptide sequence can comprise 5 to 20 amino acids from H3 and 5 to 20 amino acids from H4. Combinations of naturally occurring histone sequences can allow for the testing and/or determination of how modifications on different regions of a histone and/or on different histones can affect protein binding.
A synthetic histone peptide of the present invention can be characterized by one or more chemical and/or biological assays and/or techniques known to those of skill in the art. Characterization of a synthetic histone peptide of the present invention can used to determine and/or ensure quality of a peptide and/or to determine the specific chemical composition of a peptide. In some embodiments of the present invention, a synthetic histone peptide of the present invention is characterized using one or more of the following: high-performance liquid chromatography; mass spectrometry, such as electrospray mass spectrometry and matrix-assisted laser desorption mass spectrometry; nuclear magnetic resonance; or Edman degradation, including automated Edman degradation
A synthetic histone peptide of the present invention can have a purity of at least about 75% or more, such as about 80%, 85%, 90%, 95%, 99%, or more prior to being combined with another peptide and/or compound and/or used in an array of the present invention and/or method of the present invention. In particular embodiments of the present invention, a synthetic histone peptide of the present invention has a purity of about 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100%, or any range therein.
One aspect of the present invention comprises, consists essentially of, or consists of a plurality of synthetic histone peptides of the present invention. A “plurality” as used herein refers to a group of two or more different peptides. In particular embodiments of the present invention, a plurality of synthetic histone peptide can comprise 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 125, 150, 175, 200, 250, 300, or more, or any range therein, different synthetic histone peptides. In some embodiments of the present invention, a plurality includes at least 5 synthetic histone peptides from Table 1 and/or Table 2.
Another aspect of the present invention includes one or more peptide arrays that comprise, consist essentially of, or consist of a plurality of synthetic peptides of the present invention. In some embodiments of the present invention, a peptide array comprises a substrate comprising a surface and a plurality of synthetic histone peptides immobilized on the substrate surface. “Peptide array” as used herein, refers to a series of peptides arranged in a two or three dimensional manner. The peptides can be arranged in a pattern and/or ordered manner, such as a spiral or grid pattern, and/or in an irregular manner. In particular embodiments of the present invention, the peptides are arranged in grids and/or rows and columns with areas containing no peptides located between adjacent peptides.
“Substrate” as used herein refers to any material onto which a synthetic histone peptide of the present invention can be arranged and/or immobilized. Exemplary substrates include, but are not limited to, wafers, slides, well plates, and membranes. The substrate can be porous or nonporous and/or rigid or semi-rigid. The substrate can comprise one or more materials such as, but not limited to, polymeric materials (e.g., polystyrene, polyvinyl acetate, polyvinyl chloride, polyvinyl pyrrolidone, polyacrylonitrile, polyacrylamide, polymethyl methacrylate, polytetrafluoroethylene, polyethylene, polypropylene, polyvinylidene fluoride, polycarbonate, and divinylbenzene styrene-based polymers), agarose (e.g., Sepharose™), dextran (e.g., Sephadex™), cellulosic polymers and other polysaccharides, silica and silica-based materials, glass (e.g., controlled pore glass) and functionalized glasses, ceramics, or any combination thereof. In particular embodiments of the present invention the substrate comprises glass, such as, but not limited to, a glass slide.
The substrate comprises a surface. In some embodiments of the present invention, the substrate comprises one or more surface coatings. Exemplary surface coatings include, but are not limited to, polymers such as aminosilane and poly-L-lysine, microporous polymers (e.g., cellulosic polymers such as nitrocellulose), microporous metallic compounds (e.g., microporous aluminum), antibody-binding proteins, bisphenol A polycarbonate, and one half of a binding pair, such as streptavidin. In particular embodiments of the present invention, the substrate surface is coated with one half of a binding pair. “Binding pair” as used herein refers to any molecule that is able to specifically bind to another molecule, such as, but are not limited to, streptavidin to biotin, avidin to biotin, a receptor to a ligand, and an antibody to an antigen. In particular embodiments of the present invention, the substrate surface is coated with streptavidin.
A plurality of synthetic histone peptides of the present invention can be immobilized on the substrate surface of a peptide array of the present invention. “Immobilize” and grammatical variants thereof as used herein refer to a synthetic histone peptide being attached or bound (e.g., covalently or non-covalently) to the substrate surface either directly or indirectly. In particular embodiments of the present invention, a synthetic histone peptide is immobilized onto the substrate surface using a binding pair. This can allow for the synthetic histone peptide of the present invention to comprise a greater freedom of rotation compared to being directly bound and/or immobilized onto the substrate surface. In some embodiments of the present invention, streptavidin is coated on the substrate surface and biotin is attached to a synthetic histone peptide of the present invention.
A plurality of synthetic histone peptides of the present invention can be immobilized onto the substrate surface of a peptide array of the present invention at a high density. “High density” as used herein refers to a peptide array comprising a density of at least about 1,000 peptides per square centimeter of the substrate surface of the array. In particular embodiments of the present invention, a peptide array has a density of at least about 1,500, 2,000, 2,500, 5,000, 10,000, 25,000, 50,000, 75,000, 100,000, or more peptides per square centimeter of the substrate surface of the array. In some embodiments of the present invention, a peptide array has a density of at least about 2,600 peptides per square centimeter of the substrate surface of the array.
Immobilization of a synthetic histone peptide can be accomplished by spotting a peptide onto the substrate surface, “Spotting” and grammatical variants thereof as used herein, refer to contacting, placing, dropping, dripping, and the like, the peptide onto one or more specific locations (i.e., spots of any size or shape) on the substrate surface to immobilize the peptide onto the substrate surface. In some embodiments of the present invention, a synthetic histone peptide of the present invention can be immobilized on a peptide array once (i.e., one spot) or as a series of two or more spots on an array, such as 2, 4, 6, 8, 12, or more spots on an array, or any range therein. When a synthetic histone peptide of the present invention is spotted two or more times on a peptide array, the spots can be sequential (e.g., adjacent to one another) and/or nonsequential (e.g., placed in a different order and/or nonadjacent to one another) on the peptide array. In particular embodiments of the present invention, a synthetic histone peptide of the present invention can be immobilized on a peptide array as a series of six spots, two different times on the peptide array. A spot of a synthetic histone peptide of the present invention on the substrate surface can comprise a synthetic histone peptide having the same or a different sequence. For example, a spot can comprise a single synthetic histone peptide of the present invention (i.e., the spot comprises synthetic histone peptides comprising the same amino acid sequence) or a spot can comprise a combination of synthetic histone peptides of the present invention (i.e., the spot comprises synthetic histone peptides comprising two or more different amino acid sequences).
A spot can be from about 25 mm to about 700 mm in diameter or any range therein, such as from about 50 mm to about 500 mm or about 150 mm to about 300 mm in diameter. In some embodiments of the present invention, a spot is about 200 mm in diameter. The spacing between adjacent spots on the substrate surface of an array can be from 50 mm to 1000 mm or any range therein, such as from about 100 mm to about 800 mm or about 200 mm to about 500 mm. In some embodiments of the present invention, a spot is spaced apart from a next adjacent spot by about 375 mm. The diameter of one or more spots on a peptide array of the present invention and spacing between the one or more spots on a peptide array of the present invention can be substantially constant (i.e., varying by less than 15%, such as less than 10%, 5%, etc.) or the diameter and/or spacing can vary. In some embodiments of the present invention, the diameter of one or more spots and/or the spacing between the one or more spots on a peptide array of the present invention can be manipulated and/or designed to accommodate one or more features desired for a peptide array of the present invention. For example, the diameter of one or more spots and/or spacing between the one or more spots on a peptide array of the present invention can be changed depending on the number of peptides desired to be immobilized on the peptide array.
As described above, the amino acid sequence of a synthetic histone peptide of the present invention can be similar to a N-terminal tail of a histone, a C-terminal tail of a histone, an internal region of a histone, or any combination thereof. A synthetic histone peptide of the present invention can be immobilized on a peptide array at either terminus (i.e., the N-terminus or C-terminus of the synthetic histone peptide) or at any location of the peptide (e.g., the middle of the peptide). In some embodiments of the present invention, the N-terminus or C-terminus of the synthetic histone peptide is immobilized on a peptide array. When a synthetic histone peptide comprises an amino acid sequence similar to a naturally occurring histone amino acid sequence in the N-terminal tail of a histone, then the C-terminus of the synthetic histone peptide is immobilized on the substrate surface. Similarly, when a synthetic histone peptide comprises an amino acid sequence similar to a naturally occurring histone amino acid sequence from the C-terminal tail of a histone, then the N-terminus of the synthetic histone peptide is immobilized on the substrate surface. This can allow for the synthetic histone peptide to better model a naturally occurring histone amino acid sequence.
A plurality of synthetic histone peptides of the present invention can comprise one or more of the features described above for a synthetic histone peptide of the present invention. In some embodiments of the present invention, one or more portions of the synthetic histone peptides on a peptide array comprise one or more features described above for a synthetic histone peptide of the present invention. The various portions of synthetic histone peptides may or may not overlap. A “portion” as used herein can refer to any fraction of the total number of synthetic histone peptides in a plurality or on a peptide array, such as about 1% to about 100% of the total number of synthetic histone peptides on a peptide array or any range therein, such as about 5% to about 95% or about 20% to about 50%. In particular embodiments of the present invention, a portion refers to about 1%, 2%, 3%, 4%, 5%, 6%, 7%, 8%, 9%, 10%, 11%, 12%, 13%, 14%, 15%, 16%, 17%, 18%, 19%, 20%, 21%, 22%, 23%, 24%, 25%, 26%, 27%, 28%, 29%, 30%, 31%, 32%, 33%, 34%, 35%, 36%, 37%, 38%, 39%, 40%, 41%, 42%, 43%, 44%, 45%, 46%, 47%, 48%, 49%, 50%, 51%, 52%, 53%, 54%, 55%, 56%, 57%, 58%, 59%, 60%, 61%, 62%, 63%, 64%, 65%, 66%, 67%, 68%, 69%, 70%, 71%, 72%, 73%, 74%, 75%, 76%, 77%, 78%, 79%, 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, 100%, or any range therein. For example, a portion (e.g., about 50% or more) of the synthetic histone peptides can comprise at least one post-translational modification and/or a portion (e.g., about 50% or more) of the synthetic histone peptides can be at least 21 amino acids in length. These two portions may contain the same and/or different synthetic histone peptides. Thus, as those skilled in the art will appreciate, a plurality of synthetic histone peptides of the present invention can provide a large number of peptides with one or more features that can be the same and/or different from one another.
In some embodiments of the present invention, a peptide array of the present invention can further comprise a positive control. A positive control can aid in determining the quality of the spotting of a synthetic histone peptide of the present invention. A positive control can be bound to the substrate surface using a binding pair, such as, but not limited to, streptavidin and biotin. In particular embodiments of the present invention, a positive control is separate from a synthetic histone peptide of the present invention. Thus, in some embodiments of the present invention, a positive control does not bind, attach, and/or immobilize to the substrate surface using a synthetic histone peptide and/or is not bound and/or attached to the synthetic histone peptide.
A positive control of the present invention can comprise any compound that is detectable, such as, but not limited to, a fluorescent compound. In some embodiments of the present invention, a fluorescent compound is bound to one half of a binding pair, such as, but not limited to, biotin. Exemplary fluorescent compounds include, but are not limited to, fluoresceins, such as TET (Tetramethyl fluorescein), 2′,7′-dimethoxy-4′,5′-dichloro-6-carboxyfluorescein (JOE),6-carboxyfluorescein (HEX) and 5-carboxyfluorescein (5-FAM); phycoerythrins; resorufin dyes; coumarin dyes; rhodamine dyes, such as 6-carboxy-X-rhodamine (ROX); cyanine dyes; BODIPY dyes; quinolines; pyrenes; N,N,N′,N′-tetramethyl-6-carboxyrhodamine (TAMRA); acridine; stilbene; Texas Red; as well as derivatives thereof. In some embodiments of the present invention, the fluorophore is a rhodamine dye or a BODIPY dye and in other embodiments the fluorophore is 6-aminoquinoline. In particular embodiments of the present invention, the positive control is a fluorescein.
The present invention also encompasses methods of using a synthetic histone peptide of the present invention, a plurality of synthetic histone peptides of the present invention and/or a peptide array of the present invention. In certain embodiments of the present invention, a synthetic histone peptide of the present invention, a plurality of synthetic histone peptides of the present invention, and/or a peptide array of the present invention can be utilized in one or more chemical and/or biological assays, such as, but not limited to, protein assays, enzyme assays, antibody assays, cellular assays, or combinations thereof.
In some embodiments of the present invention, a method for determining the binding of a protein to a peptide is provided comprising providing a peptide array of the present invention, applying a protein to the peptide array, and detecting binding of the protein to one or more synthetic histone peptides in the peptide array.
In other embodiments of the present invention, a method for detecting the influence of neighboring post-translational modifications on protein binding is provided comprising providing a peptide array of the present invention, applying a protein to the peptide array, detecting binding of the protein to one or more synthetic histone peptides in the peptide array, and comparing the sequences of the synthetic histone peptides bound to the protein, thereby detecting the influence of neighboring post-translational modifications on protein binding. In some embodiments of the present invention, the method for detecting the influence of neighboring post-translational modifications on protein binding comprises providing a peptide array comprising a plurality of synthetic histone peptides of the present invention, wherein the plurality comprises peptides with a similar sequence (e.g., peptides modeled after a particular sequence from one or more histones). The plurality of synthetic histone peptides with a similar sequence can comprise peptides with no post-translational modifications, peptides with one post-translational modification, and peptides with more than one post-translational modifications. Thus, different combinations of post-translational modifications can be compared.
Binding of a protein to a peptide array of the present invention can be accomplished by methods known in the art. For example, protein binding can be detected by methods including, but not limited to, chemical and/or biological assays, such as, but not limited to, western blot methods, and/or techniques, such as, but not limited to, fluorescence, immunoprecipitation, and chromatography, or any combination thereof.
According to some embodiments of the present invention, a method of the present invention provides for the visual detection of protein binding to one or more synthetic histone peptides in a peptide array of the present invention. Any protein can be used in the methods of the present invention, such as but not limited to, an antibody, an enzyme, a histone-interacting protein, or any combination thereof. Exemplary antibodies include, but are not limited to, the antibodies listed in Table 3, below. Exemplary enzymes include, but are not limited to, peptidases, proteases, lipases, kinases, histone-modifying enzymes (e.g., methyltransferases, deacetylases, acetyltransferases, etc.), or any combination thereof. Exemplary histone-interacting proteins include, but are not limited to, CHD1, RAG2, BTPF, or any combination thereof.
The present invention is explained in greater detail in the following non-limiting Examples.
Protein posttranslational modifications (PTMs), such as phosphorylation, methylation, acetylation, and ubiquitination, regulate many processes, such as protein degradation, protein trafficking, and mediation of protein-protein interactions [1]. Perhaps the best-studied PTMs are those found to be associated with histone proteins. More than 100 histone PTMs have been described, and they largely function by recruiting protein factors to chromatin, which in turn drives processes such as transcription, replication, and DNA repair [2]. Likewise, dozens of chromatin-associating factors have been identified that bind to distinct histone PTMs, and hundreds of modification specific histone antibodies have been developed to understand the in vivo function of these modifications [3, 4].
The enormous number of potential combinations of histone PTMs represents a major obstacle to our understanding of how PTMs regulate chromatin-templated processes, as well as to our ability to develop high-quality diagnostic tools for chromatin and epigenetic studies.
The same obstacle applies to other proteins regulated by combinatorial PTMs: for example, p53, RNA polymerase, and nuclear receptors [5-7]. To that end, we developed a peptide array-based platform to begin to address how both histone-interacting proteins and antibodies recognize combinations of PTMs. We focused primarily on the recognition of PTMs associated with the N-terminal tail of histone H3, but this approach is useful for the study of other histone modifications and combinatorial PTMs found on other nonhistone proteins.
We generated a library of 110 synthetic histone peptides bearing either single or combinatorial PTMs and a biotin moiety for immobilization (
We initially used our arrays to ask two fundamental questions regarding the recognition of histone PTMs: (1) How well do modification-directed antibodies recognize their intended epitope? and (2) What impact, if any, do combinatorial PTMs have on antibody recognition? We tested more than 20 commercially available antibodies raised against individual modifications on histone tails (see Tables 3 and 4) for information regarding antibodies and experimental conditions). Generally, we found that antibodies were reasonably proficient at recognizing their target modification (
To explore methyllysine recognition, we tested the specificity of commercial antibodies raised against the three different methylated forms (mono-, di-, and trimethyl) of H3 at lysines 4 and 79 (H3K4me and H3K79me) (
We also tested a number of antibodies raised against acetyllysine found at position 14 of histone H3 (H3K14ac). Unlike lysine methylation, our arrays detected that several of these antibodies had difficulty in recognizing their target sequence, preferring acetylation at lysine 36 (H3K36ac) instead (
The large number of synthetic peptides containing combinatorial PTMs allowed us to additionally ascertain how PTM recognition is influenced by neighboring modifications. We therefore did further analysis of the H3K4me3 antibodies to determine how adjacent modifications affect substrate recognition. We observed that a monoclonal antibody widely used against H3K4me3 (Abeam; catalog number ab1012) is perturbed mainly by modification at histone H3 arginine 2 (H3R2) (
We also examined the well-characterized PTM “switch” region on histone H3, where H3K9 is modified by either acetylation or methylation and where the neighboring serine 10 (H3S10) is a target for phosphorylation [16]. A polyclonal antibody (Active Motif; catalog number 39253) raised against H3S10 phosphorylation showed a statistically significant reduction in binding to peptides also modified at H3K9 (
Collectively, our analysis of histone PTM-specific antibodies enabled us to uncover recognition of related (but off-target) sequences in addition to adjacent PTM effects. This finding is significant because several major ongoing initiatives aimed at mapping and understanding how histone PTMs regulate biology, such as the National Institutes of Health (NIH) Epigenomic Roadmap and ENCODE, heavily rely on modification-specific antibodies [20]. In addition to being a powerful diagnostic tool for the characterization of PTM-derived antibodies, we used our peptide array technology to measure how PTM codes affect the interaction of chromatin-associated proteins. Accordingly, we measured the binding of several domains known to interact with H3K4me3. We found that the PHD domain from the V(D)J recombination factor RAG2 was specific for H3K4me3 and was blocked by phosphorylation at either H3T3 or H3T6 (
We next examined the tandem bromo-PHD domains of BPTF (subunit of the NURF ATP-dependent remodeling complex [24]). Our studies showed that the tandem domain was specific for H3K4me3 and also showed reduced binding in the presence of either H3T3 or H3T6 phosphorylation (
The chromodomain of human CHD1 is also known to recognize H3K4me3 but has a structurally distinct binding pocket from the PHD domains. We found that CHD1, like RAG2 and BPTF, preferentially bindsH3K4me3 and is also negatively influenced by phosphorylation at H3T3 and H3T6 (
In conclusion, the complex patterns of histone PTMs are critical determinants of chromatin structure and function, but they also represent a significant challenge for future study. Although many protein domains that bind selectively to particular PTMs have been identified, little is known regarding how neighboring modifications inhibit or contribute to these interactions. Of equal importance is our understanding of how patterns of PTMs influence antibody recognition. In this case, detection of biologically important events could be blocked or misrepresented if neighboring modifications interfere with epitope recognition. Thus, our work underscores a need for more rigorous testing and characterization of histone-specific antibodies. Similar antibody concerns have been recently highlighted by other groups [20, 28]. The data sets for the antibodies and proteins described here, plus numerous additional antibodies, are available in
Finally, although several other peptide array approaches have been used to measure binding to histone PTMs [8, 29-31], our arrays and assay approaches offer several advantages. First, our array displays a large number of peptides carrying multiple PTMs that are fully characterized by high performance liquid chromatography (HPLC) and mass spectrometry (MS). Second, we take advantage of a biotin tracer molecule to provide an assessment of printing efficiency. Lastly, the high density of spotting allows us to perform statistical analysis of binding interactions. Although Liu et al. recently reported a similarly semiquantitative approach, their arrays were largely limited to peptides containing single PTMs, and the peptides were labeled via their N terminus, which could potentially occlude proteins and antibodies from recognizing modifications such as H3K4 methylation [30]. Furthermore, cellulose SPOT synthesis technology is limited by the inability to analytically characterize peptides [28]. In addition, a very elegant bead-based approach has been used to generate even larger peptide libraries and successfully characterize the binding of several protein factors to combinatorial histone PTMs [23]. However, our approach offers advantages in that we obtain binding data for each individual peptide and do not require sophisticated MS for the analysis.
All primary antibodies tested are commercially available and are listed in Table 3. Secondary antibodies were Alexa Fluor 647 conjugated goat anti-rabbit IgG (catalog number A21244) and Alexa Fluor 647 conjugated rabbit anti-mouse IgG (catalog number A21239) antibodies from Invitrogen.
All reagents were obtained from commercial suppliers (AnaSpec, EMD, and Apptec). The peptides, biotinylated at their C termini, were synthesized on either NovaPEG Rink amide resin (histone H3 peptides) or Biotin-PEG NovaTag resin (histone H2A, H2B, and H4 peptides) using fluorenylmethyloxycarbonyl (Fmoc) chemistry on a PS-3 automated peptide synthesizer (see Table 2 for the complete list of peptides). All standard amino acids were coupled using HATU and N-methylmorpholine in dimethylformamide (DMF). Fmoc deprotection was performed using 20% piperidine in DMF. Modified amino acid residues were coupled using HATU, HOAt, and N,N,-diisopropyletylamine in NMP, and the coupling of these residues was monitored using ninhydrin test and repeated when needed. Peptides were cleaved from the resins using a 2,5% TIS and 2.5% water in trifluoroacetic acid (TFA). After TFA evaporation and washing with diethyl ether, the peptides were lyophilized from an acetonitrile/water solution and purified via preparative HPLC using water-acetonitrile gradient (0.1% TFA in both solvents) on a Waters SymmetryShield RP-18 5 mm 19 3 150 mm column. All peptides were analyzed using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry and analytical HPLC. The average purity of peptides was over 90% (analytical HPLC). Analytical data for all peptides mentioned in this paper is available on our website.
Biotinylated peptides (25 mM final concentration) in printing buffer (10 mg/ml bovine serum albumin [BSA, Amresco], 0.3% Tween-20, and 10 mM biotinconjugated fluorescein added to 13 ArrayIt protein printing buffer) were arrayed onto SuperStreptavidin-coated slides (ArrayIt) using SMP6 stealth pins (˜200 mm spot diameter) and were arrayed onto OmniGrid100 arrayer (Digilab/Genomic Solutions) at ambient temperature and humidity (50%-60%) using the following printing parameters. To minimize effects from individual pins or localized imperfections in the substrate arrays, we arrayed samples as a series of six spots, two times on each slide at a spacing of 375 mm, as indicated in Table 5, and each peptide was printed by two different pins on each slide. After printing, slides were incubated overnight at 4° C. in a humidified environment to facilitate interaction between the biotinylated peptide and the streptavidin surface. Slides were then blocked for 1 hr at 4° C. with biotin-blocking buffer (Arrayft), washed three times with phosphate-buffered saline (PBS), dried with air, stored at 4° C., and used within 60 days.
(V43-V481
Antibody dilutions were made in PBS containing 1% BSA (˜10 mg/ml) and 0.3% Tween-20; the exact concentration for each array is summarized in Table 4. Antibodies were incubated with printed slides for 90-180 min at 4° C. (with the exception of the H3K4me3 monoclonal antibody from Abcam, which was incubated overnight) and washed three times with cold PBS. Arrays were then probed with the appropriate Alexa Fluor 647 conjugated secondary antibody (Invitrogen) for 30-60 min at 4° C., washed three times with cold PBS, and dried. Arrays were then scanned using a Typhoon TR10+ imager (GE Healthcare) at 10 mm resolution using the 526 nm and 670 nm filter sets for the biotin-fluorescein and secondary antibody, respectively. Interactions were quantified using ImageQuant array software (GE Healthcare).
The chromatin-associating domains from mouse RAG2 (PHD 387-493), human BPTF (Bromo and PHD domain 2583-2751), and CHD1 (chromodomain 251-467) were C-terminally fused to GST in pGEX-4T. Proteins were heterologously expressed in E. coli and purified by glutathione sepharose affinity chromatography in PBS buffer (50 mMphosphate, 150 mMNaC1, pH 7.6) on an AKTA purifier fast protein liquid chromatography system (GE Healthcare).
Prior to binding, arrays were blocked in PBS containing 5% BSA (˜50 mg/mL) and 0.3% Tween-20 for 1 hr at 4° C. to reduce nonspecific binding. Glutathione S-transferase (GST)-tagged protein (w25 mM) in the same buffer was overlaid on each array (200 ml total volume) and incubated in a hybridization chamber at 4° C. overnight. Slides were washed three times with cold PBS. Anti-GST primary antibody was incubated with slides for 90-180 min at 4° C. and washed three times with cold PBS. Arrays were then probed with the Alexa Fluor 647 conjugated anti-rabbit secondary antibody (Invitrogen) for 30-60 min at 4° C., washed three times with cold PBS, and dried. Arrays were then scanned using a Typhoon TR10+ imager (GE Healthcare) at 10 mm resolution using the 526 nm and 670 nm filter sets for the biotin-fluorescein and secondary antibody, respectively. Interactions were quantified using ImageQuant array software (GE Healthcare).
Briefly, printing of individual spots was evaluated based on the intensity of the fluorescein-biotin cospotted with each peptide. Spots with control intensities of less than 5% of the average intensity for all peptides were labeled as “not spotted” and omitted from subsequent analysis. Data were treated as four individual subarrays to account for small changes in intensity across the slide, each subarray containing all 110 peptides spotted six times. Alexa Fluor 647 intensities (corresponding to a positive interaction) were normalized for all spots by dividing the intensity by the sum of all intensities within a subarray. The six spots for each peptide were averaged (outliers were removed using a Grubbs test) and treated as a single value for a given subarray. The normalized intensities for the four subarrays were used to calculate the mean, and the error is reported as the standard error of the mean. For data displayed as heat maps, mean values were normalized to either the highest calculated value across all peptides or against the peptide for which a given antibody was supposed to interact. Heat maps were created using Java Treeview, and all data were plotted on a scale from 0 to 1 (
A microarray platform developed for histone peptides was used to compare the binding properties of human UHRF1 (ubiquitin-like, PHD and RING finger containing 1) tandem Tudor domain (TTD) with other known H3K9 methyl effector proteins, including the chromodomains of the three HP1 isoforms (α, β, γ), the MPP8 chromodomain, and the GLP ankyrin repeats. These peptide microarrays contain a library of 130 unmodified and modified histone peptides representing known single and combinatorial post-translational modifications (PTMs) on the four core histones (H2A, H2B, H3, and H4), including lysine and arginine methylation, lysine acetylation, and serine and threonine phosphorylation (the peptides included peptides listed in Tables 1 and 2). Arrays were spotted 24 times with each histone peptide as described in Rothbart, S. B., et al., Methods in Enzymology 512, 107-135 (2012) and probed with the histidine-tagged (His-tagged) UHRF1 or glutathione S-transferase (GST-tagged) HP1, MPP8, or GLP protein domains. Array analysis revealed that these effector proteins preferentially bound to H3K9 methylated peptides (
Analyzing the influence of neighboring PTMs on the binding of these effector proteins to H3K9 methylated peptides revealed little influence of lysine acetylation, H3K4me3, or H3R8 methylation (mono-, symmetric, or asymmetric di-methylation), with the exception of the HP1γ chromodomain, whose binding to H3K9me2 and H3K9me3 was partially perturbed by H3R8me2a. In contrast, H3T6p perturbed the binding to H3K9me3 by all tested effector proteins (
Materials.
Histone peptides were synthesized, purified, and analyzed as described in Hashimoto, H. et al., Nature 455, 826-9 (2008). Antibodies used in this study: anti-GST (Sigma G7781; 1:1,000), anti-HIS (Santa Cruz sc-8036; 1:200), anti-myc (Millipore 05-419; 1:2,500), anti-Flag (Sigma F1804; 1:5000), anti-streptavidin HRP (Cell Signaling 3999; 1:10,000), anti-UHRF1 (Abeam ab57083; 1:1,000), anti-HP1γ (Cell Signaling 2619; 1:1,000), anti-β-tubulin (Cell Signaling 2146; 1:1,000), anti-H3 (Active Motif 39163; 1:20,000), anti-H3K9me3 (Active Motif 39765; 1:5,000), anti-H3K9me2/S10p (Millipore 05-1354; 1:1,000), anti-H3K9me3/S10p (Millipore 04-809; 1:10,000), anti-H3S10p (Active Motif 39253; 1:5,000), anti-5mC (Diagenode Mab-081; 1:100), anti-cyclin A (Santa Cruz sc-751; 1:2000), anti-cyclin E (Santa Cruz sc-247; 1:1,000). The UHRF1 TTD (human cDNA encoding residues 126-280) was cloned into pET28a-LIC (GenBank accession EF442785) as an N-terminal HIS fusion, expressed in Escherichia coli BL21(DE3) using standard procedures, and purified with Talon resin (ClonTech) according to the manufacturer's protocol. HP 1α (mouse full-length cDNA), HP 1β (mouse full-length cDNA), HP1γ chromodomain (mouse cDNA encoding residues 11-129), and MPP8 chromodomain (human cDNA encoding residues 50-118) were cloned into pGEX-KG (GE Life Sciences). GLP ankyrin repeats (human cDNA encoding residues 734-968) were cloned into pGEX-6P1 (GE Life Sciences). GST fusion proteins were expressed in Escherichia coli BL21(DE3) using standard procedures and purified with GST-bind resin (Novagen) according to the manufacturer's protocol. Full length human UHRF1 was cloned into pCMV-Tag 2 (Agilent) as an N-terminal Flag fusion for mammalian expression. Full length human DNMT1 (a gift from Zhenghe Wang; Case Western) was cloned into pCMV-3Tag (Agilent) as an N-terminal myc fusion for mammalian expression. Point mutations were generated by QuickChange site-directed mutagenesis (Stratagene).
Cell Culture and Manipulation.
HeLa cells (ATCC) were cultured in Minimal Essential Medium (Invitrogen) supplemented with 10% fetal bovine serum (PAA), maintained in a 37° C. incubator with 5% CO2, and passaged every 2-3 days. E14 and NP95−/− mouse ES cells (a gift from Haruhiko Koseki, RIKEN) were cultured on 0.1% gelatin (Sigma) in Glasgow's Minimal Essential Medium (Invitrogen) supplemented with 15% ES-fetal bovine serum (PAA), 50 units/mL Leukemia Inhibitory Factor (Millipore), 2 mM L-glutamine (Invitrogen), 0.1 mM non-essential amino acids (Invitrogen), 55 μM beta-mercaptoethanol (Invitrogen), 1 mM sodium pyruvate (Invitrogen), 1× penicillin-streptomycin solution P (Invitrogen), maintained in a 37° C. incubator with 5% CO2, and passaged every 2 days. HeLa cells were synchronized in mitosis with 0.05 μg mL−1 nocodazole for 16 hours. For double thymidine block, HeLa cells were synchronized by treatment with 2 mM thymidine (Sigma) for 16 hours, followed by release for 8 hours, and re-treatment with 2 mM thymidine for 16 hours. Transient transfections were performed using TurboFect (Fermentas) according to the manufacturer's protocol. shRNAs obtained from The RNAi Consortium (TRC) were used following standard TRC Lentivirus production and infection protocols. The indicted concentrations of MG132 (Cayman Chemicals) or 0.05 μg mL−1 nocodazole (Sigma) in DMSO were added during the last 16 hours prior to harvest.
Histone Peptide Microarrays.
Array fabrication and effector protein analysis was performed as described in Rothbart, S. B., et al., Methods in Enzymology 512, 107-135 (2012) and Fuchs, S. M., Krajewski, et al., Current biology 21, 53-58 (2011). Heat maps were generated using Java TreeView.
In-Solution Peptide Pulldowns.
A 50 μL slurry of streptavidin magnetic beads (NEB) was equilibrated in binding buffer containing 50 mM Tris-HCl, pH 8.0, 300 mM NaCl, and 0.1% NP-40 before being saturated with 1 nmole biotinylated peptide for 1 hour at 4° C. with rotation. Unbound peptide was washed with binding buffer, and 100 pmoles of protein in binding buffer supplemented with 0.5% bovine serum albumin (BSA) (w/v) was incubated for 3 hours at 4° C. with rotation. Unbound protein was washed with binding buffer, and bound protein and peptide were eluted from beads by boiling in 1×SDS loading buffer followed by western blot detection. Proteins were detected with anti-HIS (UHRF1) and anti-GST (MPP8) and peptides were detected with anti-streptavidin-HRP.
Fluorescence Polarization.
Peptides for fluorescence polarization (histone H3, residues 1-20) were synthesized as described in Rothbart, S. B., et al., Methods in Enzymology 512, 107-135 (2012) with the addition of 5-carboxyfluorescin (5-FAM) at the N-terminus. Binding assays were performed in 40 μL volume in black flat-bottom 384-well plates (Costar). Protein was titrated with 50 nM peptide in buffer containing 20 mM Tris-HCl, pH 8.0, 250 mM NaCl, 1 mM DTT, and 0.05% NP-40. Following a 20 minute equilibration period at 25° C., plates were read on a POLARstar Omega (BMG Labtech) using a 480 nm excitation filter and 520/530±10 nm emissions filters. Gain settings in the parallel (∥) and perpendicular (⊥) channels were calibrated to a polarization measurement of 100 milli-polarization units (mP) for the fluorescent peptide in the absence of protein. Polarization (P) was determined from raw intensity values of the parallel and perpendicular channels using the equation P═∥−⊥∥+2(⊥) and converted to anisotropy (A) units using the equation A=2P/3−P. Equilibruim dissociation constants (Kd) were determined by fitting anisotropy curves to a one-site binding model using GraphPad Prism 5.0.
The foregoing is illustrative of the present invention, and is not to be construed as limiting thereof. The invention is defined by the following claims, with equivalents of the claims to be included therein. All publications, patent applications, patents, patent publications, sequences identified by GenBank and/or SNP accession numbers, and other references cited herein are incorporated by reference in their entireties for the teachings relevant to the sentence and/or paragraph in which the reference is presented.
This application claims the benefit of U.S. Provisional Patent Application Ser. No. 61/576,542, filed Dec. 16, 2011, the disclosure of which is incorporated herein by reference in its entirety
This invention was made with government support under National Institutes of Health (NIH) Grant No. GM085394-01. The United States government has certain rights to this invention.
Number | Date | Country | |
---|---|---|---|
61576542 | Dec 2011 | US |