METHOD AND COMPOSITION FOR PEPTIDE CYCLIZATION AND PROTEASE TREATMENT

SEQUENCE LISTING

The instant application contains a Sequence Listing which has been submitted electronically in ASCII format and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Apr. 21, 2016, is named 5727-248051_SL.txt and is 70,660 bytes in size.

TECHNICAL FIELD

This invention relates to peptide microarrays, methods of generating peptide microarrays, and methods of identifying peptide binders using microarrays. More specifically, this invention relates to peptide microarrays, methods of generating peptide microarrays, and methods of identifying peptide binders using microarrays wherein the microarrays comprise cyclic peptides. In some aspects, the invention relates to methods of increasing the proportion of cyclized peptides on a microarray relative to linear peptides by treating the peptides on the microarray with a protease. In additional aspects, the invention relates to methods of generating linear and cyclic peptides subarrays on a microarray.

BACKGROUND

Understanding protein-protein interactions is important for basic research as well as various biomedical and other practical applications. Examples of this kind include binding between peptide fragments or epitopes and antibodies, the interaction between proteins and short fragments of other proteins, as well as binding between peptides referred to as aptamers to their target molecules. Development of simple and reliable methods for identifying peptide binders for proteins would help in understanding the mechanisms of protein-protein interaction and open new opportunities for drug discovery.

With the identification of cellular pathways and targets that play key roles in metabolism and disease progression, the understanding of disease states continues to expand exponentially. Although our understanding of diseases is advanced, our ability to treat them lags behind due to the limitations inherent in existing drug platforms. At present, the available drug platforms are based primarily on small molecules and therapeutic proteins, which address only about 10 to 20 percent of the identified therapeutic targets for treatment of diseases.

Peptides combine the high specificity of biological drugs with the bioavailability of small molecules, and, thus, offer exciting opportunities to address difficult targets for disease treatment. In fact, peptides have proven to be effective when used to target extracellular receptors, but limitations include the inherent instability of peptides within the body and rapid breakdown by circulating proteases. The concept of using peptides to modulate intracellular processes has been investigated for decades, yet these strategies have largely failed because peptides lack the ability to enter cells.

Cyclic peptides, with their conformational rigidity, exhibit superior properties relative to their linear peptide counterparts, such as improved target affinity and specificity. Their higher target specificity and affinity as well as resistance to proteolysis have made them attractive candidates for drug discovery. Cyclic peptides have been isolated from large combinatorial libraries using library screening tools, such as phage display and mRNA display, but improved methods are needed to screen large numbers of cyclic peptides, to mature cyclic peptides in situ, and to identify cyclic peptides of interest. Currently, there is no systematic approach to identifying and maturing cyclic peptides to obtain optimized cyclic peptide binders.

Another powerful method to study peptide-protein interactions is the use of peptide microarrays. Peptide microarrays can be made with peptides synthesized using solid phase peptide synthesis and then immobilized on a solid support or can be directly prepared by in situ synthesis methods. Although peptide microarrays are commercially available, their application is limited by a relatively low density of peptides and high cost of manufacturing. Both of these issues can be addressed by use of maskless light-directed technology, see (Pellois, Zhou et al. (2002) Individually addressable parallel peptide synthesis on microchips.) and U.S. Pat. No. 6,375,903.

Using an instrument for maskless light-directed microarray synthesis, the selection of peptide sequences to be constructed on the peptide microarray is under software control such that it is now possible to create individually-customized arrays based on the particular needs of an investigator. In general, maskless light-directed microarray synthesis technology allows for the parallel synthesis of millions of unique peptide features in a very small area of a standard microscope slide. The peptide microarrays are generally synthesized by using light to direct which peptides are synthesized at specific locations on the microarray.

There exists an unmet need for a more efficient and successful method of identifying therapeutic cyclic peptide candidates for existing and potential new drug targets, in part, because many targets and diseases are presently “undruggable” using existing therapeutic modalities.

SUMMARY

Applicants disclose herein novel peptide microarrays, methods of generating peptide microarrays, and methods of identifying peptide binders using the microarrays described herein, wherein the microarrays comprise cyclic peptides. In one embodiment described herein, the cyclic peptides can be used as therapeutic peptides. Also disclosed herein are methods of increasing the proportion of cyclized peptides on a microarray relative to linear peptides by treating the peptides on the microarray with a protease. Further disclosed herein are methods of generating linear and cyclic peptides subarrays on the same microarray.

Several embodiments of the invention are described by the following enumerated clauses:

1. A peptide microarray comprising at least one cyclic peptide of formula I

embedded image

wherein each R¹, R², R³and R⁴is independently a natural amino acid side chain or a non-natural amino acid side chain;

each R⁵and R⁶is independently hydrogen or an N-terminal capping group;

each R⁷is independently —OH or a C-terminal capping group;

Q is selected from the group consisting of a carbonyl, a natural amino acid side chain, and a non-natural amino acid side chain;

each X and Y is independently selected from the group consisting of a bond, a natural amino acid side chain covalently attached to Z, and a non-natural amino acid side chain covalently attached to Z;

Z is a group comprising a moiety selected from the group consisting of an amide bond, a disulfide bond, an isopeptide bond, a 1,2,3-triazole, and an optionally substituted 1,2-quinone;

L′ and L″ are each independently an optional bivalent linking group or a bond;

m is an integer from 0 to 6;

n is an integer from 0 to 6; p is an integer from 0 to 100;

q is 0 or 1;

r is 0 or 1;

t is an integer from 0 to 100;

u is 0 or 1;

and * is a point of connection connecting the at least one cyclic peptide to a solid support having a reactive surface,

wherein the at least one cyclic peptide is immobilized to the reactive surface, and wherein the at least one cyclic peptide is part of a population of peptides immobilized to the reactive surface wherein the population of peptides comprises independently selected amino acid sequences of interest.

2. The peptide microarray of clause 1, wherein Z comprises a moiety selected from the group consisting of an amide bond,

embedded image

wherein v is an integer from 0 to 6, w is an integer from 0 to 6, and y is an integer from 0 to 6, and ** is a point of connection to the rest of the cyclic peptide.

3. The peptide microarray of clause 1 or 2, wherein Z comprises a peptide bond, Q is a carbonyl, q is 0, r is 1, and u is 0.

4. The peptide microarray of clause 1 or 2, wherein each Q and X is a cysteine side chain, Z is a disulfide bond, q is 1, r is 1, t is 0, and u is 0.

5. The peptide microarray of clause 1 or 2, wherein X and Y are bonds to Z, Z comprises **—S—S—**, q is 1, and u is 1.

6. The peptide microarray of clause 1 or 2, wherein Z comprises

embedded image

and v is 1.

7. The peptide microarray of clause 1 or 2, wherein Z comprises

embedded image

and w is 1.

8. The peptide microarray of clause 1 or 2, wherein Z comprises

embedded image

r is 0, t is 0, u is 0, and y is 1.

9. The peptide microarray of clause 1 or 2, wherein Y is a bond to Z, Z comprises

embedded image

u is 1, and y is 1.

10. The peptide microarray of clause 1 or 2, wherein Y is a bond to Z, Z comprises

embedded image

q is 0, and u is 1.

11. The peptide microarray of clause 1 or 2, wherein X is a bond to Z, Z comprises

embedded image

q is 1, r is 0, t is 0, and u is O.

12. The peptide microarray of clause 1 or 2, wherein X and Y are bonds to Z, Z comprises

embedded image

q is 1, and u is 1.

13. The peptide microarray of any one of clauses 1 to 12, wherein each L′ and L″ is independently of the formula II

embedded image

wherein each R⁸and R^8′ is independently selected from the group consisting of H, D, halogen, C₁-C₆alkyl, C₂-C₆alkenyl, C_2-C₆alkynyl, C_3-C₆cycloalkyl, 3- to 7-membered heterocycloalkyl, C₆-C₁₀aryl, 5- to 7-membered heteroaryl, —OR⁹, —OC(O)R⁹, —NR⁹R^9′, —NR⁹C(O)R¹⁰, —C(O)R⁹, —C(O)OR⁹, and —C(O)NR⁹R^9′, wherein each hydrogen atom in C₁-C₆alkyl, C₂-C₆alkenyl, C_2-C₆alkynyl, C_3-C₆cycloalkyl, 3- to 7-membered heterocycloalkyl, C₆-C₁₀aryl and 5- to 7-membered heteroaryl is independently optionally substituted by halogen, C₁-C₆alkyl, C₂-C₆alkenyl, C_2-C₆alkynyl, —OR¹¹; each R⁹, R^9′, R¹⁰, and R¹¹is independently selected from the group consisting of H, D, hydroxyl, C₁-C₇alkyl, C₂-C₇alkenyl, C_2-C₇alkynyl, C_3—C₆cycloalkyl, 3- to 7-membered heterocycloalkyl, C₆-C₁₀aryl and 5- to 7-membered heteroaryl; and a is an integer from 1 to 10; or the formula III or IV

embedded image

wherein b is an integer from 0 to 30.

14. The peptide microarray of clause 13, wherein each R⁸and R^8′ is hydrogen.

15. The peptide microarray of clause 13 or 14, wherein each L′ and L″ is independently

embedded image

16. The peptide microarray of any one of clauses 1 to 15, wherein m is 0.

17. The peptide microarray of any one of clauses 1 to 16, wherein n is 0.

18. The peptide microarray of any one of clauses 1 to 12 or 16 to 17, wherein each R⁸and R^8′ is hydrogen, m is 0, n is 0, a is 5, L′ is present, and L″ is absent.

19. The peptide microarray of any one of clauses 1 to 13 or 16 to 18, wherein L′ is 6-aminohexanoic acid.

20. The peptide microarray of any one of clauses 1 to 14, 16 to 17, or 19, wherein L″ is CH₂CH₂.

21. The peptide microarray of any one of clauses 1 to 20, wherein t is 0, and p is an integer 1 to 100.

22. The peptide microarray of any one of clauses 1 to 21, wherein p is an integer 1 to 20.

23. The peptide microarray of any one of clauses 1 to 22, wherein the solid support is selected from a group of materials consisting of plastic, glass, and carbon composite.

24. The peptide microarray of any one of clauses 1 to 23, wherein the reactive surface comprises an activated amine.

25. The peptide microarray of any one of clauses 1 to 24, wherein the amino acid sequences of interest of the population of peptides comprise the same number of amino acids.

26. The peptide microarray of any one of clauses 1 to 25, wherein the amino acid sequences of interest of the population of peptides comprise five amino acids.

27. The peptide microarray of any one of clauses 1 to 26, wherein the amino acid sequences of interest of the population of peptides do not contain any of a methionine amino acid, a cysteine amino acid, an amino acid repeat of the same amino acid, or an amino acid motif consisting of a histidine (H)- proline (P)- glutamine (Q) sequence.

28. The peptide microarray of any one of clauses 1 to 27, wherein each cyclic peptide of the population of peptides further comprises at least one of an N-terminal wobble synthesis oligopeptide or a C-terminal wobble synthesis oligopeptide.

29. The peptide microarray of clause 28, wherein the wobble synthesis oligopeptide of each cyclic peptide of the population of peptides comprises an amino acid sequence having the same number of amino acids.

30. The peptide microarray of clause 28 or 29, wherein the wobble synthesis oligopeptide of each cyclic peptide of the population of peptides is derived randomly from an amino acid mixture having each of the twenty amino acids or a subset of the twenty amino acids in approximately equal concentrations.

31. The peptide microarray of clause 28 or 29, wherein the wobble synthesis oligopeptide of each cyclic peptide of the population of peptides is derived randomly from an amino acid mixture having amino acids glycine (G) and serine (S) in approximately a 3 (G) to 1 (S) concentration.

32. The peptide microarray of any one of clauses 28 to 31, wherein there is a C-terminal and an N-terminal wobble synthesis oligopeptide and both the C-terminal and N-terminal wobble synthesis oligopeptides comprise the same number of five or more amino acids.

33. A method of generating a peptide microarray comprising at least one cyclic peptide of formula I

embedded image

wherein each R¹, R², R³and R⁴is independently a natural amino acid side chain or a non-natural amino acid side chain;

each R⁵and R⁶is independently hydrogen or an N-terminal capping group;

each R⁷is independently —OH or a C-terminal capping group;

Q is selected from the group consisting of a carbonyl, a natural amino acid side chain, and a non-natural amino acid side chain;

each X and Y is independently selected from the group consisting of a bond, a natural amino acid side chain covalently attached to Z, and a non-natural amino acid side chain covalently attached to Z;

Z is a group comprising a moiety selected from the group consisting of an amide bond, a disulfide bond, an isopeptide bond, a 1,2,3-triazole, and an optionally substituted 1,2-quinone;

L′ and L″ are each independently an optional bivalent linking group or a bond;

m is an integer from 0 to 6;

n is an integer from 0 to 6;

p is an integer from 0 to 100;

q is 0 or 1;

r is 0 or 1;

t is an integer from 0 to 100;

u is 0 or 1; and * is a point of connection connecting the at least one cyclic peptide to a solid support having a reactive surface;

the method comprising the step of reacting a functionalized peptide of formula II under conditions that cause Z to form

embedded image

wherein R¹, R²R³, R⁴, R⁵, R⁶, Q, L′, L″, m, n, p, q, r, t, u, and * are as defined for formula I;

each R⁷is independently selected from the group consisting of —OH, a C-terminal capping group, and

embedded image

each R⁸is independently a natural amino acid side chain or a non-natural amino acid side chain;

each R⁹is independently —OH or a C-terminal capping group;

each X′ is independently selected from the group consisting of a bond, a natural amino acid side chain covalently attached to Z″, and a non-natural amino acid side chain covalently attached to Z″;

each Y′ is independently selected from the group consisting of a bond, a natural amino acid side chain covalently attached to Z′, and a non-natural amino acid side chain covalently attached to Z′;

Z′ and Z″ are each independently selected from the group consisting of a bond, —OH, hydrogen, a thiol, an amine, a carboxylic acid, an amide, an alkyne, an azide, an optionally substituted aminophenol, a natural amino acid side chain, a non-natural amino acid side chain, an N-terminal protecting group, and a C-terminal protecting group, provided that Z′ and Z″ are complementary groups that combine to form Z;

b is an integer from 0 to 50;

and *** is a point of connection to the rest of the functionalized peptide;

34. The method of clause 33, wherein Z comprises a moiety selected from the group consisting of an amide bond,

embedded image

wherein v is an integer from 0 to 6, w is an integer from 0 to 6, and y is an integer from 0 to 6, and ** is a point of connection to the rest of the cyclic peptide.

35. The method of clause 33 or 34, wherein Z comprises a peptide bond, Z′ comprises a C-terminal protecting group or Z″ comprises an N-terminal protecting group, Q is a carbonyl, q is 0, r is 1 and u is 0.

36. The method of clause 35, further comprising removing Z′ or Z″ from the rest of the functionalized peptide to cause the peptide bond to form.

37. The method of clause 33 or 34, wherein X and Y are bonds to Z, Z comprises **—S—S—**, X′ is a bond to Z″, Y′ is a bond to Z′, Z′ and Z″ comprise cysteine side chains, q is 1, and u is 1.

38. The method of clause 37, further comprising subjecting the functionalized peptide to oxidative conditions to cause **—S—S—** to form.

39. The method of clause 33 or 34, wherein Y is a bond to Z, Z comprises

embedded image

Y′ is a bond to Z′, Z′ comprises

embedded image

Z″ comprises an azide, u is 1, and v is 1.

40. The method of clause 39, further comprising contacting the functionalized peptide with a copper catalyst to cause

embedded image

to form.

41. The method of clause 33 or 34, wherein Y is a bond to Z, Z comprises

embedded image

Y′ is a bond to Z′, Z′ comprises

embedded image

Z″ comprises an azide, u is 1, and v is 1.

42. The method of clause 41, further comprising contacting the functionalized peptide with a copper catalyst to cause

embedded image

to form.

43. The method of clause 33 or 34, wherein Z comprises

embedded image

Y′ is a bond to Z′, Z′ comprises

embedded image

r is 0, u is 1, and y is 1.

44. The method of clause 43, further comprising contacting the functionalized peptide with a potassium ferricyanide to cause

embedded image

to form.

45. The method of clause 33 or 34, wherein R³and R⁸are defined such that the functionalized peptide comprises a butelase 1 recognition sequence, Y is a bond to Z, Z comprises

embedded image

Y′ is a bond to Z′, Z′ is an asparagine or aspartic acid side chain, q is 0, and u is 1.

46. The method of clause 45, further comprising contacting the functionalized peptide with butelase 1 to cause

embedded image

to form.

47. The method of clause 33 or 34, wherein X and Y are bonds to Z, Z comprises

embedded image

X′ is a bond to Z″, Y′ is a bond to Z′, Z′ is a glutamine side chain and Z″ is a lysine side chain or Z′ is a lysine side chain and Z″ is a glutamine side chain, q is 1, and u is 1.

48. The method of clause 47, further comprising contacting the functionalized peptide with a microbial transglutaminase to cause

embedded image

to form.

49. The method of any one of clauses 33 to 48, wherein L′ is 6-aminohexanoic acid.

50. The method of any one of clauses 33 to 49, wherein L″ is CH₂CH₂.

51. The method of any one of clauses 33 to 50, wherein m is 0.

52. The method of any one of clauses 33 to 51, wherein n is 0.

53. The method of any one of clauses 33 to 52, wherein t is 0, and p is an integer 1 to 100.

54. The method of any one of clauses 33 to 53, wherein p is an integer 1 to 20.

55. The method of clause 33 or 34, wherein Q is a carbonyl, Z is an amide bond, r is 1, u is 0, and q is 0.

56. The method of any one of clauses 33 to 55, wherein the solid support is selected from a group of materials consisting of plastic, glass and carbon composite.

57. The method of any one of clauses 33 to 56, wherein the reactive surface comprises an activated amine.

58. The method of any one of clauses 33 to 57, wherein the amino acid sequences of interest of the population of peptides comprise the same number of amino acids.

59. The method of any one of clauses 33 to 58, wherein the amino acid sequences of interest of the population of peptides comprise five amino acids.

60. The method of any one of clauses 33 to 59, wherein the amino acid sequences of interest of the population of peptides do not contain any of a methionine amino acid, a cysteine amino acid, an amino acid repeat of the same amino acid, or an amino acid motif consisting of a histidine (H)- proline (P)- glutamine (Q) sequence.

61. The method of any one of clauses 33 to 60, wherein each cyclic peptide of the population of peptides further comprises at least one of an N-terminal wobble synthesis oligopeptide or a C-terminal wobble synthesis oligopeptide.

62. The method of clause 61, wherein the wobble synthesis oligopeptide of each cyclic peptide of the population of peptides comprises an amino acid sequence having the same number of amino acids.

63. The method of clause 61 or 62, wherein the wobble synthesis oligopeptide of each cyclic peptide of the population of peptides is derived randomly from an amino acid mixture having each of the twenty amino acids or a subset of the twenty amino acids in approximately equal concentrations.

64. The method of clause 61 or 62, wherein the wobble synthesis oligopeptide of each cyclic peptide of the population of peptides is derived randomly from an amino acid mixture having amino acids glycine (G) and serine (S) in approximately a 3 (G) to 1 (S) concentration.

65. The method of any one of clauses 61 to 64, wherein there is a C-terminal and an N-terminal wobble synthesis oligopeptide and both the C-terminal and N-terminal wobble synthesis oligopeptides comprise the same number of five or more amino acids.

66. A method of preparing a peptide microarray comprising:

generating at least one first linear peptide subarray comprising a first plurality of linear peptides covalently attached to a microarray surface;

generating at least one second linear peptide subarray comprising a second plurality of linear peptides covalently attached to the microarray surface, wherein the second plurality of linear peptides has an amino acid sequence that is identical to the first plurality of linear peptides; and

treating the peptide microarray under conditions to cyclize the first plurality of linear peptides to provide at least one cyclized peptide subarray comprising a plurality of cyclized peptides, wherein the second plurality of linear peptides substantially does not cyclize.

67. The method of clause 66, wherein the first plurality of linear peptides is a first plurality of protected linear peptides, wherein the C-terminus of the first plurality of protected linear peptides is protected by a first protecting group; and

the second plurality of linear peptides is a second plurality of protected linear peptides, wherein the second plurality of protected linear peptides has an amino acid sequence that is identical to the first plurality of protected linear peptides, and wherein the C-terminus of the second plurality of protected linear peptides is protected by a second protecting group that is different from the first protecting group.

68. The method of clause 67, further comprising contacting the peptide microarray with a first deprotection reagent to selectively remove the first protecting group to provide at least one first deprotected linear peptide subarray comprising a first plurality of deprotected linear peptides; and

contacting the peptide microarray with a second deprotection reagent to remove the second protecting group to provide at least one second deprotected linear peptide subarray comprising a second plurality of deprotected linear peptides.

69. The method of any one of clauses 66-68, wherein the first plurality of linear peptides and the second plurality of linear peptides are each covalently attached to the microarray surface through an amino acid side chain.

70. The method of clause 69, wherein the amino acid side chain is a carboxylic acid side chain.

71. The method of clause 70, wherein the carboxylic acid side chain is a glutamate or aspartate side chain.

72. The method of any one of clauses 69 to 71, wherein the amino acid side chain is part of the C-terminal amino acid.

73. The method of any one of clauses 66 to 72, wherein at least one molecule of the first plurality of linear peptides fails to cyclize.

74. The method of clause 73, wherein the at least one of the first plurality of linear peptides that fails to cyclize is not removed from the first deprotected linear peptide subarray.

75. The method of any one of clauses 67 to 74, wherein the first protecting group is OAll.

76. The method of any one of clauses 67 to 75, wherein the first deprotection reagent is a palladium catalyst.

77. The method of clause 76, wherein the palladium catalyst is tetrakis(triphenylphosphine)palladium(O).

78. The method of any one of clauses 67 to 77, wherein the second protecting group is OtBu.

79. The method of any one of clauses 67 to 78, wherein the second deprotection reagent is an acid.

80. The method of clause 79, wherein the acid is trifluoroacetic acid. 81. The method of any one of clauses 66 to 80, wherein treating the peptide microarray under conditions to cyclize the first plurality of linear peptides comprises activating the carboxyl group of the C-terminus of the first plurality of linear peptides to react with the amino group of the N-terminus of the first plurality of linear peptides to form an amide bond.

82. The method of any one of clauses 66 to 81, wherein treating the peptide microarray under conditions to cyclize the first plurality of linear peptides comprises contacting the first plurality of linear peptides with HOBt and HBTU.

83. A method of identifying an active cyclic peptide comprising generating a peptide microarray according to the method of any one of clauses 66 to 82, contacting the peptide microarray with a potential binding group, and measuring the presence of the potential binding group on the peptide microarray after the contacting step.

84. The method of clause 83, wherein the measuring step comprises measuring fluorescent activity.

85. A method of generating a peptide microarray comprising at least one cyclic peptide of formula III

embedded image

wherein each R¹, R², R³and R⁴is independently a natural amino acid side chain or a non-natural amino acid side chain;

Q is a carbonyl;

L′ and L″ are each independently an optional bivalent linking group or a bond;

m is an integer from 0 to 6;

n is an integer from 0 to 6;

p is an integer from 0 to 100;

t is an integer from 0 to 100; and

* is a point of connection connecting the at least one cyclic peptide to a solid support having a reactive surface;

the method comprising generating a plurality of first peptides on a cyclic peptide subarray, wherein the first peptide is of formula IV

embedded image

wherein R¹, R², R³, R⁴, Q, L′, L″, m, n, p, t, and * are as defined for formula III;

Z¹is a first carboxyl protecting group; and

Z²is hydrogen;

generating a plurality of second peptides on a linear peptide subarray, wherein the second peptide is of formula V

embedded image

wherein R¹, R², R³, R⁴, Q, L′, L″, Z², m, n, p, t, and * are as defined for formula IV; and

Z³is a second carboxyl protecting group different from the first carboxyl protecting group; and

is hydrogen; and

treating the first peptides to form a first plurality of linear deprotected peptides, wherein the linear deprotected peptide is of formula VI

embedded image

wherein R¹, R², R³, R⁴, Q, L′, L″, Z², m, n, p, t, and * are as defined for formula IV; and

Z¹is —OH; followed by

treating the linear deprotected peptides to form the cyclic peptide; followed by

treating the second peptides to form a second plurality of linear deprotected peptides of formula VI;

wherein the first peptides and the second peptides are immobilized to the reactive surface, and wherein the at least one cyclic peptide is part of a population of peptides immobilized to the reactive surface wherein the population of peptides comprises independently selected amino acid sequences of interest.

86. The method of clause 85, wherein L′ is 6-aminohexanoic acid.

87. The method of clause 85 or 86, wherein L″ is CH₂CH₂.

88. The method of any one of clauses 85 to 87, wherein m is 0.

89. The method of any one of clauses 85 to 88, wherein n is 0.

90. The method of any one of clauses 85 to 89, wherein t is 0, and p is an integer 1 to 100.

91. The method of any one of clauses 85 to 90, wherein p is an integer 1 to 20.

92. The method of any one of clauses 85 to 91, wherein at least one molecule of the linear deprotected peptides on the cyclic peptide subarray fails to cyclize.

93. The method of any one of clauses 85 to 92, wherein the linear deprotected peptides on the cyclic peptide subarray are not removed from the cyclic peptide subarray.

94. The method of any one of clauses 85 to 93, wherein the first carboxyl protecting group is OAll.

95. The method of any one of clauses 85 to 94, wherein treating the first peptides to form the first plurality of linear deprotected peptides comprises contacting the first peptides with palladium.

96. The method of any one of clauses 85 to 95, wherein the second carboxyl protecting group is OtBu.

97. The method of any one of clauses 85 to 96, wherein treating the second peptides to form the second plurality of linear deprotected peptides comprises contacting the second peptides with an acid.

98. The method of clause 97, wherein the acid is trifluoroacetic acid.

99. The method of any one of clauses 85 to 98, wherein treating the first peptides to form the cyclic peptide comprises activating a carboxyl group of the first peptide to react with a free amino group of the first peptide to form Z.

100. The method of any one of clauses 85 to 99, wherein treating the first peptides to form the cyclic peptide comprises contacting the first peptides with HOBt and HBTU.

101. A method of identifying an active cyclic peptide comprising generating a peptide microarray according to the method of any one of clauses 85 to 100, contacting the peptide microarray with a potential binding group, and measuring the presence of the potential binding group on the peptide microarray after the contacting step.

102. The method of clause 101, wherein the measuring step comprises measuring fluorescent activity.

103. A method of identifying a peptide binder comprising the steps of:

a. exposing a target of interest to a peptide microarray comprising a first population of peptide binders comprising a cyclic peptide of formula I

embedded image

wherein each R¹, R², R³and R⁴is independently a natural amino acid side chain or a non-natural amino acid side chain;

each R⁵and R⁶is independently hydrogen or an N-terminal capping group;

each R⁷is independently —OH or a C-terminal capping group;

Q is selected from the group consisting of a carbonyl, a natural amino acid side chain, and a non-natural amino acid side chain;

each X and Y is independently selected from the group consisting of a bond, a natural amino acid side chain covalently attached to Z, and a non-natural amino acid side chain covalently attached to Z;

Z is a group comprising a moiety selected from the group consisting of an amide bond, a disulfide bond, an isopeptide bond, a 1,2,3-triazole, and an optionally substituted 1,2-quinone;

L′ and L″ are each independently an optional bivalent linking group or a bond;

m is an integer from 0 to 6;

n is an integer from 0 to 6;

p is an integer from 0 to 100;

q is 0 or 1;

r is 0 or 1;

t is an integer from 0 to 100;

u is 0 or 1; and

* is a covalent bond immobilizing the cyclic peptide on a first solid support having a first reactive surface, whereby the target of interest binds to the cyclic peptide;

b. identifying overlap in peptide binder sequences of the first population of peptide binders which bind the target of interest, whereby a core binder sequence is determined;

c. performing at least one alteration selected from a single amino acid substitution, a double amino acid substitution, an amino acid deletion, and an amino acid insertion of amino acids to the core binder sequence, whereby a second population of core binder sequences is generated;

d. exposing the second population of core binder sequences to the target of interest, whereby the target of interest binds to at least one peptide sequence of the second population of core binder sequences and wherein the second population of core binder sequences comprises the cyclic peptide of formula I

embedded image

wherein each R¹, R², R³and R⁴is independently a natural amino acid side chain or a non-natural amino acid side chain;

each R⁵and R⁶is independently hydrogen or an N-terminal capping group;

each R⁷is independently —OH or a C-terminal capping group;

Q is selected from the group consisting of a carbonyl, a natural amino acid side chain, and a non-natural amino acid side chain;

each X and Y is independently selected from the group consisting of a bond, a natural amino acid side chain covalently attached to Z, and a non-natural amino acid side chain covalently attached to Z;

Z is a group comprising a moiety selected from the group consisting of an amide bond, a disulfide bond, an isopeptide bond, a 1,2,3-triazole, and an optionally substituted 1,2-quinone;

L′ and L″ are each independently an optional bivalent linking group or a bond;

m is an integer from 0 to 6;

n is an integer from 0 to 6;

p is an integer from 0 to 100;

q is 0 or 1;

r is 0 or 1;

t is an integer from 0 to 100;

u is 0 or 1; and

* is a covalent bond immobilizing the cyclic peptide on a second solid support having a second reactive surface;

e. identifying one or more sequences of the second population of core binder sequences demonstrating strong binding properties to the target of interest, whereby a matured core binder sequence is determined;

f. performing at least one of N-terminal and C-terminal extension of the matured core peptide binder sequence determined in step e, whereby a population of matured, extended peptide binders is generated;

g. exposing the target of interest to a peptide microarray comprising the population of matured, extended peptide binders generated in step f wherein the population of mature, extended peptide binders comprises the cyclic peptide of formula I

embedded image

wherein each R¹, R², R³and R⁴is independently a natural amino acid side chain or a non-natural amino acid side chain;

each R⁵and R⁶is independently hydrogen or an N-terminal capping group;

each R⁷is independently —OH or a C-terminal capping group;

Q is selected from the group consisting of a carbonyl, a natural amino acid side chain, and a non-natural amino acid side chain;

each X and Y is independently selected from the group consisting of a bond, a natural amino acid side chain covalently attached to Z, and a non-natural amino acid side chain covalently attached to Z;

Z is a group comprising a moiety selected from the group consisting of an amide bond, a disulfide bond, an isopeptide bond, a 1,2,3-triazole, and an optionally substituted 1,2-quinone;

L′ and L″ are each independently an optional bivalent linking group or a bond;

m is an integer from 0 to 6;

n is an integer from 0 to 6;

p is an integer from 0 to 100;

q is 0 or 1;

r is 0 or 1;

t is an integer from 0 to 100;

u is 0 or 1; and

* is a covalent bond immobilizing the cyclic peptide on a third solid support having a third reactive surface; and

h. identifying overlap in the N-terminal or C-terminal peptide binder sequences of the peptides comprising the population of mature, extended peptide binders, whereby a mature, extended core peptide binder sequence is determined.

104. The method of clause 103, wherein Z comprises a moiety selected from the group consisting of an amide bond,

embedded image

wherein v is an integer from 0 to 6, w is an integer from 0 to 6, and y is an integer from 0 to 6, and ** is a point of connection to the rest of the cyclic peptide.

105. The method of clause 103 or 104, wherein Z comprises a peptide bond, Q is a carbonyl, q is 0, r is 1 and u is 0.

106. The method of clause 103 or 104, wherein X and Y are bonds to Z, Z comprises **—S—S—**, q is 1, and u is 1.

107. The method of clause 103 or 104, wherein Z comprises

embedded image

and v is 1.

108. The method of clause 103 or 104, wherein Z comprises

embedded image

and w is 1.

109. The method of clause 103 or 104, wherein Y is a bond to Z, Z comprises

embedded image

u is 1, and y is 1.

110. The method of clause 103 or 104, wherein Y is a bond to Z, Z comprises

embedded image

q is 0, and u is 1.

111. The method of clause 103 or 104, wherein X and Y are bonds to Z, Z comprises

embedded image

q is 1, and u is 1.

112. The method of any one of clauses 103 to 111, wherein L′ is 6-aminohexanoic acid.

113. The method of any one of clauses 103 to 112, wherein L″ is CH₂CH₂.

114. The method of any one of clauses 103 to 113, wherein m is 0.

115. The method of any one of clauses 103 to 114, wherein n is 0.

116. The method of any one of clauses 103 to 115, wherein t is 0, and p is an integer 1 to 100.

117. The method of any one of clauses 103 to 116, wherein p is an integer 1 to 20.

118. The method of any one of clauses 103 to 117, wherein at least one of a label-free and affinity analysis of the mature, extended core peptide binder sequence is performed.

119. The method of any one of clauses 103 to 118, wherein the first, second, and/or third solid support comprises at least one of glass, plastic, and carbon composite.

120. The method any one of clauses 103 to 119, wherein the peptide binders of the first population comprise the same number of amino acids.

121. The method of any one of clauses 103 to 120, wherein the peptide binders of the first population do not include the amino acid cysteine or methionine, or histidine-proline-glutamine motifs, or amino acid repeats of 2 or more amino acids.

122. The method of any one of clauses 103 to 121, wherein the cyclic peptide binders of the population of mature, extended peptide binders include at least one of an N-terminal wobble synthesis oligopeptide and a C-terminal wobble synthesis oligopeptide.

123. The method of any one of clauses 103 to 122, wherein the core binder sequence comprises a greater number of amino acids than the number of amino acids for each of the peptides comprising the first population of peptide binders.

124. The method of any one of clauses 103 to 123, wherein steps e. and h. comprise principled clustering analysis.

125. The method of any one of clauses 103 to 124, wherein steps c. to h. are repeated for the mature, extended core peptide binder sequence.

126. The method of any one of clauses 103 to 125 wherein the peptide microarray comprises one or more linear peptides and wherein the method further comprises the step of contacting the one or more linear peptides on the peptide microarray with a protease capable of digesting the one or more linear peptides.

127. The method of clause 126 wherein the protease is an amino protease or a mixture of amino proteases.

128. The method of clause 127 wherein the protease is dipeptidyl peptidase IV, aminopeptidase m, or a combination thereof.

129. The method of clause 45 or 46 wherein the butelase 1 recognition sequence is NHV.

130. The method of clause 47 or 48 wherein the glutamine side chain is part of the sequence [WY][DE][DE][YW]ALQ[GST]YD (SEQ ID NO: 194) and the lysine side chain is part of the sequence RSKLG (SEQ ID NO: 195).

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a schematic view of a microarray system for array synthesis by way of a photolithographic technique utilizing photolithographic mask (Prior art).

FIG. 2 is a schematic view of a microarray system for array synthesis by way of a photolithographic technique utilizing maskless photolithography (Prior art).

FIG. 3 is a schematic view illustrating arrays comprising peptide probes thereon in accordance with the present disclosure.

FIG. 4 is a schematic illustration of an embodiment of a process of the present disclosure.

FIG. 5 is a schematic view illustrating another embodiment of an array comprising peptide probes thereon in accordance with the present disclosure.

FIG. 6 is a schematic view depicting an embodiment of the process of FIG. 4.

FIG. 7 is a schematic view depicting a reaction scheme for head-to-tail (amide bond formation) cyclization of peptide libraries on a surface.

FIG. 8A is a slide image of subarrays of peptides each having a glutamate linker amino acid where (bottom) a linear library of peptides is formed from OtBu-protected variants of the glutamate linker amino acid after deprotection and biotin labelling, and (top) a cyclic library of peptides is formed from OAll-protected variants of the glutamate linker amino acid after deprotection and biotin labelling.

FIG. 8B is a schematic view depicting (bottom) deprotection of OtBu-protected variants of glutamate, followed by biotin labelling and (top) deprotection of OAll-protected variants of glutamate, followed by biotin labelling.

FIG. 9A and 9B are schematic views depicting a process for forming subarrays of linear and cyclic peptide libraries where the peptides of the cyclic library that fail to cyclize are the same as those of the linear library.

FIG. 10 is a chart showing cyclic versus linear fluorescent intensity for a peptide library of the format XXXXU bound to streptavidin-Cy5.

FIG. 11 is a chart showing surface plasmon resonance (SPR) binding curves of a head-to-tail cylic NQpWQ (SEQ ID NO: 84) peptide to a streptavidin coated CM5 BIAcore chip.

FIG. 12 is a chart showing surface plasmon resonance (SPR) binding of a head-to-tail cylic NQpWQ (SEQ ID NO: 84) peptide to a streptavidin coated CM5 BIAcore chip versus peptide concentration. The dashed line indicates the binding constant.

FIG. 13 is a chart showing cyclic versus linear fluorescent intensity for a peptide library of the format JXXHPQXXJU (SEQ ID NO: 86) bound to streptavidin-Cy5.

FIG. 14 is a chart showing cyclic fluorescent intensity versus log fold change (logFC) between cyclic and linear fluorescent intensity for a peptide library of the format JXXHPQXXJU (SEQ ID NO: 86) bound to streptavidin-Cy5. The darker points indicate the top 100 JXXHPQXXJU (SEQ ID NO: 86) cyclic peptides.

FIG. 15 is a chart showing cyclic fluorescent intensity versus log fold change (logFC) between cyclic and linear fluorescent intensity for a peptide library of the format JXXHPQXXJU (SEQ ID NO: 86) bound to streptavidin-Cy5, where each XXHPQXX (SEQ ID NO: 187) corresponds to one of the top 100 cyclic peptides of the chart shown in FIG. 14, and J is random. FIG. 15 discloses SEQ ID NOS 230-231, respectively, in order of appearance.

FIG. 16 is a chart showing surface plasmon resonance (SPR) binding curves of a head-to-tail cylic LYDHPQNGGQ (SEQ ID NO: 190) peptide to a streptavidin coated CM5 BIAcore chip at various peptide concentrations.

FIG. 17 is a chart showing surface plasmon resonance (SPR) binding of a head-to-tail cylic LYDHPQNGGQ (SEQ ID NO: 190) peptide to a streptavidin coated CM5 BIAcore chip versus peptide concentration. The dashed line indicates the binding constant.

FIG. 18 is a chart showing surface plasmon resonance (SPR) binding curves of a linear NH₂-LYDHPQNGGQ-COOH (SEQ ID NO: 191) peptide to a streptavidin coated CM5 BIAcore chip at various peptide concentrations.

FIG. 19 is a chart showing surface plasmon resonance (SPR) binding of a linear NH₂-LYDHPQNGGQ-COOH (SEQ ID NO: 191) peptide to a streptavidin coated CM5 BIAcore chip versus peptide concentration. The dashed line indicates the binding constant.

FIG. 20 is a chart showing surface plasmon resonance (SPR) binding curves of a head-to-tail cylic QNDHPQNGGQ (SEQ ID NO: 192) peptide to a streptavidin coated CM5 BIAcore chip at various peptide concentrations.

FIG. 21 is a chart showing surface plasmon resonance (SPR) binding of a head-to-tail cylic QNDHPQNGGQ (SEQ ID NO: 192) peptide to a streptavidin coated CM5 BIAcore chip versus peptide concentration. The dashed line indicates the binding constant.

FIG. 22 is a chart showing surface plasmon resonance (SPR) binding curves of a linear NH₂-QNDHPQNGGQ-COOH (SEQ ID NO: 193) peptide to a streptavidin coated CM5 BIAcore chip at various peptide concentrations.

FIG. 23 is a chart showing surface plasmon resonance (SPR) binding of a linear NH₂-QNDHPQNGGQ-COOH (SEQ ID NO: 193) peptide to a streptavidin coated CM5 BIAcore chip versus peptide concentration. The dashed line indicates the binding constant.

DETAILED DESCRIPTION OF ILLUSTRATIVE EMBODIMENTS

The instant disclosure provides peptide microarrays, methods of generating peptide microarrays, and methods of identifying peptide binders (e.g., cyclic peptides) using microarrays by which novel peptide binders (e.g., cyclic peptides) can be synthesized, optimized and identified. In some embodiments, the final optimization step is cyclization according to the methods described herein after the peptide binders are matured and extended on the peptide microarray.

According to some embodiments, the peptide microarrays disclosed herein identify peptide binders (e.g., cyclic peptides) through identification of overlapping binding of the target of interest to small peptides comprising a comprehensive population of peptides immobilized on a peptide microarray, then performing an exhaustive peptide maturation of the isolated core binder sequence, followed by N-terminal and C-terminal extension procedures and, in one embodiment, followed by cyclization. In some embodiments, the mature, extended core peptide binder sequence may be subjected to further maturation processes and a new series of N-terminal and C-terminal extension processes, and, for example, followed by cyclization.

Several embodiments of the invention are described in the Summary section of this patent application and each of the embodiments described in this Detailed Description section of the application applies to the embodiments described in the Summary, including the embodiments described by the enumerated clauses below.

1. A peptide microarray comprising at least one cyclic peptide of formula I

embedded image

wherein each R¹, R², R³and R⁴is independently a natural amino acid side chain or a non-natural amino acid side chain;

each R⁵and R⁶is independently hydrogen or an N-terminal capping group;

each R⁷is independently —OH or a C-terminal capping group;

Q is selected from the group consisting of a carbonyl, a natural amino acid side chain, and a non-natural amino acid side chain;

each X and Y is independently selected from the group consisting of a bond, a natural amino acid side chain covalently attached to Z, and a non-natural amino acid side chain covalently attached to Z;

Z is a group comprising a moiety selected from the group consisting of an amide bond, a disulfide bond, an isopeptide bond, a 1,2,3-triazole, and an optionally substituted 1,2-quinone;

L′ and L″ are each independently an optional bivalent linking group or a bond;

m is an integer from 0 to 6;

n is an integer from 0 to 6;

p is an integer from 0 to 100;

q is 0 or 1;

r is 0 or 1;

t is an integer from 0 to 100;

u is 0 or 1;

and * is a point of connection connecting the at least one cyclic peptide to a solid support having a reactive surface,

2. The peptide microarray of clause 1, wherein Z comprises a moiety selected from the group consisting of an amide bond,

embedded image

wherein v is an integer from 0 to 6, w is an integer from 0 to 6, and y is an integer from 0 to 6, and ** is a point of connection to the rest of the cyclic peptide.

3. The peptide microarray of clause 1 or 2, wherein Z comprises a peptide bond, Q is a carbonyl, q is 0, r is 1, and u is 0.

4. The peptide microarray of clause 1 or 2, wherein each Q and X is a cysteine side chain, Z is a disulfide bond, q is 1, r is 1, t is 0, and u is 0.

5. The peptide microarray of clause 1 or 2, wherein X and Y are bonds to Z, Z comprises **—S—S—**, q is 1, and u is 1.

6. The peptide microarray of clause 1 or 2, wherein Z comprises

embedded image

and v is 1.

7. The peptide microarray of clause 1 or 2, wherein Z comprises

embedded image

and w is 1.

8. The peptide microarray of clause 1 or 2, wherein Z comprises

embedded image

r is 0, t is 0, u is 0, and y is 1.

9. The peptide microarray of clause 1 or 2, wherein Y is a bond to Z, Z comprises

embedded image

u is 1, and y is 1.

10. The peptide microarray of clause 1 or 2, wherein Y is a bond to Z, Z comprises

embedded image

q is 0, and u is 1.

11. The peptide microarray of clause 1 or 2, wherein X is a bond to Z, Z comprises

embedded image

q is 1, r is 0, t is 0, and u is 0.

12. The peptide microarray of clause 1 or 2, wherein X and Y are bonds to Z, Z comprises

embedded image

q is 1, and u is 1.

13. The peptide microarray of any one of clauses 1 to 12, wherein each L′ and

L″ is independently of the formula II

embedded image

wherein each R⁸and R⁸′ is independently selected from the group consisting of H, D, halogen, C₁-C₆alkyl, C₂-C₆alkenyl, C₂₋C₆alkynyl, C₃₋C₆cycloalkyl, 3- to 7-membered heterocycloalkyl, C₆-C₁₀aryl, 5- to 7-membered heteroaryl, —OR⁹, —OC(O)R⁹, —NR⁹R⁹′, —NR⁹C(O)R¹⁰—C(O)R⁹, —C(O)OR⁹, and —C(O)NR⁹R⁹′, wherein each hydrogen atom in C₁-C₆alkyl, C₂-C₆alkenyl, C₂₋C₆alkynyl, C₃₋C₆cycloalkyl, 3- to 7-membered heterocycloalkyl, C₆-C₁₀aryl and 5- to 7-membered heteroaryl is independently optionally substituted by halogen, C₁-C₆alkyl, C₂-C₆alkenyl, C₂₋C₆alkynyl, —OR¹¹; each R⁹, R⁹′, R¹⁰, and R¹¹is independently selected from the group consisting of H, D, hydroxyl, C₁-C₇alkyl, C₂-C₇alkenyl, C₂₋C₇alkynyl, C₃₋C₆cycloalkyl, 3- to 7-membered heterocycloalkyl, C₆-C₁₀aryl and 5- to 7-membered heteroaryl; and a is an integer from 1 to 10; or the formula III or IV