Claims
- 1. A method for constructing a library of designed proteins, comprising the steps of:
providing an amino acid sequence derived from a lead protein, the amino acid sequence being designated as a lead sequence; comparing the lead sequence with a plurality of tester protein sequences; and selecting from the plurality of tester protein sequences at least two peptide segments that have at least 15% sequence identity with the lead sequence, the selected peptide segments forming a hit library; and forming a library of designed proteins by substituting the lead sequence with the hit library.
- 2. The method of claim 1, wherein the length of the lead sequence is between 5-100 aa.
- 3. The method of claim 1, wherein the length of the lead sequence is between 6-80 aa.
- 4. The method of claim 1, wherein the length of the lead sequence is between 8-50 aa.
- 5. The method of claim 1, wherein the lead protein is a type of protein selected from the group consisting of enzymes receptors, cytokines, tumor suppressors, chemokines, antibodies and growth factors.
- 6. The method of claim 1, wherein the plurality of tester protein sequences comprises human protein sequences.
- 7. The method of claim 1, wherein the plurality of tester protein sequences comprises humanized protein sequences each having at least 70% human sequence.
- 8. The method of claim 1, wherein the plurality of tester protein sequences is retrieved from a protein database in Genbank or Swiss-Prot database.
- 9. The method of claim 1, wherein the step of comparing the lead sequence with the plurality of tester protein sequences is implemented by an algorithm selected from the group consisting of BLAST, PSI-BLAST, profile HMM, and COBLATH.
- 10. The method of claim 1, wherein the sequence identity of the selected peptide segments in the hit library with the lead sequence is at least 25%.
- 11. The method of claim 1, wherein the sequence identity of the selected peptide segments in the hit library with the lead sequence is at least 35%.
- 12. The method of claim 1, wherein the sequence identity of the selected peptide segments in the hit library with the lead sequence is at least 45%.
- 13. The method of claim 1, further comprising the steps of:
selecting proteins with a desired function from the library of designed proteins.
- 14. The method of claim 13, wherein the desired function is an improved biological function of the lead protein.
- 15. The method of claim 14, wherein the improved biological function is selected from the group consisting of enhanced stability, enhanced enzymatic activity, enhanced binding affinity to the cognate ligand of the lead protein, and enhanced expression in a predetermined organism.
- 16. The method of claim 1, further comprising the step of:
constructing a nucleic acid library comprising DNA segments encoding the amino acid sequences of the hit library.
- 17. The method of claim 1, further comprising the steps of:
building an amino acid positional variant profile of the hit library; combining the amino acid variants in the hit library to produce a combination of hit variants which form a hit variant library; and selecting proteins with a desirable function from the hit variant library.
- 18. The method of claim 17, further comprising the steps of:
determining if a member of the hit variant library is structurally compatible with a three-dimensional structure of the lead sequence or the lead protein by using a scoring function; and selecting the members that score equal to or better than the lead sequence or the lead protein.
- 19. The method of claim 18, wherein the three-dimensional structure of the lead sequence or the lead protein is a structure derived from X-crystallography, nuclear magnetic resonance (NMR) spectroscopy or theoretical structural modeling.
- 20. The method of claim 18, wherein the scoring function is an energy scoring function selected from the group consisting of electrostatic interactions, van der Waals interactions, electrostatic solvation energy, solvent-accessible surface solvation energy, and conformational entropy.
- 21. The method of claim 1, wherein the scoring function is a scoring function that incorporates a forcefield selected from the group consisting of the Amber forcefield, Charmm forcefield, the Discover cvff forcefields, the ECEPP forcefields, the GROMOS forcefields, the OPLS forcefields, the MMFF94 forcefield, the Tripos forcefield, the MM3 forcefield, the Dreiding forcefield, and UNRES forcefield.
- 22. The method of claim 17, wherein the step of selecting the members includes selecting the members that have equal to or lower total energy than that of the lead sequence or the lead protein calculated based on a formula of
- 23. The method of claim 17, wherein the step of selecting the members includes selecting the members that have a lower binding free energy than that of the lead sequence or the lead protein calculated as the difference between the bound and unbound states using a refined scoring function
- 24. The method of claim 1, further comprising the steps of:
constructing a nucleic acid library comprising DNA segments encoding the amino acid sequences of the library of the designed proteins; expressing the nucleic acid library to generate a library of recombinant proteins; and selecting proteins with a desired function from the library of recombinant proteins.
- 25. The method of claim 1, further comprising the steps of:
building an amino acid positional variant profile of the hit library; converting amino acid positional variant profile of the hit library into a nucleic acid positional variant profile by back-translating the amino acid positional variants into their corresponding genetic codons; constructing a degenerate nucleic acid library of DNA segments by combinatorially combining the nucleic acid positional variants; expressing the degenerate nucleic acid library to generate a library of recombinant proteins; and selecting proteins with a desired function from the library of recombinant proteins.
- 26. An antibody against human vascular endothelial growth factor (VEGF), wherein the binding affinity of the antibody to VEGF is higher than 106 M−1, and the heavy chain CDR3 of the monoclonal antibody comprises an amino acid sequence selected from group consisting of SEQ ID Nos: 36-48 and 63-125.
- 27. The antibody of claim 26, wherein the heavy chain CDR1 of the antibody comprises an amino acid sequence selected from group consisting of SEQ ID Nos: 19-30.
- 28. The antibody of claim 26, wherein the heavy chain CDR2 of the monoclonal antibody comprises an amino acid sequence selected from group consisting of SEQ ID Nos: 31-35.
- 29. The antibody of claim 26, wherein the antibody is a monoclonal antibody, Fab, Fv, or a single chain antibody.
- 30. An antibody against human vascular endothelial growth factor (VEGF), wherein the binding affinity of the antibody to VEGF is higher than 106 M−1, and the heavy chain variable region (VH) of the antibody comprises an amino acid sequence selected from group consisting of SEQ ID Nos: 126, 128, 129, 130, and 131, and the light chain variable region (VL) of the antibody comprises an amino acid sequence of SEQ ID No: 127.
CROSS REFERENCE TO RELATED APPLICATIONS
[0001] This application is a continuation-in-part of U.S. application Ser. No. 10/153,159, filed May 20, 2002, entitled “Structure-Based Selection And Affinity Maturation of Antibody Library, and is also a continuation-in-part of application Ser. No. 10/153,176, filed May 20, 2002, entitled “Generation Affinity Maturation of Antibody Library in Silico”, both of which are a continuation-in-part of U.S. patent application Ser. No. 10/125,687 entitled “Structure-based construction of human antibody library” filed Apr. 17, 2002, which claims the benefit of U.S. Provisional Application Serial No. 60/284,407 entitled “Structure-based construction of human antibody library” filed Apr. 17, 2001. These applications are incorporated herein by reference.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60284407 |
Apr 2001 |
US |
Continuation in Parts (3)
|
Number |
Date |
Country |
Parent |
10153159 |
May 2002 |
US |
Child |
10443134 |
May 2003 |
US |
Parent |
10153176 |
May 2002 |
US |
Child |
10443134 |
May 2003 |
US |
Parent |
10125687 |
Apr 2002 |
US |
Child |
10153176 |
May 2002 |
US |