Claims
- 1. A method for constructing a library of recombinant antibodies, comprising the steps of:
clustering variable regions of a collection of antibodies having known 3D structures into at least two families of structural ensembles, each family of structural ensemble comprising at least two different antibody sequences but with substantially identical main chain conformations; selecting a representative structural template from each family of structural ensemble; profiling a tester polypeptide sequence onto the representative structural template within each family of structural ensemble; and selecting the tester antibody sequence that is compatible to the structural constraints of the representative structural template.
- 2. The method of claim 1, wherein the collection of antibodies include antibodies or immunoglobulins collected in a protein database.
- 3. The method of claim 2, wherein the protein database is selected from the group consisting of the protein data bank of Brookhaven National Laboratory, genbank at the National Institute of Health, and Swiss-PROT protein sequence database.
- 4. The method of claim 1, wherein the collection of antibodies having known 3D structures include antibodies having resolved X-ray crystal structures, NMR structures or 3D structures based on structural modeling.
- 5. The method of claim 1, wherein the variable regions of the collection of antibodies are the full length heavy chain or light chain variable regions or specific portions of the heavy chain or light chain variable region selected from the group consisting of CDR, FR, and a combination thereof.
- 6. The method of claim 5, wherein the CDR is CDR1, CDR2, or CDR3 of an antibody.
- 7. The method of claim 5, wherein the FR is FR1, FR2, FR3, or FR4 of an antibody.
- 8. The method of claim 1, wherein the clustering step includes clustering the collection of antibodies such that the root mean square difference of the main chain conformations of antibody sequences in each family of the structural ensemble is less than 4 Å.
- 9. The method of claim 1, wherein the clustering step includes clustering the collection of antibodies such that the root mean square difference of the main chain conformations of antibody sequences in each family of the structural ensemble is less than 3 Å.
- 10. The method of claim 1, wherein the clustering step includes clustering the collection of antibodies such that the root mean square difference of the main chain conformations of antibody sequences in each family of the structural ensemble is less than 2 Å.
- 11. The method of claim 1, wherein the clustering step includes clustering the collection of antibodies such that the root mean square difference of the main chain conformations of antibody sequences in each family of the structural ensemble is between about 0.1-4.0 Å.
- 12. The method of claim 1, wherein the clustering step includes clustering the collection of antibodies such that the Z-score of the main chain conformations of antibody sequences in each family of the structural ensemble is more than 2.
- 13. The method of claim 1, wherein the clustering step includes clustering the collection of antibodies such that the Z-score of the main chain conformations of antibody sequences in each family of the structural ensemble is more than 3.
- 14. The method of claim 1, wherein the clustering step includes clustering the collection of antibodies such that the Z-score of the main chain conformations of antibody sequences in each family of the structural ensemble is more than 4.
- 15. The method of claim 1, wherein the clustering step includes clustering the collection of antibodies such that the Z-score of the main chain conformations of antibody sequences in each family of the structural ensemble is between about 2-8.
- 16. The method of claim 1, wherein the clustering step is implemented by an algorithm selected from the group consisting of CE, Monte Carlo and 3D clustering algorithms.
- 17. The method of claim 1, wherein the profiling step includes reverse threading the tester polypeptide sequence onto the representative structural template within each family of structural ensemble.
- 18. The method of claim 1, wherein the profiling step is implemented by a multiple sequence alignment algorithm.
- 19. The method of claim 18, wherein the multiple sequence alignment algorithm is profile HMM algorithm or PSI-BLAST.
- 20. The method of claim 1, wherein the representative structural template is adopted by a CDR region, and the profiling step includes profiling the tester polypeptide sequence that is a variable region of a human or non-human antibody onto the representative structural template within each family of structural ensemble.
- 21. The method of claim 1, wherein the representative structural template is adopted by a FR region, and the profiling step includes profiling the tester polypeptide sequence that is a variable region of a human antibody onto the representative structural template within each family of structural ensemble.
- 22. The method of claim 21, wherein the tester polypeptide sequence is a variable region of human germline antibody sequence.
- 23. The method of claim 1, wherein the tester polypeptide sequence is the sequence or a segment sequence of an expressed protein.
- 24. The method of claim 1, wherein the tester polypeptide sequence is a region of an antibody.
- 25. The method of claim 24, wherein the antibody is a human antibody.
- 26. The method of claim 1, wherein the tester polypeptide sequence is a region of a human germline antibody sequence.
- 27. The method of claim 1, wherein the selecting step includes selecting the tester polypeptide sequence by using an energy scoring function selected from the group consisting of electrostatic interactions, van der Waals interactions, electrostatic solvation energy, solvent-accessible surface solvation energy, and conformational entropy.
- 28. The method of claim 1, wherein the selecting step includes selecting the tester polypeptide sequence by using a scoring function incorporating a forcefield selected from the group consisting of the Amber forcefield, Charmm forcefield, the Discover cvff forcefields, the ECEPP forcefields, the GROMOS forcefields, the OPLS forcefields, the MMFF94 forcefield, the Tripose forcefield, the MM3 forcefield, the Dreiding forcefield, and UNRES forcefield, and other knowledge-based statistical forcefield (mean field) and structure-based thermodynamic potential functions.
- 29. The method of claim 1, further comprising the steps of:
building an amino acid positional variant profile of the selected tester polypeptide sequences; filtering out the variants with occurrence frequency lower than 3; and combining the variants remained to produce a combinatorial library of antibody sequences.
- 30. The method of claim 29, wherein the filtering step includes filtering out the variants with occurrence frequency lower than 5.
- 31. The method of claim 1, further comprising the following:
introducing the DNA segment encoding the selected tester polypeptide into cells of a host organism; expressing the DNA segment in the host cells such that a recombinant antibody containing the selected polypeptide sequence is produced in the cells of the host organism; and selecting the recombinant antibody that binds to a target antigen with affinity higher than 106 M−1.
- 32. The method of claim 31, wherein the recombinant antibody is a fully assembled antibody, a Fab fragment, an Fv fragment, or a single chain antibody.
- 33. The method of claim 31, wherein the host organism is selected from the group consisting of bacteria, yeast, plant, insect, and mammal.
- 34. The method of claim 31, wherein the target antigen is a small molecule, proteins, peptide, nucleic acid or polycarbohydate.
- 35. A method of constructing a library of recombinant antibodies based on a target structural template, comprising the steps of:
providing a target structural template of a variable region of one or more antibodies; profiling a tester polypeptide sequence onto the target structural template; and selecting the tester polypeptide sequence that is structurally compatible with the target structural template.
- 36. The method of claim 35, wherein the target structural template is a 3D structure of a heavy chain or light chain variable region of an antibody.
- 37. The method of claim 36, wherein the heavy chain or light chain variable region of an antibody is a CDR, a FR or a combination thereof.
- 38. The method of claim 35, wherein the target structural template is a 3D structural ensemble of heavy chain or light chain variable regions of at least two different antibodies.
- 39. The method of claim 38, wherein the heavy chain or light chain variable regions are CDRs, FRs or combinations thereof.
- 40. The method of claim 35, wherein the profiling step includes reverse threading the tester polypeptide sequence onto the target structural template.
- 41. The method of claim 35, wherein the profiling step is implemented by a multiple sequence alignment algorithm.
- 42. The method of claim 41, wherein the multiple sequence alignment algorithm is profile HMM algorithm or PSI-BLAST.
- 43. The method of claim 35, wherein the target structural template is adopted by a CDR region, and the profiling step includes profiling the tester polypeptide sequence that is a variable region of a human or non-human antibody onto the representative structural template within each family of structural ensemble.
- 44. The method of claim 35, wherein the target structural template is adopted by a FR region, and the profiling step includes profiling the tester polypeptide sequence that is a variable region of a human antibody onto the representative structural template within each family of structural ensemble.
- 45. The method of claim 44, wherein the tester polypeptide sequence is a variable region of human germline antibody sequence.
- 46. The method of claim 35, wherein the tester polypeptide sequence is the sequence or a segment sequence of an expressed protein.
- 47. The method of claim 35, wherein the tester polypeptide sequence is a region of an antibody.
- 48. The method of claim 35, wherein the antibody is a human antibody.
- 49. The method of claim 35, wherein the tester polypeptide sequence is a region of a human germline antibody sequence.
- 50. The method of claim 35, wherein the selecting step includes selecting the tester polypeptide sequence by using an energy scoring function selected from the group consisting of electrostatic interactions, van der Waals interactions, electrostatic solvation energy, solvent-accessible surface solvation energy, and conformational entropy.
- 51. The method of claim 35, wherein the selecting step includes selecting the tester polypeptide sequence by using a scoring function incorporating a forcefield selected from the group consisting of the Amber forcefield, Charmm forcefield, the Discover cvff forcefields, the ECEPP forcefields, the GROMOS forcefields, the OPLS forcefields, the MMFF94 forcefield, the Tripose forcefield, the MM3 forcefield, the Dreiding forcefield, and UNRES forcefield, and other knowledge-based statistical forcefield (mean field) and structure-based thermodynamic potential functions.
- 52. The method of claim 35, further comprising the steps of:
building an amino acid positional variant profile of the selected tester polypeptide sequences; filtering out the variants with occurrence frequency lower than 3; and combining the variants remained to produce a combinatorial library of antibody sequences.
- 53. The method of claim 52, wherein the filtering step includes filtering out the variants with occurrence frequency lower than 5.
- 54. The method of claim 35, further comprising the following:
introducing the DNA segment encoding the selected tester polypeptide into cells of a host organism; expressing the DNA segment in the host cells such that a recombinant antibody containing the selected polypeptide sequence is produced in the cells of the host organism; and selecting the recombinant antibody that binds to a target antigen with affinity higher than 106 M−1.
- 55. The method of claim 54, wherein the recombinant antibody is a fully assembled antibody, a Fab fragment, an Fv fragment, or a single chain antibody.
- 56. The method of claim 54, wherein the host organism is selected from the group consisting of bacteria, yeast, plant, insect, and mammal.
- 57. The method of claim 54, wherein the target antigen is a small molecule, proteins, peptide, nucleic acid or polycarbohydate.
- 58. A method for constructing a library of recombinant antibodies, comprising the steps of:
providing a target sequence of a heavy chain or light chain variable region of a target antibody; aligning the target sequence with a tester polypeptide sequence; and selecting the tester polypeptide sequence that has at least 15% sequence homology with the target sequence.
- 59. The method of claim 58, wherein the target sequence is the full length heavy chain or light chain variable region of the target antibody or specific portions of the variable regions of the target antibody selected from the group consisting of CDR, FR, and a combination thereof.
- 60. The method of claim 59, wherein the CDR is CDR1, CDR2, or CDR3 of the target antibody.
- 61. The method of claim 59, wherein the FR is FR1, FR2, FR3, or FR4 of the target antibody.
- 62. The method of claim 58, wherein the aligning step includes aligning the target sequence with the polypeptide segment of the tester polypeptide sequence by using a sequence alignment algorithm.
- 63. The method of claim 62, wherein the sequence alignment algorithm is selected from the group consisting of BLAST, PSI-BLAST, profile HMM, and COBLATH.
- 64. The method of claim 58, wherein the target sequence is a CDR region of the target antibody, and the alignment step includes aligning the tester polypeptide sequence that is the sequence or segment sequence of an expressed protein with the target sequence.
- 65. The method of claim 58, wherein the target sequence is a FR region of the target antibody, and the alignment step includes aligning the tester polypeptide sequence that is the sequence or segment sequence of a human antibody protein with the target sequence.
- 66. The method of claim 58, wherein the selecting step includes selecting the tester polypeptide sequence that has at least 25% sequence homology with the target sequence.
- 67. The method of claim 58, wherein the selecting step includes selecting the tester polypeptide sequence that has at least 35% sequence homology with the target sequence.
- 68. The method of claim 58, wherein the selecting step includes selecting the tester polypeptide sequence that has at least 35% sequence homology with the target sequence.
- 69. The method of claim 58, further comprising the steps of:
building an amino acid positional variant profile of the selected tester polypeptide sequences; filtering out the variants with occurrence frequency lower than 3; and combining the variants remained to produce a combinatorial library of antibody sequences.
- 70. The method of claim 58, wherein the filtering step includes filtering out the variants with occurrence frequency lower than 5.
- 71. The method of claim 58, further comprising the following:
introducing the DNA segment encoding the selected tester polypeptide into cells of a host organism; expressing the DNA segment in the host cells such that a recombinant antibody containing the selected polypeptide sequence is produced in the cells of the host organism; and selecting the recombinant antibody that binds to a target antigen with affinity higher than 106 M−1.
- 72. The method of claim 51, wherein the recombinant antibody is a fully assembled antibody, a Fab fragment, an Fv fragment, or a single chain antibody.
CROSS REFERENCE TO RELATED APPLICATION
[0001] This application claims the benefit of priority to U.S. Provisional Application Serial No. 60/284,407 entitled “Structure-based construction of human antibody library” filed Apr. 17, 2001. This application is incorporated herein by reference.
Provisional Applications (1)
|
Number |
Date |
Country |
|
60284407 |
Apr 2001 |
US |