Claims
- 1. A library comprising a plurality of polypeptides, each polypeptide comprising a first and a second zinc finger domain,
wherein (1) the first and second zinc finger domains of each polypeptide are each identical to a zinc finger domain from a naturally occurring protein and either (i) do not occur in the same naturally occurring protein or (ii) occur in the same naturally occurring protein in a different configuration than in the polypeptide, (2) the first zinc finger domain varies among polypeptides of the plurality, and (3) the second zinc finger domain varies among polypeptides of the plurality.
- 2. A library comprising a plurality of polypeptides, each polypeptide comprising a first and a second zinc finger domain,
wherein (1) the first and second zinc finger domains of each polypeptide are each identical to a zinc finger domain from a naturally occurring protein and do not occur in the same naturally occurring protein, (2) the first zinc finger domain varies among polypeptides of the plurality, and (3) the second zinc finger domain varies among polypeptides of the plurality.
- 3. The library of claim 1, wherein each polypeptide of the plurality binds to a target DNA site with a dissociation constant (Kd) of less than 5 nM.
- 4. The library of claim 2, wherein each polypeptide of the plurality binds to a predetermined site.
- 5. The library of claim 2 wherein the plurality contains at least 80% of the members of the library.
- 6. The library of claim 2, wherein the first zinc finger domain of at least one polypeptide of the plurality is selected from the zinc finger domains listed in any of Tables 5, 6, and 7.
- 7. The library of claim 6 wherein the first zinc finger domain of each polypeptide of the plurality is selected from the zinc finger domains listed in any of Tables 5, 6, and 7.
- 8. The library of claim 6, wherein the first and second zinc finger domains of at least one polypeptide of the plurality are selected from the zinc finger domains listed in any of Tables 5, 6, and 7.
- 9. The library of claim 2 wherein the naturally occurring protein is a eukaryotic protein.
- 10. The library of claim 9 wherein the naturally occurring protein is a mammalian protein.
- 11. The library of claim 10 wherein the naturally occurring protein is a human protein.
- 12. The library of claim 2, wherein each polypeptide of the plurality further comprises a third zinc finger domain.
- 13. The library of claim 2, wherein the third zinc finger domain is a domain of a naturally occurring protein.
- 14. The library of claim 12, wherein the third zinc finger domain differs from a domain of a naturally occurring protein by insertion, deletion, or substitution of at least one but no more than six amino acids.
- 15. The library of claim 2, wherein each polypeptide of the plurality further comprises a transcriptional regulatory domain.
- 16. The library of claim 2, wherein the plurality comprises at least 100 different polypeptides.
- 17. A library comprising a plurality of nucleic acids, each nucleic acid of the plurality encoding a polypeptide comprising a first and a second zinc finger domain,
wherein (1) the first and second zinc finger domains of the polypeptide encoded by each nucleic acid of the plurality are each identical to zinc finger domains from naturally occurring proteins and either (i) do not occur in the same naturally occurring protein or (ii) occur in the same naturally occurring protein in a different configuration than in the polypeptide, (2) the first zinc finger domain varies among nucleic acids of the plurality, and (3) the second zinc finger domain varies among nucleic acids of the plurality.
- 18. The library of claim 17, wherein each polynucleotide resides within a cell.
- 19. The library of claim 18, wherein the cell is a eukaryotic cell.
- 20. The library of claim 19, wherein the cell is a yeast cell.
- 21. The library of claim 18, wherein the cell contains a heterologous reporter construct comprising a reporter gene operably linked to a promoter.
- 22. A method of producing a plurality of chimeric nucleic acids, the method comprising:
(a) providing a set of nucleic acids, each nucleic acid of the set comprising a sequence encoding a zinc finger domain from a naturally occurring protein; and (b) joining each nucleic acid of the set to one or more other nucleic acids of the set to form a plurality of chimeric nucleic acids.
- 23. The method of claim 22 wherein the set comprises at least two sequences that encode zinc finger domains from different naturally occurring proteins.
- 24. The method of claim 22 wherein the set also comprises at least two sequences that encode zinc finger domains from the same naturally occurring proteins.
- 25. The method of claim 22 wherein step (a) comprises amplifying a collection of polynucleotides encoding zinc finger domains from genomic DNA, a messenger RNA (mRNA) mixture, or a complementary DNA (cDNA) mixture using an oligonucleotide primer that anneals to sequences that encode a conserved domain boundary.
- 26. The method of claim 22 wherein step (a) comprises:
(i) selecting a plurality of zinc finger domains, each domain having specificity for a sequence within a target site of interest; and (ii) providing a plurality of polynucleotides, each polynucleotide encoding at least one of the selected zinc finger domains, thereby providing the set of polynucleotides.
- 27. The method of claim 26 wherein the plurality of zinc finger domains are selected by querying a database that includes information relating zinc finger domains to their respective binding sites.
- 28. A method of generating an artificial zinc finger polypeptide that specifically binds to a target DNA site, the method comprising:
providing the polypeptide library of claim 1;contacting the target DNA site with the polypeptides of the library; and identifying one or more polypeptides that specifically bind to the target DNA site.
- 29. The method of claim 28, wherein the polypeptides of the library are immobilized on a solid support.
- 30. The method of claim 28, wherein each polypeptide of the library is displayed on the surface of a virus or viral particle.
- 31. A method of profiling a test nucleic acid, the method comprising;
contacting the test nucleic acid with the polypeptides of the library of claim 1;evaluating a binding interaction between the test nucleic acid and each polypeptide of the library; and determining a profile of the test nucleic acid from results of the evaluating, the profile comprising information about the evaluated binding interactions between the test nucleic acid and each polypeptide of the library.
- 32. A method of identifying a nucleic acid encoding a polypeptide that recognizes a target DNA site, the method comprising:
providing a plurality of nucleic acids, each nucleic acid of the plurality encoding a polypeptide comprising a first and a second zinc finger domain, wherein the first and second zinc finger domains of the polypeptide encoded by each nucleic acid of the plurality are identical to zinc finger domains from different naturally occurring mammalian proteins, the first zinc finger domain varies among nucleic acids of the plurality, and the second zinc finger domain varies among nucleic acids of the plurality; providing cells containing a reporter gene operably linked to a target DNA site; expressing the plurality of nucleic acids in the cells; identifying a cell having altered expression of the reporter gene relative to the level of expression in the absence of a polypeptide that recognizes the target DNA site level; and identifying a nucleic acid expressed in the cell, the nucleic acid being a nucleic acid of the plurality, thereby identifying a nucleic acid encoding a polypeptide that recognizes the target DNA site.
- 33. The method of claim 32, wherein the given level is the level of reporter gene expression in a reference cell that includes a reference nucleic acid.
- 34. A method of identifying a nucleic acid encoding a zinc finger polypeptide that specifically recognizes a target DNA site, the method comprising:
providing the library of polynucleotides of claim 17;providing cells containing a reporter gene operably linked to a target DNA site; expressing the plurality of polynucleotides in the cells; identifying a cell having altered expression of the reporter gene relative to the level of expression in the absence of a polypeptide that recognizes the target DNA site level; and identifying a polynucleotide expressed in the cell, the polynucleotide being a polynucleotide of the plurality, thereby identifying a polynucleotide encoding a polypeptide that specifically recognizes the target DNA site.
- 35. The method of claim 34, further comprising the step of modifying the amino acid sequence of the identified zinc finger polypeptide without altering the binding specificity of the zinc finger polypeptide for the target DNA site.
- 36. The method of claim 34, wherein the target site comprises at least six predetermined nucleotides.
- 37. The method of claim 34, wherein the cells are yeast cells.
- 38. The method of claim 34, further comprising the step of introducing the polynucleotides into each of the cells.
- 39. The method of claim 34, further comprising the step of fusing the cells containing the reporter gene to cell that includes the polynucleotides of the library.
- 40. A polypeptide comprising a first and a second zinc finger domain, wherein the first and second zinc finger domains are each from naturally occurring proteins and are selected from the zinc finger domains of Tables 5, 6, and 7.
- 41. A polypeptide of claim 40 that further comprises a third zinc finger domain, wherein the set of three zinc finger domains is listed in a row of Table 10.
- 42. A nucleic acid sequence comprising a polynucleotide that encodes the polypeptide of claim 40.
- 43. A nucleic acid sequence comprising a polynucleotide that encodes the polypeptide of claim 41.
- 44. A purified polypeptide comprising an amino acid sequence selected from the group consisting of:
Xa-X-Cys-X2-5-Cys-X3-Xa-X-His-X-Ser-Ser-Xb-X-Arg-His-X3-5-His (SEQ ID NO: 167), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Ile-X-Ser-Asn-Xb-X-Arg-His-X3-5-His (SEQ ID NO: 168), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Lys-X-Ser-Asn-Xb-X-Arg-His-X3-5-His (SEQ ID NO:169), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Ser-Asn-Xb-X-Lys-His-X3-5-His (SEQ ID NO:170), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Ser-His-Xb-X-Thr-His-X3-5-His (SEQ ID NO:171), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Val-X-Ser-Asn-Xb-X-Val-His-X3-5-His (SEQ ID NO: 172), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Asp-X-Ser-Cys-Xb-X-Arg-His-X3-5-His (SEQ ID NO:193), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Ile-X-Ser-Asn-Xb-X-Val-His-X3-5-His (SEQ ID NO: 194), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Trp-X-Ser-Asn-Xb-X-Arg-His-X3-5-His (SEQ ID NO:195), and Xa-X-Cys-X2-5-Cys-X3-Xa-X-Asp-X-Ser-Ala-Xb-X-Arg-His-X3-5-His (SEQ ID NO:196), wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue.
- 45. The purified polypeptide of claim 44 wherein the polypeptide comprises an amino acid sequence selected from the group consisting of: SEQ ID NO:173, 175, 177, 179, 181, 183, 185, 187, 189, and 191.
- 46. A purified polypeptide comprising between two and ten segments, each segment having an amino acid sequence selected from the group consisting of:
Xa-X-Cys-X2-5-Cys-X3-Xa-X-His-X-Ser-Ser-Xb-X-Arg-His-X3-5-His (SEQ ID NO:167), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Ile-X-Ser-Asn-Xb-X-Arg-His-X3-5-His (SEQ ID NO: 168), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Lys-X-Ser-Asn-Xb-X-Arg-His-X3-5-His (SEQ ID NO:169), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Ser-Asn-Xb-X-Lys-His-X3-5-His (SEQ ID NO:170), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Ser-His-Xb-X-Thr-His-X3-5-His (SEQ ID NO:171), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Val-X-Ser-Asn-Xb-X-Val-His-X3-5-His (SEQ ID NO: 172), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Asp-X-Ser-Cys-Xb-X-Arg-His-X3-5-His (SEQ ID NO: 193), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Ile-X-Ser-Asn-Xb-X-Val-His-X3-5-His (SEQ ID NO:194), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Trp-X-Ser-Asn-Xb-X-Arg-His-X3-5-His (SEQ ID NO:195), and Xa-X-Cys-X2-5-Cys-X3-Xa-X-Asp-X-Ser-Ala-Xb-X-Arg-His-X3-5-His (SEQ ID NO: 196), wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue.
- 47. The purified polypeptide of claim 44 further comprising a second amino acid sequence selected from the group consisting of:
Xa-X-Cys-X2-5-Cys-X3-Xa-X-Cys-X-Ser-Asn-Xb-X-Arg-His-X3-5-His (SEQ ID NO:68), Xa-X-Cys-X2-5-Cys-X3-Xa-X-His-X-Ser-Asn-Xb-X-Lys-His-X3-5-His (SEQ ID NO:69), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Ser-X-Ser-Asn-Xb-X-Arg-His-X3-5-His (SEQ ID NO:70), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Ser-Thr-Xb-X-Val-His-X3-5-His (SEQ ID NO:71), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Val-X-Ser-Xc-Xb-X-Arg-His-X3-5-His (SEQ ID NO:72), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Ser-His-Xb-X-Arg-His-X3-5-His (SEQ ID NO:73), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Ser-Asn-Xb-X-Val-His-X3-5-His (SEQ ID NO:74), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Ser-Xc-Xb-X-Arg-His-X3-5-His (SEQ ID NO:75), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Ala-His-Xb-X-Arg-His-X3-5-His (SEQ ID NO:150), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Phe-Asn-Xb-X-Arg-His-X3-5-His (SEQ ID NO:151), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Ser-His-Xb-X-Thr-His-X3-5-His (SEQ ID NO:152), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Ser-His-Xb-X-Val-His-X3-5-His (SEQ ID NO:153), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Ser-Asn-Xb-X-Ile-His-X3-5-His (SEQ ID NO:154), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Ser-Asn-Xb-X-Arg-His-X3-5-His (SEQ ID NO:155), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Thr-His-Xb-X-Gln-His-X3-5-His (SEQ ID NO:156), Cys-X2-5-Cys-X3-Xa-X-Gln-X-Thr-His-Xb-X-Arg-His-X3-5-His (SEQ ID NO:157), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Arg-X-Asp-Lys-Xb-X-Ile-His-X3-5-His (SEQ ID NO:158), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Arg-X-Ser-Asn-Xb-X-Arg-His-X3-5-His (SEQ ID NO:159), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Gln-X-Gly-Asn-Xb-X-Arg-His-X3-5-His (SEQ ID NO:161), Xa-X-Cys-X2-5-Cys-X3-Xa-Arg-X-Asp-Glu-Xb-X-Arg-His-X3-5-His (SEQ ID NO:162), Xa-X-Cys-X2-5-CYS-X3-Xa-X-Arg-X-Asp-His-Xb-X-Arg-His-X3-5-His (SEQ ID NO:163), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Arg-X-Asp-His-Xb-X-Thr-His-X3-5-His (SEQ ID NO:164), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Arg-X-Asp-Lys-Xb-X-Arg-His-X3-5-His (SEQ ID NO:165), Xa-X-Cys-X2-5-Cys-X3-Xa-X-Arg-X-Ser-His-Xb-X-Arg-His-X3-5-His (SEQ ID NO:166), and Xa-X-Cys-X2-5-Cys-X3-Xa-X-Arg-X-Thr-Asn-Xb-X-Arg-His-X3-5-His (SEQ ID NO:160), wherein Xa is phenylalanine or tyrosine, and Xb is a hydrophobic residue.
- 48. The polypeptide of claim 44 wherein the amino acid sequence is a segment of a naturally occurring protein.
- 49. A nucleic acid comprising a sequence encoding the polypeptide of claim 44.
- 50. A nucleic acid comprising a sequence encoding the polypeptide of claim 45.
- 51. A nucleic acid comprising a sequence encoding the polypeptide of claim 46.
- 52. A nucleic acid comprising a sequence encoding the polypeptide of claim 47.
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims priority to U.S. Application Serial No. 60/313,402, filed on Aug. 17, 2001, and No. 60/374,355, filed Apr. 22, 2002, the contents of both of which are incorporated herein for all purposes.
Provisional Applications (2)
|
Number |
Date |
Country |
|
60313402 |
Aug 2001 |
US |
|
60374355 |
Apr 2002 |
US |