Claims
- 1. An isolated polynucleotide encoding an acyl-specific C-domain, wherein said isolated polynucleotide encodes a polypeptide which comprises at least 45% sequence identity to at least one sequence selected from SEQ ID NOS: 1 and 2.
- 2. An isolated polynucleotide comprising a sequence selected from the group consisting of:
(a) a sequence selected from the group consisting of SEQ ID NOS: 5, 7, 9, 11, 13, 15, 17 and 19; (b) a sequence that is complementary to (a); (c) a sequence which hybridizes to said sequence of (a) or (b) under conditions of high stringency; and (d) a sequence which has at least 70% or higher homology to said sequence of (a), (b), or (c).
- 3. The isolated polynucleotide of claim 1, wherein said acyl-specific C-domain is involved in lipopeptide acyl-capping.
- 4. The isolated polynucleotide of claim 3, wherein said isolated polynucleotide resides in a gene locus selected from the group consisting of:
(a) the biosynthetic locus for ramoplanin from Actinoplanes sp. ATCC 33076; (b) the biosynthetic locus for A21978C from Streptomyces roseosporus NRRL 11379; (c) the biosynthetic locus for A54145 from Streptomyces fradiae ATCC 18158; (d) the biosynthetic locus for the calcium-dependent antibiotic from Streptomyces coelicolor A3(2); (e) the biosynthetic locus for a lipopeptide natural product from Streptomyces ghanaensis NRRL B-12104; (f) the biosynthetic locus for a lipopeptide natural product from Streptomyces refuineus NRRL 3143; (g) the biosynthetic locus for a lipopeptide natural product from Streptomyces aizunensis NRRL B-11277; (h) the biosynthetic locus for a lipopeptide natural product from Actinoplanes nipponensis FD 24834 ATCC 31145; and (i) the biosynthetic locus for a lipopeptide natural product from a Streptomyces sp. organism.
- 5. Two or more isolated polynucleotides, wherein the first polynucleotide is a polynucleotide of claim 1, and the second polynucleotide encodes a polypeptide selected from the group consisting of:
(j) a polypeptide having at least 55% sequence identity to SEQ ID NO: 3, and (k) a polypeptide having at least 50% sequence identity to SEQ ID NO: 4.
- 6. An isolated polynucleotide comprising a sequence selected from the group consisting of:
(a) a sequence selected from the group consisting of SEQ ID NOs. 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45 and 47; (b) a sequence that is complementary to (a); (c) a sequence which hybridizes to said sequence of (a) or (b) under conditions of high stringency, and (d) a sequence which has at least 70% or higher homology to said sequence of (a), (b), or (c).
- 7. The isolated polynucleotide of claim 6, wherein said isolated polynucleotide resides in a biosynthetic locus selected from the group consisting of:
(a) the biosynthetic locus for ramoplanin from Actinoplanes sp. ATCC 33076; (b) the biosynthetic locus for A21978C from Streptomyces roseosporus NRRL 11379; (c) the biosynthetic locus for A54145 from Streptomyces fradiae ATCC 18158; (d) the biosynthetic locus for a lipopeptide natural product from Streptomyces ghanaensis NRRL B-12104; (e) the biosynthetic locus for a lipopeptide natural product from Streptomyces refuineus NRRL 3143; (f) the biosynthetic locus for a lipopeptide natural product from Streptomyces aizunensis NRRL B-11277; (g) the biosynthetic locus for a lipopeptide natural product from Actinoplanes nipponensis FD 24834 ATCC 31145; and (h) the biosynthetic locus for a lipopeptide natural product from a Streptomyces sp. organism.
- 8. An isolated acyl-specific C-domain, encoded by a polynucleotide which comprises a sequence selected from the group consisting of:
(a) a sequence selected from the group consisting of SEQ ID NOs. 5, 7, 9, 11, 13, 15, 17, 19; and (b) a sequence that is complementary to (a); (c) a sequence which hybridizes to said sequence of (a) or (b) under conditions of high stringency; and (d) a sequence which has at least 70% or higher homology to said sequence of (a), (b), or (c).
- 9. An isolated acyl-specific C-domain comprising at least 45% sequence homology to at least one sequence selected from SEQ ID NO. 1 and SEQ ID NO. 2.
- 10. An isolated acyl-specific C-domain comprising a polypeptide sequence selected from the group consisting of:
(a) a sequence selected from the group consisting of SEQ ID NOs. 6, 8, 10, 12, 14, 16, 18, 20 and 22; and (b) a sequence which has at least 70% or higher homology to said sequence of (a).
- 11. Two or more isolated polypeptides, wherein the first isolated polypeptide is an acyl-specific C-domain according to claim 9; and the second isolated polypeptide is selected from the group consisting of:
(a) a polypeptide having at least 55% identity to SEQ ID NO. 3 and (b) a polypeptide having at least 50% identity to SEQ ID NO. 4.
- 12. An isolated polypeptide comprising a polypeptide selected from the group consisting of:
(a) SEQ ID NOs. 24, 26, 28, 30, 32, 34, 36, 38, 40, 42, 44, 46 and 48; and (b) a sequence which has at least 70% or higher homology to said sequence of (a).
- 13. An N-acyl-capping cassette comprising at least one acyl-specific C-domain polypeptide and another polypeptide selected from the group consisting of an adenylating protein and an acyl-carrier protein.
- 14. A computer readable medium, comprising:
(a) a computer program stored on said media containing instructions sufficient to implement a process for effecting the identification, analysis, or modeling of a representation of a polynucleotide or polypeptide sequence; (b) data stored on said media representing a sequence of a polynucleotide selected from the group consisting of:
i) a polynucleotide encoding an acyl-specific C-domain, said polynucleotide encoding a polypeptide having at least 45% sequence identity with either SEQ ID NO: 1 or SEQ ID NO: 2; ii) a polynucleotide encoding a polypeptide having at least 55% sequence identity with SEQ ID NO: 3; and iii) a polynucleotide encoding a polypeptide having at least 50% sequence identity with SEQ ID NO: 4; and (c) a data structure reflecting the underlying organization and structure of said data to facilitate said computer program access to data elements corresponding to logical sub-components of the sequence, said data structure being inherent in said program and in the way in which said computer program organizes and accesses said data.
- 15. A computer readable medium, comprising:
(a) a computer program stored on said media containing instructions sufficient to implement a process for effecting the identification, analysis, or modeling of a representation of a polypeptide sequence; (b) data stored on said media representing a sequence of a polypeptide selected from the group consisting of:
i) polypeptide representing an acyl-specific C-domain and having at least 45% sequence identity with either SEQ ID NO: 1 or SEQ ID NO: 2; ii) a polypeptide having at least 55% sequence identity with SEQ ID NO: 3; and iii) a polypeptide having at least 50% sequence identity with SEQ ID NO: 4 and (c) a data structure reflecting the underlying organization and structure of said data to facilitate said computer program access to data elements corresponding to logical sub-components of the sequence, said data structure being inherent in said program and in the way in which said computer program organizes and accesses said data.
- 16. A memory for storing data that can be accessed by a computer programmed to implement a process for effecting the identification, analysis, or modeling of a sequence of a polynucleotide or a polypeptide, said memory comprising data representing a polynucleotide selected from the group consisting of:
(a) a polynucleotide encoding an acyl-specific C-domain, said polynucleotide encoding a polypeptide having at least 45% sequence identity with either SEQ ID NO: 1 or SEQ ID NO: 2; (b) a polynucleotide encoding a polypeptide having at least 55% sequence identity with SEQ ID NO: 3; and (c) a polynucleotide encoding a polypeptide having at least 50% sequence identity with SEQ ID NO: 4.
- 17. A memory for storing data that can be accessed by a computer programmed to implement a process for effecting the identification, analysis, or modeling of a sequence of a polypeptide, said memory comprising data representing a polypeptide selected from the group consisting of:
(a) a polypeptide having at least 45% sequence identity with either SEQ ID NO: 1 or SEQ ID NO: 2; (b) a polypeptide having at least 55% sequence identity with SEQ ID NO: 3; and (c) a polypeptide having at least 50% sequence identity with SEQ ID NO: 4.
- 18. A method for detecting a polypeptide involved in lipopeptide biosynthesis or a polynucleotide encoding such a polypeptide comprising the step of identifying:
(a) a polypeptide having at least 45% sequence identity to SEQ ID NO:1 or SEQ ID NO:2, or (b) a polynucleotide encoding a polypeptide having at least 45% sequence identity to SEQ ID NO:1 or SEQ ID NO:2, and wherein said at least 45% sequence identity indicates a polypeptide involved in lipopeptide biosynthesis.
- 19. A method according to claim 18 wherein the identifying step comprising the steps of:
(a) providing a reference polynucleotide or polypeptide sequence selected from the group consisting of polynucleotide or polypeptide sequences representing an acyl-specific domain; (b) comparing said reference sequence to one or more candidate polynucleotide or polypeptide sequences stored on a computer readable medium; (c) determining level of homology between said reference sequence and said one or more candidate sequences, and (d) identifying a candidate sequence which shares at least 70% homology with reference sequence.
- 20. The method of claim 19, wherein said reference sequence is a polypeptide of SEQ ID NOS. 6, 8, 10, 12, 14, 16, 18, 20, 22 or a polynucleotide encoding a polypeptide of SEQ ID NOS. 6, 8, 10, 12, 14, 16, 18, 20 or 22.
- 21. The method of claim 19 further comprising determining structural motifs common to said candidate sequence and said reference sequence.
- 22. The method of claim 18 further comprising the step of identifying, in proximity to the polypeptide of a) or the polynucleotide of b) at least
c) one polypeptide having at least 55% sequence identity to SEQ ID NO: 3 or one polynucleotide sequence encoding a polypeptide having at least 55% sequence identity to SEQ ID NO: 3; or d) one polypeptide having at least 50% sequence identity to SEQ ID NO: 4 or one polynucleotide sequence encoding a polypeptide having at least 50% sequence identity to SEQ ID NO: 4.
- 23. The method according to claim 22 wherein
(a) the polypeptide of c) or d) is a polypeptide of SEQ ID NO: 24, 26, 28, 30, 32, 34, 36, 38 or 40, or a polypeptide having at least 70% sequence identity to a polypeptide of SEQ ID NO: 24, 26, 28, 30, 32, 34, 36, 38 or 40; or (b) the nucleotide of c) or d) is a nucleotide encoding a polypeptide of SEQ ID NO: 24, 26, 28, 30, 32, 34, 36, 38 or 40 or a nucleotide encoding a polypeptide having at least 70% sequence identity to a polypeptide of SEQ ID NO: 24, 26, 28, 30, 32, 34, 36, 38 or 40.
- 24. A computer system comprising:
(a) a database of reference sequences, wherein the reference sequences encode proteins involved in lipid biosynthesis, and wherein the reference sequences include one or more of:
(i) a polypeptide sequence representing an acyl-specific C-domain or a polynucleotide encoding an acyl-specific C-domain; and (b) a user interface capable of:
(i) receiving a test sequence for comparing against each of the reference sequences in the database; and (ii) displaying the results of the comparison.
- 25. A computer system of claim 24 wherein the reference sequences further include one or more of:
(iv) a polypeptide sequence representing an adenylating enzyme or a polynucleotide encoding an adenylating enzyme; and (v) a polypeptide sequence representing an acyl carrier protein or a poynucleotide encoding an acyl carrier protein.
- 26. A computer system of claim 25 wherein
(a) the reference sequence of (i) is selected from SEQ ID NOS: 1, 2, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21 and 22; (b) the reference sequence of (iv) is selected from SEQ ID NOS: 3, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33 and 34; and (c) the reference sequence of (v) is selected from SEQ ID NO: 4, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47 and 48.
RELATED APPLICATIONS
[0001] This application claims the benefit of U.S. Provisional Application No. 60/342,133, filed on Dec. 26, 2001, U.S. Provisional Application No. 60/372,789, filed on Apr. 17, 2002. The present application is a continuation-in-part of U.S. application Ser. No. 09/976,059, filed Oct. 15, 2001, and of U.S. application Ser. No. 10/232,370, filed Sep. 3, 2002, which is a continuation-in-part of U.S. application Ser. No. 09/910,813 filed Jul. 24, 2001. The teachings of the above applications are incorporated herein by reference in their entirety.
Provisional Applications (2)
|
Number |
Date |
Country |
|
60342133 |
Dec 2001 |
US |
|
60372789 |
Apr 2002 |
US |
Continuation in Parts (3)
|
Number |
Date |
Country |
Parent |
09976059 |
Oct 2001 |
US |
Child |
10329027 |
Dec 2002 |
US |
Parent |
10232370 |
Sep 2002 |
US |
Child |
10329027 |
Dec 2002 |
US |
Parent |
09910813 |
Jul 2001 |
US |
Child |
10329027 |
Dec 2002 |
US |