This invention relates to epoxide hydrolases and biocatalytic reactions using said epoxide hydrolases to produce optically active epoxides and vicinal diols.
Optically active epoxides and vicinal diols are versatile fine chemical intermediates for use in the production of pharmaceuticals, agrochemicals, ferro-electric liquid crystals and flavours and fragrances. Epoxides are highly reactive electrophiles because of the strain inherent in the three-membered ring and the electronegativity of the oxygen. Epoxides react readily with various O-, N-, S-, and C-nucleophiles, acids, bases, reducing and oxidizing agents, allowing access to bifunctional molecules. Vicinal diols, employed as their highly reactive cyclic sulfites and sulfates, act like epoxide-like synthons with a broad range of nucleophiles. The possibility of double nucleophilic displacement reactions with amidines and azide, allow access to dihydroimidazole derivatives, aziridines, diamines and diazides. Since enantiopure epoxides and vicinal diols can be stereospecifically interconverted, they can be regarded as synthetic equivalents. Glycidyl ethers are epoxides of general formula (I).
Optically active glycidyl ethers and their corresponding O1-substituted glycerols are biologically active compounds and useful synthons in the production of biologically active compounds. For example, guaifenesin (expectorant), mephenesin (muscle relaxant) and chlorphenesin (antifungal) are aryloxy diols in which the desired biological activity resides in the (S)-enantiomers, (S)-Aryl glycidyl ethers are useful synthons for β-adrenergic receptor blocking agents (β-blockers).
Epoxide hydrolases (EC 3.3.2.3) are hydrolytic enzymes that convert epoxides to vicinal diols by ring-opening of the epoxide with water. Epoxide hydrolases are present in mammals, plants, insects and microorganisms.
The invention is based in part on the surprising discovery by the inventors that certain microorganisms express epoxide hydrolases which act on glycidyl ether substrates with high enantioselectivity. These microorganisms and the associated enantioselective glycidyl ether hydrolase (YEGH) polypeptides of the invention selectively hydrolyse specific enantiomers of a range of different glycidyl ethers (GE). Genomes of the microorganisms therefore encode polypeptides having highly enantioselective glycidyl ether hydrolase activity.
More specifically, the invention provides a process for obtaining an optically active glycidyl ether and/or an optically active vicinal diol, which process includes the steps of: providing an enantiomeric mixture of a glycidyl ether (GE); creating a reaction mixture by adding to the enantiomeric mixture a polypeptide, or a functional fragment thereof, having enantioselective glycidyl ether hydrolase (YEGH) activity, the polypeptide being a polypeptide encoded by a gene of a yeast cell or a gene derived from a yeast cell; incubating the reaction mixture; and recovering from the reaction mixture: at least one of an enantiopure, or a substantially enantiopure vicinal diol (GD), and an enantiopure, or a substantially enantiopure, glycidyl ether (GE).
According to another aspect of the invention there is provided a process for obtaining an optically active glycidyl ether and/or an optically active vicinal diol, which process includes the steps of: providing an enantiomeric mixture of a glycidyl ether (GE); creating a reaction mixture by adding to the enantiomeric mixture a cell comprising a nucleic acid encoding, and capable of expressing, a polypeptide having enantioselective glycidyl ether hydrolase (YEGH) activity, the polypeptide being a polypeptide encoded by a gene of a yeast cell; incubating the reaction mixture; and recovering from the reaction mixture: at least one of an enantiopure, or a substantially enantiopure, vicinal diol (GD), and an enantiopure, or a substantially enantiopure, glycidyl ether (GE).
In both of the above processes, the incubation may result in the selective production of a GD having the chirality of the enantiomer for which the epoxide hydrolase has selective activity and/or the selective enrichment, relative to the total amount of both enantiomers of the GE in the mixture, of the GE enantiomers for which the epoxide hydrolase does not have selective activity.
The following embodiments apply to both of the above processes. The cell can be a yeast cell. The polypeptide can be encoded by an endogenous gene of the cell or the cell can be a recombinant cell, the polypeptide being encoded by a nucleic acid sequence with which the cell is transformed. The nucleic acid sequence can be an exogenous nucleic acid sequence, a heterologous nucleic acid sequence, or a homologous nucleic acid sequence. The polypeptide can be a full-length yeast epoxide hydrolase or a functional fragment of a full length yeast epoxide hydrolase.
Moreover both processes can be carried out at a pH from 5 to 10. They can be carried out at a temperature of 0° C. to 70° C. In the processes, the concentration of the glycidyl ether can be at least equal to the solubility of the GE in water.
In both processes, the glycidyl ether (GE) is a compound of the general formula (I) and the vicinal diol (GD) produced by the process is a compound of the general formula (II),
R represents a variably substituted straight-chain or branched alkyl group, a variably substituted straight-chain or branched alkenyl group, a variably substituted straight-chain or branched alkynyl group, a variably substituted cycloalkyl group as well as cycloalkenyl groups, a variably substituted aryl group, a variably substituted aryl alkyl group, a variably substituted heterocyclic group, a variably substituted alkylthio group, a variably substituted alkoxycarbonyl group, a variably substituted straight chain or branched alkylamino or alkenyl amino group, a variably substituted arylamino or arylalkylamino group, a variably substituted carbamoyl group, or a variably substituted acyl group.
R can also take the form of R′—X, where X is a functional group bonded to any C of R′ except C1.
—OR as a whole can also be replaced by a functional group.
The alkyl group may be a straight chain or branched alkyl group with 1 to 12 carbon atoms but preferably the alkyl group is as straight chain or branched alkyl group with 1 to 8 carbons.
The alkenyl group may be a straight chain or branched alkenyl group having 2-12 carbon atoms but preferably the alkenyl group is a straight chain or branched alkenyl group with 2 to 8 carbons.
The alkynyl group may be a straight chain or branched alkynyl group having 2-12 carbon atoms but preferably the alkynyl group is a straight chain or branched alkenyl group with 2 to 8 carbons.
The cycloalkyl group may include cycloalkyl groups with 3 to 10 carbon atoms. Examples include the cyclopropyl-, cyclobutyl-, cyclopentyl-, cyclohexyl-, cycloheptyl- and cyclooctyl-groups that may be variably substituted at any position(s) around the ring. Preferably the cycloalkyl group is a cycloalkyl group with 5 to 7 carbon atoms.
The cycloalkenyl group may include cycloalkenyl groups with 3 to 10 carbon atoms. Examples include cyclobutenyl-, cyclopentenyl-, cyclohexenyl-, cycloheptenyl- and cyclooctenyl-groups that may be variably substituted at any position(s) around the ring. Preferably the cycloalkenyl group is a cycloalkenyl group with 5 to 7 carbon atoms.
The aryl group may include phenyl, biphenyl, naphtyl, anthracenyl groups and the like. Preferably the aryl group is a phenyl group. The aryl alkyl group may include a group with 7 to 18 carbons, but preferably the aryl alkyl group is an aryl alkyl group with 7 to 12 carbon atoms.
The heterocyclic group may include 5- to 7-membered heterocyclic groups containing nitrogen, oxygen or sulfur. The heterocyclic ring may be fused with a cyclic or aromatic ring having 3 to 7 carbon atoms such as a benzene, cyclopropyl, cyclobutane, cyclopentane and cyclohexane ring systems. A ring with 5 or 6 carbon atoms is preferred.
The alkylamino group may include a straight chain or branched alkylamino group having 2-12 carbon atoms such as methylamino, ethylamino, propylamino, isopropylamino, butylamino, isobutylamino, tert-butylamino, pentylamino, hexylamino, heptylamino or octylamino.
The alkenyl amino group may include a straight chain or branched alkenylamino group having 2-12 carbon atoms but preferably the alkenyl amino group is a straight chain or branched alkenylamino group with 2 to 8 carbons.
The arylamino group may include arylamino groups such as a phenylamino or naphtylamino group which may be optionally substituted with an alkyl or alkenyl or alkoxy group having 1 to 4 carbon atoms, and also halogens, The arylalkylamino group may include benzylamino and 2-phenylethylamino.
The alkylthio group may include alkylthio groups having 1 to 8 carbon atoms such as methylthio, ethylthio, propylthio, butylthio, isobutylthio, pentylthio.
The alkenylthio group may include a straight chain or branched alkenylthio group having 1 to 8 carbon atoms such as ethynylthio-, 1-propynylthio-, 2-propynylthio-, 1-butynylthio-, 2-butynylthio-, 3-butynylthio-, 1-pentynylathio-, 2-pentynylthio-, 3-pentynylthio-, 4-pentynylthio-, 1-hexynylthio-, 2-hexynylthio-, 3-hexynylthio-, 4-hexynylthio-, 5-hexynylthio- and the like.
The arylthio group may include alkenylthio groups having 1 to 8 carbon atoms such as a phenylthio or naphtylthio group which may be optionally substituted with an alkyl or alkenyl or alkoxy group having 1 to 4 carbon atoms, and also halogens, e.g. phenylthio, 2-methylphenylthio, 3-methylphenylthio, 4-methylphenylthio, 2-allylphenylthio, 2-chlorophenylthio, 3-chlorophenylamini, 4-chlorophenylthio, 4-methoxyphenylthio, 2-allyloxyphenylthio, naphtylthio and the like.
The arylalkylthio group may include alkenylthio groups having 1 to 8 carbon atoms such as the benzylthio-group and 2-phenylethylthio-group.
The alkoxycarbonyl group may include methoxycarbonyl, ethoxycarbonyl and the like.
The substituted or unsubstituted carbamoyl group may include carbamoyl, methylcarbamoyl, dimethylcarbamoyl, diethylcarbamoyl and the like.
The acyl group may include acyl groups with 1 to 8 carbon atoms such as formyl, acetyl, propionyl or benzoyl groups and others.
The alkyl, alkenyl, alkynyl, cycloalkyl, aryl, aryl alkyl, heterocyclic, alkylamino, alkenylamino, arylamino, arylalkylamino, alkylthio, alkenylthio, arylthio, arylalkylthio, alkoxycarbonyl, substituted and unsubstituted carbamoyl and acyl groups mentioned above may optionally be substituted. Examples of such substituents include halogens (F, Cl, Br, I), hydroxyl groups, mercapto groups, carboxylates, nitro groups, cyano groups, substituted or unsubstituted amino groups (including amino, methylamino, dimethylamino, ethylamino, diethylamino, and various protected amines such as tert-butoxycarbonyl- and arylsulfonamido groups), alkoxy groups (having 1 to 8 carbon atoms such as methoxy, ethoxy, propyloxy, isopropyloxy, butyloxy, isobutyloxy, tert-butyloxy, pentyloxy, hexyloxy, heptyloxy or octyloxy), alkenyloxy groups (having 2 to 8 carbon atoms such as a vinyloxy, allyloxy, 3-butenyloxy or 5-hexenyloxy), aryloxy groups (such as a phenoxy or naphtyloxy group which may be optionally substituted with an alkyl or alkenyl or alkoxy group having 1 to 4 carbon atoms, and also halogens, e.g. phenoxy, 2-methylphenoxy, 3-methylphenoxy, 4-methylphenoxy, 2-allylphenoxy, 2-chlorophenoxy, 3-chlorophenoxy, 4-chlorophenoxy, 4-methoxyphenoxy, 2-allyloxyphenoxy, naphtyloxy and the like), aryl alkyloxy groups (e.g. benzyloxy and 2-phenylethyloxy), alkylthio groups (having 1 to 8 carbon atoms such as methylthio, ethylthio, propylthio, butylthio, isobutylthio, pentylthio), alkoxycarbonyl groups (e.g. methoxycarbonyl, ethoxycarbonyl and the like), substituted or unsubstituted carbamoyl group (e.g. carbamoyl, methylcarbamoyl, dimethylcarbamoyl, diethylcarbamoyl and the like), acyl groups (with 1 to 8 carbon atoms such as formyl, acetyl, propionyl or benzoyl groups) and others.
The above-mentioned cycloalkyl, cycloalkenyl, aryl, aryl alkyl, heterocyclic, alkoxy, alkenyloxy, aryloxy, aryl alkyloxy, alkylthio, and alkoxycarbonyl groups may also be substituted with alkyl groups having 1 to 5 carbon atoms, alkenyl groups with 2 to 5 carbon atoms, or haloalkyl groups with 1 to 5 carbon atoms in addition to the substituents specified above.
The number of substituents may be one or more than one.
The substituents may be the same or different.
R can also take the form of R′—X, where X is a functional group bonded to any carbon of R′ except C1. The functional group may be for example a halogen (F, Cl, Br, I), hydroxyl group, mercapto group, carboxylate, nitro group, cyano group, substituted or unsubstituted amino group (including amino, methylamino, dimethylamino, ethylamino, diethylamino), and various protected amines such as a tert-butoxycarbonyl- or a arylsulfonamido group
—OR as a whole can also be replaced by a functional group such as a halogen (F, Cl, Br, I), hydroxyl group, mercapto group, carboxylate, nitro group, cyano group, substituted or unsubstituted amino group (including amino, methylamino, dimethylamino, ethylamino, diethylamino), and various protected amines such as a tert-butoxycarbonyl- or a arylsulfonamido group.
Moreover, in the processes, the enantiomeric mixture can be a racemic mixture or a mixture of any ratio of amounts of the enantiomers. The processes can include adding to the reaction mixture water and at least one water-immiscible solvent, including, for example, toluene, 1,1,2-trichlorotrifluoroethane, methyl tert-butyl ether, methyl isobutyl ketone, dibutyl-o-phtalate, aliphatic alcohols containing 6 to 9 carbon atoms or aliphatic hydrocarbons containing 6 to 16 carbon atoms.
Alternatively, or in addition, the processes can include adding to the reaction mixture water and at least one water-miscible organic solvent, for example, acetone, methanol, ethanol, propanol, isopropanol, acetonitrile, dimethylsulfoxide, N,N-dimethylformamide, or N-methylpyrrolidine. In addition, or alternatively, one or more surfactants, one or more cyclodextrins, or one or more phase-transfer catalysts can be added to the reaction mixtures. Both processes can include stopping the reaction when one enantiomer of a GE and/or associated GD is in excess compared to the other enantiomer of the GE and/or GD. Furthermore, the processes can include directly recovering continuously from the reaction mixture during the reaction an optically active GE and/or associated optically active GD produced by the reaction.
In both processes the yeast cell can be of one of the following exemplary genera: Arxula, Brettanomyces, Bullera, Bulleromyces, Candida, Cryptococcus, Debaryomyces, Dekkera, Exophiala, Geotrichum, Hormonema, Issatchenkia, Kluyveromyces, Lipomyces, Mastigomyces, Myxozyma, Pichia, Rhodosporidium, Rhodotorula, Sporidiobolus, Sporobolomyces, Trichosporon, Wingea, and Yarrowia.
Moreover, in the processes, the yeast cell can be of one of the following exemplary species: Arxula adeninivorans, Arxula terrestris, Brettanomyces bruxellensis, Brettanomyces naardenensis, Brettanomyces anomalus, Brettanomyces species (e.g. Unidentified species NCYC 3151), Bullera dendrophila, Bulleromyces albus, Candida albicans, Candida fabianii, Candida glabrata, Candida haemulonii, Candida intermedia, Candida magnoliae, Candida parapsilosis, Candida rugosa, Candida tenuis, Candida tropicalis, Candida famata, Candida kruisei, Candida sp. (new) related to C. sorbophila, Cryptococcus albidus, Cryptococcus amylolentus, Cryptococcus bhutanensis, Cryptococcus curvatus, Cryptococcus gastricus, Cryptococcus humicola, Cryptococcus hungaricus, Cryptococcus laurentii, Cryptococcus luteolus, Cryptococcus macerans, Cryptococcus podzolicus, Cryptococcus terreus, Debaryomyces hansenii, Dekkera anomala, Exophiala dermatitidis, Geotrichum spp. (e.g. Unidentified species UOFS Y-0111), Hormonema spp. (e.g. Unidentified species NCYC 3171), Issatchenkia occidentalis, Kluyveromyces marxianus, Lipomyces spp. (e.g. Unidentified species UOFS Y-2159), Lipomyces tetrasporus, Mastigomyces philipporii, Myxozyma melibiosi, Pichia anomala, Pichia finlandica, Pichia guillermondii, Pichia haplophila, Rhodosporidium lusitaniae, Rhodosporidium paludigenum, Rhodosporidium sphaerocarpum, Rhodosporidium toruloides, Rhodosporidium paludigenum, Rhodotorula araucariae, Rhodotorula glutinis, Rhodotorula minuta, Rhodotorula minuta var. minuta, Rhodotorula mucilaginosa, Rhodotorula philyla, Rhodotorula rubra, Rhodotorula species (e.g. Unidentified species NCYC 3193, UOFS Y-2042, UOFS Y-0448, UOFS Y-0139, UOFS Y-0560), Rhodotorula aurantiaca, Rhodotorula spp. (e.g. Unidentified species NCYC 3224), Rhodotorula sp. “mucilaginosa”, Sporidiobolus salmonicolor, Sporobolomyces holsaticus, Sporobolomyces roseus, Sporobolomyces tsugae, Trichosporon beigelii, Trichosporon cutaneum var. cutaneum, Trichosporon delbrueckii, Trichosporon jirovecii, Trichosporon mucoides, Trichosporon ovoides, Trichosporon pullulans, Trichosporon spp. (e.g. Unidentified species NCYC 3210, NCYC 3211, UOFS Y-0861, UOFS Y-1615, UOFS Y-0451, NCYC 3212, UOFS Y-0449, UOFS Y-2113), Trichosporon moniliiforme, Trichosporon montevideense, Wingea robertsiae, and Yarrowia lipolytica.
The yeast cell can also be of any of the other genera, species, or strains disclosed herein.
Another aspect of the invention is a method for producing a polypeptide, which process includes the steps of: providing a cell comprising a nucleic acid encoding and capable of expressing a polypeptide that has enantioselective glycidyl ether hydrolase (YEGH) activity; culturing the cell; and recovering the polypeptide from the culture. Recovering the polypeptide from the culture includes, for example, recovering it from the medium in which the cells were cultured or recovering it from the cell per se. The cell can be a yeast cell. The polypeptide can be encoded by an endogenous gene of the cell or the cell can be a recombinant cell, the polypeptide being encoded by a nucleic acid sequence with which the cell is transformed. The nucleic acid sequence can be an exogenous nucleic acid sequence, a heterologous nucleic acid sequence, or a homologous nucleic acid sequence. The polypeptide can be a full-length yeast epoxide hydrolase or a functional fragment of a full-length yeast epoxide hydrolase. The cell can be of any of the yeast genera, species, or strains disclosed herein or any recombinant cell disclosed herein.
The invention also features a crude or pure enzyme preparation which includes an isolated polypeptide having YEGH activity. The polypeptide can be one encoded by any of the yeast genera, species, or strains disclosed herein or one encoded by a recombinant cell.
In another aspect, the invention features a substantially pure culture of cells, a substantial number of which comprise a nucleic acid encoding, and are capable of expressing, a polypeptide having YEGH activity. The cells can be recombinant cells or cells of any of the yeast genera, species, or strains disclosed herein.
Another embodiment of the invention is an isolated cell, the cell comprising a nucleic acid encoding a polypeptide having YEGH activity, the cell being capable of expressing the polypeptide. The cell can be any of those disclosed herein.
The invention also features an isolated DNA that includes: (a) a nucleic acid sequence that encodes a polypeptide that has YEGH activity and that hybridizes under highly stringent conditions to the complement of a sequence that can be SEQ ID NO: 10, 11, 12, 13, 14, 15, 16, 17, or 18; or (b) the complement of the nucleic acid sequence. The nucleic acid sequence can encode a polypeptide that includes an amino acid sequence that can be SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 9. The nucleic acid sequence can be, for example, one of those with SEQ ID NOs: 10, 11, 12, 13, 14, 15, 16, 17, or 18.
Also provided by the invention is an isolated DNA that includes: (a) a nucleic acid sequence that is at least 55% identical to a sequence that can be SEQ ID NO: 10, 11, 12, 13, 14, 15, 16, 17, or 18; or (b) the complement of the nucleic acid sequence, the nucleic acid sequence encoding a polypeptide that has YEGH activity.
Another aspect of the invention is an isolated DNA that includes: (a) a nucleic acid sequence that encodes a polypeptide consisting of an amino acid sequence that is at least 55% identical to a sequence that can be SEQ ID NOs: 1, 2, 3, 4, 5, 6, 7, 8, or 9; or (b) the complement of the nucleic acid sequence, the polypeptide having YEGH activity. Also included are vectors (e.g., those in which the coding sequence is operably linked to a transcriptional regulatory element) containing any of the above DNAs and cells (e.g., eukaryotic or prokaryotic cells) containing such vectors.
Also provided by the invention is an isolated polypeptide encoded by any of the above DNAs. The polypeptide can include an amino acid sequence that is at least 55% identical to SEQ ID NOs: 1, 2, 3, 4, 5, 6, 7, 8, or 9, the polypeptide having YEGH activity. The polypeptide can also include: (a) a sequence that can be SEQ ID NO: 1, 2, 3, 4, 5, 6, 7, 8, or 9, or a functional fragment of the sequence; or (b) the sequence of (a), but with no more than five conservative substitutions, the polypeptide having YEGH activity.
In another embodiment the invention features an isolated antibody (e.g., a polyclonal or a monoclonal antibody) that binds to any of the above-described polypeptides.
The term “exogenous” as used herein with reference to nucleic acid and a particular host cell refers to any nucleic acid that does not occur in (and cannot be obtained from) that particular cell as found in nature. Thus, a non-naturally-occurring nucleic acid is considered to be exogenous to a host cell once introduced into the host cell. It is important to note that non-naturally-occurring nucleic acids can contain nucleic acid subsequences or fragments of nucleic acid sequences that are found in nature provided the nucleic acid as a whole does not exist in nature. For example, a nucleic acid molecule containing a genomic DNA sequence within an expression vector is non-naturally-occurring nucleic acid, and thus is exogenous to a host cell once introduced into the host cell, since that nucleic acid molecule as a whole (genomic DNA plus vector DNA) does not exist in nature. Thus, any vector, autonomously replicating plasmid, or virus (e.g., retrovirus, adenovirus, or herpes virus) that as a whole does not exist in nature is considered to be non-naturally-occurring nucleic acid. It follows that genomic DNA fragments produced by PCR or restriction endonuclease treatment as well as cDNAs are considered to be non-naturally-occurring nucleic acid since they exist as separate molecules not found in nature. It also follows that any nucleic acid containing a promoter sequence and polypeptide-encoding sequence (e.g., cDNA or genomic DNA) in an arrangement not found in nature is non-naturally-occurring nucleic acid. Nucleic acid that is naturally-occurring can be exogenous to a particular cell. For example, an entire chromosome isolated from a cell of yeast x is an exogenous nucleic acid with respect to a cell of yeast y once that chromosome is introduced into a cell of yeast y.
It will be clear from the above that “exogenous” nucleic acids can be “homologous” or “heterologous” nucleic acids. As used herein, “homologous” nucleic acids are those that are derived from a cell of the same species as the host cell and “heterologous” nucleic acids are those that are derived from a species other than that of the host cell. In contrast, the term “endogenous” as used herein with reference to nucleic acids or genes and a particular cell refers to any nucleic acid or gene that does occur in (and can be obtained from) that particular cell as found in nature.
The glycidyl ether used by the methods of the invention may be a compound of the general formula (I) and the vicinal diol produced by the process may be a compound of the general formula (II),
R represents a variably substituted straight-chain or branched alkyl group, a variably substituted straight-chain or branched alkenyl group, a variably substituted straight-chain or branched alkynyl group, a variably substituted cycloalkyl group as well as cycloalkenyl groups, a variably substituted aryl group, a variably substituted aryl alkyl group, a variably substituted heterocyclic group, a variably substituted alkylthio group, a variably substituted alkoxycarbonyl group, a variably substituted straight chain or branched alkylamino or alkenyl amino group, a variably substituted arylamino or arylalkylamino group, a variably substituted carbamoyl group, or a variably substituted acyl group.
R can also take the form of R′—X, where X is a functional group bonded to any C of R′ except C1.
—OR as a whole can also be replaced by a functional group.
The alkyl group may be a straight chain or branched alkyl group with 1 to 12 carbon atoms. Examples include the methyl-, ethyl-, propyl-, isopropyl-, butyl-, isobutyl-, s-butyl-, t-butyl-, pent-1-yl-, pent-2-yl-, pent-3-yl-, 2-methylbut-1-yl-, 3-methylbut-1-yl-, 2-methylbut-2-yl-, 3-methylbut-2-yl-, hex-1-yl-, hex-2-yl-, hex-3-yl-, 1-methylpent-1-yl-, 2-methylpent-1-yl-, 3-methylpent-1-yl-, 2-methylpent-2-yl-, 3-methylpent-2-yl-, 4-methylpent-2-yl-, 2-methylpent-3-yl-, 3-methylpent-3-yl-, 2-ethylbut-1-yl-, hept-1-yl-, hept-2-yl-, hept-3-yl-, hept-4-yl-, 1-methylhex-1-yl-, 2-methylhex-1-yl-, 3-methylhex-1-yl-, 4-methylhex-1-yl-, 5-methylhex-1-yl-, 2-methylhex-2-yl-, 3-methylhex-2-yl-, 4-methylhex-2-yl-, 5-methylhex-2-yl-, 2-methylhex-3-yl-, 3-methylhex-3-yl-, 4-methylhex-3-yl-, 5-methylhex-3-yl-, 2-methylhex-4-yl-, 1,1-dimethylpent-1-yl-, 1,2-dimethylpent-1-yl-, 1,3-dimethylpent-1-yl-, 1,4-dimethylpent-1-yl-, 2,2-dimethylpent-1-yl-, 2,3-dimethylpent-1-yl-, 2,4-dimethylpent-1-yl-, 2,5-dimethylpent-1-yl-, 3,3-dimethylpent-1-yl-, 3,4-dimethylpent-1-yl-, 3,5-dimethylpent-1-yl-, 4,4-dimethylpent-1-yl-, 4,5-dimethylpent-1-yl-, 5,5-dimethylpent-1-yl-, 2,2-dimethylpent-2-yl-, 2,3-dimethylpent-2-yl-, 2,4-dimethylpent-2-yl-, 3,3-dimethylpent-2-yl-, 3,4-dimethylpent-2-yl-, 2,2-dimethylpent-3-yl-, 2,3-dimethylpent-3-yl-, 2,4-dimethylpent-3-yl-, 2,2-dimethylpent-4-yl-, 2-ethylpent-1-yl-, 3-ethylpent-1-yl-, 1,1,2-trimethylbut-1-yl-, 1,2,2-trimethylbut-1-yl-, 1,2,3-trimethylbut-1-yl-, 2,2,3-trimethylbut-1-yl-, 2,3,3-trimethylbut-1-yl-, 2,3,3-but-2-yl-, 2-isopropylbut-1-yl-, 2-isopropylbut-2-yl-, oct-1-yl-, oct-2-yl-, oct-3-yl-, oct-4-yl-, 2-methylhept-1-yl-, 3-methylhept-1-yl-, 4-methylhept-1-yl-, 5-methylhept-1-yl-, 6-methylhept-1-yl-, 2-methylhept-2-yl-, 3-methylhept-2-yl-, 4-methylhept-2-yl-, 5-methylhept-2-yl-, 6-methylhept-2-yl-, 2-methylhept-3-yl-, 3-methylhept-3-yl-, 4-methylhept-3-yl-, 5-methylhept-3-yl-, 6-methylhept-3-yl-, 2-methylhept-4-yl-, 3-methylhept-4-yl-, 4-methylhept-4-yl-, 2,2-dimethylhex-1-yl-, 2,3-dimethylhex-1-yl-, 2,4-dimethylhex-1-yl-, 2,5-dimethylhex-1-yl-, 3,3-dimethylhex-1-yl-, 3,4-dimethylhex-1-yl-, 3,5-dimethylhex-1-yl-, 4,4-dimethylhex-1-yl-, 4,5-dimethylhex-1-yl-, 5,5-dimethylhex-1-yl-, 2,3-dimethylhex-2-yl-, 2,4-dimethylhex-2-yl-, 2,5-dimethylhex-2-yl-, 3,3-dimethylhex-2-yl-, 3,4-dimethylhex-2-yl-, 3,5-dimethylhex-2-yl-, 4,4-dimethylhex-2-yl-, 4,5-dimethylhex-2-yl-, 5,5-dimethylhex-2-yl-, 2,2-dimethylhex-3-yl-, 2,3-dimethylhex-3-yl-, 2,4-dimethylhex-3-yl-, 2,5-dimethylhex-3-yl-, 3,3-dimethylhex-3-yl-, 3,4-dimethylhex-3-yl-, 3,5-dimethylhex-3-yl-, 4,4-dimethylhex-3-yl-, 4,5-dimethylhex-3-yl-, 5,5-dimethylhex-3-yl-, 2,2,3-trimethylpent-1-yl-, 2,2,4-trimethylpent-1-yl-, 2,3,3-trimethylpent-1-yl-, 2,3,4-trimethylpent-1-yl-, 3,3,4-trimethylpent-1-yl-, 3,4,4-trimethylpent-1-yl-, 2,4,4-trimethylpent-1-yl-, 2,3,3-trimethylpent-2-yl-, 2,3,4-trimethylpent-2-yl-, 3,3,4-trimethylpent-2-yl-, 3,4,4-trimethylpent-2-yl-, 2,4,4-trimethylpent-2-yl-, 2,2,3-trimethylpent-3-yl-, 2-methyl-3-ethylpen-1-yl-, 3-ethyl-3-methylpent-1-yl-, 3-ethyl-4-methylpent-1-yl-, (3-methylhex-3-yl)methyl-, (4-methylhex-3-yl)methyl-, (5-methylhex-3-yl)methyl-, (2-methylhex-2-yl)methyl-, 2-methyl-3-ethylpent-2-yl-, 3-ethyl-3-methylpent-2-yl-, 3-ethyl-4-methylpent-2-yl-, 2-methyl-2-ethylpent-3-yl-, 2-methyl-3-ethylpent-3-yl-, 2,2,3,3-tetramethylbut-1-yl-, 2-ethyl-3,3-dimethylbut-2-ly, 2-isopropyl-3-methylbut-2-yl-, (3-ethyl pent-3-yl)methyl-, (2,3-dimethylpent-3-yl)methyl-, (2,4-dimethylpent-3-yl)methyl-, non-1-yl-, non-2-yl-, non-3-yl-, non-4-yl-, non-5-yl-, 2-methyloct-1-yl, 3-methyloct-1-yl-, 4-methyloct-1-yl-, 5-methyloct-1-yl-, 6-methyloct-1-yl-, 7-methyloct-1-yl-, 2-methyloct-2-yl, 3-methyloct-2-yl-, 4-methyloct-2-yl-, 5-methyloct-2-yl-, 6-methyloct-2-yl-, 7-methyloct-2-yl-, 2-methyloct-3-yl, 3-methyloct-3-yl-, 4-methyloct-3-yl-, 5-methyloct-3-yl-, 6-methyloct-3-yl-, 7-methyloct-3-yl-, 2-methyloct-4-yl, 3-methyloct-4-yl-, 4-methyloct-4-yl-, 5-methyloct-4-yl-, 6-methyloct-4-yl-, 7-methyloct-4-yl-, 2,2-dimethylhept-1-yl-, 2,3-dimethylhept-1-yl-, 2,4-dimethylhept-1-yl-, 2,5-dimethylhept-1-yl-, 2,6-dimethylhept-1-yl-, 3,3-dimethylhept-1-yl-, 3,4-dimethylhept-1-yl-, 3,5-dimethylhept-1-yl-, 3,6-dimethylhept-1-yl-, 4,4-dimethylhept-1-yl-, 4,5-dimethylhept-1-yl-, 4,6-dimethylhept-1-yl-, 5,5-dimethylhept-1-yl-, 5,6-dimethylhept-1-yl-, 6,6-dimethylhept-1-yl-, 2,3-dimethylhept-2-yl-, 2,4-dimethylhept-2-yl-, 2,5-dimethylhept-2-yl-, 2,6-dimethylhept-2-yl-, 3,3-dimethylhept-2-yl-, 3,4-dimethylhept-2-yl-, 3,5-dimethylhept-2-yl-, 3,6-dimethylhept-2-yl-, 4,4-dimethylhept-2-yl-, 4,5-dimethylhept-2-yl-, 4,6-dimethylhept-2-yl-, 5,5-dimethylhept-2-yl-, 5,6-dimethylhept-2-yl-, 6,6-dimethylhept-2-yl-, 2,2-dimethylhept-3-yl-, 2,3-dimethylhept-3-yl-, 2,4-dimethylhept-3-yl-, 2,5-dimethylhept-3-yl-, 2,6-dimethylhept-3-yl-, 3,4-dimethylhept-3-yl-, 3,5-dimethylhept-3-yl-, 3,6-dimethylhept-3-yl-, 4,4-dimethylhept-3-yl-, 4,5-dimethylhept-3-yl-, 4,6-dimethylhept-3-yl-, 5,5-dimethylhept-3-yl-, 5,6-dimethylhept-3-yl-, 6,6-dimethylhept-3-yl-, 3-ethylhept-1-yl-, 3-ethylhept-1-yl-, 4-ethylhept-1-yl-, 3-ethylhept-2-yl-, 4-ethyl hept-2-yl-, 5-ethyl hept-2-yl-, 3-ethyl hept-3-yl-, 4-ethyl hept-3-yl-, 5-ethylhept-3-yl-, 3-ethylhept-4-yl-, 4-ethylhept-4-yl-, 2,2,3-trimethylhex-1-yl-, 2,2,4-trimethylhex-1-yl-, 2,2,5-trimethylhex-1-yl-, 2,3,3-trimethylhex-1-yl-, 2,3,4-trimethylhex-1-yl-, 2,3,5-trimethylhex-1-yl-, 2,4,4-trimethylhex-1-yl-, 2,4,5-trimethylhex-1-yl-, 2,5,5-trimethylhex-1-yl-, 3,3,4-trimethylhex-1-yl-, 3,3,5-trimethylhex-1-yl-, 4,4,5-trimethylhex-1-yl-, 4,5,5-trimethylhex-1-yl-, 2,3,3-trimethylhex-2-yl-, 2,3,4-trimethylhex-2-yl-, 2,3,5-trimethylhex-2-yl-, 2,4,4-trimethylhex-2-yl-, 2,4,5-trimethylhex-2-yl-, 2,5,5-trimethylhex-2-yl-, 3,3,4-trimethylhex-2-yl-, 3,3,5-trimethylhex-2-yl-, 3,4,4-trimethylhex-2-yl-, 3,4,5-trimethylhex-2-yl-, 3,5,5-trimethylhex-2-yl-, 4,4,5-trimethylhex-2-yl-, 4,5,5-trimethylhex-2-yl-, 2,2,3-trimethylhex-3-yl-, 2,2,4-trimethylhex-3-yl-, 2,2,5-trimethylhex-3-yl-, 2,3,4-trimethylhex-3-yl-, 2,3,5-trimethylhex-3-yl-, 2,4,4-trimethylhex-3-yl-, 2,4,5-trimethylhex-3-yl-, 2,5,5-trimethylhex-3-yl, 4,4,5-trimethylhex-3-yl-, 4,5,5-trimethylhex-3-yl-, (2-methylhex-3-yl)methyl-, 3-ethyl-2-methylhex-1-yl-, 3-ethyl-3-methylhex-1-yl-, 3-ethyl-4-methylhex-1-yl-, 3-ethyl-5-methylhex-1-yl-, 4-ethyl-2-methylhex-1-yl-, 4-ethyl-3-methylhex-1-yl-, 4-ethyl-4-methylhex-1-yl-, 4-ethyl-5-methylhex-1-yl-, (2-methylhex-1-yl)methyl-, (3-methylhex-1-yl)methyl-, (4-methylhex-1-yl)methyl-, (5-methylhex-1-yl)methyl-, (6-methylhex-1-yl)methyl-, 3-isopropylhex-1-yl-, 4-ethyl-5-methylhex-1-yl-, 3-ethyl-3-methylhex-2-yl-, 3-ethyl-4-methylhex-2-yl-, 3-ethyl-5-methylhex-2-yl-, 4-ethyl-2-methylhex-2-yl-, 4-ethyl-3-methylhex-2-yl-, 4-ethyl-4-methylhex-2-yl-, 4-ethyl-5-methylhex-2-yl-, 3-isopropylhex-2-yl-, 4-ethyl-5-methylhex-2-yl-, 3-ethyl-2-methylhex-3-yl-, 3-ethyl-4-methylhex-3-yl-, 3-ethyl-5-methylhex-3-yl-, 4-ethyl-2-methylhex-3-yl-, 4-ethyl-3-methylhex-3-yl-, 4-ethyl-4-methylhex-3-yl-, 4-ethyl-5-methylhex-3-yl-, 4-isopropylhex-1-yl-, 2,2,3,3-tetramethylpent-1-yl-, 2,2,3,4-tetramethylpent-1-yl-, 2,2,4,4-tetramethylpent-1-yl-, 2,3,3,4-tetramethylpent-1-yl-, 2,3,4,4-tetramethylpent-1-yl-, 2,3,4,4-tetramethylpent-1-yl-, 3,3,4,4-tetramethylpent-1-yl-, 2,3,3,4-tetramethylpent-2-yl-, 2,3,4,4-tetramethylpent-2-yl-, 2,3,4,4-tetramethylpent-2-yl-, 3,3,4,4-tetramethylpent-2-yl-, 2,2,3,4-tetramethylpent-3-yl-, 2,2,4,4-tetramethylpent-3-yl-, 2,3,4,4-tetramethylpent-3-yl-, 2,3,4,4-tetramethylpent-3-yl-, (3-ethylhex-3-yl)methyl-, (4-ethylhex-3-yl)methyl-, (5-methylhept-3-yl)methyl-, 2,4-dimethyl-3-ethylpent-1-yl-, 3,4-dimethyl-3-ethylpent-1-yl-, 4,4-dimethyl-3-ethylpent-1-yl-, 2-ethyl-2-methylhex-1-yl-, 3-ethyl-2-methylhex-1-yl-, 4-ethyl-2-methylhex-1-yl-, 2-ethyl-3-methylhex-1-yl-, 2-ethyl-4-methylhex-1-yl-, 3-ethyl-3-methylhex-1-yl-, 3-ethyl-4-methylhex-1-yl-, 3-ethyl-5-methylhex-1-yl-, 4-ethyl-3-methylhex-1-yl-, 4-ethyl-4-methylhex-1-yl-, 4-ethyl-5-methylhex-1-yl-, and the like from dec-1-yl-, dec-2-yl-, dec-3-yl-, dec-4-yl-, dec-5-yl-, dec-6-yl-, undec-1-yl-, undec-2-yl-, undec-3-yl-, undec-4-yl-, undec-5-yl-, undec-6-yl-, undec-7-yl-, dodec-1-yl, dodec-2-yl, dodec-3-yl, dodec-4-yl, dodec-5-yl, dodec-6-yl groups.
Preferably the alkyl group is as straight chain or branched alkyl group with 1 to 8 carbons.
The alkenyl group may be a straight chain or branched alkenyl group having 2-12 carbon atoms. Examples include vinyl-, allyl-, α-methallyl-, β-methallyl-, 1-propenyl-, isopropenyl-, 1-butenyl-, 2-butenyl-, 3-butenyl, 1-buten-2-yl-, 1-buten-3-yl-, 1-methyl-1-propenyl-, 2-methyl-1-propenyl-, 1-pentenyl-, 2-pentenyl-, 3-pentenyl-, 4-pentenyl-, 1-penten-2-yl-, 1-penten-3-yl-, 2-methyl-1-butenyl-, 1-hexenyl-, 2-hexenyl-, 3-hexenyl-, 4-hexenyl-, 5-hexenyl-, 1-heptenyl-, 2-heptenyl-, 3-heptenyl-, 4-heptenyl-, 5-heptenyl-, 6-heptenyl-, 1-octenyl-, 2-octenyl-, 3-octenyl-, 4-octenyl-, 5-octenyl-, 6-octenyl-, 7-octenyl-, 1-nonenyl-, 2-nonenyl-, 3-nonenyl-, 4-nonenyl-, 5-nonenyl-, 6-nonenyl-, 7-nonenyl-, 8-nonenyl-, 1-decenyl-, 2-decenyl-, 3-decenyl-, 4-decenyl-, 5-decenyl-, 6-decenyl-, 7-decenyl-, 8-decenyl-, 9-decenyl-, 1-undecenyl, 2-undecenyl, 3-undecenyl, 4-undecenyl, 5-undecenyl, 6-undecenyl, 7-undecenyl, 8-undecenyl, 9-undecenyl, 10-undecenyl, 1-dodecenyl, 2-dodecenyl, 3-dodecenyl, 4-dodecenyl, 5-dodecenyl, 6-dodecenyl, 7-dodecenyl, 8-dodecenyl, 9-dodecenyl, 10-dodecenyl and 11-dodecenyl groups and related branched isomers.
Preferably the alkenyl group is a straight chain or branched alkenyl group with 2 to 8 carbons.
The alkynyl group may be a straight chain or branched alkynyl group having 2-12 carbon atoms. Examples include ethynyl-, 1-propynyl-, 2-propynyl-, 1-butynyl-, 2-butynyl-, 3-butynyl-, 1-pentynyl-, 2-pentynyl-, 3-pentynyl-, 4-pentynyl-, 1-hexynyl-, 2-hexynyl-, 3-hexynyl-, 4-hexynyl-, 5-hexynyl-, 1-heptynyl-, 2-heptynyl-, 3-heptynyl-, 4-heptynyl-, 5-heptynyl-, 6-heptynyl-, 1-octynyl-, 2-octynyl-, 3-octynyl-, 4-octynyl-, 5-octynyl-, 6-octynyl-, 7-octynyl-, 1-nonynyl-, 2-nonynyl-, 3-nonynyl-, 4-nonynyl-, 5-nonynyl-, 6-nonynyl-, 7-nonynyl-, 8-nonynyl-, 1-decynyl-, 2-decynyl-, 3-decynyl-, 4-decynyl-, 5-decynyl-, 6-decynyl-, 7-decynyl-, 8-decynyl-, 9-decynyl-, 1-undecynyl-, 2-undecynyl-, 3-undecynyl-, 4-undecynyl-, 5-undecynyl-, 6-undecynyl-, 7-undecynyl-, 8-undecynyl-, 9-undecynyl-, 10-undecynyl-, 1-dodecynyl-, 2-dodecynyl-, 3-dodecynyl-, 4-dodecynyl-, 5-dodecynyl-, 6-dodecynyl-, 7-dodecynyl-, 8-dodecynyl-, 9-dodecynyl-, 10-dodecynyl- and 11-dodecynyl-groups and related branched isomers.
Preferably the alkynyl group is a straight chain or branched alkenyl group with 2 to 8 carbons.
The cycloalkyl group may include cycloalkyl groups with 3 to 10 carbon atoms. Examples include the cyclopropyl-, cyclobutyl-, cyclopentyl-, cyclohexyl-, cycloheptyl- and cyclooctyl-groups that may be variably substituted at any position(s) around the ring.
Preferably the cycloalkyl group is a cycloalkyl group with 5 to 7 carbon atoms.
The cycloalkenyl group may include cycloalkenyl groups with 3 to 10 carbon atoms. Examples include cyclobutenyl-, cyclopentenyl-, cyclohexenyl-, cycloheptenyl- and cyclooctenyl-groups that may be variably substituted at any position(s) around the ring.
Preferably the cycloalkenyl group is a cycloalkenyl group with 5 to 7 carbon atoms.
The aryl group may include phenyl, biphenyl, naphtyl, anthracenyl groups and the like.
Preferably the aryl group is a phenyl group.
The aryl alkyl group may include a group with 7 to 18 carbons. Examples include benzyl-, 1-methylbenzyl-, 2-phenylethyl-, 3-phenylpropyl-, 4-phenylbutyl-, 5-phenylpentyl-, 6-phenylhexyl-, 1-naphtylmethyl, 2-(1-naphtyl)-ethyl groups and the like.
Preferably the aryl alkyl group is an aryl alkyl group with 7 to 12 carbon atoms.
The heterocyclic group may include 5- to 7-membered heterocyclic groups containing nitrogen, oxygen or sulfur. The heterocyclic ring may be fused with a cyclic or aromatic ring having 3 to 7 carbon atoms such as a benzene, cyclopropyl, cyclobutane, cyclopentane and cyclohexane ring systems. A ring with 5 or 6 carbon atoms is preferred. The heterocyclic ring may be selected from the group consisting of furyl-, dihydrofuranyl-, tetrahydrofuranyl-, dioxolanyl-, oxazolyl-, dihydrooxazolyl-, oxazolidinyl-, isoxazolyl-, dihydroisoxazolyl-, isoxazolidinyl-, oxathiolanyl-, thienyl-, tetrahydrothienyl-, dithiolanyl-, thiazolyl-, dihydrothiazolyl-, thiazolidinyl-, isothiazolyl-, dihydroisothiazolyl-, isothiazolidinyl-, pyrrolyl-, dihydropyrrolyl-, pyrrolidinyl-, pyrazolyl-, dihydropyrazolyl-, pyrazolidinyl-, imidazolyl-, dihydroimidazolyl-, imidazolidinyl-, triazolyl-, dihydrotriazolyl- triazolidinyl-, tetrazolyl-, dihydrotetrazolyl-, tetrazolidinyl-, pyridyl-, dihydropyridyl-, piperidinyl-, morpholinyl-, dioxanyl-, oxathianyl-, trioxanyl-, thiomorpholinyl-, pyridazinyl-, dihydropyridazinyl-, tetrahydropyridazinyl-, hexahydropyridazinyl-, pyrimidinyl-, dihydropyrimadinyl-, tetrahydropyrimadinyl-, hexahydropyrimadinyl-, pyrazinyl-, piperazinyl-, pyranyl-, dihydropyranyl-, tetrahydropyranyl-, thiopyranyl-, dihydrothiopyranyl-, tetrahydrothiopyranyl-, dithianyl-, purinyl-, pyrimidinyl-, pyrrolizinyl-, pyrrolizidinyl, indolyl-, dihydroindolyl-, isoindolyl-, indolizinyl-, indolizidinyl-, quinolyl-, dihydroquinolyl-, tetrahydroquinolyl-, isoquinolyl-, dihydroquinolyl-, tetrahydroquinolyl-, quinolizinyl-, quinolizidinyl-, phenanthrolinyl-, chromenyl-, chromanyl-, isochromenyl-, isochromanyl-, benzofuranyl-, carbazolyl-groups and the like.
The alkylamino group may include a straight chain or branched alkylamino group having 2-12 carbon atoms such as methylamino, ethylamino, propylamino, isopropylamino, butylamino, isobutylamino, tert-butylamino, pentylamino, hexylamino, heptylamino or octylamino.
The alkenyl amino group may include a straight chain or branched alkenylamino group having 2-12 carbon atoms such as ethynylamino-, 1-propynylamino-, 2-propynylamino-, 1-butynylamino-, 2-butynylamino-, 3-butynylamino-, 1-pentynylamino-, 2-pentynylamino-, 3-pentynylamino-, 4-pentynylamino-, 1-hexynylamino-, 2-hexynylamino-, 3-hexynylamino-, 4-hexynylamino-, 5-hexynylamino-, 1-heptynylamino-, 2-heptynylamino-, 3-heptynylamino-, 4-heptynylamino-, 5-heptynylamino-, 6-heptynylamino-, 1-octynylamino-, 2-octynylamino-, 3-octynylamino-, 4-octynylamino-, 5-octynylamino-, 6-octynylamino-, 7-octynylamino-, 1-nonynylamino-, 2-nonynylamino-, 3-nonynyl-amino, 4-nonynylamino-, 5-nonynylamino-, 6-nonynylamino-, 7-nonynylamino-, 8-nonynylamino-, 1-decynylamino-, 2-decynylamino-, 3-decynylamino-, 4-decynylamino-, 5-decynylamino-, 6-decynylamino-, 7-decynylamino-, 8-decynylamino-, 9-decynylamino-, 1-undecynylamino-, 2-undecynylamino-, 3-undecynylamino-, 4-undecynylamino-, 5-undecynylamino-, 6-undecynylamino-, 7-undecynylamino-, 8-undecynylamino-, 9-undecynylamino-, 10-undecynylamino-, 1-dodecynylamino-, 2-dodecynylamino-, 3-dodecynylamino-, 4-dodecynylamino-, 5-dodecynylamino-, 6-dodecynylamino-, 7-dodecynylamino-, 8-dodecynylamino-, 9-dodecynylamino-, 10-dodecynylamino- and 11-dodecynylamino-groups and related branched isomers.
Preferably the alkenyl amino group is a straight chain or branched alkenylamino group with 2 to 8 carbons.
The arylamino group may include arylamino groups such as a phenylamino or naphtylamino group which may be optionally substituted with an alkyl or alkenyl or alkoxy group having 1 to 4 carbon atoms, and also halogens, e.g. phenylamino, 2-methylphenylamino, 3-methylphenylamino, 4-methylphenylamino, 2-allylphenylamino, 2-chlorophenylamino, 3-chlorophenylamini, 4-chlorophenylamino, 4-methoxyphenylamino, 2-allyloxyphenylamino, naphtylamino and the like.
The arylalkylamino group may include benzylamino and 2-phenylethylamino.
The alkylthio group may include alkylthio groups having 1 to 8 carbon atoms such as methylthio, ethylthio, propylthio, butylthio, isobutylthio, pentylthio.
The alkenylthio group may include a straight chain or branched alkenylthio group having 1 to 8 carbon atoms such as ethynylthio-, 1-propynylthio-, 2-propynylthio-, 1-butynylthio-, 2-butynylthio-, 3-butynylthio-, 1-pentynylathio-, 2-pentynylthio-, 3-pentynylthio-, 4-pentynylthio-, 1-hexynylthio-, 2-hexynylthio-, 3-hexynylthio-, 4-hexynylthio-, 5-hexynylthio- and the like.
The arylrthio group may include alkenylthio groups having 1 to 8 carbon atoms such as a phenylthio or naphtylthio group which may be optionally substituted with an alkyl or alkenyl or alkoxy group having 1 to 4 carbon atoms, and also halogens, e.g. phenylthio, 2-methylphenylthio, 3-methylphenylthio, 4-methylphenylthio, 2-allylphenylthio, 2-chlorophenylthio, 3-chlorophenylamini, 4-chlorophenylthio, 4-methoxyphenylthio, 2-allyloxyphenylthio, naphtylthio and the like.
The arylalkylthio group may include alkenylthio groups having 1 to 8 carbon atoms such as the benzylthio-group and 2-phenylethylthio-group.
The alkoxycarbonyl group may include methoxycarbonyl, ethoxycarbonyl and the like,
The substituted or unsubstituted carbamoyl group may include carbamoyl, methylcarbamoyl, dimethylcarbamoyl, diethylcarbamoyl and the like.
The acyl group may include acyl groups with 1 to 8 carbon atoms such as formyl, acetyl, propionyl or benzoyl groups and others.
The alkyl, alkenyl, alkynyl, cycloalkyl, aryl, aryl alkyl, heterocyclic, alkylamino, alkenylamino, arylamino, arylalkylamino, alkylthio, alkenylthio, arylthio, arylalkylthio, alkoxycarbonyl, substituted and unsubstituted carbamoyl and acyl groups mentioned above may optionally be substituted. Examples of such substituents include halogens (F, Cl, Br, I), hydroxyl groups, mercapto groups, carboxylates, nitro groups, cyano groups, substituted or unsubstituted amino groups (including amino, methylamino, dimethylamino, ethylamino, diethylamino, and various protected amines such as tert-butoxycarbonyl- and arylsulfonamido groups), alkoxy groups (having 1 to 8 carbon atoms such as methoxy, ethoxy, propyloxy, isopropyloxy, butyloxy, isobutyloxy, tert-butyloxy, pentyloxy, hexyloxy, heptyloxy or octyloxy), alkenyloxy groups (having 2 to 8 carbon atoms such as a vinyloxy, allyloxy, 3-butenyloxy or 5-hexenyloxy), aryloxy groups (such as a phenoxy or naphtyloxy group which may be optionally substituted with an alkyl or alkenyl or alkoxy group having 1 to 4 carbon atoms, and also halogens, e.g. phenoxy, 2-methylphenoxy, 3-methylphenoxy, 4-methylphenoxy, 2-allylphenoxy, 2-chlorophenoxy, 3-chlorophenoxy, 4-chlorophenoxy, 4-methoxyphenoxy, 2-allyloxyphenoxy, naphtyloxy and the like), aryl alkyloxy groups (e.g. benzyloxy and 2-phenylethyloxy), alkylthio groups (having 1 to 8 carbon atoms such as methylthio, ethylthio, propylthio, butylthio, isobutylthio, pentylthio), alkoxycarbonyl groups (e.g. methoxycarbonyl, ethoxycarbonyl and the like), substituted or unsubstituted carbamoyl group (e.g. carbamoyl, methylcarbamoyl, dimethylcarbamoyl, diethylcarbamoyl and the like), acyl groups (with 1 to 8 carbon atoms such as formyl, acetyl, propionyl or benzoyl groups) and others.
The above-mentioned cycloalkyl, cycloalkenyl, aryl, aryl alkyl, heterocyclic, alkoxy, alkenyloxy, aryloxy, aryl alkyloxy, alkylthio, and alkoxycarbonyl groups may also be substituted with alkyl groups having 1 to 5 carbon atoms, alkenyl groups with 2 to 5 carbon atoms, or haloalkyl groups with 1 to 5 carbon atoms in addition to the substituents specified above.
The number of substituents may be one or more than one.
The substituents may be the same or different.
R can also take the form of R′—X, where X is a functional group bonded to any carbon of R′ except C1. The functional group may be for example a halogen (F, Cl, Br, I), hydroxyl group, mercapto group, carboxylate, nitro group, cyano group, substituted or unsubstituted amino group (including amino, methylamino, dimethylamino, ethylamino, diethylamino), and various protected amines such as a tert-butoxycarbonyl- or a arylsulfonamido group.
—OR as a whole can also be replaced by a functional group such as a halogen (F, Cl, Br, I), hydroxyl group, mercapto group, carboxylate, nitro group, cyano group, substituted or unsubstituted amino group (including amino, methylamino, dimethylamino, ethylamino, diethylamino), and various protected amines such as a tert-butoxycarbonyl- or a arylsulfonamido group.
“Polypeptide” and “protein” are used interchangeably and mean any peptide-linked chain of amino acids, regardless of length or post-translational modification. The invention also features yeast enantioselective glycidyl ether hydrolase (YEGH) polypeptides with conservative substitutions. Conservative substitutions typically include substitutions within the following groups: glycine and alanine; valine, isoleucine, and leucine; aspartic acid and glutamic acid; asparagine, glutamine, serine and threonine; lysine, histidine and arginine; and phenylalanine and tyrosine.
The term “isolated” polypeptide or peptide fragment, as used herein, refers to a polypeptide or a peptide fragment which either has no naturally-occurring counterpart or has been separated or purified from components which naturally accompany it, e.g., microorganism cellular components such as yeast cell cellular components. Typically, the polypeptide or peptide fragment is considered “isolated” when it is at least 70%, by dry weight, free from the proteins and other naturally-occurring organic molecules with which it is naturally associated. Preferably, a preparation of a polypeptide (or peptide fragment thereof) of the invention is at least 80%, more preferably at least 90%, and most preferably at least 99%, by dry weight, the polypeptide (or the peptide fragment thereof), respectively, of the invention. Thus, for example, a preparation of polypeptide x is at least 80%, more preferably at least 90%, and most preferably at least 99%, by dry weight, polypeptide x. Since a polypeptide that is chemically synthesized is, by its nature, separated from the components that naturally accompany it, the synthetic polypeptide is “isolated.”
An isolated polypeptide (or peptide fragment) of the invention can be obtained, for example, by: extraction from a natural source (e.g., from yeast cells); expression of a recombinant nucleic acid encoding the polypeptide; or chemical synthesis. A polypeptide that is produced in a cellular system different from the source from which it naturally originates is “isolated,” because it will necessarily be free of components which naturally accompany it. The degree of isolation or purity can be measured by any appropriate method, e.g., column chromatography, polyacrylamide gel electrophoresis, or HPLC analysis.
An “isolated DNA” is either (1) a DNA that contains sequence not identical to that of any naturally occurring sequence, or (2), in the context of a DNA with a naturally-occurring sequence (e.g., a cDNA or genomic DNA), a DNA free of at least one of the genes that flank the gene containing the DNA of interest in the genome of the organism in which the gene containing the DNA of interest naturally occurs. The term therefore includes a recombinant DNA incorporated into a vector, into an autonomously replicating plasmid or virus, or into the genomic DNA of a prokaryote or eukaryote. The term also includes a separate molecule such as: a cDNA (e.g., SEQ ID NOs: 10, 11, 12, 13, 14, 15, 16, 17, or 18) where the corresponding genomic DNA can include introns and therefore can have a different sequence; a genomic fragment that lacks at least one of the flanking genes; a fragment of cDNA or genomic DNA produced by polymerase chain reaction (PCR) and that lacks at least one of the flanking genes; a restriction fragment that lacks at least one of the flanking genes; a DNA encoding a non-naturally occurring protein such as a fusion protein, mutein, or fragment of a given protein; and a nucleic acid which is a degenerate variant of a cDNA or a naturally occurring nucleic acid. In addition, it includes a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a gene encoding a non-naturally occurring fusion protein. Also included is a recombinant DNA that includes a portion of SEQ ID NOs: 10, 11, 12, 13, 14, 15, 16, 17, or 18. It will be apparent from the foregoing that isolated DNA does not mean a DNA present among hundreds to millions of other DNA molecules within, for example, cDNA or genomic DNA libraries or genomic DNA restriction digests in, for example, a restriction digest reaction mixture or an electrophoretic gel slice.
As used herein, a “functional fragment” of a YEGH polypeptide is a fragment of the polypeptide that is shorter than the full-length polypeptide and has at least 20% (e.g., at least: 30%; 40%; 50%; 60%; 70%; 80%; 90%; 95%; 98%; 99%; 100%, or more) of the ability of the full-length polypeptide to enantioselectively hydrolyse a GE of interest. Fragments of interest can be made by either recombinant, synthetic, or proteolytic digestive methods and tested for their ability to enantioselectively hydrolyse a GE.
As used herein, “operably linked” means incorporated into a genetic construct so that an expression control sequence effectively controls expression of a coding sequence of interest.
Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention pertains. In case of conflict, the present document, including definitions, will control. Preferred methods and materials are described below, although methods and materials similar or equivalent to those described herein can be used in the practice or testing of the present invention. All publications, patent applications, patents and other references mentioned herein are incorporated by reference in their entirety. The materials, methods, and examples disclosed herein are illustrative only and not intended to be limiting.
Other features and advantages of the invention, e.g., glycidyl ether (GE) and associated vicinol diol (GD) substantially enriched for one optical enantiomer, will be apparent from the following description, from the drawings and from the claims.
Various aspects of the invention are described below.
The YEGH nucleic acid molecules of the invention can be cDNA, genomic DNA, synthetic DNA, or RNA, and can be double-stranded or single-stranded (i.e., either a sense or an antisense strand). Segments of these molecules are also considered within the scope of the invention, and can be produced by, for example, the polymerase chain reaction (PCR) or generated by treatment with one or more restriction endonucleases. A ribonucleic acid (RNA) molecule can be produced by in vitro transcription. Preferably, the nucleic acid molecules encode polypeptides that, regardless of length, are soluble under normal physiological conditions.
The nucleic acid molecules of the invention can contain naturally occurring sequences, or sequences that differ from those that occur naturally, but, due to the degeneracy of the genetic code, encode the same polypeptide (for example, one of the polypeptides with SEQ ID NOS: 1-9). In addition, these nucleic acid molecules are not limited to coding sequences, e.g., they can include some or all of the non-coding sequences that lie upstream or downstream from a coding sequence.
The nucleic acid molecules of the invention can be synthesized (for example, by phosphoramidite-based synthesis) or obtained from a biological cell, such as the cell of a eukaryote (e.g., a mammal such as human or a mouse or a yeast such as any of the genera, species, and strains of yeast disclosed herein) or a prokaryote (e.g., a bacterium such as Escherichia coli). The nucleic acids can be those of a yeast such as any of the genera, species, and strains of yeast disclosed herein. Combinations or modifications of the nucleotides within these types of nucleic acids are also encompassed.
In addition, the isolated nucleic acid molecules of the invention encompass segments that are not found as such in the natural state. Thus, the invention encompasses recombinant nucleic acid molecules (for example, isolated nucleic acid molecules encoding the polypeptides of SEQ. ID. NOs: 1-9) incorporated into a vector (for example, a plasmid or viral vector) or into the genome of a heterologous cell (or the genome of a homologous cell, at a position other than the natural chromosomal location). Recombinant nucleic acid molecules and uses therefor are discussed further below.
Techniques associated with detection or regulation of genes are well known to skilled artisans. Such techniques can be used, for example, to test for expression of a YEGH gene in a test cell (e.g., a yeast cell) of interest.
A YEGH family gene or protein can be identified based on its similarity to the relevant YEGH gene or protein, respectively. For example, the identification can be based on sequence identity. The invention features isolated nucleic acid molecules which are, or are at least 50% (e.g., at least: 55%; 60%; 65%; 75%; 85%; 95%; 98%; or 99%) identical to: (a) a nucleic acid molecule that encodes the polypeptide of SEQ ID NOs: 1-9; (b) the nucleotide sequence of SEQ ID NOs:10-18; (c) a nucleic acid molecule which includes a segment of at least 15 (e.g., at least: 20; 25; 30; 35; 40; 50; 60; 80; 100; 125; 150; 175; 200; 250; 300; 350; 400; 500; 600; 700; 800; 900; 1,000; 1,100; 1,150; 1,160; 1,170; 1,175; 1,178; 1,180; 1,181; 1,200; 1,220; 1,225; 1,226; 1,228; 1,230; 1,231; or 1,232) nucleotides of SEQ ID NOs:10-18; (d) a nucleic acid molecule encoding any of the polypeptides or fragments thereof disclosed below; and (e) the complement of any of the above nucleic acid molecules. The complements of the above molecules can be full-length complements or segment complements containing a segment of at least 15 (e.g., at least: 20; 25; 30; 35; 40; 50; 60; 80; 100; 125; 150; 175; 200; 250; 300; 350; 400; 500; 600; 700; 800; 900; 1,000; 1,100; 1,200; 1,220; 1,225; 1,228; 1,230; 1,231; or 1,232) consecutive nucleotides complementary to any of the above nucleic acid molecules. Identity can be over the full-length of SEQ ID NOs: 10-18 or over one or more contiguous or non-contiguous segments.
The determination of percent identity between two sequences is accomplished using the mathematical algorithm of Karlin and Altschul, Proc. Natl. Acad. Sci. USA 90, 5873-5877, 1993. Such an algorithm is incorporated into the BLASTN and BLASTP programs of Altschul et al. (1990) J. Mol. Biol. 215, 403-410. BLAST nucleotide searches are performed with the BLASTN program, score=100, wordlength=12, to obtain nucleotide sequences homologous to HIN-1-encoding nucleic acids. BLAST protein searches are performed with the BLASTP program, score=50, wordlength=3, to obtain amino acid sequences homologous to the HIN-1 polypeptide. To obtain gap alignments for comparative purposes, Gap BLAST is utilized as described in Altschul et al. (1997) Nucleic Acids Res. 25, 3389-3402. When utilizing BLAST and Gap BLAST programs, the default parameters of the respective programs (e.g., XBLAST and NBLAST) are used.
Hybridization can also be used as a measure of homology between two nucleic acid sequences. A YEGH-encoding nucleic acid sequence, or a portion thereof, can be used as a hybridization probe according to standard hybridization techniques. The hybridization of a YEGH probe to DNA or RNA from a test source (e.g., a mammalian cell) is an indication of the presence of YEGH DNA or RNA in the test source. Hybridization conditions are known to those skilled in the art and can be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y., 6.3.1-6.3.6, 1991. Moderate hybridization conditions are defined as equivalent to hybridization in 2× sodium chloride/sodium citrate (SSC) at 30° C., followed by a wash in 1×SSC, 0.1% SDS at 50° C. Highly stringent conditions are defined as equivalent to hybridization in 6× sodium chloride/sodium citrate (SSC) at 45° C., followed by a wash in 0.2×SSC, 0.1% SDS at 65° C.
The invention also encompasses: (a) vectors (see below) that contain any of the foregoing YEGH coding sequences (including coding sequence segments) and/or their complements (that is, “antisense” sequences); (b) expression vectors that contain any of the foregoing YEGH coding sequences (including coding sequence segments) operably linked to one or more transcriptional and/or translational regulatory elements (TRE; examples of which are given below) necessary to direct expression of the coding sequences; (c) expression vectors encoding, in addition to a YEGH polypeptide (or a fragment thereof), a sequence unrelated to YEGH, such as a reporter, a marker, or a signal peptide fused to YEGH; and (d) genetically engineered host cells (see below) that contain any of the foregoing expression vectors and thereby express the nucleic acid molecules of the invention.
Recombinant nucleic acid molecules can contain a sequence encoding a YEGH polypeptide or a YEGH polypeptide having an heterologous signal sequence. The full length YEGH polypeptide, or a fragment thereof, can be fused to such heterologous signal sequences or to additional polypeptides, as described below. Similarly, the nucleic acid molecules of the invention can encode a YEGH that includes an exogenous polypeptide that facilitates secretion.
The TRE referred to above and further described below include but are not limited to inducible and non-inducible promoters, enhancers, operators and other elements that are known to those skilled in the art and that drive or otherwise regulate gene expression. Such regulatory elements include but are not limited to the cytomegalovirus hCMV immediate early gene, the early or late promoters of SV40 adenovirus, the lac system, the trp system, the TAC system, the TRC system, the major operator and promoter regions of phage A, the control regions of fd coat protein, the promoter for 3-phosphoglycerate kinase, the promoters of acid phosphatase, and the promoters of the yeast-mating factors. Other useful TRE are listed in the examples below.
Similarly, the nucleic acid can form part of a hybrid gene encoding additional polypeptide sequences, for example, a sequence that functions as a marker or reporter. Examples of marker and reporter genes include -lactamase, chloramphenicol acetyltransferase (CAT), adenosine deaminase (ADA), aminoglycoside phosphotransferase (neor, G418r), dihydrofolate reductase (DHFR), hygromycin-B-phosphotransferase (HPH), thymidine kinase (TK), lacZ (encoding -galactosidase), xanthine guanine phosphoribosyltransferase (XGPRT), and green, yellow, or blue fluorescent protein. As with many of the standard procedures associated with the practice of the invention, skilled artisans will be aware of additional useful reagents, for example, additional sequences that can serve the function of a marker or reporter. Generally, the hybrid polypeptide will include a first portion and a second portion; the first portion being a YEGH polypeptide (or any of YEGH fragments described below) and the second portion being, for example, the reporter described above or an Ig heavy chain constant region or part of an Ig heavy chain constant region, e.g., the CH2 and CH3 domains of IgG2a heavy chain. Other hybrids could include an antigenic tag or a poly-His tag to facilitate purification.
The expression systems that can be used for purposes of the invention include, but are not limited to, microorganisms such as yeasts (e.g., any of the genera, species or strains listed herein) or bacteria (for example, E. coli and B. subtilis) transformed with recombinant bacteriophage DNA, plasmid DNA, or cosmid DNA expression vectors containing the nucleic acid molecules of the invention; yeast (for example, Saccharomyces, Kluyveromyces, Hansenula, Pichia, Yarrowia, Arxula and Candida, and other genera, species, and strains listed herein) transformed with recombinant yeast expression vectors containing the nucleic acid molecule of the invention; insect cell systems infected with recombinant virus expression vectors (for example, baculovirus) containing the nucleic acid molecule of the invention; plant cell systems infected with recombinant virus expression vectors (for example, cauliflower mosaic virus (CaMV) or tobacco mosaic virus (TMV)) or transformed with recombinant plasmid expression vectors (for example, Ti plasmid) containing a YEGH nucleotide sequence; or mammalian cell systems (for example, COS, CHO, BHK, 293, VERO, HeLa, MDCK, W138, and NIH 3T3 cells) harboring recombinant expression constructs containing promoters derived from the genome of mammalian cells (for example, the metallothionein promoter) or from mammalian viruses (for example, the adenovirus late promoter and the vaccinia virus 7.5K promoter). Also useful as host cells are primary or secondary cells obtained directly from a mammal and transfected with a plasmid vector or infected with a viral vector.
The invention includes wild-type and recombinant cells including, but not limited to, yeast cells (e.g., any of those disclosed herein) containing any of the above YEGH genes, nucleic acid molecules, and genetic constructs. Other cells that can be used as host cells are listed herein. The cells are preferably isolated cells. As used herein, the term “isolated” as applied to a microorganism (e.g., a yeast cell) refers to a microorganism which either has no naturally-occurring counterpart (e.g., a recombinant microorganism such as a recombinant yeast) or has been extracted and/or purified from an environment in which it naturally occurs. Thus, an “isolated microorganism” does not include one residing in an environment in which it naturally occurs, for example, in the air, outer space, the ground, oceans, lakes, rivers, and streams and the like, ground at the bottom of oceans, lakes, rivers, and streams and the like, snow, ice on top of the ground or in/on oceans lakes, rivers, and streams and the like, man-made structures (e.g., buildings), or in natural hosts (e.g., plant, animal or microbial hosts) of the microorganism, unless the microorganism (or a progenitor of the microorganism) was previously extracted and/or purified from an environment in which it naturally occurs and subsequently returned to such an environment or any other environment in which it can survive. An example of an isolated microorganism is one in a substantially pure culture of the microorganism.
Moreover the invention provides a substantially pure culture of a microorganism (e.g., a microbial cell such as a yeast cell). As used herein, a “substantially pure culture” of a microorganism is a culture of that microorganism in which less than about 40% (i.e., less than about: 35%; 30%; 25%; 20%; 15%; 10%; 5%; 2%; 1%; 0.5%; 0.25%; 0.1%; 0.01%; 0.001%; 0.0001%; or even less) of the total number of viable microbial (e.g., bacterial, fungal (including yeast), mycoplasmal, or protozoan) cells in the culture are viable microbial cells other than the microorganism. The term “about” in this context means that the relevant percentage can be 15% percent of the specified percentage above or below the specified percentage. Thus, for example, about 20% can be 17% to 23%. Such a culture of microorganisms includes the microorganisms and a growth, storage, or transport medium. Media can be liquid, semi-solid (e.g., gelatinous media), or frozen. The culture includes the cells growing in the liquid or in/on the semi-solid medium or being stored or transported in a storage or transport medium, including a frozen storage or transport medium. The cultures are in a culture vessel or storage vessel or substrate (e.g., a culture dish, flask, or tube or a storage vial or tube).
The microbial cells of the invention can be stored, for example, as frozen cell suspensions, e.g., in buffer containing a cryoprotectant such as glycerol or sucrose, as lyophilized cells. Alternatively, they can be stored, for example, as dried cell preparations obtained, e.g., by fluidised bed drying or spray drying, or any other suitable drying method. Similarly the enzyme preparations can be frozen, lyophilised, or immobilized and stored under appropriate conditions to retain activity.
The YEGH polypeptides of the invention include all the YEGH and fragments of YEGH disclosed herein. They can be, for example, the polypeptides with SEQ ID NOs: 1-9 and functional fragments of these polypeptides. The polypeptides embraced by the invention also include fusion proteins that contain either full-length or a functional fragment of it fused to unrelated amino acid sequence. The unrelated sequences can be additional functional domains or signal peptides.
The invention features isolated polypeptides which are, or are at least 50% (e.g., at least: 55%; 60%; 65%; 75%; 85%; 95%; 98%; or 99%) identical to the polypeptides with SEQ ID NOs: 1-9. The identity can be over the full-length of the latter polypeptides or over one or more contiguous or non-contiguous segments.
Fragments of YEGH polypeptide are segments of the full-length YEGH polypeptide that are shorter than full-length YEGH. Fragments of YEGH can contain 5-410 (e.g., 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 120, 150, 250, 300, 350, 380, 390, 391, 392, 393, 400, 405, 406, 407, 408, 409, or 410) amino acids of SEQ ID NOs: 1-9. Fragments of YEGH can be functional fragments or antigenic fragments.
The polypeptides can be any of those described above but with not more 50 (e.g., not more than 50, 45, 40, 35, 30, 25, 20, 17, 14, 12, 10, nine, eight, seven, six, five, four, three, two, or one) conservative substitution(s). Such substitutions can be made by, for example, site-directed mutagenesis or random mutagenesis of appropriate YEGH coding sequences
“Functional fragments” of a YEGH polypeptide (and, optionally, any of the above-described YEGH polypeptide variants) have at least 20% (e.g., 30%, 40%, 50%, 60%, 70%, 80%, 90%, 95%, 98%, 99%, 100%, or more) of the ability of the full-length, wild-type YEGH polypeptide to enantioselectively hydrolyse a GE of interest. One of skill in the art will be able to predict YEGH functional fragments using his or her own knowledge and information provided herein, e.g., the amino acid alignments in
Fragments of interest can be made either by recombinant, synthetic, or proteolytic digestive methods and tested for their ability to enantioselectively hydrolyse enantiomers of racemic GE.
Antigenic fragments of the polypeptides of the invention are fragments that can bind to an antibody. Methods of testing whether a fragment of interest can bind to an antibody are known in the art.
The polypeptides can be purified from natural sources (e.g., wild-type or recombinant yeast cells such as any of those described herein). Smaller peptides (e.g., those less than about 100 amino acids in length) can also be conveniently synthesized by standard chemical means. In addition, both polypeptides and peptides can be produced by standard in vitro recombinant DNA techniques and in vivo transgenesis, using nucleotide sequences encoding the appropriate polypeptides or peptides. Methods well-known to those skilled in the art can be used to construct expression vectors containing relevant coding sequences and appropriate transcriptional/translational control signals. See, for example, the techniques described in Sambrook et al., Molecular Cloning: A Laboratory Manual (2nd Ed.) [Cold Spring Harbor Laboratory, N.Y., 1989], and Ausubel et al., Current Protocols in Molecular Biology [Green Publishing Associates and Wiley Interscience, N.Y., 1989].
Polypeptides and fragments of the invention also include those described above, but modified by the addition, at the amino- and/or carboxyl-terminal ends, of a blocking agent to facilitate survival of the relevant polypeptide. This can be useful in those situations in which the peptide termini tend to be degraded by proteases. Such blocking agents can include, without limitation, additional related or unrelated peptide sequences that can be attached to the amino and/or carboxyl terminal residues of the peptide to be administered. This can be done either chemically during the synthesis of the peptide or by recombinant DNA technology by methods familiar to artisans of average skill.
Alternatively, blocking agents such as pyroglutamic acid or other molecules known in the art can be attached to the amino and/or carboxyl terminal residues, or the amino group at the amino terminus or carboxyl group at the carboxyl terminus can be replaced with a different moiety. Likewise, the peptides can be covalently or non-covalently coupled to pharmaceutically acceptable “carrier” proteins prior to administration.
Also of interest are peptidomimetic compounds that are designed based upon the amino acid sequences of the functional peptide fragments. Peptidomimetic compounds are synthetic compounds having a three-dimensional conformation (i.e., a “peptide motif”) that is substantially the same as the three-dimensional conformation of a selected peptide. The peptide motif provides the peptidomimetic compound with the ability to enantioselectively hydrolyse a GE of interest in a manner qualitatively identical to that of the YEGH functional fragment from which the peptidomimetic was derived. Peptidomimetic compounds can have additional characteristics that enhance their therapeutic utility, such as increased cell permeability and prolonged biological half-life.
The peptidomimetics typically have a backbone that is partially or completely non-peptide, but with side groups that are identical to the side groups of the amino acid residues that occur in the peptide on which the peptidomimetic is based. Several types of chemical bonds, e.g., ester, thioester, thioamide, retroamide, reduced carbonyl, dimethylene and ketomethylene bonds, are known in the art to be generally useful substitutes for peptide bonds in the construction of protease-resistant peptidomimetics.
The invention also provides compositions and preparations containing one or more (e.g., two, three, four, five, six, seven, eight, nine, ten, 12, 15, 20, 25, or more) of the above-described polypeptides, polypeptide variants, and polypeptide fragments. The composition or preparation can be, for example a crude cell (e.g., yeast cell) extract or culture supernatant, a crude enzyme preparation, a highly purified enzyme preparation. The compositions and preparations can also contain one or more of a variety of carriers or stabilizers known in the art. Carriers and stabilizers are known in the art and include, for example: buffers, such as phosphate, citrate, and other non-organic acids; antioxidants such as ascorbic acid; low molecular weight (less than 10 residues) polypeptides; proteins such as serum albumin, gelatin or immunoglobulins; hydrophilic polymers such polyvinylpyrrolidone; amino acids such as glycine, glutamine, asparagine, arginine, or lysine; monosaccharides, disaccharides, and other carbohydrates including glucose, mannose, or dextrans; chelating agents such as ethylenediaminetetraacetic acid (EDTA); sugar alcohols such as mannitol, or sorbitol; salt-forming counterions such as sodium; and/or nonionic surfactants such as Tween and Pluronics.
The invention provides methods for obtaining enantiopure, or substantially enantiopure, optically active GE and optically active GD. Enantiopure optically active GE or GD preparations are preparations containing one enantiomer of the GE or GD and none of the other enantiomer of the GE or GD. “Substantially enantiopure” optically active GE or GD preparations are preparations containing at least 55% (e.g., at least: 60%; 70%; 80%; 85%; 90%; 95%; 97%; 98%; 99%; 99.5%; 99.8%; or 99.9%), relative to the total amount of both GE or GD enantiomers, of the particular enantiomer of the GE or the GD.
The method involves exposing a GE sample containing a mixture of both enantiomers of the GE to a YEGH polypeptide (e.g., an isolated YEGH polypeptide or one in a microbial cell), which selectively catalyzes the conversion of one of the enantiomers of the GE to a corresponding GD. In this way the desired GD is produced, the selective GE enantiomer substrate for the YEGH is selectively depleted, and the relative proportion (of the total amount of the GE) of the other GE enantiomer is increased. YEGH polypeptides useful for the invention (i.e., those with GE enantioselective activity) will catalyze the conversion of one enantiomer of a GE to its corresponding GD with less than 80% (e.g., less than: 70%, 60%, 50%, 40%, 30%; 20%; 10%; 5%; 2.5%; 1%; 0.5%; 0.01%) of the efficiency that its catalyzes the conversion of the other enantiomer of the GE to its corresponding GD. The starting enantiomeric mixtures can be racemic with respect to the two GE enantiomers or they can contain various proportions of the two GE enantiomers ((e.g., 95:5, 90:10, 80:20, 70:30, 60:40 or 50:50) In addition, optimal concentrations of the GE and conditions of incubation will vary from one YEGH polypeptide to another and from one GE to another. Given the teachings of the working examples contained herein, one skilled in the art will know how to select working conditions for the production of a desired enantiomer of a desired GD and/or GE.
The method can be implemented by, for example, incubating (culturing) an enantiomeric glycidyl ether with a wild-type yeast cell or a recombinant cell (yeast or any other host species listed herein) containing a nucleic acid sequence (e.g., a gene or a recombinant nucleic acid sequence) encoding a YEGH polypeptide, a crude extract from such cells, a semi-purified preparation of a YEGH polypeptide, or an isolated YEGH polypeptide, all of which exhibit epoxide hydrolase activity with chiral preference.
The strain of the yeast cell can be selected from the following genera: Arxula, Brettanomyces, Bullera, Bulleromyces, Candida, Cryptococcus, Debaryomyces, Dekkera, Exophiala, Geotrichum, Hormonema, Issatchenkia, Kluyveromyces, Lipomyces, Mastigomyces, Myxozyma, Pichia, Rhodosporidium, Rhodotorula, Sporidiobolus, Sporobolomyces, Trichosporon, Wingea, and Yarrowia.
Yeast strains innately capable of producing a polypeptide that converts or hydrolyses a range of different types of enantiomeric glycidyl ether to optically active (i.e. enantiopure or substantially enantiopure) equivalents and/or optically active associated diols include the following exemplary genera and species:
Arxula adeninivorans, Arxula terrestris, Brettanomyces bruxellensis, Brettanomyces naardenensis, Brettanomyces anomalus, Brettanomyces species (e.g. Unidentified species NCYC 3151), Bullera dendrophila, Bulleromyces albus, Candida albicans, Candida fabianii, Candida glabrata, Candida haemulonii, Candida intermedia, Candida magnoliae, Candida parapsilosis, Candida rugosa, Candida tenuis, Candida tropicalis, Candida famata, Candida kruisei, Candida sp. (new) related to C. sorbophila, Cryptococcus albidus, Cryptococcus amylolentus, Cryptococcus bhutanensis, Cryptococcus curvatus, Cryptococcus gastricus, Cryptococcus humicola, Cryptococcus hungaricus, Cryptococcus laurentii, Cryptococcus luteolus, Cryptococcus macerans, Cryptococcus podzolicus, Cryptococcus terreus, Debaryomyces hansenii, Dekkera anomala, Exophiala dermatitidis, Geotrichum spp. (e.g. Unidentified species UOFS Y-0111), Hormonema spp. (e.g. Unidentified species NCYC 3171), Issatchenkia occidentalis, Kluyveromyces marxianus, Lipomyces spp. (e.g. Unidentified species UOFS Y-2159), Lipomyces tetrasporus, Mastigomyces philipporii, Myxozyma melibiosi, Pichia anomala, Pichia finlandica, Pichia guillermondii, Pichia haplophila, Rhodosporidium lusitaniae, Rhodosporidium paludigenum, Rhodosporidium sphaerocarpum, Rhodosporidium toruloides, Rhodosporidium paludigenum, Rhodotorula araucariae, Rhodotorula glutinis, Rhodotorula minuta, Rhodotorula minuta var. minuta, Rhodotorula mucilaginosa, Rhodotorula philyla, Rhodotorula rubra, Rhodotorula spp. (e.g. Unidentified species NCYC 3193, UOFS Y-2042, UOFS Y-0448, UOFS Y-0139, UOFS Y-0560), Rhodotorula aurantiaca, Rhodotorula spp. (e.g. Unidentified species NCYC 3224), Rhodotorula sp. “mucilaginosa”, Sporidiobolus salmonicolor, Sporobolomyces holsaticus, Sporobolomyces roseus, Sporobolomyces tsugae, Trichosporon beigelii, Trichosporon cutaneum var. cutaneum, Trichosporon delbrueckii, Trichosporon jirovecii, Trichosporon mucoides, Trichosporon ovoides, Trichosporon pullulans, Trichosporon spp. (e.g. Unidentified species NCYC 3210, NCYC 3211, NCYC 3212, UOFS Y-0861, UOFS Y-1615, UOFS Y-0451, UOFS Y-0449, UOFS Y-2113), Trichosporon moniliiforme, Trichosporon montevideense, Wingea robertsiae, and Yarrowia lipolytica.
The yeast strain may be at least one yeast strain selected from the group consisting of the yeast species listed in Tables 2, 3, 4 and 5.
Cultivation in bioreactors (fermenters) of yeast strains expressing a YEGH polypeptide, or fragment thereof, (with the purpose of preparing yeasts stocks or for the enantioselective preparative methods of the invention) can be carried out under conditions that provide useful biomass and/or enzyme titer yields. Cultivation can be by batch, fed-batch or continuous culture methods. Useful cultivation conditions are dependent on the yeast strain used. General procedures for establishing useful growth conditions of yeasts, fungi and bacteria in bioreactors are known to those skilled in the art. The enantiomeric mixture of GE can be added directly to the culture. The concentration of the GE enantiomeric mixture in the reaction matrix can be at least equal to the soluble concentration of the GE enantiomeric mixture in water. The preferred GE level in the reaction matrix is greater than the solubility limit in the aqueous reaction medium thereby resulting in a two phase reaction system. The starting amount of GE added to the reaction mixture is not critical, provided that the concentration is at least equal to the solubility of the specific GE in the aqueous reaction medium. The GE can be metered out continuously or in batch mode to the reaction mixture. The relative proportions of the (R)- and (S)-glycidyl ether s in the mixture of enantiomers of the GE shown by the general formula (I) is not critical but it is advantageous for commercial purpose to employ a racemic form of the GE shown by the general formula (I). The GE can be added in a racemic form or as a mixture of enantiomers in different ratios.
The amount of the yeast cells, crude yeast cell extract, or partially purified or isolated polypeptide having GE enantioselective activity added to the reaction depends on the kinetic parameters of the specific reaction and the amount of GE that is to be hydrolysed. In the case of product inhibition, it can be advantageous to remove the formed GD from the reaction mixture or to maintain the concentration of the GD at levels that allow reasonable reaction rates. Techniques used to enhance enzyme and biomass yields include the identification of useful (or optimal) carbon sources, nitrogen sources, cultivation time, dilution rates (in the case of continuous culture) and feed rates, carbon starvation, addition of trace elements and growth factors to the culture medium, and addition of inducers (for example substrates or substrate analogs of the epoxide hydrolases) during cultivation. In the case of recombinant hosts, the conditions under which the promoters function workably for transcription of the gene encoding the polypeptide with epoxide hydrolase activity are taken into account. At the end of fermentation (culture), biomass and culture medium can be separated by methods known to one skilled in the art, such as filtration or centrifugation.
The processes are generally performed under mild conditions. For example, the reactions can be carried out at a pH from 5 to 10, preferably from 6.5 to 9, and most preferably from 7 to 8.5. The temperature for hydrolysis can be from 0 to 70° C., preferably from 0 to 50° C., most preferably from 4 to 40° C. It is also known that lowering of the temperature of the reaction can enhance enantioselectivity of an enzyme.
The reaction mixture can contain mixtures of water with at least one water-miscible solvents (e.g., water-miscible organic solvents). Preferably, water-miscible solvents are added to the reaction mixture such that epoxide hydrolase activity remains measurable. Water-miscible solvents are preferably organic solvents and can be, for example, acetone, methanol, ethanol, propanol, isopropanol, acetonitrile, dimethylsulfoxide, N,N-dimethylformamide, N-methylpyrrolidine, and the like.
The reaction mixture can also, or alternatively, contain mixtures of water with at least one water-immiscible organic solvent. Examples of water-immiscible solvents that can be used include, for example, toluene, 1,1,2-trichlorotrifluoroethane, methyl tert-butyl ether, methyl isobutyl ketone, dibutyl-o-phtalate, aliphatic alcohols containing 6 to 9 carbon atoms (for example hexanol, octanol), aliphatic hydrocarbons containing 6 to 16 carbon atoms (for example cyclohexane, n-hexane, n-octane, n-decane, n-dodecane, n-tetradecane and n-hexadecane or mixtures of the aforementioned hydrocarbons), and the like. Thus, the reaction mixture can include water with at least one water-immiscible organic solvent selected from the group consisting of toluene, 1,1,2-trichlorotrifluoroethane, methyl tert-butyl ether, methyl isobutyl ketone, dibutyl-o-phtalate, aliphatic alcohols containing 6 to 9 carbon atoms, and aliphatic hydrocarbons containing 6 to 16 carbon.
The reaction mixture can also contain surfactants (for example, Tween 80), cyclodextrins or any agent that can increase the solubility, selectively or otherwise, of the GE enantiomers in the aqueous reaction phase.
The reaction mixture can also contain a buffer. Buffers are known in the art and include, for example, phosphate buffers, Tris buffer, and HEPES buffers.
The production of the YEGH polypeptides, including functional fragments, can be, for example, as recited above in the section on Polypeptides and Polypeptide Fragments. Thus they can be made by production in a natural host cell, production in a recombinant host cell, or synthetic production. Recombinant production can be carried out in host cells of microbial origin. Preferred yeast host cells are selected from, but are not limited to, the genera Saccharomyces, Kluyveromyces, Hansenula, Pichia, Yarrowia and Candida. Preferred bacterial host cells include Escherichia coli, Agrobacterium species, Bacillus species and Streptomyces species. Preferred filamentous fungal host cells are selected from the group consisting of the genera Aspergillus, Trichoderma, and Fusarium. The production of the polypeptide can be, e.g., intra- or extra-cellular production and can be by, e.g., secretion into the culture medium.
In these fermentation reactions of the invention, the polypeptides (including functional fragments) can be immobilized on a solid support or free in solution. Procedures for immobilization of the yeast or preparation thereof include, but are not limited to, adsorption; covalent attachment; cross-linked enzyme aggregates; cross-linked enzyme crystals; entrapment in hydrogels; and entrapment into reverse micelles.
The progress of the reaction can be monitored by standard procedures known to one skilled in the art, which include, for example, gas chromatography or high-pressure liquid chromatography on columns containing chiral stationary phases. The GD formed can be removed from the reaction mixture at one or more stages of the reaction.
The reaction can be terminated when one enantiomer of the GE and/or GD is found to be in excess compared to the other enantiomer of the GE and/or GD. Preferably, the reaction is terminated when one enantiomer of a GE of general formula (I) and/or GD of general formula (II) is found to be in an enantiomeric excess of at least 90%. In a more preferred embodiment of the invention, the reaction is terminated when one enantiomer of a GE of general formula (I) and/or GD of general formula (II) is found to be in an enantiomeric excess of at least 95%. The reaction can be terminated by the separation (for example centrifugation, membrane filtration and the like) of the yeast, or a preparation thereof, from the reaction mixture or by inactivation (for example by heat treatment or addition of salts and/or organic solvents) of the yeast or polypeptide, or preparation thereof. Thus, the reaction can be stop for by, for example, the separation of the catalytic agent from the reactants and products in the mixture, or by ablation or inhibition of the catalytic activity, by techniques known to one skilled in the art.
The optically active GE and/or GD produced by the reaction can be recovered from the reaction mixture, directly or after removal of the yeast, or preparation thereof. Preferably, the process can include continuously recovering the optically active GE and/or GD produced by the reaction directly from the reaction mixture. Methods of removal of the optically active GE and/or GD produced by the reaction include, for example, extraction with an organic solvent (such as hexane, toluene, diethyl ether, petroleum ether, dichloromethane, chloroform, ethyl acetate and the like), vacuum concentration, crystallisation, distillation, membrane separation, column chromatography and the like.
Thus, the present invention provides an efficient process with economical advantages compared to other chemical and biological methods for the production, in high enantiomeric purity, of optically active GE of the general formula (I) and vicinal diol GD of the general formula (II) in the presence of a yeast strain having YEGH activity or a polypeptide having such activity.
The invention features antibodies that bind to yeast epoxide hydrolase polypeptides or fragments (e.g., antigenic or functional fragments) of such polypeptides. The polypeptides are preferably yeast epoxide polypeptides with enantioselective activity, and in particular those with glycidyl ether enantioselective activity (i.e., YEGH), e.g., those with SEQ ID NOs: 1, 2, 3, 4, 5, 6, 7, 8, or 9. The antibodies preferably bind specifically to yeast epoxide hydrolase polypeptides, i.e., not to epoxide hydrolase polypeptides of species other than yeast species. More preferably, they can bind specifically to yeast epoxide polypeptides with enantioselective activity, and in particular to YEGH polypeptides, e.g., those with SEQ ID NOs: 1, 2, 3, 4, 5, 6, 7, 8, or 9. They can moreover bind specifically to one or more of polypeptides with SEQ ID NOs: 1, 2, 3, 4, 5, 6, 7, 8, or 9.
Antibodies can be polyclonal or monoclonal antibodies; methods for producing both types of antibody are known in the art. The antibodies can be of any class (e.g., IgM, IgG, IgA, IgD, or IgE). They are preferably IgG antibodies. Moreover, polyclonal antibodies and monoclonal antibodies can be generated in, or generated from B cells from, animals any number of vertebrate (e.g., mammalian) species, e.g., humans, non-human primates (e.g., monkeys, baboons, or chimpanzees), horses, goats, camels, sheep, pigs, bovine animals (e.g., cows, bulls, or oxen), dogs, cats, rabbits, gerbils, hamsters, guinea pigs, rats, mice, birds (such as chickens or turkeys), or fish.
Recombinant antibodies specific for YEGH polypeptides, such as chimeric monoclonal antibodies composed of portions derived from different species and humanized monoclonal antibodies comprising both human and non-human portions, are also encompassed by the invention. Such chimeric and humanized monoclonal antibodies can be produced by recombinant DNA techniques known in the art, for example, using methods described in Robinson et al., International Patent Publication PCT/US86/02269; Akira et al., European Patent Application 184,187; Taniguchi, European Patent Application 171,496; Morrison et al., European Patent Application 173,494; Neuberger et al., PCT Application WO 86/01533; Cabilly et al., U.S. Pat. No. 4,816,567; Cabilly et al., European Patent Application 125,023; Better et al. (1988) Science 240, 1041-43; Liu et al. (1987) J. Immunol. 139, 3521-26; Sun et al. (1987) PNAS 84, 214-18; Nishimura et al. (1987) Canc. Res. 47, 999-1005; Wood et al. (1985) Nature 314, 446-49; Shaw et al. (1988) J. Natl. Cancer Inst. 80, 1553-59; Morrison, (1985) Science 229, 1202-07; Oi et al. (1986) BioTechniques 4, 214; Winter, U.S. Pat. No. 5,225,539; Jones et al. (1986) Nature 321, 552-25; Veroeyan et al. (1988) Science 239, 1534; and Beidler et al. (1988) J. Immunol. 141, 4053-60.
Also useful for the invention are antibody fragments and derivatives that contain at least the functional portion of the antigen-binding domain of an antibody that binds to a YEGH polypeptide. Antibody fragments that contain the binding domain of the molecule can be generated by known techniques. Such fragments include, but are not limited to: F(ab′)2 fragments that can be produced by pepsin digestion of antibody molecules; Fab fragments that can be generated by reducing the disulfide bridges of F(ab′)2 fragments; and Fab fragments that can be generated by treating antibody molecules with papain and a reducing agent. See, e.g., National Institutes of Health, 1 Current Protocols In Immunology, Coligan et al., ed. 2.8, 2.10 (Wiley Interscience, 1991). Antibody fragments also include Fv fragments, i.e., antibody products in which there are few or no constant region amino acid residues. A single chain Fv fragment (scFv) is a single polypeptide chain that includes both the heavy and light chain variable regions of the antibody from which the scFv is derived. Such fragments can be produced, for example, as described in U.S. Pat. No. 4,642,334, which is incorporated herein by reference in its entirety. The antibody can be a “humanized” version of a monoclonal antibody originally generated in a different species.
The above-described antibodies can be used for a variety of purposes including, but not limited to, YEGH polypeptide purification, detection, and quantitative measurement.
The following examples serve to illustrate, not limit, the invention.
In the examples, quantitative determinations of the compounds and determination of enantiomeric excesses were carried out by GC and HPLC. Gas chromatography (GLC) was performed on a Hewlett-Packard 6890 gas chromatograph equipped with FID detector and using H2 as carrier gas. HPLC was performed on a Hewlett-Packard 1050 liquid chromatograph equipped with a UV detector. Chiral analysis of glycidyl ethers was done as follows:
The glycidyl ethers that were used to illustrate the use of the different yeast strains to prepare optically active glycidyl ethers (GE) and vicinal diols (GD) from enantiomeric mixtures of glycidyl ethers represented by the general formula (I) is given in Scheme I.
All GE substrates used were commercially available, with the exception of benzyl glycidyl ether and naphtyl glycidyl ether.
Benzyl glycidyl ether was synthesized by addition of epichlorohydrin to benzylalcohol as follows:
A mixture of benzyl alcohol (1.401, 13.53 mol) and epichlorohydrin (1.161, 14.88 mol) was placed in a 10 l mechanically stirred baffled reactor with efficient cooling. The mixture was cooled to ˜5° C., treated with tetra-n-butylammonium iodide (99.94 g, 0.27 mol), followed by dosing of 50% (w/v) aqueous sodium hydroxide solution (7.5 l, 93.75 mol) in four portions over approximately 1 h. No substantial exotherm was noted. The dense emulsion formed on agitation at 600 rpm was left at this temperature for 2 h, then allowed to gradually warm to room temperature over 18 h. The mixture was then extracted with dichloromethane (2×2.5 l) and the solvent removed until about 2 l remained, after which MgSO4 was added to the stirred solution. Filtration and further concentration afforded about 1.6 l of orange-brown oil. Two cycles of distillation (95-97° C./0.4 mmHg) afforded a clear oil, benzyl glycidyl ether (1.13 kg, 51%). δH (200 MHz, CDCl3) 7.41-7.26 (5H, m, aryl H), 4.65 (d, 1H, PhCHaHb, J 12.0), 4.58 (d, 1H, PhCHaHb, J 12.0), 3.79 (dd, 1H, OCHaHb, J 11.6 and 3.2), 3.47 (dd, 1H, OCHaHb, J 11.4 and 5.6), 3.26-3.14 [m, 1H, CH2CH(O)], 2.82 [˜t, 1H, CHCHaHb(O), J 4.6] and 2.64 [dd, 1H, CHCHaHb(O), J 5.2 and 2.8].
Naphtyl glycidyl ether was synthesised as follows:
A mixture of 2-naphthol (5.014 g, 34.780 mmol) in epichlorohydrin (35 cm3) was treated with solid potassium carbonate (10.407 g, 75.298 mmol) in the presence of tetra-n-butylammonium iodide (0.192 g, 0.521 mmol) as phase-transfer catalyst. The mixture was stirred for 18 h at room temperature, after which it was diluted with 50 cm3 each of dichloromethane and water. Extraction with dichloromethane, drying (MgSO4) and concentration afforded an orange oil. This was distilled (180-190° C./3 mmHg) to afford a clear oil that crystallised on standing. The yellow tacky solid was recrystallised with difficulty from ethyl acetate/hexane to afford white needles of 2-naphtyl glycidyl ether (3.525 g, 51%). δH (200 MHz, CDCl3) 7.80-7.59 and 7.58-7.06 (7H, 2×m, aryl H), 4.35 (1H, dd, OCHaHb, J 11.2 and 3.4), 4.09 (1H, dd, OCHaHb, J 11.2 and 5.6), 3.52-3.29 [m, 1H, CH2CH(O)], 2.94 [˜t, 1H, CHCHaHb(O), J 5.0] and 2.81 [dd, 1H, CHCHaHb(O), J 4.8 and 2.8].
Diol standards were prepared by acid hydrolysis of the corresponding glycidyl ethers.
Yeasts were grown at 30° C. in 1 L shake-flask cultures containing 200 ml yeast extract/malt extract (YM) medium (3% yeast extract, 2% malt extract, 1% peptone w/v) supplemented with 1% glucose (w/v). At late stationary phase (48-72 h) the cells were harvested by centrifugation (10 000 g, 10 min, 4° C.), washed with phosphate buffer (50 mM, pH7.5), centrifuged and frozen in phosphate buffer containing glycerol (20%) at −20° C. as 20% (w/v) cell suspensions. The cells were stored for several months without significant loss of activity.
Glycidyl ether (GE) substrate (10 μl of a 1M stock solution in EtOH) was added to a final concentration of 20-50 mM to 100-500 μl cell suspension (20-50% w/v) in phosphate buffer (50 mM, pH 7.5). The reaction mixtures were incubated at 25° C. for 1-5 hours. The reaction mixtures were extracted with EtOAc or hexane (equal volume) and centrifuged. GD formation was evaluated by TLC (silica gel Merck 60 F254). Compounds were visualized by spraying with vanillin/conc. H2SO4 (5 g/l). Reaction mixtures that showed substantial GD formation were evaluated for asymmetric hydrolysis of the GE by chiral GLC or HPLC analysis. Some reactions were repeated over longer or shorter times and with more dilute cell suspensions (10% w/v) in order to analyse the reactions at suitable conversions.
Frozen cells were thawed, washed with phosphate buffer (50 mM, pH 7.5) and resuspended in buffer. Cell suspensions (10 ml, 20% or 50% w/v) were placed in 20 ml glass bottles with screw caps fitted with septa. The substrate (100 or 250 μl of a 2M (v/v) stock solution in ethanol) was added to final concentrations of 20 mM or 50 mM. The mixtures were agitated on a shaking water bath at 30° C. The course of the bioconversions of the GE was followed by withdrawing samples (500 μl) at appropriate time intervals. Samples were extracted with 300 μl EtOAc or hexane. After centrifugation (3000×g, 2 min), the organic layer was dried over anhydrous MgSO4 and the products analyzed by chiral GLC or HPLC.
Absolute configurations were deduced by the elution order of the GE enantiomers on chiral HPLC columns as reported in literature (Xu et. al., 2004).
Yeast strains with “Jen” and numerical screen numbers were obtained from the Yeast Culture Collection of the University of the Free State. Yeast strains with screen numbers donated “AB” or “Car” or “Alf” or “Poh” were isolated from soil from specialised ecological niches that were selected based on our hypothesis that selectivity for specific classes of epoxides in microorganisms may be determined by environmental factors such as terpene-rich environments or highly contaminated soil. “AB” and “Alf” strains were isolated from Cape Mountain fynbos, an ecological environment unique to South Africa, “Car” strains were isolated from soil under pine trees, and “Poh” strains from soil contaminated by high concentrations of cyanide. These new isolated were subsequently deposited at the Yeast Culture Collection of the Free State and assigned UOFS numbers.
Cloning and Overexpression of Wild Type Yeast Epoxide Hydrolases in Yarrowia lipolytica as Production Host Under the Control of Different Promoters
The following features are common to all the E. coli/Y. lipolytica auto-cloning integrative vectors used:
Yarrowia
lipolytica
Rhodotorula mucilaginosa NCYC 3190
Rhodotorula araucariae NCYC 3183
Rhodosporidium toruloides UOFS Y-0471
Rhodosporidium paludigenum NCYC 3179
Cryptococcus neoformans var neoformans
Rhodosporidium toruloides NCYC 3181
Cryptococcus curvatus NCYC 3158
Yarrowia lipolytica Po1h
Yarrowia lipolytica Po1h
Yarrowia lipolytica UOFS Y-1138
Yarrowia lipolytica NCYC 3229
Rhodosporidium toruloides NCYC 3181
Rhodotorula mucilaginosa NCYC 3190
Rhodotorula araucariae NCYC 3183
Rhodosporidium toruloides UOFS Y-0471
Rhodosporidium paludigenum NCYC 3179
Cryptococcus neoformans var neoformans
pINA1291 (
pINA3313 (
To prepare the vectors for ligation with an epoxide hydrolase gene (or other insert to be expressed in Y. lipolytica), DNA was digested with BamHI and AvrII, and dephosphorylated using commercial Calf Intestinal Alkaline Phosphatase.
Total RNA was isolated from selected yeast strain cells and messenger RNA (mRNA) was purified from it. The mRNA was used as a template to synthesise complementary DNA (cDNA) using reverse transcriptase. The cDNA was then used as a template for Polymerase Chain Reaction (PCR) using appropriate primers. PCR primers were selected by repeated experimentation using multiple test primers for each yeast strain, the sequences of which were based on previously described epoxide hydrolase sequences from a variety of species. The nucleotide sequences of the forward and reverse primers used to generate cDNA coding sequences from mRNA from seven different yeast strains with appropriate restriction enzyme recognition sites at their termini are shown below. Restriction enzyme recognition sequences are underlined and the relevant restriction enzymes are shown in parentheses.
R. toruloides NCYC
R. toruloides UOFS
C. curvatus NCYC
R. araucariae NCYC
R. paludigenum NCYC
R. mucilaginosa NCYC
Y. lipolytica NCYC
Debaromyces hansenii
Filobasidium
neoformans
Each PCR reaction contained 200 M dNTPs, 250 nM of each primer, 2 mM of MgCl2, cDNA and 2.5 U of Taq polymerase in a 50 μl reaction volume. The PCR profile used was: 95° C. for 5 minutes, followed by 30 cycles of: 95° C.—1 min, 50° C.—1 min, 72° C.—2 min, then a final extension of 72° C. for 10 minutes. The PCR products were purified and digested with the restriction enzymes whose recognition sites are engineered at the end of the primers. The cDNA fragment was cloned into a vector and sequenced for confirmation.
Coding sequences to be inserted in either pYLHmA or pYLTsA were prepared with BamHI and AvrII at their termini. The above PCR primers were designed with these restriction sites, unless the sites were also present in the gene to be inserted. If this occurred, appropriate compatible restriction enzymes were selected. PCR template DNA was either the insert cloned into a different vector, or cDNA synthesized from the original host organism. PCR reactions consisted of 200 M dNTP's, 250 nM each primer, 1×Taq polymerase buffer, and 2.5 units Taq polymerase per 100 l reaction. The amplification programme used was: 95° C. for 5 minutes, 30 cycles of 95° C. for 1 minute, 50° C. for 1 minute, and 72° C. for 2 minutes, followed by a single duration at 72° C. for 10 minutes.
PCR products were purified and digested with the relevant restriction enzymes. The digested DNA was subsequently repurified and was ready for ligation into the prepared vector.
5. Preparation of pYLHmA or pYLTsA Constructs
Vector and insert were ligated at pmol end ratios of 3:1-10:1 (insert:vector), using commercial T4 DNA Ligase. Ligations were electroporated into any laboratory strain of Escherichia coli, using the Bio-Rad GenePulser, or equivalent electroporator. Transformants were selected on LM media (10 g/l yeast extract, 10 g/l tryptone, 5 g/l NaCl), supplemented with kanamycin (50 g/ml). Transformants were selected based on restriction enzyme digests of purified plasmid DNA.
6. Yarrowia lipolytica Transformation
Digestion of the pINA-series of plasmids with NotI resulted in the release of a bacterial DNA-free expression cassette, containing the ura3d4 (pYLHmA) or the ura3d1 (pYLTsA) marker gene and the promoter-gene-terminator.
Scaled-up quantities of each plasmid were isolated. NotI was used to restrict the plasmid DNA, and the digested DNA was run on an agarose gel. NotI digests resulted in generation of the bacterial fragment of the plasmid as a band at 2210 bp, and the expression cassette as a band of 2760 bp+size of insert (pYLHmA) or 2596 bp+size of insert (pYLTsA). The expression cassette fragments were excised from the gel and purified from the agarose. The purified fragment was used for transformation of Y. lipolytica Po1 h.
Primers YL-Fwd and YL-Rev were used to amplify the expression cassette. PCR reactions consisted of 200 M dNTP's, 250 pmol each primer, 1×Taq polymerase buffer and 2.5 units Taq polymerase per 100 l reaction. The amplification programme used was 95° C. for 5 minutes, 30 cycles of 95° C. for 1 minute, 50° C. for 1 minute, and 72° C. for 3½ minutes, followed by 72° C. for 10 minutes. The PCR product was purified from the PCR reaction mix and used for transformation of Y. lipolytica Po1 h.
DNA from salmon testes was made up as a 10 mg/ml stock in TE (10 mM Tris-HCl, pH 8.0, 1 mM EDTA) and sonicated to produce in fragments that range from approximately 15 kb to 100 bp, with most fragments in a range of 6 to 10 kb. The DNA was denatured by boiling. Aliquots are stored at −20° C.
6.1.4. Transformation of Yarrowia lipolytica with pYLHmA or pYLTsA
An adaptation of the method of Xuan et al (1988) was used for the transformation of Y. lipolytica Po1h. The yeast was inoculated into 50 ml YPD (10 g/l yeast extract, 20 g/l peptone, 20 g/l glucose) The culture was incubated at 30° C., 220 rpm until cell densities of 8×107-2×108 cells/ml were reached. The entire culture was harvested, the pellet resuspended in 10 ml TE and reharvested. 1 ml TE+0.1 M LiOAc was used to resuspend the pellet and the culture was incubated at 28° C. in a ProBlot Jr (Labnet) hybridisation oven, set at 4 rpm (or similar incubator) for 1 hour. Transformation mixes were set up with 0.5-2 g of transforming DNA+5 g of carrier DNA with 100 l of treated cells.
Each mix was set up in a 1.5 ml microfuge tube, and incubated in a 28° C. heating block for 30 minutes. 7 volumes of PEG reagent (40% PEG 4,000, 0.1 M LiOAc, 10 mM Tris, 1 mM EDTA, pH 7.5, filter-sterilised) were added to each, mixed carefully and incubated at 28° C. for a further 1 hour. The tubes were transferred to a 37° C. heating block for 15 minutes, and then pelleted for 1 minute at 13,000 rpm and the pellets carefully resuspended in 100 l dH2O. The transformations were plated on Y. lipolytica selective plates (17 g/l Difco yeast nitrogen base without amino acids and without (NH4)2SO4, 20 g/l glucose, 4 g/l NH4Cl, 2 g/l casamino acids, 300 mg/l leucine) and incubated at 28° C. Colonies appearing on the selective plates after 3-7 days were transferred onto fresh plates and regrown.
6.1.5. Confirmation of Integration of pYLHmA or pYLTsA
Colonies that grow on the newly-streaked selective plates were inoculated into 5 ml of YPD and grown at 30° C., 200 rpm for 24-48 hours. A small-scale genomic DNA isolation was performed.
PCR was performed using this genomic DNA as template, with either pINA-1 and pINA-2 as primers (transformants with pYLHmA), or pINA-3 and pINA-2 (pYLTsA). Each PCR reaction contained 200 M dNTPs, 250 nM of each primer, 2 mM of MgCl2, genomic DNA and 2.5 U/50 l of Taq polymerase. The PCR profile was as described above in 6.1.2. These primer sets should result in products the size of the inserted genes.
Yeasts were cultivated, harvested and frozen as described above. The racemic GE was added and the screening was performed as described above. Strains with the highest activities as judged by TLC from diol formation were subjected to chiral HPLC analysis as described above. The strains with E-values >2 are given as samples 1-55 in Table 2. E-values were calculated using the following formula:
All the yeast strains referred to in this and the following examples are kept and maintained at the University of the Free State (UFS), Department of Microbial, Biochemical and Food Biotechnology, Faculty of Natural and Agricultural Sciences, P.O. Box 339, Bloemfontein 9300, South Africa (Tel +27 51 401 2396, Fax +27 51 444 3219) and are readily identified by the yeast species and culture collection number as indicated. Representative examples of strains belonging to the different species have been deposited under the Budapest Treaty at National Collection of Yeast Cultures (NCYC), Institute of Food Research Norwich Research Park Colney, Norwich NR4 7UA, U.K. (Tel: +44-(0)1603-255274 Fax: +44-(0)1603-458-414 Email: ncyc@bbsrc.ac.uk) and are readily identified by the yeast species and culture collection accession number as indicated. The samples deposited with the NCYC are taken from the same deposit maintained by the South African Council for Scientific and Industrial Research (CSIR) since prior to the filing date of this application. The deposits will be maintained without restriction in the NCYC depository for a period of 30 years, or 5 years after the most recent request, or for the effective life of the patent, whichever is longer, and will be replaced if the deposit becomes non-viable during that period. Samples of the yeast strains not deposited at NCYC will be made available upon request on the same basis and conditions of the Budapest Treaty.
Arxula terrestris
Arxula terrestris
3
752
UOFS Y-1041
180
80.5
82.4
25.5
Candida magnoliae
Candida rugosa
Candida sp. (new) rel to C. sorbophila
Candida tenuis
8
Car 054
NCYC 3158
180
68.5
87.7
31.3
9
Car 014
UOFS Y-2225
180
53.3
84.5
20.2
Cryptococcus humicola
Cryptococcus humicola
12
Car 220a
UOFS Y-2262
180
74.0
80.6
20.5
13
Car 400
UOFS Y-2263
180
80.4
78.5
20.3
Debaryomyces hansenii
Debaryomyces hansenii
Debaryomyces hansenii
Lipomyces sp. (course)
Lipomyces sp. (smooth)
Mastigomyces philipporii
Pichia guillermondii
Pichia guillermondii
Pichia guillermondii
Pichia guillermondii
Pichia haplophila
Rhodosporidium toruloides
26
Car 006
UOFS Y-2223
180
64.6
87.4
29.0
27
Car 020
UOFS Y-2226
180
46.2
88.6
26.1
Rhodosporidium toruloides
29
Car 052
UOFS Y-2230
180
81.8
90.2
49.4
30
Car 059
UOFS Y-2231
180
36.1
89.9
26.8
31
Car 076
UOFS Y-2236
180
61.8
89.7
34.6
32
Car 078
UOFS Y-2240
180
65.5
86.7
27.5
Rhodosporidium toruloides
34
Car 093
UOFS Y-2242
180
86.4
83.3
30.3
Rhodosporidium toruloides
36
Car 100
UOFS Y-2245
180
78.4
86.4
32.7
37
Car 108
UOFS Y-2247
180
46.9
87.8
24.5
38
Car 120
UOFS Y-2249
180
49.8
87.1
23.8
39
Car 121
UOFS Y-2250
180
77.5
87.2
34.2
40
Car 126
UOFS Y-2251
180
42.4
87.2
22.2
41
Car 200
UOFS Y-2256
180
64.0
92.9
52.7
42
EP 230
NCYC 3185
180
27.8
91.8
30.8
43
Car 022
UOFS Y-2227
180
42.3
92.4
38.1
44
Car 060
UOFS Y-2232
180
39.6
87.9
23.0
45
Car 061
UOFS Y-2233
180
67.3
83.7
22.7
Rhodotorula minuta
Rhodotorula sp. nearest
minuta
48
Jen 29
NCYC 3197
180
100.0
72.3
84.6
Trichosporon jirovecii
50
14
NCYC 3205
180
56.2
92.4
44.5
Trichosporon mucoides
52
231
sp.
NCYC 3210
60
88.3
91.5
66.1
53
225
sp.
NCYC 3211
180
7.1
100.0
>200
54
224
sp.
UOFS Y-0449
180
33.5
100.0
>200
Yarrowia lipolytica
aReaction conditions: 50 mM glycidyl ether, 50% cells (w/v) in phosphate buffer (50 mM, pH 7.5)
Samples 56-68 (
Hydrolysis of (±)-phenyl glycidyl ether by yeasts selected from Table 2 was performed as described under general methods and materials at room temperature, unless otherwise stated. Biocatalyst concentrations (% m/v wet weight in the aqueous phase—equivalent to five-fold dry weight) are given on the graphs as are indications of the racemic substrate concentrations in millimolar.
Samples 69-75 (
Hydrolysis of (±)-phenyl glycidyl ether by the recombinant strains was performed as described under general methods and materials at room temperature, unless otherwise stated. Biocatalyst concentrations (% m/v wet weight in the aqueous phase equivalent to five-fold dry weight) are given on the graphs as are indications of the racemic substrate concentrations in millimolar.
Samples 76-176 in Table 3 illustrate examples of wild-type yeasts that were shown to be enantioselective on (±)-benzyl glycidyl ether with different enantioselectivities. Strains producing S-Benzyl glycidyl ether and S-diol have the “same” selectivity as displayed by yeasts for phenyl glycidyl ether i.e. that produce R-phenyl glycidyl ether, and is highlighted. The absolute configuration assignment changes because of a switch of priorities if the substitutents as defined by the Cahn-Ingold-Prelogg rule. Strains producing R-BGE and R-diol have the “opposite” selectivity as that displayed for phenyl glycidyl ether.
Arxula adeninivorans
S
Arxula terrestris
S
Bullera dendrophila
S
Candida famata
S
Candida intermedia
S
Candida magnoliae
S
Candida magnoliae
S
Candida magnoliae
S
Candida rugosa
S
Candida sp. (new) rel to C. sorbophila
S
Cryptococcus albidus
Cryptococcus curvatus
Cryptococcus curvatus
Cryptococcus humicola
Cryptococcus humicola
Cryptococcus humicola
Cryptococcus humicola
Cryptococcus hungaricus
Cryptococcus laurentii
Cryptococcus podzolicus
Cryptococcus podzolicus
Cryptococcus podzolicus
Cryptococcus podzolicus
Cryptococcus podzolicus
Cryptococcus podzolicus
Cryptococcus podzolicus
Cryptococcus podzolicus
Cryptococcus podzolicus
S
Cryptococcus terreus
Debaryomyces hansenii
S
Debaryomyces hansenii
S
Debaryomyces hansenii
S
Debaryomyces hansenii
S
Debaryomyces hansenii
S
Debaryomyces hansenii
S
Lipomyces sp.
S
Pichia guillermondii
S
Pichia guillermondii
S
Pichia guillermondii
S
Pichia guillermondii
S
Pichia guillermondii
S
Pichia guillermondii
S
Pichia guillermondii
S
Pichia haplophila
S
Pichia haplophila
S
Rhodosporidium lusitaniae
Rhodosporidium paludigenum
Rhodosporidium paludigenum
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
S
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodotorula araucariae
Rhodotorula aurantiaca
Rhodotorula glutinis
Rhodotorula glutinis
Rhodotorula glutinis
Rhodotorula glutinis
Rhodotorula glutinis
Rhodotorula glutinis
Rhodotorula minuta
Rhodotorula mudilaginosa
Rhodotorula sp. nearest minuta
Rhodotorula sp. Minuta/mucilaginosa
Rhodotorula sp. Minuta/mucilaginosa
Rhodotorula philyla
Rhodotorula sp.
Rhodotorula sp.
Rhodotorula sp.
Sporidiobolus salmonicolor
Sporobolomyces holsaticus
Sporobolomyces roseus
Trichosporon jirovecii
Trichosporon mucoides
Trichosporon sp.
Yarrowia lipolytica
aReaction conditions: 50 mM benzyl glycidyl ether, 50% cells (w/v) in phosphate buffer (50 mM, pH 7.5), Reaction time 3 hours.
These samples illustrate the use of different wild type yeast strains selected from Table 3 to produce optically active glycidyl ethers and vicinal diols from glycidyl ethers. The graphs show the change in concentrations of the glycidyl ether enantiomers with time. Hydrolysis of (±)-benzyl glycidyl ether by yeasts selected from Table 3 was performed as described under general methods and materials at room temperature, unless otherwise stated. Substrate concentrations (mM) and biocatalyst concentrations (% m/v wet weight—equivalent to fivefold % m/v dry weight in the aqueous phase) are given on the graphs.
Samples 177-180 (
Samples 181 and 182 (
Samples 183-187 (
Samples 188-254 in Table 4 illustrate the stereoselective hydrolysis of furfuryl glycidyl ether (FGE) by selected wild-type yeasts.
Arxula adeninivorans
Arxula adeninivorans
Bullera dendrophila
Bullera dendrophila
Candida intermedia
Candida magnoliae
Cryptococcus albidus
Cryptococcus curvatus
Cryptococcus humicola
Cryptococcus humicola
Cryptococcus hungaricus
Cryptococcus laurentii
Cryptococcus macerans
Cryptococcus podzolicus
Cryptococcus podzolicus
Cryptococcus podzolicus
Cryptococcus podzolicus
Cryptococcus podzolicus
Debaryomyces hansenii
Debaryomyces hansenii
Exophiala dermatitidis
Lipomyces sp. (course)
Lipomyces sp. (smooth)
Mastigomyces philipporii
Myxozyma melibiosi
Pichia guillermondii
Pichia guillermondii
Pichia guillermondii
Pichia guillermondii
Pichia guillermondii
Pichia haplophila
Pichia haplophila
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodotorula araucariae
Rhodotorula glutinis
Rhodotorula glutinis
Rhodotorula glutinis
Rhodotorula glutinis
Rhodotorula sp.
Rhodotorula sp.
Sporidiobolus salmonicolor
Sporobolomyces holsaticus
Trichosporon beigelii
S
Trichosporon cutaneum var.
cutaneum
Trichosporon jirovecii
Trichosporon moniliiforme
Trichosporon mucoides
Trichosporon mucoides
Trichosporon mucoides
Trichosporon pullulans
S
Trichosporon sp.
Trichosporon sp.
aReaction conditions: 50 mM furfuryl glycidyl ether, 50% cells (w/v) in phosphate buffer (50 mM, pH 7.5), Reaction time 3 hours.
Samples 255-254 (
Samples 260-258 (
Samples 264-295 in Table 5 illustrate the wild type yeasts identified as capable of producing optically active (S)- or (R)-isopropyl glycidyl ether (GIE) and (S)- or (R)-3-isopropoxy-1,2-propanediol from (±)-isopropyl glycidyl ether.
Arxula adeninivorans
Candida magnoliae
Candida magnoliae
Candida magnoliae
Candida magnoliae
Candida magnoliae
Candida parapsilosis
Cryptococcus albidus
Cryptococcus laurentii
Cryptococcus podzolicus
Cryptococcus podzolicus
Cryptococcus podzolicus
Debaryomyces hansenii
Debaryomyces hansenii
Pichia guillermondii
Pichia guillermondii
Pichia guillermondii
Pichia guillermondii
Pichia guillermondii
Pichia haplophila
Pichia haplophila
Pichia haplophila
Pichia guillermondii
Rhodosporidium toruloides
Rhodosporidium toruloides
Rhodotorula araucariae
Rhodotorula glutinis
Rhodotorula glutinis
Rhodotorula sp.
Rhodotorula sp.
Sporobolomyces tsugae
S
Yarrowia lipolytica
aReaction conditions: 50 mM isopropyl glycidyl ether, 50% cells (w/v) in phosphate buffer (50 mM, pH 7.5), Reaction time 3 hours.
Samples 296-297 (
Sample 293-294 (
Sample 300-301 (
Sample 302-305 (
The following references are included herein by reference thereto.
A number of embodiments of the invention have been described. Nevertheless, it will be understood that various modifications can be made without departing from the spirit and scope of the invention. Accordingly, other embodiments are within the scope of the following claims.
Number | Date | Country | Kind |
---|---|---|---|
2005/03085 | Apr 2005 | ZA | national |
PCT/IB2006/050143 | Jan 2006 | IB | international |