The present invention relates to the non-toxic production of cyclic peptides via modifications to a split-intein circular ligation of peptides and proteins (SICLOPPS) methodology for enhanced efficiency within mammalian cells.
The use of cyclic peptides in the early stages of drug discovery has become increasingly prevalent within pharmaceutical research and development. These polypeptides, ranging in length from just two conjoined amino acids to peptides comprising hundreds of such residues, are particularly useful for identifying protein-protein interaction inhibitors, with a further use in serving as key starting points for the design of drug-like small molecules.
Peptides have a particular utility as ligands against otherwise “undruggable” targets. Such undruggable targets may be intracellular molecules, specific protein-protein interactions, and are generally unsuitable to small molecules and biologics. The further cyclisation, or ring closure, of peptides enhances the lifespan of such molecules in vivo with a subsequent marked improvement in their pharmacokinetic dynamics. Whilst the scope of useful cyclic peptides found in nature is somewhat limited, the production of synthetic polypeptides as such in the laboratory opens an avenue of potential for discovering candidate drugs.
As a result, the generation of large scale libraries of synthetic cyclic peptides has become a cornerstone of modern drug discovery with substantial economic and commercial implications. Such genetically-encoded libraries allow for high-throughput screening and rapid deconvolution of hits that bind to target proteins, by sequencing DNA or mRNAtags that may be bound to each peptide.
One increasingly used method for intracellular cyclic peptide library generation is termed split-intein circular ligation of peptides and proteins (SICLOPPS). This readily accessible method can generate libraries of over 100 million members with significant speed and simplicity, with its intracellular nature allowing for integrated functional assays in vivo.
The approach makes use of intein splicing to cyclise each peptide of interest, or “extein”. Inteins are unique autoprocessing protein domains that can undergo a self-excising event from a larger precursor polypeptide through the cleavage of two peptide bonds, whilst ligating the N- and C-termini of the flanking extein sequences with a new peptide bond. A “split-intein”, more specifically, has its polypeptide sequence originating from two genes and can result in the flanking of an extein by two separate N-intein and C-intein domains. Following translation, the two domains non-covalently reassemble into a canonical active intein to carry out protein splicing.
A SICLOPPS construct encodes a C-terminus intein domain followed by the extein polypeptide sequence to be cyclised, and an N-terminus intein domain. Upon transcription and translation, the flanking regions associate to give an active intein that self-excises and cyclises the remaining polypeptide sequence between the C- and N-terminus intein domains as a result of splicing. Peptides of varying length and amino acid composition can be incorporated into the SICLOPPS method, providing that the first amino acid of the target peptide is a nucleophilic cysteine, serine or threonine.
The technique provides a simple method for generating cyclic peptide libraries, requiring just a SICLOPPS plasmid, a degenerate oligonucleotide, and a handful of straightforward molecular biology steps. The degenerate oligonucleotide will have been designed to determine the ring size of the cyclic peptides, the number of randomised amino acids, and any set amino acids to be incorporated. Each oligonucleotide, containing a unique extein sequence of interest, is integrated into a SICLOPPS plasmid via PCR-digest and ligation techniques to create a library. The plasmid library can then be transformed into cells containing a phenotypic assay, for example, and then screened. The identity of the active cyclic peptides is revealed by isolating the SICLOPPS plasmids from cells that show the desired phenotype, followed by DNA sequencing (Tavassoli 2017, Curr Opin Chem Biol 38: 30-35).
The advent of SICLOPPS introduced the benefit of interfacing cyclic peptide libraries with assays in a variety of organisms: ranging from E. coli, yeast, and mammalian cells. Intracellular functional assays can be conducted against a variety of targets, thus not only assessing affinity of each member of the library, but also its function against the given target. SICLOPPS libraries are DNA-encoded, which gives a large amount of control over the makeup of the library and allows a variety of libraries to be easily produced and screened against such targets. Examples of variations in SICLOPPS libraries that are easy to implement include: cyclic peptides of different ring sizes, libraries with different amino acid composition, or inclusion of a given amino acid, or motif in a set position in every member of the library. Thus, the user has absolute control over the makeup of their cyclic peptide library via the degenerate oligonucleotide that encodes it.
SICLOPPS originally used the DnaE split-inteins from Synechocystis sp. PCC6803 termed “Ssp” inteins. Ssp inteins have a relatively slow splice rate and a significant sensitivity to amino acid changes near the splice junctions, meaning that a significant portion of the cyclic peptide library may not actually be cyclic peptides, but rather exist as the partially spliced intein. Such limitations of the technique were, however, overcome with the adaptation of faster splicing and more promiscuous “Npu” inteins engineered from Nostoc punctiforme.
Yet despite the apparent progress from using alternative intein types, the use of inteins themselves still pose an issue with regards to low efficiency levels of the technique in cells. Townend and Tavassoli (2016) investigated the effects of SICLOPPS-generated cyclic peptide libraries on cell viability in E. coli (Townend & Tavassoli 2016, ACS Chem Biol 11(6): 1624-1630). The data demonstrated that, despite their rapid rate of splicing and tolerance to variation in extein sequence, some 42% of an Npu SICLOPPS library was found to be cytotoxic to their E. coli host, significantly reducing their utility. Owing to the study's utilisation of three different E. coli strains (DH5a, BW27786, and BL21), it was deemed unlikely that such observed effects would be strain specific. The assay was further carried out with the less favourable Ssp inteins, with results indicating that around 14% of the library members also affected host viability. While it is likely that a portion of the cyclic peptides encoded by the library are inherently toxic to E. coli, for example through interference with a critical protein or pathway, both sets of inteins encoded the same library in the experiment. The higher level of toxicity observed with the Npu inteins, thus, can only be attributed to the Npu inteins themselves.
In a 2016 study, scientists engineered an SsrA tag (AANDENYALAA; SEQ ID NO: 11) to the C-terminus of a protein of interest, in order to target the spliced inteins for intracellular degradation (Townend & Tavassoli 2016, ACS Chem Biol 11(6): 1624-1630). Addition of the SsrA sequence was shown to direct the tagged protein to the ClpXP machinery native to E. coli for degradation, reducing the half-life of the tagged protein to around 5 minutes.
Whilst there has been an investigation into reducing the cytotoxicity of Npu SICLOPPS inteins in E. coli, there exists no such described alternative approach applicable for use in mammalian cells that differ in cellular functions to prokaryotic cells. With an ever increasing demand for functional assays of cyclic peptides in mammalian cells for drug discovery, a modification of SICLOPPS to improve the efficiency would help elucidate compounds against mammalian-specific targets.
Thus, there exists a need for a modified SICLOPPS approach that generates cyclic peptides in mammalian cells in a more efficient manner.
Kinsella et al 2002 JBC 277: 37512-37518 describes the production of cyclic peptide libraries with up to 160,000 members in mammalian cells using Ssp SICLOPPS inteins with no mention of, or investigation into toxicity associated with the intein.
The toxicity of inteins in mammalian cells was not investigated by Kinsella et al. The inventors have surprisingly found that mammalian cells are also susceptible to toxicity arising from active inteins. Prior to this, the problem of intein associated toxicity in mammalian cells was not recognised. The inventors have devised a degradation tag system suitable for use in mammalian cells and which obviates the intein-associated toxicity allowing the split intein system to be widely used in mammalian cells for the production of cyclic peptides.
This invention is based on the surprising discovery that it is possible to alter a mammalian cell-based SICLOPPS methodology to include intein-attached degradation tags in order to minimise any resultant intein-induced cytotoxicity. The attachment of a degradation tag to either the N-terminus or C-terminus intein domain will allow the canonical active intein, following splicing and cyclisation of the extein of interest, to be directed for degradation via the mammalian cell's degradation pathway. The approach therefore prevents a detrimental accumulation of cytotoxic inteins from being formed during cyclic peptide production within mammalian cells. Such degradation-tagged inteins can therefore be used in a modified SICLOPPS methodology to produce a cyclic peptide library within mammalian cells with greater efficiency. Importantly, the intein is able to splice before it is degraded.
Thus in a first aspect of the invention, there is provided a method for the non-toxic production of a cyclic peptide in a mammalian cell comprising: a) introducing a vector into a mammalian cell, wherein the vector comprises a construct encoding a C-terminus intein domain, a polypeptide sequence to be cyclised, an N-terminus intein domain, and a degradation tag, wherein the degradation tag is attached to at least one intein domain; and b) expressing the construct to produce an intermediate comprising an active intein and the polypeptide sequence, wherein the active intein, once formed, undergoes splicing and cyclises the polypeptide and wherein the degradation tag degrades the active intein.
The invention also provides a mammalian cell produced by the method of the first aspect of the invention. For example, the invention provides a cell expressing a cyclic peptide wherein the mammalian cell is produced by a method comprising:
The invention also provides a library of mammalian cells produced according to the first aspect of the invention, i.e. the library comprises a number of mammalian cells each comprising a different nucleic acid that encodes a different cyclic peptide, where the nucleic acid is a nucleic acid of the invention as described herein. In some embodiments the library of mammalian cells produced by the method according to the first aspect of the invention comprises at least 128,000 at the cyclic peptide level, optionally for example at least 150,000 or at least 200,000 members at the cyclic peptide level. As the skilled person will appreciate, it is possible to produce a library comprising millions of different genetic constructs, but if the protein or peptide (in this case the cyclic peptide) that the gene encodes is toxic to the cells, then those cells will be lost, reducing the number of members in the library at the protein level. For example, if a library of 2 million members is produced at the genetic level, but the active intein is toxic, only for example 1 million members expressing cyclic peptides would be obtained. Since the present invention addresses the toxicity associated with inteins it is possible to produce much larger cyclic peptide libraries, or much larger libraries of mammalian cells produced according to the method of the first aspect of the invention.
In some embodiments the library of mammalian cells produced by the method according to the first aspect of the invention comprises at least 128,000 members, for example at least 130,000, 150,000, 200,000, 300,000, 400,000, 500,000, 600,000, 700,000, 800,000, 900,000, 1 million, 1.5 million, 2 million, 2.5 million, 3 million, 3.2 million, 3.5 million or at least 4 million members at the protein level.
By at the protein level, we include the meaning of the level of the expressed cyclized peptide level.
It will be clear that whilst the use of split inteins to produce a cyclic peptide in a mammalian cell as reported in the art will result in a mammalian cell that comprises active inteins, that may be toxic, the cells of the present invention due to the use of the degradation tag comprise no active inteins, or substantially no active inteins, for example no or substantially no toxic active inteins.
The invention also provides a cell lysate prepared from a mammalian cell of the invention.
In a second aspect of the invention, there is provided a cyclic peptide library produced by the method according to the first aspect of the invention.
In a third aspect of the invention, there is provided a genetic construct comprising a polynucleotide cassette encoding a C-terminus intein domain, a polypeptide sequence to be cyclised, an N-terminus intein domain, and a degradation tag suitable for use in mammalian cells, wherein the degradation tag is attached to at least on intein domain and wherein, once expressed, an active intein is formed.
In a fourth aspect of the invention, there is provided a vector comprising the genetic construct according to the third aspect of the invention.
In a fifth aspect of the invention, there is provided a mammalian cell comprising the vector according to the fourth aspect of the invention or the genetic construct according to the third aspect of the invention. The invention also provides a library of mammalian cells comprising the vector according to the fourth aspect of the invention or the genetic construct according to the third aspect of the invention. In some embodiments the library of mammalian cells comprises at least 200,000 members, for example at least 300,000, 400,000, 500,000, 600,000, 700,000, 800,000, 900,000, 1 million, 1.5 million, 2 million, 2.5 million, 3 million, 3.2 million, 3.5 million or at least 4 million members.
In a sixth aspect of the invention, there is provided a method of producing a cyclic library according to the method of the first aspect of the invention.
The invention is illustrated with reference to the following drawings, in which:
The invention is predicated on the surprising discovery that the addition of a degradation tag to a SICLOPPS-based methodology allows for the production of cyclic peptides within mammalian cells without generating an accumulation of cytotoxic intein by-products. Cyclic peptides can thus be produced in mammalian cells without the low efficiency levels normally associated with such a method.
The term “cyclic peptide” as used herein refers to a polypeptide or protein that has been “cyclised”, in which its constituent atoms form a ring. For example, a linear peptide is cyclised when its free amino (N)-terminus is covalently bonded to its free carboxy (C)-terminus, i.e. in a head to tail format, such that no free C- or N-termini remain in the peptide. As referenced herein, the terms “peptide” and “polypeptide” can be used interchangeably.
The invention provides a method of producing cyclic peptides within mammalian cells by an altered SICLOPPS methodology to incorporate the use of degradation tags.
The term “mammalian cell” refers to a eukaryotic cell with structurally defined intracellular organisation, as opposed to bacteria and archaea. Mammalian cells are often used in cell culture, for example the use of Chinese Hamster Ovary (CHO) cells, the most common mammalian cell line used for mass production of therapeutic proteins (Wurm 2004, Nat Biotech 22(11): 1393-1398).
SICLOPPS or the “split-intein circular ligation of peptides and proteins” takes advantage of intein splicing for the generation of a cyclic peptide.
As used herein, the word “intein” means a naturally-occurring or artificially constructed polypeptide sequence embedded within a precursor protein that can catalyse a splicing reaction during post-translation processing of the protein. An intein can excise itself from the precursor protein and join the remaining portions with a peptide bond in a process named “splicing”. A “split-intein” is an intein that has two or more separate components not fused to one another, encoded by two separate genes. In some cases, the split-intein components will flank a polypeptide sequence therein referred to as an “extein”. When flanking an extein, the split-intein components are referred to as an N-terminus and C-terminus intein domain in respect to the N—and C-termini of the extein.
In some embodiments of all aspects of the invention the intein is an intein that is toxic to mammalian cells. By toxic we include the meaning that the intein negatively affects the growth rate of the mammalian cells, or causes apoptosis or necrosis.
The mammalian cells can be any mammalian cells in which it is desirous to express a cyclic peptide.
More than 350 types of inteins are recognised at present, each or which can have differing rates of catalysing a splicing reaction. The nomenclature of inteins is based on the scientific name of the organism to which it is found. Ssp inteins, for example, were first isolated from Synechocytis spp, whilst faster splicing Npu inteins were first isolated from Nostoc punctiforme. A database comprising a list of some of the known inteins can be found at http://www.biocenter.helsinki.fi/bi/iwai/InBase/tools.neb.com/inbase/list.html.
The intein used in the invention may be any intein that would splice faster than its degradation time by an attached degradation tag. The skilled person can select an appropriate intein for use with a corresponding degradation tag.
In one embodiment the intein may be a Cfa intein. In a preferred embodiment the intein may be a split Cfa intein comprising the amino acid sequences of SEQ ID NO: 5 and SEQ ID NO: 6 for the C-terminus and N-terminus intein domains, respectively.
In another preferred embodiment the intein may be an Ssp intein. In a further preferred embodiment the intein may be a split Ssp intein comprising the amino acid sequences of SEQ ID NO: 1 and SEQ ID NO: 2 for the C-terminus and N-terminus intein domains, respectively.
In another preferred embodiment the intein may be a gp41-1 intein. In a further preferred embodiment the intein may be a split gp41-1 intein comprising the amino acid sequences of SEQ ID NO: 7 and SEQ ID NO: 8 for the C-terminus and N-terminus intein domains, respectively.
In yet another preferred embodiment the intein may be an Npu intein. In a most preferred embodiment the intein may be a split Npu intein comprising the amino acid sequences of SEQ ID NO: 3 and SEQ ID NO: 4 for the C-terminus and N-terminus intein domains, respectively.
The process of SICLOPPS is known in the art, as referenced by Tavassoli 2017, Curr Opin Chem Biol 38: 30-35. A SICLOPPS method makes use of a construct containing a C-terminus intein domain followed by an extein polypeptide sequence to be cyclised, and an N-terminus intein domain. The construct is arranged as such so that, following translation of the mRNA sequence, the N- and C-terminus intein domains flanking the intervening extein are capable of non-covalently associating to form a functional active intein (
As shown in
Such a method thus results in the production of a cyclic peptide and a bi-product comprising the now unneeded intein polypeptide.
Thus in some embodiments of the invention there is an altered SICLOPPS method that uses a polypeptide construct comprising an Npu split intein that may comprise the following sequence:
Wherein X ˜˜˜˜˜ is the extein and cyclic peptide to be produced; X is C, S, or T, and “˜” denotes an amino acid of the cyclic peptide sequence. It is necessary for the functioning of the splicing that the first position may be occupied by an invariant cysteine, serine, or threonine residue. It will be apparent to one skilled in the art that any sequence may be inserted after “X” in the sequence above. The sequence may be one or more amino acids in length.
In a preferred embodiment the sequence may be three or more amino acids in length. In a most preferred embodiment the sequence may be at least six amino acids in length.
The above sequence comprises the following constituents:
An optional hexahistidine (6xHis) tag to assist in purification, or any other such affinity tag, may be included within the construct. Thus in one embodiment the base SICLOPPS construct will further include an affinity tag. In a preferred embodiment, the construct will include a 6xHis tag. In yet another preferred embodiment, the construct will include a 2xStrep tag.
Comprising the C-terminus intein domain.
Comprising the N-terminus intein domain.
In some embodiments the SICLOPPS construct may be further modified to include a tag for antibody recognition. In a preferred embodiment, the tag for antibody recognition may be a FLAG tag.
The invention provides an altered SICLOPPS method wherein the SICLOPPS construct, as exemplified above, is modified with the addition of a degradation tag, suitable for use in mammalian cells, attached to either of the N-terminus or C-terminus intein domains.
The term “degradation tag” is intended to encompass peptide sequences that mark a protein for degradation by a cell's degradation machinery. In mammalian cells, a major pathway of selective protein degradation is the ubiquitin-proteasome pathway. Ubiquitin-dependent protein degradation has a natural role in many biological processes, including signal transduction, cell cycle progression, and transcriptional regulation (Groulx & Lee 2002, Mol Cell Biol 22(15): 5319-5336).
Ubiquitin is a small regulatory protein that can be added to a substrate protein in a process called ubiquitination. The conjugation of ubiquitin is an ATP-dependent process that involves three enzymes: E1 and E2 proteins prepare the ubiquitin for conjugation; E3 ubiquitin ligases recognise the specific protein substrate to catalyse the transfer activated ubiquitin molecule. Once a protein is tagged with a single ubiquitin molecule, other E3 ubiquitin ligases are signalled to attach further ubiquitin molecules, resulting in a polyubiquitin chain attached to the substrate protein.
Proteins tagged for ubiquitination are subsequently targeted to a cellular proteasome complex wherein the ubiquitin chain is recognised by the proteasome, and the bound proteins are degraded into peptides of seven to eight amino acids long. Degradation tags can thus function by engaging the tagged protein with an E3 ubiquitin ligase resulting in the addition of a polyubiquitin chain and subsequent degradation by the proteasome.
In one embodiment of the invention there is provided an altered SICLOPPS method that uses a construct as described above which may comprise a degradation tag attached to either the N-terminus or C-terminus intein domain.
In a preferred embodiment the attached degradation tag may effect degradation at least in part by ubiquitination.
In another preferred embodiment the attached degradation tag may be a hypoxia-inducible factor-1 alpha (HIF-1α) subunit. In a further preferred embodiment the attached degradation tag may comprise the oxygen dependent degradation domain of HIF-1a that engages an ubiquitin ligase complex. In a most preferred embodiment the attached degradation tag may comprise the amino acid sequence according to SEQ ID NO: 10.
In yet another embodiment the attached degradation tag may be a tag or proteolysis targeting chimera that engages an E3 ubiquitin ligase for protein degradation.
Hypoxia-inducible factors (HIFs) are heterodimeric transcription factors comprising a constitutively expressed HIF-1B subunit and a HIF-1a subunit regulated by oxygen. HIF-1a comprises an oxygen dependent degradation (ODD) domain that contains a key proline residue P564 that is hydroxylated in normoxia to target the HIF-1a subunit for proteasomal degradation by engaging an ubiquitin ligase complex. The ubiquitin ligase complex comprises a von Hippel-Lindau tumour suppressor protein (VHL) responsible for recognising the hydroxylated P564 residue of the ODD domain of HIF-1a.
The addition of a P564-comprising sequence from the ODD domain of HIF-1a as a degradation tag to the SICLOPPS construct polypeptide thus induces degradation of the attached protein. In the present invention, upon expression of the SICLOPPS construct, the active intein, following its self-excision and splicing and cyclisation of the extein of interest, is attached from either its N-terminus or C-terminus domain to the P564-comprising polypeptide of the ODD domain. The hydroxylated P564 residue is recognised by the VHL from an ubiquitin ligase complex, and ubiquitinated and degraded by the proteasome along with the attached cytotoxic active intein.
Thus in some embodiments, the polypeptide construct as described above for use in the altered SICLOPPS methodology may further contain amino acids 548-603 of the full length ODD domain of HIF-1α, containing the key P564 for hydroxylation, attached to the N-terminus intein domain, as comprised by the following sequence:
Wherein:
Comprises amino acids 548-603 of the full length ODD domain of HIF-1α.
A proteolysis targeting chimera (PROTAC) can similarly function as a degradation tag. A PROTAC is a small molecule comprising two covalently linked protein-binding domains. One domain is capable of engaging an E3 ubiquitin ligase, whilst the other binds to a target protein meant for degradation. The incorporation of a PROTAC as a degradation tag for either the N-terminus or C-terminus thus results, upon expression of the SICLOPPS construct, in the recruitment of an E3 ubiquitin ligase to the excised active intein to result in its ubiquitination and proteasomal degradation.
It is envisaged that the degradation of active inteins will enable the production of cyclic peptides within mammalian cells without an increase in cytotoxicity as shown to be associated with the active inteins.
The term “cytotoxicity” as used herein refers to a toxic quality of a compound towards cells which may result in such cells undergoing necrosis, wherein they die rapidly due to cell lysis from a loss of membrane integrity, or apoptosis, wherein the cells undergo programmed cell death. Cytotoxic effects would have an impact on the level of efficiency in utilising mammalian cells for the purposes of producing cyclic peptides, for example, as would be understood by the skilled reader.
A degradation tag may induce degradation of the protein or polypeptide it is attached to regardless of whether the attachment is direct in sequence or via a linker. Such linkers, or spacers, are short amino acid sequences that vary between 2 and 31 amino acids, implemented to separate multiple domains of a single protein.
In one embodiment of the invention there is provided an altered SICLOPPS method using a construct as described above wherein the degradation tag is attached to either the N-terminus or C-terminus intein domain either through a direct linkage or via a linker. In a preferred embodiment, the degradation tag is attached via a direct linkage.
It is envisaged that a compatible degradation tag will be incorporated into the SICLOPPS construct relative to the type of inteins used. It would be apparent to the skilled reader that the active intein need splice before it is degraded by the included degradation tag, or in other words, the degradation tag needs to induce degradation slower than the intein splices.
It is common practice within the art to optionally include a fluorescent tag within a peptide-encoding construct for experimental visualisation purposes. It is envisaged that such fluorescent tags may be attached to the degradation tags within the SICLOPPS construct; hence any degradation of the resultant expressed tagged proteins could be visualised using fluorescence microscopy or other such techniques known within the field. A wide range of fluorescent tags, or fluorophores, exist that are suitable for use in microscopy. A comprehensive list of fluorophores is available at https://www.biosyn.com.
In some embodiments of the invention there is provided an altered SICLOPPS method using a construct as described above with the addition of any fluorescent tag. In another embodiment the fluorescent tag is attached to the degradation tag incorporated within the SICLOPPS construct. In a preferred embodiment, the fluorescent tag is a DsRed fluorophore.
It is envisaged in the present method that the modified SICLOPPS construct will be introduced into mammalian cells within any suitable expression vector that can facilitate expression of the polynucleotide.
As used herein, the phrase “expression vector” means a vehicle that facilitates transcription and/or translation of a nucleic acid molecule in a suitable in vitro or in vivo system. An expression vector is “inducible” when adding an exogenous substance to a host system containing the expression vector causes the vector to be expressed, for example causing a nucleic acid molecule within the vector to be transcribed into mRNA.
Such suitable vectors include plasmids (
Expression vectors used within the method can include a stretch of nucleotides that encodes a target polypeptide construct and a stretch of nucleotides that operate as a regulatory domain that modulates or controls expression of nucleotide sequences within the vector. For example, the regulatory domain can be a promoter or an enhancer.
In some embodiments, the expression vectors used within the method are produced with restriction sites both between and within the nucleic acid sequences that encode the split intein portions to enable the cloning of a wide variety of cyclisation targets or splicing intermediates. In some embodiments, an expression vector of the invention can be an inducible expression vector such as an arabinose inducible vector. Such expression vectors can be generated using standard molecular biology techniques as would be known to the skilled reader. Plasmids can be transfected into mammalian cells for transient expression through several well practiced techniques within the art, such as chemical-based or electroporation-based transfection.
It is envisaged that an expression vector used within the present invention will comprise a suitable origin of replication (ORI) for use in mammalian cells. Since there are no “natural” mammalian ORIs, viral-based ORIs are often used for expression vectors intended for mammalian cells, such as viral Epstein-Barr virus (EBC) or SV40 ORIs.
Thus in one embodiment there is provided an altered SICLOPPS method that utilises a plasmid comprising a modified SICLOPPS construct as outlined above suitable for expression within mammalian cells. In a preferred embodiment the plasmid comprises a SV40 origin of replication suitable for expression within mammalian cells.
In a second aspect, the present invention provides a cyclic peptide library produced by the altered SICLOPPS method according to the first aspect of the invention.
The term “cyclic peptide library” refers to a multiple of compartmentalised cyclic peptides, often containing over 100 million peptide members.
Each member of the library may be expressed within a mammalian cell from a unique plasmid as described in the first aspect of the invention. It is envisaged that the library will contain great numbers of randomised cyclic peptides from each expressed plasmid, generated as such so that the polypeptide of interest, or extein, is randomised. The randomised polypeptide is essentially a degenerate oligonucleotide, wherein “degenerate” refers to its sequence containing a number of possible nucleotide bases. The resultant library would in theory contain a great number of possible cyclic peptide structures that may subsequently be used for pharmaceutical assays and other research purposes. The generation of cyclic peptide libraries within cells allows for functional assays to be conducted against a variety of targets.
SICLOPPS-based libraries, as outlined above, are DNA-encoded, which gives a large amount of control over the makeup of the library and allows a variety of libraries to be easily produced and screened against a given target (
The length of the randomised polynucleotide inserted into the vector will be dependent on various factors that may be determined by the skilled person. Of primary consideration is the size of the ultimate polypeptide expressed, and subsequent ring size of the cyclic peptide. In a preferred embodiment, the polypeptide is 6 amino acids in length. A suitable randomised polynucleotide would therefore be 18 nucleic acids in length. For cyclic peptide formation, consideration must be given to whether the length of the polypeptide is sufficient to allow the cyclisation reaction to proceed, i.e. whether the length allows a closed peptide cycle to form. In some embodiments, the peptide is cyclised by a linker of any length. Therefore, cyclic polypeptides may be achieved by encoding just two amino acids, in which case the randomised polynucleotide will be at least 6 nucleic acids in length. Another consideration is the maximum insert size tolerated by the vector and corresponding replication system.
In some embodiments, the randomised sequence may be longer, for example, at least 9, 30, 60, 90, 180, 300, 600, 900, 1,800, 3,000, or more nucleic acids in length. In preferred embodiments, the randomised nucleotide sequence is 6, 9, 12, 15, 18, 21, 24, 27, or 30 nucleotides in length. Although the randomised sequence is intended to encode a polypeptide, its length may not necessarily be a multiple of 3. For example, it may be 7, 8, 10, 11, 13, 14, 16, 17, 19, 20, 22, 23, 25, 26, 28, or 29 nucleotides in length. The randomised polynucleotide sequence may also be referred to herein as the variable sequence. In embodiments, one or more positions of the “random” or “variable” sequence may actually be fixed. For example, in such a SICLOPPS method, the first position may be occupied by an invariant cysteine, serine, or threonine residue, followed by a variable or random amino acid sequence as described within the first aspect of the invention.
In a third aspect of the invention, there is provided a genetic construct comprising a polynucleotide cassette encoding a C-terminus intein domain, a polypeptide sequence to be cyclised, an N-terminus intein domain, and a degradation tag suitable for use in mammalian cells, wherein the degradation tag is attached to at least on intein domain and wherein, once expressed, an active intein is formed.
In some embodiments, the genetic construct may further contain any of the modifications or specifications according to the first aspect of the invention.
In a fourth aspect of the invention, there is provided a vector comprising the genetic construct according to the third aspect of the invention.
In a fifth aspect of the invention, there is provided a mammalian cell comprising the vector according to the fourth aspect of the invention.
In a sixth aspect of the invention, there is provided a method of producing a cyclic library according to the method of the first aspect of the invention
In order that the invention may be more clearly understood, embodiments thereof will now be described by way of example with reference to the accompanying figures.
A SICLOPPS construct was designed with the addition of a degradation domain to the N-terminus intein domain, resulting in the depletion of the spliced intein product to prevent toxicity in mammalian cells. Cfa inteins were used for fast splicing and high promiscuity, containing an ERD to GEP mutation at residues 122-124 for increased amino acid tolerance at the +2 residue. The construct was designed with the following sequence:
The extein, in which eGFP was utilised in this example (+N-terminus 2×Strep tag). Wildtype Cfa splice junctions (CFN & AEY) were incorporated to improve efficiency:
CFNWSHPQFEKGGGSGGGSGGSAWSHPQFEKGGSGGEFMVSKGEELFTG
The oxygen dependent degradation (ODD) domain from HIF-1α, comprising the key residue P564 for hydroxylation, was incorporated C-terminus to N-terminus intein domain:
The fluorescent protein mCherry was further fused C-terminal to ODD domain, followed by a FLAG tag for antibody recognition:
K
Two versions of an eGFP extein plasmid (
1. Wildtype or WT—will splice and degrade.
2. P564G—mutation of proline 564 in the ODD domain prevents degradation.
The two plasmids from example 1 were independently transfected into Hela cells which were subsequently placed in the presence of oxygen without or with 100 UM of DFX treatment, in order to prevent the degradation of the inteins. In the wildtype plasmid, the inteins should only degrade in the presence of oxygen and be stabilised in the absence of oxygen or in the presence of DFX, due to inhibition of the HIF prolyl hydroxylase domain (PHD). In the P564G mutant, in comparison, the inteins should never degrade, either in the presence of oxygen or with DFX treatment. The proline has been mutated into a glycine, hence preventing the proline from being hydroxylated by PHDs and the subsequent degradation of the protein.
Hela cells were imaged using a Zeiss fluorescent microscope to determine whether the extein (GFP) and the inteins (mCherry-tagged) were present depending upon the conditions.
As shown in
The wildtype and P564G plasmids from examples 1 and 2 were transfected into HeLa cells which were then incubated for 24 hours in normoxia, hypoxia, or DFX-treated conditions. Cells were then lysed using RIPA buffer and scrapers. Total protein lysates were analysed by western blot, the results of which are shown in
For the wildtype plasmid, hypoxia and DFX conditions displayed more mCherry, showing no degradation of the inteins. Overall, the results show that there was more GFP (associated with the extein) in the WT than P564G mutant, under all conditions. Actin, used as a control, was comparable overall between the wells.
Cell counts were carried out from trypsinised cells that had been independently transfected with the two plasmids from the previous examples, in which the Hela cells had been grown under normoxia and hypoxia conditions. The trypsinised cells were re-suspended in complete media. Cell suspensions were then analysed using a MOXI cell counter to count live and dead cells. Time points were taken at 0h, 6h, 12h, 24h, 36h and 48h in triplicate or duplicate.
As shown in
Transient Transfection of GFP-Npu-mCherry-ODDD Constructs, Splicing (WT) and Non-Splicing (C1A) into Trex Cells
Procedure: 1.2×106 Trex293 cells were plated into 6 cm dishes. Next day, cells were transfected with 1 μg of plasmid and one well remained non-transfected to use as negative control to set fluorescence gates. Next day, media was changed, 1 ug/mL of doxycycline was added to each dish and in the dishes of treated cells, DFX was added to a final concentration of 100 μM. Cells were analysed by FACS on the following day. Results: We investigated whether the inteins fused to mCherry and the Oxygen-Dependent-Degradation-Domain (ODDD) were degrading in presence of oxygen and if addition of DFX could prevent this oxygen-dependent degradation. When cells were analysed by FACS, the GFP splicing version (
Stable Integration of GFP-Npu-mCherry-ODDD Construct Splicing (WT) into Trex Cells
Procedure: Trex293 cells stably integrated with GFP-WT-Npu-mCherry-ODDD were plated in 6-well plates. Cells were then treated with doxycycline to induce the expression of the inteins. One condition was treated with DFX (100 μM) and the other one remained untreated. On the following day, cells were analysed by FACS Results: We investigated whether the inteins fused to mCherry and the Oxygen-Dependent-Degradation-Domain (ODDD) were degrading in presence of oxygen and if addition of DFX could prevent this oxygen-dependent degradation. When the cells were analysed by FACS, in the absence of DFX (
Viability Assay of Trex293 Cells Integrated with Constructs Encoding CFA Inteins-Peptide Extein—ODDD—mCherry, Splicing (WT) and Non-Splicing (C1A)
Procedure: Trex cells integrated with CFA inteins—peptide extein—ODDD—mCherry (splicing and non-splicing) were plated at a density of 1000 cells per well in 96-well plates. Next day, media was changed and replaced with fresh media containing doxycycline (1 μg/mL) with or without DFX (100 μM final concentration). Next day, cell viability was measured using Cell Titer Glo Assay.
Results: This viability assay confirms that upon treatment with dfx, shown to result in a decrease in the degradation of the inteins, cell viability significantly decreases (
Sequences used throughout the specification and forming part of the description:
Number | Date | Country | Kind |
---|---|---|---|
1918693.1 | Dec 2019 | GB | national |
This application is a national phase application under 35 U.S.C. § 371 of International Application No. PCT/GB2020/053263, filed Dec. 17, 2020, which claims the benefit of Great Britain Patent Application No. 1918693.1, filed Dec. 18, 2019, the entirety of each of which is incorporated herein by reference. A sequence listing is submitted herewith as an ASCII compliant text file named “EPCLP0103US_ST25.txt”, created on Dec. 27, 2022 and having a size of 26,342 bytes. The content of the aforementioned file is hereby incorporated by reference in its entirety.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/GB2020/053263 | 12/17/2020 | WO |