The present disclosure relates to compositions and methods that permit the controlled interaction of polypeptides to which a target protein and binding members are fused to. The compositions and methods make use of a target protein that binds to a small molecule to form a complex and a binding member that specifically binds the complex, wherein the target protein is derived from a non-human protein and the small molecule is an inhibitor of the non-human protein. The non-human protein may be derived from a bacterial, viral, fungal or protozoal protein. The non-human protein may be derived from a viral protease and the small molecule is a viral protease inhibitor. The present disclosure also relates to dimerization-inducible proteins, such as split transcription factors and split chimeric antigen receptors, that contain the target protein and binding member. The methods and compositions described herein find application, for example, in cell and gene therapy methods that involve the controlled expression and/or activation of proteins.
Protein-protein interactions (PPIs) represent a universal regulatory mechanism that controls multiple biological functions. For example, gene transcription, protein folding, protein localisation, protein degradation, and signal transduction all rely on the interaction or proximity of one protein to another, or indeed several others. By temporally controlling protein-protein interactions, researchers can readily monitor the functional consequences of a PPI, enabling the dissection of complex biological mechanisms. Furthermore, the ability to control biological functions are being utilised in cell and gene therapy to control therapeutic activity, enabling safer and more personalised therapies.
A commonly used technique for controlling protein-protein interactions is to use so-called chemical inducers of dimerization (CID), small molecules that bring together two proteins that do not interact in the absence of the CID, to form a tripartite ternary complex (Stanton, Chory, and Crabtree 2018). The most widely used CID is rapamycin (an immunosuppressive drug derived from Streptomyces hygroscopicus) and analogues thereof, that forms a heterodimeric complex with the proteins FKBP12 (12-kDa FK506-binding protein) and FRB (a domain from mTOR (mammalian target of rapamycin)) (Sabers et al. 1995).
An attractive feature of rapamycin, along with other naturally-occurring CIDs, such as the plant hormones S-(+)-abscisic acid (ABA) and gibberellin (GA3-AM), is its co-operative binding mechanism whereby protein 2 can only bind to the protein 1:CID complex ((Banaszynski, Liu, and Wandless 2005). De novo CIDs have also been generated through the chemical linkage of two small molecules that bind the same, or different proteins, with these proteins constituting the dimerization protein pair (Belshaw, Ho, et al. 1996; Belshaw, Spencer, et al. 1996). In these systems however, at high concentrations of the bi-functional CID, non-productive complexes between one protein partner and the CID out-compete the production of tripartite complexes, meaning that a linear dose-response cannot be achieved.
As such, there is a growing urgency for new co-operative binding CID systems that can be used to regulate cellular function and to expand the number of orthogonal systems that can be used in complex genetic circuits. Furthermore, there are very few CIDs that have been approved for chronic human use. Recently, a method to generate de novo CID systems (AbCIDs) using antibody-based phage display selection methods was described (Hill et al. 2018). The CID used in that study was ABT-737, a Bcl-2 and Bcl-xL inhibitor, and Bcl-xL itself was employed as one of the protein partners. The second protein was then selected from a phage display library of single chain Fab (scFab) molecules to be selective for the Bcl-xL:ABT-737 complex over Bcl-xL alone.
The approach described in Hill et al. 2018 and WO 2018/213848 A1 of identifying complex-specific molecules by utilising existing small molecules and their targets is an attractive one, however, the overexpression of certain human proteins (e.g. the anti-apoptotic Bcl-xL protein) and use of small molecules that bind to human targets within the body is not without its risks. For example, overexpression of a functional human protein will have consequences for the cells in which it is expressed, which could impact cell health and viability. Additionally, the use of small molecules whose targets are expressed in the body, can result in an increased dose requirement due to the competition of binding of the small molecule to the endogenous target and the overexpressed target. Moreover, the binding of the small molecule to the endogenous target will affect the function of that protein that may be detrimental to the cells in which the target is expressed.
Disclosed herein is an approach aimed to overcome the limitations of the AbCID system as described by Hill et al. Firstly, the small molecules described herein are those that have already been approved for human use, to facilitate a smoother path to regulatory approval. Secondly, and importantly, rather than identifying small molecules with human targets, the inventors recognised that there were advantages associated with selecting small molecules that bind to non-human proteins, in particular viral proteins. For example, the use of a small molecule that does not have a human target is expected to improve safety when used in humans. It was also reasoned that the use of viral, bacterial, fungal or protozoal target proteins would remove the risk of an endogenous small molecule “sink” when used in a human, where the small molecule binds to endogenous targets in the human in addition to binding to the target protein. Furthermore, the expression of a viral, bacterial, fungal or protozoal protein within human cells is less likely to impact the cellular physiology of the cell than a human protein, that has endogenous function, would.
Antivirals have been approved that bind to and inhibit various viral proteins including viral polymerases, integrases, transcriptases and proteases. The present inventors recognised that target proteins derived from viral proteases in particular would be beneficial as these proteases are cytoplasmically located, are smaller, and consist of discrete domains.
Thus, the present disclosure provides one or more expression vectors comprising:
As demonstrated herein, binding of the binding member to the T-SM complex forms a tripartite complex made up of the binding member, target protein and small molecule and the formation of this tripartite complex can be controlled by the presence of the small molecule. The controlled formation of the tripartite complex is useful as, for example, it permits the controlled interaction of polypeptides to which the target protein and binding member are fused to.
The present disclosure also provides a system comprising:
In some embodiments, the viral protease is an HCV NS3/4A protease or HIV protease. These proteases are known to be targeted by several approved small molecules that are known to be generally well tolerated in humans and suitable for chronic dosing and therefore represent suitable target proteins for use herein.
In some embodiments, the viral protease is an HCV NS3/4A protease such as the protease having the amino acid sequence of SEQ ID NO: 1. The HCV NS3/4A protease is a small, monomeric protein that can be expressed cytoplasmically and has a limited number of endogenous human targets, therefore making it an ideal target protein.
In some embodiments, the small molecule is selected from the group consisting of simeprevir, asunaprevir, vaniprevir, boceprevir, narlaprevir, and telaprevir. All these small molecules are approved for treatment in humans. In some embodiments, the small molecule is selected from the group consisting of simeprevir, boceprevir, and telaprevir. These small molecules are approved for treatment in humans and are generally well tolerated in humans.
In some embodiments, the small molecule is simeprevir. Simeprevir (Olysio®) is a small molecule that is administered orally, is cell-permeable, and has a pharmacokinetics (PK) profile that supports once-daily dosing. It has been used chronically (up to 39 months) to treat HCV infection in combination with ribavirin and pegylated interferon, and is on the WHO essential medicines list, indicative of a well-tolerated and widely administered drug.
The inventors made the realisation that any potential off-target activity caused by overexpression of the viral protease could be mitigated by using target proteins that have attenuated viral activity compared to the viral protease from which it is derived. Thus, in some embodiments the target protein has attenuated viral activity compared to the viral protease from which it is derived.
For example, the target protein may contain one or more amino acid mutations compared to the viral protease from which it is derived. In particular embodiments where the viral protease is an HCV NS3/4A protease, the target protein may have an amino acid mutation at one or more amino acids selected from positions 72, 96, 112, 114, 154, 160 and 164, wherein the amino acid numbering corresponds to SEQ ID NO: 1. For example, the target protein may have an amino acid mutation at position 154, such as a mutation to alanine, wherein the amino acid numbering corresponds to SEQ ID NO: 1. As described below, positions 72, 96, 112, 114, 154, 160 and 164 of SEQ ID NO: 1 correspond to positions 57, 81, 97, 99, 139, 145 and 149, respectively, of the full length NS3 protein set forth in SEQ ID NO: 199. The examples refer to amino acid positions according to the amino acid numbering of the full length NS3 protein. For example, reference to a ‘S139A’ mutation in the examples corresponds to a ‘S154A’ mutation where the amino acid numbering corresponds to SEQ ID NO:1.
In some cases, it may be desirable that a competing small molecule is able to bind the target protein in the T-SM complex such that the competing small molecule is capable of displacing the small molecule in the T-SM complex, where the second small molecule is different to the small molecule in the T-SM complex. In this way, the second small molecule can decrease the half-life of the tripartite complex formed between the binding member, the target protein and the small molecule. This may be desirable, for example, in situations where it is considered useful to use the second small molecule to speed up dissociation of the tripartite complex, e.g. in order to rapidly inhibit activity of a dimerization-inducible protein activated by formation of the tripartite complex.
As demonstrated herein, simeprevir binds the target protein HCV NS3/4A protease (S139A) (SEQ ID NO: 2) with a very high affinity such that other small molecules that bind the target protein are unable to displace simeprevir from the T-SM complex. The inventors determined that certain affinity reducing mutations could be introduced in the target protein that reduce the affinity of simeprevir for the HCV NS3/4A protease and allow other small molecules to “compete” with simeprevir and disrupt the tripartite complex formed. Thus, in some embodiments where the viral protease is an HCV NS3/4A protease and the small molecule is simeprevir, the target protein may comprise an affinity reducing amino acid substitution at one or more amino acids selected from positions 151 and 183, wherein the amino acid numbering corresponds to SEQ ID NO: 1. In some embodiments, the affinity reducing amino acid mutation at position 151 is a mutation to aspartic acid, asparagine or histidine (e.g. aspartic acid or asparagine) and the affinity reducing mutation at position 183 is to glutamic acid, glutamine or alanine (e.g. glutamic acid). The target protein may comprise the affinity reducing amino acid mutation in addition to other mutations described herein, such as the amino acid mutation at one or more amino acids selected from positions 72, 96, 112, 114, 154, 160 and 164.
In some embodiments the binding member is an antibody molecule, such as a single-chain variable fragment (scFv), or an antibody mimetic, such as a Tn3 protein. In particular embodiments, the binding member is a Tn3 protein or an scFv, such as the Tn3 proteins and scFvs defined herein. Compared to the single chain Fabs (scFabs) used in the system described by Hill et al., both Tn3 proteins and scFvs are smaller in size. This may be advantageous, for example where the expression cassettes are being delivered by expression vectors that are limited in coding capacity such as viral vectors. Described herein is the development and use of particular Tn3 proteins and scFvs that bind to a complex between HCV NS3/4A protease and simeprevir, which are demonstrated to function as binding members in the context of the present disclosure. These Tn3 proteins and scFvs are termed HCV NS3/4A PR:simeprevir complex-specific binding (PRSIM) molecules.
It was realised that the approach described herein could be used where the target protein and binding member are individually fused to polypeptides (termed “component polypeptides”). In particular, it was realised that the approach could be implemented to control the activity of proteins that require dimerization or clustering to drive their activity. Such proteins are termed herein as “dimerization-inducible proteins” and include “split proteins”, “dimerization-deficient proteins” and “split complexes”. Split proteins comprise single proteins that can be segregated or split into two or more domains, rendering the component parts non-functional or minimally active; function or activity can be initiated or restored, however, when the separated component polypeptides are brought into close proximity. Examples include split fluorescent proteins (e.g. split GFP), split luciferases (e.g. NanoBiT) and split kinases. A further example describes a split transcription factor, whereby the distinct DNA binding (DBD) and activation domains (AD) are separated such that the individual transcription factor domains alone cannot initiate transcription. Only when the two domains are brought into close proximity are they able to reconstitute the transcriptional activation of relevant genes (i.e. they form a functional “transcription factor”). Dimerization-deficient proteins are proteins that require dimerization for activity, but their endogenous dimerization capacity has been disabled e.g. via mutation or removal of the dimerization domain(s). One such example is the iCasp9 molecule, a caspase 9 protein that has had its dimerization (CARD) domain removed. Split complexes denote either single proteins or 2 or more different proteins that are not optimally functional or function differently, until they are brought into close proximity or “clustered”. Once such example is the split chimeric antigen receptor (CAR). Here, specific intracellular domains of the CAR that are responsible for the activation of cell signalling are physically separated such that full cellular activation is prevented. Once the domains are brought into close proximity, cell signalling is activated (i.e. they form a fully functional CAR).
Thus, in some embodiments the target protein is fused to a first component polypeptide and the binding member is fused to a second component polypeptide. In preferred embodiments, the one or more expression vectors encode a dimerization-inducible protein, such as a split transcription factor or a split CAR.
In one embodiment: (1) the first component polypeptide comprises a DNA binding domain and is fused to the target protein to form a DBD-T (DBD-target protein) fusion protein; and the second component polypeptide comprises a transcriptional regulatory domain and is fused to the binding member to form a TRD-BM (transcriptional regulatory domain-binding molecule) fusion protein, or (2) the first component polypeptide comprises a transcriptional regulatory domain and is fused to the target protein to form a TRD-T fusion protein; and the second component polypeptide comprises a DNA binding domain and is fused to the binding member to form a DBD-BM fusion protein, wherein the first and second component polypeptide form a transcription factor upon dimerization.
In another embodiment, the first component polypeptide comprises a first co-stimulatory domain and is fused to the target protein; and the second component polypeptide comprises an intracellular signalling domain is fused to the binding member. The first component polypeptide may further comprise an antigen-specific recognition domain and a transmembrane domain; and the second component polypeptide further comprises a transmembrane domain and a second co-stimulatory domain, wherein the first and second component polypeptide form a chimeric antigen receptor (CAR) upon dimerization.
Alternatively, the first component polypeptide comprises an intracellular signalling domain and is fused to the target protein and the second component polypeptide comprises a first co-stimulatory domain and is fused to the binding member. The first component polypeptide further comprises a transmembrane domain and a second co-stimulatory domain; and the second component polypeptide further comprises an antigen-specific recognition domain and a transmembrane domain, and wherein the first and second component polypeptide form a chimeric antigen receptor (CAR) upon dimerization.
In another embodiment, the first component polypeptide comprises a first caspase component; and the second component polypeptide comprises a second caspase component, and the first and second component polypeptides form a caspase upon dimerization.
In some embodiments the one or more expression vector is a viral vector, such as an AAV vector.
The present disclosure also provides an in vitro method of making viral particles comprising transfecting host cells with the viral vector(s) defined herein and expressing viral proteins necessary for viral particle formation in the host cells; culturing the transfected cells in a culture medium, such that the cells produce viral particles.
The present disclosure also provides one or more viral particles comprising
The expression cassettes, target protein, small molecule, binding member in the one or more viral particles may be as further described herein. The target protein and binding member may be fused to a first and second component polypeptide, respectively, (e.g. for encoding a dimerization-inducible protein) as further described herein.
The viral particle may be an AAV particle.
In one aspect the present disclosure provides a binding member that specifically binds to a complex between i) a target protein derived from a non-human protein and ii) a small molecule that is an inhibitor of the non-human protein, wherein the binding member binds the complex at a higher affinity than it binds both the target protein alone and the small molecule alone. In one embodiment, the non-human protein is derived from a viral protein and the small molecule is an inhibitor of the viral protein. In one embodiment, the non-human protein is derived from a viral protease and the small molecule is a viral protease inhibitor. In another embodiment, the non-human protein is derived from a bacterial, fungal or protozoal protein. As described herein, such complex-specific binding members are useful as a way of controlling formation of a tripartite complex between the binding member, target protein and small molecule in a manner that overcomes the drawbacks of the binding molecules described by Hill et al.
In another aspect, the present disclosure provides dimerization-inducible proteins comprising the target proteins and binding members, as defined herein. The dimerization-inducible proteins may be a split transcription factor, a split CAR or a split caspase protein, for example.
In one aspect, the present disclosure provides cells, e.g. allogeneic or autologous cells, including stem cells, induced pluripotent stem (iPS) cells or immune cells, comprising one or more of the expression cassettes, expression vectors, binding members, target proteins or dimerization inducible proteins defined herein. The cells may express the binding member, target protein or dimerization-inducible protein described herein. The present disclosure also provides methods of genetically modifying a cell to produce cells expressing the binding member or dimerization inducible protein described herein, the method comprising administering expression vectors to the cell. This method may be carried out in vitro or ex vivo.
It was additionally recognised that the approach described herein where the target protein and binding member are fused to component polypeptides of a split transcription factor could have uses in gene therapy methods that involve regulating the expression of a desired expression product (e.g. a desired polypeptide) in a cell.
Thus, in one aspect the present disclosure provides a method of regulating the expression of a desired expression product in a cell, comprising:
In some embodiments of the method, the DNA binding domain target sequence is located in a promoter that is operably linked to a coding sequence for the desired expression product.
The method may involve delivery of the expression cassettes encoding the dimerization-inducible protein to control expression of a desired expression product that is also delivered exogenously to the cell.
Thus, in some embodiments, the method comprises administering a third expression cassette to a cell, wherein the third expression cassette encodes the desired expression product, and wherein the third expression cassette comprises the target sequence of the DNA binding domain.
Alternatively, the method may involve delivery of the expression cassettes encoding the dimerization-inducible protein to control expression of a desired expression product that is already present as part of the genome of the cell (i.e. an endogenous desired expression product).
Thus, in other embodiments of the method, the target sequence is located in the genome of the cell.
Furthermore, it was recognised that the approach described herein could have use in methods of cellular therapy. Such methods typically involve taking cells from an individual (autologous cells), modifying the cells ex vivo to express a particular protein, e.g. a dimerization-inducible protein, and administered back into the individual.
Thus, in another aspect the present disclosure provides a method of treatment, the method comprising:
In one aspect, the present disclosure provides nucleic acids encoding the binding members, target proteins and dimerization-inducible proteins as defined herein.
In one aspect the present disclosure provides kits, as defined herein.
It was additionally recognised that it would be possible to make use of an additional small molecule (termed herein as a “competing small molecule”) to induce disassembly of a tripartite complex formed between the binding member, target protein and small molecule. This may be useful, for example, where it is desirable to rapidly inactivate a chemical inducer of dimerization (CID) disclosed herein, such as in order to turn off transgene expression or therapeutic activity association with activity of a dimerization-inducible protein.
This, in another aspect the present disclosure provides a method of inducing disassembly of a tripartite complex, the method comprising administering a competing small molecule to a cell comprising the tripartite complex,
wherein the tripartite complex is formed between a binding member and a complex formed of a target protein and a small molecule (T-SM complex), wherein the binding member binds the T-SM complex at a higher affinity than it binds both the target protein alone and the small molecule alone, and
wherein the competing small molecule is capable of binding the target protein in the T-SM complex and displacing the small molecule from the T-SM complex.
Methods of determining whether the competing small molecule is capable of binding to the target protein in the T-SM complex and displacing the small molecule from the T-SM complex include assays where a pre-formed tripartiate complex is generated and the ability of the binding member to bind the T-SM complex is measured (e.g. by a homogeneous time-resolved florescence (HTFR) binding assay) as increasing concentrations of the competing small molecule are added. A competing small molecule may be capable to displaying the small molecule from the T-SM complex if it is capable of inhibiting binding of inhibiting the binding member from binding the T-SM complex by at least 50%, by at least 75%, by at least 80%, by at least 85%, by at least 90%, or by at least 95% when measured using the HTFR binding assay. In some embodiments, the competing small molecule is asunaprevir, paritaprevir, vaniprevir, grazoprevir, danoprevir or glecaprevir. The binding member, target protein and small molecule using in the method may be as further defined herein in relation to other aspects of the disclosure.
In particular embodiments, the target protein may be derived from an HCV NS3/4A protease and the small molecule in the T-SM complex may be simeprevir and, optionally, the binding member may be PRSIM_23. For example, the target protein may have an amino acid sequence having at least 90% identity to SEQ ID NO: 1. As demonstrated herein, simeprevir binds the target protein HCV NS3/4A protease (S139A) (SEQ ID NO: 2) with a very high affinity such that other small molecules that bind the target protein are unable to displace simeprevir from the T-SM complex. As further demonstrated herein, it is possible to introduce mutations in the HCV NS3/4A protease that reduce affinity for simeprevir to the HCV NS3/4A protease and allow for a competing small molecule to disrupt the tripartite complex formed between the HCV NS3/4A protease, simeprevir and the binding member PRSIM_23
Accordingly, in embodiments where target protein is derived from an HCV NS3/4A protease and the small molecule is simeprevir, the target protein may have an affinity reducing amino acid mutation (e.g. substitution) at one or more amino acids selected from positions 151 and 183, wherein the amino acid numbering corresponds to SEQ ID NO: 1. In some embodiments, the affinity reducing amino acid mutation at position 151 is a mutation to aspartic acid, asparagine or histidine, and the affinity reducing mutation at position 183 is to glutamic acid, glutamine or alanine. In some embodiments, the affinity reducing amino acid mutation at position 151 is a mutation to aspartic acid or asparagine and the affinity reducing mutation at position 183 is to glutamic acid. The target protein may comprise the affinity reducing amino acid mutation in addition to another amino acid mutation described herein (e.g. in addition to the amino acid mutation at position 154, such as to an alanine).
The present disclosure includes the combination of the aspects and preferred features described except where such a combination is clearly impermissible or expressly avoided.
Embodiments and experiments illustrating the principles of the present disclosure will now be discussed with reference to the accompanying figures in which:
Aspects and embodiments of the present disclosure will now be discussed with reference to the accompanying figures. Further aspects and embodiments will be apparent to those skilled in the art. All documents mentioned in this text are incorporated herein by reference.
An “expression vector” as used herein is a DNA molecule used for expression of foreign genetic material in a cell. Any suitable vectors known in the art may be used. Suitable vectors include DNA plasmids, binary vectors, viral vectors and artificial chromosomes (e.g. yeast artificial chromosomes). In certain embodiments, the expression vector is a viral vector as described in more detail below. In certain embodiments, the expression vector is a DNA plasmid.
An “expression cassette” as used herein is a polynucleotide sequence that is capable of effecting transcription of an expression product, which may be a protein. A “coding sequence” is intended to mean a portion of a gene's polynucleotide sequence that encodes the expression product. Where the expression product is a protein, this sequence may be referred to as a “protein coding sequence”. The protein coding sequence typically begins at the 5′ end by a start codon and ends at the 3′ end with a stop codon. The expression cassette may be part of an expression vector, or part of a viral genome in a viral particle, as described in more detail below.
Typically, the expression cassette comprises a promoter operably linked to a protein coding sequence. The term “operably linked” includes the situation where a selected coding sequence and promoter are covalently linked in such a way as to place the expression of the protein coding sequence under the influence or control of the promoter. Thus, a promoter is operably linked to the protein coding sequence if the promoter is capable of effecting transcription of the protein coding sequence. Where appropriate, the resulting transcript may then be translated into a desired protein.
Any suitable promoter known in the art may be used in the expression cassette providing it functions in the cell type being used. For example, where the cell is a mammalian cell, the promoter may be a cytomegalovirus (CMV) promoter. Where multiple expression cassettes are used, each coding sequence may be independently operably linked to its own promoter. Alternatively, the coding sequence for one or more of the expression cassettes may be operably linked to the same promoter.
Where multiple expression cassettes are described, e.g. a first and second expression cassette, they may be part of the same or different expression vectors. Thus, in some embodiments, the first and second expression cassettes may be located on the same expression vector. In other embodiments, the first expression cassette is located on a first expression vector and the second expression cassette is located on a second expression vector.
Where multiple expression cassettes are located on the same expression vector, the individual expression cassettes (e.g. first and second expression cassettes) may be separated by an Internal Ribosome Entry Site (IRES) or 2A element. The use of IRES or 2A elements allows multiple expression products to be expressed using the same promoter. In other words, when first and second expression cassettes are separated by an IRES or 2A element, both the first and second expression cassettes can be operably linked to the same promoter.
Aspects and embodiments of the present disclosure are directed to target proteins that are derived from a non-human protein, i.e. a protein that is not endogenous to a human. In one embodiment, the non-human protein is derived from a viral, bacterial, fungal or protozoal protein. In one embodiment, the non-human protein is derived from a viral protein and the small molecule is an inhibitor of the viral protein. In one embodiment, the non-human protein is derived from a bacterial protein and the small molecule is an inhibitor of the bacterial protein. In one embodiment, the non-human protein is derived from a fungal protein and the small molecule is an inhibitor of the fungal protein. In one embodiment, the non-human protein is derived from a protozoal protein and the small molecule is an inhibitor of the protozoal protein. In one embodiment, the non-human protein is derived from a viral protease and the small molecule is an inhibitor of the viral protease.
The term “derived from” in the context of target proteins is intended to mean that the target protein has a similar, but not necessarily identical, amino acid sequence to the protein from which it is derived and the target protein is still capable of binding to the small molecule. A target protein that is derived from a protein may have an amino acid sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to the protein from which it is derived. A target protein that is derived from a protein may contain less than 50, less than 40, less than 30, less than 20, less than 10, less than 9, less than 8, less than 7, less than 6, less than 5, less than 4, less than 3, or less than 2 sequence alterations compared to the protein from which it is derived. For example, a target protein having the amino acid sequence set forth in SEQ ID NO: 2 is derived from the viral protease having the sequence set forth in SEQ ID NO: 1. Additionally, the target protein may have fewer amino acids (i.e. it is a shorter protein) than the protein from which it is derived.
Viral proteases are enzymes encoded by the genetic material of viral pathogens. The normal function of these enzymes is to catalyse the cleavage of specific peptide bonds in viral polyprotein precursors or in cellular proteins. Examples of viral proteases include those encoded by hepatitis C virus (HCV), human immunodeficiency virus (HIV), herpesvirus, retrovirus and human rhinovirus (HRV) families. Certain viral proteases, along with examples of small molecule inhibitors of these proteases, are described for example in Patick and Potts. 1998.
A small molecule is an organic compound that typically has a molecular weight of 2000 daltons or less. The small molecule may be synthetic or naturally occurring.
The choice of viral protease inhibitor as small molecule is not particularly limited provided it a) is able to bind the target protein and b) has been evaluated for clinical purposes in humans. Viral protease inhibitors that have been evaluated for clinical purposes in humans include those that have been approved by a regulatory agency for clinical use in humans, for example, inhibitors approved for treatment by the Food and Drug Administration (FDA) and/or by the European Medicines Agency (EMA). Viral protease inhibitors that have been evaluated for clinical purposes also include those that are being/have been tested in clinical trials involving humans and have preferably have proceeded past phase I clinical trials. Preferably the viral protease inhibitor is approved for clinical use in humans. Preferably the viral protease inhibitor is suitable for chronic dosing (daily for six months or greater), cell permeable, orally dosed and/or not used as a first line therapy.
The viral protease used may be monomeric or multimeric (e.g. dimeric, trimeric, tetrameric, etc.). The use of a monomeric viral protease may be preferred, for example where a strict 1:1 ratio of the target protein fusion protein and binding member fusion protein elicit the desired functional activity. There may be alternative situations where a multimeric viral protease is preferred, for example when the target protein is fused to a transcriptional regulatory domain in a split transcription factor and the use of a multimeric viral protease could increase the number of transcriptional regulatory domains that are recruited to a target gene.
In some embodiments the viral protease is an HCV NS3/4A protease or a HIV protease. Both these proteases are known to be targeted by several approved small molecule inhibitors that are known to be generally well tolerated in humans and suitable for chronic dosing. Examples of small molecule inhibitors that target HCV NS3/4A protease are described in De Clercq. 2014. Examples of small molecule inhibitors that target HIV protease are described in Lv et al. 2015.
In some embodiments the viral protease is an HCV NS3/4A protease. HCV NS3/4A PR is monomeric, relatively small in size (21 kDa), can be expressed cytoplasmically, and is not found associated with DNA, making it an ideal candidate as a viral protease for use in the disclosure. The HCV NS3/4A protease may have the amino acid sequence of amino acid positions 1030-1206 of the amino acid sequence set forth in UniProt accession number A8DG50-1 (version 2 of the sequence; sequence update 29 Apr. 2008). In some embodiments the HCV NS3/4A protease may have the amino acid sequence set forth in SEQ ID NO: 1. A target protein that is derived from a HCV NS3/4A protease may have an amino acid sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to the amino acid sequence set forth in SEQ ID NO: 1.
There are several small molecule inhibitors that are known to bind the HCV NS3/4A protease and have been approved for human use. Some of these are set forth in the following table:
The structures of the target proteins in complex with the respective small molecule are provided as PDB accession numbers, which correspond to the crystal structures available from the Protein Data Bank (PDB). The small molecule structures and chemical names are also provided as PDB accession numbers.
The small molecule may be a peptide mimetic. The terms “peptide mimetic”, “peptidomimetic” and “peptide analogue” are used interchangeably and refer to a chemical compound that is not composed of amino acids but has substantially the same characteristics as a peptidic compound that is entirely composed of amino acids.
Other small molecule inhibitors that are being/have been tested in clinical trials involving humans include faldaprevir, sovaprevir, vedroprevir.
In some embodiments, the small molecule is selected from the group consisting of simeprevir, boceprevir, telaprevir, asunaprevir, vaniprevir, voxilaprevir, glecaprevir, paritaprevir, narlaprevir, danoprevir, faldaprevir, grazoprevir, sovaprevir, vedroprevir, or a pharmacologically acceptable analog or derivative thereof. All these small molecules have been approved for human use and/or have been tested in clinical trials involving humans. In some embodiments, the small molecule is selected from the group consisting of simeprevir, boceprevir, telaprevir, asunaprevir, vaniprevir, voxilaprevir, glecaprevir, paritaprevir, grazoprevir, danoprevir and narlaprevir, or a pharmacologically acceptable analog or derivative thereof. These small molecules have been approved for human use.
In particular embodiments, the small molecule is selected from the group consisting of simeprevir, boceprevir and telaprevir, or a pharmacologically acceptable analog or derivative thereof. These small molecules (simeprevir, boceprevir and telaprevir) are well tolerated in humans and have been approved for chronic human use. In particular embodiments, the small molecule may be simeprevir or a pharmacologically acceptable analog or derivative thereof. Simeprevir (Olysio®) is a small molecule that is administered orally, is cell-permeable, and has a pharmacokinetics (PK) profile that supports once-daily dosing. It has been used chronically (up to 39 months) to treat HCV infection in combination with ribavirin and pegylated interferon, and is on the WHO essential medicines list, indicative of a well-tolerated and widely administered drug.
Pharmacologically acceptable analogs and derivatives of the small molecules include compounds that differ from the “parent” small molecule but contain a similar antiviral activity as the parent small molecule and include tautomers, regioisomers, geometric isomers, and where applicable, stereoisomers, including optical isomers (enantiomers) and other steroisomers (diastereomers) thereof, as well as pharmaceutically acceptable salts and derivatives (including prodrug forms) thereof where applicable, in context. For example, analogs of simeprevir include those compounds encompassed by formula (I) defined in WO 2007014926 A1.
Simeprevir may have the following chemical structure:
In some embodiments the viral protease is a HIV protease. HIV protease exists as a 22 kDa homodimer, with each subunit made up of 99 amino acids. The HIV protease may have the amino acid sequence of amino acid positions 501-599 of the amino acid sequence set forth in UniProt accession number P03366-1 (version 3 of the sequence; sequence update 23 Jan. 2007). A target protein that is derived from a HIV protease may have an amino acid sequence that is at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to the amino acid sequence of amino acid positions 501-599 of the amino acid sequence set forth in UniProt accession number P03366-1. A target protein that is derived from a HIV protease may be a monomeric protein. For example, the target protein may contain one or more amino acid mutations that reduce the likelihood of the formation of a homodimeric protein.
There are several small molecule inhibitors that are known to bind the HIV protease and have been approved for human use. Some of these are set forth in the following table:
Fosamprenavir is a prodrug form of amprenavir that has better solubility and bioavailability than amprenavir.
In some embodiments, the small molecule is selected from the group consisting of atazanavir, darunavir and fosamprenavir, amprenavir, indinavir, lopinavir/ritonavir, nelfinavir, ritonavir, saquinavir and tipranavir, or a pharmacologically acceptable analog or derivative thereof.
In particular embodiments, the small molecule is selected from the group consisting of atazanavir, darunavir and fosamprenavir, or a pharmacologically acceptable analog or derivative thereof. These small molecules are well tolerated in humans and have good bioavailability. Furthermore, HIV protease inhibitors are typically used in patients for long periods of time and it is expected that these small molecule inhibitors would be tolerated for use over a long period of time.
In some embodiments, the target protein has attenuated viral activity compared to the viral protease from which it is derived. Attenuated viral activity in this context is intended to mean that the target protein has a lower enzymatic activity, e.g. lower protease activity, compared to the viral protease from which it is derived. Enzymatic activity can be tested, for example, using a fluorogenic peptide cleavage assay as described in the examples or described in Sabariegos et al. 2009. Briefly, the fluorgenic peptide cleavage assay involves using incubating the target protein/viral protease with a fluorogenic protease FRET substrate containing a donor-quencher pair such that cleavage of the peptide separates the donor from the quencher, emitting energy that can be detected at a certain wavelength, e.g. 490 nm.
In some embodiments, the target protein is considered to have attenuated viral activity compared to the viral protease from which it is derived if the target protein has an activity that is less than 10% of the activity of the viral protease as measured in an enzymatic activity assay, such as a fluorogenic peptide cleavage assay. In some embodiments, the target protein does not display any detectable viral activity when measured in an enzymatic activity assay, such as a fluorogenic peptide cleavage assay, when the target protein is at a concentration less than 1 nM, less than 10 nM, less than 100 nM, or less than 1 μM.
The target protein may comprise one or more amino acid mutations (e.g. substitutions/insertions/deletions) compared to the viral protease from which it is derived (e.g. compared to SEQ ID NO: 1). The target protein comprising the one or more amino acid mutations should retain its ability to form a tripartite complex with the small molecule and binding member, which can be determined, e.g. using a homogeneous time-resolved fluorescence (HTRF) assay as described in the examples.
In some embodiments, the target protein comprises one or more amino acid mutations compared to the viral protease from which it is derived, wherein the one or more amino acid mutations attenuate the viral activity of the target protein. The one or more amino acid mutations may be in the active site of the viral protease.
For example, the HCV NS3/4A protease contains a catalytic triad involving the amino acid residues H57, D81 and S139 of the HCV NS3/4A protease. See, e.g. Grakoui et al. 1993; Eckart et al. 1993; and Bartenschlager et al. 1993. These amino acid residues correspond to positions H72, D96 and S154 of the amino acid sequence of SEQ ID NO: 1. Thus, the target protein may contain an amino acid mutation at one or more amino acids selected from positions 72, 96 and 154 of the HCV NS3/4A protease, wherein amino acid numbering corresponds to SEQ ID NO: 1. Other residues of the HCV NS3/4A protease that are known to be involved in viral activity include C97, C99, C145 and H149 of the HCV NS3/4A protease (corresponding to positions C112, C114, C160 and H164 of SEQ ID NO: 1). See, e.g. Hikikata et al. 1993; and Stempniak et al. 1997. In some embodiments, the target protein contains an amino acid mutation (e.g. substitution) at one or more amino acids selected from positions 72, 96, 112, 114, 154, 160 and 164 of the HCV NS3/4A protease, wherein amino acid numbering corresponds to SEQ ID NO: 1.
In particular embodiments, the target protein comprises an amino acid mutation at position 154 of the HCV NS3/4A protease, wherein amino acid numbering corresponds to SEQ ID NO: 1, such as a mutation to alanine. In certain embodiments, the target protein has an amino acid sequence of SEQ ID NO: 2.
The full-length sequence of the NS3 protein is provided in SEQ ID NO: 199. The amino acid mutation described here at position 154 of SEQ ID NO: 1 corresponds to the position 139 of SEQ ID NO: 199.
A table identifying the potential amino acid mutations described above numbered according to the full length NS3 protein (SEQ ID NO: 199) and their corresponding positions in the NS3/4A protease amino acid sequence set forth in SEQ ID NO: 1 is set out as follows:
As a further example, the HIV protease contains a catalytic triad involving the amino acid residues D25, T26 and G27, wherein amino acid numbering is according to the HIV protease having the amino acid sequence of amino acid positions 501-599 of the amino acid sequence set forth in UniProt accession number P03366-1 (version 3 of the sequence; sequence update 23 Jan. 2007). Thus, the target protein may contain an amino acid mutation at one or more amino acids selected from positions 25, 26 and 27 of the HIV protease, wherein amino acid numbering is according to the HIV protease having the amino acid sequence of amino acid positions 501-599 of the amino acid sequence set forth in UniProt accession number P03366-1 (version 3 of the sequence; sequence update 23 Jan. 2007).
The target protein and small molecule interact to form a complex between the target protein and small molecule referred to herein as a T-SM complex. The interaction may be a covalent interaction or a non-covalent interaction. In some embodiments the small molecule binds to the target protein with a kD that is lower than 1 mM, preferably lower than 500 nM, more preferably lower than 200 nM, even more preferably lower than 100 nM, or yet more preferably lower than 50 nM, when measured for example using surface plasmon resonance or bio-layer interferometry. In some embodiments, the small molecule binds to the target protein with a kD between 25 nM and 200 nM, between 25 nM and 100 nM, or between 25 and 75 nM, when measured for example using surface plasmon resonance or bio-layer interferometry.
It may be desirable to introduce amino acid mutations (e.g. substitutions) in the target protein in order to reduce the affinity of the small molecule for the target protein and allow a second small molecule to displace the small molecule in the T-SM complex. For example, as demonstrated herein, simeprevir binds the target protein HCV NS3/4A protease (S139A) (SEQ ID NO: 2) with a very high affinity such that other small molecules that bind the target protein are unable to displace simeprevir from the T-SM complex. Reducing the binding affinity of simeprevir to HCV NS3/4A protease by introducing amino acid modification(s) in the target protein allows for the use of different small molecules inhibitors of the HCV NS3/4A protease to disrupt the tripartite complex formed between HCV NS3/4A protease (S139A), simeprevir and PRSIM_23. Thus, in some embodiments the target protein comprises one or more affinity reducing amino acid mutations (e.g. substitutions) compared the viral protease from which it is derived (e.g. SEQ ID NO: 1), such that the small molecule binds the target molecule with a lower affinity than the small molecule binds a parent target protein. The ‘parent target protein’ in this context lacks the one or more affinity reducing amino acid mutations but is otherwise identical to the target protein. The parent target protein may be the viral protease from which the target protein is derived from (e.g. the parent target protein may have the amino acid sequence set forth in SEQ ID NO: 1), or the parent target protein may itself be derived from a viral protease (e.g. the parent target protein may have the amino acid sequence set forth in SEQ ID NO: 2).
The one or more affinity reducing amino acid mutations may result in the small molecule binding the target protein with at least a 1.5-fold lower affinity than the small molecule binds the parent target protein. The one or more affinity reducing amino acid mutations may result in the small molecule binding the target protein with an affinity that is between 1.5-fold and 10-fold lower than the small molecule binds the parent target protein, or between 1.5-fold and 5-fold lower than the small molecule binds the parent target protein. The one or more affinity reducing amino acid mutations may result in the small molecule binding the target protein with a KD between 25 nM and 200 nM, between 25 and 100 nM, or between 25 and 75 nM, optionally where affinity is measured using bio-layer interferometry, such as using an Octet RED384.
As demonstrated herein, amino acid substitutions at positions 151 and 183 of a HCV NS3/4A protease, wherein numbering amino acid numbering corresponds to SEQ ID NO: 1, were found to reduce the affinity of simeprevir to the HCV NS3/4A protease and allow a second small molecule that disrupt the tripartite complex formed between the HCV NS3/4A protease, simeprevir and the binding member PRSIM_23. Further, target proteins comprising these affinity reducing mutations were also demonstrated to retain functionality in dimerization-inducible proteins such as in split transcription factors. Amino acid positions 151 and 183 of SEQ ID NO: 1 correspond to amino acid positions 136 and 168, respectively, of the full length NS3 protein set forth in SEQ ID NO: 99.
Thus, in some embodiments where the target protein is derived from a viral protease that is an HCV NS3/4A protease, the target protein may have an affinity reducing amino acid mutation (e.g. substitution) at one or more amino acids selected from positions 151 and 183, wherein the amino acid numbering corresponds to SEQ ID NO: 1. In some embodiments, the affinity reducing amino acid mutation at position 151 is a mutation to aspartic acid, asparagine or histidine, and the affinity reducing mutation at position 183 is to glutamic acid, glutamine or alanine. In some embodiments, the affinity reducing amino acid mutation at position 151 is a mutation to aspartic acid or asparagine and the affinity reducing mutation at position 183 is to glutamic acid. The target protein may comprise the affinity reducing amino acid mutation in addition to another amino acid mutation described herein (e.g. in addition to the amino acid mutation at position 154, such as to an alanine).
In certain embodiments, the target protein has an amino acid sequence having at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% identity to SEQ ID NO: 1 and comprises alanine at position 154 and aspartic acid, asparagine or histidine (e.g. aspartic acid or asparagine) at position 151, wherein the amino acid numbering corresponds to SEQ ID NO: 1. In certain embodiments, the target protein is derived from a viral protease having the amino acid sequence set forth in SEQ ID NO: 1, wherein the target protein differs from the viral protease in that it comprises alanine at position 154 and aspartic acid, asparagine or histidine (e.g. aspartic acid or asparagine) at position 151, and optionally 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 additional sequence alterations (e.g. functionally conservative substitutions), wherein the amino acid numbering corresponds to SEQ ID NO: 1. In certain embodiments, the target protein comprises an amino acid sequence having at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to any one of the sequences set forth in SEQ ID NOs: 211 and 215.
In certain embodiments, the target protein has an amino acid sequence having at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% identity to SEQ ID NO: 1 and comprises alanine at position 154 and glutamic acid, glutamine or alanine (e.g. glutamic acid) at position 183, wherein the amino acid numbering corresponds to SEQ ID NO: 1. In certain embodiments, the target protein is derived from a viral protease having the amino acid sequence set forth in SEQ ID NO: 1, wherein the target protein differs from the viral protease in that it comprises alanine at position 154 and aspartic acid, asparagine or histidine (e.g. aspartic acid) at position 151, and optionally 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 additional sequence alterations (e.g. functionally conservative substitutions), wherein the amino acid numbering corresponds to SEQ ID NO: 1. In certain embodiments, the target protein comprises an amino acid sequence having at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to the sequences set forth in SEQ ID NOs: 213.
As used herein “binding member” refers to a polypeptide or protein that specifically binds to the T-SM complex. The term “specific” may refer to the situation in which the binding member will not show any significant binding to molecules other than the T-SM complex. Such molecules are referred to as “non-target molecules” and include the target protein alone and the small molecule alone, i.e. the target protein or small molecule when not part of the T-SM complex.
In some embodiments, the binding member is considered to not show any significant binding to a non-target molecule if the extent of binding to a non-target molecule is less than about 10% of the binding of the binding member to the T-SM as measured, e.g., by isothermal calorimetry, ELISA, surface plasmon resonance (SPR), Bio-Layer Interferometry (BLI), homogeneous time-resolved fluorescence (HTRF), MicroScale Thermophoresis (MST), or by a radioimmunoassay (RIA). In some embodiments, the extent of binding to a non-target molecule is less than about 5% or less than about 1% of the binding of the binding member to the T-SM. Methods used to determine the extent of binding involving SPR (Biacore) and HTRF are described in the Examples. In some embodiments, where the extent of binding is measured by HTFR, the binding member described herein binds to the T-SM complex with an affinity that is at least 2-fold greater than the affinity towards another, non-target molecule, e.g. the target protein alone or small molecule alone. In some embodiments, the binding member binds to its target molecule with an affinity that is one of at least 3-, 5-, 10-, 20-fold greater than the affinity towards another, non-target molecule. Alternatively, the binding specificity may be reflected in terms of binding affinity, where the binding member described herein binds to the T-SM complex with an affinity that is at least 10-fold greater than the affinity towards another, non-target molecule, e.g. the target protein alone or small molecule alone. Binding affinity may be measured by surface plasmon resonance, e.g. Biacore. In some embodiments, the binding member binds to its target molecule with an affinity that is one of at least 50-, 100-, 1000-, 10000-fold greater than the affinity towards another, non-target molecule.
Binding affinity is typically measured by Kd (the equilibrium dissociation constant between the binding member and its target). As is well understood, the lower the Kd value, the higher the binding affinity of the binding member. For example, a binding member that binds to the T-SM complex with a Kd of 1 nM would be considered to be binding the T-SM complex with an affinity that is greater than a binding member that binds to a non-target molecule with a Kd of 100 nM.
The binding member may bind to the T-SM complex with an affinity having a Kd equal to or lower than 50 nM, 25 nM, 20 nM, 15 nM or 10 nM. The binding member may bind to the target protein alone or small molecule alone with an affinity having a Kd equal to or higher than 500 nM, 1 μM, 10 μM, 100 μM, or 1 mM. Binding affinity may be measured by SPR, e.g. by Biacore. The binding member may show minimal or no binding to the target protein alone and/or to the small molecule alone when measured by SPR.
In some embodiments, the binding member specifically binds the T-SM complex at an epitope that is only present on the T-SM complex and not on the target protein alone or small molecule alone. For example, the binding member may bind to a site of the T-SM complex comprising at least a portion of the small molecule and a portion of the target protein. Alternatively, the formation of a T-SM complex may induce a conformational change in the target protein that results in the formation of a new epitope that is specifically bound by the binding member. Methods of determining whether the binding member binds to a specific epitope include X-ray crystallography, peptide scanning, site-directed mutagenesis mapping and mass spectrometry.
In embodiments where the T-SM complex comprises a target protein derived from a HCV NS3/4A protease (e.g. SEQ ID NO: 2) and the small molecule simeprevir, the binding member may specifically bind the T-SM by forming interactions with at least one of the following residues of the target protein: Tyr71, Gly75, Thr76, Va193, Asp94, where the amino acid numbering corresponds to SEQ ID NO: 1. The binding member may form interactions with 1, 2, 3, 4, or most preferably all 5 of these residues. The binding member may additionally specifically bind the T-SM complex by forming interactions with the quinoline moiety of simeprevir. At least some of these interactions may by hydrophobic interactions and/or water-mediated interactions. Interactions can be determined using X-ray crystallography, for example as described in the examples.
The binding member may be an antibody molecule, such as a single chain variable fragment, or an antibody mimetic, such as a Tn3 protein.
Antibody Molecules
Aspects and embodiments of the present disclosure are directed to binding members that are antibody molecules, such as single chain variable fragments (scFv).
The term “antibody molecule” describes an immunoglobulin whether natural or partly or wholly synthetically produced. The antibody molecule may be human or humanised. The antibody molecule may be a monoclonal antibody molecule. Examples of antibodies are the immunoglobulin isotypes, such as immunoglobulin G (IgG), and their isotypic subclasses, such as IgG1, IgG2, IgG3 and IgG4, as well as fragments thereof.
An antibody molecule generally comprises six complementarity-determining regions (CDRs); three in the variable heavy (VH) region: HCDR1, HCDR2 and HCDR3, and three in the variable light (VL) region: LCDR1, LCDR2, and LCDR3. The six CDRs together define the paratope of the antibody molecule, which is the part of the antibody molecule which binds to the T-SM complex. The VH region and VL region comprise framework regions (FRs) either side of each CDR, which provide a scaffold for the CDRs. From N-terminus to C-terminus, VH regions comprise the following structure: N term-[HFR1]-[HCDR1]-[HFR2]-[HCDR2]-[HFR3]-[HCDR3]-[HFR4]-C term; and VL regions comprise the following structure: N term-[LFR1]-[LCDR1]-[LFR2]-[LCDR2]-[LFR3]-[LCDR3]-[LFR4]-C term.
There are several different conventions for defining antibody CDRs and FRs, such as those described in Kabat et al., Sequences of Proteins of Immunological Interest, 5th Ed. Public Health Service, National Institutes of Health, Bethesda, Md. (1991), Chothia et al., J. Mol. Biol. 196:901-917 (1987), IMGT numbering as described in LeFranc et al., Nucleic Acids Res. (2015) 43 (Database issue):D413-22, and VBASE2, as described in Retter et al., Nucl. Acids Res. (2005) 33 (suppl 1): D671-D674. The CDRs and FRs of the VH regions and VL regions of the antibody molecules described herein were defined according to Kabat (Kabat, E. A et al (1991).
The term “antibody molecule”, as used herein, includes antibody fragments, provided they display binding to the relevant target molecule(s). Examples of antibody fragments include Fv, scFv, Fab, scFab, F(ab′)2, Fab2, diabodies, triabodies, scFv-Fc, minibodies and single domain antibodies (e.g. VhH), etc.). Unless the context requires otherwise, the term “antibody molecule”, as used herein, is thus equivalent to “antibody molecule or antigen-binding fragment thereof”. In particular exemplified embodiments, the antibody molecule is a single chain variable fragment (scFv).
Antibody molecules and methods for their construction and use are well-known in the art and are described in, for example, Holliger & Hudson, Nature Biotechnology 23(9):1126-1136 (2005). It is possible to take monoclonal and other antibody molecules and use techniques of recombinant DNA technology to produce other antibody or chimeric molecules which retain the specificity of the original antibody. Such techniques may involve introducing CDRs or variable regions of one antibody molecule into a different antibody molecule (EP-A-184187, GB 2188638A and EP-A-239400).
In view of today's techniques in relation to monoclonal antibody technology, antibody molecules can be prepared to most antigens. The antigen-binding domain may be a part of an antibody (for example a Fab fragment) or a synthetic antibody fragment (for example an scFv). Suitable monoclonal antibodies to selected antigens may be prepared by known techniques, for example those disclosed in “Monoclonal Antibodies: A manual of techniques”, H Zola (CRC Press, 1988) and in “Monoclonal Hybridoma Antibodies: Techniques and Applications”, J G R Hurrell (CRC Press, 1982). Chimeric antibodies are discussed by Neuberger et al (1988, 8th International Biotechnology Symposium Part 2, 792-799).
The sequence identifiers (SEQ ID NOs) for HCDR1, HCDR2, HCDR3, LCDR1, LCDR2, LCDR3, variable heavy (VH) chain, variable light (VL) chain and scFv amino acid sequences for PRSIM_57, PRSIM_01, PRSIM_04, PRSIM_67, PRSIM_72 and PRSIM_75 are as set forth in the following table:
In some embodiments, the antibody molecule comprises heavy chain complementarity determining regions (HCDRs) 1 to 3 and/or light chain complementarity determining regions (LCDRs) of:
In some embodiments, the binding member comprises a number of sequence alterations, e.g. one, two, three, four, or five sequence alterations, in any one or more of the CDRs defined above.
In some embodiments, the antibody molecule comprises a variable heavy (VH) chain and/or variable light (VL) chain of:
In particular embodiments, the antibody molecule is a single-chain variable fragment (scFv). Typically, an scFV comprises a VH chain and a VL chain separated by a peptide linker. The peptide linker may be as defined herein. In some embodiments, the peptide linker separating the VH and VL chain may comprise the amino acid sequence of SEQ ID NO: 204.
In some embodiments, the scFv comprises an amino acid sequence having at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the amino acid sequence of:
In particular embodiments, the scFv comprises an amino acid sequence of:
Antibody Mimetics
The binding member may be an antibody mimetic. Antibody mimetics are organic compounds that are able to specifically bind antigens but are structurally different to antibody molecules. Examples of antibody mimetics include scaffold proteins such as Tn3 proteins, affibodies, affilins, affimers, affitins, alphabodies, anticalins, avimers, DARPins, flynomers, Kunitz domain peptides, monobodies and nanoCLAMPs.
In particular aspects and embodiments, the binding member is a Tn3 protein.
Tn3 proteins are based on the structure of a type III fibronectin module (FnIII) and are derived from the third FnIII domain of human tenascin C. The generation and use of Tn3 proteins is described for example in WO 2009/058379, WO 2011/130324, WO2011130328 and Gilbreth et al. 2014.
The Tn3 proteins and the native FnIII domain from tenascin C are characterized by the same tridimensional structure, namely a beta-sandwich structure with three beta strands (A, B, and E) on one side and four beta strands (C, D, F, and G) on the other side, connected by six loop regions. These loop regions are designated according to the beta-strands connected to the N- and C-terminus of each loop. Accordingly, the AB loop is located between beta strands A and B, the BC loop is located between strands B and C, the CD loop is located between beta strands C and D, the DE loop is located between beta strands D and E, the EF loop is located between beta strands E and F, and the FG loop is located between beta strands F and G. FnIII domains possess solvent exposed loops tolerant of randomization, which facilitates the generation of diverse pools of protein scaffolds capable of binding specific targets with high affinity.
A wild-type Tn3 protein may comprise the sequence SEQ ID NO: 134. In the wild-type Tn3 protein, the BC, DE and FG loops are located at positions 23 to 31, 51 to 56 and 75 to 80, wherein the amino acid numbering corresponds to SEQ ID NO: 134. The Tn3 protein may contain one, preferably two, more preferably three, even more preferably four of the stabilising mutations selected from the list consisting of 132F, D49K, E861 and T89K, wherein the amino acid numbering corresponds to SEQ ID NO: 134. The amino acid sequence of a wild-type Tn3 protein comprising all four stabilising mutations is set forth in SEQ ID NO: 135. The Tn3 protein may additionally contain one or more of the stabilising mutations described in Gilbreth et al. 2014 (see, in particular, Table 1 of Gilbreth et al. 2014).
Tn3 proteins can be subjected to directed evolution designed to randomize one or more of the loops which are analogous to the complementarity-determining regions (CDRs) of an antibody variable region. Such a directed evolution approach results in the production of antibody-like binding members with high affinities for targets of interest, e.g., the T-SM complexes described herein.
Thus, the Tn3 protein that specifically binds to the T-SM complex described herein may comprise the BC, DE and FG loops of PRSIM_23, PRSIM_32, PRSIM_33, PRSIM_36, or PRSIM_47. For example, the Tn3 protein may comprise the sequence of SEQ ID NO: 134 or SEQ ID NO: 135, where the BC, DE and FG loops located at positions 23 to 31, 51 to 56, and 75 to 80, respectively, are substituted for the BC, DE and FG loops of PRSIM_23, PRSIM_32, PRSIM_33, PRSIM_36, or PRSIM_47, wherein the amino acid numbering corresponds to SEQ ID NO: 134.
A person skilled in the art would be readily able to determine the amino acid sequences of the BC, DE and FG loops of the PRSIM clones described herein. For example, the amino acid sequences of the PRSIM clones could be compared to the amino acid sequences of the wild-type Tn3 protein, e.g. those amino acid sequences set forth in SEQ ID NO: 134 or 135.
The Tn3 sequence, amino acid positions and sequences of the BC, DE and FG loops of PRSIM_23, PRSIM_32, PRSIM_33, PRSIM_36, or PRSIM_47 are as set forth in the following table:
In some embodiments, the Tn3 protein comprises the BC, DE and FG loops of:
In some embodiments, the Tn3 protein comprises the BC, DE and FG loops of:
In some embodiments, the Tn3 protein comprises a number of sequence alterations, e.g. one, two, three, four, or five sequence alterations, in any one or more of the BC, DE and EF loops defined above. In some embodiments, the Tn3 protein comprises a number of sequence alterations, e.g. one, two, three, four, or five sequence alterations, outside the BC, DE and EF loops defined above.
In some embodiments, the Tn3 protein comprises an amino acid sequence having at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98% or 99% identity with the amino acid sequence of:
In particular embodiments, the Tn3 protein comprises an amino acid sequence of:
In some embodiments the target protein is fused to a first component polypeptide and the binding member is fused to a second component polypeptide. In particular embodiments the first and second component polypeptides form part of a dimerization-inducible protein.
As used herein “dimerization-inducible protein” refers to a protein or complex comprising a first and second component polypeptide, wherein the first and second polypeptide form a functional protein upon dimerization. The term “dimerization-inducible proteins” includes “split proteins”, “dimerization-deficient proteins” and “split complexes”. The term “component polypeptide” is intended to encompass both single-chain and multi-chain polypeptides. The first and second component polypeptides in the dimerization-inducible protein typically do not have activity or have less activity when separated, but upon dimerization are brought into close proximity and as such become active or have increased activity. As described in the examples, the combination of particular binding members, target proteins and small molecules described herein are able to regulate dimerization of the dimerization-inducible protein such that a significant increase in activity is observed when the binding member is bound to the T-SM complex compared to the separate components of the dimerization-inducible protein alone.
Examples of dimerization-inducible proteins include split chimeric antigen receptor (split CAR; e.g. as described in Wu et al. 2015), split kinases (e.g. as described in Camacho-Soto et al. 2014), split transcription factors (e.g. as described in Taylor et al. 2010), split apoptotic proteins (e.g. split caspases as described in Chelur et al. 2007), split reporter systems (e.g. as described in Dixon et al. 2016).
The dimerization-inducible protein will have increased activity when the binding member is bound to the T-SM complex. Increased activity can be compared to the activity observed when the binding member is not bound to the T-SM complex (e.g. because one or more of the target protein, small molecule or binding member is not present). In some embodiments, the increased activity observed when the binding member is bound to the T-SM complex is at least a 1.5-fold, 2-fold, 3-fold, 5-fold, 10-fold, 15-fold, 20-fold, 25-fold, 30-fold, 35-fold, 40-fold, 45-fold, 50-fold, 55-fold, 60-fold, 65-fold, 70-fold, 75-fold, 80-fold, 85-fold, 90-fold, 95-fold, 100-fold, 105-fold, 110-fold, 115-fold, or 120-fold increase in activity as compared to activity observed when the binding member is not bound to the T-SM complex.
Methods of measuring the activity of the dimerization-inducible protein will depend upon the particular dimerization-inducible protein being studied. Where the first and second component polypeptide form a chimeric antigen receptor (CAR) upon dimerization, CAR activity can be determined by measuring the immune cell activation and/or proliferation. As described in the examples, CAR activity can be measured by interleukin-2 (IL-2) production, e.g. by ELISA, after stimulation of the CAR by an antigen. Where the first and second component polypeptide form a kinase upon dimerization, activity of the kinase can be measured by incorporation of phosphate, e.g. radioactive 32P, into a peptide substrate as described in Camacho-Soto et al. 2014. Where the first and second component polypeptides form a transcription factor upon dimerization, transcriptional activity can be determined by measuring expression of a downstream desired expression cassette modulated by the split transcription factor as described in the examples. Where the first and second component polypeptide form a therapeutic protein upon dimerization, activity can be measured by using suitable assays for determining functional activity of the protein. Where the first and second component polypeptides form a caspase upon dimerization, caspase activity can be measured using a caspase activity assay or by measuring apoptotic cell death. Where the first and second component polypeptides form a reporter system upon dimerization, reporter activity can be determined by measuring expression of the reporter, e.g. a luciferase.
The first component polypeptide may be fused to the C-terminus or the N-terminus of the target protein or binding member. The second component polypeptide may be fused to the C-terminus or the N-terminus of the target protein or binding member. The component polypeptides may be fused to the target protein or binding member via a peptide linker. Suitable peptide linkers include those represented by [G]n, [S]n, [A]n, [GS]n, [GGS]n, [GGGS]n (SEQ ID NO.: 239), [GGGGS)n (SEQ ID NO.: 240), [GGSG]n (SEQ ID NO.: 241), [GSGG]n (SEQ ID NO.: 242), [SGGG]n (SEQ ID NO.: 243), [SSGG]n (SEQ ID NO.: 244), [SSSG]n (SEQ ID NO.: 245), [GG]n, [GGG]n, [SA]n, [TGGGGSGGGGS]n (SEQ ID NO.: 185), and combinations thereof, wherein n is an integer between 1 and 30. For example, n may be 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, or any number up to 30. The component polypeptide may be fused to the target protein or binding member directly, e.g. in the format-first component polypeptide-peptide linker-target protein. Alternatively, the component polypeptide may be fused to the target protein or binding member indirectly with one or more additional polypeptides separating the first component polypeptide from the target protein or binding member, e.g. first component polypeptide-additional polypeptide-peptide linker-target protein.
In some embodiments, the first component polypeptide is fused to more than one target protein or binding member. In some embodiments, the second component polypeptide is fused to more than one target protein or binding member or a combination of both. For example, the first or second component polypeptide may be fused to 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 binding members. In some embodiments, the first or second component polypeptide is fused to between 2 and 10, or between 2 and 5 binding members. In particular embodiments, the first or second component polypeptide is fused to 3 binding members. For example, the first or second component polypeptide may be fused to 1, 2, 3, 4, 5, 6, 7, 8, 9, or 10 target proteins. In some embodiments, the first or second component polypeptide is fused to between 2 and 10, or between 2 and 5 target proteins. In particular embodiments, the first or second component polypeptide is fused to 3 target proteins. Where multiple binding members or target proteins are present, they may be fused to each other by peptide linkers, e.g. those peptide linkers described above.
Split Transcription Factor
The dimerization-inducible protein may be a split transcription factor. In some embodiments, the first component polypeptide comprises a DNA binding domain; and the second component polypeptide comprises a transcriptional regulatory domain, and wherein the first component polypeptide and second component polypeptide form a transcription factor upon dimerization. By “form a transcription factor” it is meant that the first and second component polypeptides are brought into close enough proximity that they are able to reconstitute the transcriptional regulatory activity of desired expression products. The dimerization-inducible protein will have increased transcriptional regulatory activity when the binding member is bound to the T-SM complex, wherein the transcriptional regulatory activity is increased compared to the transcriptional regulatory activity observed when the binding member is not bound to the T-SM complex.
The transcriptional regulatory domain may be a transcriptional activation domain that is capable of upregulating transcription of a gene that the split transcription factor binds to. Suitable transcriptional activation domains include the p65 subunit of nuclear factor kappa B (Bitko & Barik, J. Virol. 72:5610-5618 (1998) and Doyle & Hunt, Neuroreport 8:2937-2942 (1997)); Liu et al., Cancer Gene Ther. 5:3-28 (1998)); the replication and transcription activator (RTA; Lukac et al., J Virol. 73, 9348-61 (1999)), a the HSV VP16 activation domain (see, e.g., Hagmann et al., J. Virol. 71, 5952-5962 (1997)) nuclear hormone receptors (see, e.g., Torchia et al., Curr. Opin. Cell. Biol. 10:373-383 (1998)); or artificial chimeric functional domains such as VP64 (Beerli et al., (1998) Proc. Natl. Acad. Sci. USA 95:14623-33), and degron (Molinari et al., (1999) EMBO J. 18, 6439-6447). Additional exemplary activation domains include, Oct 1, Oct-2A, Sp1, AP-2, and CTF1 (Seipel et al., EMBO J. 11, 4961-4968 (1992) as well as p300, CBP, PCAF, SRC1 PvALF, AtHD2A and ERF-2. See, for example, Robyr et al. (2000) Mol. Endocrinol. 14:329-347; Collingwood et al. (1999) J. Mol. Endocrinol. 23:255-275; Leo et al. (2000) Gene 245:1-11; Manteuffel-Cymborowska (1999) Acta Biochim. Pol. 46:77-89; McKenna et al. (1999) J. Steroid Biochem. Mol. Biol. 69:3-12; Malik et al. (2000) Trends Biochem. Sci. 25:277-283; and Lemon et al. (1999) Curr. Opin. Genet. Dev. 9:499-504. Additional exemplary activation domains include, but are not limited to, OsGAI, HALF-1, C1, AP1, ARF-5,-6,-7, and -8, CPRF1, CPRF4, MYC-RP/GP, and TRAB1 and a modified Cas9 transactivator protein. See, for example, Ogawa et al. (2000) Gene 245:21-29; Okanami et al. (1996) Genes Cells 1:87-99; Goff et al. (1991) Genes Dev. 5:298-309; Cho et al. (1999) Plant Mol. Biol. 40:419-429; Ulmason et al. (1999) Proc. Natl. Acad. Sci. USA 96:5844-5849; Sprenger-Haussels et al. (2000) Plant J. 22:1-8; Gong et al. (1999) Plant Mol. Biol. 41:33-44; Hobo et al. (1999) Proc. Natl. Acad. Sci. USA 96:15,348-15,353; and Perez-Pinera et al. (2013) Nature Methods 10:973-976). The transcriptional activation domain may comprise any combination of the above exemplary activation domains. In some embodiments multiple transcriptional activation domains may be used, e.g. tandem reports of the same domains or fusions of different domains. In some embodiments the transcriptional activation domain is VPR, a tripartite activate made up of the VP64, p65 and Rta domains. An example of a TRD-T fusion protein comprising VPR is set forth in SEQ ID NO: 225 (NS4A/3 PR S139A-VPR). Generation and use of VPR as a transcriptional activator is described for example in Chavez et al. 2015. In some embodiments the transcriptional activation domain is HSF-1, optionally in combination with p65.
Alternatively, the transcriptional regulatory domain may be a transcriptional repression domain that is capable of downregulating transcription of a gene that the split transcription factor binds to. Transcriptional repression domains include, but are not limited to, KRAB A/B, KOX, TGF-beta-inducible early gene (TIEG), v-erbA, SID, MBD2, MBD3, members of the DNMT family (e.g., DNMT1, DNMT3A, DNMT3B), Rb, and MeCP2. See, for example, Bird et al. (1999) Cell 99:451-454; Tyler et al. (1999) Cell 99:443-446; Knoepfler et al. (1999) Cell 99:447-450; and Robertson et al. (2000) Nature Genet. 25:338-342. Additional exemplary repression domains include, but are not limited to, ROM2 and AtHD2A. See, for example, Chem et al. (1996) Plant Cell 8:305-321; and Wu et al. (2000) Plant J. 22:19-27.
The DNA binding domain may be any protein that binds to a target sequence in a sequence specific manner. For example, the DNA binding domain may be or may contain a transcription factor that binds to a target sequence in a sequence specific manner, or a DNA-binding fragment thereof. It is expected that any transcription factor, or DNA-binding fragment thereof, that is capable of binding to a target sequence in a specific manner can be used with the split transcription factors disclosed herein. The DNA-binding domain may be or comprise a naturally occurring DNA-binding domain such as a binding domain from a human transcription factor. For example, the DNA-binding protein may be any of the human transcription factors described in Vaquerizas et al. (2009) (e.g. any of those listed in Supplementary information S3), or a DNA-binding fragment thereof. For example, the DNA-binding protein may be a member of the C2H2 zinc-finger family, the homeodomain family or the helix-loop-helix family or a DNA-binding fragment thereof. In particular embodiments the DNA binding domain may be zinc finger homeodomain transcription factor 1 (ZFHD1). ZFHD1 contains zinc fingers 1 and 2 from the Zif268 transcription factor and the Oct-1 homeodomain. The design and construction of ZFHD1 is described for example in Pomerantz et al. 1995.
The DNA binding domain may be or comprise a DNA-binding domain such as a zinc finger DNA binding domain, a TALE DNA binding domain, a DNA binding domain from a meganuclease (e.g. based on Iscel) or a DNA binding domain from a CRISPR/Cas system. These binding domains can be engineered to bind a target sequence of choice, e.g. a target sequence in a target gene that is naturally present (endogenous) in a cell or a target sequence that has been provided in trans (e.g. as part of a third expression cassette). The engineering of zinc finger DNA binding domains to bind particular target sequences is described for example in U.S. Pat. No. 6,453,242B1. In one embodiment, the DNA-binding domain is a TALE DNA binding domain. The engineering of TALE DNA binding domain domains to bind particular target sequences is described for example in WO2010079430A1. In one embodiment, the DNA binding domain is an engineered DNA binding domain from a meganuclease. The engineering of meganucleases to bind particular target sequence is described for example in WO2007047859A1. A meganuclease may be engineered such that they no longer cleave DNA. In one embodiment, the DNA binding domain is an engineered DNA binding domain from a CRISPR/Cas system. The engineering of DNA binding domains from CRISPR/Cas systems to bind particular sequences is described for example in WO2013176772A1. CRISPR/Cas systems generally involve an RNA-guided endonuclease (e.g. Cas9) that is directed to a specific DNA sequence through complementarity between the associated guide RNA (gRNA) and its target sequence. Thus, the engineered DNA binding domain from a CRISPR/Cas system typically comprises a complex of a RNA-guided endonuclease (e.g. Cas9 or a variant thereof) and a guide RNA. Variants of Cas9 have been generated that lack the endonucleolytic activity but retain the capacity to interact with DNA. See for example Chavez et al. 2015 which describes the use of nuclease-null (dCas9) variants in a method of transcriptional regulation. Thus, the DNA-binding domain may include a nuclease null Cas9 variant which, upon addition of a particular gRNA specific for a target sequence, binds to the target sequence. An example of a DBD-BM fusion protein comprising dCas9 as a DNA-binding domain is set forth in SEQ ID NO: 227 (spdCas9-PRSIM_23×3). An example of a guide RNA that targets the DBD-BM to human IL-2 is set forth in SEQ ID NO: SEQ ID NO: 229. The use of a dCas9 variant as part of a split transcription factor is described in Hill et al. 2018 and WO 2018/213848 A1.
The binding member may be fused to the transcriptional regulatory domain or to the DNA binding domain.
In some embodiments:
In certain embodiments:
The DBD-T fusion protein may comprise an amino acid sequence having at least 90% identity to the amino acid sequence set forth in SEQ ID NO: 45. In particular embodiments TRD-BM fusion protein defined in (1) above may comprise an amino acid sequence having at least 90% sequence identity to the amino acid sequence set forth in any one of SEQ ID NOs: 57-67.
The TRD-T fusion protein may comprise an amino acid sequence having at least 90% identity to the amino acid sequence set forth in SEQ ID NO: 44. In particular embodiments, the DBD-BM fusion protein defined in (2) above may comprise an amino acid sequence having at least 90% sequence identity to the amino acid sequence set forth in any one of SEQ ID NOs: 46-56.
As described in the examples, some of the exemplified binding members showed a preference for fusion to either the DNA binding domain or the transcriptional regulatory domain, whereby increased transcriptional regulatory activity was observed depending on if the particular binding member was fused to the DNA binding domain or transcriptional regulatory domain. Thus, in some embodiments:
In some embodiments, the binding member or target protein is fused to the C-terminus of the DNA binding domain. In other embodiments, the binding member or target protein is fused to the N-terminus of the transcriptional regulatory domain. The binding member or target protein may be fused to the DNA binding domain or transcriptional regulatory domain via a peptide linker, for example via one or more of the peptide linkers set out above. In particular embodiments the linkers have the amino acid sequence TGGGGSGGGGS (SEQ ID NO: 185) or SA.
As described in the examples, PRSIM_23 was found to provide strong gene expression regulation in both orientations. Thus, in some embodiments:
In particular embodiments:
As also demonstrated in the examples, the PRSIM-based CIDs can also be applied to an activating CRISPR (CRISPRa) system. This can be used, for example, to facilitate endogenous gene regulation.
Thus, in some embodiments the DBD-BM fusion protein comprises an amino acid sequence having at least 90% identity to the amino acid sequence set forth in SEQ ID NO: 227; and the TRD-T fusion protein comprises an amino acid sequence having at least 90% identity to the amino acid sequence set forth in SEQ ID NO: 225. The DBD-BM fusion protein can be guided to a target sequence through the use of particular guide RNAs that are specific for said target sequence.
As demonstrated in the examples, split transcription factors comprising a DNA binding domain fused to multiple copies of the target protein or binding member exhibited increased expression relative to a split transcription factor comprising a DNA binding domain fused to a single copy of the target protein or binding member.
Thus, in some embodiments,
The multiple binding members or multiple target proteins may be separated by a linker, for example by one or more peptide linkers as set out above. In particular exemplified embodiments the DBD-T fusion protein comprises a DNA binding domain fused to three target proteins, or the DBD-BM fusion protein comprises a DNA binding domain fused to three binding members.
The first and/or second component polypeptide may additionally comprise nuclear localization signals (such as, for example, that from the SV40 medium T-antigen).
A split transcription factor may also be provided with a third expression cassette, wherein the third expression cassette encodes a desired expression product, wherein the DNA binding domain of the split transcription factor binds to a target sequence in the third expression cassette such that the transcription factor is capable of regulating expression of the desired expression product. By “capable of regulating expression” it is intended to mean that the DNA binding domain is able to bind the target sequence and upon forming a transcription factor with the transcriptional regulatory domain (i.e. upon dimerization of the dimerization-inducible protein), has transcriptional regulatory activity that regulates (increases or decreases) expression of the desired expression product. The desired expression product can be RNA or peptidic (peptide, polypeptide or protein). Preferably the desired expression product is peptidic. The desired expression product may be a therapeutic protein, i.e. a protein that exerts a therapeutic effect in the subject.
The target sequence may be located in or in close proximity to a promoter that is operably linked to a coding sequence for the desired expression product. By “close proximity” it is meant that the target sequence is within 500 bp, within 250 bp, within 100 bp, within 50 bp, or within 25 bp of the sequence corresponding to the promoter.
Split Chimeric Antigen Receptor
The dimerization-inducible protein may be a split chimeric antigen receptor (split CAR).
CARs combine both antibody-like recognition with T-cell-activating function. They are typically composed of an antigen-specific recognition domain, e.g. derived from an antibody, a transmembrane domain to anchor the CAR to the T cell, a co-stimulatory domain and one or more intracellular signalling domains that induce persistence, trafficking and effector functions in transduced T cells. The design and use of CARs is well known in the art and is described, for example in Sadelain et al. 2013.
Split CARs have been designed that require an exogenous, user-provided signal to activate the CAR, for example as described in Wu et al. 2015. In these split receptors, antigen binding and intracellular signalling components only assemble in the presence of a heterodimerizing small molecule, allowing the user to precisely control the timing, location and dosage of T-cell activity. Such split CARs are expected to mitigate toxicity for example by inducing less off-target effects.
In one embodiment the dimerization-inducible protein comprises:
The first component polypeptide set out above may further comprise an antigen-specific recognition domain and a transmembrane domain and the second component polypeptide further comprises a transmembrane domain and a second co-stimulatory domain, and wherein the first and second component polypeptide form a chimeric antigen receptor (CAR) upon dimerization. By “form a CAR” it is meant that the first and second component polypeptides are brought into close enough proximity that they are able to reconstitute a fully functional CAR.
In another embodiment the dimerization-inducible protein comprises:
The first component polypeptide set out above may further comprise a transmembrane domain and a second co-stimulatory domain and the second component polypeptide further comprises an antigen-specific recognition domain and a transmembrane domain, wherein the first and second component polypeptide form a chimeric antigen receptor (CAR) upon dimerization,
The split CAR will have increased activity when the binding member is bound to the T-SM complex, wherein the activity is increased compared to the activity observed when the binding member is not bound to the T-SM complex.
In one embodiment the first component polypeptide comprises, from N-terminal to C-terminal:
In some embodiments the target protein and binding member are fused at a location that is C-terminal to the respective transmembrane domains in the first and second component polypeptides. For example, the target protein or binding member may be fused to the N-terminus or C-terminus of the respective co-stimulatory domains in the first and second component polypeptides. In a particular embodiment, one of the target protein and binding member is fused to the C-terminus of the first co-stimulatory domain and the other is fused to the C-terminus of the second co-stimulatory domain.
For example, in one embodiment the first component polypeptide comprises from N-terminal to C-terminal:
For example, in another embodiment the first component polypeptide comprises from N-terminal to C-terminal:
The target protein and/or binding member may be fused directed to the respective co-stimulatory domains. More preferably, the target protein and binding member are separated from their respective co-stimulatory domains by peptide linkers. The peptide linkers may be as further defined herein. In some embodiments, the target protein and binding member are separated from their respective co-stimulatory domains by a linker comprising the amino acid sequence set forth in SEQ ID NO: 204. Similarly, peptide linkers may separate the various domains in the first and second component polypeptides. For example, the transmembrane domain may be separated from the second co-stimulatory domain by a peptide linker, e.g. a peptide linker comprising the amino acid sequence GS, and/or the second co-stimulatory domain may be separated from the intracellular signalling domain by a peptide linker, e.g. a peptide linker comprising the amino acid sequence set forth in SEQ ID NO: 204.
Non-limiting examples of suitable co-stimulatory domains include, but are not limited to, activation domains from 4-1BB (CD137), CD28, ICOS, OX-40, BTLA, CD27, CD30, GITR, and HVEM. In one embodiment the first and second co-stimulatory domain is a 4-1 BB activation domain.
Non-limiting examples of suitable intracellular signalling domains include, but are not limited to, cytoplasmic sequences of the T cell receptor (TCR) and co-receptors that act in concert to initiate signal transduction following antigen receptor engagement, as well as any derivative or variant of these sequences and any synthetic sequence that has the same functional capability. Particular intracellular signalling domains are those that include signaling motifs which are known as immunoreceptor tyrosine-based activation motifs or ITAMs. Examples of ITAM containing signaling domains include those derived from TCR zeta, FcR gamma, FcR beta, CD3 gamma, CD3 delta, CD3 epsilon, CD3 zeta, CD5, CD22, CD79a, CD79b, and CD66d. In particular embodiments the intracellular signalling domain is derived from CD3 zeta.
The transmembrane domain may be derived either from a natural or from a synthetic source. Where the source is natural, the domain may be derived from any membrane-bound or transmembrane protein. Transmembrane regions may be derived from (i.e. comprise at least the transmembrane region(s) of) the alpha, beta or zeta chain of the T-cell receptor, CD28, CD3 epsilon, CD45, CD4, CD5, CD8, CD9, CD16, CD22, CD33, CD37, CD64, CD80, CD86, CD134, CD137, CD154, or from an immunoglobulin such as IgG4. Alternatively, the transmembrane domain may be synthetic, in which case it will comprise predominantly hydrophobic residues such as leucine and valine. A triplet of phenylalanine, tryptophan and valine may be found at each end of a synthetic transmembrane domain. Optionally, a short oligo- or polypeptide linker, preferably between 2 and 10 amino acids in length may form the linkage between the transmembrane domain and the intracellular signalling domain of the CAR. A glycine-serine doublet provides a particularly suitable linker. In particular embodiments, the transmembrane domain is derived from CD28.
The first and second polypeptides may additionally include a hinge domain, such as an IgG4 or CD8a hinge domain, N-terminal to the transmembrane domains in the first and/or second polypeptides. Examples of hinge domains are described in, for example, Qin et al. 2017. In particular embodiments, the hinge domain is a human IgG4 hinge domain.
An antigen-specific recognition domain suitable for use in a dimerization-inducible protein of the present disclosure can be any antigen-binding polypeptide, a wide variety of which are known in the art. In some instances, the antigen-binding domain is a single chain Fv (scFv). Other antibody-based recognition domains (cAb VHH (camelid antibody variable domains) and humanized versions, IgNAR VH (shark antibody variable domains) and humanized versions, sdAb VH (single domain antibody variable domains) and “camelized” antibody variable domains are suitable for use. In some instances, T-cell receptor (TCR) based recognition domains such as single chain TCR (scTv, single chain two-domain TCR containing v vβ) are also suitable for use.
In particular embodiments, the antigen-specific recognition domain is a single chain Fv (scFv). As described elsewhere, an scFv typically comprises a VH chain separated from a VL chain by a peptide linker, e.g. a peptide linker comprising the amino acid sequence set forth in SEQ ID NO: 204.
An antigen-specific recognition domain suitable for use in a dimerization-inducible protein of the present disclosure can have a variety of antigen-binding specificities. In some cases, the antigen-binding domain is specific for an epitope present in an antigen that is expressed by (synthesized by) a cancer cell, i.e., a cancer cell associated antigen. The cancer cell associated antigen can be an antigen associated with, e.g., a breast cancer cell, a B cell lymphoma, a Hodgkin lymphoma cell, an ovarian cancer cell, a prostate cancer cell, a mesothelioma, a lung cancer cell (e.g., a small cell lung cancer cell), a non-Hodgkin B-cell lymphoma (B-NHL) cell, an ovarian cancer cell, a prostate cancer cell, a mesothelioma cell, a lung cancer cell (e.g., a small cell lung cancer cell), a melanoma cell, a chronic lymphocytic leukemia cell, an acute lymphocytic leukemia cell, a neuroblastoma cell, a glioma, a glioblastoma, a medulloblastoma, a colorectal cancer cell, etc. A cancer cell associated antigen may also be expressed by a non-cancerous cell.
In particular exemplary embodiments, the target protein used in the split-CAR is derived from an HCV NS3/4A protease, the small molecule is simeprevir and the binding member is based on PRSIM_23 (e.g. comprises the BC, DE and FG loops or Tn3 sequence of PRSIM_23, optionally with the sequence identity and/or alterations described herein).
In some embodiments the first component polypeptide comprises from N-terminal to C-terminal:
In some embodiments, the first component polypeptide comprises a first signal peptide located N-terminal to the antigen-specific recognition domain. The first signal peptide may comprise the amino acid sequence set forth in SEQ ID NO: 201 or SEQ ID NO: 202. In exemplified embodiments, the first signal peptide comprises the amino acid sequence set forth in SEQ ID NO: 201.
In some embodiments, the second component polypeptide comprises a second signal peptide located N-terminal to the transmembrane domain. The second signal peptide may comprise the amino acid sequence set forth in SEQ ID NO: 201 or SEQ ID NO: 202. In exemplified embodiments, the second signal peptide comprises the amino acid sequence set forth in SEQ ID NO: 202. In one embodiment, the second component polypeptide comprises an amino acid sequence having at least 90% identity to the amino acid sequence set forth in SEQ ID NO: 203.
Also provided is an engineered immune cell comprising the split CAR disclosed herein. In one embodiment the immune cell is a T-cell. Also provided is a method of genetically modifying an immune cell to express a split CAR disclosed herein. The method may be carried out ex vivo. The method may comprise administering the one or more expression vectors described herein to the immune cell such that the split CAR is expressed on the surface of the immune cell.
Split Reporter System
The dimerization-inducible protein may be a split reporter system. The split reporter system may be an enzyme or fluorescent protein that provides an observable phenotype when the first and second component polypeptides dimerise. The observable phenotype may be a colorimetric signal, a luminescent signal or a fluorescent signal. Particular examples of split reporter systems are provided in Dixon et al. 2017.
In some embodiments, the first component polypeptide comprises a first reporter component; and the second component polypeptide comprises a second reporter component, and wherein the first component polypeptide and second component polypeptide form a reporter system upon dimerization, optionally wherein the reporter system provides an increased colorimetric, luminescent, or a fluorescent signal when the binding member is bound to the T-SM complex.
Split Apoptotic Protein
The dimerization-inducible protein may be a split apoptotic protein. A split apoptotic protein is any protein that is capable of inducing apoptosis when the first and second component polypeptides of the split apoptotic protein dimerise. An example of a split apoptotic protein is a split caspase (e.g. split caspase 9 or split caspase 3), that is capable to inducing apoptosis upon dimerization and as such can be used to kill specific cells that contain the split apoptotic protein (e.g. diseased cells, or therapeutic cells that have been administered for cell therapy purposes). Examples of split caspases are provided in Chelur et al. 2007. The use of an inducible caspase 9 suicide gene system is described, for example, in Gargett et al. 2014.
In some embodiments, the first component polypeptide comprises a first caspase component; and the second component polypeptide comprises a second caspase component, wherein the first component polypeptide and second component polypeptide form a caspase upon dimerization. The split caspase may be capable of inducing cell death when the binding member is bound to the T-SM complex.
In certain embodiments, the first and second caspase components are identical, for example both caspase components comprise caspase 9 activation domains. An exemplary caspase 9 activation domain is provided as amino acids residues 152-414 of the human caspase 9 amino acid sequence provided as NCBI accession number AAO21133.1 (version 1; last updated 1 Dec. 2009). In cases where the first and second caspase components are identical, the first and second caspase components may be encoded from the same expression cassette. For example, a split apoptotic protein may be encoded from one or more expression cassettes encoding the target protein, the binding member and the caspase 9 activation domain, where both the target protein and the binding member are fused to a caspase 9 activation domain. Upon expression, a plurality of proteins comprising the target protein, binding member and caspase 9 activation domain are produced and dimerization of the caspase 9 activation domains (i.e. at least a first and a second caspase 9 activation domain) can be regulated through the addition of the small molecule.
In certain exemplary embodiments, the split apoptotic protein comprises an amino acid sequence having at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% sequence identity to SEQ ID NO: 223.
Other Dimerization-Inducible Proteins
Other dimerization proteins contemplated for use with the present disclosure include split therapeutic proteins, split TEV proteases and split Cas9. A split therapeutic protein is any protein that is capable of exerting a therapeutic effect when the first and second component polypeptides of the split therapeutic protein dimerize.
In one embodiment the expression vector is a viral vector. Suitable viral vectors for use include adeno-associated virus vectors, adenovirus vectors, herpes simplex virus vectors, retrovirus vectors, lentivirus vectors, alphavirus vectors, flavivirus vectors, rhabdovirus vectors, measles virus vectors, Newcastle disease virus vectors, poxvirus vectors and picornavirus vectors.
As used herein a viral vector means a DNA expression vector which comprises the first and second expression cassettes such that the expression cassettes are converted into a viral genome that is packaged in the viral particle when expressed in a cell alongside the necessary components for the assembly of the viral particle. Additionally, in one embodiment, the viral vector comprises a third expression cassette encoding a desired expression product.
In a particular embodiment the expression vector is an adeno-associated virus (AAV) vector. AAVs are one of the most actively investigated gene therapy vehicles and are characterized by excellent safety profile and high efficiency of transduction in a broad range of target tissues. The use of AAVs as a vector for gene therapy is described in for example Naso et al. 2017 and Colella et al. 2018.
Various AAV serotypes, including AAV1, AAV3, AAV4, AAV5, AAV6, AAV6.2, AAV6.2FF, AAV8, AAV 8.2, AAV9, and AAV rh10 and pseudotyped AAV such as AAV2/8, AAV2/5 and AAV2/6 can also be used in accordance with the present disclosure. Further examples of serotypes and their isolation are described in Srivastava, 2006.
The AAV particle is a small (25-nm) virus from the Parvoviridae family, and it is composed of a non-enveloped icosahedral capsid (protein shell) that contains a linear single-stranded DNA genome of around 4.8 kb. The AAV genome encodes for several protein products, namely, four non-structural Rep proteins, three capsid proteins (VP1-3), and the assembly-activating protein (AAP). The AAV genes are flanked by two AAV-specific palindromic inverted terminal repeats (ITRs).
Thus, where the expression vector is an AAV vector, this may mean that the first and second expression cassettes are flanked by ITRs (e.g. ITR-first expression cassette-second expression cassette-ITR), such that the expression cassettes are converted into a single-stranded genome that is packaged in an AAV particle when expressed in a cell alongside the necessary components for the assembly of the AAV particle.
The AAV vector may be engineered, for example in order to improve their function. Examples of AAVs that have been engineered for clinical gene therapy are described in Kotterman and Schaffer, 2014.
AAV vectors have a packaging capacity of less than 5 kb, which can limits the size of the genetic material (e.g. expression cassettes) that can be introduced in the viral genome. As demonstrated herein, the use of components that have a relatively small size, such as Tn3 proteins and scFvs as the binding members, allow for the expression cassette(s) encoding the tripartite complex (e.g. as part of a dimerization-inducible protein such as a split transcription factor) to fit within a single AAV vector. As additionally demonstrated herein, the small size of the expression cassette(s) encoding the tripartite complex allowed for a transgene (e.g. as part of a third expression cassette) to be introduced into the same AAV vector as the components of the split transcription factor, allowing the split transcription factor to be delivered “in cis” with the transgene.
The disclosure also includes in vitro methods of making viral particles. In one embodiment, a method of making viral particles involves transfecting host cells such as mammalian cells with a viral vector as described herein and expressing viral proteins necessary for particle formation in the cells and culturing the transfecting cells in a culture medium, such that the cells produce viral particles. The viral particles may be released into the culture medium, or the method may additionally involve lysing and isolating particles from the cell lysates. An example of a suitable mammalian cell is a human embryonic kidney (HEK) 293 cell.
Typically, multiple plasmid expression vectors are utilised to generate the various protein components that generate the viral particles. It is also possible to make use of cell lines that constitutively express components for viral packaging, enabling the use of few plasmids.
For example, construction of an AAV particle requires the Rep and Cap proteins and additional genes from adenovirus to mediate AAV replication. Making AAV particles is described for example in Robert et al. 2017
An exemplary method of producing AAV particles is described in Robert et al. 2017. Briefly, this involves transfection of a mammalian cell line, such as HEK293 cells, with three plasmids. One vector encodes the rep and cap genes of AAV (pRepCap) using their endogenous promoters; one vector (pHelper) encodes three additional adenoviral helper genes (E4, E2A and VA RNAs) not present in HEK293 cells and; one vector (the viral vector) (pAAV-GOI) contains the one or more expression cassettes flanked by two ITRs. See FIG. 2 of Robert et al.
Following release of viral particles, the culture medium comprising the viral particles may be collected and, optionally the viral particles may be separated from the cell lysate. Optionally, the viral particles may be concentrated.
Following production and optional concentration, the viral particles may be stored, for example by freezing at −80° C. ready for use by administering to a cell and/or use in therapy.
The disclosure also provides viral particles, such as AAV particles, for example those produced by the methods described herein. As used herein, a viral particle comprises a viral genome packaged within the viral envelope that is capable of infecting a cell, e.g. a mammalian cell.
Disclosed herein are one or more viral particles comprising a viral genome encoding:
Also disclosed herein are one or more viral particles comprising:
In some embodiments, the first and second expression cassettes form part of the same viral genome of a viral particle. In other embodiments, the first expression cassette is located in a first viral genome of a first viral particle and the second expression cassette is located in a second viral genome of a second viral particle.
The expression cassette, target protein, binding member, small molecule and first and second component polypeptides may be as further defined above. Depending on the viral particle used, the viral genome may be a single stranded or double stranded nucleic acid and may be RNA or DNA. For example, when the viral particle is an AAV particle, the viral genome is a single stranded DNA viral genome. The viral genome may encode the split proteins as defined above.
The agents (i.e. the one or more expression vectors, expression products or viral particles, plus small molecule) may be administered to a patient as part of a method of treatment or a method of prophylaxis of a disease. Following binding of the binding member to the T-SM complex the recipient individual may experience a reduction in symptoms of the disease or disorder being treated. This may have a beneficial effect on the disease condition in the individual.
The term “treatment,” as used herein in the context of treating a condition, pertains generally to treatment and therapy of a human, in which some desired therapeutic effect is achieved, for example, the inhibition of the progress of the condition, and includes a reduction in the rate of progress, a halt in the rate of progress, regression of the condition, amelioration of the condition, and cure of the condition. Treatment as a prophylactic measure (i.e., prophylaxis, prevention) is also included.
“Prophylaxis” in the context of the present specification should not be understood to circumscribe complete success i.e. complete protection or complete prevention. Rather prophylaxis in the present context refers to a measure which is administered in advance of detection of a symptomatic condition with the aim of preserving health by helping to delay, mitigate or avoid that particular condition.
The method of treatment may involve expressing one or more dimerization-inducible proteins as defined further herein in a cell. The dimerization-inducible protein may, for example, comprise a first component polypeptide and a second component polypeptide that form a therapeutic polypeptide upon dimerization. In this way, addition of the small molecule can result in the therapeutic protein having increased activity and can be used, for example, in a method of treatment of a disease where the therapeutic protein is deficient.
Disclosed herein is a method of regulating the expression of a desired expression product in a cell, comprising i) expressing a dimerization-inducible protein described herein in the cell, wherein the first and second component polypeptides form a transcription factor upon dimerization, and wherein the DNA binding domain binds to a target sequence in the cell such that the transcription factor is capable of regulating (i.e. increasing or decreasing) expression of the desired expression product in the cell, and ii) administering the small molecule to the cell in order to regulate expression of the desired expression product.
Additionally disclosed herein is a dimerization-inducible protein for use in a method of regulating the expression of a desired expression product in a cell in a human or animal subject, the method comprising expressing the dimerization-inducible protein described herein in the cell, wherein the first and second component polypeptides form a transcription factor upon dimerization, and administering the small molecule to the cell in order to regulate (e.g. increase or decrease) expression of the desired expression product. Also disclosed herein is a small molecule for use in a method of regulating the expression of a desired expression product in a cell in a human or animal subject, the method comprising expressing the dimerization-inducible protein described herein in the cell, wherein the first and second component polypeptides form a transcription factor upon dimerization, and administering the small molecule to the cell in order to regulate (e.g. increase or decrease) expression of the desired expression product.
The method may comprise administering one or more expression vectors or viral particles as described herein in order to express the dimerization-inducible protein in the cell. In other embodiments the method may comprise administering an expression product produced from the one or more expression vectors, e.g. mRNA encoding the dimerization-inducible protein, to the cell. The particular administration would be at the discretion of the physician who would also select dosages using his/her common general knowledge and dosing regimens known to a skilled practitioner.
The desired expression product can be RNA or a peptidic (peptide, polypeptide or protein). Preferably the desired expression product is peptidic. The desired expression product may be a therapeutic protein, i.e. a protein that exerts a therapeutic effect in the subject.
The desired expression product may be part of an endogenous gene present in the genome of the target cell. For example, where the method is carried out in a human cell, the desired expression product may be part of a human gene. Alternatively, the desired expression product may be part of a transgene delivered to the target cell, e.g. a therapeutic transgene. Regulating expression of the gene may be used in a method of treatment or a method of prophylaxis of a disease. Following expression of the split transcription factor and administration of the small molecule, the recipient individual may exhibit reduction in symptoms of the disease or disorder being treated. This may have a beneficial effect on the disease condition in the individual.
Where the target sequence is part of a transgene delivered to the cell, the method may further comprise administering a third expression cassette to the cell, wherein the third expression cassette encodes the desired expression product and wherein the third expression cassette comprises the target sequence. The transgene may comprise a promoter that is operably linked to a coding sequence for the desired expression product, which may be a therapeutic protein, e.g. a therapeutic antibody. An example of a therapeutic antibody is MEDI8852, having the heavy chain amino acid sequence set forth as SEQ ID NO: 205 and the light chain amino acid sequence set forth as SEQ ID NO: 206. The third expression cassette may be part of the same expression vector or viral particle as one or both of the first and second expression cassettes. In other words, the transgene may be delivered “in cis” with the split transcription factor to the cell, such within the same viral (e.g. AAV) particle. Alternatively, the third expression cassette may be part of a different expression vector or viral particle as one or both of the first and second expression cassettes. In other words, the transgene may be delivered “in trans” with the split transcription factor to the cell, such as within separate viral (e.g. AAV) particles. As demonstrated herein, the split transcription factors of the disclosure are suitable for both “in cis” and “in trans” delivery with the transgene.
The target sequence may be located in or in close proximity to a promoter that is operably linked to a coding sequence for the desired expression product. By “close proximity” it is meant that the target sequence is within 500 bp, within 250 bp, within 100 bp, within 50 bp, or within 25 bp of the sequence corresponding to the promoter.
Administration to the cell may occur by any suitable means. For example, the expression cassettes may be delivered by viral, e.g. as part of a viral particle described herein, or by non-viral means. Non-viral means of delivery include electroporation, lipofection, microinjection, biolistics, virosomes, liposomes, immunoliposomes, polycation or lipid:nucleic acid conjugates, naked DNA, naked RNA, artificial virions, and agent-enhanced uptake of DNA. In one embodiment, the expression cassettes are delivered as mRNA. In one embodiment, the expression cassettes are delivered as DNA plasmids.
In any of the in vivo methods disclosed herein, the small molecule may be orally administered to a human subject, for example in an acceptable dosage form such as a capsule, tablet, aqueous suspension or solution. The amount used will depend on the host treated and the particular mode of administration. The small molecule may be administered as a single dose, multiple doses or over an established period of time.
Where the method involves administering a viral particle to a cell, the unit dose may be calculated in terms of the dose of viral particles being administered. Viral doses include a particular number of virus particles or plaque forming units (pfu) or viral genome copies (vgc). For embodiments involving AAV, particular unit doses include 103, 104, 105, 106, 107, 108, 109, 1010, 1011, 1012, 1013, 1014, 1015, 1016 viral genome copies (vgc) per kg of body weight. Particle doses may be somewhat higher (10 to 100-fold) due to the presence of infection-defective particles.
Without wishing to be bound by theory, infection and transduction of cells by viral particles (e.g. AAV particles) is believed to occur by a series of sequential events as follows: interaction of the viral capsid with receptors on the surface of the target cell, internalization by endocytosis, intracellular trafficking through the endocytic/proteasomal compartment, endosomal escape, nuclear import, virion uncoating, and viral DNA double-strand conversion that leads to the transcription and expression of proteins encoded by the viral genome in the viral particle.
While it is possible for the one or more expression vectors, expression products, viral particles, and small molecules to be used (e.g., administered) alone, it is often preferable to present the individual components as a composition or formulation e.g. with a pharmaceutically acceptable carrier or diluent.
For example, the one or more viral particles may be administered as a pharmaceutical composition comprising the one or more viral particles and a pharmaceutically acceptable carrier or diluent. As another example, the small molecules may be administered as a pharmaceutical composition comprising the small molecule and a pharmaceutically acceptable carrier or diluent.
The term “pharmaceutically acceptable,” as used herein, pertains to compounds, ingredients, materials, compositions, dosage forms, etc., which are, within the scope of sound medical judgment, suitable for use in contact with the tissues of the subject in question (e.g., human) without excessive toxicity, irritation, allergic response, or other problem or complication, commensurate with a reasonable benefit/risk ratio. Each carrier, diluent, excipient, etc. must also be “acceptable” in the sense of being compatible with the other ingredients of the formulation.
The agents (i.e. the one or more expression vectors, DNA plasmids or viral particles, plus small molecule) may be administered simultaneously or sequentially and may be administered in individually varying dose schedules and via different routes. For example, when administered sequentially, the agents can be administered at closely spaced intervals (e.g., over a period of 5-10 minutes) or at longer intervals (e.g., 1, 2, 3, 4 or more hours apart, or even longer periods apart where required), the precise dosage regimen being commensurate with the properties of the agent(s) being administered. In one embodiment, the small molecule is administered after administration of the one or more expression vectors, DNA plasmids or viral particles.
Also provided are methods of cellular therapy. Cellular therapy involves administering cells that have been genetically modified to express an expression product, such as a dimerization-inducible protein, to a patient.
Cells such as stem cells may be used methods of cellular therapy. One potential advantage associated with using stem cells is that they can be differentiated into other cell types in vitro, and can be introduced into a mammal (such as the donor of the cells) where they will engraft in the bone marrow. Suitable stem cells include embryonic stem cells, induced pluripotent stem cells, hematopoietic stem cells, mesenchymal stem cells, neuronal stem cells, cardiac stem cells and mesenchymal stem cells.
For example, the cellular therapy may involve administering the one or more expression vectors described herein to a cell (e.g. a stem cell) in an ex vivo method such that a dimerization-inducible protein is expressed by the cell and administering the cell to a patient. Following administration of the cell expressing the dimerization-inducible protein, a small molecule may be administered to the individual in order to induce dimerization of the first and second component polypeptides in order to reconstitute their function upon dimerization. For example, the first and second component polypeptides may form a transcription factor upon dimerization, or the first and second component polypeptides may form a CAR upon dimerization.
Disclosed herein is a method of treatment comprising administering a cell expressing a dimerization-inducible protein defined herein to a patient, the method comprising:
The dimerization-inducible protein may be for example a split transcription factor, a split CAR, a split apoptotic protein or a split therapeutic protein. The method of treatment may be a method of treating cancer.
Cellular therapy may involve isolating cells from a patient, transfecting the cells with one or more expression vectors ex vivo and the cells are administered to the patient. Various cell types suitable for ex vivo transfection are well known to those of skill in the art (see, e.g., Freshney et al., Culture of Animal Cells, A Manual of Basic Technique (3rd ed. 1994)) and the references cited therein for a discussion of how to isolate and culture cells from patients).
For example, the cellular therapy may involve isolating a cell from a patient, administering the one or more expression vectors described herein to the cell in an ex vivo method such that a dimerization-inducible protein is expressed by the cell, and administering the cell back to the patient. Following administration of the cell expressing the dimerization-inducible protein, a small molecule may be administered to the individual in order to induce dimerization of the first and second component polypeptides as described herein.
In one embodiment, the cell is an immune cell (such as a T-cell) and the dimerization-inducible protein expressed by the cell is a split CAR. Methods of treatment involving CAR T-cell therapy are known in the art and are described for example in Miliotou and Papadopoulou, 2018.
Disclosed herein is a method of treatment comprising administering a cell expressing the dimerization-inducible protein defined herein to a patient thereof, wherein the first and second component polypeptide form a CAR upon dimerization, the method comprising:
The method of treatment may be a method of treating cancer.
The disclosure also provides a nucleic acid molecule or molecules encoding a binding member or dimerization-inducible protein defined herein. The nucleic acid molecule or molecules may be isolated nucleic acid molecule or molecules. The nucleic acids encoding the binding members and dimerization-inducible proteins may have the requisite features and sequence identity as described herein in relation to the expression vectors. The skilled person would have no difficulty in preparing such nucleic acid molecules using methods well-known in the art.
In some embodiments the nucleic acid molecule or molecules encode the VH and/or VL domain(s) of PRSIM_57, PRSIM_01, PRSIM_04, PRSIM_67, PRSIM_72, or PRSIM_75. The amino acid sequences for those VH or VL domains are defined herein.
In some embodiments, the nucleic acid molecule or molecules encode the binding member of PRSIM_23, PRSIM_32, PRSIM_33, PRSIM_36, PRSIM_47, PRSIM_57, PRSIM_01, PRSIM_04, PRSIM_67, PRSIM_72, or PRSIM_75. The amino acid sequences for those binding members are defined herein.
In some embodiments, the nucleic acid molecule or molecules comprise a nucleic acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the exemplary nucleic acid sequences set forth for PRSIM_23, PRSIM_32, PRSIM_33, PRSIM_36, PRSIM_47, PRSIM_57, PRSIM_01, PRSIM_04, PRSIM_67, PRSIM_72, or PRSIM_75. In some embodiments, the nucleic acid molecule or molecules comprise a nucleic acid sequence of PRSIM_23, PRSIM_32, PRSIM_33, PRSIM_36, PRSIM_47, PRSIM_57, PRSIM_01, PRSIM_04, PRSIM_67, PRSIM_72, or PRSIM_75. The nucleic acid sequences for those exemplary binding members are set forth in the following table:
In some embodiments, the nucleic acid molecule or molecules encodes the first component polypeptide and/or second component polypeptides fused to the target protein or binding member as described above. The amino acid sequences for those component polypeptides are defined herein.
In some embodiments, the nucleic acid molecule or molecules encodes one or more of the DBD-T fusion protein, TRD-BM fusion protein, DBD-BM fusion protein, and TRD-T fusion protein as described above. The amino acid sequences for those fusion proteins are defined herein.
In some embodiments, the nucleic acid molecule or molecules encoding a TRD-T fusion protein has a nucleic acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the nucleic acid sequence set forth in SEQ ID NO: 108. In some embodiments, the nucleic acid molecule or molecules encoding a TRD-T fusion protein has the nucleic acid sequence of SEQ ID NO: 108.
In some embodiments, the nucleic acid molecule or molecules encoding a DBD-T fusion protein has a nucleic acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the nucleic acid sequence set forth in SEQ ID NO: 109. In some embodiments, the nucleic acid molecule or molecules encoding a DBD-T fusion protein has the nucleic acid sequence of SEQ ID NO: 109.
In some embodiments, the nucleic acid molecule or molecules encoding a DBD-BM fusion protein has a nucleic acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with any one of the nucleic acid sequences set forth in SEQ ID NOs: 110-120. In some embodiments, the nucleic acid molecule or molecules encoding a DBD-BM fusion protein has the nucleic acid sequence of any one of SEQ ID NOs: 110-120.
In some embodiments, the nucleic acid molecule or molecules encoding a TRD-BM fusion protein has a nucleic acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with any one of the nucleic acid sequences set forth in SEQ ID NO: 121-131. In some embodiments, the nucleic acid molecule or molecules encoding a TRD-BM fusion protein has the nucleic acid sequence of any one of SEQ ID NOs: 121-131.
In some embodiments the nucleic acid molecule or molecules encode a split CAR as defined herein. In some embodiments the nucleic acid molecule or molecules encoding a split CAR has a nucleic acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the nucleic acid sequence set forth in SEQ ID NO: 133 and a nucleic acid sequence encoding the antigen-specific recognition domain. In some embodiments, the nucleic acid molecule or molecules encoding a split CAR has the nucleic acid sequence of SEQ ID NO: 133 and a nucleic acid sequence encoding the antigen-specific recognition domain. In some embodiments, the nucleic acid molecule or molecules encoding a split CAR comprises a nucleic acid sequence encoding an antigen-specific recognition domain (e.g. an scFv) located between positions 66 and 67, wherein the nucleotide numbering corresponds to SEQ ID NO: 133.
An isolated nucleic acid molecule may be used to express a binding member or dimerization-inducible protein disclosed herein. The nucleic acid will generally be provided in the form of one or more expression vectors, for example having the features of the expression vectors described herein.
The disclosure also provides kits that comprise one or more expression vectors, one or more viral particles, cells, or one or more nucleic acids, all as defined herein, with a small molecule, also as defined herein. In some embodiments, the small molecule is simeprevir. Where the one or more expression vector or nucleic acid encodes a polypeptide containing a DNA binding domain that is from a CRISPR/Cas system, the kit may additionally include a guide RNA specific for the target sequence, or a nucleic acid encoding the guide RNA specific for the target sequence.
Sequence identity is commonly defined with reference to the algorithm GAP (Wisconsin GCG package, Accelerys Inc, San Diego USA). GAP uses the Needleman and Wunsch algorithm to align two complete sequences, maximising the number of matches and minimising the number of gaps. Generally, default parameters are used, with a gap creation penalty equaling 12 and a gap extension penalty equaling 4. Use of GAP may be preferred but other algorithms may be used, e.g. BLAST (which uses the method of Altschul et al. (1990)), FASTA (which uses the method of Pearson and Lipman (1988)), or the Smith-Waterman algorithm (Smith and Waterman (1981)), or the TBLASTN program, of Altschul et al. (1990) supra, generally employing default parameters. In particular, the psi-Blast algorithm may be used.
Where the disclosure makes reference to a particular amino acid sequence having at least 90% sequence identity to a reference amino acid sequence, this includes the amino acid sequence having 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% and 100% sequence identity to the reference amino acid sequence.
The term “sequence alterations” as used herein is intended to encompass the substitution, deletion and/or insertion of an amino acid residue. Thus, a protein containing one or more amino acid sequence alterations compared to a reference sequence contains one or more substitutions, one or more deletions and/or one or more insertions of an amino acid residues as compared to the reference sequence. The term “amino acid mutation” is also herein used interchangeably with “sequence alteration”, unless the context clearly identifies otherwise.
In some embodiments in which one or more amino acids are substituted with another amino acid, the substitutions may be conservative substitutions, for example according to the following Table. In some embodiments, amino acids in the same block in the middle column are substituted, i.e. a non-polar amino acid is substituted for another non-polar amino acid for example. In some embodiments, amino acids in the same line in the rightmost column are substituted, i.e. G is substituted for A or P for example.
In some embodiments, substitution(s) may be functionally conservative. That is, in some embodiments the substitution may not affect (or may not substantially affect) one or more functional properties (e.g. binding affinity) of the protein comprising the substitution as compared to the equivalent unsubstituted protein.
The binding member may also comprise a variant of a BC, DE or FG loop, Tn3, CDR, VH domain, VL domain, and/or scFv sequence as disclosed herein. Suitable variants can be obtained by means of methods of sequence alteration, or mutation, and screening. In a preferred embodiment, a binding member comprising one or more variant sequences retains one or more of the functional characteristics of the parent binding member, such as binding specificity and/or binding affinity for the T-SM complex. For example, a binding member comprising one or more variant sequences preferably binds to T-SM complex with the same affinity as, or a higher affinity than, the (parent) binding member. The parent binding member is a binding member which does not comprise the amino acid substitution(s), deletion(s), and/or insertion(s) which has (have) been incorporated into the variant binding member.
For example, a binding member may comprise a BC, DE or FG loops, Tn3, CDR, VH domain, VL domain, or scFv sequence which has at least 70%, at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, at least 99.1%, at least 99.2%, at least 99.3%, at least 99.4%, at least 99.5%, at least 99.6%, at least 99.7%, at least 99.8%, or at least 99.9% sequence identity to a BC, DE or FG loops, Tn3, CDR, VH domain, VL domain, or scFv sequence disclosed herein.
A binding member may comprise a BC, DE or FG loops, Tn3, CDR, VH domain, VL domain, or scFv sequence which has one or more amino acid sequence alterations (addition, deletion, substitution and/or insertion of an amino acid residue), preferably 20 alterations or fewer, 15 alterations or fewer, 10 alterations or fewer, 5 alterations or fewer, 4 alterations or fewer, 3 alterations or fewer, 2 alterations or fewer, or 1 alteration compared with a BC, DE or FG loops, Tn3, CDR, VH domain, VL domain, or scFv sequence disclosed herein.
The features disclosed in the foregoing description, or in the following claims, or in the accompanying drawings, expressed in their specific forms or in terms of a means for performing the disclosed function, or a method or process for obtaining the disclosed results, as appropriate, may, separately, or in any combination of such features, be utilised for realising the present disclosure in diverse forms thereof.
While the present disclosure has been described in conjunction with the exemplary embodiments described above, many equivalent modifications and variations will be apparent to those skilled in the art when given this disclosure. Accordingly, the exemplary embodiments of the present disclosure set forth above are considered to be illustrative and not limiting. Various changes to the described embodiments may be made without departing from the spirit and scope of the present disclosure.
For the avoidance of any doubt, any theoretical explanations provided herein are provided for the purposes of improving the understanding of a reader. The inventors do not wish to be bound by any of these theoretical explanations.
Any section headings used herein are for organizational purposes only and are not to be construed as limiting the subject matter described.
Throughout this specification, including the claims which follow, unless the context requires otherwise, the word “comprise” and “include”, and variations such as “comprises”, “comprising”, and “including” will be understood to imply the inclusion of a stated integer or step or group of integers or steps but not the exclusion of any other integer or step or group of integers or steps.
It must be noted that, as used in the specification and the appended claims, the singular forms “a,” “an,” and “the” include plural referents unless the context clearly dictates otherwise. Ranges may be expressed herein as from “about” one particular value, and/or to “about” another particular value. When such a range is expressed, another embodiment includes from the one particular value and/or to the other particular value. Similarly, when values are expressed as approximations, by the use of the antecedent “about,” it will be understood that the particular value forms another embodiment. The term “about” in relation to a numerical value is optional and means for example+/−10%.
The Visual Molecular Dynamics (VMD) software (University of Illinois at Urbana-Champaign) built-in measure sasa command was used to calculate the solvent accessible surface area (SASA) of simeprevir from the three-dimensional structure of HCV NS3/4A PR:simeprevir complex available from the Protein Data Bank (PDB; http://www.rcsb.org/); PDB code 3KEE. The—restrict option, and a radius of 1.4 Å was used to calculate the surface of simeprevir not bound to HCV NS3/4A PR, in other words, the solvent accessible surface area.
The sequence used in the design of HCV NS3/4A PR constructs is derived from Uniprot entry A8DG50 (Hepatitis C virus subtype 1a genome polyprotein) and incorporates additional modifications from US patent U.S. Pat. No. 6,800,456. The protease domain corresponds to residues 1030-1206 of the polyprotein. A single chain consisting of an 11-residue peptide derived from the viral NS4A protein fused to the N-terminus of NS3 protease (SEQ ID 1) was used to create a fully folded and activated polypeptide. This sequence with N-terminal hexahistidine (6His) and AviTag (SEQ ID 3) (to enable affinity purification and biotinylation, respectively) was purchased as a linear DNA string (GeneArt). In parallel, a DNA string encoding an equivalent sequence with the active site mutation S139A (SEQ ID 4) was ordered. The DNA strings were cloned into the pET-28a vector (for bacterial expression) using Gibson assembly. A second set of DNA strings were ordered encoding human codon-optimised versions of the His and Avitag tagged WT and S139A protease and these were cloned into a mammalian expression vector with a CMV promoter. The sequences of the final constructs were verified via Sanger sequencing of the entire coding sequences.
For bacterial expression, the pET-28a plasmids were transformed into BL21 (DE3) E. coli cells and selected on plates containing kanamycin (50 μg/ml). For each expression, a single colony was used to inoculate a 5 ml 2×TY+50 μg/ml kanamycin culture that was grown at 37° C. overnight. This culture was used to inoculate 500 ml TB Autoinduction medium (Formedium, supplemented with 10 ml/L glycerol and 100 μg/ml kanamycin) at 1:500 dilution. The culture was grown at 37° C. to an OD600 of 1.3-1.5 and then transferred to 20° C. for 20 hours for expression to be induced. Cells were harvested by centrifugation and the pellets were stored at −80° C.
For mammalian expression, plasmid DNA was prepared with the Qiagen Plasmid Plus Gigaprep kit. Gigaprep DNA was transfected into Expi293F cells (ThermoFisher) cultured in FreeStyle293 medium (ThermoFisher) using PEI-mediated delivery with cells at a density of 2.5×106 cells/ml at the point of transfection. Cells were cultured at 37° C., 5% CO2, 140 rpm, 70% humidity for 6 days. Cells were harvested at 4,000 g and pellets stored at −80° C.
For protein purification, each bacterial pellet from 500 ml culture was thawed and re-suspended in 50 ml lysis buffer (2×DPBS, 200 mM NaCl, pH 7.4). The cells were lysed using a probe sonicator and the lysate was clarified by centrifugation at 50,000 g for 40 min at 4° C. Mammalian cell pellets were lysed via resuspension in lysis buffer containing detergent (2×DPBS, 200 mM NaCl, 1 mM TCEP, cOmplete, EDTA-free Protease Inhibitor and 25 U/ml Turbonuclease, 1% Triton X-100, pH 7.4) and rotation at 10 rpm, 4° C. for 2 hours. The mammalian lysed sample was centrifuged at 50,000 g, 30 min, 4° C. All samples were filtered with 0.22 μm bottle-top filtration devices prior to column chromatography. The filtered supernatant was loaded on a 5 ml HisTrap HP column (GE Healthcare) at 5 ml/min flow rate. The column was washed with 100 ml wash buffer (2×DPBS, 200 mM additional NaCl, 20 mM Imidazole, pH 7.4) and eluted with an imidazole gradient over 5 column volumes from 20-400 mM imidazole. Fractions were analysed by SDS-PAGE and those that were enriched for the correct protein were pooled and buffer exchanged with a HiPrep 26/10 Desalting column (GE Healthcare) into lysis buffer (2×DPBS, 200 mM NaCl, pH 7.4). Desalted protein fractions were pooled, concentrated with a centrifugal concentration device and were purified on a HiLoad Superdex 75 26/600 μg column (GE Healthcare) equilibrated in 2×DPBS, 2 mM DTT, 10 μM ZnCl2. Fractions were analysed by SDS-PAGE and those that were >95% pure were pooled, had their concentration determined via UV absorbance, and were snap frozen in liquid nitrogen prior to storage at −70° C. Final sample purity was verified with RP-HPLC on an XBridge BEH300, C4 (Waters).
The purified protein was biotinylated on its AviTag using an MBP-tagged BirA enzyme incubated with sample for 2.5 hours at 22° C. in the presence of ATP and biotin. Biotinylated protein was purified via size exclusion chromatography on a HiLoad Superdex 75 16/600 μg column (GE Healthcare) in 2×DPBS, 2 mM DTT, 1 μM ZnCl2. Fractions were analysed by SDS-PAGE and those containing the protease were pooled and the extent of biotinylation was confirmed by intact mass spectrometry on a Xevo G2-CS MS (Waters). Biotinylated protein was split into aliquots, snap frozen in liquid nitrogen and stored at −70° C.
For production of His- and Avitag tagged NS3/4A S139A protease with the introduction of additional mutations either to reduce affinity for simeprevir, the pET-28a derived plasmid encoding the protease was used as a template for site-directed mutagenesis with the Quikchange Lightning site-directed mutagenesis kit. Mutant forms of the protease construct were verified via Sanger sequencing of the entire coding sequences prior to expression. Mutant proteins were transformed into a BL21(DE3) E. coli derivative bearing a plasmid for IPTG-inducible overexpression of BirA biotin protein ligase to enable biotinylation during bacterial expression. An overnight culture was used to inoculate 50 ml 2×TY+50 μg/ml kanamycin at a 1:20 dilution. The culture was grown at 37° C. to an OD600 of 0.6, and then supplemented with 50 μM biotin and induced with 1 mM IPTG. The induced culture was transferred to 25° C. for 20 hours for expression. Cells were harvested by centrifugation and the pellets were stored at −20° C. For purification, each pellet was resuspended in 20 ml lysis buffer (50 mM HEPES, 500 mM NaCl, 1 mM TCEP, cOmplete, EDTA-free Protease Inhibitor) and lysed by passage through a cell disruptor (Constant Systems) at 40,000 kpsi. Protein was purified in an automated 2-step procedure of IMAC followed by buffer exchange with a desalting column. Once loaded on an IMAC resin, sample was washed with lysis buffer supplemented with 20 mM imidazole and eluted with buffer containing 400 mM imidazole. Eluate was automatically captured and loaded on a desalting column equilibrated in 50 mM HEPES, 300 mM NaCl, 0.5 mM TCEP, pH 7.5. Final protein samples were split into aliquots, snap frozen in liquid nitrogen and stored at −70° C.
To assess enzyme activity, cleavage of a fluorogenic HCV protease FRET substrate with an EDANS-DABCYL donor-quencher pair by purified HCV NS3/4A PR and the S139A mutant (RET 51, AnaSpec) was measured. When in close proximity (10-100 Å), as would be the case for the intact peptide, EDANS is excited at 340 nm, and the energy emitted from EDANS (at 490 nm) is quenched by DABCYL. Cleavage of the peptide by the HCV NS3/4A PR separates DABCYL from EDANS, allowing detection of fluorescence at 490 nm.
Serial dilutions of HCV NS3/4A PR and the active site mutant S139A in assay buffer (HEPES pH 7.8, 5 mM DTT, 100 mM NaCl, 10% glycerol, 0.01% CHAPS) were incubated with the fluorogenic substrate at room temperature. Fluorescence was measured after 3 hours using a PerkinElmer Envision plate reader (excitation 340 nm, emission 490 nm).
Isothermal calorimetry (ITC) was carried out using the Auto-ITC 200 (Malvern), with a preliminary injection of 0.4 μl followed by 19 injections of 2 μl each, at 120 second intervals. Rotation of the solution was set to 750 rpm and temperature 37° C. Simeprevir (125 μM) was titrated into HCV NS3/4A PR (WT 8 μM and S139A mutant 8.2 μM) or protein buffer (control); the protein buffer was enriched with 2.5% DMSO to equal the amount present in the simeprevir solution. The WT was run once; the S139A mutant was run in duplicate. The data were analysed with the ITC-PEAQ software (Malvern) using a one-site binding model and reference subtraction point-by-point.
scFv and Tn3 sequences were isolated from phage display selections using three phage display libraries as follows (i) Library 1, a Tn3 library developed as an FnIII alternative scaffold based on the third such module in human tenascin C ((Leahy et al. 1992), (Oganesyan et al. 2013), (Gilbreth et al. 2014)), (ii) Library 2, a restricted framework scFv library and (iii) Library 3 a naïve scFv library.
All phage selections were performed according to previously established protocols ((Vaughan et al. 1996), (Swers et al. 2013)). Phage display selections were performed using biotinylated HCV NS3/4A PR (S139A) captured on streptavidin coated magnetic beads (Promega). In total, 4 rounds of phage display selection were performed for each phage library, using decreasing concentrations of biotinylated HCV NS3/4A PR and simeprevir (
The biotinylated HCV NS3/4A PR (S139A) antigen was pre-incubated with a 50-fold molar excess of simeprevir prior to selections commencing, to ensure saturation of the protease. Prior to each selection, the phage pool was incubated with streptavidin beads alone to deplete the library of any binders to the streptavidin beads. For phage display selections rounds 1 and 2, no deselection step on biotinylated HCV NS3/4A PR (S139A) in the absence of simeprevir was performed. However, for rounds 3 and 4, selections were performed in parallel, with one arm having no deselection step on biotinylated HCV NS3/4A PR (S139A), and the other arm having a deselection step where the phage particles were pre-incubated with 250 nM biotinylated HCV NS3/4A PR (S139A) for 15 minutes at room temperature prior to removing the protease using streptavidin coated beads. Following this the resulting phage were then added to the biotinylated HCV NS3/4A PR (S139A) coated on streptavidin beads in the presence of simeprevir for the selection protocol.
Phage display selections were performed using the following concentrations of biotinylated HCV NS3/4A PR (S139A) at each round:
Round 1: 250 nM biotinylated HCV NS3/4A PR (S139A)+12.5 μM simeprevir
Round 2: 100 nM biotinylated HCV NS3/4A PR (S139A)+5 μM simeprevir
Round 3: 25 nM biotinylated HCV NS3/4A PR (S139A)+1.25 μM simeprevir
Round 4: 25 nM biotinylated HCV NS3/4A PR (S139A)+1.25 μM simeprevir
Following incubation with the biotinylated HCV NS3/4A PR (S139A) in the presence of simeprevir, the phage bound to the complex were washed three times with D-PBS (Sigma) followed by elution with trypsin. Eluted phage were used to infect mid-log phage cultures of E. coli TG1 cells and plated on agar plates (containing 100 μg/ml ampicillin and 2% (w/v) glucose).
Individual phage clones from round 3 and round 4 were picked for DNA sequencing and screening for antigen binding by phage ELISA. DNA sequence information is shown in Table 1.
Specific binding to HCV NS3/4A PR (S139A) was assessed by phage ELISA using single phagemid scFv or Tn3 clones induced for expression as described ((Osbourn et al. 1996)). Briefly, individual TG1 colonies encoding phage clones from round 3 and round 4 selection outputs, and negative control clones, were grown in 96 well plates at 3TC shaking at 280 rpm to log phase in media containing 100 μg/ml ampicillin and 2% (w/v) glucose. Helper phage was then added to each well and the plates incubated at 3TC for 1 hour, shaking at 150 rpm. Plates were then centrifuged at 4500 rpm for 10 minutes at room temperature and the media was removed and replaced with media containing 100 μg/ml ampicillin and 50 μg/ml kanamycin. Plates were then incubated overnight at 25° C., shaking at 280 rpm. The following day, phage preparations were blocked by adding an equal volume of 2×PBS containing 6% (w/v) skimmed milk powder (Marvel) to each well of the plate.
Biotinylated HCV NS3/4A PR (S139A) was used to coat 96 well streptavidin-coated plates at 5 μg/ml (1.875 μM) in the presence and absence of a 3-fold excess of simeprevir (5.6 μM). Coated plates were washed with PBS and blocked with PBS containing 3% (w/v) skimmed milk powder (Marvel) for one hour. Following this blocking step, the plate wells were washed three times with PBS, prior to adding the blocked phage preps (produced as described in the phage rescue section). Phage preps were incubated with the antigens for 1 hour at room temperature prior to washing three times with PBS/Tween 20 (0.1% v/v). Phage that bound specifically to the antigen coated plate were detected using an anti-M13 phage-HRP tagged antibody (GE Healthcare), followed by detection using 3,3′, 5,5′-Tetramethylbenzidine (TMB; Sigma). The detection reaction was stopped using 0.5 M H2S04 and plates were read using a fluorescent plate reader at 450 nm. Fluorescent readings determined for each clone binding to biotinylated HCV NS3/4A PR (S139A) in the presence of simeprevir was compared to those binding in the absence of simeprevir, by dividing the signal observed in the presence of simeprevir to the signal observed in the absence of simeprevir. These data were plotted on graphs (
Expression of scFv and Tn3 PRSIM Binding Molecules
scFv and Tn3 PRSIM binding molecules were purified from E. coli using methods previously described (Vaughan et al., 1996), using nickel-chelate chromatography, followed by size exclusion chromatography. To increase the expression level of the most promising Tn3 PRSIM binding molecules, the DNA sequences encoding them were subcloned to the pET16b vector, using the oligonucleotides Tn3_pETFwd2 (5′-CGATCATATGGACTACAAGGACGACGATGACAAGGGCAGCCGTCTGGATGCACCGAGCCAG-3′ (SEQ ID NO: 183)) and Tn3_pETRev2 (5′-ATCGGGATCCCTACAGACCGGTTTTAAGGTAATTTTTGCCGG-3′ (SEQ ID NO: 184)) and expressed cytoplasmically in BL21 (DE3) E. coli (New England Biolabs). Following lysis in BugBuster plus Benzonase (EMD Millipore), Tn3-based PRSIM binding molecules were purified to homogeneity using nickel-chelate chromatography, followed by size exclusion chromatography to provide a monomeric protein in PBS (pH 6.5).
scFv and Tn3 PRSIM binding molecules that were selective for the HCV NS3/4A PR (S139A) were identified in homogeneous time-resolved fluorescence (HTRF®) assays run in parallel to measure binding in the presence and absence of simeprevir. HCV NS3/4A PR (S139A), and serial dilutions of purified PRSIM binding molecules, were prepared in assay buffer (PBS containing 0.4 M potassium fluoride and 0.1% BSA). Streptavidin cryptate (Cisbio) was pre-mixed with either anti-FLAG XL665 (to detect the Tn3 molecules) or anti-c-myc XL665 (to detect the scFv molecules) in assay buffer. For each assay 2.5 μl of sample titration was added to 2.5 μl HCV NS3/4A PR (S139A) and 2.5 μl of pre-mixed detection reagents. Either 2.5 μl simeprevir or 2.5 μl of a DMSO blank were also added to each well. Background was defined using wells with zero sample addition. Assay plates were incubated overnight at 4° C., prior to reading the time resolved fluorescence at 620 nm and 665 nm emission wavelengths using a PerkinElmer Envision plate reader. Data was analysed by calculating % Delta F values for each sample. Delta F was determined according to equation 1.
% Delta F=((sample 665 nm/620 nm ratio value)−(background 665 nm/620 nm ratio value)/(background 665 nm/620 nm ratio value))×100 Equation 1
Selective binding molecules are defined as those scFv and Tn3 PRSIM binding molecules that bind to HCV NS3/4A PR (S139A) in complex with simeprevir and no binding to HCV NS3/4A PR (S139A).
The affinity of the scFv and Tn3 PRSIM binding molecules were measured using the Biacore 8K (GE Healthcare) at 25° C. The scFv and Tn3 PRSIM binding molecules were covalently immobilised to a CM5 chip surface using standard amine coupling techniques at a concentration of 1 μg/ml in 10 mM sodium acetate pH 4.5.
The HCV NS3/4A PR (S139A), or BSA control, was diluted 1:4 (1.25-20 nM)±10 nM simeprevir in 10 mM Hepes pH 7.4, 150 mM NaCl, 0.05% Surfactant P20, 0.01% DMSO, ensuring constant simeprevir and DMSO concentration. The samples were flowed over the chip at 50 μl/min using single cycle kinetics, with 120 sec association and 600 sec dissociation. The chip surface was regenerated with two 20 sec pulses of 10 mM Glycine-HCl pH 3.0. The final sensorgrams were analysed using the Biacore 8K Evaluation Software and the affinity constant KD was determined using a 1:1 binding model. The same method was used for measuring the affinity of the HCV NS3/4A PR mutants for PRSIM_23 with minor deviations. The mutants were diluted 1:4 (2.5-40 nM)±simeprevir in 10 mM Hepes pH 7.4, 150 mM NaCl, 0.05% Surfactant P20, 0.08% DMSO, ensuring constant simeprevir and DMSO concentration. The samples were flowed over the chip at 50 μl/min using single cycle kinetics, with 180 sec association and 600 sec dissociation.
The effect of simeprevir concentration on the formation of the HCV NS3/4A PR (S139A)/PRSIM binding molecule complex was also measured using the Biacore 8K. PRSIM_57 and PRSIM_23 were covalently immobilised on a CM5 chip surface, as before. Simeprevir was diluted 1:2 (0.0152-300 nM) in 10 mM Hepes pH 7.4, 150 mM NaCl, 0.05% Surfactant P20, 0.3% DMSO at a constant 40 nM HCV NS3/4A PR (S139A) concentration. The samples were flowed over the chip at 50 μl/min using multi cycle kinetics, with 240 sec association and 600 sec dissociation. Regeneration conditions were as described above. Titration curves for the induction of HCV NS3/4A PR (S139A)/PRSIM dimerization by simeprevir were generated. The response for each simeprevir concentration at 225 sec (15 sec before the end of the association) was normalized as a percentage of the response for 300 nM Simeprevir at 225 sec and plotted against the Simeprevir concentration. Each data point represents the mean of 3 independent experiments±s.e.m. The EC50 reported was calculated using nonlinear regression curve fit. The same method was used for the mutant HCV NS3/4A proteases, except simeprevir was diluted 1:2 (0.0457-900 or 0.412-8,100 nM) in 10 mM Hepes pH 7.4, 150 mM NaCl, 0.05% Surfactant P20, 0.82% DMSO at a constant 40 nM HCV NS3/4A PR (S139A) concentration and the response for each simeprevir concentration was normalized against the highest simeprevir concentration.
The affinity of simeprevir was measured using the Octet RED384 (ForteBio) at 25° C. The biotinylated HCV NS3/4A PR (S139A), HCV NS3/4A K136D PR, HCV NS3/4A K136N PR and HCV NS3/4A D168E PR were loaded on High Precision Streptavidin (SAX) biosensors at a concentration of 2 μg/ml in 10 mM Hepes pH 7.4, 150 mM NaCl, 0.05% Surfactant P20, 0.3% DMSO. The simeprevir was diluted 1:1 (46.88-3,000 nM) in the same buffer and the loaded biosensors were dipped into the simeprevir samples for 180 sec to measure the association. For the dissociation the biosensors were dipped into the buffer for 600 sec. The traces were analysed using ForteBio Data Analysis software and fit globally using a 1:1 binding model.
The ability of the PRSIM binding molecules to promote dimerization of two proteins to which they are fused was assessed with the NanoBiT system (Promega) that measures the reconstitution of a split Nanoluciferase (NanoLuc) and the resultant luminescence upon supply of a live cell imaging Nano-Glo NanoLuc substrate (
To establish the optimal orientation of the HCV NS3/4A PR (S139A) and PRSIM components, constructs whereby HCV NS3/4A PR (S139A) was fused at either the N- or C-terminus to SmBiT (SEQ ID NOs: 18 and 19, respectively) and a parallel set of constructs for each PRSIM binding module fused to either the N- or C-terminus of LgBiT (SEQ ID NOS: 20-30 and 31-41, respectively). The NanoBiT kit (Promega) supplies a set of vectors enabling these constructs to be generated. DNA strings encoding HCV NS3/4A PR (S139A) and the PRSIM molecules were purchased from GeneArt and amplified via PCR with primers with extensions containing restriction sites compatible with the NanoBiT vectors and were cloned via Gibson assembly. All constructs were verified via Sanger sequencing of the entire coding sequence.
All NanoBiT screens were performed in adherent HEK293 cells cultured in 96-well plates. Cells enzymatically dissociated from a tissue culture flask were counted and plated at 2×104 cells/well in a white, opaque-bottomed 96-well plate (Costar 3917). The plates were incubated overnight at 37° C. with 5% CO2 to allow the cells to adhere. On day 2, plasmids were co-transfected with Lipofectamine LTX (ThermoFisher) at a final concentration of 100 ng/well (50 ng/plasmid, one encoding a SmBiT fusion, the other a LgBiT fusion). On day 3, wells were treated with 100 nM of the appropriate small molecule inducer (rapamycin (FRB:FKBP12) or simeprevir (HCV NS3/4A PR:PRSIM)) or vehicle control, and luminescence was quantified with an Envision plate reader immediately following addition of Nano-Glo Live Cell Substrate (Promega).
The iDimerize regulated transcription system (Takara) was used to test the ability of PRSIM-based CIDs to regulate gene expression. It is based on the reconstitution of a split transcription factor, where the DNA binding domain (DBD) and activation domain (AD) are separated such that transcription does not occur. The DBD and AD are separately fused to the two protein components of a CID such that, only in the presence of the small molecule inducer, the AD is brought into close proximity to the DBD, recruiting the transcription machinery to a promoter harbouring the DBD recognition sites. The iDimerize regulated transcription system (Takara) provides two vectors, pHet-Act1-2 and pZFHD1-Luciferase. The pHet-Act1-2 vector encodes two fusion proteins that represent a positive control: one is a fusion between FRB (T82L mutant; DmrC) and an activation domain (AD) from human p65 (SEQ ID NO: 42); the other is a fusion protein comprised of a DNA binding domain (ZFHD1) (SEQ ID NO: 43) fused to three tandem copies of FKBP12 (DmrA). These sequences are preceded by a CMV promoter and separated by an internal ribosome entry site (IRES). The ZFHD1 vector encodes luciferase preceded by an inducible promoter consisting of 12 copies of the recognition sequence of the ZFHD1 DBD upstream of a minimal IL-2 promoter. Binding of the DBD to its recognition sequence and recruitment of the transcriptional machinery by the AD initiates transcription of the luciferase reporter gene. The DNA sequence encoding HCV NS3/4A PR (S139A) was purchased as a DNA string from GeneArt and cloned into the pHet-Act1-2 vector as either an N-terminal fusion partner to the activation domain (SEQ ID NO: 44) (replacing FRB) or as a C-terminal fusion partner to the DNA binding domain (SEQ ID NO: 45) (replacing FKBP12) with flexible linkers (TGGGGSGGGGS (SEQ ID NO: 185) and SA, respectively) between the fusion partners. Subsequently, sequences encoding one copy of a panel of 12 PRSIM molecules (Table 2) were purchased as DNA strings from GeneArt and were cloned using Gibson assembly into the HCV NS3/4A PR (S139A)-containing pHetAct1-2 constructs described above, as a fusion partner to either the DBD (SEQ ID NOS: 46-56) or AD (SEQ ID NOS: 57-67), respectively. An equivalent construct was generated to replace the three copies of FKBP12 in pHet-Act1-2 with a single copy of FKBP12. The sequence of the constructs encoding both activation domain and DNA-binding domain fusion proteins was confirmed by Sanger sequencing of the entire coding region.
The DNA sequence encoding NanoLuc-PEST (Promega) (SEQ ID NO: 68) was purchased as a DNA string from GeneArt and cloned into the pZFHD1-2 vector (Takara) downstream of the ZFHD1 inducible promoter using Gibson assembly cloning. The nucleotide sequence of the final construct was confirmed by sequencing.
The DNA sequence encoding MED18852 (SEQ ID NO: 237 and SEQ ID NO: 238, separated by an internal ribosome entry site (IRES) sequence) was purchased as a as a DNA string from GeneArt and cloned into the pZFHD1-2 vector (Takara) downstream of the ZFHD1 inducible promoter using Gibson assembly cloning. The nucleotide sequence of the final construct was confirmed by sequencing.
Sequences encoding the three HCV NS3/4A PR (S139A) mutants (Table 6) were purchased as DNA strings from GeneArt and were cloned using Gibson assembly into the pHetAct1-2 HCV NS3/4A PR (S139A)-PRSIM_23 (3 tandem copy) construct described above as a fusion partner to the AD (SEQ ID NOs: 211-216).
All transcriptional regulation assays were performed in adherent HEK293 cells cultured in 384-well plates. Cells enzymatically dissociated from a tissue culture flask were counted and plated at 7.5×103 cells/well in a 384-well plate. The plates were incubated overnight at 37° C. with 5% CO2 to allow the cells to adhere. On day 2, the cells were co-transfected with a pHet-Act1-2 plasmid (containing the FRB:FKBP12 control fusion proteins (Clontech) or the HCV NS3/4A PR (S139A):PRSIM fusion proteins) and a pZFHD1 plasmid (encoding either luciferase (Clontech) or NanoLuc-PEST (as described above)) using Lipofectamine LTX (ThermoFisher). On day 3, wells were treated with different concentrations of either A/C heterodimeriser (for the FRB:FKBP12 control), simeprevir or with vehicle control, and 24 hours later luminescence was quantified with an Envision plate reader immediately following addition of SteadyGlo luciferase substrate (Promega) or Nano-Glo Vivazine luciferase substrate (Promega). Alternatively, reverse transfections were carried out on Day 1, addition of dimeriser on Day 2 and luminescence quantified 24 hours later on Day 3.
Luminescence readings were converted into fold-change by dividing the signal in the presence of simeprevir by that in the absence of simeprevir.
For the quantification of antibody expression (MED18852) utilising the transcriptional regulation assay, the cells were co-transfected with a pHet-Act1-2 plasmid (containing the HCV NS3/4A PR (S139A):PRSIM_23) and a pZFHD1 plasmid (encoding MED18852); 24 hours later wells were treated with different concentrations of simeprevir. Antibody concentration was determined in the supernatants 48 hours post the addition of simeprevir using MSD kit (Singleplex Human/NHP IgG Isotyping Kit (Mesoscale).
A chimeric antigen receptor (CAR), a synthetic, genetically engineered version of a T-cell receptor, can direct the activation of immune cells in response to user-defined targets via target-specific recognition domains, e.g. a single chain variable antibody fragment (scFv). These multi-domain, synthetic proteins are typically constructed by fusion of the target recognition domain to a transmembrane domain, T-cell receptor co-stimulatory domain and a C-terminal CD3 zeta cytoplasmic activation domain. A split-CAR can be generated by expressing the target recognition/transmembrane/co-stimulatory domain and the CD3 zeta activation domain as two separate proteins. Addition of the appropriate heterodimerising switch components, to the respective proteins, will then allow activation of the CAR in the presence of the target protein via chemical-induced heterodimerisation.
Two split CAR-encoding constructs were generated utilising either the FRB:FKBP12 or HCV NS3/4A PR (S139A):PRSIM_23 heterodimerising components. For both split CARs a tricistronic construct was generated. The three fusion proteins encoded were 1) From N-terminus to C-terminus, a signal peptide sequence, an scFv fragment that recognises the target antigen, a hinge domain from human IgG4, a transmembrane domain from CD28, the intracellular domain of co-stimulatory protein 4-1BB activation domain and either FKBP12 or HCV NS3/4A PR (S139A), 2) From N-terminus to C-terminus, a signal peptide sequence, a hinge domain from human IgG4, a transmembrane domain from CD28, the intracellular domain of co-stimulatory protein 4-1 BB activation domain, either FRB or PRSIM_23, followed by the CD3 zeta domain and 3) green fluorescent protein (GFP) which was used as a marker for transfected cells (
Lentiviral particles encoding each split CAR were generated using the pPACKH1 HIV lentivector packaging kit (Systems Bioscience), according to the manufacturer's protocol. Jurkat cells were transduced with lentiviral particles in the presence of 8 μg/ml polybrene for 24 hours, after which time the cells were changed into fresh growth media (RPMI-1640+10% foetal bovine serum) and allowed to grow for 5 days. Split CAR-transduced Jurkat cell pools were FACS-sorted based on GFP fluorescence to achieve equivalent expression levels for both the FKBP12:FRB and HCV NS3/4A PR (S139A):PRSIM_23 CARs before functional testing. Activation of the split-CAR-expressing Jurkat cells can be measured by interleukin-2 (IL-2) production after stimulation of the CAR (Smith-Garvin, Koretzky, and Jordan 2009). A co-culture assay was employed to facilitate CAR activation whereby CAR-expressing Jurkat cells were mixed with either HepG2 (antigen positive) or A375 (antigen negative) cells at a ratio of 1:1. Different concentrations of simeprevir or the vehicle control (DMSO) was added to the cell mixtures and incubated for 24 hours. Following incubation, the cells are pelleted by centrifugation and the supernatant was tested for IL-2 expression via a commercially-available IL-2 ELISA (R&D Systems) as per the manufacturer's protocol.
AAV expression vectors were generated by subcloning specific promoter and transgene elements into an intermediate vector derived from pAAV-CMV (Takara) in which the CMV promoter downstream of the 5′ITR was removed and a WPRE element and SV40 polyA sequence were inserted upstream of the 3′ ITR.
To generate AAV encoding an inducible luciferase transgene, the ZHFD1-luciferase cassette was amplified by PCR from pZFHD1-Luciferase provided in the iDimerize regulated transcription system (Takara) and subcloned into the intermediate AAV vector. To generate AAV encoding constitutively expressed huIL-2, a gene encoding human IL-2 (SEQ ID NO: 210) was subcloned downstream of a CAG promoter in the intermediate AAV vector (
Recombinant AAV (rAAV) was produced by triple-transfection of 40 T-175 cm2 flasks containing HEK293 T-17 cells at 80% confluency using a standard helper-free approach. Briefly, each flask was transfected with 15 μg of a helper plasmid (a plasmid containing adenoviral E2A and E4), 7.5 μg of the AAV ITR-bearing, and transgene-encoding plasmid and 7.5 μg of the AAV capsid plasmid (containing the AAV8 capsid and the corresponding Rep genes) using 90 μg of 40 kD linear polyethylenimine (PEI). Five days after transfection, media was collected from all the flasks, treated with 2000 units of Benzonase nuclease and incubated at 37° C. for 1 hr. The media was then filtered through a 0.22 μm filter and concentrated to a volume of 80 ml using tangential flow filtration (TFF). This volume was further concentrated and buffer exchanged with PBS using an Amicon-15 ml-100 kDa filter before loading onto a stepwise iodixanol gradient (15%/25%/40%/60%) and spinning at 69000 rpm on an ultracentrifuge in a Ti70 rotor for 1.5 hrs at 18° C. Fractions were taken from the ultraclear centrifuge tubes by piercing the tube with a 19 gauge syringe in the 60% layer below the clear band representing the virus and the purity of each fraction was assessed by SDS-PAGE of each fraction and subsequent Sypro Ruby analysis. Pure fractions were combined, buffer exchanged with PBS in an Amicon-15 ml-100 kDa filter and concentrated to a final volume of 150 μl and stored at −80C in aliquots to avoid any repeated freeze/thaws. The viruses were titered using digital-droplet PCR and a TaqMan probe specific to the ITRs. Typical titres ranged from 1-3×1013 genome copies (GC)/ml.
All rAAV transduction assays were performed in adherent HEK293 cells cultured in 96-well plates. Cells enzymatically dissociated from a tissue culture flask were counted and plated at 2.5×104 cells/well in a 96-well plate. The plates were incubated overnight at 37° C. with 5% CO2 to allow the cells to adhere. On Day 2, the cells were transduced with 2.5-5×109GC/ml (corresponding to a multiplicity of infection (MOI) of 1-2×105) of the relevant rAAV. After incubation for 48-72 hours, the cells were treated with different concentrations of simeprevir or with vehicle control and incubated for a further 24 hours. For luminescence assays, SteadyGlo luciferase substrate (Promega) was added and luminescence was quantified with an Envision plate reader. Luminescence readings were converted into fold-change by dividing the signal in the presence of simeprevir by that in the absence of simeprevir. For IL-2 assays, supernatant was harvested and IL-2 quantified using a V-PLEX Human IL-2 Kit (Meso Scale Discovery) following the manufacturer's protocol.
To demonstrate endogenous gene regulation by the PRSIM-based CID, an activating CRISPR (CRISPRa) approach was employed. CRISPRa relies on the use of a dead Cas9 enzyme (dCas9) with no endonuclease activity to bind to a target site within the promoter region of an endogenous gene via a single guide RNA. Upon recruitment of a transcriptional activator, transcription of the endogenous gene is initiated.
For this approach, the dCas9 and the VPR activation domain (AD) are separated such that transcription does not occur. The dCas9 and AD are separately fused to the two protein components of the CID such that, only in the presence of the small molecule inducer, the AD is brought into close proximity to dCas9, allowing recruitment of the transcription machinery to the promoter region of an endogenous gene via a single guide RNA (sgRNA). In this example, an activation plasmid was generated consisting of two functional units; an AD fused to the HCV NS3/4A PR (S139A) (SEQ ID 226) and a dCas9 fused to three tandem copies of PRSIM-23 (SEQ ID 228). The sequences are preceded by a CMV promoter and separated by an internal ribosome entry site (IRES). A gRNA plasmid was generated by golden gate assembly, utilising BsaI. The gRNA plasmid encodes the human U6 promoter, an interleukin-2 (IL-2) target sequence (GTTACATTAGCCCACACTT; SEQ ID NO: 229) and a scaffold RNA sequence to allow Cas9 binding (
Transcriptional regulation assays were performed in adherent HEK293 cells cultured in 96-well plates. Cells enzymatically dissociated from a tissue culture flask were counted and plated at 2.5×104 cells/well. The plates were incubated overnight at 37° C. with 5% CO2 to allow the cells to adhere. On day 2, the cells were co-transfected with the activation and gRNA plasmids using Lipofectamine 3000 (ThermoFisher), using a gRNA:activation plasmid DNA ratio of 2:1. On day 3, wells were incubated with 300 nM simeprevir or with vehicle control. 72 hours post-treatment (day 6), the cell supernatant was harvested and IL-2 quantified using a V-PLEX Human IL-2 Kit (Meso Scale Discovery), as per the manufacturer's protocol.
The co-crystal structure of HCV in complex with Simeprevir was first prepared using Protein Preparation Wizard (Sastry et al., 2013) to add hydrogen atoms, fill in missing side chains, assign the proper ionization state for both the amino acids and Simeprevir at physiological pH. The FEP+ (module) in the Schrödinger 2019-2 (Moraca et al., 2019) release with the OPLS3e force field was then used to predict the relative binding free energies upon mutations of residues H57, K136, S139 and R155 in HCV NS3/4A PR. Mutations that are predicted to reduce the affinity of HCV protease to Simeprevir are listed in Table 4.
Monoclonal cell lines were generated using CRISPR-mediated knockin system for transgene integration at AAVS1 locus (ORIGENE) according to manufacturer's instructions (
AAVS1 safe harbor CRISPR-mediated knockin system employs two plasmids: the CRISPR all-in-one vector, pCAS-Guide-AAVS1 vector and the donor vector (pAAVS1-DNR-Puromycin) with AAVS1 homologous arms (SEQ ID NO: 234, 235). The AAVS1 targeting sequence (SEQ ID NO: 236) was previously cloned into pCAS-Guide plasmid. The donor vector was engineered by addition of SbfI and HpaI restriction enzyme sites via Gibson assembly to enable further sub-cloning of HCV NS3/4A PR (S139A) and mutants:PRSIM_23 heterodimerising components. Subsequently, pHet-Act1-2-HCV NS3/4A PR (S139A)-PRISM23 (3 tandem copies) plasmid was digested with SbfI and HpaI restriction enzymes (New England Biolabs) to obtain HCV NS3/4A PR (S139)-PRISM23 DNA which was further sub-cloned into the donor vector by Gibson Assembly. HCV NS3/4A PR variants including HCV NS3/4 PR (K136D) (SEQ ID NO: 211), HCV NS3/4 PR (D168E) (SEQ ID NO: 213) and HCV NS3/4 PR (K136N) (SEQ ID NO: 215) were sub-cloned from pHet-Act1-2-HCV NS3/4 PR (K136D/D168E or K136N)-PRISM23 into pAAVS1-HCV NS3/4A PR (S139A)-PRISM23-Puromycin plasmid by Gibson assembly using SbfI and AfeI restriction enzyme sites. The nucleotide sequences were confirmed by Sanger sequencing.
Stable cells expressing GFP-PEST under control of inducible promoter alone were co-transfected with pAAVS1-HCV NS3/4A PR (S139A; K136D; D168E; K136N)-PRISM23-Puromycin donor vector and pCAS-Guide-AAVS1 to enable targeted integration into AAVS1 locus. Transfected cells were selected by addition of 1 ug/ml puromycin into growth media (DMEM+10% foetal bovine serum+1% Non-essential amino acids+800 μg/ml Geneticin) 48 hr post-transfection. Following 14 day selection period, polyclonal cell lines were induced with 500 nM simeprevir and FACS sorted based on GFP fluorescence intensity to isolate single cell clones. Final monoclonal cell lines (
Flow Cytometry to Determine the Kinetics of GFP-PEST Expression from the Simeprevir-Inducible Switch
Monoclonal cell lines expressing GFP-PEST under the control of the split transcription factor system were enzymatically dislodged from tissue culture flasks and plated into 96 well collagen-coated plates. The following day, cells were treated with 100 nM Simeprevir. 24 h post-treatment, cells were washed twice in growth medium without Simeprevir, and cells were further maintained in medium without Simeprevir. Cellular GFP-fluorescence at various timepoints after the removal of Simeprevir was determined using flow cytometry on a Fortessa Flow cytometer (BD Biosciences). For analysis, the GFP-fluorescence (in relative fluorescence units=RFU) of untreated cells was subtracted from all experimental values. RFU values were further normalised to timepoint ‘0 h’, taken at the time of Simeprevir removal.
The single chain HCV protease construct—an 11-residue peptide derived from the viral NS4A protein fused to the N-terminus of NS3 protease with S139A mutation—was redesigned with an N-terminal hexahistidine (6His) followed by a tobacco etch virus (TEV) protease cleavage site (to enable affinity purification and removal of the tags, respectively) (SEQ ID NO: 218). A second construct was designed to express the PRSIM_57 scFv with an N-terminal pelB leader to direct periplasmic secretion and C-terminal TEV site and 6His tag (SEQ ID NO: 221). Both sequences were purchased as linear DNA strings (GeneArt) and were cloned into the pET-28a vector (for bacterial expression) using Gibson assembly. The sequences of the final constructs were verified via Sanger sequencing of the entire coding sequences.
For expression, the pET-28a plasmids were transformed into BL21(DE3) E. coli cells and selected on plates containing kanamycin (50 μg/ml). For each expression, a single colony was used to inoculate a 5 ml 2×TY+50 μg/ml kanamycin culture that was grown at 37° C. overnight. This culture was used to inoculate 500 ml TB Autoinduction medium (Formedium, supplemented with 10 ml/L glycerol and 100 μg/ml kanamycin) at 1:500 dilution. The culture was grown at 37° C. to an OD600 of 1.3-1.5 and then transferred to 25° C. (HCV NS3/4A PR (S139A)) or 30° C. (PRSIM_57) for 20 hours for expression to be induced. Cells were harvested by centrifugation and the pellets were stored at −80° C.
For protein purification of HCV NS3/4A PR (S139A), each bacterial pellet from 500 ml culture was thawed and re-suspended in 50 ml lysis buffer (50 mM HEPES, 500 mM NaCl, 1 mM TCEP, pH 8.0). The cells were lysed by passage through a cell disruptor at 30,000 kpsi and the lysate was clarified by centrifugation at 50,000 g for 30 min at 4° C. The clarified supernatant was loaded on a 5 ml HisTrap HP column (GE Healthcare) at 5 ml/min flow rate. The column was washed sequentially with wash buffers (50 mM HEPES, 500 mM NaCl, 1 mM TCEP, 20 mM Imidazole, pH 8.0 and 50 mM HEPES, 500 mM NaCl, 1 mM TCEP, 40 mM Imidazole, pH 8.0) and eluted with an imidazole gradient over 5 column volumes from 40-400 mM imidazole. Fractions were analysed by SDS-PAGE and those that were enriched for the correct protein were pooled and buffer exchanged with a HiPrep 26/10 Desalting column (GE Healthcare) into 50 mM HEPES, 200 mM NaCl, 0.3 mM TCEP, 10 μM ZnCl2, pH 7.5 (storage buffer). Desalted protein fractions were treated with His-tagged TEV protease at 1:100 w/w overnight at 4° C. TEV protease was removed by passing the sample through a HisTrap HP column and the resulting flow-through material was polished by loading on a Superdex 75 26/600 column equilibrated in storage buffer.
The PRSIM-57 His-tagged scFv sample was released from the periplasm via osmotic shock of the cell pellets: cells were first resuspended in 300 ml 50 mM Tris, 1 mM EDTA, 20% sucrose, pH 8.0 and then pelleted and resuspended in water to exert osmotic shock and release periplasm contents. The sample was purified by loading on a HisTrap excel column and washing and eluting with the same buffers as used for the HCV NS3/4A PR (S139A) construct. The eluted protein was buffer exchanged by loading on a HiPrep 26/10 desalting column in 50 mM HEPES, 200 mM NaCl, pH 7.5 and treated with TEV protease at 1:50 w/w ratio overnight at 4° C. TEV-digested material was further purified with IMAC and size exclusion steps as for the protease and stored in 50 mM HEPES, 200 mM NaCl, pH 7.5.
To form the ternary complex of HCV NS3/4A PR (S139A), PRSIM_57 and simeprevir, HCV NS3/4A PR (S139A) at a concentration of 50 μM was mixed with a 1.1-fold excess of PRSIM_57 and to this was added simeprevir to a final concentration of 100 μM with DMSO at 3% in the final solution. The sample was incubated at room temperature for 60 min to allow equilibration and then loaded on Superdex 75 16/600 column at 0.75 ml/min in 20 mM HEPES, 200 mM NaCl, pH 7.5. Fractions containing the complex were pooled, concentrated to 12 mg/ml, split into aliquots and snap frozen in liquid nitrogen prior to storage at −70° C. An aliquot of the complex was thawed and run on an HP-SEC column to verify complex integrity and monodispersity prior to crystallisation.
The ternary complex was crystallised using sitting drop vapour diffusion method. A number of proprietary crystallisation screens were set up at 277K and 293K. Hits from these screens were optimised using sitting drop and hanging drop vapour diffusion experiments as appropriate. Final crystals were obtained at 293K from reservoir solutions comprised of 20-25% (w/v) PEG 8000, 100-300 mM magnesium chloride and HEPES buffer, pH 7.0-8.0. Crystals were exposed to cryoprotectant solution of reservoir supplemented with 20% (v/v) ethylene glycol and then frozen directly in liquid nitrogen.
Data collection was carried out at Diamond Light Source, beamline i04, at cryogenic temperatures. The CCP4 and autoBUSTER software packages were used to solve and refine the structures, and the program Coot was used for manual building of the models. The structure was solved by molecular replacement using models of HCV NS3/4A (S139A) from the Protein Data Bank.
The change in stability of the HCV protein upon mutation was calculated using Schroedinger Residue Scanning tool (Schrödinger Release 2020-2: SiteMap, Schrödinger, LLC, New York, N.Y., 2020).
A Prime MM/GBSA energy function with an implicit solvent term was used for the calculations (Li et al., 2011). A cutoff of 6 Å was used for the protein refinement around the mutation. Negative values of the change in stability are linked to an increased mutant stability.
The sequence encoding a kill switch fusion protein of PRSIM23, HCV NS3/4A PR and ΔCARDCaspase9 with short GGGSG between the three fragments (SEQ ID NO: 223) was purchased as a cloned gene in vector pcDNA3.1 from Geneart (Life Technologies). The fusion protein was sub-cloned into EcoRI/NotI digested lentiviral vector pCDH-EF1α-MCS-(PGK-GFP-T2A-Puro) (Systems Bioscience) using Gibson assembly cloning. To generate the Caspase 9 S196A mutation, a DNA fragment altering the equivalent Ser371 in the kill switch construct to Ala was synthesized by Geneart was cloned into ClaI/NotI cleaved kill switch vector (SEQ ID NO: 230). Gene sequences were confirmed by DNA sequencing.
Lentiviral particles encoding the kill switch fusion protein (SEQ ID NO: 223) or kill switch S196A mutant fusion protein (SEQ ID NO: 230) were generated using pPACKH1 HIV lentiviral packaging kit (Systems Bioscience), according to manufacturer's instructions. HEK293 cells were transduced for 24 h in the presence of 8 μg/ml polybrene after which cells were changed into fresh growth medium (DMEM+10% foetal bovine serum+1% Non-essential amino acids). 24 h later transduced cells were selected by addition of 2 μg/ml puromycin for 5 days. Before functional testing, transduced cell pools were FACS sorted based on GFP fluorescence to isolate high expressing cell line pools and single cell clones.
HCT116 and HT29 transduced cells were generated following the same protocol with exception of using McCoy's 5A medium+10% foetal bovine serum as growth medium, supplemented with 2 μg/ml puromycin for selection of transduced cells.
The hESC line Sa121 (TakaraBio Europe) was also transduced with the lentiviral particles encoding the PRSIM-based kill switch fusion protein described above (SEQ ID 223). Cells (passage 19) were plated at 3.5×105 cells/cm2 in the DEF-CS culture system, and the cells were transduced 30 h later. 24 h after transduction, puromycin selection was initiated and the antibiotic selection was maintained until a stable cell pool was achieved.
A stable induced pluripotency stem cell (iPSC) line (a single clone (B-3/1F1) derived from the fibroblast cells of a healthy human donor from Research Specimen Collection Program of Astrazeneca) stably expressing the simeprevir-inducible kill switch was generated using CRISPR/Cas9 technology, using AAV-encoded DNA as template for targeted integration into the β2 microglobulin (B2M) locus.
The donor construct (
The iPSC cells seeded in Vitronectin-coated 6-well plates at 50-70% confluency (approximately 1.2×106 cells) were used for transfection/transduction. Cells were maintained in 2 mL fresh StemFlex medium containing 1× RevitaCell (Life Technologies). For each well, 200 μL Opti-MEM (Life Technologies) medium containing 220 nM of CRISPR-ribonucleoprotein and 12 μL of RNAiMAX (Life Technologies) was applied. In the meantime, the AAV vectors were applied with the multiplicity of infection (MOI) of 50,000. After 24-hour incubation, the RNP/AAV-containing medium was replace by fresh StemFlex medium.
At 48-hour post transfection, the medium was replaced by fresh StemFlex medium containing 5 μg/mL Blasticidin S HCl (Life Technologies). The medium was replaced with fresh Blasticidin-containing StemFlex medium each day for another 3 to 4 days. Then, the cells were maintained in regular StemFlex medium again.
To identify cells that were B2M-negative, and thus encoding the PRSIM-based kill switch, FACS was performed. Cells were detached from the plates using TrypLE Express (Life Technologies) and resuspended in FACS buffer (HBBS containing 1% PBS and 1× RevitaCell) at a density of 1×107 cells/mL containing 5% of APC-labeled anti-human B2M antibody (BioLegend, Inc.) solution. After 10-minute incubation, cells were washed using 10 times volume of FACS buffer for two times and resuspended in FACS buffer at a density of 2×107 cells/mL. B2M-negative cells were collected by FACS (FACSAria; BD Biosciences) and cultured for further experiment.
Single cell clones were then isolated using single-cell printing. Cells were detached from the plates using TrypLE Express (Life Technologies) and resuspended in SCP buffer (HBBS containing 1× RevitaCell) at a density of 1.6×106 cells/ml. Cell suspension was loaded to a cartridge of the Cytena CloneSelect Single-Cell Printer (Cytena). Cells were seeded at 1 cell per well in the Matrigel or Vitronectin-coated 96-well plates containing 200 μL of mL fresh mTeSR (STEMCELL Technologies) or StemFlex medium containing 1× RevitaCell (Life Technologies). The media was replaced by fresh StemFlex media on the next day of SCP.
Five single cell clones were recovered, expanded from 96-well plates to Vitronectin-coated 24-well plates, and were further expanded and maintained in Vitronectin-coated 6-well plates. For each single-cell clone, approximately 5×105 cells were collected. Genome DNA was isolated using the DNeasy Blood & Tissue Kit (Qiagen). The targeted region of human B2M gene were amplified using the primers below and using the SuperFi DNA polymerase (Life Technologies). The PCR products were loaded in 1.2% agarose gel for electrophoresis. The gel were visualized to identify the gene knock-in status of the single-cell clones by the size of the amplicons (
HEK293, HCT116 or HT29 cells stably expressing the PRSIM-based kill switch fusion protein (SEQ ID NO: 223) or HEK293 cells stably expressing the PRSIM kill switch S196A mutant fusion protein (SEQ ID NO: 230) were plated onto collagen-coated 96-well plates and 24 h later treated with 100 nM simeprevir. Phase contrast images were acquired at various timepoints using 10× or 20× objectives on an Incucyte Zoom (EssenBioscience).
Functional Caspase 9 activates Caspase 3, and this proteolytic activity can be determined using cleavage of non-fluorescent substrate DEVD-AMC into cleavage products DEVD and fluorescent AMC, such that AMC fluorescence signal at 430 nm is proportional to Caspase 3 activity. For the Caspase 3 assay, cells were plated in duplicate onto 6 well tissue-culture treated plates. 24 h later, one of the duplicate wells was treated with 10 nM simeprevir for 3 h. Cells lysates were analysed in triplicate using a Caspase 3 assay from BD Biosciences according to manufacturer's instruction with the modification that total protein input was normalised to 50 μg by BCA assay (LifeTechnologies). Fluorescence was determined on an Envision plate reader (PerkinElmer), Ex: 380 nm, Em: 430 nm. For quantification, RFU (raw fluorescence value) for wells that contained only the assay substrate were subtracted from all RFU derived from assay samples. Results were normalised to non-transduced, simeprevir-treated cells. Analysis was performed in Prism (GraphPad) using a One-Way-Anova followed by multiple comparisons.
To test the induction of the kill switch in Sa121 ES cells, the cells were plated at 3.5×105/cm2 two days before inducing kill switch activity by treating with simeprevir at concentrations from 10 nm to 1 uM. The cells were imaged using the Incucyte S3 (EssenBioscience) at intervals ranging from 10-20 min; kill switch efficiency was quantified by image analysis of confluency.
The cells for each of the single-cell clones described above were plated in the vitronectin-coated 96-well electronic microtiter plates (E-Plate® 96, ACEA Biosciences Inc.) at the density of 40,000 cells per well. The plate was set connected with the xCelligence module and incubated at 37° C. in a humidified incubator with 5% CO2 so that the cell proliferation index can be monitored without interrupting regular cell growth. The cell proliferation index was measured and recorded every 15 minutes for 24 hours. Simeprevir at different concentration was then added and the cell proliferation index measured every 5 minutes for 8 hours and then every 15 minutes for further 40 hours. All experiments were performed in triplicate wells for each clone and each condition. The average cell index were quantified by using the xCELLigence RTCA Software Pro (ACEA Biosciences Inc.)
To generate a de novo chemical inducer of dimerization module, we adopted an approach whereby the small molecule inducer is a clinically approved small molecule and one of the protein components is the target of the small molecule (target protein). The second protein component (binding member) is derived from a library of binding molecules (Tn3 or scFv) and demonstrates exquisite selectivity for the target protein bound to the small molecule over the unbound target protein (
Ideal small molecule criteria:
Ideal target protein criteria:
Small molecule:target protein complex criteria:
An extensive analysis was carried out, and one of the preferred small molecule/target protein pairings identified was simeprevir and its target, the NS3/4A protease from hepatitis C virus (HCV NS3/4A PR). Simeprevir (Olysio®) is a small molecule that is administered orally, is cell-permeable, and has a pharmacokinetics (PK) profile that supports once-daily dosing. It has been used chronically (up to 39 months) to treat HCV infection in combination with ribavirin and pegylated interferon, and is on the WHO essential medicines list, indicative of a well-tolerated and widely administered drug. HCV NS3/4A PR is monomeric, relatively small in size (21 kDa), can be expressed cytoplasmically, and is not found associated with DNA. Furthermore, three-dimensional X-ray crystallography of the complex (PDB code: 3KEE) reveals that simeprevir is bound in the shallow substrate-binding groove of HCV NS3/4A PR with 364 Å of exposed surface area (
The HCV NS3/4A PR is an enzyme that cleaves at four junctions of the HCV polyprotein precursor, and it is known to cleave a limited number of endogenous human targets (Li, Sun, et al. 2005; Li, Foy, et al. 2005). To limit this activity within human cells, we reasoned that identification of a mutant form of the HCV NS3/4A PR that is enzymatically inactive but retains binding to simeprevir would be necessary. An active site mutant of HCV NS3/4A PR (S139A) had previously been shown to demonstrate significantly less activity than its wild-type counterpart (Sabariegos et al. 2009). To confirm this, and to investigate whether the mutant HCV NS3/4A PR would retain binding to simeprevir, recombinant proteins were expressed in E. coli and purified to homogeneity. HCV NS3/4A PR with an N-terminal hexahistidine and AviTag, both WT (SEQ ID NO: 3) and S139A mutant (SEQ ID NO: 4) were expressed separately in 1 litre culture of BL21(DE3) induced via autoinduction. The cultures were harvested and proteins purified using a combination of immobilised metal affinity chromatography and size exclusion chromatography. Final pooled samples were assessed via SDS-PAGE indicating a >99% level of purity (
These recombinant HCV NS3/4A PR WT and S139A proteins were tested for enzymatic activity in a fluorogenic peptide cleavage assay, where the significantly reduced activity of the HCV NS3/4A PR S139A mutant was confirmed. No enzymatic activity could be detected at most concentrations tested, with minimal activity observed only at high nM to μM concentrations (
Isothermal calorimetry was performed to assess the binding affinity of simeprevir to the WT and S139A HCV NS3/4A PR proteins. Both proteins gave very similar results, with the same stoichiometry (˜0.6 Sim/NS3 binding sites) and ΔH values (˜22 kcal/mol) obtained (
Based on these data we chose to proceed with the selection of HCV NS3/4A PR:simeprevir complex-specific binding (PRSIM) molecules based on the S139A mutant protein.
Four rounds of phage display selections were performed on biotinylated HCV NS3/4A PR (S139A) in the presence of simeprevir. From the round 3 and round 4 selection outputs, phage ELISAs were performed on biotinylated HCV NS3/4A PR (S139A) in both the presence and absence of simeprevir, and binding determined by fluorescent signal measured (
PRSIM_23
Tn3
Library 1
4
23.8
1573
PRSIM_32
Tn3
Library 1
4
22.4
1955
PRSIM_33
Tn3
Library 1
3
29.9
1704
PRSIM_36
Tn3
Library 1
3
27.9
1440
PRSIM_38
Tn3
Library 1
3
23.3
1061
PRSIM_39
Tn3
Library 1
4
24.9
1015
PRSIM_47
Tn3
Library 1
3
25.3
1367
PRSIM_48
Tn3
Library 1
3
26.6
1780
PRSIM_04
scFv
Library 2
3
30.3
1055
PRSIM_56
scFv
Library 2
4
16.3
829
PRSIM_57
scFv
Library 2
4
15.7
1076
PRSIM_58
scFv
Library 2
4
25.5
610
PRSIM_63
scFv
Library 2
4
23.5
760
PRSIM_64
scFv
Library 3
4
26.1
1006
PRSIM_66
scFv
Library 3
3
19.6
559
PRSIM_67
scFv
Library 3
3
12.7
1708
PRSIM_72
scFv
Library 3
3
12.5
1107
PRSIM_73
scFv
Library 3
3
25.4
418
PRSIM_75
scFv
Library 3
3
6.9
1030
PRSIM_78
scFv
Library 3
3
20.2
852
The PRSIM binding proteins identified from phage display selections as complex-specific were expressed and purified at larger scale to provide sufficient material for further analysis. A homogeneous time-resolved fluorescence (HTRF) binding screen (
To further characterise the PRSIM binding molecules, 5 scFv molecules (PRSIM_4, PRSIM_57, PRSIM_67, PRSIM_72 and PRSIM_75) and 5 Tn3 molecules (PRSIM_23, PRSIM_32, PRSIM_33, PRSIM_36, PRSIM_47) were selected and the kinetics of HCV NS3/4A PR (S139A) protease binding in the presence or absence of simeprevir were determined using Biacore 8K (Table 2). All the PRSIM binding molecules tested showed selectivity for simeprevir-bound HCV NS3/4A PR (S139A) and only three showed minor non-specific binding to HCV NS3/4A PR (S139A) alone. PRSIM_57 (
Having isolated PRSIM binding molecules that specifically bound simeprevir:HCV NS3/4A PR (S139A) complexes, we reasoned that the system could be used to regulate the reconstitution of a split protein. By providing temporal and spatial regulation of protein dimerization within a cell, the CID could be applied within a post-translational context to control a desired protein-protein interaction or activity. Numerous examples of split proteins that gain activity upon reconstitution exist, one of which is the split Nanoluciferase as provided in the NanoBiT system (Promega) (
Having demonstrated that PRSIM-based CIDs were capable of reconstituting the activity of a split protein via fusion of the HCV NS3/4A PR (S139A) and PRSIM molecules to the separate components of the split NanoLuc enzyme, we reasoned that the same CIDs could regulate expression of transgenes via fusion to the two domains of a split transcription factor. To demonstrate this, we utilised the iDimerize regulated transcription system (Takara) in which two separate vectors are provided; one vector (pHet-Act1-2) encodes FRB fused to the activation domain (AD) p65, and the DNA binding domain (DBD) ZFHD1 fused to 3 copies of FKBP12, separated by an IRES sequence and preceded by the constitutive promoter, CMV; the other vector (pZFHD1_Luciferase) encodes luciferase under the control of an inducible promoter that contains 12 copies of the ZFHD1 recognition sequence upstream of a minimal IL-2 promoter. When both plasmids are transfected into cells, the FRB-AD and DBD-FKBP12 proteins are expressed; the DBD recognises its target site on the inducible promoter, but as there is no AD in close proximity to the promoter, transcription initiation does not occur. Only when the rapalog inducer “A/C heterodimeriser” is added, is the AD recruited to the DBD bound to the promoter upstream of the luciferase gene and expression commences.
We exchanged the FRB and FKBP12 coding sequences for those encoding one copy of the HCV NS3/4A PR (S139A) and one of the 11 PRSIM molecules described below, where the PRSIM molecules were either fused to the N-terminus of the activation domain or the C-terminus of the DNA binding domain (
When the ability of the HCV NS3/4A PR (S139A)-AD and DBD-PRSIM_23 or DBD-PRSIM_57-based constructs to regulate the expression of luciferase in the presence of simeprevir was directly compared to the FRB:FKBP12:rapalog positive control, the PRSIM-based CIDs (100-fold increase) outperformed the FRB:FKBP12-based CID (30-fold increase) (
To assess the impact of copy number of the target protein fused to the DNA binding domain, we generated pHet-Act1-2-based constructs encoding FRB-AD or HCV NS3/4A PR (S139A)-AD and DBD-FKBP12 or DBD-PRSIM_23, whereby the protein fused to the DBD was included either as a single copy or as three tandem copies separated by short peptide linkers (
Furthermore, when the impact of one, two or three tandem copies of PRSIM_23 fused to the DBD was assessed via the same split transcription factor assay, and the induction of firefly luciferase expression was measured, a graded response was observed; one copy of PRSIM_23 resulted in a max fold change of 364.5, whereas two tandem PRSIM_23 molecules resulted in max fold change of 2436 and a further increase to 4862-fold for three tandem PRSIM_23 molecule (
This data suggests that it is possible to improve the regulation of gene expression from the inducible promoter by recruiting more copies of the activation domain, and that this is a common phenomenon, independent of CID used.
Regulation of CAR activity via chemical-induced heterodimerisation was previously shown to be an effective way to modulate CAR function (Wu et al. 2015); (Hill et al. 2018). We hypothesized that the application of the heterodimerising PRSIM components to a CAR would facilitate CAR regulation in a similar manner. The previously described FKBP12:FRB system (Wu et al. 2015) was used as a comparator to regulate CAR function. To test this, we engineered Jurkat T-cells to express PRSIM and FKBP12:FRB-regulated CARs using a lentiviral expression system (
In addition to demonstrating gene regulation of two recombinant intracellular proteins (luciferase (Example 7) and NanoLuc-PEST (Example 8)) using a PRSIM-based CID, the regulation of gene expression of a secreted antibody (MED18852; SEQ ID NO: 205 and SEQ ID NO: 206) was also investigated. pHet-Act1-2-based constructs encoding HCV NS3/4A PR (S139A)-AD and DBD-PRSIM_23 (three tandem copies) and a construct encoding pZFHD1_MED18852) were generated. When cells were transfected with these two constructs, the expression of MED18852 was shown to be dependent on the dose of simeprevir, as measured using the Singleplex Human/NHP IgG Isotyping Kit (Mesoscale) (
Recombinant adeno-associated virus (rAAV) vectors represent a well-studied platform which could be used to deliver the DNA encoding a PRSIM_23/HCV NS3/4A PR (S139A)-based CID to cells to control gene therapy. One such application is the regulation of an exogenous transgene delivered to cells either together with, or in separate AAV particles to the PRSIM_23/HCV NS3/4A PR (S139A)-based split transcription factor components described in Example 7. In the context of the system described here, the packaging capacity of AAV limits the size of the transgenes that can be delivered in the same AAV vector to ˜550 bp, or the size of transgenes that can be delivered in separate AAV particles to ˜3.6 kb.
To demonstrate delivery of the CID-encoding DNA and an inducible transgene “in trans”, two different AAV vectors were generated, one encoding the PRSIM_23/HCV NS3/4A PR (S139A)-based split transcription factor components, with expression driven by the constitutive EF1/HTLV hybrid promoter, and the second encoding the firefly luciferase gene under control of the inducible ZFHD1 promoter (
To demonstrate that the CID and an inducible transgene can be delivered “in cis”, an AAV8 vector encoding both the PRSIM_23/HCV NS3/4A PR (S139A)-based transcription factor components and an inducible IL-2 transgene was generated (
Thus, the ability of the PRSIM-based CID to control gene expression via AAV transduction was demonstrated, using either a single, or dual AAV-based system.
Having demonstrated that PRSIM-based CIDs can regulate the expression of transgenes via fusion to the two domains of a split transcription factor, we reasoned that the PRSIM-based CID could also regulate the expression of endogenous genes. The use of chemical-induced heterodimerisation systems to regulate endogenous gene activity has previously been shown to be an effective way to modulate gene regulation (Foight et al. 2019). We therefore hypothesized that the application of the heterodimerising PRSIM components to an activating CRISPR (CRISPRa) system could facilitate endogenous gene regulation in a similar manner.
To demonstrate this, an inactive form of the Streptococcus pyogenes Cas9 enzyme (dCas9) and an activation domain (AD) consisting of a fusion of three transcriptional activators (VP64, p65 and Rta; VPR) were separately fused to the two protein components of the CID (three copies of PRSIM_23 and HCV NS3/4A PR (S139A), respectively) such that, only in the presence of the small molecule inducer, the AD is brought into close proximity to the dCas9. Co-transfection of the PRSIM-based CID and a guide RNA (gRNA) targeting the promoter of interleukin-2 (IL-2) allows dCas9 to bind to the target site on the promoter of IL-2. Upon administration of the PRSIM dimeriser (simeprevir) the AD and associated transcription machinery is subsequently recruited to the promoter region of the endogenous IL-2 gene, enabling initiation of transcription (
In HEK293 cells transiently expressing the PRSIM regulated split dcas9/AD cassette and an IL-2 targeted gRNA, the addition of simeprevir resulted in secretion of IL-2. (
This data demonstrates that the PRSIM heterodimerising system can be used for simeprevir-mediated regulation of endogenous gene expression.
Having demonstrated that formation of the active switch complex is dependent on the presence of simeprevir, we wanted to test the specificity of this interaction with respect to alternative small molecule inhibitors of HCV protease. There are several small molecule inhibitors that are known to bind the HCV NS3/4A protease and have been approved for human use. A panel of such small molecules were assessed for their ability to induce complex formation between HCV NS3/4A PR (S139A) and PRSIM_23 or PRSIM_57. These were glecaprevir, boceprevir, telaprevir, asunaprevir, paritaprevir, vaniprevir, narlaprevir, grazoprevir, and danoprevir.
A homogeneous time-resolved fluorescence (HTRF) binding assay (
This data suggests that administration of other HCV NS3/4A PR inhibitor small molecules, such as in the case of a HCV-infected individual, would not be able to form an active HCV NS3/4A PR (S139A):PRSIM_23 complex, and that the HCV NS3/4A PR (S139A):PRSIM_23 complex is exquisitely specific for simeprevir.
The affinity of simeprevir for HCV NS3/4A PR is very high (Example 3;
In order to identify mutations on the Hepatitis C virus (HCV) protease protein that reduce simeprevir binding, the co-crystal structure of HCV NS3/NS4A in complex with simeprevir (PDB: 3KEE, Resolution: 2.4 Å) was first analysed. The analysis showed that the HCV NS3/NS4A:simeprevir interface is made up of 25 HCV residues where 6 residues contribute towards hydrogen bond and salt bridge interactions and 12 are surface-exposed (
Having identified a panel of mutants predicted to reduce the affinity of HCV NS3/4A PR for simeprevir, we reasoned that if the mutations affected the affinity of HCV NS3/4A PR for simeprevir as predicted, this would the influence formation of the HCV NS3/4A PR (S139A): simeprevir: PRSIM_23 complex. To assess the impact these mutations have on the formation of the HCV NS3/4A PR (S139A): simeprevir: PRSIM_23 complex we measured the amount of complex formation in a homogeneous time-resolved fluorescence (HTRF) binding assay (
We found that mutations made at positions R155, H57 and S139 were not tolerated and no complex was formed. Mutations made at position K136 resulted in complex-competent HCV PR variants, with the degree of complex formation reaching the same maximum as observed with HCV PR “wt”. Mutations at residue D168 are also tolerated, but with a reduction in the amount of complex formed at the equivalent HCV PR concentration. The EC50 observed for simeprevir is increased with K136N and K136D mutants, and for all mutants at position D168, indicating that different affinities exist within the complex for these mutants.
Having identified that mutations at position K136 and D168 resulted in complex competent HCV PR variants, three mutants (K136D, K136N and D168E) were selected for further characterization. To assess the impact on the affinity of simeprevir, the kinetics of simeprevir binding to HCV NS3/4A PR ‘WT’ (S139A) protease and the three mutants were determined using Octet RED384 (
To further characterise the three mutant proteases, the effect of simeprevir concentration on the formation of the mutant HCV NS3/4A PR (S139A)/PRSIM_23 complex was also assessed using Biacore 8K (FIG. 24A, Table 6). In line with the decreased affinity of simeprevir (Table 5) the EC50 of simeprevir in the HCV NS3/4A PR K136D/simeprevir/PRSIM_23 complex had increased to 131.5 nM, ˜30-fold higher than for the wt complex. The K136N mutation also resulted in a higher EC50 for simeprevir compared to ‘wt’, albeit the effect was less than for the K136D mutation. The 0168E mutation however, had an almost equivalent EC50 compared to the ‘wt’ complex; 3.69 and 4.53 nM, respectively.
The HCV NS3/4A PR mutants binding in the presence or absence of simeprevir to PRSIM_23 were also determined using Biacore 8K (
Having demonstrated that HCV NS3/4A PR (S139A):PRSIM_23 complex formation was specific for simeprevir (Example 13), we went on to investigate whether our panel of small molecule HCV PR inhibitors were able to disrupt the HCV NS3/4A PR (S139A): simeprevir: PRSIM_23 complex, by competing with simeprevir for binding to HCV PR. We found that when the small molecule inhibitors were added in a homogeneous time-resolved fluorescence (HTRF) binding assay concomitantly with simeprevir, a subset of these small molecules were able to inhibit HCV NS3/4A PR (S139A):PRSIM_23 complex formation. However, when simeprevir is pre-incubated with HCV NS3/4A PR (S139A) prior to addition of the small molecule inhibitors, no significant complex inhibition is observed (
To further characterise the mutations made to the HCV NS3/4A PR, we investigated whether the small molecules were able to disrupt pre-formed mutant HCV PR: simeprevir: PRSIM_23 complexes. Where a mutation at position 136 is made, more pronounced inhibition of the mutant HCV PR: simeprevir: PRSIM_23 complex is observed with a subset of the small molecule inhibitors (asunaprevir, paritaprevir, vaniprevir, grazoprevir, danoprevir and glecaprevir), but not with others (narlaprevir, boceprevir and telaprevir) (
The ability of other small molecule inhibitors (asunaprevir, paritaprevir, vaniprevir, grazoprevir, danoprevir and glecaprevir) to “compete” with simeprevir, and disrupt the complex between PRSIM_23 and mutant versions of HCV NS3/4A PR, provides an opportunity to rapidly inactivate any PRSIM-based CID, and turn off transgene expression or therapeutic activity. Furthermore, the inability of other inhibitors (narlaprevir, boceprevir and telaprevir) to compete with simeprevir for HCV NS3/4A PR binding provides an opportunity to develop orthogonal HCV NS3/4A PR-based molecular switches that are induced by these small molecules.
To assess the impact mutations of HCV NS3/4A PR on gene regulation we generated pHet-Act1-2-based constructs encoding HCV NS3/4A PR (S139A)-AD mutants & DBD-PRSIM_23 (three tandem copies). Following transfection of cells with these pHet-Act1-2 (HCV NS3/4A PR (S139A)-AD mutants & DBD-PRSIM_23 (three tandem copies)) constructs or ‘WT’ construct (HCV NS3/4A PR (S139A)-AD & DBD-PRSIM_23 (three tandem copies)) with the reporter construct pZFHD1_Luciferase, we assessed gene expression. The ability to regulate luciferase gene expression in the presence of increasing concentrations of simeprevir was determined. All PRSIM HCV NS3/4A PR (S139A)-AD mutants demonstrated dose-dependent gene expression of luciferase, albeit with a slight reduction of the max fold change and increase in EC50 relative to the ‘WT’ HCV NS3/4A PR (S139A)-AD (
The combined data from examples 14-19 suggests that mutant HCV NS3/4A PR-containing PRSIM-based CIDs could provide alternatives to the HCV NS3/4A (S139A) “wt”-based CID in scenarios where rapid reversal of CID-based activity is required, through the administration of “competing” small molecule HCV PR inhibitors.
To assess whether the decreased affinity/increased dissociation rate of the HCV NS3/4A PR (S139A) mutants (K1360, D168E, K136N) would impact the rate at which gene expression could be switched off upon simeprevir removal, a cell-based assay was performed using a live cell time-course assay. Monoclonal stable cell lines were generated in which the expression of a short-lived green fluorescent protein (GFP-PEST, half-live ˜2h) was placed under the control of a split transcription factor composed of HCV NS3/4A PR (S139A)-AD variants & DBD-PRSIM_23 (three tandem copies). GFP expression was induced by 24h treatment with simeprevir after which simeprevir was removed and GFP fluorescence at timepoints after removal was determined. The ‘WT’S139A retained high GFP-fluorescence over 24 h. This shows that once formed in a simeprevir-dependent fashion, the transcription factor complex containing the HCV NS3/4A PR (S139A) remains stable for a prolonged period of time to drive continued GFP-PEST-expression which does not require the continued presence of excess simeprevir in the culture medium. However, over the same period of time, all three mutants (K136D, K136N, D168E) return to a native, non-expressing state within 15-24h after removal of simeprevir demonstrating the reduced stability of the transcription factor complexes formed using HCV NS3/4A PR (S139A)-AD mutants & DBD-PRSIM_23 (three tandem copies) compared to HCV NS3/4A PR (S139A)-AD ‘WT’ & DBD-PRSIM_23 (three tandem copies).
This data suggests that, by reducing the affinity of simeprevir to mutants of HCV NS3/4A PR, it is possible to alter the kinetics of gene expression, enabling the cessation of gene expression to occur faster than when using the “wt” HCV NS3/4A PR-based CID, in the split transcription format.
Simeprevir induces the formation of a heterodimer of the HCV NS3/4A PR (S139A) and the scFv molecule PRSIM_57 by binding to a pocket on the surface of the protease and generating a new epitope that is specifically recognised by PRSIM_57. In order to understand the molecular mechanisms underlying this heterodimerisation event, a crystal structure of the complex between protease, scFv and simeprevir was determined. To derive the structure, forms of the protease and PRSIM_57 scFv with tobacco etch virus (TEV)-cleavable His-tags were both expressed separately in BL21(DE3) E. coli. The proteins were purified to homogeneity using a combination of immobilised metal affinity chromatography and size exclusion chromatography, and tags removed by treatment with TEV protease. In order to form the ternary complex, the protease was incubated with an excess of PRSIM_57 and simeprevir and the resulting complex was purified from non-complexed material using size exclusion chromatography. The fractions containing pure complex were pooled and concentrated to 12 mg/ml and set up in crystal trials. The complex was crystallised via sitting drop vapour diffusion and X-ray diffraction data were collected from crystals at a synchrotron X-ray source. The structure was solved using molecular replacement with the structure of the apo form of HCV NS3/4A PR (S139A) as the search model.
All three components of the ternary complex are clearly visible in the electron density (
The following interactions can be identified between PRSIM_57 and HCV NS3/4A PR (S139A) (
The ability to “remotely control” therapeutic cells once they have been administered, provides a safety net, in the advent of uncontrolled proliferation or adverse event. One way to control such cells is to endow them with a so called “kill switch” such that they can be removed at will once they have performed their function or pose a safety risk. As such, a PRSIM-based, simeprevir-responsive Caspase 9-based kill switch was generated and tested in vitro. The homo-dimerisation CARD domain of Caspase 9 was replaced with both the PRSIM23 and HCV NS3/4A PR (S139A) domains, separated by short linkers. An active Caspase 9 homodimer can thus only be reconstituted by addition of simeprevir (
To demonstrate that the PRSIM-based kill switch can eliminate therapeutically-relevant cells, stable cell lines were made in both embryonic stem (ES) cells and induced pluripotent stem cells (iPSC). In ES cells, a dose-response to simeprevir can be observed whereby a high dose of simeprevir (1 μM) rapidly and efficiently eliminates up to 95% of cells within 4 hours, as measured by cell confluency, with an onset of ˜15 minutes (
To demonstrate the effectiveness of the PRSIM-based kill switch in iPSC cells, four individual iPSC clones that were biallelic for the PRSIM-based kill switch at the B2M locus were generated. These cells, alongside parental iPSC cells were incubated with 1 nM simeprevir and the cell proliferation index was measured over time using the xCELLigence RTCA Software Pro (ACEA Biosciences Inc.). All cell clones that encoded the PRSIM-based kill switch showed a dramatic reduction in cell proliferation index after 5 hours which was maintained over the course of the experiment (˜60 hours, post-simeprevir addition), whereas the parental cells continued to proliferate.
These data demonstrate that a PRSIM-based kill switch can efficiently eliminate a wide range of cell types in vitro and provides a means for the rapid removal of therapeutic cells in patients.
Caspase 9 can be inactivated by Aid kinase-mediated phosphorylation on Ser96. This poses a risk of “escape” from Caspase 9 mediated apoptosis by cells that have undergone phosphorylation of Ser96 on the Caspase 9 fusion protein. To mitigate this risk, a stable HEK cell line encoding the PRSIM-based kill switch fusion protein containing a Ser196 to Ala substitution was generated. Addition of 100 nM simeprevir to kill switch S196A cells showed rapid cell killing in a timeframe comparable to the wt kill switch (
A number of publications are cited above in order to more fully describe the present disclosure and the state of the art to which the disclosure pertains. Full citations for these references are provided below. The entirety of each of these references is incorporated herein.
For standard molecular biology techniques, see Sambrook, J., Russel, D. W. Molecular Cloning, A Laboratory Manual. 3 ed. 2001, Cold Spring Harbor, New York: Cold Spring Harbor Laboratory Press
This application is a U.S. National Stage application of International Application No. PCT/IB2020/056657, filed on Jul. 15, 2020, said International Application No. PCT/IB2020/056657 claims benefit under 35 U.S.C. § 119(e) of the U.S. Provisional Application No. 62/874,025, filed Jul. 15, 2019. Each of the above listed applications is incorporated by reference herein in its entirety for all purposes.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/IB2020/056657 | 7/15/2020 | WO |
Number | Date | Country | |
---|---|---|---|
62874025 | Jul 2019 | US |