1. Field of the Invention
The present invention relates generally to recombinant protein production, and more specifically to methods for increased protein production using miRNAs and siRNAs as well as compositions utilized in such methods.
2. Background Information
Improving the expression level of recombinant mammalian proteins has not only been pursued by biotechnologist for production of commercial biotherapeutics, but has also been at the heart of numerous biomedical studies in academia, as an adequate supply of correctly folded proteins is a prerequisite for all structure and function studies. A critical area is mammalian integral membrane proteins such as receptors, ion channels and transporters which are encoded by 20-40% of all Open Reading Frames (ORFs) in the mammalian genome and are targets of most of the medicines sold worldwide. Even though more than 100,000 structures have been deposited in Protein Data bank, the overexpression of membrane protein remains difficult and only 898 membrane protein structures are available as of Oct. 2, 2014. Rational attempts to improve membrane protein expression may not lead to expected results as membrane proteins involve particularly complex folding, assembly, and processing pathways, and there is only limited information for the bottlenecks that may reside in the protein production steps, such as transcription, translation, protein folding, secretion and cell viability.
MiRNAs have emerged as powerful tools for engineering cells with desirable properties, such as improved protein production capabilities and enhanced anti-apoptotic properties under stress conditions. MiRNAs are a novel class of small, non-coding RNAs that can simultaneously silence multiple genes by binding to their 3′-untranslated regions (3′-UTR). They exhibit a broad spectrum of regulatory effects in eukaryotic cellular processes including cell growth and apoptosis, cell differentiation and metabolism, cancer development and progression. Their capacity to globally regulate entire gene networks and not introduce an additional translational burden (compared to gene overexpression strategies) makes them particularly advantageous for cell line development.
MicroRNAs (miRNAs) are approximately 21 nucleotide single-stranded small RNAs that regulate posttranscriptional gene expression in metazoans and plants. miRNAs are processed from hairpin precursors and assembled into functional complexes containing Argonaute proteins (termed RNA-induced silencing complex (RISC)), which suppress target mRNA expression. miRNAs are usually generated from noncoding regions of gene transcripts and function to suppress gene expression by translational repression and/or by enhancing mRNA destabilization RNA degradation. Mature microRNAs are short, single-stranded RNA molecules approximately 22 nucleotides in length. MicroRNAs are sometimes encoded by multiple loci, some of which are organized in tandemly co-transcribed clusters.
MicroRNAs usually induce gene silencing by binding to target sites found within the 3′UTR of the targeted mRNA. This interaction prevents protein production by suppressing protein synthesis and/or by initiating mRNA degradation. Since most target sites on the mRNA have only partial base complementarity with their corresponding microRNA, individual microRNAs may target as many as 100 different mRNAs. Moreover, individual mRNAs may contain multiple binding sites for different microRNAs, resulting in a complex regulatory network.
MicroRNAs have been shown to be involved in a wide range of biological processes such as cell cycle control, apoptosis and several developmental and physiological processes including stem cell differentiation, hematopoiesis, hypoxia, cardiac and skeletal muscle development, neurogenesis, insulin secretion, cholesterol metabolism, aging, immune responses and viral replication. In addition, highly tissue-specific expression and distinct temporal expression patterns during embryogenesis suggest that microRNAs play a key role in the differentiation and maintenance of tissue identity.
In recent years, miRNAs have emerged as regulators of numerous activities, including developmental processes, disease pathogenesis, and host-pathogen interactions. miRNA expression and gene regulation is a wide-spread phenomenon, and according to recent miRNA annotation and deep-sequencing data, there are >15,000 microRNA gene loci spanning >140 species and >17,000 distinct mature microRNA sequences. These numbers will surely increase as high-throughput RNA sequencing technologies are applied to discovery of new non-coding RNA.
RNA interference (RNAi), first discovered as a natural biological process of eukaryotic cells for protecting the genome against foreign nucleic acids, has been developed and utilized as a revolutionary tool in deducing gene functions and in combating genetic defects, viral diseases, autoimmune disorders, and cancers. siRNAs are 21-25 nucleotide double-strand RNA fragments with symmetric 2-nucleotides 3′-end overhangs. The guide strand of siRNA can be incorporated into RNA-induced silencing complex (RISC), which brings about sequence-specific degradation of the homologous single stranded mRNAs. In recent years, large-scale genetic screens have been made possible by the availability of genome-wide siRNA libraries, as well as the development of sophisticated new instrumentation and bioinformatics approaches for data analysis. They have been used to investigate the biological functions of specific genes and pathways in various diseases and important biological processes, including signal transduction, cell aging or death, cell or organelle organization, protein localization and responses of host cells to pathogens. However, there has been limited use of a genome-wide siRNA screen for improving heterologous protein production, an important process intensively investigated by the pharmaceutical and biotechnology industry.
The present invention is based on the discovery of miRNAs and siRNAs that can be utilized, either alone or in combination, to enhance protein production.
In one embodiment, the invention provides a method of increasing production of a protein of interest in a cell. The method includes contacting the cell with an miRNA, siRNA or combination thereof under conditions wherein the miRNA or siRNA is incorporated into the cell, wherein an increase in production of the protein greater than that of a control cell not contacted with the miRNA or siRNA is indicative of increased protein production in the cell, thereby increasing production of the protein of interest in the cell. In one aspect, the cell is a mammalian cell, for example, an HEK or CHO cell. In one aspect, the cell transiently expresses the protein and in one aspect, the cell stably expresses the protein. The protein, can be a cytosolic, secreted or a membrane protein, for example.
In another embodiment, the invention provides miRNAs and siRNAs for use in increasing protein production.
In yet another embodiment, the invention provides a vector including the miRNA or siRNA of the invention.
In still another embodiment, the invention provides a cell which includes in the vector of the invention.
In a further embodiment, the invention provides a kit for increasing protein production in a cell. The kit includes a miRNA of the present invention, e.g., a miRNA sequence having a sequence as set forth in SEQ ID NOs:1-26, and a siRNA which inhibits expression of a gene set forth in Table 3.
In yet another embodiment, the invention provides a kit including a reagent for inhibiting or silencing a gene listed in Table 3 for increasing protein production in a cell. In embodiments, the reagent is used to accomplish a genome editing methodology including, but not limited to, a Crispr, zinc finger nuclease, or transcription activator-like effector nuclease (Talen).
In still another embodiment, the invention further provides a method of increasing production of a protein of interest in a cell comprising inhibiting or silencing one or more genes as listed in Table 3. In embodiments, silencing or inhibition is achieved via a genome editing methodology, for example a methodology that includes use of a Crispr, zinc finger nuclease, or transcription activator-like effector nuclease (Talen). In some embodiments, expression of the gene is knocked-out or knocked-down. In some embodiments, silencing or inhibition of gene expression results from deletion or mutation of the gene.
In another embodiment, the invention provides a screening method for obtaining miRNAs for enhancing expression of a protein. The method includes: a) contacting a cell comprising a detectably labeled protein with a plurality of miRNAs; and b) measuring protein production prior to and after contacting with the miRNAs, wherein an increase in expression of the protein after contact is indicative of an miRNA for enhancing expression of the protein. In one aspect, the invention provides for assessing the functionality of the enhanced protein produced.
The present invention is based on the seminal discovery of miRNAs and siRNAs that enhance protein production in a cell. The miRNAs and siRNAs may be used alone or in combination to increase cellular protein production of a protein of interest.
Before the present methods are described, it is to be understood that this invention is not limited to particular methods, and experimental conditions described, as such methods, and conditions may vary. It is also to be understood that the terminology used herein is for purposes of describing particular embodiments only, and is not intended to be limiting, since the scope of the present invention will be limited only in the appended claims.
As used in this specification and the appended claims, the singular forms “a”, “an”, and “the” include plural references unless the context clearly dictates otherwise. Thus, for example, references to “the method” includes one or more methods, and/or steps of the type described herein which will become apparent to those persons skilled in the art upon reading this disclosure and so forth.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Although any methods and materials similar or equivalent to those described herein can be used in the practice or testing of the invention, the preferred methods and materials are now described.
Obtaining adequate quantities of functional mammalian membrane proteins has been a bottleneck in their structural and functional studies because the expression of these proteins from mammalian cells is relatively low. To explore the possibility of enhancing expression of these proteins using miRNA, a stable T-REx-293 cell line expressing the neurotensin receptor type 1 (NTSR1), a hard-to-express G protein-coupled receptor (GPCR), was constructed. The cell line was then subjected to human miRNA mimic library screening. In parallel, an HEK293 cell line expressing luciferase was also screened with the same human miRNA mimic library. Five microRNA mimics: hsa-miR-22-5p (SEQ ID NO:3), hsa-miR-18a-5p (SEQ ID NO:25), hsa-miR-22-3p (SEQ ID NO:21), hsa-miR-429 (SEQ ID NO:2) and hsa-miR-2110 (SEQ ID NO:20) were identified from both screens. They led to 48% increase in the expression of functional NTSR1 and to 239% increase of luciferase expression. These miRNAs were also effective in enhancing the expression of secreted glypican-3 hFc-fusion protein in HEK293 cell. The results indicate that these molecules may have a wide role in enhancing production of proteins with biomedical interest.
In a related aspect, for the purpose of improving recombinant protein production from mammalian cells, an unbiased, high-throughput whole-genome RNA interference screen was conducted using human embryonic kidney 293 (HEK 293) cells expressing firefly luciferase. 21,585 human genes were individually silenced with three different siRNAs for each gene. 56 genes whose silencing caused the greatest improvement in the luciferase expression were found to be part of several different pathways that are associated with spliceosome formation/mRNA processing, transcription, metabolic process, transport and protein folding. 10 genes whose downregulation significantly enhanced the protein expression were validated by their silencing effect on four different recombinant proteins. Among the validated genes, OAZ1—the gene encoding the ornithine decarboxylase antizyme1—was selected for detailed investigation, since its silencing improved the reporter protein production without affecting cell viability. Silencing OAZ1 caused the increase of the omithine decarboxylase enzyme and the cellular levels of putrescine and spermidine, and indicated that increased cellular polyamines enhanced luciferase expression without affecting its transcription. The study shows that OAZ1 is a novel target for improving expression of recombinant proteins. The genome-scale screening demonstrated in this work can establish the foundation for targeted design of an efficient mammalian cell platform for different biotechnological applications.
The terms “microRNA”, “miRNA”, or “miR” all refer to non-coding RNAs (and also, as the context will indicate, to DNA sequences that encode such RNAs) that are capable of entering the RNAi pathway and regulating gene expression. “Primary miRNA” or “pri-miRNA” represents the non-coding transcript prior to Drosha processing and includes the stem-loop structure(s) as well as flanking 5′ and 3′ sequences. “Precursor miRNAs” or “pre-miRNA” represents the non-coding transcript after Drosha processing of the pri-miRNA. The term “mature miRNA” can refer to the double stranded product resulting from Dicer processing of pre-miRNA or the single stranded product that is introduced into RISC following Dicer processing. In some cases, only a single strand of an miRNA enters the RNAi pathway. In other cases, two strands of a miRNA are capable of entering the RNAi pathway. Illustrative examples of the invention are provided in Attachment A.
As used herein, the term “RNA silencing” refers to a group of sequence-specific regulatory mechanisms (e.g., RNA interference (RNAi), transcriptional gene silencing (TGS), post-transcriptional gene silencing (PTGS), quelling, co-suppression, and translational repression) mediated by RNA molecules which result in the inhibition or “silencing” of the expression of a corresponding protein-coding gene. RNA silencing has been observed in many types of organisms, including plants, animals, and fungi.
The term “discriminatory RNA silencing” refers to the ability of an RNA molecule to substantially inhibit the expression of a “first” or “target” polynucleotide sequence while not substantially inhibiting the expression of a “second” or “non-target” polynucleotide sequence”, e.g. when both polynucleotide sequences are present in the same cell. In certain embodiments, the target polynucleotide sequence corresponds to a target gene, while the non-target polynucleotide sequence corresponds to a non-target gene. In other embodiments, the target polynucleotide sequence corresponds to a target allele, while the non-target polynucleotide sequence corresponds to a non-target allele. In certain embodiments, the target polynucleotide sequence is the DNA sequence encoding the regulatory region (e.g. promoter or enhancer elements) of a target gene. In other embodiments, the target polynucleotide sequence is a target mRNA encoded by a target gene.
As used herein, the term “target gene” is a gene whose expression is to be substantially inhibited or “silenced” as regards siRNA, or a gene or plurality of genes that serve as regulatory genes in which increased expression results in increased protein production as regards miRNA. This silencing can be achieved by RNA silencing, for example by cleaving the mRNA of the target gene or by translational repression of the target gene. Alternatively, inhibition or silencing may be achieved by genome editing tools to inhibit expression of the gene at the genomic level, e.g., gene knock-out via, for example, deletion or mutation of the gene. The term “non-target gene” is a gene whose expression is not to be substantially inhibited. In one embodiment, the polynucleotide sequences of the target and non-target gene (e.g. mRNA encoded by the target and non-target genes) can differ by one or more nucleotides. In another embodiment, the target and non-target genes can differ by one or more polymorphisms. In another embodiment, the target and non-target genes can share less than 100% sequence identity. In another embodiment, the non-target gene may be a homolog (e.g. an ortholog or paralog) of the target gene.
A “target allele” is an allele whose expression is to be selectively inhibited or “silenced.” This silencing can be achieved by RNA silencing, such as, for example, by cleaving the mRNA of the target gene or target allele by an siRNA. Alternatively, inhibition may be achieved by genome editing tools to inhibit expression of the gene at the genomic level, e.g., gene knock-out via, for example, deletion or mutation of the gene. The term “non-target allele” is a allele whose expression is not to be substantially inhibited. In certain embodiments, the target and non-target alleles can correspond to the same target gene. In other embodiments, the target allele corresponds to a target gene, and the non-target allele corresponds to a non-target gene. In one embodiment, the polynucleotide sequences of the target and non-target alleles can differ by one or more nucleotides. In another embodiment, the target and non-target alleles can differ by one or more allelic polymorphisms. In another embodiment, the target and non-target alleles can share less than 100% sequence identity.
As used herein, the term “RNA silencing agent” refers to an RNA which is capable of inhibiting or “silencing” the expression of a target gene. In certain embodiments, the RNA silencing agent is capable of preventing complete processing (e.g, the full translation and/or expression) of a mRNA molecule through a post-transcriptional silencing mechanism. RNA silencing agents include small (<50 b.p.), noncoding RNA molecules, for example RNA duplexes comprising paired strands, as well as precursor RNAs from which such small non-coding RNAs can be generated. Exemplary RNA silencing agents include siRNAs, miRNAs, siRNA-like duplexes, and dual-function oligonucleotides as well as precursors thereof. In a certain embodiment, the RNA silencing agent is capable of silencing miRNA either by an RNA-induced silencing complex (RISC)-like ribonucleoprotein particle (miRNP) which inhibits translations or, depending on the degree of Watson-Crick complementarity, induces degradation of target mRNAs. In another embodiment, the RNA silencing agent is capable of inducing RNA interference (RNAi). In yet another embodiment, the RNA silencing agent is capable of mediating translational repression.
As used herein, the term “microRNA inhibitor” or “anti-microRNA” is synonymous with the term “microRNA antagonist”. Additionally, the term “microRNA mimic” is synonymous with the term “microRNA agonist”.
The term “nucleoside” refers to a molecule having a purine or pyrimidine base. covalently linked to a ribose or deoxyribose sugar. Exemplary nucleosides include adenosine, guanosine, cytidine, uridine and thymidine. Additional exemplary nucleosides include inosine, 1-methyl inosine, pseudouridine, 5,6-dihydrouridine, ribothymidine, 2N-methylguanosine and 2,2N,N-dimethylguanosine (also referred to as “rare” nucleosides). The term “nucleotide” refers to a nucleoside having one or more phosphate groups joined in ester linkages to the sugar moiety. Exemplary nucleotides include nucleoside monophosphates, diphosphates and triphosphates. The terms “polynucleotide” and “nucleic acid molecule” are used interchangeably herein and refer to a polymer of nucleotides joined together by a phosphodiester linkage between 5′ and 3′ carbon atoms.
The term “RNA” or “RNA molecule” or “ribonucleic acid molecule” refers to a polymer of ribonucleotides. The term “DNA” or “DNA molecule” or deoxyribonucleic acid molecule” refers to a polymer of deoxyribonucleotides. DNA and RNA can be synthesized naturally (e.g. by DNA replication or transcription of DNA, respectively). RNA can be post-transcriptionally modified. DNA and RNA can also be chemically synthesized. DNA and RNA can be single-stranded (i.e. ssRNA and ssDNA, respectively) or multi-stranded (e.g. double stranded, i.e. dsRNA and dsDNA, respectively). “mRNA” or “messenger RNA” is single-stranded RNA that specifies the amino acid sequence of one or more polypeptide chains. This information is translated during protein synthesis when ribosomes bind to the mRNA.
As used herein, the term “rare nucleotide” refers to a naturally occurring nucleotide that occurs infrequently, including naturally occurring deoxyribonucleotides or ribonucleotides that occur infrequently, e.g. a naturally occurring ribonucleotide that is not guanosine, adenosine, cytosine, or uridine. Examples of rare nucleotides include, but are not limited to, inosine, 1-methyl inosine, pseudouridine, 5,6-dihydrouridine, ribothymidine, 2N-methylguanosine and 2,2N,N-dimethylguanosine.
The term “nucleotide analog” or “altered nucleotide” or “modified nucleotide” refers to a non-standard nucleotide, including non-naturally occurring ribonucleotides or deoxyribonucleotides. Nucleotide analogs may be modified at any position so as to alter certain chemical properties of the nucleotide yet retain the ability of the nucleotideanalog to perform its intended function. Examples of modified nucleotides include, but are not limited to, 2-amino-guanosine, 2-amino-adenosine, 2,6-diamino-guanosine and 2,6-diamino-adenosine. Examples of positions of the nucleotide which may be derivitized include the 5 position, e.g. 5-(2-amino)propyl uridine, 5-bromo uridine, 5-propyne uridine, 5-propenyl uridine, etc.; the 6 position, e.g. 6-(2-amino)propyl uridine; the 8-position for adenosine and/or guanosines, e.g. 8-bromo guanosine, 8-chloro guanosine, 8-fluoroguanosine, and the like.
Nucleotide analogs also include deaza nucleotides, e.g. 7-deaza-adenosine; O- and N-modified (e.g. alkylated, e.g. N6-methyl adenosine, or as otherwise known in the art) nucleotides; and other heterocyclically modified nucleotide analogs such as those described in Herdewijn, Antisense Nucleic Acid Drug Dev., 2000 Aug. 10(4):297-310.
Nucleotide analogs may also comprise modifications to the sugar portion of the nucleotides. For example the 2′ OH-group may be replaced by a group selected from H, OR, R, F, Cl, Br, I, SH, SR, NH2, NHR, NR2, COOR, or OR, wherein R is substituted or unsubstituted C1-C6 alkyl, alkenyl, alkynyl, aryl, and the like. Other possible modifications include those described in U.S. Pat. Nos. 5,858,988, and 6,291,438.
The phosphate group of the nucleotide may also be modified, e.g. by substituting one or more of the oxygens of the phosphate group with sulfur (e.g. phosphorothioates), or by making other substitutions which allow the nucleotide to perform its intended function such as described in, for example, Eckstein, Antisense Nucleic Acid Drug Dev. 2000 Apr. 10(2): 117-21, Rusckowski et al. Antisense Nucleic Acid Drug Dev. 2000 Oct. 10(5):333-45, Stein, Antisense Nucleic Acid Drug Dev. 2001 Oct. 11(5): 317-25, Vorobjev et al. Antisense Nucleic AcidDrug Dev. 2001 Apr. 11(2):77-85, and U.S. Pat. No. 5,684,143. Certain of the above-referenced modifications (e.g. phosphate group modifications) decrease the rate of hydrolysis of, for example, polynucleotides comprising the analogs in vivo or in vitro.
The term “oligonucleotide” refers to a short polymer of nucleotides and/or nucleotide analogs. The term “RNA analog” refers to a polynucleotide (e.g. a chemically synthesized polynucleotide) having at least one altered or modified nucleotide as compared to a corresponding unaltered or unmodified RNA but retaining the same or similar nature or function as the corresponding unaltered or unmodified RNA. The oligonucleotides may be linked with linkages which result in a lower rate of hydrolysis of the RNA analog as compared to an RNA molecule with phosphodiester linkages. For example, the nucleotides of the analog may comprise methylenediol, ethylene diol, oxymethylthio, oxyethylthio, oxycarbonyloxy, phosphorodiamidate, and/or phosphorothioate linkages. Exemplary RNA analogues include sugar- and/or backbone-modified ribonucleotides and/or deoxyribonucleotides. Such alterations or modifications can further include addition of non-nucleotide material, such as to the end(s) of the RNA or internally (at one or more nucleotides of the RNA). An RNA analog need only be sufficiently similar to natural RNA that it has the ability to mediate (mediates) RNA silencing (e.g. RNA interference). In an exemplary embodiment, oligonucleotides comprise Locked Nucleic Acids (LNAs) or Peptide Nucleic Acids (PNAs).
As used here, the term “melting temperature” or “Tm” refers to the temperature at which half of a population of double-stranded polynucleotide molecules becomes dissociated into single strands.
As used herein, the terms “sufficient complementarity” or “sufficient degree of complementarity” mean that the RNA silencing agent has a sequence (e.g. in the antisense strand, mRNA targeting moiety or miRNA recruiting moiety) which is sufficient to bind the desired target RNA respectively, and to trigger the RNA silencing of the target mRNA.
As used herein, the term “translational repression” refers to a selective inhibition of mRNA translation. Natural translational repression proceeds via miRNAs cleaved from shRNA precursors. Both RNAi and translational repression are mediated by RISC. Both RNAi and translational repression occur naturally or can be initiated by the hand of man, for example, to silence the expression of target genes.
As used herein, the term “small interfering RNA” (“siRNA”) (also referred to in the art as “short interfering RNAs”) refers to an RNA (or RNA analog) comprising between about 5-60 nucleotides (or nucleotide analogs) which is capable of directing or mediating RNA silencing (e.g. RNA interference or translational repression). A siRNA may comprise between about 15-30 nucleotides or nucleotide analogs, between about 16-25 nucleotides (or nucleotide analogs), between about 18-23 nucleotides (or nucleotide analogs), and between about 19-22 nucleotides (or nucleotide analogs) (e.g. 19, 20, 21 or 22 nucleotides or nucleotide analogs). The term “short” siRNA refers to a siRNA comprising 5-23 nucleotides, ˜21 nucleotides (or nucleotide analogs), for example, 19, 20, 21 or 22 nucleotides. The term “long” siRNA refers to a siRNA comprising 24-60 nucleotides, ˜24-25 nucleotides, for example, 23, 24, 25 or 26 nucleotides. Short siRNAs may, in some instances, include fewer than 19 nucleotides, e.g. 16, 17 or 18 nucleotides, or as few as 5 nucleotides, provided that the shorter siRNA retains the ability to mediate RNAi. Likewise, long siRNAs may, in some instances, include more than 26 nucleotides, e.g. 27, 28, 29, 30, 35, 40, 45, 50, 55, or even 60 nucleotides, provided that the longer siRNA retains the ability to mediate RNAi or translational repression absent further processing, e.g. enzymatic processing, to a short siRNA.
As used herein, the term “antisense strand” of an RNA silencing agent, e.g. an siRNA or RNAi agent, refers to a strand that is substantially complementary to a section of about 10-50 nucleotides, e.g. about 15-30, 16-25, 18-23 or 19-22 nucleotides of the mRNA of the gene targeted for silencing. The antisense strand or first strand has sequence sufficiently complementary to the desired target mRNA sequence to direct target-specific silencing, e.g. complementarity sufficient to trigger the destruction of the desired target mRNA by the RNAi machinery or process (RNAi interference) or complementarity sufficient to trigger translational repression of the desired target mRNA.
The term “sense strand” or “second strand” of an RNA silencing agent, e.g. an siRNA or RNAi agent, refers to a strand that is complementary to the antisense strand or first strand. Antisense and sense strands can also be referred to as first or second strands, the first or second strand having complementarity to the target sequence and the respective second or first strand having complementarity to the first or second strand. miRNA duplex intermediates or siRNA-like duplexes include a miRNA strand having sufficient complementarity to a section of about 10-50 nucleotides of the mRNA of the gene targeted for silencing and a miRNA strand having sufficient complementarity to form a duplex with the miRNA strand.
As used herein, the term “guide strand” refers to a strand of an RNAi agent, e.g. an antisense strand of an miRNA duplex or miRNA sequence, that enters into the RISC complex and directs cleavage of the target mRNA.
As used herein, the term “passenger strand” refers to the strand typically not incorporated into risk, present in lower levels in the steady state. It is to be understood, however, that in certain cases, both strands of the duplex, i.e., both the “passenger strand” and the “guide strand” are viable and may be functional miRNA that enters into the RISC complex and directs cleavage of the target mRNA;
The term “engineered,” as in an engineered RNA precursor, or an engineered nucleic acid molecule, indicates that the precursor or molecule is not found in nature, in that all or a portion of the nucleic acid sequence of the precursor or molecule is created or selected by man. Once created or selected, the sequence can be replicated, translated, transcribed, or otherwise processed by mechanisms within a cell. Thus, an RNA precursor produced within a cell from a transgene that includes an engineered nucleic acid molecule is an engineered RNA precursor.
An “isolated nucleic acid molecule or sequence” is a nucleic acid molecule or sequence that is not immediately contiguous with both of the coding sequences with which it is immediately contiguous (one on the 5′ end and one on the 3′ end) in the naturally occurring genome of the organism from which it is derived. The term therefore includes, for example, a recombinant DNA or RNA that is incorporated into a vector; into an autonomously replicating plasmid or virus; or into the genomic DNA of a prokaryote or eukaryote, or which exists as a separate molecule (e.g. a cDNA or a genomic DNA fragment produced by PCR or restriction endonuclease treatment) independent of other sequences. It also includes a recombinant DNA that is part of a hybrid gene encoding an additional polypeptide sequence.
As used herein, the term “isolated RNA” (e.g. “isolated shRNA”, “isolated siRNA”, “isolated siRNA-like duplex”, “isolated miRNA”, “isolated gene silencing agent”, or “isolated RNAi agent”) refers to RNA molecules which are substantially free of other cellular material, or culture medium when produced by recombinant techniques, or substantially free of chemical precursors or other chemicals when chemically synthesized.
As used herein, the term “transgene” refers to any nucleic acid molecule, which is inserted by artifice into a cell, and becomes part of the genome of the organism that develops from the cell. Such a transgene may include a gene that is partly or entirely heterologous (i.e. foreign) to the transgenic organism, or may represent a gene homologous to an endogenous gene of the organism. The term “transgene” also means a nucleic acid molecule that includes one or more selected nucleic acid sequences, e.g. DNAs, that encode one or more engineered RNA precursors, to be expressed in a transgenic organism, e.g. animal, which is partly or entirely heterologous, i.e. foreign, to the transgenic animal, or homologous to an endogenous gene of the transgenic animal, but which is designed to be inserted into the animal's genome at a location which differs from that of the natural gene. A transgene includes one or more promoters and any other DNA, such as introns, necessary for expression of the selected nucleic acid sequence, all operably linked to the selected sequence, and may include an enhancer sequence.
As used herein, “silencing” or “inhibiting” refers to various methods to reduce or eliminate expression of a target gene using siRNA as well as genome editing including CRisprs, zinc fingers, and tale nucleases. Such methods are used to knock-out or knock-down a gene.
A gene “involved” in a disease or disorder includes a gene, the normal or aberrant expression or function of which effects or causes the disease or disorder or at least one symptom of the disease or disorder.
Sequence identity may be determined by sequence comparison and alignment algorithms known in the art. To determine the percent identity of two nucleic acid sequences (or of two amino acid sequences), the sequences are aligned for optimal comparison purposes (e.g. gaps can be introduced in the first sequence or second sequence for optimal alignment). The nucleotides (or amino acid residues) at corresponding nucleotide (or amino acid) positions are then compared. When a position in the first sequence is occupied by the same residue as the corresponding position in the second sequence, the molecules are identical at that position. The percent identity between the two sequences is a function of the number of identical positions shared by the sequences (i.e. % homology=number of identical positions/total number of positions×100), optionally penalizing the score for the number of gaps introduced and/or length of gaps introduced.
The comparison of sequences and determination of percent identity between two sequences can be accomplished using a mathematical algorithm. In one embodiment, the alignment generated over a certain portion of the sequence aligned having sufficient identity but not over portions having low degree of identity (i.e. a local alignment). A non-limiting example of a local alignment algorithm utilized for the comparison of sequences is the algorithm of Karlin and Altschul (1990) Proc. Natl. Acad Sci. USA 87:2264-68, modified as in Karlin and Altschul (1993) Proc. Natl. Acad. Sci. USA 90:5873-77. Such an algorithm is incorporated into the BLAST programs (version 2.0) of Altschul, et al. (1990) J. Mol. Biol. 215:403-10.
In another embodiment, the alignment is optimized by introducing appropriate gaps and percent identity is determined over the length of the aligned sequences (i.e. a gapped alignment). To obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in Altschul et al., (1997) Nucleic Acids Res. 25(17):3389-3402. In another embodiment, the alignment is optimized by introducing appropriate gaps and percent identity is determined over the entire length of the sequences aligned (i.e. a global alignment). A non-limiting example of a mathematical algorithm utilized for the global comparison of sequences is the algorithm of Myers and Miller, CABIOS (1989). Such an algorithm is incorporated into the ALIGN program (version 2.0) which is part of the GCG sequence alignment software package. When utilizing the ALIGN program for comparing amino acid sequences, a PAM120 weight residue table, a gap length penalty of 12, and a gap penalty of 4 can be used.
miRNAs are noncoding RNAs of approximately 22 nucleotides which can regulate gene expression at the post transcriptional or translational level during plant and animal development. One common feature of miRNAs is that they are all excised from an approximately 70 nucleotide precursor RNA stem-loop termed pre-miRNA, probably by Dicer, an RNase III-type enzyme, or a homolog thereof.
The miRNA sequence can be similar or identical to that of any naturally occurring miRNA (see e.g. The miRNA Registry; Griffiths-Jones S, Nuc. Acids Res., 2004). Over one thousand natural miRNAs have been identified to date and together they are thought to comprise ˜1% of all predicted genes in the genome. Many natural miRNAs are clustered together in the introns of pre-mRNAs and can be identified in silico using homology-based searches (Pasquinelli et al., 2000; Lagos-Quintana et al., 2001; Lau et al., 2001; Lee and Ambros, 2001) or computer algorithms (e.g. MiRScan, MiRSeeker) that predict the capability of a candidate miRNA gene to form the stem loop structure of a pri-mRNA (Grad et al., Mol. Cell, 2003; Lim et al., Genes Dev., 2003; Lim et al., Science, 2003; Lai E C et al., Genome Bio.} 2003). An online registry provides a searchable database of all published miRNA sequences (The miRNA Registry at the Sanger Institute website; Griffiths-Jones S, Nuc. Acids Res., 2004). Exemplary, natural miRNAs include lin-4, let-7, miR-10, miRR-15, miR-16, miR-168, miR-175, miR-196 and their homologs, as well as other natural miRNAs from humans and certain model organisms including Drosophila melemogaster, Caenorhabditis elegans, zebrafish, Arabidopsis thalania, mouse, and rat as described in International PCT Publication No. WO 03/029459.
Naturally-occurring miRNAs are expressed by endogenous genes in vivo and are processed from a hairpin or stem-loop precursor (pre-miRNA or pri-miRNAs) by Dicer or other RNAses (Lagos-Quintana et al., Science, 2001; Lau et al., Science, 2001; Lee and Ambros, Science, 2001; Lagos-Quintana et al., Curr. Biol., 2002; Mourelatos et al., Genes Dev., 2002; Reinhart et al., Science, 2002; Ambros et al, Curr. Biol., 2003; Brennecke et al., 2003; Lagos-Quintana et al., RNA, 2003; Lim et al., Genes Dev., 2003; Lim et al., Science, 2003). miRNAs can exist transiently in vivo as a double-stranded duplex but only one strand is taken up by the RISC complex to direct gene silencing. Certain miRNAs, e.g. plant miRNAs, have perfect or near-perfect complementarity to their target mRNAs and, hence, direct cleavage of the target mRNAs. Other miRNAs have less than perfect complementarity to their target mRNAs and, hence, direct translational repression of the target mRNAs. The degree of complementarity between an miRNA and its target mRNA is believed to determine its mechanism of action. For example, perfect or near-perfect complementarity between a miRNA and its target mRNA is predictive of a cleavage mechanism (Yekta et al., Science, 2004), whereas less than perfect complementarity is predictive of a translational repression mechanism. In particular embodiments, the miRNA sequence is that of a naturally-occurring miRNA sequence, the aberrant expression or activity of which is correlated with a miRNA disorder.
Naturally-occurring miRNA precursors (pre-miRNA) have a single strand that forms a duplex stem including two portions that are generally complementary, and a loop, that connects the two portions of the stem. In typical pre-miRNAs, the stem includes one or more bulges, e.g. extra nucleotides that create a single nucleotide “loop” in one portion of the stem, and/or one or more unpaired nucleotides that create a gap in the hybridization of the two portions of the stem to each other. Short hairpin RNAs, or engineered RNA precursors, of the invention are artificial constructs based on these naturally occurring pre-miRNAs, but which are engineered to deliver desired RNAi agents (e.g. siRNAs of the invention). By substituting the stem sequences of the pre-miRNA with sequence complementary to the target mRNA, a shRNA is formed. The shRNA is processed by the entire gene silencing pathway of the cell, thereby efficiently mediating RNAi.
MicroRNAs (miRNAs) are small endogenous non-coding RNAs that post-transcriptionally regulate gene expression by binding with imperfect complementarity in 3′ untranslated regions (3′-UTR) of their target messenger RNAs (mRNAs). MiRNAs are 18-25 nucleotide single-stranded small RNAs associated with a complex of proteins which is called RNA-induced silencing complex (RISC)-like ribonucleoprotein particle (miRNP). This complex inhibits translation or, depending on the degree of Watson-Crick complementarity, induces degradation of target mRNAs. These small RNAs are usually generated from non-coding regions of many gene transcripts and function to suppress gene expression by translational repression. MiRNAs have been shown to play important roles in development, cell growth, and differentiation. Recent studies have highlighted the role of miRNAs in various disease states and in regulating host-pathogen interactions. For example, mRNAs have been implicated in cardiovascular disease, inflammation, viral infections, and cancers. Hence, disease-associated miRNAs could become potential targets for therapeutic intervention.
In embodiments where post-transcriptional gene silencing by translational repression of the target gene is desired, the miRNA sequence has partial complementarity with the target gene sequence. In certain embodiments, the miRNA sequence has partial complementarity with one or more short sequences (complementarity sites) dispersed within the target mRNA (e.g. within the 3′-UTR of the target mRNA) (Hutvagner and Zamore, Science, 2002; Zeng et al., Mol. Cell, 2002; Zeng et al., RNA, 2003; Doench et al., Genes & Dev., 2003). Since the mechanism of translational repression is cooperative, multiple complementarity sites (e.g. 2, 3, 4, 5, or 6) may be targeted in certain embodiments.
In general, the nucleotides comprising a polynucleotide are naturally occurring deoxyribonucleotides, such as adenine, cytosine, guanine or thymine linked to 2′-deoxyribose, or ribonucleotides such as adenine, cytosine, guanine or uracil linked to ribose. Depending on the use, however, a polynucleotide also can contain nucleotide analogs, including non-naturally occurring synthetic nucleotides or modified naturally occurring nucleotides. Nucleotide analogs are well known in the art and commercially available, as are polynucleotides containing such nucleotide analogs. The covalent bond linking the nucleotides of a polynucleotide generally is a phosphodiester bond. However, depending on the purpose for which the polynucleotide is to be used, the covalent bond also can be any of numerous other bonds, including a thiodiester bond, a phosphorothioate bond, a peptide-like bond or any other bond known to those in the art as useful for linking nucleotides to produce synthetic polynucleotides.
A polynucleotide or oligonucleotide comprising naturally occurring nucleotides and phosphodiester bonds can be chemically synthesized or can be produced using recombinant DNA methods, using an appropriate polynucleotide as a template. In comparison, a polynucleotide comprising nucleotide analogs or covalent bonds other than phosphodiester bonds generally will be chemically synthesized, although an enzyme such as T7 polymerase can incorporate certain types of nucleotide analogs into a polynucleotide and, therefore, can be used to produce such a polynucleotide recombinantly from an appropriate template.
As discussed above, in various embodiments antisense oligonucleotides or RNA molecules include oligonucleotides containing modifications. A variety of modifications are known in the art and contemplated for use in the present invention. For example oligonucleotides containing modified backbones or non-natural internucleoside linkages are contemplated. As used herein, oligonucleotides having modified backbones include those that retain a phosphorus atom in the backbone and those that do not have a phosphorus atom in the backbone. For the purposes of this specification, and as sometimes referenced in the art, modified oligonucleotides that do not have a phosphorus atom in their internucleoside backbone can also be considered to be oligonucleosides.
The term “RNA Induced Silencing Complex,” and its acronym “RISC,” refers to the set of proteins that complex with single-stranded polynucleotides such as mature miRNA or siRNA, to target nucleic acid molecules (e.g., mRNA) for cleavage, translation attenuation, methylation, and/or other alterations. Known, non-limiting components of RISC include Dicer, R2D2 and the Argonaute family of proteins, as well as strands of siRNAs and miRNAs.
Methods
In one embodiment, the invention provides a method of increasing production of a protein of interest in a cell. The method includes contacting the cell with miRNA of the present disclosure, siRNA of the present disclosure, or both, to increase protein production.
The protein of interest for use with the invention may be any protein which can be expressed in a cell. For example, the protein may be a cytosolic, secreted or membrane protein. The term “polypeptides/protein” is used broadly to refer to macromolecules comprising linear polymers of amino acids which may act in biological systems, for example, as structural components, enzymes, chemical messengers, receptors, ligands, regulators, hormones, and the like. Such polypeptides/proteins would include functional protein complexes, such as antibodies. The term “antibody” is used broadly herein to refer to a polypeptide or a protein complex that can specifically bind an epitope of a polypeptide or antigen. As used in this invention, the term “epitope” refers to an antigenic determinant on a polypeptide or an antigen, such as a cell surface marker or receptor, to which the paratope of an antibody binds.
Generally, an antibody contains at least one antigen binding domain that is formed by an association of a heavy chain variable region domain and a light chain variable region domain, particularly the hypervariable regions. An antibody can be a naturally occurring antibodies, for example, bivalent antibodies, which contain two antigen binding domains formed by first heavy and light chain variable regions and second heavy and light chain variable regions (e.g., an IgG or IgA isotype) or by a first heavy chain variable region and a second heavy chain variable region (VHH antibodies), or on non-naturally occurring antibodies, including, for example, single chain antibodies, chimeric antibodies, bifunctional antibodies, and humanized antibodies, as well as antigen-binding fragments of an antibody, for example, an Fab fragment, an Fd fragment, an Fv fragment, and the like.
Generally, an antibody contains at least one antigen binding domain that is formed by an association of a heavy chain variable region domain and a light chain variable region domain, particularly the hypervariable regions. Antibodies include polyclonal and monoclonal antibodies, chimeric, single chain, and humanized antibodies, as well as Fab fragments, including the products of an Fab or other immunoglobulin expression library. Antibodies which consists essentially of pooled monoclonal antibodies with different epitopic specificities, as well as distinct monoclonal antibody preparations are provided. Monoclonal antibodies are made by methods well known to those skilled in the art. The term antibody as used in this invention is meant to include intact molecules as well as fragments thereof, such as Fab and F(ab′)2, Fv and SCA fragments which are capable of binding an epitopic determinant on a protein of interest. An Fab fragment consists of a monovalent antigen-binding fragment of an antibody molecule, and can be produced by digestion of a whole antibody molecule with the enzyme papain, to yield a fragment consisting of an intact light chain and a portion of a heavy chain. An Fab′ fragment of an antibody molecule can be obtained by treating a whole antibody molecule with pepsin, followed by reduction, to yield a molecule consisting of an intact light chain and a portion of a heavy chain. Two Fab′ fragments are obtained per antibody molecule treated in this manner. An (Fab′)2 fragment of an antibody can be obtained by treating a whole antibody molecule with the enzyme pepsin, without subsequent reduction. A (Fab′)2 fragment is a dimer of two Fab′ fragments, held together by two disulfide bonds. An Fv fragment is defined as a genetically engineered fragment containing the variable region of a light chain and the variable region of a heavy chain expressed as two chains. A single chain antibody (“SCA”) is a genetically engineered single chain molecule containing the variable region of a light chain and the variable region of a heavy chain, linked by a suitable, flexible polypeptide linker.
Cells for use with the invention generally include eukaryotic cells, such as animal cells. In embodiments, the cells are mammalian cells, such as HEK or CHO cell. However the invention contemplates use of any cell line commonly known in the art for protein production.
miRNAs and siRNAs may be introduced into the cells, according to methods well known in the art. Similarly, the protein of interest may be introduced into the cells, according to methods well known in the art. Typically, a gene encoding the protein is inserted into a plasmid or vector, and the resulting construct is then transfected into appropriate cells, in which the protein is then expressed, and from which the protein is ultimately purified.
In embodiments, a host cell transfected with an expression vector encoding a protein of interest can be cultured under appropriate conditions to allow expression of the protein to occur in the presence of the miRNAs and siRNAs of the invention. The protein may be secreted, by inclusion of a secretion signal sequence, and isolated from a mixture of cells and medium containing the protein. Alternatively, the protein may be retained cytoplasmically and the cells harvested, lysed and the protein isolated. A cell culture includes host cells, media and other byproducts. Suitable media for cell culture are well known in the art. The proteins can be isolated from cell culture medium, host cells, or both using techniques known in the art for purifying proteins, including ion-exchange chromatography, gel filtration chromatography, ultrafiltration, electrophoresis, and immunoaffinity purification with antibodies specific for particular epitopes of the protein.
In embodiments, an increase in production of the protein greater than that of a control cell not contacted with the miRNA or siRNA is indicative of increased protein production in the cell. In various embodiments, the protein production is increased greater than 1.1, 1.2, 1.3. 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2.0 times or more as compared to the control cell not contacted with the miRNA or siRNA.
miRNAs and siRNAs
The invention also provides miRNAs and siRNAs for use in increasing protein production, as well as genome editing methodologies to increase protein production.
In embodiments, the miRNA may be one or more miRNAs including a sequence selected from SEQ ID NOs:1-26, and any combination thereof. For example, the miRNA may be one or more miRNAs selected from SEQ ID NOs:1-4, 20, 21 and 25. In one embodiment, the miRNA includes those as set forth in SEQ ID NOs:2, 3, 20, 21 or 25. In some embodiments, the miRNA includes a common sequence motif as set forth in SEQ ID NO:28 or SEQ ID NO:29. For example, the miRNA has a sequence selected from SEQ ID NOs:4, 16 and 22.
In embodiments, the invention provides an isolated nucleic acid sequence including the miRNA sequence of the invention operably linked to a heterologous promoter. The miRNA sequence may have a length of about 6-25 nucleotides and include a sequence as set forth in SEQ ID NOs:1-26. Similarly, the miRNA sequence may have a length of about 6-25 nucleotides and include a sequence as set forth in SEQ ID NO:28 or 29.
In embodiments, the siRNA is one or more siRNA is that inhibits expression of a gene set forth in Table 3, inhibition of which has been determined to increase protein expression. Such siRNAs include those having a sequence set forth in SEQ ID NOs:38-212.
In some embodiments, the siRNA inhibits one or more genes listed in Table 3, such as one or more of INTS1, INTS2, HNRNPC, CASP8AP2, OAZ1, ODC1, AZIN1, PPP2R1A, PRPF19, CHAF1A, CCT2, EEF1B2, or a combination thereof. In an exemplary embodiment, the siRNA inhibits OAZ1 and has a sequence as set forth in SEQ ID NO:155 or SEQ ID NO:156.
In some embodiments, genome editing tools are used to inhibit or silence one or more genes listed in Table 3, such as one or more of INTS1, INTS2, HNRNPC, CASP8AP2, OAZ1, ODC1, AZIN1, PPP2R1A, PRPF19, CHAF1A, CCT2, EEF1B2, or a combination thereof.
The miRNAs and siRNAs of the present invention may include naturally occurring nucleotides as well as non-naturally occurring nucleotide analogs. Such molecules may also include modified backbones or non-natural internucleoside linkages as discussed herein as well as modifications at the 5′, 3′ or both the 5′ and 3′ terminus.
Kits
The invention also provides a kit for increasing protein production in a cell. The kit includes a miRNA of the present invention, for example, a miRNA sequence having a sequence as set forth in SEQ ID NOs:1-26, and a siRNA which inhibits expression of a gene set forth in Table 3, for example, an siRNAs having a sequence set forth in SEQ ID NOs:94-212.
Screening
In another embodiment, the invention provides a screening method for obtaining miRNAs for enhancing expression of a protein. The method includes: a) contacting a cell comprising a detectably labeled protein with a plurality of miRNAs; and b) measuring protein production prior to and after contacting with the miRNAs, wherein an increase in expression of the protein after contact is indicative of an miRNA for enhancing expression of the protein. In one aspect, the invention provides for assessing the functionality of the enhanced protein produced.
In embodiments, the detectable label is used to detect expression of the labeled protein. Such labels are commonly known in the art and include, for example, luciferase (LUC), β-lactamase, chloramphenicol acetyltransferase (CAT), adenosine deaminase (ADA), aminoglycoside phosphotransferase (neo, G418), dihydrofolate reductase (DHFR), hygromycin-β-phosphotransferase (HPH), thymidine kinase (TK), β-galactosidase (β-gal), and xanthine guanine phophoribosyltransferase (XGPRT), affinity or epitope tags, and fluorescent proteins. In one embodiment the detectable label is green fluorescent protein (GFP) or enhanced green fluorescent protein (eGFP).
ZFN/TALEN/CRISPR
Zinc-finger nucleases (ZFNs), transcription activator-like effector nucleases (TALENs) and CRISPR/Cas9 systems all comprise a powerful class of genome editing tools that are redefining the boundaries of biological research.
These chimeric nucleases are composed of programmable, sequence-specific DNA-binding modules linked to a nonspecific DNA cleavage domain. ZFNs and TALENs enable a broad range of genetic modifications by inducing DNA double-strand breaks that stimulate error-prone non-homologous end joining or homology-directed repair at specific genomic locations. There are potential therapeutic applications of ZFNs and TALENs.
CRISPR/Cas-based RNA-guided DNA endonucleases are the newest of the genome editing tools, and very powerful.
The following example is provided to further illustrate the advantages and features of the present invention, but are not intended to limit the scope of the invention. While they are typical of those that might be used, other procedures, methodologies, or techniques known to those skilled in the art may alternatively be used.
This Example sets forth a high-throughput screening strategy for identifying miRNAs that can improve functional expression of the model membrane protein-Neurotensin Receptor type 1 (NTSR1). Belonging to the G protein-coupled receptors (GPCRs) superfamily and interacting with its ligand neurotensin, NTSR1 plays important roles in Parkinson's disease, pathogenesis of schizophrenia, modulation of dopamine neurotransmission, hypothermia, and antinociception and in promoting growth of cancer cells. The structure of a stabilized NTSR1 mutant with T4 lysozyme replacing most of the third intracellular loop was recently determined. Previously, the inventors constructed inducible suspension mammalian HEK293 cells expressing functional NTSR1, that allowed one to obtain 1 mg purified receptor per liter of cell culture with a viable cell density of 1.4 million cells/mL. Here the ability to improve receptor expression by applying the powerful miRNA tool is explored. This study describes the implementation of high-throughput image-based screen with NTSR1-GFP-expressing cells using a human miRNA mimic library comprising 875 miRNA mimics.
Materials and Methods
Construction of Expression Plasmid pJMA-NTSR1-GFP
Truncated wild type NTSR1 (T43-K396) was subcloned into the tetracycline inducible plasmid pJMA111 replacing the serotonin transporter construct using KpnI and NotI restriction sites. Thus NTSR1 was placed downstream of the tetracycline-controlled CMV promoter and had an eGFP-deca-histidine tag fused to its C-terminal (
Construction of Stable NTSR1-GFP-Expressing T-REx-293 Cell Line
The T-REx-293 cell line was maintained as an adherent culture in DMEM containing 10% certified FBS and 5 μg/mL blasticidin (Invitrogen). The cells were transfected with the plasmid pJMA-NTSR1-GFP using Lipofectamine™ 2000 according to the manufacturer's protocol (Life Technologies). One day after transfection, cells were transferred into fresh DMEM medium containing 200 μg/mL zeocin (Invitrogen) and the medium was replaced every three days. Two weeks later, ten cell clones were separately expanded into two T-flasks each. Cells in one T-flask were harvested during the exponential growth phase and frozen in 10% DMSO for storage. Cells in the other T-flask were induced with 1 μg/mL tetracycline for 24 hrs, after reaching 80% confluency. Cells were then detached from the flask and washed with cold PBS. After adjusting the cell density to ˜1×106 cells per mL, protease inhibitors (Roche) were added and the cell suspension was frozen on dry ice in 1 mL aliquots. NTSR1 expression levels were determined by [3H]NT binding and the clone with the highest expression level was selected for further experiments. The selected stable T-REx-293-NTSR1-GFP high expressor was then routinely maintained in DMEM containing 10% certified FBS, 5 μg/mL blasticidin and 200 μg/mL zeocin.
High-Throughput miRNA Screen
T-REx-293-NTSR1-GFP cells were screened with a miRNA mimic library (Qiagen) based on Sanger miRBase™ 13.0 and consisting of 875 miRNAs mimics. For transfection, 0.8 pmol of each mimic was spotted to 384 well plate wells (Corning) and 20 μL of serum-free DMEM containing 0.1 μL of Lipofectamine™ RNAiMax (Life Technologies) was then added to each well. This lipid-miRNA mixture was incubated at ambient temperature for 30 min prior to adding 2000 cells in 20 μL of DMEM containing 20% certified FBS (Gibco). Transfected cells were incubated at 37° C. in 5% CO2 for 72 hours and induced with 1 μg/mL tetracycline for 24 hours for NTSR1-GFP expression. Cells were then fixed with 2% paraformaldehyde (Electron Microscopy Sciences), stained with Hoechst™ 33342 (Life Technologies) for 45 minutes and gently washed with PBS. Plates were imaged with an ImageXpress Micro XL™ (Molecular Devices). Total cell number and per cell green fluorescence intensity were calculated using MetaXpress™ software (Molecular Devices) employing the Multi-Wavelength Cell Scoring™ application module. All screening plates had a full column (16 wells) of SilencerSelect™ Negative Control #2 (Life Technologies) and the median value of each plate's negative control column was used to normalize corresponding sample wells. A full column of positive control siRNA targeting GFP (GFP-22 siRNA, Qiagen) was also used as on-plate reference for transfection efficiency. The median absolute deviation (MAD)-based z-score was calculated for each sample.
Validation Transfection
Validation transfections were performed in 12-well plates with miScript™ miRNA mimics (Qiagen, Cat. No. 219600-S0), SilencerSelect™ Negative Control #2 and lethal control siRNA (Qiagen AllStars Cell Death Control™) served as a control for transfection efficiency. Cells were transfected as described for screening except 0.15 million cells were transfected with 40 nM miRNA using 6.25 ul Lipofectamine™ RNAiMax in a total volume of 1 mL of media. 72 hours after transfection, cells were induced with 1 μg/mL tetracycline. 24 hours later, cells from each well were detached with non-enzymatic cell dissociation buffer (Gibco, Cat. No. 13150-016) and washed twice with cold PBS. Cell densities and viability were determined by trypan blue exclusion using a CEDEX™ cell quantification system (Roche, Mannheim, Germany). Based on the counts, cell densities were adjusted to 0.5 million cells/ml with PBS and then subject to flow cytometry analysis. The remaining cells were pelleted and frozen on dry ice for [3H]NT binding assays.
Flow Cytometry Analysis
Cells harvested from validation transfection step were diluted to 0.2 million cells/ml with cold PBS for flow cytometry analysis. Green fluorescence was measured with Guava Easycyte 5HT and Incyte software (Millipore). The green fluorescence signal and cell gating were adjusted using uninduced T-REx-293-NTSR1-GFP cells, with more than 99% of the cells in low fluorescence range (<100). The setting was kept same for acquisition of all cell samples.
Analytical Solubilization of NTSR1
The detergents n-Dodecyl-β-D-maltoside (LM), 3-[(3-cholamidopyropyl) dimethylammonio]-1-propane sulfonate (CHAPS) and cholesteryl hemisuccinate Tris salt (CHS) were obtained from Anatrace. Cell pellets from 2 mL of suspension culture were suspended in Tris-glycerol-NaCl buffer. Then the detergents LM, CHAPS and CHS were added to give a final buffer composition of 50 mM TrisHCl pH 7.4, 200 mM NaCl, 30% (v/v) glycerol, 1% (w/v) LM, and 0.6% (w/v) CHAPS and 0.12% (w/v) CHS in a total volume of 0.5 mL. The samples were placed on a rotating mixer at 4° C. for 1 hour. Cell debris and non-solubilized material were removed by ultracentrifugation (TL100 rotor, 60 k rpm, 4° C., 30 min in Optima Max™ bench-top ultracentrifuge, Beckman), and the supernatants containing detergent-solubilized NTSR1 were used to determine the total number of expressed receptors by a detergent-based radio-ligand binding assay (see below).
Ligand Binding Assay
Tritiated neurotensin agonist [3H]NT ([3,11-tyrosyl-3,5-3H(N)]-pyroGlu-Leu-Tyr-Glu-Asn-Lys-Pro-Arg-Arg-Pro-Tyr-Ile-Leu; SEQ ID NO:27) was purchased from Perkin Elmer. Ligand-binding assays with detergent-solubilized receptors were carried out in TEBB assay buffer (50 mM Tris pH 7.4, 1 mM EDTA, 40 μg/mL bacitracin, 0.1% BSA) containing 0.1% (w/v) LM, 0.2% (w/v) CHAPS and 0.04% (w/v) CHS. For one-point assays, receptors were incubated with 2 nM [3H]NT on ice for 1 hour in a volume of 150 μL. The concentration of [3H]NT used was at least 5-fold above the apparent dissociation constants for detergent-solubilized NTSR1 to allow high receptor occupancy. Separation of the receptor-ligand complex from free ligand (100 μL) was achieved by centrifugation-assisted gel filtration using Bio-Spin™ 30 Tris columns (BioRad), equilibrated with RDB buffer (50 mM Tris-HCl, pH 7.4, 1 mM EDTA, 0.1% (w/v) LM, 0.2% (w/v) CHAPS, 0.04% (w/v) CHS). Non-specific [3H]NT binding of 220 dpm was subtracted from total binding to calculate the total amount of receptors in T-REx-293-NTSR1-GFP cells. The number of functional NTSR1 was estimated by specific [3H]NT binding assuming one ligand-binding site per receptor molecule. The number of cells in the assay was derived by cell counting at cell harvest. This approach led to the calculation of the parameter “receptors/cell”.
Validation with Luciferase-Expressing Cells
CMV-Luc2-HygroHEK293 cell line constitutively expressing luciferase is purchased from Promega. Validation transfections were performed in 12-well plates with miScript miRNA mimics (Qiagen, Cat. No. 219600-S0), SilencerSelect™ Negative Control #2 and lethal control siRNA (Qiagen AllStars Cell Death Control™) served as a control for transfection efficiency. Cells were transfected in duplicates as described above in the screening method with the following modification: 0.1 million cells were transfected with 40 nM miRNA using 6.25 ul Lipofectamine™ RNAiMax in a total volume of 1 mL of media. 72 hours after transfection, 500 μL of ONE-Glo™ Reagent (Promega) was added to one set of replicates for luciferase activity quantification and 500 μL of CellTiter-Glo™ Reagent (Promega) was added to the second set of replicates for viable cell density measurement. All plates were incubated at room temperature for 20 minutes to stabilize luminescent signal and then measured with SpectraMax i3™ plate reader (Molecular Devices). Per cell luciferase production was calculated from overall luciferase activity and viable cell number.
Validation with GPC3-hFc-Expressing Cells
HEK-GPC3-hFc cell line constitutively secreting glypican-3 hFc-fusion protein (GPC3-hFc) was a gift from the National Cancer Institute of the NIH. Cells were grown in DMEM supplemented with 10% FBS in a humidified incubator set at 37° C. and 5% CO2.
Cells were transfected in 12-well plates as described above in the screening method with the following modification: 0.15 million cells were transfected with 40 nM miRNA using 6.25 ul Lipofectamine™ RNAiMax in a total volume of 1 mL of media. 6 days after transfection, cell culture supernatant was collected and cleared using centrifuge for GPC3-hFc concentration determination with ELISA and cells were detached and counted by trypan blue exclusion using a CEDEX™ cell quantification system (Roche, Mannheim, Germany). Per cell GPC3-hFc production can be calculated from overall GPC3-hFc yield and viable cell number.
ELISA for GPC3-hFc Concentration Determination
AffiniPure™ F(ab′)2 Fragment Goat Anti-Human IgG (min X Bov, Ms, Rb Sr Prot, Cat. 109-006-170, Jackson Immunology) was used to coat a 96-well plate at 5 μg/mL in PBS buffer, 50 μL per well, at 4° C. overnight. After the plate was blocked with 2% BSA in PBS buffer, pre-diluted cell culture supernatant was added, and the plate was incubated at room temperature for one hour to allow binding to occur. After the plate was washed twice with PBS containing 0.05% Tween 20, Peroxidase-conjugated AffiniPure™ Goat-anti-uman IgG (Cat. 109-035-098, Jackson Immunology) was added at 1:4000 dilutions, 50 ul/well. Following incubating at room temperature for one hour, the plate was washed 4 times and detected with Peroxidase Substrate System (KPL).
Figure Legends
Results
Construction of Inducible T-REx-293-NTSR1-GFP Cell Line for Image-Based Screen
A stable cell line expressing functional wild type NTSR1-GFP fusion was constructed using the inducible T-REx system by transfecting T-REx-293 cells with the pJMA-NTSR1-GFP plasmid (
High-Throughput miRNA Screen for Enhanced NTSR1-GFP Expression
To identify miRNAs that improve NTSR1 expression in T-REx-293-NTSR1-GFP cells, the cells were screened with a library comprised of 875 human miRNA mimics. Cells were transiently transfected with mimics in 384-well format for 72 hours followed by tetracycline-induced expression of NTSR1-GFP fusion protein (
Validation of the Selected miRNA Candidates by Flow Cytometry Analysis
The expression level of NTRS1-GFP following transient transfection of the cells with the top 26 microRNA was measured by flow cytometry (
[3H]NT Binding Assay Validation for Improved Functional Expression of NTSR1
The effect of the top 9 miRNAs on the functional expression of NTSR1 was also evaluated by measuring the functional activity of the receptor through the binding of labeled neurotensin ([3H]NT). Although all top 9 miRNAs were shown to improve NTSR1-GFP expression based on GFP fluorescence, only 5 of them (miR-22-5p, miR-18a-5p, miR-22-3p, miR-429 and miR-2110) led to improved functional activity levels of NTSR1 (
MiRNA Screen for Enhanced Luciferase Expression
The human mimic miRNA library was also evaluated for its effects on the expression of luciferase in HEK293 cells constitutively expressing luciferase under control of a cytomegalovirus (CMV) promoter. Screening was performed in duplicate in 384-well format. Seventy two hours post-transfection, one set of plates was assayed for luciferase and the other set was used for viable cell density (
Validation of Common Hits
The top 9 miRNAs identified from the NTRS1 screen were examined for their effects on luciferase activity in a 12-well plate format where seven miRNAs improved luciferase activity. (
Application of Top Common Hits on Secreted Protein
To investigate the impact of top common hits on secreted protein production, the five identified miRNAs (hsa-miR-22-5p, hsa-miR-18a-5p, hsa-miR-22-3p, hsa-miR-429 and hsa-miR-2110) were independently transfected into HEK293 cell line stably expressing secreted hFc-fusion protein: glypican-3 hFc-fusion protein (GPC3-hFc). All five miRNAs enhanced per cell GPC3-hFc secretion (up to 120% improvement,
Discussion
Integral membrane proteins such as mammalian receptors, ion channels and transporters are vital for medical research. However, obtaining large amounts of functional membrane proteins for medical research, especially structural studies, has been difficult and therefore been a barrier for productive research towards better understanding of their mechanisms and potential medical use. So far, a tetracycline-inducible mammalian expression system has been shown to be an effective method for functional expression of membrane proteins. This inducible system together with optimized production conditions led to a yield of 1 milligram per liter of purified NTSR1. Compared with well-developed prokaryotic hosts such as E. coli, the production of membrane proteins from higher eukaryotic hosts is still in the stage of “trial and error” since engineering tools are limited and the membrane protein synthesis, insertion, folding and trafficking are not completely understood.
To improve the production of these proteins, a bottom-up screening approach using human miRNA mimics library was implemented to identify candidates that lead to improved expression of the GPCR from the T-Rex-293 cells. This approach has previously proven effective for apoptosis screening and recombinant secreted protein screening in CHO cells. In this study, An image-based high-throughput screening method was developed to detect per cell green fluorescence signal, which is applied as a proxy for the number of molecules of NTSR1 protein expressed per cell. In addition to its high reproducibility (0.92 correlation between technical replicates), this method is cost-effective for protein with a fluorescent label, as no secondary reagent is needed for protein quantification. It is also high throughput and high-capacity, as cells are fixed and the screening is not time-sensitive compared to live-cell processes such as flow cytometry. This screen methodology can be applied to other membrane or intracellular protein candidates when the targeted protein is fused with GFP. Although GFP fusion has been widely used for membrane protein overexpression screen and purification in a variety of hosts, it is possible that the N-terminal GFP fusion may mask signal sequence essential for protein insertion. This may compromise folding or correct localization of the desired membrane protein. C-terminal GFP fusion, on the other hand, is preferable as it is generally better in maintaining the localization and function of the native protein with exceptions when C-terminal contains an essential functional segment.
Among the 875 human miRNA mimics tested, 40 mimics consistently led to significant improvement in per cell green fluorescence levels, exhibiting an average MAD-based-z-score higher than 2.0. Among the top 40 candidates, miR-892b, miR193b-3p and miR-193a-3p share the same seed sequence (ACUGGC; SEQ ID NO:28), indicating that they may comprise an overlap in target genes. Similarly, miR-129-3p and miR-129-1-3p also share a same seed sequence (AGCCCU; SEQ ID NO:29).
The activity of two thirds of the 40 mimics was confirmed further by flow cytometry and the 9 mimic candidates contributing to the highest per cell green fluorescence signal were further tested in the [3H]NT binding assay. Five out of the nine mimics showed up to 48% improvement in functional expression of NTSR1. From these five, miR-2110 is a novel miRNA that has been identified but not studied. The other four miRNAs (miR-429, miR-18a-5p, miR-22-5p and miR-22-3p) have been associated with cancer research in which they have exhibited contradicting effects on cell proliferation, cell growth, and protein production depending on the cell type and stage of cell development. For example, miR-429, a member of the miR-200 family, was shown to suppress tumor growth in human osteosarcoma, while in non-small cell lung cancer (NSCLC), the same miRNA is suggested as a potential target for NSCLC due to its promotion of cell proliferation. miR-18a-5p is part of the miR-17-92 precursor sequence cluster, which is also named Oncomir-1. This miR-17-92 cluster was studied in depth regarding its effect on recombinant EpoFc protein production in CHO cells. Although over-expression of the entire cluster decreased productivity while having no effect on cell growth, the over-expression of miR-17 and miR-92 were shown to increase production.
Of the nine miRNAs that enhanced the expression of the NTSRI-GFP fusion protein, four (miR-129-5p, miR-221-5p, miR-892b and miR-639) were not associated with enhanced binding activity of the agonist in the [3H]NT assay. This may be an indication that NTSR1 could be misfolded in these cells following the enhanced expression. Another observation is that eight of the nine top hits (except miR-129-5p) caused a decrease in the viable cell number. One possible reason for this behavior is that overexpression of NTSR1-GFP could be toxic to the cells. Another possibility is that the introduction of a specific miRNA to the cells is associated with a growth arrest, leading to improved protein production. Since multiple pathways and genes can be targeted by one miRNA, it will be worthwhile to examine which specific genes are down-regulated in these cells and to investigate the mechanism that improved NTSR1 functional expression.
In parallel to the analysis of the miRNA effect on the NTSR1 expression, an HEK293 cell line constitutively expressing luciferase under the control of CMV promoter was subjected to a screening of the same miRNA mimics library. This screen showed low degrees of correlation (R=0.31) with the NTSR1-GFP screen. The low correlation may be the result of the difference between biogenesis process of integral membrane proteins and intracellular soluble proteins; the difference between constitutive expression elements and the inducible expression system; and clonal differences between the two HEK293 cell line used. Despite the overall low correlation between the screens, seven out of nine top miRNAs (except miR-129-5p and miR-639) identified from NTSR1-GFP screen, improved luciferase activity from 50% to 239%. All the final five miRNAs (miR-429, miR-18a-5p, miR-22-5p and miR-22-3p and miR-2110) capable of improving NTSR1 functional expression were also relevant for improving luciferase expression.
These five miRNAs affecting both model proteins were expected to have wider application for other types of proteins. Therefore, they were tested with HEK293 cell line constitutively secreting a Fc fusion proteins with medical importance. Interestingly, all of the five miRNAs were effective in enhancing per cell Fc fusion protein secretion. However, the overall Fc fusion protein yield varied from 10% decrease to 62% increase, partially depending on the viable cell number after miRNA transfection.
For this example, a genome-wide siRNA screen to identify genes that may influence recombinant protein production, using Photinus pyralis (firefly) luciferase as a reporter protein. With a high-throughput format, 21,585 genes were individually silenced with three different siRNAs, in HEK-CMV-Luc2-Hygro cells constitutively expressing firefly luciferase. The viable cell number and the luciferase activity were measured following the screening and the results were incorporated into genome-wide loss-of-function data. Statistical data analyses were conducted, followed by a validation screen where ten target genes (leading to greatest improvement of luciferase production) were confirmed. Among these selected genes, OAZ1 the gene that encodes antizyme 1, an inhibitor to ornithine decarboxylase, was chosen for more detailed studies, since its silencing caused minimal effect on cell viability.
Materials and Methods
Cell Culture
HEK-CMV-Luc2-Hygro cell line constitutively expressing P. pyralis luciferase (Progema) and HEK-GPC3-hFc cell line constitutively secreting glypican-3 hFc-fusion protein (GPC3-hFc) were maintained in DMEM containing 10% fetal bovine serum (FBS). The inducible T-Rex-SERT-GFP cell line and T-Rex-NTSR1-GFP cell line were maintained as an adherent culture in DMEM containing 10% certified FBS, 5 μg/mL blasticidin and 200 μg/mL zeocin (Invitrogen). All cells were maintained in a humidified incubator set at 37° C. and 5% CO2.
High-Throughput Genome-Wide Screen for Luciferase Expression
The Silencer Select™ Human genome siRNA library (Ambion), which targets 21,585 human genes with 3 siRNAs per gene, was used for screening. Each siRNA is arrayed in an individual well (Corning 3570, 384 well, white, solid bottom plates). The transfection was done in duplicates: 0.8 pmol of each siRNA was spotted to a well of a 384-well plate (Corning) and 20 μL of serum-free DMEM containing 0.07 μL of Lipofectamine™ RNAiMax (Life Technologies) was then added to each well. This lipid-siRNA mixture was incubated at ambient temperature for 30 min prior to addition of 4000 cells in 20 μL of DMEM containing 20% FBS (Gibco). After incubating the transfected cells at 37° C. in 5% CO2 for 72 hours, 20 μL of ONE-Glo™ Reagent (Promega) was added to one set of replicates for ‘overall luciferase yield’ quantification and 20 μL of CellTiter-Glo™ Reagent (Promega) was added to the second set of replicates for ‘viable cell density’ measurement. All plates were incubated at room temperature for 20 minutes to stabilize the luminescent signal and the signal was then measured with PerkinElmer Envision 2104 Multilabel™ plate reader. All plates had a full column (16 wells) of Silencer Select™ Negative Control #2 (Life Technologies) for data normalization and a full column of siPLK1 (Ambion Silencer Select, cat#s448) was also used as on-plate reference for transfection efficiency and both controls were also used in all validation transfections.
The 56 genes which got targeted by at least two independent siRNAs (out of three) resulting in enhanced luciferase production with MAD-based z-score>3 from the primary screen were subjected to validation screen using 3 additional Silencer™ siRNAs (Ambion) with different sequences from those used in the primary screen. Ten gene candidates were selected based on the criteria that 3 out of 6 siRNAs displayed a MAD-based z-score>3. The transfection and assay processes were-the same as the primary genome-wide screen. Data visualization was performed in R computational environment (available on the World Wide Web at R-project.org/) by using ‘hexbin’ and ‘ggplot2’ packages.
Statistical Analysis of Primary Screen Data
The screen generated end-point data for ‘overall luciferase yield’ and ‘viable cell density’ in each well. For each plate, the median value of the negative control wells was set as 100% and was used to normalize corresponding sample wells. The ‘overall luciferase yield’ and ‘viable cell density’ were exported as % of negative control and the median absolute deviation (MAD)-based z-score was calculated for each sample.
Gene Ontology (GO) Analysis
In order to get the maximum coverage of GO annotation data for 119 selected siRNA's targeting 56 genes, PANTHER classification system (available on the World Wide Web at pantherdb.org/) and AmiGO 2 GO™ browser were used. The construction of a heatmap was accomplished using Partek Genomics Suite™ software, version 6.6.
Validation Transfection
Ten targeted genes were selected and tested in four HEK 293 cell lines expressing different reporter proteins, glycan-3 hFc-fision protein (GPC3-hFc), neurotensin receptor type 1-GFP (NTSR1-GFP) and serotonin transporter-GFP (SERT-GFP), using 1 representative siRNA for each gene. Transfection was performed in 12-well plate format. 500 μL of serum free DMEM media containing siRNA and Lipofectamine™ RNAiMax was incubated in each well for 20 min at ambient temperature and 500 μL DMEM containing 20% FBS and cells was then added for transfection. The final siRNA concentration in each well was 40 nM. Lipofectamine™ RNAiMax volume and cell seeding number in each well have been optimized for each cell line (Table 2).
ELISA for Determination of GPC3-hFc Production
5 days after transfection, clarified cell culture supernatant was used for determination of GPC3-hFc concentration by ELISA and cells were detached and counted by trypan blue exclusion using a CEDEX™ cell quantification system (Roche, Mannheim, Germany). AffiniPure™ F(ab′)2 Fragment Goat Anti-Human IgG (min X Bov, Ms, Rb Sr Prot, Cat. 109-006-170, Jackson Immunology, 5 μg/mL in PBS) was used to coat a 96-well plate (50 μL per well) at 4° C. overnight. After blocking the plate with 2% BSA in PBS, 50 μl of pre-diluted cell culture supernatant was added, and the plate was incubated at room temperature for 1 h to allow binding to occur. After the plate was washed twice with PBS containing 0.05% Tween 20, Peroxidase-conjugated AffiniPure™ Goat-anti-human IgG (Cat. 109-035-098, Jackson Immunology) was added at 1:4000 dilution (50 μL/well). Following incubation at room temperature for 1 hr, the plate was washed 4 times and signals were detected with Peroxidase Substrate System (KPL).
Flow Cytometry Analysis for Determination of NTSR1-GFP and SERT-GFP Production
3 days after transfection, cells were induced with 1 μg/mL tetracycline. 24 hours later, cells from each well were detached with non-enzymatic cell dissociation buffer (Gibco, Cat. No. 13150-016) and washed twice with cold PBS. Cell densities were adjusted to 0.5 million cells/ml with PBS and then subjected to flow cytometry analysis. Green fluorescence was measured with Guava Easycyte 5HT™ and Incyte™ software (Millipore). The green fluorescence signal and cell gating were adjusted using uninduced T-REx-293-NTSR1-GFP cells, with more than 99.5% of the cells in low fluorescence range (<100). The setting was kept the same for all cell samples.
OAZ1 Silencing Studies
HEK-CMV-Luc2-Hygro cells in 6 well plates were transfected with Silencer siRNA for oaz1 gene (Catalog number: AM51331, assay ID: 46078). The transfection was done in 6-well plate format: 0.12 nmol of each siRNA and 1.5 mL of serum-free DMEM containing 11.25 μL of Lipofectamine™ RNAiMax (Life Technologies) was then added to each well. This lipid-siRNA mixture was incubated at ambient temperature for 30 min prior to adding 2×105 cells in 1.5 mL of DMEM containing 20% FBS (Gibco). The transfected cells were incubated at 37° C. in 5% CO2 and were harvested at 24, 48, 72 and 96 hr. Luciferase activity was determined using ONE-Glo™ Reagent (Promega) and aliquots of transfected cells.
Isolation of RNA and Real-Time qRT-PCR
Cells were trypsinized from 6-well plates, washed twice with cold PBS and cell pellets were flash frozen on dry ice and stored at −80° C. until extraction. RNA was extracted using the RNeasy™ kit (Qiagen) and then treated with DNase using TURBO DNA-Free™ Kit (Life Technologies). cDNA was generated from the RNA using the Maxima Frist Strand cDNA Synthesis Kit for qRT-PCR (Thermo Scientific). The real-time qPCR was performed using Fast SYBR™ Green Master Mix (Life Technologies) in 7900 HT Fast Real Time PCR System™ (Applied Biosystems). The 2−ΔΔct method was used for relative expression analysis with GAPDH as the reference gene. Cells transfected with negative control siRNA and harvested at 24 hr were set as calibrator. Primers used for each gene are:
Western Blotting
Transfected cells were lysed in buffer containing 50 mM Tris-HCl, pH 7.4, 5 mM EDTA, 150 mM NaCl, 1% Nonidet P-40, and protease inhibitor mixture. Proteins (˜20 μg) were separated by SDS-PAGE (4-12% gel) in MES buffer and transferred to 0.2-μm nitrocellulose membrane for immunodetection using mouse anti-ODC (Sigma, catalog number 01136) and mouse anti-β-actin (BD biosciences, catalogue number 612657) primary antibodies and HRP conjugated anti-mouse secondary antibodies (abCAM, catalog number ab20043). Signals were detected with an ECL Plus chemiluminescence reagent.
Measurement of Cellular Polyamine Concentration
Cells in six-well plates were washed with PBS twice, harvested, and precipitated with 0.1 mL cold 10% trichloroacetic acid (TCA). A total of 50 μL of the TCA supernatant was used for polyamine analysis by an ion exchange chromatographic system (Biochrom). TCA precipitates were dissolved in 0.1 N NaOH and aliquots were used for protein determination by the Bradford method. Polyamine contents were estimated as nmol/mg protein.
Figure Legends
Results
1. Identification of Genes Whose Silencing Leads to Enhanced Luciferase Expression.
A human genome-wide siRNA screen was conducted in HEK-CMV-Luc2-Hygro cells by using siRNA library targeting 21,585 human genes, with 3 independent arrayed siRNAs per gene. The transfection was done in duplicate: one set of plates was used for measuring the overall luciferase yield and the second set was used for the determination of viable cell density, from which the per cell luciferase yield was calculated (
2. Identification of Pathways Affecting Viable Cell Density and Recombinant Protein Productivity
To identify pathways that affect the reporter protein production, functional ontology analyses were carried out using the 119 siRNAs (Table 4) against the 56 genes (Table 3) that significantly improved the specific luciferase yield, using the PANTHER™ (available on the World Wide Web at pantherdb.org) and AmiGO 2 GOTM browser. The heatmap (
3. Selection Often Genes Whose Silencing Leads to Enhanced Luciferase Expression
For selecting gene candidates for further work, three additional siRNAs were tested for each of the 56 target genes identified from the primary screen. From the combined data of the primary and the validation screen of the 56 genes, ten genes were selected, based on the criteria that least 3 out of 6 siRNAs tested displayed a MAD-based z-scores higher than 3.0 (Table 5). The viable cell number was also taken into consideration to remove candidates with significant toxicity. The median value of the overall luciferase yield for each selected gene calculated from the 6 siRNAs was improved by 24% to 72% compared with negative control, and the median of MAD-based z-scores ranged from 2.13 to 4.55.
†Values are normalized to negative control siN.C. transfected cells (set as 100%) .
Four out of the ten target genes, INTS1, INTS2, HNRNPC, and PRPF19, are involved in mRNA splicing process; they encode important proteins for spliceosome formation, such as integrator complex, heterogeneous nuclear ribonucleoprotein and pre-mRNA processing factor 19. The remainder of the identified genes encodes proteins involved in a wide span of biological functions, including cell growth and division, signal transduction, apoptosis, regulation of cellular polyamine concentration and protein translation and folding.
4. Effects of Silencing the Ten Target Genes on Secreted and Membrane Protein Production
To examine the silencing effect of the 10 selected genes on the expression of other recombinant proteins from HEK293 cells, three additional cell lines were tested: 1) HEK-GPC3-hFc cell line, which constitutively secretes glypican −3 hFc-fusion protein (GPC3-hFc) as a representative of antibody secreting cell lines, 2) T-REx-293-NTSR1-GFP cell line constructed previously for the production of functional neurotensin receptor type I (NTSR1), and 3) T-REx-293-SERT-GFP cell line, an inducible cell line for high level expression of serotonin transporter (SERT), a hard-to-express 12 transmembrane domain protein. Both NTRS1 and SERT were fused with GFP at the C-terminus, allowing proximal protein quantification by flow cytometry. As shown in
5. Effect of Silencing OAZ1 on Luciferase Expression.
Among the selected genes, the antizyme 1 (OAZ1) was chosen for follow-up studies since its silencing consistently improved cytosolic, secreted and membrane protein expression and caused minimal growth disadvantage in the four cell lines tested (
As cells transfected with siOAZ1 showed significantly higher luciferase production for an extended period of time (
6. Effect of Silencing OAZ1 on Ornithine Decarboxylase and Cellular Polyamines
OAZ1 is a negative regulator of the ODC, a rate-limiting enzyme in the polyamine biosynthesis (
7. Effects of Exogenous Polyamines on Luciferase Protein Expression
Increased cellular polyamines in OAZ1-silenced cells are most likely responsible for the enhanced cellular production of the reporter proteins. To further verify this, the impacts of exogenously added polyamines on luciferase expression level and viable cell number were determined. As can be seen in
Discussion
Cultivated mammalian cells are the dominant vehicle for production of recombinant proteins for bio-therapeutics and structural studies. As a result, continuous effort has been directed toward improving cellular production capabilities. Previous work demonstrated the ability to improve recombinant protein expression based primarily on previous knowledge of specific genes and pathways, but there is a need for discovering novel genes and pathways for improved production. In order to discover new candidates suitable for improving recombinant protein production from HEK 293 cells, an extensive, high throughput RNA interference (RNAi) screen was performed. Genome-wide RNAi screening has emerged as a powerful tool for probing gene functions and for target discovery in various diseases. However, it has rarely been used for identifying targets for enhanced recombinant protein production. The purpose of the present study was to identify genes that showed improved recombinant protein production following their down regulation.
An HEK293 cell line expressing the luciferase reporter was subjected to interference with 64, 755 siRNAs targeting 21,585 human genes. Approximately 2.6% of the library (1,681 siRNAs) strongly improved the luciferase expression with a MAD-based z-score >3. To eliminate the introduction of ‘false positives’ by off-target effects, gene hits were considered ‘true positive’ only if more than two single siRNAs targeting the gene passed the MAD-based z-score >3. Fifty six genes were selected and were subjected to a validation screen with 3 additional siRNAs for each gene. From the data generated by the six siRNAs for each of the 56 genes, ten genes were selected for further analysis. Only those genes that showed an increase in luciferase yield of 3 MAD-based z-scores by 3 or more siRNAs were chosen. This high statistical significance also corresponds to 40% increase in luciferase activity.
The influences of the siRNAs targeting the ten identified genes on recombinant protein expression from the HEK cells were evaluated further by measuring the expression of three additional recombinant proteins: a secreted protein (GPC3-hFc) and two “hard”-to-express membrane proteins (neurotensin receptor type I and serotonin transporter). Silencing of the INTS1, HNRHPC, OAZ1, and PPP2R1A consistently improved production of all the reporter proteins. Of these four genes, silencing INTS1 or HNRHPC also affected cell viability. From the other two genes that slightly affected the cell viability, OAZ1 was chosen for follow-up studies. The identification of OAZ1 as a gene whose silencing can enhance reporter protein production is an indication that this gene normally suppresses protein synthesis. This is compatible with the known function of the antizyme as a negative regulator of polyamine homeostasis, cell proliferation and transformation. The current findings suggest that increased concentration of cellular putrescine and spermidine increases the biosynthesis of the reporter proteins without increasing their transcription, and, therefore, provide new insights into the primary function of polyamines in the regulation of translation. Consistent with this observation is published information that depleting cellular spermidine and spermine by over expressing spermidine/spermine N1-acetyltransferase 1 (SSAT1) led to suppression of protein biosynthesis without inhibiting DNA and RNA biosynthesis.
Although the invention has been described with reference to the examples herein, it will be understood that modifications and variations are encompassed within the spirit and scope of the invention. Accordingly, the invention is limited only by the following claims.
This application claims the benefit of priority under 35 U.S.C. §119(e) of U.S. Provisional Patent Application Ser. No. 62/108,976, filed Jan. 28, 2015, the entire contents of which is incorporated herein by reference in its entirety.
This invention was also made in part with government support under Grant No. DK075080-04 awarded by the Intramural Research Program of the National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK/NIH), National Center for Advancing Translational Sciences (NCATS/NIH) and the National Institute of Dental and Craniofacial Research (NIDCR/NIH). The United States government has certain rights in this invention.
Number | Date | Country | |
---|---|---|---|
62108976 | Jan 2015 | US |