The disclosure relates to a novel splice variant of FOXP1 and methods and uses thereof. In particular, the disclosure relates to methods of reprogramming somatic cells into pluripotent stem cells and methods of maintaining pluripotent stem cells through the use of the novel splice variant. The disclosure also relates to modulators of alternative splicing for promoting pluripotency and methods and uses thereof.
During the past several years great strides have been made in understanding the regulatory processes that are responsible for maintenance of the pluripotent state of embryonic stem cells (ESCs), and for the reprogramming of somatic cells to generate ESCs and induced pluripotent stem cells (iPSCs). A core set of transcription factors that includes Oct4, Nanog, Sox2 and Tcf3 functions in ESC maintenance. The first three of these factors induce and cross regulate each other's expression, and also activate genes that further stabilize the ESC state (Chen et al., 2008; Kim et al., 2008; Silva et al., 2009). In a landmark discovery it was demonstrated that Oct4 and Sox2, together with Klf4 and c-Myc can reprogram somatic cells to iPSCs (Takahashi and Yamanaka, 2006). These factors remodel the transcriptome through successive reprogramming stages (Samavarchi-Tehrani et al., 2010) that ultimately lead to the reactivation of endogenous Oct4, Nanog and Sox2 and the establishment of the core transcriptional regulatory network required to maintain pluripotency.
In contrast to the major progress in the elucidation of signaling pathways and transcription factor networks in the control of pluripotency and iPSC formation, the role of alternative splicing (AS) in this process is not well understood. Likewise, while transcripts from nearly all human multi-exon genes undergo AS (Pan et al., 2008; Wang et al., 2008), the functional significance of the vast majority of these events is not known. A small number of AS events have been identified that play pivotal roles in the regulation of cell differentiation and early development. Classic examples include AS events in the Drosophila transcription factor genes doublesex and fruitless, which control transcriptional programs required for sex determination and courtship behavior, respectively (Demir and Dickson, 2005; Forch and Valcarcel, 2003). Recent studies comparing splice isoforms between ES and differentiated cell populations have identified AS differences between these two states (Atlasi et al., 2008; Kunarso et al., 2008; Pritsker et al., 2005; Rao et al., 2010b; Salomonis et al., 2010; Wu et al., 2010; Yeo et al., 2007), and two such events have been implicated in changing the activities of Tcf3 and Sall4, transcription factors which function in pluripotency (Rao et al., 2010b; Salomonis et al., 2010). It is therefore emerging that specific AS events may function to modulate transcriptional networks involved in transitions between pluripotency maintenance and cell type specification.
Forkhead box (FOX) proteins belong to a family of metazoan transcription factors that play essential roles in the regulation of genes involved in cell proliferation, differentiation and development, particularly during embryogenesis (Wijchers et al., 2006). The forkhead box, a domain of 80 to 100 amino acids that adopts a winged helix conformation, is a defining feature of FOX proteins and is responsible for binding to DNA (Li et al., 2004). FOXP1 is one of four members of the FOXP subfamily of FOX proteins that contain a C-terminal forkhead motif and N-terminal zinc finger and leucine zipper domains. FOXP1 is widely expressed across human tissues and loss of its expression, or its fusion with other proteins through chromosomal translocations, has been linked to several types of cancer (Koon et al., 2007). Knockout of murine Foxp1 results in early embryonic lethality (Wang et al., 2004) and disruption of Foxp1 expression in adult cells and tissues revealed that it has numerous critical roles in development and the establishment of specific cell types (Dasen et al., 2008; Zhang et al., 2010). Several splice variants of FOXP1 have been identified (Brown et al., 2008), yet the function of these are not well understood.
The present inventors have found that an embryonic stem cell (ESC)-specific isoform (FOXP1-ES) of the forkhead family transcription factor FOXP1 promotes the maintenance of ESC pluripotency and the reprogramming of somatic cells to ESCs and induced pluripotent stem cells. In particular, the present inventors identified a highly conserved alternative splicing event in FOXP1 transcripts that is activated in ESCs and silenced during cell differentiation. This alternative splicing event modifies critical amino acid residues within the FOXP1 forkhead domain and alters its DNA binding specificity. In ESCs this alternative splicing event switches the transcriptional regulatory output of FOXP1, which results in the stimulation of pluripotency genes such as OCT4, NANOG, GDF3 and NR5A2 while repressing cell lineage specification and differentiation genes. Induced expression of the ESC-specific isoform of FOXP1 promotes self-renewal and the maintenance of pluripotency, whereas silencing this isoform inhibits efficient iPSC reprogramming.
Accordingly, one aspect of the present disclosure is related to an isolated nucleic acid molecule comprising:
a. a nucleic acid sequence as shown in SEQ ID NOS: 3, 4, 7 or 8;
b. a nucleic acid sequence that is complementary to a nucleic acid sequence of (a);
c. a nucleic acid sequence that has substantial sequence identity to a nucleic acid sequence of (a) or (b);
d. a nucleic acid sequence that hybridizes to a nucleic acid sequence of (a), (b) or (c) under stringent hybridization conditions; or
e. a nucleic acid sequence differing from any of the nucleic acid sequences of (a) to (d) in codon sequences due to the degeneracy of the genetic code.
The disclosure is also related to an isolated nucleic acid molecule encoding an amino acid sequence as shown in SEQ ID NOS: 11, 12, 15 or 16.
The disclosure is further related to an isolated nucleic acid molecule comprising an antisense oligonucleotide to the nucleic acid sequence as shown in any one of SEQ ID NOS: 36 to 43. Optionally, the antisense nucleotide is 2 to 50, 5 to 40 or 10 to 25 nucleotides in length.
The disclosure also provides a recombinant expression vector comprising any of the isolated nucleic acid molecules described above.
In another aspect, the disclosure relates to an isolated polypeptide comprising the amino acid sequence as shown in SEQ ID NOS: 11, 15 or a fragment thereof. Optionally, the fragment comprises amino acids 511 to 565 of SEQ ID NO: 11 or amino acids 538 to 594 of SEQ ID NO: 15.
The disclosure further relates to a binding protein that binds to an isolated polypeptide comprising the amino acid sequence as shown in SEQ ID NOS: 11, 12, 15 or 16 or a fragment thereof. In one embodiment, the binding protein is an antibody, antibody fragment, peptide aptamer or nucleic-acid derived aptamer.
The disclosure also relates to a host cell comprising any of the isolated nucleic acid molecules, recombinant expression vectors, isolated polypeptides, or binding proteins described above.
In another aspect, the disclosure relates to the use of the isolated nucleic acids, vectors, or isolated polypeptides described above, or an antisense or interfering RNA molecule that increases the expression of FOXP1-ES and/or decreases the expression of FOXP1 to produce pluripotent stem cells, to maintain or enhance a homogeneous population of pluripotent stem cells, suppress stem cell differentiation or reprogram somatic cells into pluripotent stem cells.
The disclosure further relates to the use of a cDNA encoding FOXP1, a FOXP1 protein, an antisense or interfering RNA molecule that decreases the expression of FOXP1-ES and/or increases the expression of FOXP1, or a binding protein as described above to produce a population of differentiated cells.
The disclosure also provides a method of reprogramming somatic cells into pluripotent stem cells comprising:
In another embodiment, the disclosure provide a method of maintaining a homogenous population of pluripotent stem cells comprising:
In another embodiment, the disclosure provides a method of suppressing stem cell differentiation comprising:
In yet another embodiment, the disclosure provides a method of producing a population of differentiated cells comprising:
The inventors have also shown that specific splicing regulators including MBNL1, MBNL2, TIA1 and TIAL1 function to modulate the ESC-specific splicing event in FOXP1 transcripts and expression of FOXP1-ES. The regulators act through conserved binding sites for these factors in the intronic regions proximal to this splicing event.
Accordingly, another aspect of the present disclosure is directed to a method of modulating the expression of FOX1P-ES in a cell comprising administering an exon 18b or exon 16b modulator to the cell.
In one embodiment, the modulator is a stimulator of exon 18b inclusion. Optionally, the stimulator of exon 18b inclusion is selected from the group consisting of: TIA1, TIAL1, a MBNL1 antagonist, a MBNL2 antagonist and antisense RNA or small interfering RNA that decreases expression of FOXP1.
In another embodiment, the modulator is a stimulator of exon 16b inclusion. Optionally, the stimulator of exon 16b inclusion is selected from the group consisting of: Tia1, Tial1, a Mbnl1 antagonist, a Mbnl2 antagonist and antisense RNA or small interfering RNA that decreases expression of Foxp1.
In a further embodiment, the MBNL1 antagonist and/or MBNL2 antagonist is an antibody to MBNL1 and/or MBNL2 or peptide or nucleic-acid derived aptamer to MBNL1 and/or MBNL2, antisense RNA or small interfering RNA that decreases expression of MBNL1 and/or MBNL2, or a compound that inhibits the expression or function of MBNL1 and/or MBNL2.
In another embodiment, the modulator is a repressor of exon 18b inclusion. Optionally, the repressor of exon 18b inclusion is selected from the group consisting of: MBNL1, MBNL2, a TIA1 antagonist, a TIAL1 antagonist, and antisense RNA or small interfering RNA that decreases expression of FOXP1-ES.
In another embodiment, the modulator is a repressor of exon 16b inclusion. Optionally, the repressor of exon 16b inclusion is selected from the group consisting of: Mbnl1, Mbnl2, a Tia1 antagonist, a Tial1 antagonist, and antisense RNA or small interfering RNA that decreases expression of Foxp1-ES.
In a further embodiment, the TIA1 antagonist or TIAL2 antagonist is an antibody or peptide or nucleic-acid derived aptamer to TIA1 and/or TIAL1, antisense RNA or small interfering RNA that decreases expression of TIA1 and/or TIAL1, or compound that inhibits the expression or function of TIA1 and/or TIAL1.
The inventors have also found that MBNL1 and MBNL2 are regulators of ESC-specific alternative splicing and reprogramming that act through a number of target genes, including, but not limited to, FOXP1.
Accordingly, the present disclosure also relates to the use of antagonists to MBNL1 and/or MBNL2 to maintain or enhance pluripotency of a cell.
In one embodiment, the disclosure relates to the use of both a MBNL1 antagonist and a MBNL2 antagonist to maintain or enhance pluripotency of a cell.
In one embodiment, maintaining or enhancing pluripotency comprises producing pluripotent stem cells, maintaining a homogeneous population of pluripotent stem cells, suppressing stem cell differentiation or reprogramming somatic cells into pluripotent stem cells.
In another embodiment, maintaining or enhancing pluripotency comprises producing pluripotent stem cells comprises increasing the efficiency and/or kinetics of: producing pluripotent stem cells, maintaining a homogeneous population of pluripotent stem cells, suppressing stem cell differentiation or reprogramming somatic cells into pluripotent stem cells.
The disclosure further relates to a method of assessing the pluripotency of a cell population comprising detecting the level of expression of FOXP1-ES in a sample of cells from the population, wherein an increase in the level of FOXP1-ES compared to a reference level in the sample of cells indicates the pluripotency of the cell population.
The disclosure also relates to a method of assessing the pluripotency of a cell population over time comprising
The disclosure further relates to a method of assessing the pluripotency of a cell population comprising detecting the level of expression of MBNL1 and/or MBNL2 in a sample of cells from the population, wherein a decrease in the level of MBNL1 and/or MBNL2 compared to a reference level in the sample of cells indicates the pluripotency of the cell population.
The disclosure also relates to a method of assessing the pluripotency of a cell population over time comprising
Other features and advantages of the present disclosure will become apparent from the following detailed description. It should be understood, however, that the detailed description and the specific examples while indicating embodiments of the disclosure are given by way of illustration only, since various changes and modifications within the spirit and scope of the disclosure will become apparent to those skilled in the art from this detailed description.
The disclosure will now be described in relation to the drawings in which:
“FOXP1-ES”) are specifically detected in hESCs. Transcripts simultaneously including exons 18 and 18b (indicated by an asterisk) are detected at low levels in hESCs, and are predicted to be targeted by nonsense mediated mRNA decay.
(A) Endoderm markers: GATA6 and SOX17
(B) Mesoderm markers: BRACHYURY (T) and GOUSCOID (GSC)
(C) Neural markers: SOX1 and SOX3
(D) Pluripotency markers: OCT4, NANOG and SOX2
] exons overlap the 5″-UTR; [
] exons contain the translation initiation site; [▪] exons contain the translation termination site; exons annotated with an asterisk are truncated such that the resulting mRNA encodes only the first 21 amino acids of the forkhead domain; the [
] exon, when spliced in mRNA with exon 18, is predicted to introduce a premature termination codon (PTC) targeting the corresponding mRNA for degradation by the nonsense-mediated mRNA decay pathway.
) dots represent fold changes in transcript levels detected in H9 hESC RNA samples following knockdown using exon 18- and exon 18b-siRNAs pools, respectively, relative to H9 hESCs transfected with a non-targeting control siRNA pool. Comparison of the RNA-Seq and qRT-PCR measurements yields a correlation of 0.941.
), relative to H9 hESCs transfected with a non-targeting control siRNA pool.
The present inventors have demonstrated that an alternatively spliced variant of forkhead family transcription factor FOXP1 promotes the maintenance of embryonic stem cell pluripotency and the reprogramming of somatic cells to induced pluripotent stem cells.
Accordingly, an isolated nucleic acid molecule including exon 18b of the human FOXP1 gene (hereinafter referred to as hFOXP1-ES; SEQ ID NO: 3) is described. In another embodiment, an isolated nucleic acid molecule including exon 16b of the mouse Foxp1 gene (hereinafter referred to as mFOXP1-ES; SEQ ID NO: 7) is described. An isolated nucleic acid molecule comprising exon 18b (SEQ ID NO: 4) and an isolated nucleic acid molecule comprising exon 16b (SEQ ID NO: 8) are also described. The genomic coordinates for human exons 17, 18, 18b and 19 of hFOXP1 are given in Table 1 and the genomic coordinates for mouse exons 15, 16, 16b and 17 of mFoxp1 are given in Table 2.
The term “isolated” refers to a nucleic acid substantially free of cellular material or culture medium when produced by recombinant DNA techniques, or chemical precursors, or other chemicals when chemically synthesized.
The term “nucleic acid molecule” is intended to include unmodified DNA or RNA or modified DNA or RNA. For example, the nucleic acid molecules or polynucleotides of the disclosure can be composed of single- and double stranded DNA, DNA that is a mixture of single- and double-stranded regions, single- and double-stranded RNA, and RNA that is a mixture of single- and double-stranded regions, hybrid molecules comprising DNA and RNA that may be single-stranded or, more typically double-stranded or a mixture of single- and double-stranded regions. In addition, the nucleic acid molecules can be composed of triple-stranded regions comprising RNA or DNA or both RNA and DNA. The nucleic acid molecules of the disclosure may also contain one or more modified bases or DNA or RNA backbones modified for stability or for other reasons. “Modified” bases include, for example, tritiated bases and unusual bases such as inosine. A variety of modifications can be made to DNA and RNA; thus “nucleic acid molecule” embraces chemically, enzymatically, or metabolically modified forms. The term “polynucleotide” shall have a corresponding meaning.
The term “FOXP1” as used herein refers to the forkhead family transcription factor FOXP1 from any species or source, optionally mammalian, such as human or mouse. While the term “FOXP” is often used to here to human FOXP1, in the present disclosure, the term is also used to refer to FOXP1 from other species. “FOXP1-ES” refers to the FOXP1 gene from any species or source that includes an exon 18b or a homolog thereof, such as exon 16b of mouse.
“hFOXP1-ES” refers to the human FOXP1 gene that includes exon 18b and the protein produced by that gene, also described herein as “FOXP1-ES”. “mFoxp1-ES” refers to the mouse Foxp1 gene that includes exon 16b and the protein produced by that gene, also described herein as “Foxp1-ES”. While the term “FOXP1-ES” is often used to here to human FOXP1-ES, in the present disclosure, the term is also used to refer to FOXP1-ES from other species. FOXP1-ES refers to an ESC-specific isoform of FOXP1. FOXP1-ES also refers to an iPSC-specific isoform of FOXP1.
One aspect of the present disclosure is thus an isolated nucleic acid molecule comprising a nucleotide sequence selected from the group consisting of:
As previously stated, the disclosure includes isolated DNA molecules having such sequences of nucleotides and RNA molecules having such sequences. The disclosure thus includes isolated mRNA transcribed from DNA having such a sequence. The disclosure further encompasses nucleic acid molecules that differ from any of the nucleic acid molecules of the disclosure in codon sequences due to the degeneracy of the genetic code.
In another aspect, the present disclosure includes a fragment of the nucleotide sequence encoding hFOXP1-ES, said fragment comprising exon 18b (SEQ ID NO: 4). The present disclosure also includes a fragment of the nucleotide sequence encoding mFoxp1-ES, said fragment comprising exon 16b (SEQ ID NO: 8). Such fragments can find usefulness as probes or depending on the fragments may even have biological activity themselves. The complement of the probe can find utility in, for example, manufacture of the probe or inhibition of any activity of the fragment, as the case may be. In a particular use, the probe can be used to determine the presence of an RNA molecule in a sample which might, or might not, also include an RNA molecule encoding FOXP1-ES. Such a probe would generally be 20 nucleotides long or be at least 20 nucleotides long. The probe could also be 25, 30, 35, 40, 45, 50, 55, 60 or more nucleotides in length and the probe can include the full length of the complement to the sequence to which it is intended to bind.
The disclosure includes the method of determining the presence of a nucleic acid molecule encoding FOXP1-ES in a sample containing RNA isolated from a cell, using such a probe.
In the context of this specification, the term “conserved” describes similarity between sequences. The degree of conservation between two sequences can be determined by optimally aligning the sequences for comparison. Sequences may be aligned using the Omiga software program, Version 1.13 (Oxford Molecular Group, Inc., Campbell, Calif.). The Omiga software uses the Clustal W Alignment algorithms [Higgins et al., 1989; Higgins et al., 1991; Thompson et al. 1994] Default settings used are as follows: Open gap penalty 10.00; Extend gap penalty 0.05; Delay divergent sequence 40 and Scoring matrix—Gonnet Series. Percent identity or homology between two sequences is determined by comparing a position in the first sequence with a corresponding position in the second sequence. When the compared positions are occupied by the same nucleotide or amino acid, as the case may be, the two sequences are conserved at that position. The degree of conservation between two sequences is often expressed, as it is here, as a percentage representing the ratio of the number of matching positions in the two sequences to the total number of positions compared.
In one aspect, the present disclosure is a nucleic acid molecule which encodes a protein that is a conservatively substituted variant of the protein encoded by the nucleotide sequence of SEQ ID NOS: 3, 4, 7 or 8.
Further, it will be appreciated that the disclosure includes nucleic acid molecules comprising nucleic acid sequences having substantial sequence identity with the nucleic acid sequence as shown in SEQ ID NOS: 3, 4, 7 or 8 or fragments thereof. The term “sequences having substantial sequence identity” means those nucleic acid sequences that have slight or inconsequential sequence variations from these sequences, i.e., the sequences function in substantially the same manner to produce functionally equivalent proteins. The variations may be attributable to local mutations or structural modifications.
Nucleic acid sequences having substantial identity include nucleic acid sequences having at least about 50 percent identity with a protein encoded by SEQ ID NOS: 3, 4, 7 or 8, respectively, or the full-length anti-sense sequence thereto. The level of sequence identity, according to various aspects of the disclosure is at least about: 60, 70, 75, 80, 83, 85, 88, 90, 93, 95 or 98 percent. Methods for aligning the sequences to be compared and determining the level of homology between the sequences are described in detail above.
In one aspect, the nucleic acid molecule having substantial sequence identity described above has a sequence identity of at least about: 50, 60, 70, 75, 80, 83, 85, 88, 90, 93, 95 or 98 percent within the exon 18b (nucleic acid 1531 to 1720 of SEQ ID NO: 3 numbering from the first ATG of the human FOXP1-ES gene sequence) or exon 16b (nucleic acid 1615 to 1784 of SEQ ID NO:7 numbering from the first ATG of the mouse Foxp1-ES gene sequence).
The disclosure is not to be restricted by this sequence identity, for instance, nucleic acid sequences having at least a 50% sequence identity with the sequence shown in SEQ ID NOS: 3, 4, 7 or 8 are also encompassed within the scope of the present disclosure where there is at least 50%, at least 60%, at least 70%, at least 80%, at least 90%, at least 95% or at least 98% homology with the sequence from nucleic 1531 to 1720 of SEQ ID NO: 3 numbering from the first ATG of the human FOXP1-ES gene sequence nucleic acid 1615 to 1784 of SEQ ID NO:7 numbering from the first ATG of the mouse Foxp1-ES gene sequence.
Sequence identity can be calculated according to methods known in the art. Sequence identity is most preferably assessed by the algorithm of BLAST version 2.1 advanced search. BLAST is a series of programs that are available, for example, online from the National Institutes of Health. The advanced blast search is set to default parameters. (ie Matrix BLOSUM62; Gap existence cost 11; Per residue gap cost 1; Lambda ratio 0.85 default). References to BLAST searches are: Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. (1990) “Basic local alignment search tool.” J. Mol. Biol. 215:403410; Gish, W. & States, D. J. (1993) “Identification of protein coding regions by database similarity search.” Nature Genet. 3:266272; Madden, T. L., Tatusov, R. L. & Zhang, J. (1996) “Applications of network BLAST server” Meth. Enzymol. 266:131—141; Altschul, S. F., Madden, T. L., Schäffer, A. A., Zhang, J., Zhang, Z., Miller, W. & Lipman, D. J. (1997) “Gapped BLAST and PSI_BLAST: a new generation of protein database search programs.” Nucleic Acids Res. 25:33893402; Zhang, J. & Madden, T. L. (1997) “PowerBLAST: A new network BLAST application for interactive or automated sequence analysis and annotation.” Genome Res. 7:649656.
The term “sequence that hybridizes” means a nucleic acid sequence that can hybridize to a sequence of (a), (b) or (c) under stringent hybridization conditions. Appropriate stringency conditions which promote nucleic acid hybridization are known to those skilled in the art, or may be found in Current Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989), 6.3.1-6.3.6. The term “stringent hybridization conditions” as used herein means that conditions are selected which promote selective hybridization between two complementary nucleic acid molecules in solution. Hybridization may occur to all or a portion of a nucleic acid sequence molecule. The hybridizing portion is at least 50% the length with respect to one of the polynucleotide sequences encoding a polypeptide. In this regard, the stability of a nucleic acid duplex, or hybrids, is determined by the Tm, which in sodium containing buffers is a function of the sodium ion concentration, G/C content of labeled nucleic acid, length of nucleic acid probe (I), and temperature (Tm=81.5° C.−16.6 (Log 10[Na+])+0.41(% (G+C)−600/I). Accordingly, the parameters in the wash conditions that determine hybrid stability are sodium ion concentration and temperature. In order to identify molecules that are similar, but not identical, to a known nucleic acid molecule a 1% mismatch may be assumed to result in about a 1° C. decrease in Tm, for example if nucleic acid molecules are sought that have a greater than 95% identity, the final wash will be reduced by 5° C. Based on these considerations stringent hybridization conditions shall be defined as: hybridization at 5× sodium chloride/sodium citrate (SSC)/5×Denhardt's solution/1.0% SDS at Tm (based on the above equation)-5° C., followed by a wash of 0.2×SSC/0.1% SDS at 60° C.
Isolated nucleic acid molecules having sequences which differ from the nucleic acid sequence shown in SEQ ID NOS: 3, 4, 7 or 8 due to degeneracy in the genetic code are also within the scope of the disclosure. Such nucleic acids encode functionally equivalent proteins or peptides but differ in sequence from the above mentioned sequences due to degeneracy in the genetic code.
An isolated nucleic acid molecule of the disclosure which comprises DNA can be isolated by preparing a labelled nucleic acid probe based on all or part of the nucleic acid sequences as shown in SEQ ID NOS: 3, 4, 7 or 8 and using this labelled nucleic acid probe to screen an appropriate DNA library (e.g. a cDNA or genomic DNA library).
An isolated nucleic acid molecule of the disclosure which is DNA can also be isolated by selectively amplifying a nucleic acid encoding a novel protein of the disclosure using the polymerase chain reaction (PCR) methods and cDNA or genomic DNA. It is possible to design synthetic oligonucleotide primers from the nucleic acid sequences shown in SEQ ID NOS: 3, 4, 7 or 8 for use in PCR. A nucleic acid can be amplified from cDNA or genomic DNA using these oligonucleotide primers and standard PCR amplification techniques. The nucleic acid so amplified can be cloned into an appropriate vector and characterized by DNA sequence analysis. It will be appreciated that cDNA may be prepared from mRNA, by isolating total cellular mRNA by a variety of techniques, for example, by using the guanidinium-thiocyanate extraction procedure of Chirgwin et al., Biochemistry, 18, 5294 5299 (1979). cDNA is then synthesized from the mRNA using reverse transcriptase (for example, Moloney MLV reverse transcriptase available from Gibco/BRL, Bethesda, Md., or AMV reverse transcriptase available from Seikagaku America, Inc., St. Petersburg, Fla.).
An isolated nucleic acid molecule of the disclosure which is RNA can be isolated by cloning a cDNA encoding a novel protein of the disclosure into an appropriate vector which allows for transcription of the cDNA to produce an RNA molecule which encodes a protein of the disclosure. For example, a cDNA can be cloned downstream of a bacteriophage promoter, (e.g., a T7 promoter) in a vector, cDNA can be transcribed in vitro with T7 polymerase, and the resultant RNA can be isolated by standard techniques.
A nucleic acid molecule of the disclosure may also be chemically synthesized using standard techniques. Various methods of chemically synthesizing polydeoxynucleotides are known, including solid-phase synthesis which, like peptide synthesis, has been fully automated in commercially available DNA synthesizers (See e.g., Itakura et al. U.S. Pat. No. 4,598,049; Caruthers et al. U.S. Pat. No. 4,458,066; and Itakura U.S. Pat. Nos. 4,401,796 and 4,373,071).
The sequence of a nucleic acid molecule of the disclosure may be inverted relative to its normal presentation for transcription to produce an antisense nucleic acid molecule. The term “antisense” nucleic acid molecule is a nucleotide sequence that is complementary to its target. Preferably, an antisense sequence is constructed by inverting a region preceding or targeting the initiation codon or an unconserved region. In another embodiment the antisense sequence targets all or part of the mRNA or cDNA of FOXP1-ES or FOXP1. In particular, the nucleic acid sequences contained in the nucleic acid molecules of the disclosure or a fragment thereof may be inverted relative to its normal presentation for transcription to produce antisense nucleic acid molecules. In one embodiment the antisense molecules can be used to inhibit FOXP1-ES or FOXP1 expression.
The antisense nucleic acid molecules of the disclosure or a fragment thereof, may be chemically synthesized using naturally occurring nucleotides or variously modified nucleotides designed to increase the biological stability of the molecules or to increase the physical stability of the duplex formed with mRNA or the native gene e.g. phosphorothioate derivatives and acridine substituted nucleotides. The antisense sequences may be produced biologically using an expression vector introduced into cells in the form of a recombinant plasmid, phagemid or attenuated virus in which antisense sequences are produced under the control of a high efficiency regulatory region, the activity of which may be determined by the cell type into which the vector is introduced.
In another embodiment, the disclosure provides interfering RNA molecules such as siRNAs, shRNAs and miRNAs that target and silence FOXP1 or FOXP1-ES. Accordingly, in one aspect, the disclosure provides siRNAs for knockdown of human FOXP1 comprising SEQ ID NOs: 25-28, human FOXP1-ES comprising SEQ ID NOs: 29-32, mouse FOXP1 comprising SEQ ID NOs: 17-20 and mouse FOXP1-ES comprising SEQ ID NOs: 21-24. In another aspect, the disclosure provides shRNAs for knockdown of mouse Foxp1 isoforms (SEQ ID NOs: 33-35).
The disclosure also provides an isolated nucleic acid molecule comprising an antisense oligonucleotide to the nucleic acid sequence as shown in any one of SEQ ID NOS: 36-43. Optionally, the antisense oligonucleotide is between 1 and 300 nucleotides in length, optionally 2-50, 2-30 or 5-20 nucleotides in length. In one embodiment, the antisense oligonucleotide is optionally 2 to 50, 5 to 40 or 10 to 25 nucleotides in length.
The disclosure also provides a kit for modulating the expression of FOXP1-ES wherein the kit comprises at least two antisense oligonucleotides to the nucleic acid sequence as shown in any one of SEQ ID NOS: 36-43.
The present application further contemplates an isolated FOXP1-ES polypeptide, human FOXP1-ES polypeptide (SEQ ID NO: 11) or mouse Foxp1-ES polypeptide (SEQ ID NO: 15) or homologs thereof. In an embodiment of the disclosure, an isolated polypeptide is provided which has the amino acid sequence as shown in SEQ ID NO:11 or a fragment, having retained the amino acids encoded by exon 18b, thereof. In another embodiment of the disclosure, an isolated polypeptide is provided which has the amino acid sequence as shown in SEQ ID NO:15 or a fragment, having retained the amino acids encoded by exon 16b, thereof. The present disclosure also encompasses peptides comprising the amino acid sequence shown in SEQ ID NOs: 12 (exon 18b) and 16 (exon 16b).
Within the context of the present application, a polypeptide of the disclosure may in one embodiment include various structural forms of the primary protein. For example, a polypeptide of the disclosure may be in the form of acidic or basic salts or in neutral form. In addition, individual amino acid residues may be modified by oxidation or reduction.
In addition to the full-length amino acid sequences corresponding to human FOXP1-ES and mouse Foxp1-ES (SEQ ID NOs: 11 and 15, respectively), the polypeptides of the present disclosure may also include truncations of the polypeptides, and analogs, and homologs of the proteins and truncations thereof as described herein. Truncated polypeptides may comprise peptides of at least 10 and preferably at least fourteen amino acid residues.
In an embodiment, the disclosure provides a peptide fragment comprising amino acids 511 to 565 of SEQ ID NO: 11 or an analog or homolog thereof. In one embodiment, the disclosure provides a peptide fragment comprising the amino acid sequence shown in SEQ ID NO: 12 or an analog or homolog thereof.
In an embodiment, the disclosure provides a peptide fragment comprising amino acids 538 to 594 of SEQ ID NO: 15 or an analog or homolog thereof. In one embodiment, the disclosure provides a peptide fragment comprising the amino acid sequence shown in SEQ ID NO: 16 or an analog or homolog thereof.
Analogs of the proteins having the amino acid sequences shown in SEQ ID NO: 11 or 15 as described herein, may include, but are not limited to an amino acid sequence containing one or more amino acid substitutions, insertions, and/or deletions. Amino acid substitutions may be of a conserved or non-conserved nature. Conserved amino acid substitutions involve replacing one or more amino acids of the proteins of the disclosure with amino acids of similar charge, size, and/or hydrophobicity characteristics. When only conserved substitutions are made the resulting analog should be functionally equivalent. Non-conserved substitutions involve replacing one or more amino acids of the amino acid sequence with one or more amino acids which possess dissimilar charge, size, and/or hydrophobicity characteristics.
Conservative substitutions are described in the patent literature, as for example, in U.S. Pat. No. 5,264,558. It is thus expected, for example, that interchange among non-polar aliphatic neutral amino acids, glycine, alanine, proline, valine and isoleucine, would be possible. Likewise, substitutions among the polar aliphatic neutral amino acids, serine, threonine, methionine, asparagine and glutamine could possibly be made. Substitutions among the charged acidic amino acids, aspartic acid and glutamic acid, could probably be made, as could substitutions among the charged basic amino acids, lysine and arginine. Substitutions among the aromatic amino acids, including phenylalanine, histidine, tryptophan and tyrosine would also likely be possible. Other substitutions might well be possible.
One or more amino acid insertions may be introduced into the amino acid sequences shown in SEQ ID NO: 11 or 15. Amino acid insertions may consist of single amino acid residues or sequential amino acids ranging from 2 to 15 amino acids in length. This procedure may be used in vivo to inhibit the activity of FOXP1-ES.
Deletions may consist of the removal of one or more amino acids, or discrete portions from the amino acid sequence shown in SEQ ID NO: 11, excluding the region corresponding to exon 18b (511 to 565 of SEQ ID NO: 11) or from the amino acid sequence shown in SEQ ID NO:15, excluding the region corresponding to exon 16b (amino acids 538 to 594 of SEQ ID NO: 15). The deleted amino acids may or may not be contiguous. The lower limit length of the resulting analog with a deletion mutation is about 10 amino acids, preferably 100 amino acids.
Exemplary methods of making the alterations set forth above are disclosed by Sambrook et al (Molecular Cloning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, 1989).
The proteins of the disclosure also include homologs of the amino acid sequence shown in SEQ ID NO: 11 or 15 and/or truncations thereof as described herein. Such homologs are proteins whose amino acid sequences are encoded by nucleic acid sequences that hybridize under stringent hybridization conditions (see discussion of stringent hybridization conditions herein) with a probe used to obtain a protein of the disclosure. Homologs of a protein of the disclosure will have the same regions which are characteristic of the protein.
A homologous protein includes a protein with an amino acid sequence having at least 60%, at least 70%, at least 80%, at least 90%, at least 95% or at least 98% identity with the amino acid sequence as shown in SEQ ID NO: 11 or 15.
The present application also includes a protein described above conjugated with a selected protein, or a selectable marker protein (see below) to produce fusion proteins.
The proteins described above (including truncations, analogs, etc.) may be prepared using recombinant DNA methods. These proteins may be purified and/or isolated to various degrees using techniques known in the art. Accordingly, nucleic acid molecules of the present disclosure having a sequence which encodes a protein of the disclosure may be incorporated according to procedures known in the art into an appropriate expression vector which ensures good expression of the protein. Possible expression vectors include but are not limited to cosmids, plasmids, or modified viruses (e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), so long as the vector is compatible with the host cell used. The expression “vectors suitable for transformation of a host cell”, means that the expression vectors contain a nucleic acid molecule of the disclosure and regulatory sequences, selected on the basis of the host cells to be used for expression, which are operatively linked to the nucleic acid molecule. “Operatively linked” is intended to mean that the nucleic acid is linked to regulatory sequences in a manner which allows expression of the nucleic acid.
The disclosure therefore contemplates a recombinant expression vector of the disclosure containing a nucleic acid molecule of the disclosure, or a fragment thereof, and the necessary regulatory sequences for the transcription and translation of the inserted protein-sequence. The disclosure further provides a recombinant expression vector comprising a DNA nucleic acid molecule of the disclosure cloned into the expression vector in an antisense orientation. That is, the DNA molecule is operatively linked to a regulatory sequence in a manner which allows for expression, by transcription of the DNA molecule, of an RNA molecule which is antisense to a nucleotide sequence of the disclosure optionally comprising the nucleotides as shown in SEQ ID NO: 3, 4, 7 or 8 or fragments thereof. Regulatory sequences operatively linked to the antisense nucleic acid can be chosen which direct the continuous expression of the antisense RNA molecule.
The recombinant expression vectors of the disclosure may also contain a selectable marker gene that facilitates the selection of host cells transformed or transfected with a recombinant molecule of the disclosure. Examples of selectable marker genes are genes encoding a protein which confers resistance to certain drugs, such as G418 and hygromycin.
Recombinant expression vectors can be introduced into host cells to produce a transformed host cell. The term “transformed host cell” is intended to include prokaryotic and eukaryotic cells which have been transformed or transfected with a recombinant expression vector of the disclosure. The terms “transformed with”, “transfected with”, “transformation” and “transfection” are intended to encompass introduction of nucleic acid (e.g. a vector) into a cell by one of many possible techniques known in the art.
Suitable host cells include a wide variety of prokaryotic and eukaryotic host cells. For example, the proteins of the disclosure may be expressed in bacterial cells such as E. coli, insect cells (using baculovirus), yeast cells or mammalian cells, COS1 cells. Other suitable host cells can be found in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. (1991).
The disclosure includes a microbial cell that contains and is capable of expressing a heterologous a nucleic acid molecule having a nucleotide sequence as encompassed by the disclosure. The heterologous nucleic acid molecule can be DNA.
Isolated DNA of the disclosure can be contained in a recombinant cloning vector.
The disclosure includes a stably transfected cell line which expresses any one or more proteins as defined by the disclosure.
The disclosure includes a culture of cells transformed with a recombinant DNA molecule having a nucleotide sequence as encompassed by the disclosure.
The application also contemplates a process for producing any protein as defined by the disclosure. The process includes such steps as:
preparing a DNA fragment including a nucleotide sequence which encodes said protein;
incorporating the DNA fragment into an expression vector to obtain a recombinant DNA molecule which includes the DNA fragment and is capable of undergoing replication;
transforming a host cell with said recombinant DNA molecule to produce a transformant which can express said protein;
culturing the transformant to produce said protein; and
recovering said protein from resulting cultured mixture.
More particularly, the application provides a method of preparing a purified protein of the disclosure comprising introducing into a host cell a recombinant nucleic acid encoding the protein, allowing the protein to be expressed in the host cell and isolating and purifying the protein.
Alternatively, the protein or parts thereof can be prepared by chemical synthesis using techniques well known in the chemistry of proteins such as solid phase synthesis or synthesis in homogeneous solution.
The application provides binding proteins that bind to the FOXP1-ES protein, but do not bind to the FOXP1 protein.
The term “binding protein” as used herein refers to proteins that specifically bind to another substance. In an embodiment, the binding proteins bind to the human FOXP1-ES protein, but not to the human FOXP1 protein. Accordingly, in one embodiment, the binding protein binds to the protein having the amino acid sequence shown in SEQ ID NO:11. In another embodiment, the binding protein binds to a protein encoded by the nucleic acid sequence as shown in SEQ ID NO: 3. In another embodiment, the binding proteins bind to the mouse Foxp1-ES protein, but not to the mouse Foxp1 protein. Accordingly, in one embodiment, the binding protein binds to the protein having the amino acid sequence shown in SEQ ID NO: 15. In another embodiment, the binding protein binds to a protein encoded by the nucleic acid sequence as shown in SEQ ID NO:7. In another embodiment, the binding proteins are antibodies or antibody fragments thereof. In a further embodiment, the binding proteins are monoclonal antibodies or fragments thereof. The term “antibody” as used herein is intended to include monoclonal antibodies, polyclonal antibodies, and chimeric antibodies. The antibody may be from recombinant sources and/or produced in transgenic animals. The term “antibody fragment” as used herein is intended to include Fab, Fab′, F(ab′)2, scFv, dsFv, ds-scFv, dimers, minibodies, diabodies, and multimers thereof and bispecific antibody fragments. Antibodies can be fragmented using conventional techniques.
In one embodiment of the disclosure, the binding protein binds to an epitope on the FOXP1-ES protein comprising the portion that is encoded by exon 18b which is normally not found in the wild type FOXP1 protein. For example, in one embodiment of the disclosure the epitope comprises the amino acid sequence as shown in SEQ ID NO:12, or a fragment thereof. Accordingly, in another embodiment, the binding protein binds to an epitope on the mouse Foxp1-ES protein comprising the portion that is encoded by exon 16b which is normally not found in the wild type mouse Foxp1 protein. For example, in one embodiment of the disclosure the epitope comprises the amino acid sequence as shown in SEQ ID NO:16, or a fragment thereof.
The term “epitope” as used herein refers to the part of the protein which contacts the antigen binding site of the binding protein of the disclosure.
The fragments of FOXP1-ES described above are useful as antigens in preparing antibodies that bind to the protein portion encoded by FOXP1-ES but do not bind to the FOXP1. In a particular embodiment, the antigen comprises the unique sequence in human FOXP1-ES, which is encoded by exon 18b as shown in SEQ ID NO. 4. In another embodiment, the antigen comprises the unique sequence in mouse Foxp1-ES which is encoded by exon 16b as shown in SEQ ID NO. 8.
In an embodiment, the disclosure provides a method of making an antibody that binds to FOXP1-ES comprising the following steps:
In an embodiment, the disclosure provides a method of making an antibody that binds to mouse FOXP1-ES comprising the following steps:
Conventional methods are useful to prepare the antibodies. For example, by using a peptide of a protein of the disclosure, polyclonal antisera or monoclonal antibodies can be made using standard methods.
Specific antibodies, or antibody fragments reactive against a protein of the disclosure may also be generated by screening expression libraries encoding immunoglobulin genes, or portions thereof, expressed in bacteria with peptides produced from nucleic acid molecules of the present disclosure. For example, complete Fab fragments, VH regions and FV regions can be expressed in bacteria using phage expression libraries (See for example Ward et al., Nature 341, 544-546: (1989); Huse et al., Science 246, 1275-1281 (1989); and McCafferty et al. Nature 348, 552-554 (1990)).
In another embodiment of the disclosure, the binding proteins are apatmers, optionally nucleic acid (DNA or RNA) derived or peptide derived aptamers.
The present disclosure also relates to alternative splicing modulators to promote, enhance or maintain pluripotency.
In one embodiment, the modulators regulate FOXP1 alternative splicing. In one aspect of the invention, the present disclosure relates to FOXP1 “exon 18b splicing modulators”. In one embodiment, an “exon 18b splicing modulator” is a stimulator of FOXP1 exon 18b inclusion. In another embodiment, an exon 18b splicing modulator is a repressor of FOXP1 exon 18b inclusion.
The term “stimulator of exon 18b inclusion” as used herein includes all substances that can stimulate the inclusion of FOXP1 exon 18b or homologs thereof and includes, without limitation, modified or unmodified nucleic acids, peptides, polypeptides, proteins or fragments thereof, small molecule activators, chemical compounds, antibodies (and fragments thereof), aptamers, interfering RNA molecules such as siRNA, miRNA and shRNA and other substances that can stimulate the inclusion of FOXP1 exon 18b. The term “stimulator of exon 18b inclusion” also includes agents which act as cis-regulatory or trans-regulatory elements of FOXP1 exon 18b and agents which recognize intronic sequences located 300 bp upstream (SEQ ID NO: 36) or 600 bp downstream (SEQ ID NO: 37) of human FOXP1 exon 18 or 300 bp upstream (SEQ ID NO: 38) or 300 bp downstream (SEQ ID NO: 39) of human FOXP1 exon 18b.
In some embodiments of the present disclosure, the splicing modulator is MBNL1 (also known as Muscleblind-like). While the term “MBNL1” is often used to here to human MBNL1, in the present disclosure, the term “MBNL1” refers to a protein encoded by the MBNL1 gene and encompasses MBNL1 from any species or source, optionally mammalian, such as human or mouse and isoforms and homologs thereof.
Optionally, the term MBNL1 relates to human MBNL1. A sequence corresponding to human MBNL1 is provided in GenBank under Accession No. AAH50535.1
Optionally, the term Mbnl1 relates to mouse Mbnl1. A sequence corresponding to mouse Mbnl1 is provided in GenBank under Accession No. AAH96600.1.
In some embodiments of the present disclosure, the splicing modulator is MBNL2 (also known as Muscleblind-like 2). While the term “MBNL2” is often used to here to human MBNL2, in the present disclosure, the term “MBNL2” refers to a protein encoded by the MBNL2 gene and encompasses MBNL2 from any species or source, optionally mammalian, such as human or mouse and isoforms and homologs thereof.
Optionally, the term MBNL2 relates to human MBNL2. A sequence corresponding to human MBNL2 is provided in GenBank under Accession No. AAI04040.1.
Optionally, the term Mbnl2 relates to mouse Mbnl2. A sequence corresponding to mouse Mbnl2 is provided in GenBank under Accession No. AAH75665.1.
In other embodiments of the present disclosure, the splicing modulator is TIA1 (also known as T Cell intracellular antigen-1). While the term “TIA1” is often used to here to human TIA1, in the present disclosure, the he term “TIA1” refers to a protein encoded by the TIA1 gene and encompasses TIA1 from any species or source, optionally mammalian, such as human or mouse and isoforms and homologs thereof.
Optionally, the term TIA1 relates to human TIA1. A sequence corresponding to human TIA1 is provided in GenBank under Accession No. AAH15944.1.
Optionally, the term TIA1 relates to mouse Tia1. A sequence corresponding to mouse Tia1 is provided in GenBank under Accession No. AAH23813.1.
In other embodiments of the present disclosure, the splicing modulator is TIAL1 (also known as TIA1 cytotoxic granule-associated RNA binding protein-like 1). While the term “TIAL1” is often used to here to human TIAL1, in the present disclosure, the he term “TIAL1” refers to a protein encoded by the TIAL1 gene and encompasses TIAL1 from any species or source, optionally mammalian, such as human or mouse and isoforms and homologs thereof.
Optionally, the term TIAL1 relates to human TIAL1. A sequence corresponding to human TIAL1 is provided in GenBank under Accession No. AAH30025.1.
Optionally, the term TIAL1 relates to mouse Tial1. A sequence corresponding to mouse Tial1 is provided in GenBank under Accession No. AAH10496.1.
In an embodiment, a “stimulator of exon 18b inclusion” is TIA1, TIAL1, an MBNL1 and/or MBNL2 antagonist or antisense or siRNA that decreases the expression of FOXP1.
An “MBNL1 and/or MBNL2 antagonist” is any factor that decreases the expression and/or function of MBNL1 and/or MBNL2. Optionally, an “MBNL1 and/or MBNL2 antagonist” is any factor that reduces and/or inhibits the activity of MBNL1 and/or MBNL2. Optionally, the activity of MBNL1 and/or MBNL2 includes the regulation of alternative splicing of FOXP1 and other ESC/iPSC-specific targets. In one embodiment, an “MBNL1 and/or MBNL2 antagonist” promotes inclusion of exon 18b in FOXP1 and/or exon 16b in Foxp1.
Examples of MBNL1 and/or MBNL2 antagonists include antibody or peptide or nucleic-acid derived aptamers to MBNL1 or MBNL2, antisense RNA or small interfering RNA that decrease expression of MBNL1 and/or MBNL2, or compounds that target MBNL1 and/or MBNL2, or the genes encoding MBNL1 and/or MBNL2. MBNL1 and MBNL2 antagonists are optionally used alone or in combination.
The term “repressor of exon 18b inclusion” as used herein includes all substances that can repress the inclusion of FOXP1 exon 18b or homologs thereof and includes, without limitation, modified or unmodified nucleic acids, peptides, polypeptides, proteins or fragments thereof, small molecule activators, chemical compounds, antibodies (and fragments thereof), aptamers, interfering RNA molecules such as siRNA, miRNA and shRNA, and other substances that can repress the inclusion of FOXP1 exon 18b. The term “repressor of exon 18b inclusion” also includes agents which act as cis-regulatory or trans-regulatory elements of FOXP1 exon 18b and agents which recognize intronic sequences located 300 bp upstream (SEQ ID NO: 36) or 300 bp downstream (SEQ ID NO: 37) of human FOXP1 exon 18 or 300 bp upstream (SEQ ID NO: 38) or 300 bp downstream (SEQ ID NO: 39) of human FOXP1 exon 18b.
In an embodiment of the present disclosure, a “repressor of exon 18b inclusion” is MBNL1, MBNL2, a TIA1 or TIAL1 antagonist, or antisense or siRNA that decreases expression of FOXP1-ES.
A “TIA1 or TIAL1 antagonist” is any factor that decreases the expression and/or function of TIA1 or TIAL1. Optionally, an “TIA1 or TIAL1 antagonist” is any factor that reduces and/or inhibits the activity of TIA1 or TIAL1. Optionally, the activity of TIA1 or TIAL1 includes the regulation of alternative splicing of FOXP1 and other ESC/iPSC-specific targets. In one embodiment, a “TIA1 or TIAL1 antagonist” represses inclusion of exon 18b in FOXP1 and/or exon 16b in Foxp1.
Examples of TIA1 or TIAL1 antagonists include antibody or peptide or nucleic-acid derived aptamers to TIA1 or TIAL1, antisense RNA or small interfering RNA that decrease expression of TIA1 or TIAL1, or compounds that target TIA1 or TIAL1 or the genes encoding TIA1 or TIAL1. TIA1 and TIAL1 antagonists are optionally used alone or in combination.
In one embodiment, the “exon 18b splicing modulator” is an antisense oligonucleotide to any one of SEQ ID NOs 36-39. In one aspect of the disclosure, the antisense oligonucleotides act to inhibit the binding of factors that stimulate or repress the inclusion of exon 18b. In another embodiment, antisense oligonucleotides are tethered to factors that stimulate or repress the inclusion of exon 18b.
Also contemplated in the disclosure are “exon 16b splicing modulators”.
The “exon 16b splicing modulator” can be a stimulator of Foxp1 exon 16b inclusion or a repressor of Foxp1 exon 16b inclusion.
The term “stimulator of exon 16b inclusion” or “repressor of exon 16b inclusion” as used herein includes all substances that can stimulate or repress the inclusion of mouse Foxp1 exon 16b and includes, without limitation, modified or unmodified nucleic acids, peptides, polypeptides, proteins or fragments thereof, small molecule activators, chemical compounds, antibodies (and fragments thereof), aptamers, interfering RNA molecules such as siRNA, miRNA and shRNA and other substances that can stimulation the inclusion of mouse Foxp1 exon 16b. The term “stimulator of exon 16b inclusion” includes agents which act as cis-regulatory or trans-regulatory elements of Foxp1 exon 16b and agents which recognize intronic sequences located 300 bp upstream (SEQ ID NO: 40) or 300 bp downstream (SEQ ID NO: 41) of mouse Foxp1 exon 16 or 300 bp upstream (SEQ ID NO: 42) or 300 bp downstream (SEQ ID NO: 43) of mouse Foxp1 exon 16b.
In one embodiment, the “exon 16b splicing modulator” is an antisense oligonucleotide to any one of SEQ ID NOs 40-43. In one aspect of the disclosure, the antisense oligonucleotides act to inhibit the binding of factors that stimulate or repress the inclusion of exon 16b. In another embodiment, antisense oligonucleotides are tethered to factors that stimulate or repress the inclusion of exon 16b.
The above nucleic acids, polypeptides, peptides, exon 18b splicing modulators and exon 16b splicing modulators of the disclosure can be used to promote the maintenance of the self-renewal and pluripotency properties of cells. The above nucleic acids, polypeptides peptides and exon 16b/18b splicing modulators of the disclosure can also be used to reprogram somatic cells into induced pluripotent stem cells.
In one aspect of the methods and uses provided in the present disclosure, the cells are from any species, optionally a mammalian species such as human or mouse.
The above described nucleic acid, polypeptide and peptide molecules allow those skilled in the art to maintain the pluripotent state of stem cells.
The term “stem cell” as used herein refers to a cell that has the ability for self-renewal (the ability to go through numerous cycles of cell division while maintaining the undifferentiated state) and the capacity to differentiate into specialized cell types. The stem cells of the disclosure are optionally embryonic stem cells or induced pluripotent stem cells. In one embodiment, the cells are from any species, optionally a mammalian species such as human or mouse.
In one embodiment, the stem cell is a pluripotent stem cell. The term “pluripotent” as used herein refers to the ability of a cell to differentiate into one of many cell types. Embryonic stem cells and induced pluripotent stem cells are pluripotent stem cells.
The term “differentiation” or “differentiated” as used herein refers to the process by which a less specialized cell, such as a stem cell, becomes a more specialized cell type, such that it is committed to a specific lineage.
The terms “maintain the pluripotent state” or “maintain the undifferentiated state” of stem cells may include maintaining the pluripotency of stem cells, maintaining the self-renewal of stem cells and/or suppressing stem cell differentiation.
As used herein the term “cell culture” refers to one or more cells grown under controlled conditions and optionally includes a cell line. The term “cell line” refers to a plurality of cells that are the product of a single group of parent cells. According to the teaching of the present disclosure, cells may be cultured according to any method known in the art. Optionally, cells may be cultured in culture medium comprising conditioned medium, non-conditioned medium, or embryonic stem cell medium. Examples of suitable conditioned medium include IMDM, DMEM, or aMEM, conditioned with embryonic fibroblast cells (e.g. human embryonic fibroblast cells or mouse embryonic fibroblast cells), or equivalent medium. Examples of suitable non-conditioned medium include Iscove's Modified Delbecco's Medium (IMDM), DMEM, or aMEM, or equivalent medium. The culture medium may comprise serum (e.g. bovine serum, fetal bovine serum, calf bovine serum, horse serum, human serum, or an artificial serum substitute) or it may be serum free.
Accordingly, in one aspect of the disclosure, a method is provided for maintaining a homogeneous population of pluripotent stem cells. Optionally, a homogeneous population of pluripotent stem cells is a population of cells comprising at least: 50%, 60%, 70%, 80%, 90% or 100% pluripotent stem cells. Optionally, the pluripotent stem cells are induced pluripotent stem cells.
In one embodiment, a method of maintaining a homogeneous population of pluripotent stem cells comprises:
In one embodiment of the method, the FOXP1-ES or FOXP1 is human FOXP1-ES or human FOXP1. In another embodiment, the FOXP1-ES or FOXP1 is mouse Foxp1-ES or mouse Foxp1. In a further embodiment, the cells are embryonic stem cells optionally human embryonic stem cells or mouse embryonic stem cells. In yet another embodiment, the cells are induced pluripotent stem cells.
In another embodiment of the disclosure, a method of maintaining a homogenous population of stem cells comprises decreasing the expression or function of FOXP1. Decreasing the expression of FOXP1 may be accomplished by any method known in the art including, but not limited to the use of genetic modifications, including knockdown by antisense RNA, small interfering RNAs, microRNAs, antibodies or binding proteins, aptamers, retroviral agents and small molecule compounds or toxins.
In another aspect of the disclosure, a method is provided for suppressing the differentiation of stem cells. The stem cells may be embryonic stem cells, optionally embryonic stem cells of any species such as human embryonic stem cells, mouse embryonic stem cells or induced pluripotent stem cells. The term “suppressing the differentiation” refers to suppressing the differentiation or maturation of a stem cell to later lineage cell stages.
In one embodiment, a method for suppressing the differentiation of stem cells comprises:
In one embodiment of the method, the FOXP1-ES or FOXP1 is human FOXP1-ES or human FOXP1. In another embodiment, the FOXP1-ES or FOXP1 is mouse Foxp1-ES or mouse Foxp1. In a further embodiment, the cells are embryonic stem cells of any species, optionally human embryonic stem cells or mouse embryonic stem cells.
In another embodiment of the disclosure, a method for suppressing the differentiation of stem cells comprises decreasing the expression or function of FOXP1. Decreasing the expression of FOXP1 may be accomplished by any method known in the art including, but not limited to the use of genetic modifications, including knockdown by antisense RNA, small interfering RNAs, microRNAs, antibodies or binding proteins, retroviral agents and small molecule compounds or toxins.
In another aspect of the disclosure, a method is provided for producing a population of differentiated cells.
In one embodiment, a method for producing a population of differentiated cells comprises:
In one embodiment of the method, the FOXP1-ES or FOXP1 is human FOXP1-ES or human FOXP1. In another embodiment, the FOXP1-ES or FOXP1 is mouse Foxp1-ES or mouse Foxp1. In a further embodiment, the cells are embryonic stem cells of any species, optionally human embryonic stem cells, mouse embryonic stem cells or induced pluripotent stem cells.
In another embodiment of the disclosure, a method for producing a population of differentiated cells comprises decreasing the expression or function of FOXP1-ES. Decreasing the expression of FOXP1-ES may be accomplished by any method known in the art including, but not limited to the use of genetic modifications, including knockdown by antisense RNA, small interfering RNA, microRNA, antibodies or binding proteins, aptamers, retroviral agents and small molecule compounds or toxins.
(ii) Reprogramming Somatic Cells into Induced Pluripotent Stem Cells
The above described nucleic acid, polypeptide and peptide molecules also allow those skilled in the art to reprogram somatic cells into induced pluripotent stem cells. The somatic cells may be cells from any species, optionally a mammalian species such as human or mouse.
The term “induced pluripotent stem cell” refers to a pluripotent stem cell that has been artificially derived from a non-pluripotent cell.
Accordingly, in an embodiment of the disclosure, the inventors provide a method of reprogramming somatic cells into pluripotent stem cells comprising:
In one embodiment, the FOXP1-ES is human FOXP1-ES. In another embodiment, the FOXP1-ES is mouse Foxp1-ES. In a further embodiment, the somatic cells are somatic cells from any species, optionally human somatic cells or mouse somatic cells.
Optionally, the method comprises reprogramming embryonic fibroblasts, optionally fibroblast from any species such as human embryonic fibroblasts or mouse embryonic fibroblasts into induced pluripotent stem cells.
In some embodiments, the induced pluripotent stem cells exhibit morphology traits similar to the embryonic stem cells described herein such as increased expression of SSEA-1.
The present disclosure further provides isolated induced pluripotent stem cells generated by the method described herein and cells differentiated therefrom. In another aspect, the disclosure provides use of the cells described herein as a source of induced pluripotent stem cells or differentiated cells therefrom.
(iii) Modulation of FOXP1-ES Expression
In another embodiment, the disclosure provides a method of modulating the expression of FOXP1-ES. In one embodiment, the disclosure provides a method of modulating the expression of FOXP1-ES in a cell comprising administering an exon 18b splicing modulator to the cell.
Also contemplated in the disclosure are methods of modulating the expression of mouse Foxp1-ES in a cell comprising administering an exon 16b splicing modulator to the cell.
The term “administering to a cell” includes, without limitation transfecting a cell with nucleic acid corresponding to an exon 18b or 16b splicing modulator, overexpressing an exon 18b or 16b splicing modulator in a cell and exposing a cell to an exon 18b or 16b splicing modulator.
The present disclosure also relates to the use of the exon 18b or 16b splicing modulators to maintain stem cells, for example embryonic stem cells, optionally human or mouse embryonic stem cells, in a self-renewing and pluripotent state. The disclosure further relates to the use of exon 18b or 16b splicing modulators for the production of self-renewing, pluripotent stem cells (for example, induced pluripotent cells).
(iii) Markers of Differentiation
The present disclosure also relates to the use of the nucleic acids, polypeptides and peptides described herein as markers of differentiation or the differentiation state of an individual cell or a population of cells. In one embodiment, the nucleic acids, polypeptides and peptides described herein are for use as markers of cells that have been reprogrammed into pluripotent stem cells.
In one embodiment, the expression of higher levels of FOXP1-ES and/or lower levels of FOXP1 is correlated with pluripotency. In one embodiment, the expression of FOXP1-ES and/or suppression of FOXP1 in a cell indicates that the cell is a pluripotent cell such as an embryonic stem cell or an induced pluripotent stem cell.
In another embodiment, the expression of lower levels of MBNL1 and/or MBNL2 is correlated with pluripotency. In one embodiment, the suppression of MBNL1 and/or MBNL2 in a cell indicates that the cell is a pluripotent cell such as an embryonic stem cell or an induced pluripotent stem cell.
In a further embodiment, the expression of higher levels of FOXP1 and/or lower levels of FOXP1-ES is correlated with cell differentiation. In one embodiment, the expression of FOXP1 and/or suppression of FOXP1-ES in a cell indicates that the cell is a differentiated cell. In yet another embodiment, expression of MBNL1 and/or MBNL2 indicates that the cell is a differentiated cell.
Accordingly, the disclosure relates to a method of assessing the pluripotency of a cell population comprising detecting the level of expression of FOXP1-ES in a sample of cells from the population, wherein an increase in the level of FOXP1-ES compared to a reference level in the sample of cells indicates the pluripotency of the cell population.
The disclosure also relates to a method of assessing the pluripotency of a cell population over time comprising
(a) detecting the level of expression of FOXP1-ES in a sample of cells from the population at a first time point,
(b) detecting the level of expression of FOXP1-ES in a sample of cells from the population at a second time point,
wherein an increase in the level of FOXP1-ES in the sample of cells at the second time point compared to the first time point indicates increased pluripotency of the cell population.
Optionally, the reference level is the expression level of FOXP1-ES in a differentiated or partially differentiated cell. Optionally a 5, 10, 15, 25, 50, 75, 100 or 200% increase in the level of FOXP1-ES in a sample of cells compared to a reference level indicates that the cell population comprises pluripotent cells.
The disclosure further relates to a method of assessing the pluripotency of a cell population comprising detecting the level of expression of FOXP1 in a sample of cells from the population, wherein a decrease in the level of FOXP1 compared to a reference level in the sample of cells indicates the pluripotency of the cell population.
The disclosure also relates to a method of assessing the pluripotency of a cell population over time comprising
(a) detecting the level of expression of FOXP1 in a sample of cells from the population at a first time point,
(b) detecting the level of expression of FOXP1 in a sample of cells from the population at a second time point,
wherein a decrease in the level of FOXP1 in the sample of cells at the second time point compared to the first time point indicates increased pluripotency of the cell population.
Optionally, the reference level is the expression level of FOXP1 in a differentiated or partially differentiated cell. Optionally a 5, 10, 15, 25, 50, 75, 100 or 200% decrease in the level of FOXP1 in a sample of cells compared to a reference level indicates that the cell population comprises pluripotent cells.
The disclosure further relates to a method of assessing the pluripotency of a cell population comprising detecting the level of expression of MBNL1 and/or MBNL2 in a sample of cells from the population, wherein a decrease in the level of MBNL1 and/or MBNL2 compared to a reference level in the sample of cells indicates the pluripotency of the cell population.
The disclosure also relates to a method of assessing the pluripotency of a cell population over time comprising
(a) detecting the level of expression of MBNL1 and/or MBNL2 in a sample of cells from the population at a first time point,
(b) detecting the level of expression of MBNL1 and/or MBNL2 in a sample of cells from the population at a second time point,
wherein a decrease in the level of MBNL1 and/or MBNL2 in the sample of cells at the second time point compared to the first time point indicates increased pluripotency of the cell population.
Optionally, the reference level is the expression level of MBNL1 and/or MBNL2 in a differentiated or partially differentiated cell. Optionally a 5, 10, 15, 25, 50, 75, 100 or 200% decrease in the level of MBNL1 and/or MBNL2 in a sample of cells compared to a reference level indicates that the cell population comprises pluripotent cells.
The disclosure relates to a method of assessing the differentiation state of a cell population comprising detecting the level of expression of FOXP1-ES in a sample of cells from the population, wherein a decrease in the level of FOXP1-ES compared to a reference level in the sample of cells indicates increased differentiation of the cell population.
The disclosure also relate to a method of assessing the differentiation state of a cell population over time comprising
(a) detecting the level of expression of FOXP1-ES in a sample of cells from the population at a first time point,
(b) detecting the level of expression of FOXP1-ES in a sample of cells from the population at a second time point,
wherein a decrease in the level of FOXP1-ES in the sample of cells at the second time point compared to the first time point indicates increased differentiation of the cell population.
Optionally, the reference level is the expression level of FOXP1-ES in a pluripotent cell. Optionally a 5, 10, 15, 25, 50, 75, 100 or 200% decrease in the level of FOXP1-ES in a sample of cells compared to a reference level indicates that the cell population comprises differentiated cells.
The disclosure also relates to a method of assessing the differentiation state of a cell population comprising detecting the level of expression of FOXP1 in a sample of cells from the population, wherein an increase in the level of FOXP1 compared to a reference level in the sample of cells indicates increased differentiation of the cell population.
The disclosure also relates to a method of assessing the differentiation state of a cell population over time comprising
(a) detecting the level of expression of FOXP1 in a sample of cells from the population at a first time point,
(b) detecting the level of expression of FOXP1 in a sample of cells from the population at a second time point,
wherein an increase in the level of FOXP1 in the sample of cells at the second time point compared to the first time point indicates increased differentiation of the cell population.
Optionally, the reference level is the expression level of FOXP1 in a pluripotent cell. Optionally a 5, 10, 15, 25, 50, 75, 100 or 200% increase in the level of FOXP1 in a sample of cells compared to a reference level indicates that the cell population comprises differentiated cells.
The disclosure further relates to a method of assessing the differentiation state of a cell population comprising detecting the level of expression of MBNL1 and/or MBNL2 in a sample of cells from the population, wherein an increase in the level of MBNL1 and/or MBNL2 compared to a reference level in the sample of cells increased differentiation of the cell population.
The disclosure also relates to a method of assessing the pluripotency of a cell population over time comprising
(a) detecting the level of expression of MBNL1 and/or MBNL2 in a sample of cells from the population at a first time point,
(b) detecting the level of expression of MBNL1 and/or MBNL2 in a sample of cells from the population at a second time point,
wherein an increase in the level of MBNL1 and/or MBNL2 in the sample of cells at the second time point compared to the first time point indicates increased differentiation of the cell population.
Optionally, the reference level is the expression level of MBNL1 and/or MBNL2 in a pluripotent cell. Optionally a 5, 10, 15, 25, 50, 75, 100 or 200% increase in the level of MBNL1 and/or MBNL2 in a sample of cells compared to a reference level indicates that the cell population comprises differentiated cells.
As used herein, “expression” relates to the expression of gene and/or proteins in a cell. Methods of measuring the expressions levels of genes and proteins in a cell are well known in the art.
The present disclosure includes evidence that alternative splicing modulators such as MBNL1/Mbnl1 and MBNL2/Mbnl2 regulate a large number ESC/iPSC-specific alternative splicing events
As such, the present disclosure relates also to the use of alternative splicing modulators such as MBNL1, MBNL2, TIA1, and TIAL1 and antagonists thereof, to regulate ESC/iPSC-specific alternative splicing events through FOXP1 exon 18b as well as targets other than FOXP1 exon 18b.
Accordingly, the present disclosure also relates to the use of modulators of ESC/iPSc-specific alternative splicing and reprogramming (for example, MBNL1 antagonists, MBNL2 antagonists, TIA1 and/or TIAL1, alone or in combination) to maintain or enhance pluripotency.
In one embodiment, the application relates to the use of a MBNL1 antagonist and a MBNL2 antagonist to maintain or enhance pluripotency. In another embodiment, the application relates to the use of a TIA1, and TIAL1 to maintain or enhance pluripotency. The phrase “maintaining or enhancing pluripotency” includes producing pluripotent stem cells, maintaining a homogeneous population of pluripotent stem cells, suppressing stem cell differentiation or reprogramming somatic cells into pluripotent stem cells.
Optionally, “maintaining or enhancing pluripotency” as used herein includes, without limitation, increasing the efficiency and/or kinetics of producing pluripotent stem cells, maintaining a homogeneous population of pluripotent stem cells, suppressing stem cell differentiation or reprogramming somatic cells into pluripotent stem cells. “Maintaining or enhancing pluripotency” as used herein also includes, without limitation, increasing the homogeneity of a population of pluripotent stem cells.
“Increasing the efficiency and/or kinetics” refers to increasing the efficiency and/or kinetics of a process by at least 5, 10, 25, 75, 100, 150 or 200%.
In other embodiments, modulators of ESC/iPSC-specific alternative splicing and reprogramming (for example, MBNL1 antagonists, MBNL2 antagonists, TIA1, and TIAL1) are used to accelerate or enhance the production of pluripotent stem cells, suppress stem cell differentiation, or reprogram somatic cells into pluripotent stem cells.
The present disclosure also relates to the use of MBNL1 and/or MBNL2, to promote differentiation or produce a population of differentiated cells. The present disclosure also relates to the use of a TIA1 antagonist and/or a TIAL1 antagonist to promote differentiation or produce a population of differentiated cells.
The following non-limiting examples are illustrative of the present disclosure:
An Embryonic Stem Cell-Specific Splice Variant from the FOXP1 Gene
In order to identify AS events that may function in the control of stem cell pluripotency, microarray profiling was used to compare patterns of alternative splicing (AS) in undifferentiated and differentiated H9 human embryonic stem cells (hESCs;
Microarray analysis indicated that human FOXP1 exon 18 has increased inclusion in day 10 neural progenitor-enriched cells (˜96% inclusion) compared to undifferentiated H9 hESCs (˜79% inclusion). RT-PCR assays using primers specific for FOXP1 exons 17 and 19 confirmed this prediction, and also detected two unexpected additional bands that are ˜50 nt and ˜170 nt longer than the transcripts containing exon 18 (
To establish whether exon 18b is specifically included in self-renewing, pluripotent hESCs, RT-PCR assays were performed to measure the inclusion levels of exon 18 and 18b in flow cytometry-sorted H9 cells that express TRA1-81 and SSEA-3, two cell surface pluripotency markers (
The evolutionary conservation of FOXP1 exons 18 and 18b as well as their regulated AS patterns were examined. A multispecies genome alignment encompassing human FOXP1 exons 17 to 19 and the orthologous sequences from 46 vertebrate species reveals that exons 18 and 18b are located within a ˜1000 nt genomic region that is highly conserved (PhastCons Mean 0.959, variance 0.029) across the analyzed species (
To determine whether differential AS of FOXP1 exons 18 and 18b between hESCs and differentiated cells is conserved, RT-PCR assays were used to analyze the regulation of the orthologous exons (exons 16 and 16b; FIG. S2B) in mouse Foxp1 transcripts (
The following table lists the genomic co-ordinates of exons 17, 18, 18b and 19 of human FOX P1.
The following table lists the genomic co-ordinates of exons 15, 16, 16b and 17 of mouse Foxp1.
FOXP1 and FOXP1-ES have Distinct DNA Binding Specificities
The coding sequence of the DNA-binding Forkhead domain of human FOXP1 overlaps exons 16 to 19. It was investigated whether differential AS of exons 18 and 18b affects the DNA binding properties of FOXP1. The Forkhead domains of FOXP family members are highly homologous and bind the same canonical consensus motif GTAAACA as monomers, homo- and/or heterodimers (Koh et al., 2009; Li et al., 2004). A co-crystal structure of FOXP2 bound to DNA (Stroud et al., 2006) reveals that residues forming direct contacts with DNA are conserved in FOXP1. Interestingly, the inclusion of exon 18b instead of exon 18 is predicted to substitute a total of 35 residues that overlap with the Forkhead domain. Four highly conserved amino acid residues (Asn510, His514, Ala531 and Arg543) in FOXP1 shown to directly contact DNA in FOXP2 are among those substituted (with Gly510, Tyr514, Ser 531 and Gly543) by exon 18b splicing. None of the sequence changes resulting from exon 18b splicing are expected to alter the overall secondary structure of the FOXP1 forkhead domain, nor its capacity to multimerize (amino acids involved in dimer formation are indicated by black dots in
To investigate the consequence of exon 18b splicing on DNA binding, protein binding microarray experiments were performed (PBM: Berger et al., 2008; Berger et al., 2006) to define 8-mer sequences that are preferentially bound by FOXP1 and FOXP1-ES forkhead domains fused to glutathione S transferase (GST). GST-FOXP1 and GST-FOXP1-ES preferentially recognize distinct DNA binding motifs (
The PBM-defined binding preferences of the FOXP1 and FOXP1-ES forkhead domains were confirmed by gel mobility shift assays using the GST-forkhead fusion proteins and radiolabeled dsDNA probes (
These results show that hESC-specific inclusion of exon 18b changes the DNA binding specificity of FOXP1. Substitution of residues Asp510 and His514 as a consequence of exon 18b splicing affects the recognition of the A-T base pair at the fourth position of the FOXP1 consensus binding site. FOXP1-ES binds a T-A base pair at this position in a subset of the preferentially bound PBM sequences. Additional amino acid substitutions in FOXP1-ES affecting conserved residues at the DNA binding interface account for the other changes in the DNA binding properties of this splice isoform, including its preferential binding to specific C/A-rich motifs. In addition to a change in sequence preference, the combined results from the PBM experiments and gel mobility shift assays reveal that GST-FOXP1-ES can bind to a broader spectrum of sequences than does GST-FOXP1, although with apparent reduced affinity since at a similar concentration range of protein, GST-FOXP1-ES bound less efficiently to many of its highest scoring PBM sequences than did GST-FOXP1 (
FOXP1 and FOXP1-ES Regulate Distinct Programs of Gene Expression in hESCs
To show that FOXP1 and FOXP1-ES control the expression of different sets of genes, siRNA knockdowns of each isoform in undifferentiated H9 cells were performed followed by RNA-Seq profiling. H9 hESCs were transfected with custom siRNA pools targeting exons 18 or 18b. Relative to a control siRNA pool (
A significantly higher proportion of genes showed increased expression upon knockdown of FOXP1-ES compared to knockdown of FOXP1 (86% vs. 58.2%; p=1.63E-05, Chi-square test). Moreover, of the 76 genes affected in both knockdowns (
A Gene Ontology (GO) enrichment analysis of genes with decreased mRNA expression upon knockdown of FOXP1 or FOXP1-ES revealed significant enrichment of terms associated with early development (p <1.21E-05,
Validation experiments using qRT-PCR confirmed that knockdown of FOXP1-ES in H9 hESCs results in a ˜2-fold or greater decrease in the levels of mRNA expression of the pluripotency genes OCT4, NANOG, NR5A2, GDF3 and TDGF1, and a ˜2-fold or greater increase in the expression of differentiation-associated genes including GAS1, HESX1, SFRP4 and WNT1 (
These results show that the AS-mediated expression of FOXP1-ES in hESCs predominantly functions in the suppression of a large number of genes with important functions in cell differentiation and development. Moreover, FOXP1-ES is also involved in promoting the expression of a specific subset of genes associated with pluripotency.
Chromatin immunoprecipitation was performed followed by high-throughput sequencing (ChIP-Seq) to identify genes that are bound and potentially directly regulated by FOXP1-ES and FOXP1 in H9 ESCs. To avoid introducing isoform-specific signal bias in the ChIP-Seq experiments, an antibody was used to efficiently immunoprecipitate both FOXP1 isoforms. The ChIP-Seq data revealed >3400 significant peaks across the human genome that were specific to immunoprecipitation with the anti-FOXP1 antibody. These peaks correspond to in vivo sites of FOXP1 and FOXP1-ES occupancy (
Scatterplots directly comparing the relative under-peak enrichment and PBM scores for the individual 8-mers are shown in
These results show that FOXP1 and FOXP1-ES can preferentially bind to similar sequences in vivo as they do in vitro. However, the CGATACA consensus and closely related sequences preferentially bound by FOXP1-ES in vitro do not appear to be widely utilized by this factor in vivo. Instead, FOXP1-ES appears to more often bind a subset of C/A-rich motifs, and possibly lower affinity sites that resemble C/A-rich motifs also bound by FOXP1 (
Examples of candidate direct target genes include several with functions in differentiation and early development and are shown in
Further showing a direct role for FOXP1-ES in regulating OCT4, a statistically significant overlap was observed between the set of RNA-Seq profiled genes that are dependent on FOXP1-ES for expression in H9 hESCs (
Foxp1-ES Expression Promotes mESC Self-Renewal and Pluripotency
To show that Foxp1-ES is important for the maintenance of the self-renewal and pluripotency of mESCs, it was first asked whether increasing its expression over the endogenous levels of Foxp1 is capable of suppressing mESC differentiation. Transgenic CGR8 mESC lines were generated that stably express 3×Flag-Foxp1-ES or 3×Flag-Foxp1 isoforms under Doxycycline (Dox)-inducible control. Upon Dox stimulation 3×Flag-Foxp1 and 3×Flag-Foxp1-ES expression was detected at levels comparable to those of the endogenous proteins, thus resulting in an approximately 2-fold increase in the overall expression of each isoform (
To further show that Foxp1-ES expression is required for the maintenance of stem cell identity, the Dox-inducible 3×Flag-Foxp1 or 3×Flag-Foxp1-ES CGR8 cell lines were cultured in the presence of different levels of Leukemia Inhibitory Factor (LIF), a cytokine which is required for pluripotency maintenance of mESCs. The CGR8 3×Flag-Foxp1 and 3×Flag-Foxp1-ES lines were cultured with an excess of LIF to support mESC self-renewal (
To further show that overexpression of Foxp1-ES but not Foxp1 promotes the maintenance of pluripotency, the 3×Flag-Foxp1 and 3×Flag-Foxp1-ES expressing cell lines were cultured in the absence of exogenous LIF, with or without Dox added to the media. As expected, the two cell lines rapidly differentiated in the absence of Dox and the 3×Flag-Foxp1 line could not be maintained in culture beyond 5 or 6 passages even in the presence of Dox.Strikingly, the 3×Flag-Foxp1-ES line continued to grow in the presence of Dox and these cells were kept in culture over 30 passages in the absence of LIF. These cells are referred to as 3×Flag-Foxp1-ESΔLIF. qRT-PCR analysis performed after the 24th passage confirmed that these cells express Oct4, Nanog and Nr5a2 at levels comparable with those of the parental control CGR8 cells, but they display reduced levels of Sox2, Klf4 and LifR (
It was determined that Foxp1-ES expression is important for the formation of induced pluripotent stem cells (iPSCs) from mouse embryonic fibroblasts (MEFs). For this experiment, secondary mouse MEFs (2° MEFs) were employed that were derived from a primary iPSC line (1°-6C iPSC) containing an integrated piggyBac transposon expressing, under Dox-inducible control, the four Yamanaka transcription factors “OKMS” (Oct4, Klf4, c-Myc and Sox2) required for iPSC reprogramming (Takahashi and Yamanaka, 2006; Woltjen et al., 2009). In the presence of Dox, the 2°-6C MEFs expressing all four factors efficiently form secondary iPSCs (2°-6C iPSCs) that are pluripotent (Woltjen et al., 2009). Consistent with a key role for Foxp1-ES in the maintenance of mESC pluripotency, RT-PCR assays showed that exon 16b (homologous to human exon 18b) is almost completely skipped in primary MEFs but is included to ˜32% in Foxp1 transcripts in iPSCs, which is comparable with its inclusion level observed in mESCs (
It was next investigated whether Foxp1 and Foxp1-ES are important for iPSC formation. Each isoform was selectively knocked down using siRNAs specific for either exon 16b or exon 16 sequences. For each knockdown, sets of four siRNAs targeted to each exon were initially used as pools. Short interfering RNAs comprising these pools that produced the most efficient isoform-specific knockdown (
Quantification of SSEA-1/GFP positive cells by flow cytometry at day 5 of reprogramming is shown in
Taken together with the results described earlier, these data show that the AS mediated switch controlling Foxp1-ES expression provides for efficient iPSC reprogramming as well as for the maintenance of ESC self-renewal and pluripotency.
Using the recently reported splicing code (Barash et al. 2010) and manual inspection of intronic sequences located 300 nucleotides upstream and downstream of FOXP1 exon 18b, candidate cis-regulatory elements of this exon were identified. Both approaches identified sequences resembling binding sites of the regulators MBNL1-3 and TIA1/TIAL1 proteins, which recognize YGCUKY containing sequences- and U-rich motifs, respectively (
Using transfection of Dharmacon siRNA pools, significant and reproducible modulation of exon 18b splicing following depletion of MBNL1, MBNL2, TIA1 and TIAL1 proteins was observed, but negligible effects on exon 18b splicing following knockdown of MBNL3, and two other tested splicing regulators, PTBP1 and PTBP2 (
In addition, MBNL1 and 2 affect the regulation of additional stem cell specific events. Without being bound by theory, these proteins may play a widespread role in negatively regulating a stem cell specific alternative splicing program. The suppression of this program by modulators such as MBNL1 and 2 is a factor in reprogramming somatic cells into pluripotent stem cells.
In order to delineate sequences that are critical for regulation of exon 18b splicing a pre-mRNA splicing reporter was generated by subcloning sequences of FOXP1 including the alternative exons 18 and 18b as well as flanking introns (˜1.1 kb) into the Exontrap vector (MoBiTec) (
Total RNA was extracted from 1-2 g of sample of undifferentiated or differentiating H9 hESCs using TRI reagent (Sigma-Aldrich) as per the manufacturer recommendations. PolyA+ mRNA was purified using Nucleotrap Midiprep kits (Clonetech). Complementary DNA (cDNA) synthesized from the polyA+ RNA samples using the WT-Ovation RNA Amplification System (Nugen) was hybridized to custom AS microarrays in dye swap experiments and analyzed as described previously (Pan et al., 2004).
siRNA Knockdown of Splicing Regulators
Cells were transfected with SMART-pool siRNAs (Dharmacon) targeting potential splicing regulators using DharmaFECT reagent following the manufacturer's instructions, and total RNA was extracted 48 or 72 hours after transfection.
H9 hESCs were transfected with a control, non-targeting siRNA pool (Dharmacon) or with siRNA pools targeting FOXP1 exon 18 or exon 18b sequences using DharmaFECT reagent following the manufacturer recommendations. Two days after transfection, H9 hESCs were transfected again and total RNA was isolated after an additional two days of culture. Total RNA from two independent transfections was pooled and submitted to IIlumina Inc. for polyA+ isolation and mRNA sequencing; datasets consisting of ˜50 million×50 mer reads were generated from each sample. RNA-Seq analysis was performed using a reference database comprising all annotated human splice junctions, essentially as previously described (Pan et al., 2008).
Radiolabeled, semi-quantitative RT-PCR reactions contained 10 ng of total RNA were performed using the OneStep kit (Qiagen), essentially as described previously (Calarco et al., 2007). For qRT-PCR analysis, cDNAs were generated from 2 μg of total RNA using SuperScript® III Reverse Transcriptase (Invitrogen) as per manufacturer recommendations, and reactions were performed in a 384 well format using 20 ng of cDNA and FastStart Universal SYBR Green Master (Rox) (Roche Applied Science).
GST-FOXP1 and GST-FOXP1-ES were expressed, purified and incubated on PBMs as previously described (Badis et al., 2009). Each protein was assayed using two independent array designs, and the resulting data was processed as described in (Lam et al., 2011). Individual 8-mers obtaining an E score >0.45 in at least one of the two experimental repeats for FOXP1 and FOXP1-ES were considered significant (Berger et al., 2008) and aligned to generate consensus sequences using enoLOGOS (Workman et al., 2005).
dsDNA probes for electrophoretic mobility shift assays (EMSAs) contained two tandemly repeated copies of representative PBM-derived FOXP1/FOXP1-ES consensus binding sequences, or mutated derivatives of these sequences, separated by two cytosines. EMSAs were performed as described in (Hellman and Fried, 2007).
ChIP-Seq experiments were performed as described previously (Schmidt et al., 2009), using an anti-FOXP1 (Abcam) antibody and 5×107 H9 hESCs per sample. Immunoprecipitated DNA and a fraction of the total input DNA (after fragmentation) were used to generate dsDNA libraries, which were prepared using the Genomic DNA Sample preparation kit (Illumina, FC-102-1001) as per the manufacturer's protocol.
CGR8 cells were fixed in 4% formaldehyde/PBS for 10 min at room temperature, washed with PBS, permeabilized for 3 min at 4° C. with 0.1% Triton X100, and then incubated with blocking solution (0.01% goat serum, 10 mg/ml BSA, 0.2% Tween-20) for one hour at 37° C. Cells were incubated with primary antibodies (polyclonal β-III Tubulin, T2200 Sigma-Aldrich and murine monoclonal Oct4 Pierce) in blocking solution (1% goat serum, 1% BSA, 0.2% Tween20 in PBS) for two hours at 37°, and then with secondary antibodies in blocking solution for two hours at 37° C. Cells were washed in PBS, incubated with Hoechst dye (Sigma-Aldrich, B2883), mounted with Aqueous mounting Medium (Permafluor #0725) and images were acquired by epi-fluorescence imaging as previously described (Samavarchi-Tehrani et al., 2010).
iPSC Reprogramming Assays and Imaging
2°-6C MEF cells were cultured and induced to express OKMS factors as previously described (Woltjen et al., 2009). Single or pooled siRNAs (Dharmacon/ThermoFisher) were transfected 0 or 13 days after induction of OKMS factors, and cells were cultured for another 5 or 3 days prior to harvesting for total RNA, or analysis by Flow-cytometry and immunostaining, which were performed as previously described (Samavarchi-Tehrani et al., 2010).
As discussed above, Muscleblind-like proteins MBNL1 and MBNL2 have been identified as negative regulators of an ESC/iPSC-specific AS switch in the forkhead family transcription factor FOXP1 that controls pluripotency. These proteins are expressed at lower levels in ESCs than in diverse differentiated cells and tissues, and their knockdown promotes ESC/iPSC-specific AS in differentiated cells. It is further shown that among a large network of ESC-specific AS events in genes with diverse functions associated with ESC biology, approximately half are regulated by MBNL proteins. Consistent with a central role for MBNL proteins in the core pluripotency circuitry, their knockdown promotes the expression of key pluripotency genes early on during the reprogramming of somatic cells to induced pluripotent stem cells.
To identify trans-acting regulators of ESC-specific AS events, high-throughput RNA sequencing (RNA-Seq) was used to detect human and mouse splicing factor genes that are differentially expressed between ESCs and a diverse panel of non-ESC lines and differentiated tissues. From the same RNA-Seq datasets, sets of human and mouse cassette alternative exons that display differential regulation between ESCs and non-ESCs and tissues were defined. A splicing code inference method (Barash et al. 2010; Xiong et al. 2011) was employed to predict cis-acting elements required for the regulation of these ESC/iPSC-specific AS events. By integrating results from these data, differentially expressed splicing regulators corresponding to defined binding sequences the code predicts to be important for controlling ESC-specific AS were identified.
The mRNA expression levels of 221 known or putative splicing regulators across 7 human ESC/iPSC lines and 28 diverse other cell line and tissue samples were determined using unique-mapping RNA-Seq reads and the metric “corrected reads per kilobase cDNA per million reads” (cRPKM). Eleven of these 221 genes are significantly differentially-expressed between ESCs and the other cells and tissues (Bonferroni-corrected p<0.05, Wilcox test). MBNL1 and MBNL2 have the lowest overall mRNA expression levels in ESCs and iPSCs compared with almost all of the other analyzed cell lines and tissues. Quantitative RT-PCR assays confirmed these observations (
From the RNA-Seq data 208 human and 104 mouse ESC-regulated cassette AS events were defined, with each set comprising comparable numbers of exons that are on average ≧25% more included or more skipped in ESCs versus the other profiled cell lines and tissues. Alternative splicing of ESC-regulated exons are more conserved between human and mouse than are non-regulated exons (58.7% vs. 23.5% in human, p<2.2e-16, Fisher test). When comparing exons in both species that meet minimal required expression criteria for comparing ESC and non-ESC percent spliced in (PSI) levels, 27 (13-26%) of the human and mouse ESC-specific AS events overlapped. In addition to detecting previously identified ESC-specific AS events such as FOXP1 exon 18b, examples of novel conserved ESC-specific AS events are in MTA1, a component of the NuRD chromatin remodeling complex that has previously been implicated in the control of pluripotency (Liang et al. 2008), and in the protein kinases CASK, MARK2 and MAP2K7. These results thus suggest a more extensive role for regulated AS events in fundamental aspects of ESC biology than previously appreciated.
Using the splicing code, it was inferred the combinations of cis-regulatory elements that are predictive of ESC-specific AS in human and mouse. Consistent with the analysis of differential expression of trans-acting factors, consensus binding sites for MBNL proteins are the most strongly predictive of exons with both higher or lower inclusion levels in ESCs relative to other cells and tissues (
Because MBNL1/Mbnl1 and MBNL2/Mbnl2 proteins are expressed at very low levels in human and mouse ESCs compared to other cell types and tissues, without being bound by theory, it is hypothesized that these factors may function by repressing the inclusion of ESC-specific exons in non-ESCs, and, conversely, that they may also function by activating the inclusion of exons in non-ESCs that are normally skipped in ESCs. Previous studies performed in differentiated cell lines and tissues have shown that MBNL proteins can suppress exon inclusion when they bind upstream flanking intronic sequences, whereas they can promote inclusion when binding to downstream flanking intron sequences. The results from the splicing code are entirely consistent with this proposal: predicted MBNL binding elements are significantly enriched in intronic sequence upstream of exons that are specifically included in ESCs, and they are significantly enriched in intronic sequence downstream of exons that are specifically skipped in ESCs (
To directly test whether MBNL1/Mbnl1 and MBNL2/Mbnl2 proteins regulate ESC-specific AS events, siRNAs were transfected to knockdown these proteins individually and together in differentiated human and mouse cell lines and, for control and comparison purposes, in ESCs. RT-PCR assays were then used to monitor changes in the inclusion levels of ESC-specific AS events. Western blotting indicated that knockdown in all tested differentiated cells was efficient, resulting in less than 10% of the endogenous proteins remaining (
Knockdown of MBNL2 in 293T cells had no apparent effect on exon 18b inclusion. In contrast, knockdown of MBNL1 alone, and together with MBNL2, resulted in shifts from zero to 2.4 and 6.5 exon PSI, respectively (
The extent to which ESC/iPSC-specific AS events are regulated by MBNL proteins was investigated next. MBNL1 and MBNL2 were simultaneously knocked down in HeLa cells, as described above, and RNA-Seq profiling was used to detect knockdown-dependent AS level changes (
Given the important role for MBNL/Mbnl proteins in the regulation of FOXP1/Foxp1 exons 18b/16b, which control ESC pluripotency, it was next asked whether these splicing regulators control the expression of key pluripotency genes during the process of iPSC reprogramming. Secondary mouse embryonic fibroblasts (MEFs) expressing the four Yamanka factors “OKMS” (Oct4, Klf4, c-Myc, and Sox2) under doxycycline (dox)-inducible control were transfected with siRNA pools that knockdown Mbnl1 and Mbnl2, or with a control non-targeting siRNA pool. A siRNA pool targeting Oct4 was transfected as a negative control. The expression of several pluripotency factors and markers, including Oct4, Nanog, Sall4 and Alpl, were measured by qRT-PCR assays three and five days post induction of OKMS (
While the present disclosure has been described with reference to a number of examples, it is to be understood that the disclosure is not limited to the disclosed examples. To the contrary, the disclosure is intended to cover various modifications and equivalent arrangements included within the spirit and scope of the appended claims.
All publications, patents and patent applications are herein incorporated by reference in their entirety to the same extent as if each individual publication, patent or patent application was specifically and individually indicated to be incorporated by reference in its entirety.
This non-provisional application claims priority from U.S. provisional application No. 61/503,206 filed on Jun. 30, 2011, which is incorporated herein by reference in its entirety.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/CA2012/000616 | 6/29/2012 | WO | 00 | 12/20/2013 |
Number | Date | Country | |
---|---|---|---|
61503206 | Jun 2011 | US |