This invention relates to targeting isolation and identification of a special protein group in biotechnology field. Specifically, this invention relates to the strategy for isolation and identification of organism endogenous transcription factor complexes. Also, this invention relates to the corresponding tandem arrays of concatenated transcription factor response elements (cat TFRE).
About 6% of genes in the human genome encode transcription factors (TF) which are the second largest category of proteins encoded by the human genome. Transcription factors play important roles in the regulation of gene expression and also are key nodes for intracellular signaling network, wherein, various signaling pathways triggered by intracellular and extracellular stimulations are cross linked with each other via transcription factors. Thus, transcription factors and their complexes have been attracting great concern. However, due to their low-abundance expression level (only accounting for 0.01-0.001% of total proteins within cells), it is very difficult to purify and identify transcription factors and complexes at protein level. Purification of transcription factors by a conventional chromatography method usually needs hundreds of liters of cell cultures. However, the yielded protein which has been enriched 10,000-100,000 times is barely enough for chemical and functional analysis. While antibodies are undoubtedly the best affinity reagents for detecting proteins, in fact, the commercial available antibodies with high and specific affinity to certain transcription factors are limited. Furthermore, the generation of usable antibodies for detecting endogenous proteins is a process of trial and error. Thus, applications of antibodies to affinity purification of endogenous transcription factors and their complexes are limited. To date, only less than 5% of transcription factors have been purified and identified. Therefore, a method for the purification and identification of the entire family of endogenous transcription factors is in great need.
Transcription factors regulate gene expression by binding to DNA cis-elements located in the regulatory region of target genes. Transcription factors include general transcription factors (e.g., subunits of general transcription factor II (TF II) complex, TATA-binding protein and etc.) and specific transcription factors (such as Spl, C/EBP, AP1 and etc.). During transcription, specific transcription factors bind to promoters and general transcription factors are recruited to the DNA sequence at 40˜60 base pairs upstream or downstream from the transcriptional initiation site, initiating the synthesis of RNA. Recently, more and more researches have found that structural properties of DNA binding elements will affect the formation of the transcriptional initiating complex. In other words, the nucleic acid composition of the transcription factor binding site will affect the recruitment of its co-regulators, which to a certain extent determine whether the transcription factor will act as an activator or a repressor of target genes. For this reason, it is very important to develop a method for isolation and identification of endogenous transcription factors and their complexes. The applications of such methods will shed light on the understanding of the transcriptional network of target genes.
Endogenous protein levels of transcription factors are usually very low and it is difficult to analyze the expression profile of transcription factors on proteome scale by conventional methods. To profile the endogenous transcription factors in cells or tissues, it is necessary to enrich and isolate the transcription factors by affinity purification strategy using specific reagents (such as antibodies). However, limited types of antibodies and the high cost constrained the affinity purification of endogenous transcription factors by using antibodies. In addition, only a few transcription factors can be analyzed by antibody affinity purification in a single experiment. It is hard to enrich and identify most of transcription factors expressed in certain cells or tissues by this strategy.
The present invention provides a method for enrichment and isolation of endogenous transcription factors and their complexes and also provides corresponding tandem arrays of concatenated concatenated transcription factor response elements (catTFRE).
The tandem arrays of catTFRE provided in this invention are DNA sequences obtained by concatenating mono or multiple copies of respective DNA response elements of one or more transcription factors with 3˜5 bp nucleic acid linkers.
Specifically, the above-mentioned one or more transcription factors may be selected from the group consisting of AP1, AR, BRCA1, CEBPA, CREB1, E2F1, ELK1, ELK4, ESR1, ETS1, EWSR1-FLI1, FEV, FOXA1, FOXC1, FOXD1, FOXF2, FOXI1, FOXL1, FOXO3, Fra-1, GATA2, GATA3, GR, HIF1A::ARNT, HLF, HNF1B, HNF4A, HOXA5, INSM1, IRF1, IRF2, JunB, JunD, MAX, MEF2A, MIZF, MYC::MAX, Myf, MZF1—1-4, MZF1—5-13, NF-kappaB, NFATC2, NFE2L2, NFIC, NFL3, NFKB1, NFYA, NHLH1, NKX3-1, NR1H2::RXRA, NR2F1, NR3C1, NR4A2, Pax6, PBX1, PDX1, PLAG1, PPARG, PR, PXR-1:RXR-alpha, RAR-alpha, RAR-alpha:RXR-gam, RAR-beta:RXR-alpha, REL, RELA, REST, RFX1, RFX2, RFX3, RFX5:RFXAP:RFXANK, RORA—1, RORA—2, RREB1, RXR::RAR_DR5, RXRA::VDR, SOX10, SOX9, SP1, SPI1, SPIB, SRF, SRY, STAT1, STAT5A, T3R-beta1, TAL1::TCF3, TBP, TEAD1, TFAP2A, TLX1::NFIC, TP53, USF1, WT1-del2, WT1-KTS, WT1I, WT1I-del2, WT1I-KTS, XBP-1, YY1 and ZNF354C.
Specifically, the tandem arrays nucleotide sequence of the catTFRE may be obtained by concatenating dual copies of respective DNA response elements of one or more transcription factors selected from AP1, AR, BRCA1, CEBPA, CREB1, E2F1, ELK1, ELK4, ESR1, ETS1, EWSR1-FLI1, FEV, FOXA1, FOXC1, FOXD1, FOXF2, FOXI1, FOXL1, FOXO3, Fra-1, GATA2, GATA3, GR, HIF1A::ARNT, HLF, HNF1B, HNF4A, HOXA5, INSM1, IRF1, IRF2, JunB, JunD, MAX, MEF2A, MIZF, MYC::MAX, Myf, MZF1—1-4, MZF1—5-13, NF-kappaB, NFATC2, NFE2L2, NFIC, NFIL3, NFKB1, NFYA, NHLH1, NKX3-1, NR1H2::RXRA, NR2F1, NR3C1, NR4A2, Pax6, PBX1, PDX1, PLAG1, PPARG, PR, PXR-1:RXR-alpha, RAR-alpha, RAR-alpha:RXR-gam, RAR-beta:RXR-alpha, REL, RELA, REST, RFX1, RFX2, RFX3, RFX5:RFXAP:RFXANK, RORA—1, RORA—2, RREB1, RXR::RAR_DR5, RXRA::VDR, SOX10, SOX9, SP1, SPI1, SPIB, SRF, SRY, STAT1, STAT5A, T3R-beta1, TAL1::TCF3, TBP, TEAD1, TFAP2A, TLX1::NFIC, TP53, USF1, WT1-del2, WT1-KTS, WT1I, WT1I-del2, WT1I-KTS, XBP-1, YY1 and ZNF354C with 3-5 bp nucleic acid linkers.
In one exemplary embodiment, the tandem arrays nucleotide sequence of the catTFRE may be obtained by concatenating dual copies of respective DNA response elements of 100 transcription factors including AP1, AR, BRCA1, CEBPA, CREB1, E2F1, ELK1, ELK4, ESR1, ETS1, EWSR1-FLI1, FEV, FOXA1, FOXC1, FOXD1, FOXF2, FOXI1, FOXL1, FOXO3, Fra-1, GATA2, GATA3, GR, HIF1A::ARNT, HLF, HINF1B, HNF4A, HOXA5, INSM1, IRF1, IRF2, JunB, JunD, MAX, MEF2A, MIZF, MYC::MAX, Myf, MZF1—1-4, MZF1—5-13, NF-kappaB, NFATC2, NFE2L2, NFIC, NFIL3, NFKB1, NFYA, NHLH1, NKX3-1, NR1H2::RXRA, NR2F1, NR3C1, NR4A2, Pax6, PBX1, PDX1, PLAG1, PPARG, PR, PXR-1:RXR-alpha, RAR-alpha, RAR-alpha:RXR-gam, RAR-beta:RXR-alpha, REL, RELA, REST, RFX1, RFX2, RFX3, RFX5:RFXAP:RFXANK, RORA 1, RORA 2, RREB1, RXR::RAR_DR5, RXRA::VDR, SOX10, SOX9, SP1, SPI1, SPIB, SRF, SRY, STAT1, STAT5A, T3R-beta1, TAL1::TCF3, TBP, TEAD1, TFAP2A, TLX1::NFIC, TP53, USF1, WT1-del2, WT1-KTS, WT1I, WT1I-del2, WT1I-KTS, XBP-1, YY1 and ZNF354C with 3-5 bp nucleic acid linkers.
In one specific embodiment, the tandem array nucleotide sequence of the catTFRE is represented by Seq: No: 1 in the sequence list.
Seq: No: 1 in the sequence list is consisted of 2800 base pairs, which contains dual-copies of core response elements of the above-mentioned 100 transcription factors, and each of the dual-copy core response elements is spaced from the adjacent one by three random base pairs.
The second object of the present invention is to provide a method for enrichment and isolation of endogenous transcription factors and their complexes.
The method for enrichment and isolation of endogenous transcription factors and their complexes provided in this invention comprises the steps of
In the above-mentioned method for enrichment and isolation of endogenous transcription factors and their complexes, the target vector in step 1) may be pUC57, pET24a+, pGEX4T-2, pGEX4T-1, pCMV-Myc, pGH, pcDNA-Myc, and etc.
The nucleotide sequence of the forward primer in step 2) is represented by Seq: No: 2 in the sequence list and the nucleotide sequence of the reverse primer is represented by Seq: No: 3 in the sequence list. The PCR reaction system of 100 μl is as follows: 10×ExTaq Buffer, 10 μl; dNTPs (2.5 mM/dNTP), 10 μl; pUC57-sdTF, 1 μl (50 ng); each of forward and reverse primers, 1 μl (1 nmol); ExTaq, 0.5 μl; H2O, 87.5 μl. The reaction conditions for PCR is as follows: 94° C. for 2 min at first; subsequently, 94° C. for 45 s, 60° C. for 45 s, 72° C. for 2 min, 35 cycles in total; then 72° C. for 7 min; 4° C. for 30 min at last.
The method in step 3) for immobilizing the DNA bait to the streptavidin-coated magnetic beads comprises the steps of
The nuclear extracts used in step 4) are extracted by employing homogenization procedure with Dounce homogenizer. Specifically, cells are suspended in a low-salt hypertonic buffer for 10 min and then homogenized to separate nuclear and cytoplasm fractions. Homogenate is spin at 4000×g for 15 min. Nuclear pellet is re-suspended with a low-salt solution and treated by Dounce for 10 times. Then, the salt concentration is adjusted to 300 mM with a high-salt solution. NE is spin down at 60,000 RPM in Ultracentrifuge (Beckman Optima TLA 100 rotor) for 20 min at 4° C. The supernatant is taken and dialyzed with BC150 solution till final salt concentration reached 150 mM. Specific procedures are as follows: cells are harvested by centrifugation at 1000×g under 4° C. for 10 min; cell pellet is washed with 1×PBS and re-suspended with a hypotonic solution at 10 times of the precipitate volume; after stayed on ice for 10 min, cells are harvested by centrifugation at 1000×g tinder 4° C. for 10 min; the cell pellet is re-suspended with a hypotonic solution of ¼ volume of that of the pellet and homogenized 15 times with a Dounce homogenizer; nuclear and cytoplasm fractions are separated by centrifugation at 4000×g under 4° C. for 15 min; the nucleus pellet is re-suspended with a low-salt buffer of ½ volume of that of nucleus and then homogenized 10 times with a Dounce homogenizer at 4° C.; the solution is transferred to a centrifugal tube; a high-salt buffer of ½ volume of that of nucleus pellet is added drop by drop while the solution is gently stirred; the solution mixture is rotated at 4° C. for 30 min and then centrifuged at 25,000×g under 4° C. for 20 min; the supernatant is dialyzed at 4° C. for 30 min in a BC150 buffer; the nuclear extract is aliquoted and quick-frozen with liquid nitrogen, and then reserved at −80° C. for future use. For enrichment and isolation of endogenous transcription factors, 4˜8 mg of nuclear extract is incubated with the magnetic beads obtained in step 3) at 4° C. for 2 hr. The unbound proteins are washed with NETN (50 mM NaCl, 0.25% NP-40) twice and then with PBS for three times, each for 10 s. By now, endogenous transcription factors and their complexes are enriched.
Depending on the purposes of application, the method above may further comprise a step of eluting the endogenous transcription factors and their complexes which bind to DNA bait immobilized on the magnetic beads. Alternatively, the method of the present invention may further contain a step of identification of endogenous transcription factors and their complexes captured by the DNA bait in step 4), which comprises steps of digesting the endogenous transcription factors and their complexes by trypsin, drying the digested peptides and identifying the components of endogenous transcription factors and their complexes by mass spectrometry. The digestion procedure is as follows: 45 μl of 50 mM NH4HCO3 (pH 8.0) is added to the magnetic beads after washing and then 10 μl of trypsin (Promega) solution (100 μg/ml) is added to digest the targets at 37° C. overnight; then 5 μl of trypsin (Promega) solution (100 μg/ml) is added again for digestion at 37° C. for 1 hour; the peptides are extracted from beads with 200 μl of 50% acetonitrile (containing 0.1% of formic acid); the mixture is shaken fiercely for 10 min and the supernatant is transferred to a clean tube; the whole mentioned procedure is repeated once. Thereafter, supernatants are combined and dried. Then, the components of endogenous transcription factors and their complexes are identified by mass spectrometry. The procedure for MS analysis is as follows: Tryptic peptides are dissolved with loading buffer (5% Methanol, 0.1% Formic acid) and then separated on an on-line C18 column (75 μm-inner diameter, 360 μm-outer diameter, 10 cm, 3 μm, C18). Mobile phase A is consisted of 0.1% formic acid in water solution and mobile phase B is consisted of 0.1% formic acid in acetonitrile solution; a linear gradient from 3 to 100% B over a 75 minute period at a flow rate of 350 nL/min is applied. For identification, peptides are fragmented by collision-induced dissociation (CID) and analyzed by the LTQ-Orbitrap Velos (Thermo, Germany). The survey scan is limited to 375-1600 m/z. Proteins are identified using the Proteome Discoverer 1.3 using MASCOT search engine and appropriate reference sequence protein database from NCBI. Threshold score/expectation value for accepting individual spectra is set to ion score 10. The PSM false positive rate is set to 1% strict/5% relaxed cutoff. The mass tolerance is set at 20 ppm for precursors and 0.5 Da for product ions. Dynamic modifications of oxidation (Met), acetylation (protein N-terminus), phosphorylation (ST) and Destreak (C) are chosen. Maximum missed cleavage sites are set to be 2.
Other methods such as Western Blotting, ELISA and etc. may, of course, be adopted to verify specific transcription factors or co-regulatory proteins as well.
The use of the catTFRE in the enrichment and isolation of endogenous transcription factors and their complexes is also contained in the present invention.
A further object of the present invention is to provide a test chip or an ELISA assay kit for detection of endogenous transcription factors or their complexes, which contains the biotinylated catTFRE as affinity reagents as mentioned above.
Hereinabove, we provide a method for enrichment and isolation of endogenous transcription factors and their complexes by employing the binding property of transcription factors to sequence-specific DNA response elements. We have surveyed the response element of various transcription factors and tandemly combined them into a concatenated tandem array of the consensus TF response element sequence. The DNA bait, biotinylated catTFRE with arms of 200 bp, is produced through molecular cloning technology. For enrichment and isolation of endogenous transcription factors and their complexes, the biotinylated DNA baits are immobilized to streptavidin-coated magnetic beads. Then the immobilized catTFRE is incubated with nuclear extract. After unbound proteins are washed away, endogenous transcription factors and their complexes are enriched and isolated. At last, identification by mass spectrometry or functional characterization of certain transcription factors by other methods can be employed according to different application purposes. The present invention further comprises the design of “DNA bait”. Specifically, such a design comprises of steps of producing DNA sequence containing multiple copies of DNA response elements by strategies of de novo synthesis or in vitro ligation, ligating DNA sequence to target vector, designing and synthesizing a pair of biotin-labeled primers annealed to two ends of the multiple cloning site of the vector and then obtaining the biotinylated “DNA bait” by PCR. The 200 bp arms of DNA baits allow the formation of a spatial structure which facilitates the binding of transcription factors when the “DNA bait” is immobilized to magnetic beads. The present invention adopts the DNA binding property of transcription factors to enrich and isolate endogenous transcription factors and their complexes. Since the affinity of transcription factor to its consensus binding sites is several orders of magnitude higher than that to non-specific DNA, using DNA sequences containing consensus binding sites of transcription factors is a relatively direct method for isolation of transcription factors and associated proteins. In addition, it is easier to obtain DNA than antibodies. Furthermore, native conformations of transcription factors can be maintained upon binding to its consensus DNA element. Therefore, using DNA consensus elements to affinity purify transcription factors and their complexes has greater advantages, which provides a powerful tool for characterizing the composition of transcription factor complexes and analyzing their dynamic behaviors.
Hereinafter, the present invention will be further described in detail in conjunction with specific examples.
Examples are conducted on basis of the technical solution of the present invention and detailed embodiments and concrete procedures are provided. However, the protection scope of the present invention is not limited to the following examples.
Methods used in the following examples are all conventional methods unless otherwise indicated.
Transcription factors and their complexes in nuclear extract of mouse liver hepatocytes are isolated and identified using the method of the present invention. Specifically, the method comprises the following steps as shown in
1. Obtaining of catTFRE
In the example, 100 transcription factors and their DNA response elements as shown in Table 1, and tandem arrays of the catTFRE were designed and synthesized to enrich and isolate endogenous transcription factors and their complexes. The tandem arrays sequences of catTFRE were obtained by randomly concatenating dual copies of respective DNA response elements of transcription factors including AP1, AR, BRCA1, CEBPA, CREB1, E2F1, ELK1, ELK4, ESR1, ETS1, EWSR1-FLI1, FEV, FOXA1, FOXC1, FOXD1, FOXF2, FOXI1, FOXL1, FOXO3, Fra-1, GATA2, GATA3, GR, HIF1A::ARNT, HLF, HNF1B, HNF4A, HOXA5, INSM1, IRF1, IRF2, JunB, JunD, MAX, MEF2A, MIZF, MYC::MAX, Myf, MZF1—1-4, MZF1—5-13, NF-kappaB, NFATC2, NFE2L2, NFIC, NFIL3, NFKB1, NFYA, NHLH1, NKX3-1, NR1H2::RXRA, NR2F1, NR3C1, NR4A2, Pax6, PBX1, PDX1, PLAG1, PPARG, PR, PXR-1:RXR-alpha, RAR-alpha, RAR-alpha:RXR-gam, RAR-beta:RXR-alpha, REL, RELA, REST, RFX1, RFX2, RFX3, RFX5:RFXAP:RFXANK, RORA—1, RORA—2, RREB1, RXR::RAR_DR5, RXRA::VDR, SOX10, SOX9, SP1, SPI1, SPIB, SRF, SRY, STAT1, STAT5A, T3R-beta1, TAL1::TCF3, TBP, TEAD1, TFAP2A, TLX1::NFIC, TP53, USF1, WT1-del2, WT1-KTS, WT1I, WT1I-del2, WT1I-KTS, XBP-1, YY1 and ZNF354C with linkers of 3˜5 base pairs. Seq: No: 1 in the sequence list is consisted of 2800 base pairs, showing a tandem array nucleotide sequence of the catTFRE containing dual-copies of respective DNA response elements of the 100 transcription factors, having 3 base pairs between adjacent DNA response elements.
2. Construction of a Recombinant Vector Carrying the catTFRE
The catTFRE obtained in step 1 was inserted into the multiple cloning site of the target vector pUC57 to get a recombinant vector carrying the catTFRE. Specific method was as follows: de novo synthesis was performed to obtain a catTFRE DNA of 2.8 kb length (Seq: No: 1 in the sequence list) and the synthesized catTFRE was inserted to the pUC57 vector by using restrictive enzymes EcoRI and HindIII. The recombinant vector was transformed and amplified in the E. coli DH5a strain, which can be used as the template of PCR for biotinylated catTFRE.
A pair of primers labeled with biotin was designed and synthesized, of which the forward and reverse primers can be annealed to sequences at 200 bps upstream and downstream from the multiple cloning site of target vector.
The nucleotide sequence of the forward primer was: 5′-CATTCAGGCTGCGCAACTGTTG-3′ (Seq: ID: 2 in the sequence list).
The nucleotide sequence of the reverse primer was: 5′-GTGAGTTAGCTCACTCATTAGG-3′ (Seq: ID: 3 in the sequence list).
PCR amplification was performed with biotinylated primers using the recombinant vector carrying the catTFRE obtained in step 2 as the template. PCR reaction system of 100 μl was prepared as follows: 10×ExTaq Buffer, 10 μl; dNTPs (2.5 mM/dNTP), 10 μl; pUC57-sdTF, 1 μl (about 50 ng); each of forward and reverse primers, 1 μl (1 nmol); ExTaq, 0.5 μl; H2O, 87.5 μl. The reaction conditions for PCR was as follows: 94° C. for 2 min at first; subsequently, 94° C. for 45 s, 60° C. for 45 s, 72° C. for 2 min, 35 cycles in total; then 72° C. for 7 min; 4° C. for 30 min at last. The PCR product was purified with Minigel purification kit.
The biotinylated DNA bait obtained in step 3 was immobilized to streptavidin-coated magnetic beads (Dynabeads® M-280 streptavodin (Invitrogen)). Specifically, the following steps were done.
Nuclear extract from mouse liver hepatocytes was extracted as follows: cells were harvested by centrifugation at 1000×g under 4° C. for 10 min; the cell pellet was washed with 1×PBS and re-suspended with a hypotonic solution (10 mM Tris-HCl pH7.3, 1.5 mM MgCl2, 10 mM KCl, adding 10 mM β-ME and 1 mM PMSF before use) at 10 times of the precipitate volume; the mixture was stayed on ice for 10 min and then the cells were harvested by centrifugation at 1000×g undef 4° C. for 10 min; the cell pellet was re-suspended with a hypotonic solution of ¼ volume of that of the pellet and then homogenized 15 times with a Dounce homogenizer; nuclear and cytoplasm fractions were separated by centrifugation at 4000×g under 4° C. for 15 min; the nucleus pellet was re-suspended with a low salt buffer (20 mM Tris-HCl pH7.3, 1.5 mM MgCl2, 20 mM KCl, 0.2 mM EDTA, 25% glycerol, adding 10 mM β-ME and 1 mM PMSF before use) of ½ volume of that of cells and then homogenized 10 times with a Dounce homogenizer at 4° C.; the solution was transferred to a centrifugal tube; a high salt buffer (20 mM Tris.HCl pH7.3, 1.5 mM MgCl2, 1.2 M KCl, 0.2 mM EDTA, 25% glycerol, adding 10 mM β-mercaptoethanol and 0.5× Protein inhibitors before use) of ½ volume of that of the nucleus pellet was added drop by drop while the mixture was gently stirred; the solution mixture was rotated at 4° C. for 30 min and then centrifuged at 25,000×g under 4° C. for 20 min; the supernatant was dialyzed at 4° C. for 30 min in a BC150 buffer (20 mM Tris.HCl pH7.3, 0.15 mM KCl, 0.2 mM EDTA, 20% glycerol, adding 10 mM β-ME and 1 mM PMSF before use). The nuclear extract was aliquoted and quick-frozen with liquid nitrogen, which was then reserved at −80° C. for future use.
For enrichment and isolation of endogenous transcription factors, 200-800 μl of nuclear extract (4˜8 mg) was centrifuged at 100,000×g under 4° C. for 20 min. The supernatant was transferred to a clean Eppendorf tube and 1 mM EDTA, 50 mM NaCl and 0.5 mmol PMSF were added. After determining its concentration by Bradford assay, the supernatant was incubated with the magnetic beads obtained in step 4 at 4° C. for 2 hr. The unbound proteins were washed away with NETN (50 mM NaCl, 0.25% NP-40) twice and then with PBS for three times, each for 10 s. By now, endogenous transcription factors and their complexes were enriched on the beads.
6. Identification of Endogenous Transcription Factors and their Complexes by Mass Spectrometry
In order to evaluate the capacity of the method provided in this invention in enriching and isolating endogenous transcription factors and their complexes, mass spectrometry was used to identify the components of protein mixture captured by DNA bait. The protein mixture was firstly digested by trypsin as follows: 45 μl of 50 mM NH4HCO3 (pH 8.0) was added to the magnetic beads after washing, and then 10 μl of trypsin (Promega) solution (100 μg/ml) was added; digest was performed at 37° C. overnight; then 5 μl of trypsin (Promega) solution (100 μg/ml) was added again and digestion was performed at 37° C. for 1 more hour; peptides were extracted from beads with 200 ul of acetonitrile (contains 0.1% of formic acid); the supernatant was transferred to a clean tube and the extraction was repeated once; the solutions were combined and dried and then the components of protein mixture were identified by mass spectrometry. The procedure for MS analysis was as follows: Tryptic peptides were dissolved with loading buffer (5% Methanol, 0.1% Formic acid) and then separated on an on-line C18 column (75 μm inner diameter, 360 μm outer diameter, 10 cm, 3 μm C18). Mobile phase A was consisted of 0.1% formic acid in water solution and mobile phase B was consisted of 0.1% formic acid in acetonitrile solution; a linear gradient from 3 to 100% B over a 75 minute period at a flow rate of 350 nL/min was applied. For identification, peptides were fragmented by collision-induced dissociation (CID) and analyzed by the LTQ-Orbitrap Velos (Thermo, Germany). The survey scan was limited to 375-1600 m/z. Proteins were identified using the Proteome Discoverer 1.3 using MASCOT search engine and appropriate reference sequence protein database from NCBI. Threshold score/expectation value for accepting individual spectra was set to ion score 10. The PSM false positive rate was set to 1% strict/5% relaxed cutoff. The mass tolerance was set at 20 ppm for precursors and 0.5 Da for product ions. Dynamic modifications of oxidation (Met), acetylation (protein N-terminus), phosphorylation (ST) and Destreak (C) were chosen. Maximum missed cleavage sites were set to be 2.
As a result, up to 391 endogenous transcription factors (shown in Table 2) were identified from the sample in this experiment. It showed that a great amount of endogenous transcription factors were captured by catTFRE from nuclear extract of mouse liver hepatocytes. Therefore, the method of the present invention can be widely applicable for identification of endogenous transcription factors at a large scale, as well as for validation and quantification of specific transcription factors. More importantly, transcription factors, especially the superfamily of nuclear receptors, are attractive targets in current drug development, and some available drugs typically exert their potency by activating/inhibiting transcription factors. On the other hand, drugs are characterized in the complexity of mechanism and diversity of targets. Due to the property of multiple targets, practical effects of drugs are usually different from initial expectation, e.g., they may bring toxic or side effects, or, they may have some “unexpected” effects in treating other diseases. Full scanning on dynamic changes of transcription factors is especially important in research of pharmaceutical mechanism and side effects. The method of the present invention adopts catTFRE to enrich and isolate endogenous transcription factors in a large scale. The enriched transcription factors should be identified and quantified by appropriate methods. Using this approach, it is possible to analyze the dynamic endogenous transcription factors stimulated by certain drugs, which would provide some clues for 1) the targets of the drugs and its pharmacological mechanism; and 2) candidate targets of the drugs and the corresponding potential side effects.
It's important to note that DNA response elements of a certain transcription factor can usually enrich multiple members of a transcription factor superfamily, since members belonging to a transcription factor superfamily usually bind similar DNA sequences. For example, the nuclear receptor superfamily (48 members in human) tends to bind DNA elements containing a consensus half site with the sequence of AGGTCA. The above-mentioned reasons may account for the phenomenon that the number of transcription factors, 391, as detected in the liver in the experiment above is higher than the number of transcription factors to be enriched by catTFRE as designed.
In addition to the application to profile endogenous transcription, factors in biological organisms, tissues or cells, the catTFRE provided in this invention can also be used to develop assay kits or chips for screening endogenous transcription factors. For example, an ELISA assay kit or a test chip for detection of endogenous transcription factors can be developed by coating binding elements to a 96-well plate or on the surf ace of a solid substrate.
Number | Date | Country | Kind |
---|---|---|---|
201110457108.X | Dec 2011 | CN | national |