Assay for the detection of factors that modulate the expression of INGAP

Information

  • Patent Application
  • 20030207301
  • Publication Number
    20030207301
  • Date Filed
    January 09, 2003
    21 years ago
  • Date Published
    November 06, 2003
    20 years ago
Abstract
A reporter construct contains mammalian INGAP 5′-regulatory region or a fragment thereof, a minimal promoter element from mammalian INGAP or a heterologous promoter, and a reporter gene. The reporter construct can be used to screen for agents which alone or in combination up-regulate or down-regulate reporter gene expression. Alternatively, the reporter construct can be used to screen for agents that bind to the hamster INGAP 5′-regulatory region or a fragment thereof.
Description


FIELD OF THE INVENTION

[0002] The invention relates to the field of assays for the detection of factors that modulate gene expression. Specifically, the invention relates to reporter constructs and methods for identifying agents that modulate the expression of the INGAP gene.



BACKGROUND OF THE INVENTION

[0003] Islet neogenesis gene associated protein (INGAP protein) has been identified as a pancreatic acinar cell protein that can induce islet cell neogenesis from progenitor cells resident in the pancreas in a manner that recapitulates islet development during normal embryogenesis. INGAP is unique in its ability to stimulate growth and differentiation of islets of Langerhans from precursor cells associated with pancreas. These islets evolve a mature insulin secretory profile capable of responding to perturbations in blood glucose in a physiologic manner. This potential anti-diabetic therapeutic has been shown to demonstrate homology across several species and to exert a biological response.


[0004] Pancreatic islet cell mass is lost in type 1 diabetes mellitus, a disease in which a progressive autoimmune reaction results in the selective destruction of insulin-producing β-cells. In type 2 diabetes mellitus, so-called adult-onset disease, but also increasingly a condition in young overweight people, the β-cell mass may be reduced by as much as 60% of normal. The number of functioning β-cells in the pancreas is of critical significance for the development, course, and outcome of diabetes. In type I diabetes, there is a reduction of β-cell mass to less than 2% of normal. Even in the face of severe insulin resistance as occurs in type II diabetes, the development of diabetes only occurs if there is inadequate compensatory increase in β-cell mass. Thus, the development of either of the major forms of diabetes can be regarded as a failure of adaptive β-cell growth and a subsequent deficiency in insulin secretion. Stimulating the growth of islets and β-cells from precursor cells, known as islet neogenesis, is an attractive approach to the amelioration of diabetes. There is need in the art for methods to identify agents that can modulate the expression of INGAP, whether in animals or in cultured cells.



BRIEF SUMMARY OF THE INVENTION

[0005] It is an object of the invention to provide a reporter construct containing the 5′-regulatory region from mammalian INGAP gene.


[0006] It is another object of the invention to provide methods for identifying agents which modulate INGAP expression.


[0007] It is another object of the invention to provide a nucleic acid or fragment of INGAP 5′-regulatory region.


[0008] It is another object of the invention to provide methods for increasing INGAP expression.


[0009] It is another object of the invention to provide a kit for modulating INGAP expression.


[0010] These and other objects of the invention are provided by one or more of the embodiments described below.


[0011] In one aspect of the invention a reporter construct is provided. The reporter construct comprises a regulatory region nucleotide sequence and a nucleotide sequence encoding a detectable product. In one aspect of the invention, the reporter construct is provided in a vector. The regulatory region nucleotide sequence is linked to the nucleotide sequence encoding a detectable product. The regulatory region nucleotide sequence may comprise one or more fragments of 5′ regulatory region of the INGAP genomic sequence, SEQ ID NO:23, or it may comprise the entire length of the 5′ regulatory region. In one embodiment of the reporter construct, a promoter element is interposed between the regulatory region nucleotide sequence and the nucleotide sequence encoding a detectable product. The promoter element may be selected from the promoter elements present in the INGAP regulatory sequence. Alternatively, the promoter element present in the vector comprising the reporter construct may be used. The detectable product encoded by the said nucleotide sequence encoding a detectable product could be either a nucleic acid or a protein. The detectable product need not be the INGAP gene nucleic acid or protein.


[0012] In another embodiment of the invention, a method identifying agents that modulate INGAP expression is provided. The method comprises contacting a cell with a test agent, wherein the cell comprises a reporter construct of the present invention. Expression of the detectable nucleic acid or protein product in the cell is determined. A test agent is identified as a modulator of INGAP expression if the test agent modulates expression of the detectable product in the cell.


[0013] In another embodiment of the invention, an isolated nucleic acid comprising the genomic sequence of the hamster INGAP gene (SEQ ID NO: 2), or a fragment thereof is provided.


[0014] According to another embodiment of the invention, an in vitro method for identifying agents that modulate INGAP expression is provided. The method comprises contacting a test agent with a reporter construct of the present invention in a cell-free system that allows for transcription and translation of a nucleotide sequence. Expression of the detectable product is determined. The substance is identified as a modulator of INGAP expression if the test substance modulates expression of the detectable product.


[0015] According to another embodiment of the invention, an in vitro method for identifying an agent that modulate INGAP expression is provided. The method comprises contacting a test agent with a nucleic acid of the invention. Binding of the test agent to the nucleic acid is determined. The test agent is identified as a modulator of INGAP expression if the test agent binds to the nucleic acid.


[0016] According to another embodiment of the invention a method for increasing INGAP expression is provided. An effective amount of a factor that stimulates INGAP expression directly or indirectly, for example cytokines, chemokines, growth factors, or pharmacological agents, is administered to a mammal in need of increased INGAP expression.


[0017] According to another embodiment of the invention a kit for modulating INGAP expression is provided. The kit comprises a modulator of INGAP expression and instructions for using the modulator of INGAP expression to modulate INGAP expression.


[0018] According to another embodiment of the invention a method for modulating INGAP expression in a mammal to treat a disease state related to reduced islet cell function is provided. The method comprises the step of administering to the mammal an effective amount of a modulator of INGAP expression whereby the level of INGAP expression in the mammal is modified.


[0019] All documents cited are, in relevant part, incorporated herein by reference; the citation of any document is not to be construed as an admission that it is prior art with respect to the present invention.







BRIEF DESCRIPTION OF THE DRAWINGS

[0020]
FIG. 1 shows the annotation of the hamster INGAP gene structure. The boundaries of introns 1-5 are listed in Table 1.


[0021]
FIG. 2 shows an overview of the 5′-regulatory region of the hamster INGAP gene (nucleotides 1-3137 of SEQ ID NO: 2) showing many well known and well-characterized transcription factor binding sites. The minimal promoter element contains the regions noted with an underline (CAAT-box, TATA-box, and GC-box).


[0022]
FIG. 3 shows a schematic of many well known and well-characterized transcription factor-binding sites for nucleotides 1-3123 of the 5′-regulatory region (SEQ ID NO: 1) of the hamster INGAP gene. Table 3 further describes these transcription factor-binding sites.


[0023]
FIG. 4 shows the predicted transcription start sites within the 5′-regulatory region (SEQ ID NO: 1) of the hamster INGAP gene (SEQ ID NO: 2). The predicted start site is indicated by a boldface nucleotide. The start and end nucleotide numbers are indicated for the promoter sequence. The numbers refer to nucleotide numbers of the hamster INGAP gene (SEQ ID NO: 2)


[0024]
FIG. 5 shows the adapter primer structure and sequence used in gene walking. Adapter primer 1 (API) and adapter primer 2 (AP2) are shown.


[0025]
FIGS. 6 and 7 show the strategy for reconstructing the hamster INGAP gene. The hamster INGAP gene was reconstructed using the technique of gene walking. Shown are the fragments and the gene specific primers (GSP1 and GSP2) used in PCR amplification for gene walking. Fragments were joined together using unique restriction enzyme sites within each fragment. The nucleotide sequences of the individual primers are listed in Table 2.


[0026]
FIG. 8 shows the fragments of INGAP 5′-regulatory region, which were cloned into pβGal-basic upstream of a P-galactosidase reporter gene. The labels on the left refer to the nucleotide fragments of SEQ ID NO: 23 which were cloned upstream of pβGal-basic.


[0027]
FIG. 9A shows reporter activity in human embryonic kidney cells (293T) transfected with a reporter construct that contains various fragments of the 5′-regulatory region (SEQ ID NO: 23) of hamster INGAP DNA cloned upstream of a β-galactosidase reporter gene (pβGal-basic), or in a reporter construct which contains no INGAP DNA. The cells are stimulated with phorbol myristate acetate. Promoter activity is assessed by determining the level of β-galactosidase present in the cell using a β-galactosidase luminescent assay.


[0028]
FIG. 9B shows reporter activity in human embryonic kidney cells (293T) transfected with a reporter construct that contains nucleotides 2030 to 3137 of the 5′-regulatory region (SEQ ID NO: 23) of hamster INGAP cloned upstream of a β-galactosidase reporter gene, or in a reporter construct which contains no INGAP DNA. The cells are stimulated with leukemia inhibitory factor. Promoter activity is assessed by determining the level of P-galactosidase present in the cell using a β-galactosidase luminescent assay.


[0029]
FIG. 10 shows the reporter activity in human embryonic kidney cells (293T) transfected with a reporter construct that contains different fragments (see FIG. 8) of the 5′-regulatory region of hamster INGAP cloned upstream of a β-galactosidase reporter gene. The cells are stimulated with phorbol myristate acetate. Concentrations of PMA used are 6 ng/ml, 17 ng/ml, 50 ng/ml, 100 ng/ml, or 300 ng/ml. Promoter activity is assessed by determining the level of β-galactosidase present in the cell using a β-galactosidase luminescent assay.


[0030]
FIG. 11 shows reporter activity in human embryonic kidney cells (293T) transfected with a reporter construct that contains different fragments (see FIG. 8) of the 5′-regulatory region of hamster INGAP cloned upstream of a β-galactosidase reporter gene. The cells are stimulated with human leukemia inhibitory factor (hLIF). Concentrations of hLIF used are 1 ng/ml, 10 ng/ml, or 30 ng/ml. Promoter activity was assessed by determining the level of β-galactosidase present in the cell using a galactosidase luminescent assay.


[0031]
FIG. 12 shows RNA analysis for INGAP gene upregulation in rat amphicrine pancreatic cells, AR42J, treated with cytokine IL-6 or untreated. Total RNA is probed by Northern analysis for INGAP gene.







DETAILED DESCRIPTION OF THE INVENTION

[0032] Definitions


[0033] It must be noted that as used herein and in the appended claims, the singular forms “a”, “an”, and “the” include plural references unless the context clearly dictates otherwise.


[0034] The term “promoter” is used to define the region of a gene at which initiation and rate of transcription are controlled. It contains the site at which RNA polymerase binds and also sites for the binding of regulatory proteins, e.g. transcription factors, repressors, etc. In order to differentiate between the transcription initiation site and other sites that modulate rate of transcription, promoter region is generally subdivided into “minimal promoter element” and “regulatory region”. The term “minimal promoter element” or sometimes simply referred to as “promoter” therefore may include TATA box, GC-rich sequence and CAAT box; while “regulatory region” is usually a long stretch of nucleotide sequence where transcription factors and other factors bind. Most eukaryotic genes have long regulatory regions where many different transcription factors bind. The expression or the lack of expression of a given gene in a given cell type, tissue, organ, or an organism is governed by the interactions that take place on its regulatory region.


[0035] The term “transcription factor” is used to describe the proteins that bind short stretches of DNA in the regulatory regions of a gene. Transcription factors may interact with each other as well as RNA polymerase. Thus, transcription factors may bind hormones or second messengers, DNA, RNA, other transcription factors, or other proteins. They may activate or inhibit transcription of a given gene. Transcription factors are also sometimes referred to as “enhancers” or “repressors”. Transcription factor binding sites can be used to identify agents that bind to the 5′-regulatory region of the gene and modulate the gene's expression.


[0036] The term “reporter” is used to describe a coding sequence attached to a heterologous promoter or enhancer elements and whose product, either nucleic acid or protein, is easly detected and is quantifiable. Some common reporter genes include βgalactosidase (lacZ), chloramphenicol acetyltransferase (cat), β-glucuronidase (GUS), and green fluorescent protein (GFP).


[0037] A “reporter construct” is a piece of nucleic acid that includes a promoter element and a reporter gene housed in a suitable vector plasmid DNA. Regulatory region nucleotide sequences may be cloned 5′ of the promoter element to determine if they contain transcription factor binding sites. The reporter construct-containing vector is introduced into a cell that contains many transcription factors. Activation of the reporter gene by transcription factors may be monitored by detection and quantification of the product of the reporter gene.


[0038] The term “agent” is used here to essentially describe any means to modulate INGAP expression. Agent may be a chemical compound, a biological agent, or a physical force, a mechanical contraption, or any combinations thereof.


[0039] INGAP Promoter and Regulatory Region


[0040] It is a discovery of the present inventors that INGAP gene is regulated by a 5′-regulatory region that is susceptible to modulation by many known transcription factors, including PMA and LIF.


[0041] It is a further discovery of the present invention that the 5′-regulatory region nucleotide sequence of the INGAP gene may be used in screening assays to identify agents capable of modulating the INGAP gene expression. These modulating agents have potential as therapeutic agents for treating pathological conditions including, but not limited to, diabetes mellitus, both type 1 and type 2, endocrine and non-endocrine hypoplasia, hypertrophy, adenoma, neoplasia, and nesidioblastosis.


[0042] Mammalian INGAP, like most genes, has a 5′-regulatory region followed by introns and exons. The sequence of a mammalian (Hamster sp.) INGAP gene is provided as SEQ ID NO: 2. FIG. 1 details the relative location of the 5′-regulatory region, the introns and the exons of the hamster INGAP gene. The boundaries of introns 1-5 and the location of the TATA-box and the poly-A signal are listed in Table 1.
1TABLE 1DescriptionPosition In INGAP Gene (SEQ ID NO: 2)TATA-Box3094INTRON 13150-3426INTRON 23508-4442INTRON 34562-4735INTRON 44874-5459INTRON 55587-5843Poly-A Signal6098-6103


[0043] The nucleotide sequence of the 5′-regulatory region including the promoter elements of mammalian INGAP, is shown partially in SEQ ID NO: 1, and completely in SEQ ID NO: 2 and 23 (nucleotides 1-3137 of SEQ ID NO: 2). Nucleotides 1-3120 of SEQ ID NO: 1 are identical to nucleotides 1-3120 of SEQ ID NO: 2 and SEQ ID NO: 23. An overview of the 5′-regulatory region is shown in FIG. 2. Representative transcription enhancer/repressor binding sites are shown also in FIG. 2. Predicted transcription enhancer/repressor binding sites for nucleotides 1-3123 of the 5′-regulatory region are shown in FIG. 3. Table 3 at the end of the specification details these transcription factors and their binding sites, and their locations in the regulatory region. Potential transcription factor binding analysis was done using MatInspector professional™, which is a bioinformatics software that utilizes a library of matrix descriptions for transcription factor binding sites to locate matches in sequences of unlimited length (Quandt, K., Frech, K., Karas, H., Wingender, E., Werner, T. (1995) Nucleic Acids Res. 23, 4878-4884).


[0044] Table 3 lists predicted binding proteins (Further Information) based upon their classification into functionally similar matrix families (Family/matrix). The DNA sequence predicted to bind the protein (Sequence), whether sense or antisense DNA (Str) and location of the sequence in SEQ ID NO: 2, (Position) are listed. Further the similarity to the consecutive highest conserved nucleotides of a matrix (Core sim.) and similarity to all nucleotides in that matrix (Matrix sim.) along with the optimized value (Opt) defined in a way that a minimum number of matches is found in non-regulatory test sequences are also listed. Details to the algorithms used in MatInspector professional™ is referenced:


[0045] OPT: This matrix similarity is the optimized value defined in a way that a minimum number of matches are found in non-regulatory test sequences (i.e. with this matrix similarity the number of false positive matches is minimized). This matrix similarity is used when the user checks “Optimized” as the matrix similarity threshold for MatInspector professional™.


[0046] Family: Each matrix belongs to a so-called matrix family, where functionally similar matrices are grouped together, eliminating redundant matches by MatInspector professional™ professional (if the family option was selected). E.g. the matrix family V$NFKB includes 5 similar matrices for NFkappaB (V$NFKAPPAB.01, V$NFKAPPAB.02, V$NFKAPPAB.03, V$NFKAPPAB50.01, V$NFKAPPAB65.01) as well as 1 matrix for the NFkappaB related factor c-Rel (V$CREL.01).


[0047] Matrix: The MatInspector professional™ matrices have an identifier that indicates one of the following seven groups: vertebrates (V$), insects (I$), plants (P$), fungi (F$), nematodes (N$), bacteria (B$), and other functional elements (O$); followed by an acronym for the factor the matrix refers to, and a consecutive number discriminating between different matrices for the same factor. Thus, V$OCT1.02 indicates the second matrix for vertebral Oct-1 factor.


[0048] Core Sim: The “core sequence” of a matrix is defined as the (usually 4) consecutive highest conserved positions of the matrix. The core similarity is calculated as described here. The maximum core similarity of 1.0 is only reached when the highest conserved bases of a matrix match exactly in the sequence. More important than the core similarity is the matrix similarity which takes into account all bases over the whole matrix length.


[0049] Matrix Sim: The matrix similarity is calculated as described here. A perfect match to the matrix gets a score of 1.00 (each sequence position corresponds to the highest conserved nucleotide at that position in the matrix), a “good” match to the matrix usually has a similarity of >0.80. Mismatches in highly conserved positions of the matrix decrease the matrix similarity more than mismatches in less conserved regions.


[0050] Another aspect of the invention provides for a reporter construct. Reporter constructs contain a 5′ regulatory region nucleotide sequence fragment of SEQ ID NO: 23 (e.g., an enhancer and/or repressor binding site containing region), a promoter element (which may or may not be from INGAP regulatory region nucleotide sequence, SEQ ID NO: 23), and a reporter gene. The 5′-regulatory region nucleotide sequence is positioned upstream of the reporter gene. In order to determine the identity of various transcription factors that bind the 5′ regulatory region nucleotide sequence and to elucidate their binding locations within the 5′ regulatory nucleotide sequence of the INGAP gene, the region may be mapped using deletion analysis. One or more fragments of the regulatory region nucleotide sequence may be initially analyzed for their responses to various transcription factor activators. Once, a region of interest is determined, further fine mapping may be carried out where DNA from different locations within the regulatory region could be combined to make a more robust, and responsive reporter construct. DNA sequences, such as INGAP 5′-regulatory region DNA or a fragment thereof, can be manipulated by methods well known in the art. Examples of such techniques include, but are not limited to, polymerase chain reaction (PCR), restriction enzyme endonuclease digestion, ligation, and gene walking. Cloning fragments of DNA, such as 5′-regulatory regions is well known in the art.


[0051] Another approach to quantify the expression levels of a gene is to measure transcription of the gene. PCR-ELISA may be used to capture transcripts onto a solid phase using biotin or digoxigenin-labelled primers, oligonucleotide probes (oligoprobes) or directly after incorporation of the digoxigenin into the transcripts (Watzinger, F. and Lion, T. (2001) Nucleic Acids Res., 29, e52). Once captured, the transcripts can be detected using an enzyme-labeled avidin or anti-digoxigenin reporter molecule similar to a standard ELISA format. Another approach is to employ real-time PCR to detect the transcript of the reporter gene (Mackay, I. M. and Nitsche, A., Nucleic Acids Res. Mar. 15, 2002; 30(6), 1292-305). In real-time PCR fluorogenic nucleotides are used and progress of the transcript is monitored in real-time as the polymerase transcribes the reporter gene.


[0052] The promoter element in the reporter construct may or may not be from the same gene as the 5′-regulatory region. As an example, the enhancer/repressor region from the INGAP 5′-regulatory region, or a fragment of the enhancer/repressor region from the INGAP 5′-regulatory region, may be cloned upstream of a heterologous minimal promoter element, e.g., the minimal CMV promoter (Boshart et al., 1985) and the promoters for TK (Nordeen, 1988), IL-2, and MMTV.


[0053] Transcription of a gene begins around the minimal promoter. FIG. 4 shows the predicted transcription start sites for mammalian INGAP gene (SEQ ID NO: 2). SEQ ID NO: 2 was analyzed using “Neural Network Promoter Prediction” program designed by Martin Reese to identify eukaryotic promoter recognition elements such as TATA-box, GC-box, CAAT-box, and the transcription start site. These promoter elements are present in various combinations separated by various distances in sequence. The program is available on the Internet and is located at http://www.fruitfly.org/seqtools/promoter.html.


[0054] The reporter construct can be used to identify agents that modulate, either alone or in combination, the expression of INGAP. Some such agents may modulate expression of INGAP by binding to the regulatory region directly while others may regulate expression of transcription factors that bind to the INGAP regulatory region.


[0055] The reporter construct can be transfected into a host cell in vitro, or in vivo through the pancreatic duct, either transiently or stably, and a test agent introduced to the assay system. Examples of test agents include, but are not limited to organic and inorganic chemical agents, carbohydrates, proteins, oligonucleotides, cholecystokinin, mechanically induced pressure, and agents which cause a pancreatic duct obstruction. Expression of the reporter gene product can be determined by an assay appropriate for the reporter gene employed. Examples of such assays include, but are not limited to a luminescent assay for β-galactosidase or luciferase, an enzymatic assay for chloramphenicol acetyl transferase, and fluorescence detection for fluorescent proteins. Such assays are well known in the art, and a skilled artisan will be able to select an appropriate assay for the chosen reporter. A test agent is identified as a modulator of INGAP expression if the test agent modulates expression of the reporter gene product. Preferably the level of increase or decrease is at least 50%, 100%, 200%, 500%, or 1000%, but any statistically significant change can be an indicator of modulatory activity. A skilled artisan may also determine reporter gene product expression in untreated cells, and in treated and untreated cells transfected with a promoter-less reporter gene only. Such determinations can be used to determine background levels of expression.


[0056] Test agents can also be obtained by fractionating pancreatic secretion fluids. A pancreatic duct obstruction can be used as an exemplary method of harvesting pancreatic secretion fluids. The pancreatic secretion fluids can be fractionated by methods well known in the art. Examples include high-pressure liquid chromatography (HPLC), size exclusion chromatography, hydrophobic interacting columns, and density gradient centrifugation. Individual fractions can be tested for agents that modulate reporter gene expression using a method described herein. The individual fractions can be further fractionated to identify agents that modulate reporter gene expression. The identified test agents can be used to modulate the expression of INGAP.


[0057] A host cell can be any cell suitable for transfection and maintenance in a suitable assay system. Examples of suitable cells include, but are not limited to, mammalian cells, human cells, mouse cells, rat cells, monkey cells, dog cells, bovine cells, and porcine cells. Preferably the cells used will be human cells. The cells could be either transformed cells line or primary cells. Whole organ explants may also be used where the regulation may be monitored over many different cell types. Many methods exist in the art for transfecting or infecting cells with reporter construct DNA. Such methods include, but are not limited to, lipofection, electroporation, calcium phosphate precipitation, DEAE dextran, gene guns, and modified viral techniques (e.g., recombinant adenovirus or recombinant retrovirus). The skilled artisan can readily choose a method suitable for use with a given cell type and assay system.


[0058] The reporter construct can also be introduced in vivo directly into cells of the pancreas. Examples of methods to introduce the reporter construct into pancreatic cells in vivo include pancreatic duct retrograde perfusion and in vivo electroporation (Mir, 2001). The reporter construct encodes a reporter gene product that is readily measured in vivo. A test agent can be administered systemically or locally, and expression of the reporter gene in vivo can be determined by an assay appropriate for the particular reporter employed. Examples of such include a fluorescence assay for green fluorescent protein.


[0059] Methods for identifying agents that modulate INGAP expression can also be accomplished in vitro. The reporter construct can be contacted with a test agent in vitro under conditions sufficient for transcription and/or translation of the reporter gene. Components such as rabbit reticulocyte lysates or wheat germ extracts can be utilized for such a method. Subsequently, the expression level of the reporter gene can be determined as described above utilizing an appropriate assay for a given reporter gene. A test agent is identified as a modulator of INGAP expression if the test agent modulates expression of the reporter gene. Threshold levels of change can be set by the practitioner as discussed above.


[0060] A test agent can alternatively be contacted with an isolated and purified INGAP 5′-regulatory DNA molecule and one can determine if the test agent binds to the DNA molecule. Test agents can be a chemical agent, a protein, or a nucleic acid. Appropriate INGAP 5′-regulatory DNA molecules would include nucleotides 1-6586 of SEQ ID NO: 2, the 5′-regulatory region DNA (SEQ ID NO: 1, or SEQ ID NO: 23), or any fragment of the 5′-regulatory region, preferably a fragment which contains one or more enhancer/repressor binding sites. Methods to determine binding of the test agent to the fragment of DNA are well known in the art, e.g., electrophoretic mobility shift assay (EMSA). See for example Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL, 2d ed., 1989, at pages 9.50-9.51. Fragments of the 5′-regulatory region can be obtained by methods well known in the art using the disclosed sequence (SEQ ID NO: 2). Examples of such methods include, PCR, restriction enzyme digestion, and chemical synthesis. Any fragment of DNA within the 5′-regulatory region (SEQ ID NO: 1, or 23) can be used. The exact location that an agent binds can be determined for example by utilizing smaller fragments to map precisely the binding site for the test agent. Test agents that bind in the assay can be further tested in other assays that require modulatory activity.


[0061] An agent that causes an increase or decrease in reporter gene expression can be used as a modulator of INGAP expression. The modulator can be administered to a mammal in need of such modulation. Examples of mammals that may need INGAP expression modulation are those with reduced pancreatic function, in particular reduced islet cell function. Such mammals include those who have diabetes mellitus, impaired glucose tolerance, impaired fasting glucose, hyperglycemia, obesity, and pancreatic insufficiency.


[0062] An agent that is identified as a modulator of INGAP expression can be supplied in a kit to treat diseases associated with reduced islet cell function. The kit would comprise in single or divided containers, in single or divided doses a modulator of INGAP expression. Written instructions may be included for using the modulator of INGAP expression. The instructions may simply refer a reader to another location such as a website or other information source.


[0063] Agents that cause an increase in reporter gene expression can be used to increase INGAP expression to treat a disease state related to reduced islet cell function.


[0064] Agents that cause a decrease in reporter gene expression can be used to decrease INGAP expression to treat a disease state related to hyperactivity of islet cells or a disease where reduced INGAP expression is desirable. Examples of such agents include, but are not limited to, PMA, LIF, interleukin-6, Oncostatin M, and ciliary neurotropic factor. Agents can be administered by any number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, parenteral, topical, sublingual, rectal, or pancreatic duct retrograde perfusion. Agents for oral administration can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for oral administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by the mammal. Agents for intravenous, intramuscular, intra-arterial, transdermal, and subcutaneous injections can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for injection into the mammal. Agents for intranasal, topical, and rectal administration can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for surface administration to the mammal. Mammals in need of an increase in INGAP expression include for example, mammals with diabetes mellitus, impaired glucose tolerance, impaired fasting glucose, hyperglycemia, obesity, and pancreatic insufficiency. Mammals in need of a decrease in INGAP expression include for example, mammals with hypoglycemia.


[0065] The following examples are offered by way of illustration and do not limit the invention disclosed herein.



EXAMPLES


Example 1

[0066] Hamster INGAP Genomic Sequence and Structure


[0067] The hamster INGAP genomic sequence and structure was determined by gene walking (Clontech) and DNA sequencing. Gene walking is a method for walking upstream toward a promoter or downstream in genomic DNA from a known sequence, such as cDNA. This method utilizes four uncloned, adapter-ligated genomic fragment libraries. The manufacturer's recommended protocol is followed with one notable exception; hamster genomic DNA was used to create the uncloned, adapter-ligated genomic fragment libraries.


[0068] To create uncloned, adapter ligated genomic fragment libraries, genomic DNA was purified from hamster cells. Four separate aliquots were thoroughly digested with PvuII, StuI, DraI, or EcoRV. Following digestion, inactivation of the restriction enzymes, and dephosphorylation, each separate pool of DNA fragments was ligated to an adapter, see FIG. 5. The adapter was phosphorylated to provide the requisite phosphate group for a ligation reaction. Also note that the 3-prime side of the short adapter contains an amine group to prevent the adapters from forming concatamers.


[0069] Two gene specific primers (GSP1 and GSP2) were designed for each region of known sequence (i.e., the exons of the INGAP gene). See FIG. 6 for fragment location and GSP1 and GSP2 location. The gene specific primers were designed as reverse PCR primers for all fragments except fragments 12 and 145. The gene specific primers for fragments 12 and 145 were designed as forward primers. Adapter primer 1 (API) and adapter primer 2 (AP2) (FIG. 5) were forward PCR primers for all fragments except fragments 12 and 145, which were reverse PCR primers. The outer gene specific primer (GSP1) was used with adapter primer 1 in a PCR reaction. To increase specificity, a second, nested PCR was set up using the inner gene specific primer (GSP2) and adapter primer 2. A small aliquot of the first reaction served as template for the second reaction. Gene specific PCR primers utilized for gene walking are listed in Table 2 and the strategy used to build the INGAP genomic sequence is shown in FIGS. 6 and 7. The arrowheads in FIG. 6 represent the adapter primers (AP1 and AP2), while the circles represent the gene specific primers (GSP1 and GSP2).
2TABLE 2NAME (LOCATION)SEQUENCEINGEN 21_3 (1464, 1482)5′-ACAAGCAATCTAGAGATGG-3′(SEQ ID NO: 3)INGEN 19_3 (1401, 1423)5′-GTTCAGCTATGTTCATAGCAGGG-3′(SEQ ID NO: 4)INGEN 16_3 (1855, 1876)5′-GTCTGTATGACTGTGTGGGAAG-3′(SEQ ID NO: 5)INGEN 15_3 (1929, 1948)5′-GCACTTGAACTCAATGGCTC-3′(SEQ ID NO: 6)INGEN 14_3 (2147, 2168)5′-GAACCACCTGACATGGGTGATG-3′(SEQ ID NO: 7)INGEN 13_3 (2177, 2200)5′-GGGCATCGTATCATCTGGTTACAG-3′(SEQ ID NO: 8)INGEN 8_3 (2544, 2565)5′-GGTTCAAAAAAGCTGCTTCAAC-3′(SEQ ID NO: 9)INGEN 7_3 (2666, 2689)5′-GGAATAGCTGCAATTTATGCCCAT-3′(SEQ ID NO: 10)INGEN 4_3 (2833, 2858)5′-CTTAGGAACATTCAGGCAGCCTCCTG-3′(SEQ ID NO: 11)INGEN 3_3 (2866, 2891)5′-GTTGCCCTCTGCCACGTGTCAAGTTC-3′(SEQ ID NO: 12)INGEN 2_3 (3444, 3470)5′-CATCCAAGACATCCTACAGAGGGTCAT-3′(SEQ ID NO: 13)INGEN 1_3 (3475, 3501)5′-CCCAAGAAAGGAACATCAGGCAGGAAA-3′(SEQ ID NO: 14)INGEN 2_2 (3330, 3350)5′-CCAAATGAGTGCTTCCCTGAA-3′(SEQ ID NO: 15)INGEN 1_2 (3241, 3266)5′-GCAGCACTCTGAAACTCAGTAGAGTT-3′(SEQ ID NO: 16)INGEN 14_5 (5544, 5563)5′-GCTGCTGACCGTGGTTATTG-3′(SEQ ID NO: 17)INGEN 13_5 (5463 5485)5′-ACACTACCCAACGGAAGTGGATG-3′(SEQ ID NO: 18)INGAP1_1L (3475, 3492)5′-TTTCCTGCCTGATGTTCC-3′(SEQ ID NO: 19)INGAP1_1R (5957, 5976)5′-TCATACTTGCTTCCTTGTCC-3′(SEQ ID NO: 20)INGAP2_1L (4470, 4488)5′-CTTCACGTATAACCTGTCC-3′(SEQ ID NO: 21)INGAP2_1R (5905, 5923)5′-ATTAGAACTGCCCTAGACC-3′(SEQ ID NO: 22)


[0070] The PCR fragments were sequenced to determine the nucleotide sequence of the INGAP 5′-regulatory region; the introns, the intron/exon junctions, and the 3-prime polyadenylation regions. The nucleotide sequence of hamster INGAP genomic DNA is shown in SEQ ID NO: 2.



Example 2

[0071] Cloning Hamster INGAP 5′-Regulatory Region Fragment into a Reporter Construct


[0072] To construct the INGAP 5′-regulatory region, individual PCR fragments were joined together at unique restriction sites located within two adjoining fragments. FIGS. 6 and 7 detail the strategy used to piece the INGAP 5′-regulatory region together. Fragments 83 and 23 were joined at a unique SphI site; 143 and 83 were joined at a unique BbsI site; 163 and 143 were joined at a unique PstI site. The nucleotide sequence of hamster INGAP 5′-regulatory region DNA is shown in SEQ ID NO: 1 and 23 in the sequence listing.


[0073] The hamster INGAP 5′-regulatory region or a fragment of the 5′-regulatory region was cloned into a reporter plasmid, pβGal-Basic (Clontech). The 5′-regulatory region or fragments were cloned utilizing the unique XmaI site from the gene walking adapter primer and a unique BglII site located at the 3-prime side of the regulatory region. FIG. 8 details the fragments cloned into pβGal-Basic. The sizes of the fragments are indicated to the right of the fragments and are expressed as the number of nucleotides of the fragment.



Example 3

[0074] Assay System to Screen for Factors that Modulate the Expression of INGAP


[0075] Promoter analysis of INGAP identified a number of potential promoter-proximal regulatory sites including the consensus transcription factor binding sites; cAMP response element (CRE), AP-1 and STAT. Promoter-fragment reporter-gene constructs were transiently transfected into 293T cells and co-transfection of secretory alkaline phosphatase was used to normalize for transfection efficiency.


[0076] Reporter constructs containing INGAP 5′-regulatory region fragments 23sP (SEQ ID NO: 37), 23dP (SEQ ID NO: 38), 23pP (SEQ ID NO: 36), 143P (SEQ ID NO: 34), 163P (SEQ ID NO: 31), or 193P (SEQ ID NO: 23) were transfected into human cells. The pβGal-Basic plasmid without the hamster INGAP DNA was also transfected into human cells as a control to measure the level of endogenous reporter activity. Two days following transfection, the cells were treated with PMA for 24 hours or were untreated. To determine the level of promoter activity, the amount of β-galactosidase gene product was determined using a luminescent assay for β-galactosidase. FIG. 9A shows that construct 143P activated the INGAP expression the most, followed by 23pP, and 163P.


[0077] Reporter construct containing INGAP 5′-regulatory region DNA nucleotides 2030 to 3120 was transfected into human cells. The pβGal-Basic plasmid without the hamster INGAP DNA was also transfected into human cells as a control to measure the level of endogenous reporter activity. Two days following transfection, the cells were treated with LIF for 24 hours or were untreated. To determine the level of promoter activity, the amount of β-galactosidase gene product was determined using a luminescent assay for β-galactosidase. FIG. 9B shows the results. LIF was determined to increase the activity of the 5′-regulatory region of mammalian INGAP. Forskolin (an activator of cAMP/CREB/CRE) did not modulate gene expression (data not shown).


[0078] It is important to note that when present in human cells, the hamster INGAP 5′-regulatory region is transactivated by the human transcription factors. Thus, linked to a reporter gene, the 5′-regulatory region of hamster INGAP creates a sensitive assay system to screen for factors that modulate the expression of INGAP.



Example 4

[0079] Determination of Approximate Location of PMA and LIF-Mediated Transcription Factor Binding in the 5′-Regulatory Region


[0080] To map the approximate location of PMA-initiated or LIF-initiated transcription factor binding different fragments of the hamster INGAP 5′-regulatory region were cloned into pβGal-Basic. See FIG. 8. The fragments cloned into the reporter construct were 23sP (SEQ ID NO: 37), 23dP (SEQ ID NO: 38), 23pP (SEQ ID NO: 36), 143P (SEQ ID NO: 34), 163P (SEQ ID NO: 31), or 193P (SEQ ID NO: 23). The reporter constructs were transfected into human cells. Two days following transfection, the cells were treated with different concentrations of PMA or LIF for 24 hours. The concentrations of PMA used were 6 ng/ml, 17 ng/ml, 50 ng/ml, 100 ng/ml, or 300 ng/ml. The concentrations of LIF used were 1 ng/ml, 10 ng/ml, or 30 ng/ml. To determine the level of promoter activity, the amount of β-galactosidase gene product was determined using a luminescent assay for β-galactosidase. FIGS. 10 and 11 show the results for PMA and LIF treatment, respectively. Both PMA and LIF activated the cell reporter constructs. The exact location of the DNA contact sites can be narrowed further by cloning smaller fragments of the hamster INGAP 5′-regulatory region and by site directed mutations or deletions.



Example 5

[0081] RNA Analysis of INGAP Gene Upregulation


[0082] To determine if INGAP RNA levels increase after stimulation with a cytokine that signals through STAT, rat amphocrine pancreatic cells, AR42J were treated with IL-6 (1000 U/ml) for 24 hours. Total RNA was extracted from the treated and untreated cells using techniques well known in the art, e.g., using TRIZOL® reagent.


[0083] Equal amounts of total RNA (10 kg) were loaded in 2.5% formaldehyde gel and electrophoresed for 4 hours at 70V with a constant circulation of the buffer using a circulating pump. The gel was photographed and washed with water twice at room temperature and soaked in 20×SSC. The gel was transferred to a nylon membrane (Amersham) in 20×SSC overnight following a standard procedure. The membrane was washed with 20×SSC to remove any agar that might have attached to the membrane and baked for 4 hours at −80° C.


[0084] One hundred nanograms of hamster INGAP cDNA was labeled using Random Prime Labeling kit (Roche-BMB) and alpha-P32 dCTP (ICN). Approximately 20 million counts were used for hybridization in 20 ml hybridization buffer following the standard procedure at 42° C. for overnight. The blot was washed as follows: 2-times at room temperature with 2×SSC for 10 minutes each; 2-times at 42° C. with 2×SSC for 10 minutes each; 2-times at 55° C. with 1×SSC for 10 minutes each. The membrane was exposed to the film (XOMAT-Kodak) and kept at −80° C. overnight before developing.


[0085] Treatment with IL-6 caused an increase in INGAP gene expression (FIG. 12). These data demonstrate that extracellular factors that elevate AP-1-binding transcription factors and STAT-binding transcription factors are involved in the regulation of INGAP gene expression. These studies suggest that it is feasible to enhance INGAP expression as a means of inducing islet neogenesis.


[0086] While particular embodiments of the present invention have been illustrated and described, it would be obvious to those skilled in the art that various other changes and modifications can be made without departing from the spirit and scope of the invention. It is therefore intended to cover in the appended claims all such changes and modifications that are within the scope of this invention.
3TABLE 3PositionCoreMatrixFamily/matrixFurther InformationOpt.from-toanchorStr.sim.sim.SequenceV$LEFF/LEF1.01TCF/LEF-1, involved in the0.8612-2820(+)1.0000.900ggaccatCAAAgtctgtWnt signal transductionpathwayV$MITF/MIT.01MIT (microphthalmia0.8122-4031(+)1.0000.823agtctgtCATGtcatttggtranscription factor) and TFE3V$OCT1/OCT1.05octamer-binding factor 10.9027-4134(+)0.8330.904gTCATgtcatttgggV$TCFF/TCF11.01TCF11/KCR-F1/Nrf11.0032-3835(+)1.0001.000GTCAttthomodimersV$MYOF/Myogenin/nuclear factor 1 or0.7125-5339(+)1.0000.735ctgtcatgtcatTTGGgggagggcctatgMYOGNF1.01related factorsV$ZBPF/ZBP89.01Zinc finger transcription factor0.9336-4842(−)1.0000.982gccctCCCCcaaaZBP-89V$SP1F/GC.01GC box elements0.8838-5245(+)0.8760.898tgggGGAGggcctatV$PERO/PPARA.01PPAR/RXR heterodimers0.7044-6454(−)0.8840.708acagaggagggcATAGgccctV$PAX5/PAX9.01zebrafish PAX9 binding sites0.7843-7157(−)0.8000.811cagataCACAgaggagggcataggccctcV$TBPF/ATATA.01Avian C-type LTR TATA box0.8168-8476(−)1.0000.987tgctattTAAGcccagaV$HMTB/MTBF.01muscle-specific Mt binding0.9076-8480(−)1.0000.932tgctATTTasiteV$OCT1/OCT1.06octamer-binding factor 10.8074-8881(−)0.7500.865ggtatgctATTTaagV$BRNF/BRN2.01POU factor Brn-2 (N-Oct 3)0.91 89-10597(+)1.0000.970tccataggAAATgggctV$HMTB/MTBF.01muscle-specific Mt binding0.90108-116112(−)1.0000.953tggaATTTgsiteV$OCT1/OCT1.05octamer-binding factor 10.90106-120113(−)0.9440.917tATATggaatttgggV$HNF6/HNF6.01Liver enriched Cut -0.82108-122115(+)0.8330.885caaatTCCAtatatgHomeodomain transcriptionfactor HNF6 (ONECUT)V$SRFF/SRF.02serum response factor0.83110-128119(+)1.0000.862aattCCATatatgcactagV$OCTP/OCT1P.01octamer-binding factor 1,0.86114-126120(+)1.0000.903ccatatATGCactPOU-specific domainV$MYOF/Myogenin/nuclear factor 1 or0.71171-199185(+)0.8570.740ctggtcttttagCTGGcacccatccatatMYOGNF1.01related factorsV$NF1F/NF1.02Nuclear factor 1 (CTF1)0.81181-199190(+)1.0000.812agcTGGCacccatccatatV$CLOX/cut-like homeodomain protein0.94187-203195(−)0.9290.940ctgaatatgGATGggtgCDPCR3HD.01V$MYOF/Myogenin/nuclear factor 1 or0.71181-209195(−)0.7850.767aaccctctgaatATGGatgggtgccagctMYOGNF1.01related factorsV$OCTP/OCT1P.01octamer-binding factor 1,0.86192-204198(+)0.9800.907atccatATTCagaPOU-specific domainV$CREB/TAXCREB.02Tax/CREB complex0.71202-222212(−)0.7500.721ttgaacTGAAccaaaccctctV$HOXF/EN1.01Homeobox protein engrailed0.77210-226218(−)0.7820.823aacaTTGAactgaacca(en-1)V$BARB/BARBIE.01barbiturate-inducible element0.88230-244237(−)1.0000.894ttatAAAGctgaggaV$TBPF/TATA.01cellular and viral TATA box0.90230-246238(−)1.0000.910agttaTAAAgctgaggaelementsV$BARB/BARBIE.01barbiturate-inducible element0.88252-266259(−)1.0000.902agtgAAAGcagagagV$MYT1/MYT1.01MyT1 zinc finger transcription0.75272-284278(−)0.7500.756craCAGTtgacctfactor involved in primaryneurogenesisV$SMAD/SMAD4.01Smad4 transcription factor0.94304-312308(+)1.0000.940GTCTtgactinvolved in TGF-beta signalingV$HOXF/CRX.01Cone-rod homeobox-0.94312-328320(−)1.0000.960gagggATTAgaaaaggacontaining transcription factor/otx-like homeobox geneV$ECAT/NFY.01nuclear factor Y (Y-box0.90337-351344(−)1.0000.906ggaatCCAAtygtagbinding factor)V$HOXF/PTX1.01Pituitary Homeobox 1 (Ptx1)0.79337-353345(+)0.7890.802ctacraTTGGattccatV$FKHD/FREAC2.01Fork head RElated ACtivator-20.84362-378370(−)1.0000.897tacagcTAAAcactgagV$MINI/Muscle Initiator Sequence0.86401-419410(−)0.8400.865gagcctTCATccagtagctMUSCLE_INI.02V$MOKF/MOK2.01Ribonucleoprotein associated0.74409-429419(−)1.0000.746tgtcatcttagagCCTTcatczinc finger protein MOK-2(mouse)V$ZFIA/ZID.01zinc finger with interaction0.85414-426420(+)1.0000.861agGCTCtaagatgdomainV$CART/XVENT2.01Xenopus homeodomain factor0.82418-434426(+)0.7500.837tcTAAGatgacaattaaXvent-2; early BMP signalingresponseV$OCT1/OCT1.04octamer-binding factor 10.80421-435428(+)0.8070.840aaGATGacaattaagV$HOMS/S8.01Binding site for S8 type0.97426-434430(+)1.0000.994gacaATTAahomeodomainsV$NKXH/NKX25.02homeo domain factor Nkx-0.88424-436430(−)1.0001.000cctTAATtgtcat2.5/Csx, tinman homolog lowaffinity sitesV$CREB/CREBP1.01cAMP-responsive element0.80425-445435(−)0.7660.808cgacgattACCTtaattgtcabinding protein 1V$COMP/COMP1 .01COMP1, cooperates with0.76434-454444(−)0.7500.768aatgaggATCGacgattacctmyogenic proteins inmulticomponent complexV$HOXF/HOX1-3.01Hox-1.3, vertebrate0.83444-460452(+)1.0000.886cgatcctcATTAtagtghomeobox proteinV$ETSF/GABP.01GABP: GA binding protein0.85454-470462(+)1.0000.868tatagtGGAAgggcttcV$LEFF/LEF1.01TCF/LEF-1, involved in the0.86463-479471(+)1.0000.904agggcttCAAAggcagtWnt signal transductionpathwayV$STAT/STAT6.01STAT6: signal transducer and0.84464-482473(−)0.7580.867gagacTGCCtttgaagcccactivator of transcription 6V$GATA/GATA1.03GATA-binding factor 10.95490-502496(−)1.0000.971ttcaGATAggcagV$SRFF/SRF.01serum response factor0.66487-505496(−)0.7570.672atgttcaGATAggcagtagV$EVI1/EVI1.04Ecotropic viral integration site0.77493-509501(−)0.8000.824gGAAAtgttcagatagg1 encoded factorV$AP4R/TH1E47.01Thing1/E47 heterodimer, TH10.93509-525517(+)1.0000.951cctaatgCCAGatgtctbHLH member specificexpression in a variety ofembryonic tissuesV$AP4R/Tal-1beta/ITF-2 heterodimer0.85512-528520(+)1.0000.852aatgcCAGAtgtctcttTAL1BETAITF2.01V$NEUR/NEUROD1.01DNA binding site for0.83514-526520(−)1.0000.851gagaCATCtggcaNEUROD1 (BETA-2/E47dimer)V$MEF2/MEF2.05MEF20.96518-540529(−)1.0000.984aggataggttTAAAgagacatctV$EVI1/EVI1.04Ecotropic viral integration site0.77523-539531(−)1.0000.774gGATAggtttaaagaga1 encoded factorV$MEF2/AMEF2.01myocyte enhancer factor0.80521-543532(+)1.0000.813tgtctcttTAAAcctatcctggcV$TBPF/MTATA.01Muscle TATA box0.84524-540532(+)1.0000.877ctcttTAAAcctatcctV$HOXF/HOX1-3.01Hox-1.3, vertebrate0.83543-559551(+)1.0000.845ctcccttcATTAaggtahomeobox proteinV$PDX1/ISL1.01Pancreatic and intestinal lim-0.82543-563553(−)1.0000.834gagatacctTAATgaagggaghomeodomain factorV$OCT1/OCT1.05octamer-binding factor 10.90556-570563(+)0.9440.926gGTATctcattttttV$CIZF/NMP4.01NMP4 (nuclear matrix protein0.97562-572567(−)1.0000.972gcAAAAaatga4)/CIZ (Cas-interacting zincfinger protein)V$EVI1/EVI1.01Ecotropic viral integration site0.72569-585577(−)0.7640.720ggaaCAGAggagagcaa1 encoded factorV$AP1F/AP1.01AP1 binding site0.95582-602592(−)0.8810.964aaaactgaATCAgtggnggaaV$PIT1/PIT1.01Pit1, GHF-1 pituitary specific0.86589-599594(+)1.0000.886actgATTCagtpou domain transcriptionfactorV$AP1F/AP1.01AP1 binding site0.95586-606596(+)0.8500.956nccactgaTTCAgtttttctgV$VMYB/VMYB.01v-Myb0.90593-603598(−)0.8760.910aaaAACTgaatV$CIZF/NMP4.01NMP4 (nuclear matrix protein0.97595-605600(−)1.0000.975agAAAAactga4)/CIZ (Cas-interacting zincfinger protein)V$GREF/PRE.01Progesterone receptor binding0.84604-622613(+)1.0000.875ctgatccctctTGTTctccsiteV$GKLF/GKLF.01Gut-enriched Krueppel-like0.91632-646639(−)1.0000.971gaaaaagagaAGGGafactorV$CIZF/NMP4.01NMP4 (nuclear matrix protein0.97637-647642(−)1.0000.987ggAAAAagaga4)/CIZ (Cas-interacting zincfinger protein)V$NFAT/NFAT.01Nuclear factor of activated T-0.97640-650645(−)1.0000.982ggagGAAAaagcellsV$MAZF/MAZ.01Myc associated zinc finger0.90649-661655(−)1.0000.910ggtgGAGGgaaggprotein (MAZ)V$EGRF/WT1.01Wilms Tumor Suppressor0.88658-672665(−)1.0000.932gggggTGGGagggtgV$ZBPF/ZBP89.01Zinc finger transcription factor0.93663-675669(+) 1.0000.972tcccaCCCCcatgZBP-89V$IRFF/IRF2.01interferon regulatory factor 20.80702-716709(−)1.0000.815aggaagggGAAAgggV$BRNF/BRN2.01POU factor Brn-2 (N-Oct 3)0.91746-762754(−)1.0000.911aaaataggAAATaaggaV$ETSF/PU1.01Pu.1 (Pu120) Ets-like0.86746-762754(−)1.0000.883aaaataGGAAataaggatranscription factor identifiedin lymphoid B-cellsV$EVI1/EVI1.04Ecotropic viral integration site0.77750-766758(−)0.7600.792aGAGAaaataggaaata1 encoded factorV$EVI1/EVI1.05Ecotropic viral integration site0.80755-771763(−)0.7630.817cccccagagaaAATAgg1 encoded factorV$ZBPF/ZBP89.01Zinc finger transcription factor0.93764-776770(−)1.0000.934ccacaCCCCcagaZBP-89V$FAST/FAST1.01FAST-1 SMAD interacting0.81769-783776(+)0.9830.894gggtgtgGATTttatproteinV$TBPF/TATA.02Mammalian C-type LTR TATA0.89771-787779(−)1.0000.942caccaTAAAatccacacboxV$PAX5/PAX9.01zebrafish PAX9 binding sites0.78781-809795(−)0.8660.813aacataTGCAcagaagggcttccaccataV$OCT1/OCT.01Octamer binding site0.79793-807800(−)1.0000.790catATGCacagaagg(OCT1/OCT2 consensus)V$OCTP/OCT1P.01octamer-binding factor 1,0.86798-810804(−)1.0000.910caacatATGCacaPOU-specific domainV$SRFF/SRF.01serum response factor0.66797-815806(+)0.7570.666ctgtgcaTATGttgtcttaV$EVI1/EVI1.05Ecotropic viral integration site0.80802-818810(−)0.7500.828caataagacaaCATAtg1 encoded factorV$CLOX/CDP.01cut-like homeodomain protein0.75803-819811(−)1.0000.776ccAATAagacaacatatV$EVI1/EVI1.02Ecotropic viral integration site0.83807-823815(−)1.0000.836tcaaccaatAAGAcaac1 encoded factorV$ECAT/NFY.02nuclear factor Y (Y-box0.91810-824817(−)1.0000.960atcaaCCAAtaagacbinding factor)V$HAML/AML3.01Runt-related transcription0.84811-825818(+)1.0000.844tcttatTGGTtgatafactor 2/CBFA1 (core-binding factor, runt domain,alpha subunit 1)V$PCAT/CAAT.01cellular and viral CCAAT box0.90813-823818(−)1.0000.943tcaaCCAAtaaV$GATA/GATA.01GATA binding site0.95818-830824(+)1.0000.956ggttGATAaataa(consensus)V$HNF1/HNF1.02Hepatic nuclear factor 10.76818-834826(+)0.7570.791gGTTGataaataaagcaV$HOXT/Homeobox protein MEIS10.79823-835829(−)0.7500.797gTGCTttatttatMEIS1_HOXA9.01binding siteV$ECAT/NFY.01nuclear factor Y (Y-box0.90837-851844(+)1.0000.912gttgtCCAAtagggabinding factor)V$FKHD/FREAC2.01Fork head RElated ACtivator-20.84844-860852(+)0.7500.843aataggGAAAcaagataV$EVI1/EVI1.06Ecotropic viral integration site0.83846-862854(+)1.0000.960tagggaaacaAGATagg1 encoded factorV$GATA/GATA1.01GATA-binding factor 10.96853-865859(+)1.0000.970acaaGATAggtggV$PCAT/ACAAT.01Avian C-type LTR CCAAT box0.86856-866861(−)0.7500.867cccaCCTAtctV$XBBF/RFX1.01X-box binding protein RFX10.89909-927918(−)1.0000.929ggatcacatgGCAAccctcV$EBOX/MYCMAX.02c-Myc/Max heterodimer0.92912-928920(−)0.8950.936aggatCACAtggcaaccV$MITF/MIT.01MIT (microphthalmia0.81911-929920(+)1.0000.863gggttgcCATGtgatcctatranscription factor) and TFE3V$ETSF/PU1.01Pu.1 (Pu120) Ets-like0.86927-943935(+)1.0000.950ctaggaGGAAttgacactranscription factor identifiedin lymphoid B-cellsV$OCT1/OCT1.06octamer-binding factor 10.80932-946939(−)1.0000.800catgtgtcAATTcctV$TALE/TGIF.01TG-interacting factor1.00936-942939(−)1.0001.000tGTCAatbelonging to TALE class ofhomeodomain factorsV$MITF/MIT.01MIT (microphthalmia0.81935-953944(−)1.0000.835ccattctCATGtgtcaatttranscription factor) and TFE3V$OCT1/OCT1.04octamer-binding factor 10.80941-955948(+)0.8460.800caCATGagaatggggV$GATA/GATA.01GATA binding site0.95962-974968(+)1.0000.998gaaaGATAagtcc(consensus)V$SRFF/SRF.01serum response factor0.66968-986977(−)1.0000.672atattttTATAaggacttaV$CDXF/CDX2.01Cdx-2 mammalian caudal0.84970-988979(−)1.0000.867atatattTTTAtaaggactrelated intestinal transcr.factorV$FKHD/XFD2.01Xenopus fork head domain0.89972-988980(+)1.0000.894tccttaTAAAaatatatfactor 2V$MEF2/MEF2.01myogenic enhancer factor 20.74970-992981(+)1.0000.740agtccttaTAAAaatatatattaV$TBPF/TATA.01cellular and viral TATA box0.90973-989981(+)1.0000.963ccttaTAAAaatatataelementsV$CART/CART1.01Cart-1 (cartilage0.84978-994986(−)1.0000.870acTAATatatatttttahomeoprotein 1)V$CART/CART1.01Cart-1 (cartilage0.84 985-1001993(−)1.0000.855caTAATtactaatatathomeoprotein 1)V$SATB/SATB1.01Special AT-rich sequence-0.93 985-1001993(−)1.0000.943cataattacTAATatatbinding protein 1,predominantly expressed inthymocytes, binds to matrixattachment regions (MARs)V$BRNF/BRN3.01POU transcription factor Brn-30.78 987-1003995(−)1.0000.816cccATAAttactaatatV$CLOX/CDP.01cut-like homeodomain protein0.75 987-1003995(−)0.7570.765ccCATAattactaatatV$HOMS/S8.01Binding site for S8 type0.97 992-1000996(+)1.0000.989agtaATTAthomeodomainsV$NKXH/DLX1.01DLX-1, -2, and -5 binding0.91 990-1002996(−)1.0000.976ccatAATTactaasitesV$HOXF/HOX1-3.01Hox-1.3, vertebrate0.83 989-1005997(−)1.0000.886aacccataATTActaathomeobox proteinV$PDX1/PDX1.01Pdx1 (IDX1/IPF1) pancreatic0.74 988-1008998(−)1.0000.775attaacccaTAATtactaataand intestinal homeodomainTFV$FKHD/XFD3.01Xenopus fork head domain0.82 998-10141006(+)0.8260.844tatgggttAATAattaafactor 3V$HNF1/HNF1.01hepatic nuclear factor 10.781000-10161008(−)0.7550.857aCTTAattattaacccaV$HNF1/HNF1.01hepatic nuclear factor 10.781002-10181010(+)1.0000.966gGTTAataattaagtcaV$PAX4/PAX4.01Pax-4 paired domain protein,0.971005-10151010(+)1.0000.972taatAATTaagtogether with PAX-6 involvedin pancreatic developmentV$HOMS/S8.01Binding site for S8 type0.971007-10151011(−)1.0000.995cttaATTAthomeodomainsV$HOXF/HOX1-3.01Hox-1.3, vertebrate0.831003-10191011(−)1.0000.873ctgacttaATTAttaachomeobox proteinV$NKXH/DLX1.01DLX-1, -2, and -5 binding0.911005-10171011(+)1.0000.988taatAATTaagtcsitesV$RBIT/BRIGHT.01Bright, B cell regulator of IgH0.921005-10171011(+)1.0000.931taataATTAagtctranscriptionV$TBPF/ATATA.01Avian C-type LTR TATA box0.811005-10211013(+)1.0000.881taataatTAAGtcagagV$CREB/CREBP1.01cAMP-responsive element0.801004-10241014(−)0.7660.819tagctctgACTTaattattaabinding protein 1V$RORA/RORA2.01RAR-related orphan receptor0.821007-10231015(+)0.7500.874ataattaAGTCagagctalpha2V$PCAT/CAAT.01cellular and viral CCAAT box0.901022-10321027(+)0.8560.928ctagCCATtaaV$NKXH/NKX25.02homeo domain factor Nkx-0.881022-10341028(−)1.0000.903tctTAATggctag2.5/Csx. tinman homolog lowaffinity sitesV$CREB/HLF.01hepatic leukemia factor0.841022-10421032(−)0.7700.842ctagtGTTTcttaatggctagV$HOXF/HOX1-3.01Hox-1.3, vertebrate0.831056-10721064(+)1.0000.891gcttcataATTAatatahomeobox proteinV$HOMS/S8.01Binding site for S8 type0.971061-10691065(−)1.0000.995attaATTAthomeodomainsV$NKXH/DLX1.01DLX-1, -2, and -5 binding0.911059-10711065(+)1.0000.988tcatAATTaatatsitesV$RBIT/BRIGHT.01Bright, B cell regulator of IgH0.921059-10711065(+)1.0000.952tcataATTAatattranscriptionV$BRNF/BRN2.01POU factor Brn-2 (N-Oct 3)0.911058-10741066(+)1.0000.945ttcataatTAATatagtV$OCT1/OCT1.06octamer-binding factor 10.801060-10741067(−)1.0000.885actatattAATTatgV$HOXF/HOX1-3.01Hox-1.3, vertebrate0.831061-10771069(−)1.0000.854gatactatATTAattathomeobox proteinV$OCT1/OCT1.06octamer-binding factor 10.801079-10931086(+)0.7500.875tgtatgttCATTtggV$FAST/FAST1.01FAST-1 SMAD interacting0.811080-10941087(+)0.8500.887gtatgttCATTtgggproteinV$RREB/RREB1.01Ras-responsive element0.791081-10951088(−)1.0000.816cCCCAaatgaacatabinding protein 1V$E2FF/E2F.02E2F, involved in cell cycle0.841085-10991092(−)1.0000.849tcagcccCAAAtgaaregulation, interacts with Rbp107 proteinV$CREB/TAXCREB.01Tax/CREB complex0.811091-11111101(+)1.0000.828tggggcTGACacagttctgggV$AP1F/VMAF.01v-Maf0.821092-11121102(+)1.0000.833ggggcTGACacagttctgggaV$MYT1/MYT1.01MyT1 zinc finger transcription0.751123-11351129(+)0.7500.791aggAAGAytacttfactor involved in primaryneurogenesisV$CLOX/CLOX.01Clox0.811136-11521144(−)0.8040.820cctacaATCCatgtaccV$HNF4/HNF4.01Hepatic nuclear factor 40.821156-11721164(−)1.0000.864atagagCAAAggactacV$LEFF/LEF1.01TCF/LEF-1, involved in the0.861157-11731165(−)1.0000.907catagagCAAAggactaWnt signal transductionpathwayV$PERO/PPARA.01PPAR/RXR heterodimers0.701157-11771167(−)1.0000.700tagacatagagcAAAGgactaV$CLOX/CLOX.01Clox0.811173-11891181(+)0.8040.831gtctaaATCCatatatgV$HNF6/HNF6.01Liver enriched Cut-0.821175-11891182(+)0.8330.929ctaaaTCCAtatatgHomeodomain transcriptionfactor HNF6 (ONECUT)V$SRFF/SRF.02serum response factor0.831177-11951186(+)1.0000.851aaatCCATatatgaatgagV$CLOX/CDPCR3.01cut-like homeodomain protein0.751180-11961188(−)1.0000.761actcattcatatATGGaV$PIT1/PIT1.01Pit1, GHF-1 pituitary specific0.861186-11961191(−)1.0000.919actcATTCatapou domain transcriptionfactorV$HMTB/MTBF.01muscle-specific Mt binding0.901196-12041200(−)0.8070.901tggtATGTasiteV$FKHD/HFH8.01HNF-3/Fkh Homolog-80.921200-12161208(−)1.0000.922gaaagayAAACatggtaV$E4FF/E4F.01GLI-Krueppel-related0.821223-12351229(−)0.7890.898gtgAGGTaacccctranscription factor, regulatorof adenovirus E4 promoterV$CREB/HLF.01hepatic leukemia factor0.841221-12411231(+)1.0000.854atgggGTTAcctcactcaggaV$VBPF/VBP.01PAR-type chicken vitellogenin0.861226-12361231(+)1.0000.903gTTACctcactpromoter-binding proteinV$OCT1/OCT.01Octamer binding site0.791259-12731266(−)0.7580.870cgcAGGCaaatgaat(OCT1/OCT2 consensus)V$STAT/STAT6.01STAT6: signal transducer and0.841261-12791270(+)0.7580.850tcattTGCCtgcgaattttactivator of transcription 6V$CDXF/CDX2.01Cdx-2 mammalian caudal0.841270-12881279(+)1.0000.869tgcgaatTTTAagattccarelated intestinal transcr.factorV$SORY/SOX9.01SOX (SRY-related HMG box)0.901280-12961288(−)1.0000.990taaaaCAATggaatcttV$FKHD/HFH2.01HNF-3/Fkh Homolog 20.931285-13011293(−)1.0000.931aggaataaAACAatggaV$CDXF/CDX2.01Cdx-2 mammalian caudal0.841286-13041295(+)1.0000.865ccattgtTTTAttcctctgrelated intestinal transcr.factorV$OCTB/TST1.01POU-factor Tst-1/Oct-60.871288-13021295(−)0.8940.876gaggAATAaaacaatV$PDX1/ISL1.01Pancreatic and intestinal lim-0.821298-13181308(+)1.0000.824tcctctgagTAATactccatthomeodomain factorV$SORY/SOX9.01SOX (SRY-related HMG box)0.901308-13241316(−)1.0000.925ttacaCAATggagtattV$CREB/HLF.01hepatic leukemia factor0.841310-13301320(−)0.9010.920ggtacATTAcacaatggagtaV$VBPF/VBP.01PAR-type chicken vitellogenin0.861315-13251320(−)1.0000.871aTTACacaatgpromoter-binding proteinV$CEBP/CEBPB.01CCAAT/enhancer binding0.941313-13311322(+)0.9290.955tccattgtGTAAtgtaccaprotein betaV$PDX1/ISL1.01Pancreatic and intestinal lim-0.821313-13331323(+)1.0000.859tccattgtgTAATgtaccacahomeodomain factorV$HAML/AML1.01runt-factor AML-11.001323-13371330(−)1.0001.000aaaatgTGGTacattV$GREF/ARE.01Androgene receptor binding0.801323-13411332(+)0.7500.819aatgtaccacaTTTTctccsiteV$TEAF/TEF1.01TEF-1 related muscle factor0.841343-13551349(+)1.0000.896taCATTcttcagtV$CMYB/CMYB.01c-Myb, important in0.991352-13601356(+)1.0000.990caGTTGagghematopoesis, cellularequivalent to avianmyoblastosis virus oncogenev-mybV$AP4R/TH1E47.01Thing1/E47 heterodimer, TH10.931378-13941386(−)1.0000.932gcaatagCCAGaacctgbHLH member specificexpression in a variety ofembryonic tissuesV$CP2F/CP2.01CP20.901384-13941389(−)1.0000.945gcaatagCCAGV$CHOP/CHOP.01heterodimers of CHOP and0.901386-13981392(−)1.0000.951attTGCAatagccC/EBPalphaV$CEBP/CEBP.02C/EBP binding site0.851385-14031394(+)1.0000.853tggctattGCAAataacccV$MEF2/HMEF2.01myocyte enhancer factor0.761384-14061395(+)1.0000.809ctggctattgcAAATaaccctgcV$OCT1/OCT1.03octamer-binding factor 10.851388-14021395(+)1.0000.889ctattgcAAATaaccV$HMTB/MTBF.01muscle-specific Mt binding0.901394-14021398(−)1.0000.900ggttATTTgsiteV$CLOX/CDPCR3.01cut-like homeodomain protein0.751422-14381430(+)0.9750.761acatatgtcattATTGtV$OCT1/OCT1.05octamer-binding factor 10.901423-14371430(+)0.9440.938cATATgtcattattgV$HOXF/HOX1-3.01Hox-1.3, vertebrate0.831423-14391431(+)1.0000.836catatgtcATTAttgtahomeobox proteinV$PDX1/PDX1.01Pdx1 (IDX1/IPF1) pancreatic0.741423-14431433(−)1.0000.889ttcatacaaTAATgacatatgand intestinal homeodomainTFV$SORY/SOX5.01Sox-50.871426-14421434(−)1.0000.870tcataCAATaatgacatV$OCT1/OCT1.05octamer-binding factor 10.901444-14581451(−)0.9440.914aATATgtaaaacagaV$CREB/E4BP4.01E4BP4, bZIP domain,0.801443-14631453(−)1.0000.856tttaaaatatGTAAaacagattranscriptional repressorV$VBPF/VBP.01PAR-type chicken vitellogenin0.861449-14591454(+)1.0000.886tTTACatatttpromoter-binding proteinV$TBPF/MTATA.01Muscle TATA box0.841455-14711463(+)1.0000.841tatttTAAAccatctctV$PBXF/PBX1.01homeo domain factor Pbx-10.781469-14811475(−)1.0000.783caagCAATctagaV$COMP/COMP1.01COMP1, cooperates with0.761467-14871477(+)1.0000.765tctctagATTGcttgtaatatmyogenic proteins inmulticomponent complexV$SORY/SOX5.01Sox-50.871478-14941486(−)1.0000.997tttaaCAATattacaagV$FKHD/FREAC2.01Fork head RElated ACtivator-20.841485-15011493(+)1.0000.885tattgtTAAAcatagagV$PDX1/ISL1.01Pancreatic and intestinal lim-0.821495-15151505(+)1.0000.839catagagagTAATaatgctathomeodomain factorV$HOXF/HOX1-3.01Hox-1.3, vertebrate0.831499-15151507(−)1.0000.872atagcattATTActctchomeobox proteinV$PDX1/PDX1.01Pdx1 (IDX1/IPF1) pancreatic0.741498-15181508(−)0.8260.843tttatagcaTTATtactctctand intestinal homeodomainTFV$CART/XVENT2.01Xenopus homeodomain factor0.821502-15181510(+)1.0000.829agTAATaatgctataaaXvent-2; early BMP signalingresponseV$CDXF/CDX2.01Cdx-2 mammalian caudal0.841507-15251516(−)1.0000.906tttaattTTTAtagcattarelated intestinal transcr.factorV$MEF2/MEF2.05MEF20.961505-15271516(+)1.0000.983aataatgctaTAAAaattaaaaaV$HNF1/HNF1.01hepatic nuclear factor 10.781510-15261518(−)0.7550.805tTTTAatttttatagcaV$OCT1/OCT1.06octamer-binding factor 10.801511-15251518(+)1.0000.832gctataaaAATTaaaV$TBPF/TATA.02Mammalian C-type LTR TATA0.891510-15261518(+)1.0000.991tgctaTAAAaattaaaaboxV$NKXH/MSX.01Homeodomain proteins MSX-10.971514-15261520(−)1.0000.989tttTAATttttatand MSX-2V$RBIT/BRIGHT.01Bright, B cell regulator of IgH0.921515-15271521(+)1.0000.944taaaaATTAaaaatranscriptionV$MEF2/AMEF2.01myocyte enhancer factor0.801514-15361525(+)1.0000.807ataaaaatTAAAaataatgataaV$EVI1/EVI1.02Ecotropic viral integration site0.831526-15421534(+)1.0000.872aataatgatAAGAaaga1 encoded factorV$GATA/GATA1.02GATA-binding factor 10.991528-15401534(+)1.0000.993taatGATAagaaaV$GATA/GATA3.02GATA-binding factor 30.911537-15491543(+)1.0000.931gaaAGATcctataV$GATA/GATA3.02GATA-binding factor 30.911559-15711565(+)1.0000.915tacAGATgaaaatV$OCT1/OCT1.02octamer-binding factor 10.821561-15751568(+)0.7630.867cagATGAaaatttagV$CEBP/CEBPB.01CCAAT/enhancerbinding0.941567-15851576(+)0.9850.964aaaatttaGAAAtactttaprotein betaV$PLZF/PLZF.01Promyelocytic leukemia zink0.861574-15881581(−)0.9580.866agcTAAAgtatttctfinger (TF with nine Krueppel-like zink fingers)V$PAX3/PAX3.01Pax-3 paired domain protein,0.761587-15991593(−)1.0000.763TCGTcagtggtagexpressed in embryogenesis,mutations correlate toWaardenburg SyndromeV$CREB/ATF.01activating transcription factor0.901588-16081598(+)1.0000.923taccacTGACgaaatttgtatV$AP4R/TH1E47.01Thinq1/E47 heterodimer, TH10.931614-16301622(−)1.0000.959tttaattCCAGacattcbHLH member specificexpression in a variety ofembryonic tissuesV$NKXH/MSX.01Homeodomain proteins MSX-10.971619-16311625(−)1.0000.977cttTAATtccagaand MSX-2V$RBIT/BRIGHT.01Bright, B cell regulator of IgH0.921620-16321626(+)1.0000.923ctggaATTAaagatranscriptionV$OCTB/TST1.01POU-factor Tst-1/Oct-60.871620-16341627(+)1.0000.898ctggAATTaaagaaaV$NKXH/DLX3.01Distal-less 3 homeodomain0.911628-16401634(−)1.0000.915cagTAATttcttttranscription factorV$GREF/PRE.01Progesterone receptor binding0.841628-16461637(+)1.0000.922aaagaaattacTGTTctttsiteV$TBPF/TATA.01cellular and viral TATA box0.901636-16521644(−)1.0000.934ttataTAAAgaacagtaelementsV$FKHD/XFD2.01Xenopus fork head domain0.891637-16531645(−)1.0000.890attataTAAAgaacagtfactor 2V$TBPF/TATA.01cellular and viral TATA box0.901638-16541646(−)0.8910.923tattaTATAaagaacagelementsV$CREB/E4BP4.01E4BP4, bZIP domain,0.801638-16581648(−)0.7690.856ctattattatATAAagaacagtranscriptional repressorV$PDX1/ISL1.01Pancreatic and intestinal lim-0.821644-16641654(+)1.0000.836tttatataaTAATagactgtahomeodomain factorV$COMP/COMP1.01COMP1, cooperates with0.761648-16681658(+)0.7910.760tataataATAGactgtaaaatmyogenic proteins inmulticomponent complexV$TBPF/TATA.02Mammalian C-type LTR TATA0.891658-16741666(+)1.0000.912gactgTAAAatggcaacboxV$IRFF/ISRE.01interferon-stimulated0.811662-16761669(+)0.7500.817gtaaaatgGCAActtresponse elementV$XBBF/RFX1.01X-box binding protein RFX10.891660-16781669(+)1.0000.907ctgtaaaatgGCAActtttV$MYT1/MYT1.02MyT1 zinc finger transcription0.881667-16791673(−)1.0000.882taaAAGTtgccatfactor involved in primaryneurogenesisV$OCT1/OCT1.06octamer-binding factor 10.801683-16971690(+)1.0000.878tatttgctAATTcacV$AP1F/TCF11/MafG heterodimers,0.811681-17011691(−)0.7770.865tcctgTGAAttagcaaatattTCF11MAFG.01binding to subclass of AP1sitesV$NKXH/MSX2.01Muscle segment homeo box0.951687-16991693(+)1.0000.969tgCTAAttcacag2, homologue of Drosophila(HOX 8)V$FAST/FAST1.01FAST-1 SMAD interacting0.811687-17011694(−)0.8500.866tcctgtgAATTagcaproteinV$PBXC/Binding site for a Pbx1/Meis10.761686-17021694(+)0.7500788ttgctaatTCACaggatPBX1_MEIS1.03heterodimerV$CIZF/NMP4.01NMP4 (nuclear matrix protein0.971699-17091704(−)1.0000.973agAAAAaatcc4)/CIZ (Cas-interacting zincfinger protein)V$STAT/STAT6.01STAT6: signal transducer and0.841702-17201711(−)1.0000.908agatgTTCCaaagaaaaaaactivator of transcription 6V$AP4R/Tal-1beta/E47 heterodimer0.871710-17261718(−)1.0000.919ttgttCAGAtgttccaaTAL1BETAE47.01V$SORY/HMGIY.01HMGI(Y) high-mobility-group0.921720-17361728(+)1.0000.953tgaacaAATTtcccttaprotein I (Y), architecturaltranscription factor organizingthe framework of a nuclearprotein-DNA transcriptionalcomplexV$MYT1/MYT1.01MyT1 zinc finger transcription0.751723-17351729(+)0.7500.757acaAATTtcccttfactor involved in primaryneurogenesisV$SRFF/SRF.01serum response factor0.661728-17461737(+)1.0000.771tttccctTATAtgaatcacV$HOXF/HOXA9.01Member of the vertebrate0.871731-17471739(−)1.0000.908agtGATTcatataagggHOX - cluster of homeoboxfactorsV$HOXT/Homeobox protein MEIS10.791734-17461740(−)1.0000.797gTGATtcatataaMEIS1_HOXA9.01binding siteV$PIT1/PIT1.01Pit1, GHF-1 pituitary specific0.861737-17471742(−)1.0000.912agtgATTCatapou domain transcriptionfactorV$AP1F/AP1.01AP1 binding site0.951734-17541744(+)0.8810.958ttatatgaATCActtacatttV$VBPF/VBP.01PAR-type chicken vitellogenin0.861746-17561751(+)1.0000.860cTTACatttttpromoter-binding proteinV$FAST/FAST1.01FAST-1 SMAD interacting0.811757-17711764(+)0.8500.829gcctgttCATTtaaaproteinV$HOXF/EN1.01Homeobox protein engrailed0.771759-17751767(−)1.0000.832gtttTTTAaatgaacag(en-1)V$TBPF/MTATA.01Muscle TATA box0.841763-17791771(+)1.0000.853tcattTAAAaaactgcaV$ETSF/ETS2.01c-Ets-2 binding site0.861774-17901782(+)1.0000.866actgcAGGAaagttgtgV$MYT1/MYT1.02MyT1 zinc finger transcription0.881780-17921786(+)1.0000.891ggaAAGTtgtgatfactor involved in primaryneurogenesisV$GFI1/GFI1.01Growth factor independence 10.971782-17961789(−)1.0001.000ataAATCacaactttzinc finger protein acts astranscriptional repressorV$TBPF/TATA.01cellular and viral TATA box0.901784-18001792(−)1.0000.931cattaTAAAtcacaactelementsV$BRNF/BRN2.01POU factor Brn-2 (N-Oct 3)0.911786-18021794(−)1.0000.933tgcattatAAATcacaaV$HOXT/Homeobox protein MEIS10.791788-18001794(+)1.0000.924gTGATttataatgMEIS1_HOXA9.01binding siteV$MEF2/AMEF2.01myocyte enhancer factor0.801783-18051794(−)0.8660.827agttgcatTATAaatcacaacttV$OCTB/TST1.01POU-factor Tst-1/Oct-60.871787-18011794(+)0.8940.898tgtgATTTataatgcV$HOXF/HOXA9.01Member of the vertebrate0.871787-18031795(+)1.0000.971tgtGATTtataatgcaaHOX - cluster of homeoboxfactorsV$BRNF/BRN2.01POU factor Brn-2 (N-Oct 3)0.911788-18041796(+)1.0000.916gtgatttaTAATgcaacV$PARF/DBP.01Albumin D-box binding0.841791-18051798(+)0.8840.891atttaTAATgcaactproteinV$OCT1/OCT1.02octamer-binding factor 10.821795-18091802(+)1.0000.861ataATGCaactgcacV$FKHD/FREAC2.01Fork head RElated ACtivator-20.841816-18321824(+)1.0000.910cagtctTAAAcaatgctV$SORY/SOX5.01Sox-50.871821-18371829(+)1.0000.992ttaaaCAATgctaaccaV$AREB/AREB6.04AREB6 (Atp1a1 regulatory0.981837-18491843(+)1.0000.981actgtGTTTcagcelement binding factor 6)V$MYT1/MYT1.02MyT1 zinc finger transcription0.881848-18601854(−)1.0000.889gggAAGTttatgcfactor involved in primaryneurogenesisV$RBPF/RBPJK.01Mammalian transcriptional0.841851-18651858(−)1.0000.878tgtgTGGGaagtttarepressor RBP-Jkappa/CBF1V$OCT1/OCT1.02octamer-binding factor 10.821875-18891882(+)0.7630.826actATGAaaacacatV$FKHD/FREAC4.01Fork head RElated ACtivator-40.781875-18911883(+)1.0000.786actatgaaAACAcatgcV$EBOX/MYCMAX.02c-Myc/Max heterodimer0.921880-18961888(+)0.8950.920gaaaaCACAtgcttaaaV$PAX6/PAX6.01Pax-6 paired domain protein0.751880-18981889(−)0.7730.791cctttAAGCatgtgttttcV$IRFF/IRF3.01Interferon regulatory factor 30.861891-19051898(+)1.0000.874cttaaaggCAAAtct(IRF-3)V$HNF1/HNF1.02Hepatic nuclear factor 10.761895-19111903(−)0.8580.782aGGTAaagatttgccttV$FKHD/FREAC2.01Fork head RElated ACtivator-20.841898-19141906(−)1.0000.853ctgaggTAAAgatttgcV$E4FF/E4F.01GLI-Krueppel-related0.821902-19141908(−)0.7890.830ctgAGGTaaagattranscription factor, regulatorof adenovirus E4 promoterV$CREB/CREBP1.01cAMP-responsive element0.801900-19201910(+)0.7660.820aaatctttACCTcagttaactbinding protein 1V$VBPF/VBP.01PAR-type chicken vitellogenin0.861905-19151910(+)1.0000.862tTTACctcagtpromoter-binding proteinV$MYT1/MYT1.01MyT1 zinc finger transcription0.751912-19241918(−)0.7500.775gaaTAGTtaactgfactor involved in primaryneurogenesisV$HNF1/HNF1.01hepatic nuclear factor 10.781913-19291921(+)1.0000.811aGTTAactattccatagV$PCAT/CAAT.01cellular and viral CCAAT box0.901928-19381933(+)0.8560.925agagCCATtgaV$HNF6/HNF6.01Liver enriched Cut -0.821929-19431936(−)1.0000.873tgaacTCAAtggctcHomeodomain transcriptionfactor HNF6 (ONECUT)V$PXRF/PXRCAR.01Halfsite of PXR (pregnane X0.981935-19451940(−)1.0000.980ctTGAActcaareceptor)/RXR resp. CAR(constitutive androstanereceptor)/RXR heterodimerbinding siteV$RARF/RTR.01Retinoid receptor-related0.811934-19521943(+)1.0000.854attgagtTCAAgtgcattttestis-associated receptor(GCNF/RTR)V$HOXF/EN1.01Homeobox protein engrailed0.771936-19521944(+)0.7820.813tgagTTCAagtgcattt(en-1)V$NKXH/NKX25.01homeo domain factor Nkx-1.001939-19511945(+)1.0001.000gttcAAGTgcatt2.5/Csx, tinman homolog,high affinity sitesV$GATA/GATA3.02GATA-binding factor 30.911953-19651959(+)1.0000.928agaAGATataatgV$TBPF/TATA.01cellular and viral TATA box0.901968-19841976(−)0.8910.912atataTATAtggccataelementsV$SRFF/SRF.01serum response factor0.661969-19871978(+)1.0000.777atggccaTATAtatatataV$CLOX/CDPCR3.01cut-like homeodomain protein0.751972-19881980(−)1.0000.806atatatatatatATGGcV$PAX1/PAX1.01Pax1 paired domain protein,0.612016-20342025(−)0.7500.675CTGTgctgatatatatataexpressed in the developingvertebral column of mouseembryosV$TBPF/ATATA.01Avian C-type LTR TATA box0.812019-20352027(+)0.7500.827atatataTCAGcacagtV$GFI1/GfI1B.01Growth factor independence 10.822021-20352028(+)1.0000.904ataTATCagcacagtzinc finger protein Gfi-1BV$NRSF/NRSF.01neuron-restrictive silencer0.692025-20452035(+)1.0000.704atcAGCAcagtggaaacagttfactorV$NFAT/NFAT.01Nuclear factor of activated T-0.972033-20432038(+)1.0000.970agtgGAAAcagcellsV$AREB/AREB6.04AREB6 (Atp1a1 regulatory0.982034-20462040(−)1.0000.991taactGTTTccacelement binding factor 6)V$HNF1/HNF1.01hepatic nuclear factor 10.782036-20522044(−)1.0000.798tGTTAttaactgtttccV$FKHD/XFD3.01Xenopus fork head domain0.822038-20542046(+)0.8260.824aaacagttAATAacattfactor 3V$PDX1/PDX1.01Pdx1 (IDX1/IPF1) pancreatic0.742036-20562046(+)1.0000.749ggaaacagtTAATaacattttand intestinal homeodomainTFV$OCT1/OCT1.01octamer-binding factor 10.772050-20642057(−)1.0000.863taTATGctaaaatgtV$TBPF/TATA.01cellular and viral TATA box0.902053-20692061(−)0.8910.908tagtaTATAtgctaaaaelementsV$ETSF/GABP.01GABP: GA binding protein0.852080-20962088(+)1.0000.897gaggctGGAAgggggctV$BEL1/BEL1.01Bel-1 similar region (defined0.782083-21052094(+)1.0000.787gctggaagggggcTCAGcagttain Lentivirus LTRs)V$VMYB/VMYB.01v-Myb0.902097-21072102(−)0.8760.901attAACTgctgV$GREF/ARE.01Androgene receptor binding0.802106-21242115(+)0.7500.840atagcacatacTATTcttcsiteV$PDX1/PDX1.01Pdx1 (IDX1/IPF1) pancreatic0.742137-21572147(+)0.7820.747gtttggtttTCATcacccatgand intestinal homeodomainTFV$MYOD/MYOD.02myoblast determining factor0.982154-21682161(−)1.0000.988gaacCACCtgacatgV$GATA/GATA1.03GATA-binding factor 10.952169-21812175(−)1.0000.958tacaGATAgaaatV$AP4R/Tal-1beta/E47 heterodimer0.872179-21952187(+)1.0000.924gtaacCAGAtgatacgaTAL1BETAE47.01V$OAZF/ROAZ.01Rat C2H2 Zn finger protein0.732204-22202212(−)0.7500.762agGTACccaaggggactinvolved in olfactory neuronaldifferentiationV$GATA/GATA1.01GATA-binding factor 10.962217-22292223(−)1.0000.960aggtGATAgaggtV$MYOD/E47.02TAL1/E47 dimers0.932220-22342227(−)1.0000.939atagCAGGtgatagaV$LTUP/TAACC.01Lentiviral TATA upstream0.712225-22472236(+)0.7590.710cacctgctattctCACCcaaagaelementV$RREB/RREB1.01Ras-responsive element0.792239-22532246(+)1.0000.805aCCCAaagacacacabinding protein 1V$OCT1/OCT1.05octamer-binding factor 10.902251-22652258(−)0.9440.904tGTATgtgagtgtgtV$OCT1/OCT1.02octamer-binding factor 10.822282-22962289(+)1.0000.854tgcATGCacatagttV$COUP/COUP.01COUP antagonizes HNF-4 by0.812284-22982291(−)0.9770.855tGAACtatgtgcatgbinding site competition orsynergizes by directprotein-protein interaction with HNF-4V$MEF2/MEF2.01myogenic enhancer factor 20.742290-23122301(+)0.7500.767catagttcAAAAaataaaattttV$CDXF/CDX2.01Cdx-2 mammalian caudal0.842296-23142305(−)1.0000.896ttaaaatTTTAttttttgarelated intestinal transcr.factorV$MYT1/MYT1.01MyT1 zinc finger transcription0.752301-23132307(−)0.7500.798taaAATTttatttfactor involved in primaryneurogenesisV$NFAT/NFAT.01Nuclear factor of activated T-0.972314-23242319(+)1.0000.991aaagGAAAaaacellsV$CIZF/NMP4.01NMP4 (nuclear matrix protein0.972317-23272322(+)1.0000.977ggAAAAaaagc4)/CIZ (Cas-interacting zincfinger protein)V$GATA/GATA3.02GATA-binding factor 30.912326-23382332(−)1.0000.946aaaAGATttgagcV$HMTB/MTBF.01muscle-specific Mt binding0.902351-23592355(−)1.0000.901aggaATTTtsiteV$NOLF/OLF1.01olfactory neuron-specific0.822350-23722361(+)0.8060.820taaaatTCCTatgagtgtgtgatfactorV$PDX1/PDX1.01Pdx1 (IDX1/IPF1) pancreatic0.742363-23832373(−)0.7820.753tactgacttTGATcacacactand intestinal homeodomainTFV$GATA/GATA3.02GATA-binding factor 30.912395-24072401(−)1.0000.942cacAGATtataccV$NFAT/NFAT.01Nuclear factor of activated T-0.972406-24162411(+)1.0000.971tgtgGAAAacacellsV$OCTP/OCT1P.01octamer-binding factor 1,0.862433-24452439(+)0.9800.879ctcagtATTCacaPOU-specific domainV$MITF/MIT.01MIT (microphthalmia0.812438-24562447(−)1.0000.827ctactttCATGtgtgaatatranscription factor) and TFE3V$PAX8/PAX8.01PAX 2/5/8 binding site0.882441-24532447(−)0.8500.952cttTCATgtgtgaV$TBPF/ATATA.01Avian C-type LTR TATA box0.812451-24672459(+)1.0000.838aagtagcTAAGaataaaV$GATA/GATA3.02GATA-binding factor 30.912462-24742468(−)1.0000.960aatAGATtttattV$CLOX/CLOX.01Clox0.812462-24782470(+)0.8060.819aataaaATCTattcatcV$HNF6/HNF6.01Liver enriched Cut-0.822464-24782471(+)0.7850.846taaaaTCTAttcatcHomeodomain transcriptionfactor HNF6 (ONECUT)V$PIT1/PIT1.01Pit1, GHF-1 pituitary specific0.862468-24782473(+)1.0000.890atctATTCatcpou domain transcriptionfactorV$AP4R/Tal-1beta/ITF-2 heterodimer0.852469-24852477(−)1.0000.881aaaaaCAGAtgaatagaTAL1BETAITF2.01V$CIZF/NMP4.01NMP4 (nuclear matrix protein0.972477-24872482(−)1.0000.981ggAAAAacaga4)/CIZ (Cas-interacting zincfinger protein)V$NFAT/NFAT.01Nuclear factor of activated T-0.972480-24902485(−)1.0000.976taagGAAAaaccellsV$STAT/STAT.01signal transducers and0.872479-24972488(−)1.0000.872aggattttaaGGAAaaacaactivators of transcriptionV$TBPF/TATA.02Mammalian C-type LTR TATA0.892484-25002492(+)1.0000.897ttcctTAAAatcctggcboxV$FKHD/XFD3.01Xenopus fork head domain0.822501-25172509(−)1.0000.880actgagtcAACActgtafactor 3V$AP1F/AP1.01AP1 binding site0.952500-25202510(−)1.0000.984accactgaGTCAacactgtagV$AP1F/AP1.01AP1 binding site0.952504-25242514(+)0.9640.984agtgttgaCTCAgtggttgctV$PCAT/CAAT.01cellular and viral CCAAT box0.902513-25232518(−)0.8260.904gcaaCCACtgaV$CDXF/CDX2.01Cdx-2 mammalian caudal0.842524-25422533(+)1.0000.883tttaaatTTTAtgctcaaarelated intestinal transcr.factorV$MYT1/MYT1.02MyT1 zinc finger transcription0.882539-25512545(+)1.0000.891caaAAGTtgaagcfactor involved in primaryneurogenesisV$ETSF/FLI.01ETS family member FLI0.812560-25762568(+)1.0000.829tgaaCCGGtaattctacV$MYT1/MYT1.01MyT1 zinc finger transcription0.752569-25812575(−)1.0000.757acaAAGTagaattfactor involved in primaryneurogenesisV$TBPF/ATATA.01Avian C-type LTR TATA box0.812576-25922584(−)0.7500.816aagtattTAATacaaagV$SATB/SATB1.01Special AT-rich sequence-0.932578-25942586(−)1.0000.939acaagtattTAATacaabinding protein 1,predominantly expressed inthymocytes, binds to matrixattachment regions (MARs)V$NKXH/NKX31.01prostate-specific0.842584-25962590(−)1.0000.865taacAAGTatttahomeodomain protein NKX3.1V$PARF/DBP.01Albumin D-box binding0.842589-26032596(+)1.0000.882acttgTTATgcatcgproteinV$PAX5/PAX5.02B-cell-specific activating0.752591-26192605(−)1.0000.758aacttgatttgttgAGCGatgcataacaaproteinV$ECAT/NFY.03nuclear factor Y (Y-box0.802604-26182611(+)0.7500.809ctcaaCAAAtcaagtbinding factor)V$GFI1/GFI1.01Growth factor independence 10.972608-26222615(+)1.0000.976acaAATCaagttttazinc finger protein acts astranscriptional repressorV$HNF6/HNF6.01Liver enriched Cut -0.822608-26222615(+)1.0000.830acaaaTCAAgttttaHomeodomain transcriptionfactor HNF6 (ONECUT)V$MYT1/MYT1.01MyT1 zinc finger transcription0.752610-26222616(−)0.7500.756taaAACTtgatttfactor involved in primaryneurogenesisV$PAX8/PAX8.01PAX 2/5/8 binding site0.882610-26222616(+)1.0000.907aaaTCAAgttttaV$TTFF/TTF1.01Thyroid transcription factor-10.922609-26232616(+)1.0000.936caaatCAAGttttaa(TTF1) binding siteV$MYT1/MYT1.02MyT1 zinc finger transcription0.882612-26242618(+)1.0000.887atcAAGTtttaacfactor involved in primaryneurogenesisV$CDXF/CDX2.01Cdx-2 mammalian caudal0.842612-26302621(+)1.0000.883atcaagtTTTAacacaccarelated intestinal transcr.factorV$SORY/HMGIY.01HMGI(Y) high-mobility-group0.922649-26652657(−)1.0000.925ttaaaaAATTtaagataprotein I (Y), architecturaltranscription factor organizingthe framework of a nuclearprotein-DNA transcriptionalcomplexV$HOXF/EN1.01Homeobox protein engrailed0.772657-26732665(+)1.0000.780atttTTTAaatgggcat(en-1)V$OCT1/OCT1.06octamer-binding factor 10.802662-26762669(−)0.7500.818tttatgccCATTtaaV$BCL6/BCL6.01POZ/zinc finger protein,0.762683-26992691(+)1.0000.796ctaTTCCtacagaagtctranscriptional repressor,translocations observed indiffuse large cell lymphomaV$OCTP/OCT1P.01octamer-binding factor 1,0.862715-27272721(+)1.0000.860ctgaaaATGCattPOU-specific domainV$TEAF/TEF1.01TEF-1 related muscle factor0.842722-27342728(+)1.0000.898tgCATTcctgattV$GFI1/GFI1.01Growth factor independence 10.972723-27372730(−)1.0000.981ataAATCaggaatgczinc finger protein acts astranscriptional repressorV$HOXT/Homeobox protein MEIS10.792729-27412735(+)1.0000.929cTGATttatgtaaMEIS1_HOXA9.01binding siteV$HOXF/HOXA9.01Member of the vertebrate0.872728-27442736(+)1.0000.964cctGATTtatgtaaataHOX - cluster of homeoboxfactorsV$PARF/DBP.01Albumin D-box binding0.842729-27432736(+)1.0000.861ctgatTTATgtaaatproteinV$VBPF/VBP.01PAR-type chicken vitellogenin0.862732-27422737(−)1.0000.929tTTACataaatpromoter-binding proteinV$CREB/E4BP4.01E4BP4, bZIP domain,0.802728-27482738(+)1.0000.943cctgatttatGTAAatatatgtranscriptional repressorV$OCT1/OCT1.01octamer-binding factor 10.772733-27472740(+)1.0000.895ttTATGtaaatatatV$FKHD/XFD1.01Xenopus fork head domain0.902733-27492741(+)1.0000.940tttatgTAAAtatatgtfactbr 1V$SRFF/SRF.01serum response factor0.662736-27542745(+)1.0000.691atgtaaaTATAtgtatataV$OCTP/OCT1P.01octamer-binding factor 1,0.862746-27582752(+)0.8490.883atgtatATACataPOU-specific domainV$CLOX/CDPCR3.01cut-like homeodomain protein0.752748-27642756(+)0.8880.755gtatatacatatATAGcV$TBPF/TATA.01cellular and viral TATA box0.902749-27652757(−)0.8910.903ggctaTATAtgtatataelementsV$SRFF/SRF.01serum response factor0.662750-27682759(+)1.0000.709atatacaTATAtagccttaV$TBPF/ATATA.01Avian C-type LTR TATA box0.812759-27752767(−)1.0000.816ttgttttTAAGgctataV$TBPF/TATA.02Mammalian C-type LTR TATA0.892762-27782770(+)1.0000.899agcctTAAAaacaaagaboxV$CABL/CABL.01Multifunctional c-Abl src type0.972769-27792774(+)1.0000.973aaAACAaagattyrosine kinaseV$LEFF/LEF1.01TCF/LEF-1, involved in the0.862766-27822774(+)1.0000.863ttaaaaaCAAAgattgtWnt signal transductionpathwayV$OCT1/OCT1.06octamer-binding factor 10.802775-27892782(+)1.0000.811aagattgtAATTtttV$MEF2/MMEF2.01myocyte enhancer factor0.902776-27982787(−)1.0000.900acaatttaTAAAaattacaatctV$OCT1/OCT1.06octamer-binding factor 10.802780-27942787(−)1.0000.844tttataaaAATTacaV$TBPF/TATA.01cellular and viral TATA box0.902779-27952787(−)1.0000.956atttaTAAAaattacaaelementsV$CART/CART1.01Cart-1 (cartilage0.842780-27962788(+)1.0000.875tgTAATttttataaatthomeoprotein 1)V$FKHD/XFD2.01Xenopus fork head domain0.892780-27962788(−)1.0000.903aatttaTAAAaattacafactor 2V$MEF2/MEF2.05MEF20.962778-28002789(−)1.0000.973tcacaatttaTAAAaattacaatV$BRNF/BRN3.01POU transcription factor Brn-30.782785-28012793(−)0.7500.798atcACAAtttataaaaaV$TBPF/TATA.01cellular and viral TATA box0.902786-28022794(+)1.0000.927ttttaTAAAttgtgattelementsV$GFI1/GFI1.01Growth factor independence 10.972791-28052798(−)1.0000.997aaaAATCacaatttazinc finger protein acts astranscriptional repressorV$HOXT/Homeobox protein MEIS10.792797-28092803(+)1.0000.806gTGATttttaaaaMEIS1_HOXA9.01binding siteV$MEF2/MMEF2.01myocyte enhancer factor0.902792-28142803(−)1.0000.923tattttttTAAAaatcacaatttV$MEF2/MEF2.05MEF20.962795-28172806(+)1.0000.990ttgtgattttTAAAaaaataaacV$MEF2/MMEF2.01myocyte enhancer factor0.902797-28192808(+)1.0000.905gtgattttTAAAaaaataaacctV$HNF1/HNF1.01hepatic nuclear factor 10.782802-28182810(−)0.7550.796gGTTTatttttttaaaaV$MEF2/MEF2.01myogenic enhancer factor 20.742799-28212810(+)0.7500.775gatttttaAAAAaataaacctgcV$HOXF/HOX1-3.01Hox-1.3, vertebrate0.832814-28302822(+)1.0000.848aaacctgcATTAtcttchomeobox proteinV$PARF/DBP.01Albumin D-box binding0.842816-28302823(−)0.8840.851gaagaTAATgcaggtproteinV$PDX1/ISL1.01Pancreatic and intestinal lim-0.822814-28342824(−)1.0000.853tgctgaagaTAATgcaggttthomeodomain factorV$GATA/GATA1.02GATA-binding factor 10.992819-28312825(−)1.0000.993tgaaGATAatgcaV$HEAT/HSF1.01heat shock factor 10.932845-28552850(+)0.8670.951TGAAtgttcctV$MYT1/MYT1.02MyT1 zinc finger transcription0.882853-28652859(+)1.0000.893cctAAGTtttgtafactor involved in primaryneurogenesisV$BCL6/BCL6.02POZ/zinc finger protein,0.772857-28732865(+)1.0000.772agttttgTAGAacttgatranscriptional repressor,translocations observed indiffuse large cell lymphomaV$TTFF/TTF1.01Thyroid transcription factor-10.922863-28772870(−)1.0000.927cgtgtCAAGttctac(TTF1) binding siteV$EBOX/USF.02upstream stimulating factor0.942868-28842876(−)1.0000.997tctgccaCGTGtcaagtV$HOXF/PTX1.01Pituitary Homeobox 1 (Ptx1)0.792892-29082900(+)1.0000.795aggattTTAGtctacacV$MYOD/complex of Lmo2 bound to0.982901-29152908(−)1.0000.981gatgCAGGtgtagacLMO2COM.01Tal-1, E2A proteins, andGATA-1, half-site 1V$REBV/EBVR.01Epstein-Barr virus0.812904-29242914(−)1.0000.832ctgtcctcagatgcaGGTGtatranscription factor RV$ETSF/PU1.01Pu.1 (Pu120) Ets-like0.862932-29482940(+)1.0000.873ctaacaGGAAaggagactranscription factor identifiedin lymphoid B-cellsV$MITF/MIT.01MIT (microphthalmia0.812943-29612952(+)1.0000.829ggagacaCATGtgtggtagtranscription factor) and TFE3V$HAML/AML1.01runt-factor AML-11.002950-29642957(+)1.0001.000catgtgTGGTagttcV$NFKB/CREL.01c-Rel0.912954-29682961(+)1.0000.919tgtggtagTTCCcagV$IKRS/IK3.01Ikaros 3, potential regulator0.842958-29702964(−)1.0000.841aactgGGAActacof lymphocyte differentiationV$RBPF/RBPJK.01Mammalian transcriptional0.842957-29712964(−)1.0000.842aaacTGGGaactaccrepressor RBP-Jkappa/CBF1V$E2FF/E2F.01E2F, involved in cell cycle0.742966-29802973(−)0.7500.784ttcacgtCAAAactgregulation, interacts with Rbp107 proteinV$E4FF/E4F.01GLI-Krueppel-related0.822968-29802974(−)1.0000.830ttcACGTcaaaactranscription factor, regulatorof adenovirus E4 promoterV$CREB/ATF6.02Activating transcription factor0.852966-29862976(+)1.0000.985cagttttGACGtgaaaagtcc6, member of b-zip family,induced by ER stressV$EBOX/ARNT.01AhR nuclear translocator0.892968-29842976(+)1.0000.891gttttgaCGTGaaaagthomodimersV$E4FF/E4F.01GLI-Krueppel-related0.822971-29832977(+)1.0000.909ttgACGTgaaaagtranscription factor, regulatorof adenovirus E4 promoterV$EBOR/XBP1.01X-box-binding protein 10.862970-29842977(+)1.0000.890tttgACGTgaaaagtV$E2FF/E2F.01E2F, involved in cell cycle0.742971-29852978(+)1.0000.837ttgacgtGAAAagtCregulation, interacts with Rbp107 proteinV$STAT/STAT.01signal transducers and0.872989-30072998(+)1.0000.937cattcttactGGAAacctcactivators of transcriptionV$BCL6/BCL6.02POZ/zinc finger protein,0.772991-30072999(+)0.8000.805ttcttacTGGAaacctctranscriptional repressor,translocations observed indiffuse large cell lymphomaV$XSEC/STAF.01Se-Cys tRNA gene0.773003-30253014(+)0.7820.791acctCCCTgaatccatgccaagctranscription activating factorV$NF1F/NF1.01Nuclear factor 10.943007-30253016(−)1.0000.964gctTGGCatggattcagggV$OCT1/OCT1.02octamer-binding factor 10.823014-30283021(+)1.0000.820tccATGCcaagcactV$RCAT/Mammalian C-type LTR0.753019-30433031(+)1.0000.787gCCAAgcactacccatcaccttgacCLTR_CAAT.01CCAAT boxV$SF1F/SF1.01SF1 steroidogenic factor 10.953033-30453039(−)1.0000.954cagtCAAGgtgatV$OCT1/OCT1.01octamer-binding factor 10.773038-30523045(−)1.0000.800ctTATGccagtcaagV$PARF/DBP.01Albumin D-box binding0.843042-30563049(−)1.0000.862agtgcTTATgccagtproteinV$ETSF/ETS1.01c-Ets-1 binding site0.923057-30733065(−)1.0000.920atcaaAGGAaatgagtgV$LEFF/LEF1.01TCF/LEF-1, involved in the0.863062-30783070(−)1.0000.969ggggcatCAAAggaaatWnt signal transductionpathwayV$MAZF/MAZ.01Myc associated zinc finger0.903072-30843078(−)1.0000.912gaggGAGGggcatprotein (MAZ)V$SP1F/GC.01GC box elements0.883071-30853078(−)0.8760.920tgagGGAGgggcatcV$TBPF/TATA.01cellular and viral TATA box0.903091-31073099(+)1.0000.973tattaTAAAagcacagtelementsV$SEF1/SEF1.01SEF1 binding site0.693099-31173108(−)1.0000.700gaaagagacgaCTGTgctt


[0087]


Claims
  • 1. An isolated nucleic acid sequence of SEQ ID NO: 2.
  • 2. An isolated nucleic acid sequence comprising at least about nucleotides 1-3137 of SEQ ID NO: 2 (SEQ ID NO: 23).
  • 3. A reporter construct, comprising: a. a nucleotide sequence which encodes a detectable product; and b. a regulatory region nucleotide sequence linked to the 5′ end of nucleotide sequence that encodes a detectable product, wherein the regulatory region nucleotide sequence comprises of one or more regions chosen from nucleotides 1-3137 of SEQ ID NO: 2.
  • 4. A reporter construct of claim 3, further comprising: a. a promoter element interposed between the regulatory region nucleotide sequence and the nucleotide sequence encoding the detectable product.
  • 5. The reporter construct of claim 3 wherein the regulatory region nucleotide sequence is selected from the group consisting of SEQ ID NO: 1, 2, 23, 32, 35, 37, 28, 24, 25, 26, 27, 29, 30, 31, 33, 34, 38, and 36.
  • 6. The reporter construct of claim 4 wherein the regulatory region nucleotide sequence is selected from the group consisting of SEQ ID NO: 1, 2, 23, 32, 35, 37, 28, 24, 25, 26, 27, 29, 30, 31, 33, 34, 38, and 36.
  • 7. The reporter construct of claim 4 wherein the promoter element is selected from SEQ ID NO: 2.
  • 8. A host cell comprising the reporter construct of claim 3.
  • 9. A host cell comprising the reporter construct of claim 4.
  • 10. A method for identifying agents which modulate INGAP expression, comprising: a. contacting the host cell of claim 8 with a test agent; b. determining expression of the detectable protein or nucleic acid product in the cell; and c. identifying the test agent as a modulator of INGAP expression if the test agent modulates expression of the detectable product in the cell.
  • 11. A method for identifying agents which modulate INGAP expression, comprising: a. contacting the host cell of claim 9 with a test agent; b. determining expression of the detectable protein or nucleic acid product in the cell; and c. identifying the test agent as a modulator of INGAP expression if the test agent modulates expression of the detectable product in the cell.
  • 12. An in vitro method for identifying agents which modulate INGAP expression, comprising: a. contacting a reporter construct according to claim 3 with a test substance under conditions sufficient for transcription and translation of said nucleotide sequence; b. determining expression of the detectable protein or nucleic acid product; and c. identifying the test substance as a modulator of INGAP expression if the test substance modulates expression of the detectable product.
  • 13. An in vitro method for identifying agents which modulate INGAP expression, comprising: a. contacting a reporter construct according to claim 4 with a test substance under conditions sufficient for transcription and translation of said nucleotide sequence; b. determining expression of the detectable protein or nucleic acid product; and c. identifying the test substance as a modulator of INGAP expression if the test substance modulates expression of the detectable product.
  • 14. An in vitro method for identifying agents which modulate INGAP expression, comprising: a. contacting the nucleic acid or fragments thereof of claim 2 with a test agent; b. determining binding of the test agent to the nucleic acid; and c. identifying the test agent as a potential modulator of INGAP expression if the test agent binds to the nucleic acid.
  • 15. A method for modulating INGAP expression in a mammal in need thereof, comprising administering to the mammal an effective amount of a factor that stimulates INGAP expression in the said mammal.
  • 16. The method of claim 15 further comprising: administering to the mammal an effective amount of hLIF.
  • 17. The method of claim 15 further comprising: administering to the mammal an effective amount of PMA.
Parent Case Info

[0001] This application incorporates by reference co-pending provisional applications Serial No. 60/388,315 filed Jun. 14, 2002, Serial No. 60/361,073 filed Mar. 1, 2002, and Serial No. 60/346,898 filed Jan. 11, 2002.

Provisional Applications (3)
Number Date Country
60388315 Jun 2002 US
60361073 Mar 2002 US
60346898 Jan 2002 US