ASSAY FOR THE DETECTION OF FACTORS THAT MODULATE THE EXPRESSION OF INGAP

REFERENCE TO SEQ ID

The Sequence listing in “1247090060.tx” created on Aug. 1, 2011, being 108 KB in size, is incorporated by reference.

FIELD OF THE INVENTION

The invention relates to the field of assays for the detection of factors that modulate gene expression. Specifically, the invention relates to reporter constructs and methods for identifying agents that modulate the expression of the INGAP gene.

BACKGROUND OF THE INVENTION

Islet neogenesis gene associated protein (INGAP protein) has been identified as a pancreatic acinar cell protein that can induce islet cell neogenesis from progenitor cells resident in the pancreas in a manner that recapitulates islet development during normal embryogenesis. INGAP is unique in its ability to stimulate growth and differentiation of islets of Langerhans from precursor cells associated with pancreas. These islets evolve a mature insulin secretory profile capable of responding to perturbations in blood glucose in a physiologic manner. This potential anti-diabetic therapeutic has been shown to demonstrate homology across several species and to exert a biological response.

Pancreatic islet cell mass is lost in type 1 diabetes mellitus, a disease in which a progressive autoimmune reaction results in the selective destruction of insulin-producing β-cells. In type 2 diabetes mellitus, so-called adult-onset disease, but also increasingly a condition in young overweight people, the β-cell mass may be reduced by as much as 60% of normal. The number of functioning β-cells in the pancreas is of critical significance for the development, course, and outcome of diabetes. In type I diabetes, there is a reduction of β-cell mass to less than 2% of normal. Even in the face of severe insulin resistance as occurs in type II diabetes, the development of diabetes only occurs if there is inadequate compensatory increase in β-cell mass. Thus, the development of either of the major forms of diabetes can be regarded as a failure of adaptive β-cell growth and a subsequent deficiency in insulin secretion. Stimulating the growth of islets and β-cells from precursor cells, known as islet neogenesis, is an attractive approach to the amelioration of diabetes. There is need in the art for methods to identify agents that can modulate the expression of INGAP, whether in animals or in cultured cells.

BRIEF SUMMARY OF THE INVENTION

It is an object of the invention to provide a reporter construct containing the 5′-regulatory region from mammalian INGAP gene.

It is another object of the invention to provide methods for identifying agents which modulate INGAP expression.

It is another object of the invention to provide a nucleic acid or fragment of INGAP 5′-regulatory region.

It is another object of the invention to provide methods for increasing INGAP expression.

It is another object of the invention to provide a kit for modulating INGAP expression.

These and other objects of the invention are provided by one or more of the embodiments described below.

In one aspect of the invention a reporter construct is provided. The reporter construct comprises a regulatory region nucleotide sequence and a nucleotide sequence encoding a detectable product. In one aspect of the invention, the reporter construct is provided in a vector. The regulatory region nucleotide sequence is linked to the nucleotide sequence encoding a detectable product. The regulatory region nucleotide sequence may comprise one or more fragments of 5′ regulatory region of the INGAP genomic sequence, SEQ ID NO: 23, or it may comprise the entire length of the 5′ regulatory region. In one embodiment of the reporter construct, a promoter element is interposed between the regulatory region nucleotide sequence and the nucleotide sequence encoding a detectable product. The promoter element may be selected from the promoter elements present in the INGAP regulatory sequence. Alternatively, the promoter element present in the vector comprising the reporter construct may be used. The detectable product encoded by the said nucleotide sequence encoding a detectable product could be either a nucleic acid or a protein. The detectable product need not be the INGAP gene nucleic acid or protein.

In another embodiment of the invention, a method identifying agents that modulate INGAP expression is provided. The method comprises contacting a cell with a test agent, wherein the cell comprises a reporter construct of the present invention. Expression of the detectable nucleic acid or protein product in the cell is determined. A test agent is identified as a modulator of INGAP expression if the test agent modulates expression of the detectable product in the cell.

In another embodiment of the invention, an isolated nucleic acid comprising the genomic sequence of the hamster INGAP gene (SEQ ID NO: 2), or a fragment thereof is provided.

According to another embodiment of the invention, an in vitro method for identifying agents that modulate INGAP expression is provided. The method comprises contacting a test agent with a reporter construct of the present invention in a cell-free system that allows for transcription and translation of a nucleotide sequence. Expression of the detectable product is determined. The substance is identified as a modulator of INGAP expression if the test substance modulates expression of the detectable product.

According to another embodiment of the invention, an in vitro method for identifying an agent that modulate INGAP expression is provided. The method comprises contacting a test agent with a nucleic acid of the invention. Binding of the test agent to the nucleic acid is determined. The test agent is identified as a modulator of INGAP expression if the test agent binds to the nucleic acid.

According to another embodiment of the invention a method for increasing INGAP expression is provided. An effective amount of a factor that stimulates INGAP expression directly or indirectly, for example cytokines, chemokines, growth factors, or pharmacological agents, is administered to a mammal in need of increased INGAP expression.

According to another embodiment of the invention a kit for modulating INGAP expression is provided. The kit comprises a modulator of INGAP expression and instructions for using the modulator of INGAP expression to modulate INGAP expression.

According to another embodiment of the invention a method for modulating INGAP expression in a mammal to treat a disease state related to reduced islet cell function is provided. The method comprises the step of administering to the mammal an effective amount of a modulator of INGAP expression whereby the level of INGAP expression in the mammal is modified.

All documents cited are, in relevant part, incorporated herein by reference; the citation of any document is not to be construed as an admission that it is prior art with respect to the present invention.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows the annotation of the hamster INGAP gene structure. The boundaries of introns 1-5 are listed in Table 1.

FIG. 2 shows an overview of the 5′-regulatory region of the hamster INGAP gene (nucleotides 1-3137 of SEQ ID NO: 2) showing many well known and well-characterized transcription factor binding sites. The minimal promoter element contains the regions noted with an underline (CAAT-box, TATA-box, and GC-box).

FIG. 3 shows a schematic of many well known and well-characterized transcription factor-binding sites for nucleotides 1-3123 of the 5′-regulatory region (SEQ ID NO: 1) of the hamster INGAP gene. Table 3 further describes these transcription factor-binding sites.

FIG. 4 shows the predicted transcription start sites within the 5′-regulatory region (SEQ ID NO: 1) of the hamster INGAP gene (SEQ ID NO: 2). The predicted start site is indicated by a boldface nucleotide. The start and end nucleotide numbers are indicated for the promoter sequence. The numbers refer to nucleotide numbers of the hamster INGAP gene (SEQ ID NO: 2)

FIG. 5 shows the adapter primer structure and sequence used in gene walking. Adapter primer 1 (AP1) and adapter primer 2 (AP2) are shown.

FIGS. 6 and 7 show the strategy for reconstructing the hamster INGAP gene. The hamster INGAP gene was reconstructed using the technique of gene walking. Shown are the fragments and the gene specific primers (GSP1 and GSP2) used in PCR amplification for gene walking. Fragments were joined together using unique restriction enzyme sites within each fragment. The nucleotide sequences of the individual primers are listed in Table 2.

FIG. 8 shows the fragments of INGAP 5′-regulatory region, which were cloned into pβGal-basic upstream of a β-galactosidase reporter gene. The labels on the left refer to the nucleotide fragments of SEQ ID NO: 23 which were cloned upstream of pβGal-basic.

FIG. 9A shows reporter activity in human embryonic kidney cells (293T) transfected with a reporter construct that contains various fragments of the 5′-regulatory region (SEQ ID NO: 23) of hamster INGAP DNA cloned upstream of a β-galactosidase reporter gene (pβGal-basic), or in a reporter construct which contains no INGAP DNA. The cells are stimulated with phorbol myristate acetate. Promoter activity is assessed by determining the level of β-galactosidase present in the cell using a β-galactosidase luminescent assay.

FIG. 9B shows reporter activity in human embryonic kidney cells (293T) transfected with a reporter construct that contains nucleotides 2030 to 3137 of the 5′-regulatory region (SEQ ID NO: 23) of hamster INGAP cloned upstream of a β-galactosidase reporter gene, or in a reporter construct which contains no INGAP DNA. The cells are stimulated with leukemia inhibitory factor. Promoter activity is assessed by determining the level of β-galactosidase present in the cell using a β-galactosidase luminescent assay.

FIG. 10 shows the reporter activity in human embryonic kidney cells (293T) transfected with a reporter construct that contains different fragments (see FIG. 8) of the 5′-regulatory region of hamster INGAP cloned upstream of a β-galactosidase reporter gene. The cells are stimulated with phorbol myristate acetate. Concentrations of PMA used are 6 ng/ml, 17 ng/ml, 50 ng/ml, 100 ng/ml, or 300 ng/ml. Promoter activity is assessed by determining the level of β-galactosidase present in the cell using a β-galactosidase luminescent assay.

FIG. 11 shows reporter activity in human embryonic kidney cells (293T) transfected with a reporter construct that contains different fragments (see FIG. 8) of the 5′-regulatory region of hamster INGAP cloned upstream of a β-galactosidase reporter gene. The cells are stimulated with human leukemia inhibitory factor (hLIF). Concentrations of hLIF used are 1 ng/ml, 10 ng/ml, or 30 ng/ml. Promoter activity was assessed by determining the level of β-galactosidase present in the cell using a β-galactosidase luminescent assay.

FIG. 12 shows RNA analysis for INGAP gene upregulation in rat amphicrine pancreatic cells, AR42J, treated with cytokine IL-6 or untreated. Total RNA is probed by Northern analysis for INGAP gene.

DETAILED DESCRIPTION OF THE INVENTION
Definitions

It must be noted that as used herein and in the appended claims, the singular forms “a”, “an”, and “the” include plural references unless the context clearly dictates otherwise.

The term “promoter” is used to define the region of a gene at which initiation and rate of transcription are controlled. It contains the site at which RNA polymerase binds and also sites for the binding of regulatory proteins, e.g. transcription factors, repressors, etc. In order to differentiate between the transcription initiation site and other sites that modulate rate of transcription, promoter region is generally subdivided into “minimal promoter element” and “regulatory region”. The term “minimal promoter element” or sometimes simply referred to as “promoter” therefore may include TATA box, GC-rich sequence and CAAT box; while “regulatory region” is usually a long stretch of nucleotide sequence where transcription factors and other factors bind. Most eukaryotic genes have long regulatory regions where many different transcription factors bind. The expression or the lack of expression of a given gene in a given cell type, tissue, organ, or an organism is governed by the interactions that take place on its regulatory region.

The term “transcription factor” is used to describe the proteins that bind short stretches of DNA in the regulatory regions of a gene. Transcription factors may interact with each other as well as RNA polymerase. Thus, transcription factors may bind hormones or second messengers, DNA, RNA, other transcription factors, or other proteins. They may activate or inhibit transcription of a given gene. Transcription factors are also sometimes referred to as “enhancers” or “repressors”. Transcription factor binding sites can be used to identify agents that bind to the 5′-regulatory region of the gene and modulate the gene's expression.

The term “reporter” is used to describe a coding sequence attached to a heterologous promoter or enhancer elements and whose product, either nucleic acid or protein, is easily detected and is quantifiable. Some common reporter genes include β-galactosidase (lacZ), chloramphenicol acetyltransferase (cat), β-glucuronidase (GUS), and green fluorescent protein (GFP).

A “reporter construct” is a piece of nucleic acid that includes a promoter element and a reporter gene housed in a suitable vector plasmid DNA. Regulatory region nucleotide sequences may be cloned 5′ of the promoter element to determine if they contain transcription factor binding sites. The reporter construct-containing vector is introduced into a cell that contains many transcription factors. Activation of the reporter gene by transcription factors may be monitored by detection and quantification of the product of the reporter gene.

The term “agent” is used here to essentially describe any means to modulate INGAP expression. Agent may be a chemical compound, a biological agent, or a physical force, a mechanical contraption, or any combinations thereof.

INGAP Promoter and Regulatory Region

It is a discovery of the present inventors that INGAP gene is regulated by a 5′-regulatory region that is susceptible to modulation by many known transcription factors, including PMA and LIF.

It is a further discovery of the present invention that the 5′-regulatory region nucleotide sequence of the INGAP gene may be used in screening assays to identify agents capable of modulating the INGAP gene expression. These modulating agents have potential as therapeutic agents for treating pathological conditions including, but not limited to, diabetes mellitus, both type 1 and type 2, endocrine and non-endocrine hypoplasia, hypertrophy, adenoma, neoplasia, and nesidioblastosis.

Mammalian INGAP, like most genes, has a 5′-regulatory region followed by introns and exons. The sequence of a mammalian (Hamster sp.) INGAP gene is provided as SEQ ID NO: 2. FIG. 1 details the relative location of the 5′-regulatory region, the introns and the exons of the hamster INGAP gene. The boundaries of introns 1-5 and the location of the TATA-box and the poly-A signal are listed in Table 1.

TABLE 1

Position In INGAP

Description
Gene (SEQ ID NO: 2)

TATA-Box
3094

INTRON 1
3150-3426

INTRON 2
3508-4442

INTRON 3
4562-4735

INTRON 4
4874-5459

INTRON 5
5587-5843

Poly-A Signal
6098-6103

The nucleotide sequence of the 5′-regulatory region including the promoter elements of mammalian INGAP, is shown partially in SEQ ID NO: 1, and completely in SEQ ID NO: 2 and 23 (nucleotides 1-3137 of SEQ ID NO: 2). Nucleotides 1-3120 of SEQ ID NO: 1 are identical to nucleotides 1-3120 of SEQ ID NO: 2 and SEQ ID NO: 23. An overview of the 5′-regulatory region is shown in FIG. 2. Representative transcription enhancer/repressor binding sites are shown also in FIG. 2. Predicted transcription enhancer/repressor binding sites for nucleotides 1-3123 of the 5′-regulatory region are shown in FIG. 3. Table 3 at the end of the specification details these transcription factors and their binding sites, and their locations in the regulatory region. Potential transcription factor binding analysis was done using MatInspector Professional™, which is a bioinformatics software that utilizes a library of matrix descriptions for transcription factor binding sites to locate matches in sequences of unlimited length (Quandt, K., Frech, K., Karas, H., Wingender, E., Werner, T. (1995) Nucleic Acids Res. 23, 4878-4884).

Table 3 lists predicted binding proteins (Further Information) based upon their classification into functionally similar matrix families (Family/matrix). The DNA sequence predicted to bind the protein (Sequence), whether sense or antisense DNA (Str) and location of the sequence in SEQ ID NO: 2, (Position) are listed. Further the similarity to the consecutive highest conserved nucleotides of a matrix (Core sim.) and similarity to all nucleotides in that matrix (Matrix sim.) along with the optimized value (Opt) defined in a way that a minimum number of matches is found in non-regulatory test sequences are also listed. Details to the algorithms used in MatInspector Professional™ is referenced:

OPT: This matrix similarity is the optimized value defined in a way that a minimum number of matches are found in non-regulatory test sequences (i.e. with this matrix similarity the number of false positive matches is minimized). This matrix similarity is used when the user checks “Optimized” as the matrix similarity threshold for MatInspector Professional™.

Family: Each matrix belongs to a so-called matrix family, where functionally similar matrices are grouped together, eliminating redundant matches by MatInspector Professional™ professional (if the family option was selected). E.g. the matrix family V$NFKB includes 5 similar matrices for NFkappaB (V$NFKAPPAB.01, V$NFKAPPAB 0.02, V$NFKAPPAB 0.03, V$NFKAPPAB50.01, V$NFKAPPAB65.01) as well as 1 matrix for the NFkappaB related factor c-Rel (V$CREL.01).

Matrix: The MatInspector Professional™ matrices have an identifier that indicates one of the following seven groups: vertebrates (V$), insects (I$), plants (P$), fungi (F$), nematodes (N$), bacteria (B$), and other functional elements (O$); followed by an acronym for the factor the matrix refers to, and a consecutive number discriminating between different matrices for the same factor. Thus, V$OCT1.02 indicates the second matrix for vertebral Oct-1 factor.

Core Sim: The “core sequence” of a matrix is defined as the (usually 4) consecutive highest conserved positions of the matrix. The core similarity is calculated as described here. The maximum core similarity of 1.0 is only reached when the highest conserved bases of a matrix match exactly in the sequence. More important than the core similarity is the matrix similarity which takes into account all bases over the whole matrix length.

Matrix Sim: The matrix similarity is calculated as described here. A perfect match to the matrix gets a score of 1.00 (each sequence position corresponds to the highest conserved nucleotide at that position in the matrix), a “good” match to the matrix usually has a similarity of >0.80. Mismatches in highly conserved positions of the matrix decrease the matrix similarity more than mismatches in less conserved regions.

Another aspect of the invention provides for a reporter construct. Reporter constructs contain a 5′ regulatory region nucleotide sequence fragment of SEQ ID NO: 23 (e.g., an enhancer and/or repressor binding site containing region), a promoter element (which may or may not be from INGAP regulatory region nucleotide sequence, SEQ ID NO: 23), and a reporter gene. The 5′-regulatory region nucleotide sequence is positioned upstream of the reporter gene. In order to determine the identity of various transcription factors that bind the 5′ regulatory region nucleotide sequence and to elucidate their binding locations within the 5′ regulatory nucleotide sequence of the INGAP gene, the region may be mapped using deletion analysis. One or more fragments of the regulatory region nucleotide sequence may be initially analyzed for their responses to various transcription factor activators. Once, a region of interest is determined, further fine mapping may be carried out where DNA from different locations within the regulatory region could be combined to make a more robust, and responsive reporter construct. DNA sequences, such as INGAP 5′-regulatory region DNA or a fragment thereof, can be manipulated by methods well known in the art. Examples of such techniques include, but are not limited to, polymerase chain reaction (PCR), restriction enzyme endonuclease digestion, ligation, and gene walking. Cloning fragments of DNA, such as 5′-regulatory regions is well known in the art.

Another approach to quantify the expression levels of a gene is to measure transcription of the gene. PCR-ELISA may be used to capture transcripts onto a solid phase using biotin or digoxigenin-labelled primers, oligonucleotide probes (oligoprobes) or directly after incorporation of the digoxigenin into the transcripts (Watzinger, F. and Lion, T. (2001) Nucleic Acids Res., 29, e52). Once captured, the transcripts can be detected using an enzyme-labeled avidin or anti-digoxigenin reporter molecule similar to a standard ELISA format. Another approach is to employ real-time PCR to detect the transcript of the reporter gene (Mackay, I. M. and Nitsche, A., Nucleic Acids Res. 2002 Mar. 15; 30 (6), 1292-305). In real-time PCR fluorogenic nucleotides are used and progress of the transcript is monitored in real-time as the polymerase transcribes the reporter gene.

The promoter element in the reporter construct may or may not be from the same gene as the 5′-regulatory region. As an example, the enhancer/repressor region from the INGAP 5′-regulatory region, or a fragment of the enhancer/repressor region from the INGAP 5′-regulatory region, may be cloned upstream of a heterologous minimal promoter element, e.g., the minimal CMV promoter (Boshart et al., 1985) and the promoters for TK (Nordeen, 1988), IL-2, and MMTV.

Transcription of a gene begins around the minimal promoter. FIG. 4 shows the predicted transcription start sites for mammalian INGAP gene (SEQ ID NO: 2). SEQ ID NO: 2 was analyzed using “Neural Network Promoter Prediction” program designed by Martin Reese to identify eukaryotic promoter recognition elements such as TATA-box, GC-box, CAAT-box, and the transcription start site. These promoter elements are present in various combinations separated by various distances in sequence. The program is available on the Internet and is located at http://www.fruitfly.org/seq_tools/promoter.html.

The reporter construct can be used to identify agents that modulate, either alone or in combination, the expression of INGAP. Some such agents may modulate expression of INGAP by binding to the regulatory region directly while others may regulate expression of transcription factors that bind to the INGAP regulatory region.

The reporter construct can be transfected into a host cell in vitro, or in vivo through the pancreatic duct, either transiently or stably, and a test agent introduced to the assay system. Examples of test agents include, but are not limited to organic and inorganic chemical agents, carbohydrates, proteins, oligonucleotides, cholecystokinin, mechanically induced pressure, and agents which cause a pancreatic duct obstruction. Expression of the reporter gene product can be determined by an assay appropriate for the reporter gene employed. Examples of such assays include, but are not limited to a luminescent assay for β-galactosidase or luciferase, an enzymatic assay for chloramphenicol acetyl transferase, and fluorescence detection for fluorescent proteins. Such assays are well known in the art, and a skilled artisan will be able to select an appropriate assay for the chosen reporter. A test agent is identified as a modulator of INGAP expression if the test agent modulates expression of the reporter gene product. Preferably the level of increase or decrease is at least 50%, 100%, 200%, 500%, or 1000%, but any statistically significant change can be an indicator of modulatory activity. A skilled artisan may also determine reporter gene product expression in untreated cells, and in treated and untreated cells transfected with a promoter-less reporter gene only. Such determinations can be used to determine background levels of expression.

Test agents can also be obtained by fractionating pancreatic secretion fluids. A pancreatic duct obstruction can be used as an exemplary method of harvesting pancreatic secretion fluids. The pancreatic secretion fluids can be fractionated by methods well known in the art. Examples include high-pressure liquid chromatography (HPLC), size exclusion chromatography, hydrophobic interacting columns, and density gradient centrifugation. Individual fractions can be tested for agents that modulate reporter gene expression using a method described herein. The individual fractions can be further fractionated to identify agents that modulate reporter gene expression. The identified test agents can be used to modulate the expression of INGAP.

A host cell can be any cell suitable for transfection and maintenance in a suitable assay system. Examples of suitable cells include, but are not limited to, mammalian cells, human cells, mouse cells, rat cells, monkey cells, dog cells, bovine cells, and porcine cells. Preferably the cells used will be human cells. The cells could be either transformed cells line or primary cells. Whole organ explants may also be used where the regulation may be monitored over many different cell types. Many methods exist in the art for transfecting or infecting cells with reporter construct DNA. Such methods include, but are not limited to, lipofection, electroporation, calcium phosphate precipitation, DEAE dextran, gene guns, and modified viral techniques (e.g., recombinant adenovirus or recombinant retrovirus). The skilled artisan can readily choose a method suitable for use with a given cell type and assay system.

The reporter construct can also be introduced in vivo directly into cells of the pancreas. Examples of methods to introduce the reporter construct into pancreatic cells in vivo include pancreatic duct retrograde perfusion and in vivo electroporation (Mir, 2001). The reporter construct encodes a reporter gene product that is readily measured in vivo. A test agent can be administered systemically or locally, and expression of the reporter gene in vivo can be determined by an assay appropriate for the particular reporter employed. Examples of such include a fluorescence assay for green fluorescent protein.

Methods for identifying agents that modulate INGAP expression can also be accomplished in vitro. The reporter construct can be contacted with a test agent in vitro under conditions sufficient for transcription and/or translation of the reporter gene. Components such as rabbit reticulocyte lysates or wheat germ extracts can be utilized for such a method. Subsequently, the expression level of the reporter gene can be determined as described above utilizing an appropriate assay for a given reporter gene. A test agent is identified as a modulator of INGAP expression if the test agent modulates expression of the reporter gene. Threshold levels of change can be set by the practitioner as discussed above.

A test agent can alternatively be contacted with an isolated and purified INGAP 5′-regulatory DNA molecule and one can determine if the test agent binds to the DNA molecule. Test agents can be a chemical agent, a protein, or a nucleic acid. Appropriate INGAP 5′-regulatory DNA molecules would include nucleotides 1-6586 of SEQ ID NO: 2, the 5′-regulatory region DNA (SEQ ID NO: 1, or SEQ ID NO: 23), or any fragment of the 5′-regulatory region, preferably a fragment which contains one or more enhancer/repressor binding sites. Methods to determine binding of the test agent to the fragment of DNA are well known in the art, e.g., electrophoretic mobility shift assay (EMSA). See for example Sambrook et al., MOLECULAR CLONING: A LABORATORY MANUAL, 2d ed., 1989, at pages 9.50-9.51. Fragments of the 5′-regulatory region can be obtained by methods well known in the art using the disclosed sequence (SEQ ID NO: 2). Examples of such methods include, PCR, restriction enzyme digestion, and chemical synthesis. Any fragment of DNA within the 5′-regulatory region (SEQ ID NO: 1, or 23) can be used. The exact location that an agent binds can be determined for example by utilizing smaller fragments to map precisely the binding site for the test agent. Test agents that bind in the assay can be further tested in other assays that require modulatory activity.

An agent that causes an increase or decrease in reporter gene expression can be used as a modulator of INGAP expression. The modulator can be administered to a mammal in need of such modulation. Examples of mammals that may need INGAP expression modulation are those with reduced pancreatic function, in particular reduced islet cell function. Such mammals include those who have diabetes mellitus, impaired glucose tolerance, impaired fasting glucose, hyperglycemia, obesity, and pancreatic insufficiency.

An agent that is identified as a modulator of INGAP expression can be supplied in a kit to treat diseases associated with reduced islet cell function. The kit would comprise in single or divided containers, in single or divided doses a modulator of INGAP expression. Written instructions may be included for using the modulator of INGAP expression. The instructions may simply refer a reader to another location such as a website or other information source.

Agents that cause an increase in reporter gene expression can be used to increase INGAP expression to treat a disease state related to reduced islet cell function. Agents that cause a decrease in reporter gene expression can be used to decrease INGAP expression to treat a disease state related to hyperactivity of islet cells or a disease where reduced INGAP expression is desirable. Examples of such agents include, but are not limited to, PMA, LIF, interleukin-6, Oncostatin M, and ciliary neurotropic factor. Agents can be administered by any number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, parenteral, topical, sublingual, rectal, or pancreatic duct retrograde perfusion. Agents for oral administration can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for oral administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by the mammal.

Agents for intravenous, intramuscular, intra-arterial, transdermal, and subcutaneous injections can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for injection into the mammal. Agents for intranasal, topical, and rectal administration can be formulated using pharmaceutically acceptable carriers well known in the art in dosages suitable for surface administration to the mammal. Mammals in need of an increase in INGAP expression include for example, mammals with diabetes mellitus, impaired glucose tolerance, impaired fasting glucose, hyperglycemia, obesity, and pancreatic insufficiency. Mammals in need of a decrease in INGAP expression include for example, mammals with hypoglycemia.

The following examples are offered by way of illustration and do not limit the invention disclosed herein.

EXAMPLES
Example 1
Hamster INGAP Genomic Sequence and Structure

The hamster INGAP genomic sequence and structure was determined by gene walking (Clontech) and DNA sequencing. Gene walking is a method for walking upstream toward a promoter or downstream in genomic DNA from a known sequence, such as cDNA. This method utilizes four uncloned, adapter-ligated genomic fragment libraries. The manufacturer's recommended protocol is followed with one notable exception; hamster genomic DNA was used to create the uncloned, adapter-ligated genomic fragment libraries.

To create uncloned, adapter ligated genomic fragment libraries, genomic DNA was purified from hamster cells. Four separate aliquots were thoroughly digested with PvuII, StuI, DraI, or EcoRV. Following digestion, inactivation of the restriction enzymes, and dephosphorylation, each separate pool of DNA fragments was ligated to an adapter AP1 SEQ ID NO: 489 or AP2 SEQ ID NO: 490, see FIG. 5. The adapter was phosphorylated to provide the requisite phosphate group for a ligation reaction.

Also note that the 3-prime side of the short adapter contains an amine group to prevent the adapters from forming concatamers.

Two gene specific primers (GSP1 and GSP2) were designed for each region of known sequence (i.e., the exons of the INGAP gene). See FIG. 6 for fragment location and GSP1 and GSP2 location. The gene specific primers were designed as reverse PCR primers for all fragments except fragments 1_2 and 14_5. The gene specific primers for fragments 1_2 and 14_5 were designed as forward primers. Adapter primer 1 (AP1) and adapter primer 2 (AP2) (FIG. 5) were forward PCR primers for all fragments except fragments 1_2 and 14_5, which were reverse PCR primers. The outer gene specific primer (GSP1) was used with adapter primer 1 in a PCR reaction. To increase specificity, a second, nested PCR was set up using the inner gene specific primer (GSP2) and adapter primer 2. A small aliquot of the first reaction served as template for the second reaction. Gene specific PCR primers utilized for gene walking are listed in Table 2 and the strategy used to build the INGAP genomic sequence is shown in FIGS. 6 and 7. The arrowheads in FIG. 6 represent the adapter primers (AP1 and AP2), while the circles represent the gene specific primers (GSP1 and GSP2).

TABLE 2

NAME (LOCATION)
SEQUENCE

INGEN 21_3 (1464, 1482)
5′-ACAAGCAATCTAGAGATGG-3′ (SEQ ID NO: 3)

INGEN 19_3 (1401, 1423)
5′-GTTCAGCTATGTTCATAGCAGGG-3′ (SEQ ID NO: 4)

INGEN 16_3 (1855, 1876)
5′-GTCTGTATGACTGTGTGGGAAG-3′ (SEQ ID NO: 5)

INGEN 15_3 (1929, 1948)
5′-GCACTTGAACTCAATGGCTC-3′ (SEQ ID NO: 6)

INGEN 14_3 (2147, 2168)
5′-GAACCACCTGACATGGGTGATG-3′ (SEQ ID NO: 7)

INGEN 13_3 (2177, 2200)
5′-GGGCATCGTATCATCTGGTTACAG-3′ (SEQ ID NO: 8)

INGEN 8_3 (2544, 2565)
5′-GGTTCAAAAAAGCTGCTTCAAC-3′ (SEQ ID NO: 9)

INGEN 7_3 (2666, 2689)
5′-GGAATAGCTGCAATTTATGCCCAT-3′ (SEQ ID NO: 10)

INGEN 4_3 (2833, 2858)
5′-CTTAGGAACATTCAGGCAGCCTCCTG-3′ (SEQ ID NO: 11)

INGEN 3_3 (2866, 2891)
5′-GTTGCCCTCTGCCACGTGTCAAGTTC-3′ (SEQ ID NO: 12)

INGEN 2_3 (3444, 3470)
5′-CATCCAAGACATCCTACAGAGGGTCAT-3′ (SEQ ID NO: 13)

INGEN 1_3 (3475, 3501)
5′-CCCAAGAAAGGAACATCAGGCAGGAAA-3′ (SEQ ID NO: 14)

INGEN 2_2 (3330, 3350)
5′-CCAAATGAGTGCTTCCCTGAA-3′ (SEQ ID NO: 15)

INGEN 1_2 (3241, 3266)
5′-GCAGCACTCTGAAACTCAGTAGAGTT-3′ (SEQ ID NO: 16)

INGEN 14_5 (5544, 5563)
5′-GCTGCTGACCGTGGTTATTG-3′ (SEQ ID NO: 17)

INGEN 13_5 (5463, 5485)
5′-ACACTACCCAACGGAAGTGGATG-3′ (SEQ ID NO: 18)

INGAP1_1L (3475, 3492)
5′-TTTCCTGCCTGATGTTCC-3′ (SEQ ID NO: 19)

INGAP1_1R (5957, 5976)
5′-TCATACTTGCTTCCTTGTCC-3′ (SEQ ID NO: 20)

INGAP2_1L (4470, 4488)
5′-CTTCACGTATAACCTGTCC-3′ (SEQ ID NO: 21)

INGAP2_1R (5905, 5923)
5′-ATTAGAACTGCCCTAGACC-3′ (SEQ ID NO: 22)

The PCR fragments were sequenced to determine the nucleotide sequence of the INGAP 5′-regulatory region, the introns, the intron/exon junctions, and the 3-prime polyadenylation regions. The nucleotide sequence of hamster INGAP genomic DNA is shown in SEQ ID NO: 2.

Example 2
Cloning Hamster INGAP 5′-Regulatory Region Fragment into a Reporter Construct

To construct the INGAP 5′-regulatory region, individual PCR fragments were joined together at unique restriction sites located within two adjoining fragments. FIGS. 6 and 7 detail the strategy used to piece the INGAP 5′-regulatory region together. Fragments 8_3 and 2_3 were joined at a unique SphI site; 14_3 and 8_3 were joined at a unique BbsI site; 16_3 and 14_3 were joined at a unique PstI site. The nucleotide sequence of hamster INGAP 5′-regulatory region DNA is shown in SEQ ID NO: 1 and 23 in the sequence listing.

The hamster INGAP 5′-regulatory region or a fragment of the 5′-regulatory region was cloned into a reporter plasmid, pβGal-Basic (Clontech). The 5′-regulatory region or fragments were cloned utilizing the unique XmaI site from the gene walking adapter primer and a unique BgIII site located at the 3-prime side of the regulatory region. FIG. 8 details the fragments cloned into pβGal-Basic. The sizes of the fragments are indicated to the right of the fragments and are expressed as the number of nucleotides of the fragment.

Example 3
Assay System to Screen for Factors that Modulate the Expression of INGAP

Promoter analysis of INGAP identified a number of potential promoter-proximal regulatory sites including the consensus transcription factor binding sites; cAMP response element (CRE), AP-1 and STAT. Promoter-fragment reporter-gene constructs were transiently transfected into 293T cells and co-transfection of secretory alkaline phosphatase was used to normalize for transfection efficiency.

Reporter constructs containing INGAP 5′-regulatory region fragments 2_3sP (SEQ ID NO: 37), 2_3dP (SEQ ID NO: 38), 2_3pP (SEQ ID NO: 36), 14_3P (SEQ ID NO: 34), 16_3P (SEQ ID NO: 31), or 19_3P (SEQ ID NO: 23) were transfected into human cells. The pβGal-Basic plasmid without the hamster INGAP DNA was also transfected into human cells as a control to measure the level of endogenous reporter activity. Two days following transfection, the cells were treated with PMA for 24 hours or were untreated. To determine the level of promoter activity, the amount of β-galactosidase gene product was determined using a luminescent assay for β-galactosidase. FIG. 9A shows that construct 14_3P activated the INGAP expression the most, followed by 2_3pP, and 16_3P.

Reporter construct containing INGAP 5′-regulatory region DNA nucleotides 2030 to 3120 was transfected into human cells. The pβGal-Basic plasmid without the hamster INGAP DNA was also transfected into human cells as a control to measure the level of endogenous reporter activity. Two days following transfection, the cells were treated with LIF for 24 hours or were untreated. To determine the level of promoter activity, the amount of β-galactosidase gene product was determined using a luminescent assay for β-galactosidase. FIG. 9B shows the results. LIF was determined to increase the activity of the 5′-regulatory region of mammalian INGAP. Forskolin (an activator of cAMP/CREB/CRE) did not modulate gene expression (data not shown).

It is important to note that when present in human cells, the hamster INGAP 5′-regulatory region is transactivated by the human transcription factors. Thus, linked to a reporter gene, the 5′-regulatory region of hamster INGAP creates a sensitive assay system to screen for factors that modulate the expression of INGAP.

Example 4
Determination of Approximate Location of PMA and LIF-Mediated Transcription Factor Binding in the 5′-Regulatory Region

To map the approximate location of PMA-initiated or LIF-initiated transcription factor binding different fragments of the hamster INGAP 5′-regulatory region were cloned into pβGal-Basic. See FIG. 8. The fragments cloned into the reporter construct were 2_3sP (SEQ ID NO: 37), 2_3dP (SEQ ID NO: 38), 2_3pP (SEQ ID NO: 36), 14_3P (SEQ ID NO: 34), 16_3P (SEQ ID NO: 31), or 19_3P (SEQ ID NO: 23). The reporter constructs were transfected into human cells. Two days following transfection, the cells were treated with different concentrations of PMA or LIF for 24 hours. The concentrations of PMA used were 6 ng/ml, 17 ng/ml, 50 ng/ml, 100 ng/ml, or 300 ng/ml. The concentrations of LIF used were 1 ng/ml, 10 ng/ml, or 30 ng/ml. To determine the level of promoter activity, the amount of β-galactosidase gene product was determined using a luminescent assay for β-galactosidase. FIGS. 10 and 11 show the results for PMA and LIF treatment, respectively. Both PMA and LIF activated the cell reporter constructs. The exact location of the DNA contact sites can be narrowed further by cloning smaller fragments of the hamster INGAP 5′-regulatory region and by site directed mutations or deletions.

Example 5
RNA Analysis of INGAP Gene Upregulation

To determine if INGAP RNA levels increase after stimulation with a cytokine that signals through STAT, rat amphocrine pancreatic cells, AR42J were treated with IL-6 (1000 U/ml) for 24 hours. Total RNA was extracted from the treated and untreated cells using techniques well known in the art, e.g., using TRIZOL® reagent.

Equal amounts of total RNA (10 μg) were loaded in 2.5% formaldehyde gel and electrophoresed for 4 hours at 70V with a constant circulation of the buffer using a circulating pump. The gel was photographed and washed with water twice at room temperature and soaked in 20×SSC. The gel was transferred to a nylon membrane (Amersham) in 20×SSC overnight following a standard procedure. The membrane was washed with 20×SSC to remove any agar that might have attached to the membrane and baked for 4 hours at 80° C.

One hundred nanograms of hamster INGAP cDNA was labeled using Random Prime Labeling kit (Roche-BMB) and alpha-P³²dCTP (ICN). Approximately 20 million counts were used for hybridization in 20 ml hybridization buffer following the standard procedure at 42° C. for overnight. The blot was washed as follows: 2-times at room temperature with 2×SSC for 10 minutes each; 2-times at 42° C. with 2×SSC for 10 minutes each; 2-times at 55° C. with 1×SSC for 10 minutes each. The membrane was exposed to the film (XOMAT-Kodak) and kept at −80° C. overnight before developing.

Treatment with IL-6 caused an increase in INGAP gene expression (FIG. 12). These data demonstrate that extracellular factors that elevate AP-1-binding transcription factors and STAT-binding transcription factors are involved in the regulation of INGAP gene expression. These studies suggest that it is feasible to enhance INGAP expression as a means of inducing islet neogenesis.

While particular embodiments of the present invention have been illustrated and described, it would be obvious to those skilled in the art that various other changes and modifications can be made without departing from the spirit and scope of the invention. It is therefore intended to cover in the appended claims all such changes and modifications that are within the scope of this invention.

TABLE 3

SEQ

ID

Further

Position

NO:
Family/matrix
Information
Opt.
from-to
anchor

SEQ
V$LEFF/LEF1.01
TCF/LEF-1,
0.86
12-28
20

ID

involved in the

NO:

Wnt signal

41

transduction

pathway

SEQ
V$MITF/MIT.01
MIT
0.81
22-40
31

ID

(microphthalmia

NO:

transcription

42

factor) and TFE3

SEQ
V$OCT1/OCT1.05
octamer-binding
0.90
27-41
34

ID

factor 1

NO:

43

SEQ
V$TCFF/TCF11.01
TCF11/KCR-
1.00
32-38
35

ID

F1/Nrf1

NO:

homodimers

44

SEQ
V$MYOF/MYOGNF1.01
Myogenin/
0.71
25-53
39

ID

nuclear factor 1

NO:

or related factors

45

SEQ
V$ZBPF/ZBP89.01
Zinc finger
0.93
36-48
42

ID

transcription

NO:

factor ZBP-89

46

SEQ
V$SP1F/GC.01
GC box elements
0.88
38-52
45

ID

NO:

47

SEQ
V$PERO/PPARA.01
PPAR/RXR
0.70
44-64
54

ID

heterodimers

NO:

48

SEQ
V$PAX5/PAX9.01
zebrafish PAX9
0.78
43-71
57

ID

binding sites

NO:

49

SEQ
V$TBPF/ATATA.01
Avian C-type LTR
0.81
68-84
76

ID

TATA box

NO:

50

SEQ
V$HMTB/MTBF.01
muscle-specific
0.90
76-84
80

ID

Mt binding site

NO:

51

SEQ
V$OCT1/OCT1.06
octamer-binding
0.80
74-88
81

ID

factor 1

NO:

52

NO:

(en-1)

65

SEQ
V$BARB/BARBIE.01
barbiturate-
0.88
230-244
237

ID

inducible element

NO:

66

SEQ
V$TBPF/TATA.01
cellular and viral
0.90
230-246
238

ID

TATA box

NO:

elements

67

SEQ
V$BARB/BARBIE.01
barbiturate-
0.88
252-266
259

ID

inducible element

NO:

68

SEQ
V$MYT1/MYT1.01
MyT1 zinc finger
0.75
272-284
278

ID

transcription

NO:

factor involved in

69

primary

neurogenesis

SEQ
V$SMAD/SMAD4.01
Smad4
0.94
304-312
308

ID

transcription

NO:

factor involved in

70

TGF-beta

signaling

SEQ
V$HOXF/CRX.01
Cone-rod
0.94
312-328
320

ID

homeobox-

NO:

containing

71

transcription

factor/otx-like

homeobox gene

SEQ
V$ECAT/NFY.01
nuclear factor Y
0.90
337-351
344

ID

(Y-box binding

NO:

factor)

72

SEQ
V$HOXF/PTX1.01
Pituitary
0.79
337-353
345

ID

Homeobox 1

NO:

(Ptx1)

73

SEQ
V$FKHD/FREAC2.01
Fork head
0.84
362-378
370

ID

RElated

NO:

ACtivator-2

74

SEQ
V$MINI/MUSCLE_INI.02
Muscle Initiator
0.86
401-419
410

ID

Sequence

NO:

75

SEQ
V$MOKF/MOK2.01
Ribonucleoprotein
0.74
409-429
419

ID

associated zinc

NO:

finger protein

76

MOK-2 (mouse)

SEQ
V$ZFIA/ZID.01
zinc finger with
0.85
414-426
420

ID

interaction

NO:

domain

77

SEQ
V$CART/XVENT2.01

Xenopus

0.82
418-434
426

ID

homeodomain

NO:

factor Xvent-2;

78

early BMP

signaling

response

SEQ
V$OCT1/OCT1.04
octamer-binding
0.80
421-435
428

ID

factor 1

NO:

79

SEQ
V$HOMS/S8.01
Binding site for
0.97
426-434
430

ID

S8 type

NO:

homeodomains

80

SEQ
V$NKXH/NKX25.02
homeo domain
0.88
424-436
430

ID

factor Nkx-

NO:

2.5/Csx, tinman

81

homolog low

affinity sites

SEQ
V$CREB/CREBP1.01
cAMP-responsive
0.80
425-445
435

ID

element binding

NO:

protein 1

82

SEQ
V$COMP/COMP1.01
COMP1,
0.76
434-454
444

ID

cooperates with

NO:

myogenic

83

proteins in

multicomponent

complex

SEQ
V$HOXF/HOX1-3.01
Hox-1.3,
0.83
444-460
452

ID

vertebrate

NO:

homeobox

84

protein

SEQ
V$ETSF/GABP.01
GABP: GA
0.85
454-470
462

ID

binding protein

NO:

85

SEQ
V$LEFF/LEF1.01
TCF/LEF-1,
0.86
463-479
471

ID

involved in the

NO:

Wnt signal

86

transduction

pathway

SEQ
V$STAT/STAT6.01
STAT6: signal
0.84
464-482
473

ID

transducer and

NO:

activator of

87

transcription 6

SEQ
V$GATA/GATA1.03
GATA-binding
0.95
490-502
496

ID

factor 1

NO:

88

SEQ
V$SRFF/SRF.01
serum response
0.66
487-505
496

ID

factor

NO:

89

SEQ
V$EVI1/EVI1.04
Ecotropic viral
0.77
493-509
501

ID

integration site 1

NO:

encoded factor

90

SEQ
V$AP4R/TH1E47.01
Thing1/E47
0.93
509-525
517

ID

heterodimer, TH1

NO:

bHLH member

91

specific

expression in a

variety of

embryonic

tissues

SEQ
V$AP4R/TAL1BETAITF2.01
Tal-1beta/ITF-2
0.85
512-528
520

ID

heterodimer

NO:

92

SEQ
V$NEUR/NEUROD1.01
DNA binding site
0.83
514-526
520

ID

for NEUROD1

NO:

(BETA-2/E47

93

dimer)

SEQ
V$MEF2/MEF2.05
MEF2
0.96
518-540
529

ID

NO:

94

SEQ
V$EVI1/EVI1.04
Ecotropic viral
0.77
523-539
531

ID

integration site 1

NO:

encoded factor

95

SEQ
V$MEF2/AMEF2.01
myocyte
0.80
521-543
532

ID

enhancer factor

NO:

96

SEQ
V$TBPF/MTATA.01
Muscle TATA box
0.84
524-540
532

ID

NO:

97

SEQ
V$HOXF/HOX1-3.01
Hox-1.3,
0.83
543-559
551

ID

vertebrate

NO:

homeobox

98

protein

SEQ
V$PDX1/ISL1.01
Pancreatic and
0.82
543-563
553

ID

intestinal lim-

NO:

homeodomain

99

factor

SEQ
V$OCT1/OCT1.05
octamer-binding
0.90
556-570
563

ID

factor 1

NO:

100

SEQ
V$CIZF/NMP4.01
NMP4 (nuclear
0.97
562-572
567

ID

matrix protein 4)/

NO:

CIZ (Cas-

101

interacting zinc

finger protein)

SEQ
V$EVI1/EVI1.01
Ecotropic viral
0.72
569-585
577

ID

integration site 1

NO:

encoded factor

102

SEQ
V$AP1F/AP1.01
AP1 binding site
0.95
582-602
592

ID

NO:

103

SEQ
V$PIT1/PIT1.01
Pit1, GHF-1
0.86
589-599
594

ID

pituitary specific

NO:

pou domain

104

transcription

factor

SEQ
V$AP1F/AP1.01
AP1 binding site
0.95
586-606
596

ID

NO:

105

SEQ
V$VMYB/VMYB.01
v-Myb
0.90
593-603
598

ID

NO:

106

SEQ
V$CIZF/NMP4.01
NMP4 (nuclear
0.97
595-605
600

ID

matrix protein 4)/

NO:

CIZ (Cas-

107

interacting zinc

finger protein)

SEQ
V$GREF/PRE.01
Progesterone
0.84
604-622
613

ID

receptor binding

NO:

site

108

SEQ
V$GKLF/GKLF.01
Gut-enriched
0.91
632-646
639

ID

Krueppel-like

NO:

factor

109

SEQ
V$CIZF/NMP4.01
NMP4 (nuclear
0.97
637-647
642

ID

matrix protein 4)/

NO:

CIZ (Cas-

110

interacting zinc

finger protein)

SEQ
V$NFAT/NFAT.01
Nuclear factor of
0.97
640-650
645

ID

activated T-cells

NO:

111

SEQ
V$MAZF/MAZ.01
Myc associated
0.90
649-661
655

ID

zinc finger

NO:

protein (MAZ)

112

SEQ
V$EGRF/WT1.01
Wilms Tumor
0.88
658-672
665

ID

Suppressor

NO:

113

SEQ
V$ZBPF/ZBP89.01
Zinc finger
0.93
663-675
669

ID

transcription

NO:

factor ZBP-89

114

SEQ
V$IRFF/IRF2.01
interferon
0.80
702-716
709

ID

regulatory factor 2

NO:

115

SEQ
V$BRNF/BRN2.01
POU factor Brn-2
0.91
746-762
754

ID

(N-Oct 3)

NO:

116

SEQ
V$ETSF/PU1.01
Pu.1 (Pu120) Ets-
0.86
746-762
754

ID

like transcription

NO:

factor identified

117

in lymphoid B-

cells

SEQ
V$EVI1/EVI1.04
Ecotropic viral
0.77
750-766
758

ID

integration site 1

NO:

encoded factor

118

SEQ
V$EVI1/EVI1.05
Ecotropic viral
0.80
755-771
763

ID

integration site 1

NO:

encoded factor

119

SEQ
V$ZBPF/ZBP89.01
Zinc finger
0.93
764-776
770

ID

transcription

NO:

factor ZBP-89

120

SEQ
V$FAST/FAST1.01
FAST-1 SMAD
0.81
769-783
776

ID

interacting

NO:

protein

121

SEQ
V$TBPF/TATA.02
Mammalian C-
0.89
771-787
779

ID

type LTR TATA

NO:

box

122

SEQ
V$PAX5/PAX9.01
zebrafish PAX9
0.78
781-809
795

ID

binding sites

NO:

123

SEQ
V$OCT1/OCT.01
Octamer binding
0.79
793-807
800

ID

site (OCT1/OCT2

NO:

consensus)

124

SEQ
V$OCTP/OCT1P.01
octamer-binding
0.86
798-810
804

ID

factor 1, POU-

NO:

specific domain

125

SEQ
V$SRFF/SRF.01
serum response
0.66
797-815
806

ID

factor

NO:

126

SEQ
V$EVI1/EVI1.05
Ecotropic viral
0.80
802-818
810

ID

integration site 1

NO:

encoded factor

127

SEQ
V$CLOX/CDP.01
cut-like
0.75
803-819
811

ID

homeodomain

NO:

protein

128

SEQ
V$EVI1/EVI1.02
Ecotropic viral
0.83
807-823
815

ID

integration site 1

NO:

encoded factor

129

SEQ
V$ECAT/NFY.02
nuclear factor Y
0.91
810-824
817

ID

(Y-box binding

NO:

factor)

130

SEQ
V$HAML/AML3.01
Runt-related
0.84
811-825
818

ID

transcription

NO:

factor 2/CBFA1

131

(core-binding

factor, runt

domain, alpha

subunit 1)

SEQ
V$PCAT/CAAT.01
cellular and viral
0.90
813-823
818

ID

CCAAT box

NO:

132

SEQ
V$GATA/GATA.01
GATA binding site
0.95
818-830
824

ID

(consensus)

NO:

133

SEQ
V$HNF1/HNF1.02
Hepatic nuclear
0.76
818-834
826

ID

factor 1

NO:

134

SEQ
V$HOXT/MEIS1_HOXA9.01
Homeobox
0.79
823-835
829

ID

protein MEIS1

NO:

binding site

135

SEQ
V$ECAT/NFY.01
nuclear factor Y
0.90
837-851
844

ID

(Y-box binding

NO:

factor)

136

SEQ
V$FKHD/FREAC2.01
Fork head
0.84
844-860
852

ID

RElated

NO:

ACtivator-2

137

SEQ
V$EVI1/EVI1.06
Ecotropic viral
0.83
846-862
854

ID

integration site 1

NO:

encoded factor

138

SEQ
V$GATA/GATA1.01
GATA-binding
0.96
853-865
859

ID

factor 1

NO:

139

SEQ
V$PCAT/ACAAT.01
Avian C-type LTR
0.86
856-866
861

ID

CCAAT box

NO:

140

SEQ
V$XBBF/RFX1.01
X-box binding
0.89
909-927
918

ID

protein RFX1

NO:

141

SEQ
V$EBOX/MYCMAX.02
c-Myc/Max
0.92
912-928
920

ID

heterodimer

NO:

142

SEQ
V$MITF/MIT.01
MIT
0.81
911-929
920

ID

(microphthalmia

NO:

transcription

143

factor) and TFE3

SEQ
V$ETSF/PU1.01
Pu.1 (Pu120) Ets-
0.86
927-943
935

ID

like transcription

NO:

factor identified

144

in lymphoid B-

cells

SEQ
V$OCT1/OCT1.06
octamer-binding
0.80
932-946
939

ID

factor 1

NO:

145

SEQ
V$TALE/TGIF.01
TG-interacting
1.00
936-942
939

ID

factor belonging

NO:

to TALE class of

146

homeodomain

factors

SEQ
V$MITF/MIT.01
MIT
0.81
935-953
944

ID

(microphthalmia

NO:

transcription

147

factor) and TFE3

SEQ
V$OCT1/OCT1.04
octamer-binding
0.80
941-955
948

ID

factor 1

NO:

148

SEQ
V$GATA/GATA.01
GATA binding site
0.95
962-974
968

ID

(consensus)

NO:

149

SEQ
V$SRFF/SRF.01
serum response
0.66
968-986
977

ID

factor

NO:

150

SEQ
V$CDXF/CDX2.01
Cdx-2
0.84
970-988
979

ID

mammalian

NO:

caudal related

151

intestinal transcr.

factor

SEQ
V$FKHD/XFD2.01

Xenopus fork
0.89
972-988
980

ID

head domain

NO:

factor 2

152

SEQ
V$MEF2/MEF2.01
myogenic
0.74
970-992
981

ID

enhancer factor 2

NO:

153

SEQ
V$TBPF/TATA.01
cellular and viral
0.90
973-989
981

ID

TATA box

NO:

elements

154

SEQ
V$CART/CART1.01
Cart-1 (cartilage
0.84
978-994
986

ID

homeoprotein 1)

NO:

155

SEQ
V$CART/CART1.01
Cart-1 (cartilage
0.84
985-1001
993

ID

homeoprotein 1)

NO:

156

SEQ
V$SATB/SATB1.01
Special AT-rich
0.93
985-1001
993

ID

sequence-binding

NO:

protein 1,

157

predominantly

expressed in

thymocytes,

binds to matrix

attachment

regions (MARs)

SEQ
V$BRNF/BRN3.01
POU transcription
0.78
987-1003
995

ID

factor Brn-3

NO:

158

SEQ
V$CLOX/CDP.01
cut-like
0.75
987-1003
995

ID

homeodomain

NO:

protein

159

SEQ
V$HOMS/S8.01
Binding site for
0.97
992-1000
996

ID

S8 type

NO:

homeodomains

160

SEQ
V$NKXH/DLX1.01
DLX-1, -2, and -5
0.91
990-1002
996

ID

binding sites

NO:

161

SEQ
V$HOXF/HOX1-3.01
Hox-1.3,
0.83
989-1005
997

ID

vertebrate

NO:

homeobox

162

protein

SEQ
V$PDX1/PDX1.01
Pdx1 (IDX1/IPF1)
0.74
988-1008
998

ID

pancreatic and

NO:

intestinal

163

homeodomain TF

SEQ
V$FKHD/XFD3.01

Xenopus fork
0.82
998-1014
1006

ID

head domain

NO:

factor 3

164

SEQ
V$HNF1/HNF1.01
hepatic nuclear
0.78
1000-1016
1008

ID

factor 1

NO:

165

SEQ
V$HNF1/HNF1.01
hepatic nuclear
0.78
1002-1018
1010

ID

factor 1

NO:

166

SEQ
V$PAX4/PAX4.01
Pax-4 paired
0.97
1005-1015
1010

ID

domain protein,

NO:

together with

167

PAX-6 involved in

pancreatic

development

SEQ
V$HOMS/S8.01
Binding site for
0.97
1007-1015
1011

ID

S8 type

NO:

homeodomains

168

SEQ
V$HOXF/HOX1-3.01
Hox-1.3,
0.83
1003-1019
1011

ID

vertebrate

NO:

homeobox

169

protein

SEQ
V$NKXH/DLX1.01
DLX-1, -2, and -5
0.91
1005-1017
1011

ID

binding sites

NO:

170

SEQ
V$RBIT/BRIGHT.01
Bright, B cell
0.92
1005-1017
1011

ID

regulator of IgH

NO:

transcription

171

SEQ
V$TBPF/ATATA.01
Avian C-type LTR
0.81
1005-1021
1013

ID

TATA box

NO:

172

SEQ
V$CREB/CREBP1.01
cAMP-responsive
0.80
1004-1024
1014

ID

element binding

NO:

protein 1

173

SEQ
V$RORA/RORA2.01
RAR-related
0.82
1007-1023
1015

ID

orphan receptor

NO:

alpha2

174

SEQ
V$PCAT/CAAT.01
cellular and viral
0.90
1022-1032
1027

ID

CCAAT box

NO:

175

SEQ
V$NKXH/NKX25.02
homeo domain
0.88
1022-1034
1028

ID

factor Nkx-

NO:

2.5/Csx, tinman

176

homolog low

affinity sites

SEQ
V$CREB/HLF.01
hepatic leukemia
0.84
1022-1042
1032

ID

factor

NO:

177

SEQ
V$HOXF/HOX1-3.01
Hox-1.3,
0.83
1056-1072
1064

ID

vertebrate

NO:

homeobox

178

protein

SEQ
V$HOMS/S8.01
Binding site for
0.97
1061-1069
1065

ID

S8 type

NO:

homeodomains

179

SEQ
V$NKXH/DLX1.01
DLX-1, -2, and -5
0.91
1059-1071
1065

ID

binding sites

NO:

180

SEQ
V$RBIT/BRIGHT.01
Bright, B cell
0.92
1059-1071
1065

ID

regulator of IgH

NO:

transcription

181

SEQ
V$BRNF/BRN2.01
POU factor Brn-2
0.91
1058-1074
1066

ID

(N-Oct 3)

NO:

182

SEQ
V$OCT1/OCT1.06
octamer-binding
0.80
1060-1074
1067

ID

factor 1

NO:

183

SEQ
V$HOXF/HOX1-3.01
Hox-1.3,
0.83
1061-1077
1069

ID

vertebrate

NO:

homeobox

184

protein

SEQ
V$OCT1/OCT1.06
octamer-binding
0.80
1079-1093
1086

ID

factor 1

NO:

185

SEQ
V$FAST/FAST1.01
FAST-1 SMAD
0.81
1080-1094
1087

ID

interacting

NO:

protein

186

SEQ
V$RREB/RREB1.01
Ras-responsive
0.79
1081-1095
1088

ID

element binding

NO:

protein 1

187

SEQ
V$E2FF/E2F.02
E2F, involved in
0.84
1085-1099
1092

ID

cell cycle

NO:

regulation,

188

interacts with Rb

p107 protein

SEQ
V$CREB/TAXCREB.01
Tax/CREB
0.81
1091-1111
1101

ID

complex

NO:

189

SEQ
V$AP1F/VMAF.01
v-Maf
0.82
1092-1112
1102

ID

NO:

190

SEQ
V$MYT1/MYT1.01
MyT1 zinc finger
0.75
1123-1135
1129

ID

transcription

NO:

factor involved in

191

primary

neurogenesis

SEQ
V$CLOX/CLOX.01
Clox
0.81
1136-1152
1144

ID

NO:

192

SEQ
V$HNF4/HNF4.01
Hepatic nuclear
0.82
1156-1172
1164

ID

factor 4

NO:

193

SEQ
V$LEFF/LEF1.01
TCF/LEF-1,
0.86
1157-1173
1165

ID

involved in the

NO:

Wnt signal

194

transduction

pathway

SEQ
V$PERO/PPARA.01
PPAR/RXR
0.70
1157-1177
1167

ID

heterodimers

NO:

195

SEQ
V$CLOX/CLOX.01
Clox
0.81
1173-1189
1181

ID

NO:

196

SEQ
V$HNF6/HNF6.01
Liver enriched
0.82
1175-1189
1182

ID

Cut -

NO:

Homeodomain

197

transcription

factor HNF6

(ONECUT)

SEQ
V$SRFF/SRF.02
serum response
0.83
1177-1195
1186

ID

factor

NO:

198

SEQ
V$CLOX/CDPCR3.01
cut-like
0.75
1180-1196
1188

ID

homeodomain

NO:

protein

199

SEQ
V$PIT1/PIT1.01
Pit1, GHF-1
0.86
1186-1196
1191

ID

pituitary specific

NO:

pou domain

200

transcription

factor

SEQ
V$HMTB/MTBF.01
muscle-specific
0.90
1196-1204
1200

ID

Mt binding site

NO:

201

SEQ
V$FKHD/HFH8.01
HNF-3/Fkh
0.92
1200-1216
1208

ID

Homolog-8

NO:

202

SEQ
V$E4FF/E4F.01
GLI-Krueppel-
0.82
1223-1235
1229

ID

related

NO:

transcription

203

factor, regulator

of adenovirus E4

promoter

SEQ
V$CREB/HLF.01
hepatic leukemia
0.84
1221-1241
1231

ID

factor

NO:

204

SEQ
V$VBPF/VBP.01
PAR-type chicken
0.86
1226-1236
1231

ID

vitellogenin

NO:

promoter-binding

205

protein

SEQ
V$OCT1/OCT.01
Octamer binding
0.79
1259-1273
1266

ID

site (OCT1/OCT2

NO:

consensus)

206

SEQ
V$STAT/STAT6.01
STAT6: signal
0.84
1261-1279
1270

ID

transducer and

NO:

activator of

207

transcription 6

SEQ
V$CDXF/CDX2.01
Cdx-2
0.84
1270-1288
1279

ID

mammalian

NO:

caudal related

208

intestinal transcr.

factor

SEQ
V$SORY/SOX9.01
SOX (SRY-related
0.90
1280-1296
1288

ID

HMG box)

NO:

209

SEQ
V$FKHD/HFH2.01
HNF-3/Fkh
0.93
1285-1301
1293

ID

Homolog 2

NO:

210

SEQ
V$CDXF/CDX2.01
Cdx-2
0.84
1286-1304
1295

ID

mammalian

NO:

caudal related

211

intestinal transcr.

factor

SEQ
V$OCTB/TST1.01
POU-factor Tst-
0.87
1288-1302
1295

ID

1/Oct-6

NO:

212

SEQ
V$PDX1/ISL1.01
Pancreatic and
0.82
1298-1318
1308

ID

intestinal lim-

NO:

homeodomain

213

factor

SEQ
V$SORY/SOX9.01
SOX (SRY-related
0.90
1308-1324
1316

ID

HMG box)

NO:

214

SEQ
V$CREB/HLF.01
hepatic leukemia
0.84
1310-1330
1320

ID

factor

NO:

215

SEQ
V$VBPF/VBP.01
PAR-type chicken
0.86
1315-1325
1320

ID

vitellogenin

NO:

promoter-binding

216

protein

SEQ
V$CEBP/CEBPB.01
CCAAT/enhancer
0.94
1313-1331
1322

ID

binding protein

NO:

beta

217

SEQ
V$PDX1/ISL1.01
Pancreatic and
0.82
1313-1333
1323

ID

intestinal lim-

NO:

homeodomain

218

factor

SEQ
V$HAML/AML1.01
runt-factor AML-1
1.00
1323-1337
1330

ID

NO:

219

SEQ
V$GREF/ARE.01
Androgene
0.80
1323-1341
1332

ID

receptor binding

NO:

site

220

SEQ
V$TEAF/TEF1.01
TEF-1 related
0.84
1343-1355
1349

ID

muscle factor

NO:

221

SEQ
V$CMYB/CMYB.01
c-Myb, important
0.99
1352-1360
1356

ID

in hematopoesis,

NO:

cellular

222

equivalent to

avian

myoblastosis

virus oncogene v-

myb

SEQ
V$AP4R/TH1E47.01
Thing1/E47
0.93
1378-1394
1386

ID

heterodimer, TH1

NO:

bHLH member

223

specific

expression in a

variety of

embryonic

tissues

SEQ
V$CP2F/CP2.01
CP2
0.90
1384-1394
1389

ID

NO:

224

SEQ
V$CHOP/CHOP.01
heterodimers of
0.90
1386-1398
1392

ID

CHOP and

NO:

C/EBPalpha

225

SEQ
V$CEBP/CEBP.02
C/EBP binding
0.85
1385-1403
1394

ID

site

NO:

226

SEQ
V$MEF2/HMEF2.01
myocyte
0.76
1384-1406
1395

ID

enhancer factor

NO:

227

SEQ
V$OCT1/OCT1.03
octamer-binding
0.85
1388-1402
1395

ID

factor 1

NO:

228

SEQ
V$HMTB/MTBF.01
muscle-specific
0.90
1394-1402
1398

ID

Mt binding site

NO:

229

SEQ
V$CLOX/CDPCR3.01
cut-like
0.75
1422-1438
1430

ID

homeodomain

NO:

protein

230

SEQ
V$OCT1/OCT1.05
octamer-binding
0.90
1423-1437
1430

ID

factor 1

NO:

231

SEQ
V$HOXF/HOX1-3.01
Hox-1.3,
0.83
1423-1439
1431

ID

vertebrate

NO:

homeobox

232

protein

SEQ
V$PDX1/PDX1.01
Pdx1 (IDX1/IPF1)
0.74
1423-1443
1433

ID

pancreatic and

NO:

intestinal

233

homeodomain TF

SEQ
V$SORY/SOX5.01
Sox-5
0.87
1426-1442
1434

ID

NO:

234

SEQ
V$OCT1/OCT1.05
octamer-binding
0.90
1444-1458
1451

ID

factor 1

NO:

235

SEQ
V$CREB/E4BP4.01
E4BP4, bZIP
0.80
1443-1463
1453

ID

domain,

NO:

transcriptional

236

repressor

SEQ
V$VBPF/VBP.01
PAR-type chicken
0.86
1449-1459
1454

ID

vitellogenin

NO:

promoter-binding

237

protein

SEQ
V$TBPF/MTATA.01
Muscle TATA box
0.84
1455-1471
1463

ID

NO:

238

SEQ
V$PBXF/PBX1.01
homeo domain
0.78
1469-1481
1475

ID

factor Pbx-1

NO:

239

SEQ
V$COMP/COMP1.01
COMP1,
0.76
1467-1487
1477

ID

cooperates with

NO:

myogenic

240

proteins in

multicomponent

complex

SEQ
V$SORY/S0X5.01
Sox-5
0.87
1478-1494
1486

ID

NO:

241

SEQ
V$FKHD/FREAC2.01
Fork head
0.84
1485-1501
1493

ID

RElated

NO:

ACtivator-2

242

SEQ
V$PDX1/ISL1.01
Pancreatic and
0.82
1495-1515
1505

ID

intestinal lim-

NO:

homeodomain

243

factor

SEQ
V$HOXF/HOX1-3.01
Hox-1.3,
0.83
1499-1515
1507

ID

vertebrate

NO:

homeobox

244

protein

SEQ
V$PDX1/PDX1.01
Pdx1 (IDX1/IPF1)
0.74
1498-1518
1508

ID

pancreatic and

NO:

intestinal

245

homeodomain TF

SEQ
V$CART/XVENT2.01

Xenopus

0.82
1502-1518
1510

ID

homeodomain

NO:

factor Xvent-2;

246

early BMP

signaling

response

SEQ
V$CDXF/CDX2.01
Cdx-2
0.84
1507-1525
1516

ID

mammalian

NO:

caudal related

247

intestinal transcr.

factor

SEQ
V$MEF2/MEF2.05
MEF2
0.96
1505-1527
1516

ID

NO:

248

SEQ
V$HNF1/HNF1.01
hepatic nuclear
0.78
1510-1526
1518

ID

factor 1

NO:

249

SEQ
V$OCT1/OCT1.06
octamer-binding
0.80
1511-1525
1518

ID

factor 1

NO:

250

SEQ
V$TBPF/TATA.02
Mammalian C-
0.89
1510-1526
1518

ID

type LTR TATA

NO:

box

251

SEQ
V$NKXH/MSX.01
Homeodomain
0.97
1514-1526
1520

ID

proteins MSX-1

NO:

and MSX-2

252

SEQ
V$RBIT/BRIGHT.01
Bright, B cell
0.92
1515-1527
1521

ID

regulator of IgH

NO:

transcription

253

SEQ
V$MEF2/AMEF2.01
myocyte
0.80
1514-1536
1525

ID

enhancer factor

NO:

254

SEQ
V$EVI1/EVI1.02
Ecotropic viral
0.83
1526-1542
1534

ID

integration site 1

NO:

encoded factor

255

SEQ
V$GATA/GATA1.02
GATA-binding
0.99
1528-1540
1534

ID

factor 1

NO:

256

SEQ
V$GATA/GATA3.02
GATA-binding
0.91
1537-1549
1543

ID

factor 3

NO:

257

SEQ
V$GATA/GATA3.02
GATA-binding
0.91
1559-1571
1565

ID

factor 3

NO:

258

SEQ
V$OCT1/OCT1.02
octamer-binding
0.82
1561-1575
1568

ID

factor 1

NO:

259

SEQ
V$CEBP/CEBPB.01
CCAAT/enhancer
0.94
1567-1585
1576

ID

binding protein

NO:

beta

260

SEQ
V$PLZF/PLZF.01
Promyelocytic
0.86
1574-1588
1581

ID

leukemia zink

NO:

finger (TF with

261

nine Krueppel-

like zink fingers)

SEQ
V$PAX3/PAX3.01
Pax-3 paired
0.76
1587-1599
1593

ID

domain protein,

NO:

expressed in

262

embryogenesis,

mutations

correlate to

Waardenburg

Syndrome

SEQ
V$CREB/ATF.01
activating
0.90
1588-1608
1598

ID

transcription

NO:

factor

263

SEQ
V$AP4R/TH1E47.01
Thing1/E47
0.93
1614-1630
1622

ID

heterodimer, TH1

NO:

bHLH member

264

specific

expression in a

variety of

embryonic

tissues

SEQ
V$NKXH/MSX.01
Homeodomain
0.97
1619-1631
1625

ID

proteins MSX-1

NO:

and MSX-2

265

SEQ
V$RBIT/BRIGHT.01
Bright, B cell
0.92
1620-1632
1626

ID

regulator of IgH

NO:

transcription

266

SEQ
V$OCTB/TST1.01
POU-factor Tst-
0.87
1620-1634
1627

ID

1/Oct-6

NO:

267

SEQ
V$NKXH/DLX3.01
Distal-less 3
0.91
1628-1640
1634

ID

homeodomain

NO:

transcription

268

factor

SEQ
V$GREF/PRE.01
Progesterone
0.84
1628-1646
1637

ID

receptor binding

NO:

site

269

SEQ
V$TBPF/TATA.01
cellular and viral
0.90
1636-1652
1644

ID

TATA box

NO:

elements

270

SEQ
V$FKHD/XFD2.01

Xenopus fork
0.89
1637-1653
1645

ID

head domain

NO:

factor 2

271

SEQ
V$TBPF/TATA.01
cellular and viral
0.90
1638-1654
1646

ID

TATA box

NO:

elements

272

SEQ
V$CREB/E4BP4.01
E4BP4, bZIP
0.80
1638-1658
1648

ID

domain,

NO:

transcriptional

273

repressor

SEQ
V$PDX1/ISL1.01
Pancreatic and
0.82
1644-1664
1654

ID

intestinal lim-

NO:

homeodomain

274

factor

SEQ
V$COMP/COMP1.01
COMP1,
0.76
1648-1668
1658

ID

cooperates with

NO:

myogenic

275

proteins in

multicomponent

complex

SEQ
V$TBPF/TATA.02
Mammalian C-
0.89
1658-1674
1666

ID

type LTR TATA

NO:

box

276

SEQ
V$IRFF/ISRE.01
interferon-
0.81
1662-1676
1669

ID

stimulated

NO:

response element

277

SEQ
V$XBBF/RFX1.01
X-box binding
0.89
1660-1678
1669

ID

protein RFX1

NO:

278

SEQ
V$MYT1/MYT1.02
MyT1 zinc finger
0.88
1667-1679
1673

ID

transcription

NO:

factor involved in

279

primary

neurogenesis

SEQ
V$OCT1/OCT1.06
octamer-binding
0.80
1683-1697
1690

ID

factor 1

NO:

280

SEQ
V$AP1F/TCF11MAFG.01
TCF11/MafG
0.81
1681-1701
1691

ID

heterodimers,

NO:

binding to

281

subclass of AP1

sites

SEQ
V$NKXH/MSX2.01
Muscle segment
0.95
1687-1699
1693

ID

homeo box 2,

NO:

homologue of

282

Drosophila (HOX

8)

SEQ
V$FAST/FAST1.01
FAST-1 SMAD
0.81
1687-1701
1694

ID

interacting

NO:

protein

283

SEQ
V$PBXC/PBX1_MEIS1.03
Binding site for a
0.76
1686-1702
1694

ID

Pbx1/Meis1

NO:

heterodimer

284

SEQ
V$CIZF/NMP4.01
NMP4 (nuclear
0.97
1699-1709
1704

ID

matrix protein 4)/

NO:

CIZ (Cas-

285

interacting zinc

finger protein)

SEQ
V$STAT/STAT6.01
STAT6: signal
0.84
1702-1720
1711

ID

transducer and

NO:

activator of

286

transcription 6

SEQ
V$AP4R/TAL1BETAE47.01
Tal-1beta/E47
0.87
1710-1726
1718

ID

heterodimer

NO:

287

SEQ
V$SORY/HMGIY.01
HMGI(Y) high-
0.92
1720-1736
1728

ID

mobility-group

NO:

protein I (Y),

288

architectural

transcription

factor organizing

the framework of

a nuclear protein-

DNA

transcriptional

complex

SEQ
V$MYT1/MYT1.01
MyT1 zinc finger
0.75
1723-1735
1729

ID

transcription

NO:

factor involved in

289

primary

neurogenesis

SEQ
V$SRFF/SRF.01
serum response
0.66
1728-1746
1737

ID

factor

NO:

290

SEQ
V$HOXF/HOXA9.01
Member of the
0.87
1731-1747
1739

ID

vertebrate HOX -

NO:

cluster of

291

homeobox factors

SEQ
V$HOXT/MEIS1_HOXA9.01
Homeobox
0.79
1734-1746
1740

ID

protein MEIS1

NO:

binding site

292

SEQ
V$PIT1/PIT1.01
Pit1, GHF-1
0.86
1737-1747
1742

ID

pituitary specific

NO:

pou domain

293

transcription

factor

SEQ
V$AP1F/AP1.01
AP1 binding site
0.95
1734-1754
1744

ID

NO:

294

SEQ
V$VBPF/VBP.01
PAR-type chicken
0.86
1746-1756
1751

ID

vitellogenin

NO:

promoter-binding

295

protein

SEQ
V$FAST/FAST1.01
FAST-1 SMAD
0.81
1757-1771
1764

ID

interacting

NO:

protein

296

SEQ
V$HOXF/EN1.01
Homeobox
0.77
1759-1775
1767

ID

protein engrailed

NO:

(en-1)

297

SEQ
V$TBPF/MTATA.01
Muscle TATA box
0.84
1763-1779
1771

ID

NO:

298

SEQ
V$ETSF/ETS2.01
c-Ets-2 binding
0.86
1774-1790
1782

ID

site

NO:

299

SEQ
V$MYT1/MYT1.02
MyT1 zinc finger
0.88
1780-1792
1786

ID

transcription

NO:

factor involved in

300

primary

neurogenesis

SEQ
V$GFI1/GFI1.01
Growth factor
0.97
1782-1796
1789

ID

independence 1

NO:

zinc finger

301

protein acts as

transcriptional

repressor

SEQ
V$TBPF/TATA.01
cellular and viral
0.90
1784-1800
1792

ID

TATA box

NO:

elements

302

SEQ
V$BRNF/BRN2.01
POU factor Brn-2
0.91
1786-1802
1794

ID

(N-Oct 3)

NO:

303

SEQ
V$HOXT/MEIS1_HOXA9.01
Homeobox
0.79
1788-1800
1794

ID

protein MEIS1

NO:

binding site

304

SEQ
V$MEF2/AMEF2.01
myocyte
0.80
1783-1805
1794

ID

enhancer factor

NO:

305

SEQ
V$OCTB/TST1.01
POU-factor Tst-
0.87
1787-1801
1794

ID

1/Oct-6

NO:

306

SEQ
V$HOXF/HOXA9.01
Member of the
0.87
1787-1803
1795

ID

vertebrate HOX-

NO:

cluster of

307

homeobox factors

SEQ
V$BRNF/BRN2.01
POU factor Brn-2
0.91
1788-1804
1796

ID

(N-Oct 3)

NO:

308

SEQ
V$PARF/DBP.01
Albumin D-box
0.84
1791-1805
1798

ID

binding protein

NO:

309

SEQ
V$OCT1/OCT1.02
octamer-binding
0.82
1795-1809
1802

ID

factor 1

NO:

310

SEQ
V$FKHD/FREAC2.01
Fork head
0.84
1816-1832
1824

ID

RElated

NO:

ACtivator-2

311

SEQ
V$SORY/SOX5.01
Sox-5
0.87
1821-1837
1829

ID

NO:

312

SEQ
V$AREB/AREB6.04
AREB6 (Atp1a1
0.98
1837-1849
1843

ID

regulatory

NO:

element binding

313

factor 6)

SEQ
V$MYT1/MYT1.02
MyT1 zinc finger
0.88
1848-1860
1854

ID

transcription

NO:

factor involved in

314

primary

neurogenesis

SEQ
V$RBPF/RBPJK.01
Mammalian
0.84
1851-1865
1858

ID

transcriptional

NO:

repressor RBP-

315

Jkappa/CBF1

SEQ
V$OCT1/OCT1.02
octamer-binding
0.82
1875-1889
1882

ID

factor 1

NO:

316

SEQ
V$FKHD/FREAC4.01
Fork head
0.78
1875-1891
1883

ID

RElated

NO:

ACtivator-4

317

SEQ
V$EBOX/MYCMAX.02
c-Myc/Max
0.92
1880-1896
1888

ID

heterodimer

NO:

318

SEQ
V$PAX6/PAX6.01
Pax-6 paired
0.75
1880-1898
1889

ID

domain protein

NO:

319

SEQ
V$IRFF/IRF3.01
Interferon
0.86
1891-1905
1898

ID

regulatory factor

NO:

3 (IRF-3)

320

SEQ
V$HNF1/HNF1.02
Hepatic nuclear
0.76
1895-1911
1903

ID

factor 1

NO:

321

SEQ
V$FKHD/FREAC2.01
Fork head
0.84
1898-1914
1906

ID

RElated

NO:

ACtivator-2

322

SEQ
V$E4FF/E4F.01
GLI-Krueppel-
0.82
1902-1914
1908

ID

related

NO:

transcription

323

factor, regulator

of adenovirus E4

promoter

SEQ
V$CREB/CREBP1.01
cAMP-responsive
0.80
1900-1920
1910

ID

element binding

NO:

protein 1

324

SEQ
V$VBPF/VBP.01
PAR-type chicken
0.86
1905-1915
1910

ID

vitellogenin

NO:

promoter-binding

325

protein

SEQ
V$MYT1/MYT1.01
MyT1 zinc finger
0.75
1912-1924
1918

ID

transcription

NO:

factor involved in

326

primary

neurogenesis

SEQ
V$HNF1/HNF1.01
hepatic nuclear
0.78
1913-1929
1921

ID

factor 1

NO:

327

SEQ
V$PCAT/CAAT.01
cellular and viral
0.90
1928-1938
1933

ID

CCAAT box

NO:

328

SEQ
V$HNF6/HNF6.01
Liver enriched
0.82
1929-1943
1936

ID

Cut-

NO:

Homeodomain

329

transcription

factor HNF6

(ONECUT)

SEQ
V$PXRF/PXRCAR.01
Halfsite of PXR
0.98
1935-1945
1940

ID

(pregnane X

NO:

receptor)/RXR

330

resp. CAR

(constitutive

androstane

receptor)/RXR

heterodimer

binding site

SEQ
V$RARF/RTR.01
Retinoid
0.81
1934-1952
1943

ID

receptor-related

NO:

testis-associated

331

receptor

(GCNF/RTR)

SEQ
V$HOXF/EN1.01
Homeobox
0.77
1936-1952
1944

ID

protein engrailed

NO:

(en-1)

332

SEQ
V$NKXH/NKX25.01
homeo domain
1.00
1939-1951
1945

ID

factor Nkx-

NO:

2.5/Csx, tinman

333

homolog, high

affinity sites

SEQ
V$GATA/GATA3.02
GATA-binding
0.91
1953-1965
1959

ID

factor 3

NO:

334

SEQ
V$TBPF/TATA.01
cellular and viral
0.90
1968-1984
1976

ID

TATA box

NO:

elements

335

SEQ
V$SRFF/SRF.01
serum response
0.66
1969-1987
1978

ID

factor

NO:

336

SEQ
V$CLOX/CDPCR3.01
cut-like
0.75
1972-1988
1980

ID

homeodomain

NO:

protein

337

SEQ
V$PAX1/PAX1.01
Pax1 paired
0.61
2016-2034
2025

ID

domain protein,

NO:

expressed in the

338

developing

vertebral column

of mouse

embryos

SEQ
V$TBPF/ATATA.01
Avian C-type LTR
0.81
2019-2035
2027

ID

TATA box

NO:

339

SEQ
V$GFI1/GfI1B.01
Growth factor
0.82
2021-2035
2028

ID

independence 1

NO:

zinc finger

340

protein Gfi-1B

SEQ
V$NRSF/NRSF.01
neuron-restrictive
0.69
2025-2045
2035

ID

silencer factor

NO:

341

SEQ
V$NFAT/NFAT.01
Nuclear factor of
0.97
2033-2043
2038

ID

activated T-cells

NO:

342

SEQ
V$AREB/AREB6.04
AREB6 (Atp1a1
0.98
2034-2046
2040

ID

regulatory

NO:

element binding

343

factor 6)

SEQ
V$HNF1/HNF1.01
hepatic nuclear
0.78
2036-2052
2044

ID

factor 1

NO:

344

SEQ
V$FKHD/XFD3.01

Xenopus fork
0.82
2038-2054
2046

ID

head domain

NO:

factor 3

345

SEQ
V$PDX1/PDX1.01
Pdx1 (IDX1/IPF1)
0.74
2036-2056
2046

ID

pancreatic and

NO:

intestinal

346

homeodomain TF

SEQ
V$OCT1/OCT1.01
octamer-binding
0.77
2050-2064
2057

ID

factor 1

NO:

347

SEQ
V$TBPF/TATA.01
cellular and viral
0.90
2053-2069
2061

ID

TATA box

NO:

elements

348

SEQ
V$ETSF/GABP.01
GABP: GA
0.85
2080-2096
2088

ID

binding protein

NO:

349

SEQ
V$BEL1/BEL1.01
Bel-1 similar
0.78
2083-2105
2094

ID

region (defined in

NO:

Lentivirus LTRs)

350

SEQ
V$VMYB/VMYB.01
v-Myb
0.90
2097-2107
2102

ID

NO:

351

SEQ
V$GREF/ARE.01
Androgene
0.80
2106-2124
2115

ID

receptor binding

NO:

site

352

SEQ
V$PDX1/PDX1.01
Pdx1 (IDX1/IPF1)
0.74
2137-2157
2147

ID

pancreatic and

NO:

intestinal

353

homeodomain TF

SEQ
V$MYOD/MYOD.02
myoblast
0.98
2154-2168
2161

ID

determining

NO:

factor

354

SEQ
V$GATA/GATA1.03
GATA-binding
0.95
2169-2181
2175

ID

factor 1

NO:

355

SEQ
V$AP4R/TAL1BETAE47.01
Tal-1beta/E47
0.87
2179-2195
2187

ID

heterodimer

NO:

356

SEQ
V$OAZF/ROAZ.01
Rat C2H2 Zn
0.73
2204-2220
2212

ID

finger protein

NO:

involved in

357

olfactory

neuronal

differentiation

SEQ
V$GATA/GATA1.01
GATA-binding
0.96
2217-2229
2223

ID

factor 1

NO:

358

SEQ
V$MYOD/E47.02
TAL1/E47 dimers
0.93
2220-2234
2227

ID

NO:

359

SEQ
V$LTUP/TAACC.01
Lentiviral TATA
0.71
2225-2247
2236

ID

upstream

NO:

element

360

SEQ
V$RREB/RREB1.01
Ras-responsive
0.79
2239-2253
2246

ID

element binding

NO:

protein 1

361

SEQ
V$OCT1/OCT1.05
octamer-binding
0.90
2251-2265
2258

ID

factor 1

NO:

362

SEQ
V$OCT1/OCT1.02
octamer-binding
0.82
2282-2296
2289

ID

factor 1

NO:

363

SEQ
V$COUP/COUP.01
COUP
0.81
2284-2298
2291

ID

antagonizes HNF-

NO:

4 by binding site

364

competition or

synergizes by

direct protein -

protein

interaction with

HNF-4

SEQ
V$MEF2/MEF2.01
myogenic
0.74
2290-2312
2301

ID

enhancer factor 2

NO:

365

SEQ
V$CDXF/CDX2.01
Cdx-2
0.84
2296-2314
2305

ID

mammalian

NO:

caudal related

366

intestinal transcr.

factor

SEQ
V$MYT1/MYT1.01
MyT1 zinc finger
0.75
2301-2313
2307

ID

transcription

NO:

factor involved in

367

primary

neurogenesis

SEQ
V$NFAT/NFAT.01
Nuclear factor of
0.97
2314-2324
2319

ID

activated T-cells

NO:

368

SEQ
V$CIZF/NMP4.01
NMP4 (nuclear
0.97
2317-2327
2322

ID

matrix protein 4)/

NO:

CIZ (Cas-

369

interacting zinc

finger protein)

SEQ
V$GATA/GATA3.02
GATA-binding
0.91
2326-2338
2332

ID

factor 3

NO:

370

SEQ
V$HMTB/MTBF.01
muscle-specific
0.90
2351-2359
2355

ID

Mt binding site

NO:

371

SEQ
V$NOLF/OLF1.01
olfactory neuron-
0.82
2350-2372
2361

ID

specific factor

NO:

372

SEQ
V$PDX1/PDX1.01
Pdx1 (IDX1/IPF1)
0.74
2363-2383
2373

ID

pancreatic and

NO:

intestinal

373

homeodomain TF

SEQ
V$GATA/GATA3.02
GATA-binding
0.91
2395-2407
2401

ID

factor 3

NO:

374

SEQ
V$NFAT/NFAT.01
Nuclear factor of
0.97
2406-2416
2411

ID

activated T-cells

NO:

375

SEQ
V$OCTP/OCT1P.01
octamer-binding
0.86
2433-2445
2439

ID

factor 1, POU-

NO:

specific domain

376

SEQ
V$MITF/MIT.01
MIT
0.81
2438-2456
2447

ID

(microphthalmia

NO:

transcription

377

factor) and TFE3

SEQ
V$PAX8/PAX8.01
PAX 2/5/8
0.88
2441-2453
2447

ID

binding site

NO:

378

SEQ
V$TBPF/ATATA.01
Avian C-type LTR
0.81
2451-2467
2459

ID

TATA box

NO:

379

SEQ
V$GATA/GATA3.02
GATA-binding
0.91
2462-2474
2468

ID

factor 3

NO:

380

SEQ
V$CLOX/CLOX.01
Clox
0.81
2462-2478
2470

ID

NO:

381

SEQ
V$HNF6/HNF6.01
Liver enriched
0.82
2464-2478
2471

ID

Cut -

NO:

Homeodomain

382

transcription

factor HNF6

(ONECUT)

SEQ
V$PIT1/PIT1.01
Pit1, GHF-1
0.86
2468-2478
2473

ID

pituitary specific

NO:

pou domain

383

transcription

factor

SEQ
V$AP4R/TAL1BETAITF2.01
Tal-1beta/ITF-2
0.85
2469-2485
2477

ID

heterodimer

NO:

384

SEQ
V$CIZF/NMP4.01
NMP4 (nuclear
0.97
2477-2487
2482

ID

matrix protein 4)/

NO:

CIZ (Cas-

385

interacting zinc

finger protein)

SEQ
V$NFAT/NFAT.01
Nuclear factor of
0.97
2480-2490
2485

ID

activated T-cells

NO:

386

SEQ
V$STAT/STAT.01
signal
0.87
2479-2497
2488

ID

transducers and

NO:

activators of

387

transcription

SEQ
V$TBPF/TATA.02
Mammalian C-
0.89
2484-2500
2492

ID

type LTR TATA

NO:

box

388

SEQ
V$FKHD/XFD3.01

Xenopus fork
0.82
2501-2517
2509

ID

head domain

NO:

factor 3

389

SEQ
V$AP1F/AP1.01
AP1 binding site
0.95
2500-2520
2510

ID

NO:

390

SEQ
V$AP1F/AP1.01
AP1 binding site
0.95
2504-2524
2514

ID

NO:

391

SEQ
V$PCAT/CAAT.01
cellular and viral
0.90
2513-2523
2518

ID

CCAAT box

NO:

392

SEQ
V$CDXF/CDX2.01
Cdx-2
0.84
2524-2542
2533

ID

mammalian

NO:

caudal related

393

intestinal transcr.

factor

SEQ
V$MYT1/MYT1.02
MyT1 zinc finger
0.88
2539-2551
2545

ID

transcription

NO:

factor involved in

394

primary

neurogenesis

SEQ
V$ETSF/FLI.01
ETS family
0.81
2560-2576
2568

ID

member FLI

NO:

395

SEQ
V$MYT1/MYT1.01
MyT1 zinc finger
0.75
2569-2581
2575

ID

transcription

NO:

factor involved in

396

primary

neurogenesis

SEQ
V$TBPF/ATATA.01
Avian C-type LTR
0.81
2576-2592
2584

ID

TATA box

NO:

397

SEQ
V$SATB/SATB1.01
Special AT-rich
0.93
2578-2594
2586

ID

sequence-binding

NO:

protein 1,

398

predominantly

expressed in

thymocytes,

binds to matrix

attachment

regions (MARs)

SEQ
V$NKXH/NKX31.01
prostate-specific
0.84
2584-2596
2590

ID

homeodomain

NO:

protein NKX3.1

399

SEQ
V$PARF/DBP.01
Albumin D-box
0.84
2589-2603
2596

ID

binding protein

NO:

400

SEQ
V$PAX5/PAX5.02
B-cell-specific
0.75
2591-2619
2605

ID

activating protein

NO:

401

SEQ
V$ECAT/NFY.03
nuclear factor Y
0.80
2604-2618
2611

ID

(Y-box binding

NO:

factor)

402

SEQ
V$GFI1/GFI1.01
Growth factor
0.97
2608-2622
2615

ID

independence 1

NO:

zinc finger

403

protein acts as

transcriptional

repressor

SEQ
V$HNF6/HNF6.01
Liver enriched
0.82
2608-2622
2615

ID

Cut -

NO:

Homeodomain

404

transcription

factor HNF6

(ONECUT)

SEQ
V$MYT1/MYT1.01
MyT1 zinc finger
0.75
2610-2622
2616

ID

transcription

NO:

factor involved in

405

primary

neurogenesis

SEQ
V$PAX8/PAX8.01
PAX 2/5/8
0.88
2610-2622
2616

ID

binding site

NO:

406

SEQ
V$TTFF/TTF1.01
Thyroid
0.92
2609-2623
2616

ID

transcription

NO:

factor-1 (TTF1)

407

binding site

SEQ
V$MYT1/MYT1.02
MyT1 zinc finger
0.88
2612-2624
2618

ID

transcription

NO:

factor involved in

408

primary

neurogenesis

SEQ
V$CDXF/CDX2.01
Cdx-2
0.84
2612-2630
2621

ID

mammalian

NO:

caudal related

409

intestinal transcr.

factor

SEQ
V$SORY/HMGIY.01
HMGI(Y) high-
0.92
2649-2665
2657

ID

mobility-group

NO:

protein I (Y),

410

architectural

transcription

factor organizing

the framework of

a nuclear protein-

DNA

transcriptional

complex

SEQ
V$HOXF/EN1.01
Homeobox
0.77
2657-2673
2665

ID

protein engrailed

NO:

(en-1)

411

SEQ
V$OCT1/OCT1.06
octamer-binding
0.80
2662-2676
2669

ID

factor 1

NO:

412

SEQ
V$BCL6/BCL6.01
POZ/zinc finger
0.76
2683-2699
2691

ID

protein,

NO:

transcriptional

413

repressor,

translocations

observed in

diffuse large cell

lymphoma

SEQ
V$OCTP/OCT1P.01
octamer-binding
0.86
2715-2727
2721

ID

factor 1, POU-

NO:

specific domain

414

SEQ
V$TEAF/TEF1.01
TEF-1 related
0.84
2722-2734
2728

ID

muscle factor

NO:

415

SEQ
V$GFI1/GFI1.01
Growth factor
0.97
2723-2737
2730

ID

independence 1

NO:

zinc finger

416

protein acts as

transcriptional

repressor

SEQ
V$HOXT/MEIS1_HOXA9.01
Homeobox
0.79
2729-2741
2735

ID

protein MEIS1

NO:

binding site

417

SEQ
V$HOXF/HOXA9.01
Member of the
0.87
2728-2744
2736

ID

vertebrate HOX -

NO:

cluster of

418

homeobox factors

SEQ
V$PARF/DBP.01
Albumin D-box
0.84
2729-2743
2736

ID

binding protein

NO:

419

SEQ
V$VBPF/VBP.01
PAR-type chicken
0.86
2732-2742
2737

ID

vitellogenin

NO:

promoter-binding

420

protein

SEQ
V$CREB/E4BP4.01
E4BP4, bZIP
0.80
2728-2748
2738

ID

domain,

NO:

transcriptional

421

repressor

SEQ
V$OCT1/OCT1.01
octamer-binding
0.77
2733-2747
2740

ID

factor 1

NO:

422

SEQ
V$FKHD/XFD1.01

Xenopus fork
0.90
2733-2749
2741

ID

head domain

NO:

factor 1

423

SEQ
V$SRFF/SRF.01
serum response
0.66
2736-2754
2745

ID

factor

NO:

424

SEQ
V$OCTP/OCT1P.01
octamer-binding
0.86
2746-2758
2752

ID

factor 1, POU-

NO:

specific domain

425

SEQ
V$CLOX/CDPCR3.01
cut-like
0.75
2748-2764
2756

ID

homeodomain

NO:

protein

426

SEQ
V$TBPF/TATA.01
cellular and viral
0.90
2749-2765
2757

ID

TATA box

NO:

elements

427

SEQ
V$SRFF/SRF.01
serum response
0.66
2750-2768
2759

ID

factor

NO:

428

SEQ
V$TBPF/ATATA.01
Avian C-type LTR
0.81
2759-2775
2767

ID

TATA box

NO:

429

SEQ
V$TBPF/TATA.02
Mammalian C-
0.89
2762-2778
2770

ID

type LTR TATA

NO:

box

430

SEQ
V$CABL/CABL.01
Multifunctional c-
0.97
2769-2779
2774

ID

Abl src type

NO:

tyrosine kinase

431

SEQ
V$LEFF/LEF1.01
TCF/LEF-1,
0.86
2766-2782
2774

ID

involved in the

NO:

Wnt signal

432

transduction

pathway

SEQ
V$OCT1/OCT1.06
octamer-binding
0.80
2775-2789
2782

ID

factor 1

NO:

433

SEQ
V$MEF2/MMEF2.01
myocyte
0.90
2776-2798
2787

ID

enhancer factor

NO:

434

SEQ
V$OCT1/OCT1.06
octamer-binding
0.80
2780-2794
2787

ID

factor 1

NO:

435

SEQ
V$TBPF/TATA.01
cellular and viral
0.90
2779-2795
2787

ID

TATA box

NO:

elements

436

SEQ
V$CART/CART1.01
Cart-1 (cartilage
0.84
2780-2796
2788

ID

homeoprotein 1)

NO:

437

SEQ
V$FKHD/XFD2.01

Xenopus fork-
0.89
2780-2796
2788

ID

head domain

NO:

factor 2

438

SEQ
V$MEF2/MEF2.05
MEF2
0.96
2778-2800
2789

ID

NO:

439

SEQ
V$BRNF/BRN3.01
POU transcription
0.78
2785-2801
2793

ID

factor Brn-3

NO:

440

SEQ
V$TBPF/TATA.01
cellular and viral
0.90
2786-2802
2794

ID

TATA box

NO:

elements

441

SEQ
V$GFI1/GFI1.01
Growth factor
0.97
2791-2805
2798

ID

independence 1

NO:

zinc finger

442

protein acts as

transcriptional

repressor

SEQ
V$HOXT/MEIS1_HOXA9.01
Homeobox
0.79
2797-2809
2803

ID

protein MEIS1

NO:

binding site

443

SEQ
V$MEF2/MMEF2.01
myocyte
0.90
2792-2814
2803

ID

enhancer factor

NO:

444

SEQ
V$MEF2/MEF2.05
MEF2
0.96
2795-2817
2806

ID

NO:

445

SEQ
V$MEF2/MMEF2.01
myocyte
0.90
2797-2819
2808

ID

enhancer factor

NO:

446

SEQ
V$HNF1/HNF1.01
hepatic nuclear
0.78
2802-2818
2810

ID

factor 1

NO:

447

SEQ
V$MEF2/MEF2.01
myogenic
0.74
2799-2821
2810

ID

enhancer factor 2

NO:

448

SEQ
V$HOXF/HOX1-3.01
Hox-1.3,
0.83
2814-2830
2822

ID

vertebrate

NO:

homeobox

449

protein

SEQ
V$PARF/DBP.01
Albumin D-box
0.84
2816-2830
2823

ID

binding protein

NO:

450

SEQ
V$PDX1/ISL1.01
Pancreatic and
0.82
2814-2834
2824

ID

intestinal lim-

NO:

homeodomain

451

factor

SEQ
V$GATA/GATA1.02
GATA-binding
0.99
2819-2831
2825

ID

factor 1

NO:

452

SEQ
V$HEAT/HSF1.01
heat shock factor 1
0.93
2845-2855
2850

ID

NO:

453

SEQ
V$MYT1/MYT1.02
MyT1 zinc finger
0.88
2853-2865
2859

ID

transcription

NO:

factor involved in

454

primary

neurogenesis

SEQ
V$BCL6/BCL6.02
POZ/zinc finger
0.77
2857-2873
2865

ID

protein,

NO:

transcriptional

455

repressor,

translocations

observed in

diffuse large cell

lymphoma

SEQ
V$TTFF/TTF1.01
Thyroid
0.92
2863-2877
2870

ID

transcription

NO:

factor-1 (TTF1)

456

binding site

SEQ
V$EBOX/USF.02
upstream
0.94
2868-2884
2876

ID

stimulating factor

NO:

457

SEQ
V$HOXF/PTX1.01
Pituitary
0.79
2892-2908
2900

ID

Homeobox 1

NO:

(Ptx1)

458

SEQ
V$MYOD/LMO2COM.01
complex of Lmo2
0.98
2901-2915
2908

ID

bound to Tal-1,

NO:

E2A proteins, and

459

GATA-1, half-site 1

SEQ
V$REBV/EBVR.01
Epstein-Barr
0.81
2904-2924
2914

ID

virus

NO:

transcription

460

factor R

SEQ
V$ETSF/PU1.01
Pu.1 (Pu120) Ets-
0.86
2932-2948
2940

ID

like transcription

NO:

factor identified

461

in lymphoid B-

cells

SEQ
V$MITF/MIT.01
MIT
0.81
2943-2961
2952

ID

(microphthalmia

NO:

transcription

462

factor) and TFE3

SEQ
V$HAML/AML1.01
runt-factor AML-1
1.00
2950-2964
2957

ID

NO:

463

SEQ
V$NFKB/CREL.01
c-Rel
0.91
2954-2968
2961

ID

NO:

464

SEQ
V$IKRS/IK3.01
Ikaros 3,
0.84
2958-2970
2964

ID

potential

NO:

regulator of

465

lymphocyte

differentiation

SEQ
V$RBPF/RBPJK.01
Mammalian
0.84
2957-2971
2964

ID

transcriptional

NO:

repressor RBP-

466

Jkappa/CBF1

SEQ
V$E2FF/E2F.01
E2F, involved in
0.74
2966-2980
2973

ID

cell cycle

NO:

regulation,

467

interacts with Rb

p107 protein

SEQ
V$E4FF/E4F.01
GLI-Krueppel-
0.82
2968-2980
2974

ID

related

NO:

transcription

468

factor, regulator

of adenovirus E4

promoter

SEQ
V$CREB/ATF6.02
Activating
0.85
2966-2986
2976

ID

transcription

NO:

factor 6, member

469

of b-zip family,

induced by ER

stress

SEQ
V$EBOX/ARNT.01
AhR nuclear
0.89
2968-2984
2976

ID

translocator

NO:

homodimers

470

SEQ
V$E4FF/E4F.01
GLI-Krueppel-
0.82
2971-2983
2977

ID

related

NO:

transcription

471

factor, regulator

of adenovirus E4

promoter

SEQ
V$EBOR/XBP1.01
X-box-binding
0.86
2970-2984
2977

ID

protein 1

NO:

472

SEQ
V$E2FF/E2F.01
E2F, involved in
0.74
2971-2985
2978

ID

cell cycle

NO:

regulation,

473

interacts with Rb

p107 protein

SEQ
V$STAT/STAT.01
signal
0.87
2989-3007
2998

ID

transducers and

NO:

activators of

474

transcription

SEQ
V$BCL6/BCL6.02
POZ/zinc finger
0.77
2991-3007
2999

ID

protein,

NO:

transcriptional

475

repressor,

translocations

observed in

diffuse large cell

lymphoma

SEQ
V$XSEC/STAF.01
Se-Cys tRNA
0.77
3003-3025
3014

ID

gene

NO:

transcription

476

activating factor

SEQ
V$NF1F/NF1.01
Nuclear factor 1
0.94
3007-3025
3016

ID

NO:

477

SEQ
V$OCT1/OCT1.02
octamer-binding
0.82
3014-3028
3021

ID

factor 1

NO:

478

SEQ
V$RCAT/CLTR_CAAT.01
Mammalian C-
0.75
3019-3043
3031

ID

type LTR CCAAT

NO:

box

479

SEQ
V$SF1F/SF1.01
SF1 steroidogenic
0.95
3033-3045
3039

ID

factor 1

NO:

480

SEQ
V$OCT1/OCT1.01
octamer-binding
0.77
3038-3052
3045

ID

factor 1

NO:

481

SEQ
V$PARF/DBP.01
Albumin D-box
0.84
3042-3056
3049

ID

binding protein

NO:

482

SEQ
V$ETSF/ETS1.01
c-Ets-1 binding
0.92
3057-3073
3065

ID

site

NO:

483

SEQ
V$LEFF/LEF1.01
TCF/LEF-1,
0.86
3062-3078
3070

ID

involved in the

NO:

Wnt signal

484

transduction

pathway

SEQ
V$MAZF/MAZ.01
Myc associated
0.90
3072-3084
3078

ID

zinc finger

NO:

protein (MAZ)

485

SEQ
V$SP1F/GC.01
GC box elements
0.88
3071-3085
3078

ID

NO:

486

SEQ
V$TBPF/TATA.01
cellular and viral
0.90
3091-3107
3099

ID

TATA box

NO:

487

elements

SEQ
V$SEF1/SEF1.01
SEF1 binding site
0.69
3099-3117
3108

ID

NO:

488

SEQ

ID

Core
Matrix

NO:
Str.
sim.
sim.
Sequence

SEQ
(+)
1.000
0.900
ggaccatCAAAgtctgt

ID

NO:

41

SEQ
(+)
1.000
0.823
agtctgtCATGtcatttgg

ID

NO:

42

SEQ
(+)
0.833
0.904
gTCATgtcatttggg

ID

NO:

43

SEQ
(+)
1.000
1.000
GTCAttt

ID

NO:

44

SEQ
(+)
1.000
0.735
ctgtcatgtcatTTGGgggagggcctatg

ID

NO:

45

SEQ
(−)
1.000
0.982
gccctCCCCcaaa

ID

NO:

46

SEQ
(+)
0.876
0.898
tgggGGAGggcctat

ID

NO:

47

SEQ
(−)
0.884
0.708
acagaggagggcATAGgccct

ID

NO:

48

SEQ
(−)
0.800
0.811
cagataCACAgaggagggcataggccctc

ID

NO:

49

SEQ
(−)
1.000
0.987
tgctattTAAGcccaga

ID

NO:

50

SEQ
(−)
1.000
0.932
tgctATTTa

ID

NO:

51

SEQ
(−)
0.750
0.865
ggtatgctATTTaag

ID

NO:

52

NO:

65

SEQ
(−)
1.000
0.894
ttatAAAGctgagga

ID

NO:

66

SEQ
(−)
1.000
0.910
agttaTAAAgctgagga

ID

NO:

67

SEQ
(−)
1.000
0.902
agtgAAAGcagagag

ID

NO:

68

SEQ
(−)
0.750
0.756
craCAGTtgacct

ID

NO:

69

SEQ
(+)
1.000
0.940
GTCTtgact

ID

NO:

70

SEQ
(−)
1.000
0.960
gagggATTAgaaaagga

ID

NO:

71

SEQ
(−)
1.000
0.906
ggaatCCAAtygtag

ID

NO:

72

SEQ
(+)
0.789
0.802
ctacraTTGGattccat

ID

NO:

73

SEQ
(−)
1.000
0.897
tacagcTAAAcactgag

ID

NO:

74

SEQ
(−)
0.840
0.865
gagcctTCATccagtagct

ID

NO:

75

SEQ
(−)
1.000
0.746
tgtcatcttagagCCTTcatc

ID

NO:

76

SEQ
(+)
1.000
0.861
agGCTCtaagatg

ID

NO:

77

SEQ
(+)
0.750
0.837
tcTAAGatgacaattaa

ID

NO:

78

SEQ
(+)
0.807
0.840
aaGATGacaattaag

ID

NO:

79

SEQ
(+)
1.000
0.994
gacaATTAa

ID

NO:

80

SEQ
(−)
1.000
1.000
cctTAATtgtcat

ID

NO:

81

SEQ
(−)
0.766
0.808
cgacgattACCTtaattgtca

ID

NO:

82

SEQ
(−)
0.750
0.768
aatgaggATCGacgattacct

ID

NO:

83

SEQ
(+)
1.000
0.886
cgatcctcATTAtagtg

ID

NO:

84

SEQ
(+)
1.000
0.868
tatagtGGAAgggcttc

ID

NO:

85

SEQ
(+)
1.000
0.904
agggcttCAAAggcagt

ID

NO:

86

SEQ
(−)
0.758
0.867
gagacTGCCtttgaagccc

ID

NO:

87

SEQ
(−)
1.000
0.971
ttcaGATAggcag

ID

NO:

88

SEQ
(−)
0.757
0.672
atgttcaGATAggcagtag

ID

NO:

89

SEQ
(−)
0.800
0.824
gGAAAtgttcagatagg

ID

NO:

90

SEQ
(+)
1.000
0.951
cctaatgCCAGatgtct

ID

NO:

91

SEQ
(+)
1.000
0.852
aatgcCAGAtgtctctt

ID

NO:

92

SEQ
(−)
1.000
0.851
gagaCATCtggca

ID

NO:

93

SEQ
(−)
1.000
0.984
aggataggttTAAAgagacatct

ID

NO:

94

SEQ
(−)
1.000
0.774
gGATAggtttaaagaga

ID

NO:

95

SEQ
(+)
1.000
0.813
tgtctcttTAAAcctatcctggc

ID

NO:

96

SEQ
(+)
1.000
0.877
ctcttTAAAcctatcct

ID

NO:

97

SEQ
(+)
1.000
0.845
ctcccttcATTAaggta

ID

NO:

98

SEQ
(−)
1.000
0.834
gagatacctTAATgaagggag

ID

NO:

99

SEQ
(+)
0.944
0.926
gGTATctcatttttt

ID

NO:

100

SEQ
(−)
1.000
0.972
gcAAAAaatga

ID

NO:

101

SEQ
(−)
0.764
0.720
ggaaCAGAggagagcaa

ID

NO:

102

SEQ
(−)
0.881
0.964
aaaactgaATCAgtggnggaa

ID

NO:

103

SEQ
(+)
1.000
0.886
actgATTCagt

ID

NO:

104

SEQ
(+)
0.850
0.956
nccactgaTTCAgtttttctg

ID

NO:

105

SEQ
(−)
0.876
0.910
aaaAACTgaat

ID

NO:

106

SEQ
(−)
1.000
0.975
agAAAAactga

ID

NO:

107

SEQ
(+)
1.000
0.875
ctgatccctctTGTTctcc

ID

NO:

108

SEQ
(−)
1.000
0.971
gaaaaagagaAGGGa

ID

NO:

109

SEQ
(−)
1.000
0.987
ggAAAAagaga

ID

NO:

110

SEQ
(−)
1.000
0.982
ggagGAAAaag

ID

NO:

111

SEQ
(−)
1.000
0.910
ggtgGAGGgaagg

ID

NO:

112

SEQ
(−)
1.000
0.932
gggggTGGGagggtg

ID

NO:

113

SEQ
(+)
1.000
0.972
tcccaCCCCcatg

ID

NO:

114

SEQ
(−)
1.000
0.815
aggaagggGAAAggg

ID

NO:

115

SEQ
(−)
1.000
0.911
aaaataggAAATaagga

ID

NO:

116

SEQ
(−)
1.000
0.883
aaaataGGAAataagga

ID

NO:

117

SEQ
(−)
0.760
0.792
aGAGAaaataggaaata

ID

NO:

118

SEQ
(−)
0.763
0.817
cccccagagaaAATAgg

ID

NO:

119

SEQ
(−)
1.000
0.934
ccacaCCCCcaga

ID

NO:

120

SEQ
(+)
0.983
0.894
gggtgtgGATTttat

ID

NO:

121

SEQ
(−)
1.000
0.942
caccaTAAAatccacac

ID

NO:

122

SEQ
(−)
0.866
0.813
aacataTGCAcagaagggcttccaccata

ID

NO:

123

SEQ
(−)
1.000
0.790
catATGCacagaagg

ID

NO:

124

SEQ
(−)
1.000
0.910
caacatATGCaca

ID

NO:

125

SEQ
(+)
0.757
0.666
ctgtgcaTATGttgtctta

ID

NO:

126

SEQ
(−)
0.750
0.828
caataagacaaCATAtg

ID

NO:

127

SEQ
(−)
1.000
0.776
ccAATAagacaacatat

ID

NO:

128

SEQ
(−)
1.000
0.836
tcaaccaatAAGAcaac

ID

NO:

129

SEQ
(−)
1.000
0.960
atcaaCCAAtaagac

ID

NO:

130

SEQ
(+)
1.000
0.844
tcttatTGGTtgata

ID

NO:

131

SEQ
(−)
1.000
0.943
tcaaCCAAtaa

ID

NO:

132

SEQ
(+)
1.000
0.956
ggttGATAaataa

ID

NO:

133

SEQ
(+)
0.757
0.791
gGTTGataaataaagca

ID

NO:

134

SEQ
(−)
0.750
0.797
gTGCTttatttat

ID

NO:

135

SEQ
(+)
1.000
0.912
gttgtCCAAtaggga

ID

NO:

136

SEQ
(+)
0.750
0.843
aataggGAAAcaagata

ID

NO:

137

SEQ
(+)
1.000
0.960
tagggaaacaAGATagg

ID

NO:

138

SEQ
(+)
1.000
0.970
acaaGATAggtgg

ID

NO:

139

SEQ
(−)
0.750
0.867
cccaCCTAtct

ID

NO:

140

SEQ
(−)
1.000
0.929
ggatcacatgGCAAccctc

ID

NO:

141

SEQ
(−)
0.895
0.936
ggatCACAtggcaacc

ID

NO:

142

SEQ
(+)
1.000
0.863
gggttgcCATGtgatccta

ID

NO:

143

SEQ
(+)
1.000
0.950
ctaggaGGAAttgacac

ID

NO:

144

SEQ
(−)
1.000
0.800
catgtgtcAATTcct

ID

NO:

145

SEQ
(−)
1.000
1.000
tGTCAat

ID

NO:

146

SEQ
(−)
1.000
0.835
ccattctCATGtgtcaatt

ID

NO:

147

SEQ
(+)
0.846
0.800
caCATGagaatgggg

ID

NO:

148

SEQ
(+)
1.000
0.998
gaaaGATAagtcc

ID

NO:

149

SEQ
(−)
1.000
0.672
atattttTATAaggactta

ID

NO:

150

SEQ
(−)
1.000
0.867
atatattTTTAtaaggact

ID

NO:

151

SEQ
(+)
1.000
0.894
tccttaTAAAaatatat

ID

NO:

152

SEQ
(+)
1.000
0.740
agtccttaTAAAaatatatatta

ID

NO:

153

SEQ
(+)
1.000
0.963
ccttaTAAAaatatata

ID

NO:

154

SEQ
(−)
1.000
0.870
acTAATatatattttta

ID

NO:

155

SEQ
(−)
1.000
0.855
caTAATtactaatatat

ID

NO:

156

SEQ
(−)
1.000
0.943
cataattacTAATatat

ID

NO:

157

SEQ
(−)
1.000
0.816
cccATAAttactaatat

ID

NO:

158

SEQ
(−)
0.757
0.765
ccCATAattactaatat

ID

NO:

159

SEQ
(+)
1.000
0.989
agtaATTAt

ID

NO:

160

SEQ
(−)
1.000
0.976
ccatAATTactaa

ID

NO:

161

SEQ
(−)
1.000
0.886
aacccataATTActaat

ID

NO:

162

SEQ
(−)
1.000
0.775
attaacccaTAATtactaata

ID

NO:

163

SEQ
(+)
0.826
0.844
tatgggttAATAattaa

ID

NO:

164

SEQ
(−)
0.755
0.857
aCTTAattattaaccca

ID

NO:

165

SEQ
(+)
1.000
0.966
gGTTAataattaagtca

ID

NO:

166

SEQ
(+)
1.000
0.972
taatAATTaag

ID

NO:

167

SEQ
(−)
1.000
0.995
cttaATTAt

ID

NO:

168

SEQ
(−)
1.000
0.873
ctgacttaATTAttaac

ID

NO:

169

SEQ
(+)
1.000
0.988
taatAATaagtc

ID

NO:

170

SEQ
(+)
1.000
0.931
taataATTAagtc

ID

NO:

171

SEQ
(+)
1.000
0.881
taataatTAAGtcagag

ID

NO:

172

SEQ
(−)
0.766
0.819
tagctctgACTTaattattaa

ID

NO:

173

SEQ
(+)
0.750
0.874
ataattaAGTCagagct

ID

NO:

174

SEQ
(+)
0.856
0.928
ctagCCATtaa

ID

NO:

175

SEQ
(−)
1.000
0.903
tctTAATggctag

ID

NO:

176

SEQ
(−)
0.770
0.842
ctagtGTTTcttaatggctag

ID

NO:

177

SEQ
(+)
1.000
0.891
gcttcataATTAatata

ID

NO:

178

SEQ
(−)
1.000
0.995
attaATTAt

ID

NO:

179

SEQ
(+)
1.000
0.988
tcatAATTaatat

ID

NO:

180

SEQ
(+)
1.000
0.952
tcataATTAatat

ID

NO:

181

SEQ
(+)
1.000
0.945
ttcataatTAATatagt

ID

NO:

182

SEQ
(−)
1.000
0.885
actatattAATTatg

ID

NO:

183

SEQ
(−)
1.000
0.854
gatactatATTAattat

ID

NO:

184

SEQ
(+)
0.750
0.875
tgtatgttCATTtgg

ID

NO:

185

SEQ
(+)
0.850
0.887
gtatgttCATTtggg

ID

NO:

186

SEQ
(−)
1.000
0.816
cCCCAaatgaacata

ID

NO:

187

SEQ
(−)
1.000
0.849
tcagcccCAAAtgaa

ID

NO:

188

SEQ
(+)
1.000
0.828
tggggcTGACacagttctggg

ID

NO:

189

SEQ
(+)
1.000
0.833
ggggcTGACacagttctggga

ID

NO:

190

SEQ
(+)
0.750
0.791
aggAAGAytactt

ID

NO:

191

SEQ
(−)
0.804
0.820
cctacaATCCatgtacc

ID

NO:

192

SEQ
(−)
1.000
0.864
atagagCAAAggactac

ID

NO:

193

SEQ
(−)
1.000
0.907
catagagCAAAggacta

ID

NO:

194

SEQ
(−)
1.000
0.700
tagacatagagcAAAGgacta

ID

NO:

195

SEQ
(+)
0.804
0.831
gtctaaATCCatatatg

ID

NO:

196

SEQ
(+)
0.833
0.929
ctaaaTCCAtatatg

ID

NO:

197

SEQ
(+)
1.000
0.851
aaatCCATatatgaatgag

ID

NO:

198

SEQ
(−)
1.000
0.761
actcattcatatATGGa

ID

NO:

199

SEQ
(−)
1.000
0.919
actcATTCata

ID

NO:

200

SEQ
(−)
0.807
0.901
tggtATGTa

ID

NO:

201

SEQ
(−)
1.000
0.922
gaaagayAAACatggta

ID

NO:

202

SEQ
(−)
0.789
0.898
gtgAGGTaacccc

ID

NO:

203

SEQ
(+)
1.000
0.854
atgggGTTAcctcactcagga

ID

NO:

204

SEQ
(+)
1.000
0.903
gTTACctcact

ID

NO:

205

SEQ
(−)
0.758
0.870
cgcAGGCaaatgaat

ID

NO:

206

SEQ
(+)
0.758
0.850
tcattTGCCtgcgaatttt

ID

NO:

207

SEQ
(+)
1.000
0.869
tgcgaatTTTAagattcca

ID

NO:

208

SEQ
(−)
1.000
0.990
taaaaCAATggaatctt

ID

NO:

209

SEQ
(−)
1.000
0.931
aggaataaAACAatgga

ID

NO:

210

SEQ
(+)
1.000
0.865
ccattgtTTTAttcctctg

ID

NO:

211

SEQ
(−)
0.894
0.876
gaggAATAaaacaat

ID

NO:

212

SEQ
(+)
1.000
0.824
tcctctgagTAATactccatt

ID

NO:

213

SEQ
(−)
1.000
0.925
ttacaCAATggagtatt

ID

NO:

214

SEQ
(−)
0.901
0.920
ggtacATTAcacaatggagta

ID

NO:

215

SEQ
(−)
1.000
0.871
aTTACacaatg

ID

NO:

216

SEQ
(+)
0.929
0.955
tccattgtGTAAtgtacca

ID

NO:

217

SEQ
(+)
1.000
0.859
tccattgtgTAATgtaccaca

ID

NO:

218

SEQ
(−)
1.000
1.000
aaaatgTGGTacatt

ID

NO:

219

SEQ
(+)
0.750
0.819
aatgtaccacaTTTTctcc

ID

NO:

220

SEQ
(+)
1.000
0.896
taCATTcttcagt

ID

NO:

221

SEQ
(+)
1.000
0.990
caGTTGagg

ID

NO:

222

SEQ
(−)
1.000
0.932
gcaatagCCAGaacctg

ID

NO:

223

SEQ
(−)
1.000
0.945
gcaatagCCAG

ID

NO:

224

SEQ
(−)
1.000
0.951
attTGCAatagcc

ID

NO:

225

SEQ
(+)
1.000
0.853
tggctattGCAAataaccc

ID

NO:

226

SEQ
(+)
1.000
0.809
ctggctattgcAAATaaccctgc

ID

NO:

227

SEQ
(+)
1.000
0.889
ctattgcAAATaacc

ID

NO:

228

SEQ
(−)
1.000
0.900
ggttATTTg

ID

NO:

229

SEQ
(+)
0.975
0.761
acatatgtcattATTGt

ID

NO:

230

SEQ
(+)
0.944
0.938
cATATgtcattattg

ID

NO:

231

SEQ
(+)
1.000
0.836
catatgtcATTAttgta

ID

NO:

232

SEQ
(−)
1.000
0.889
ttcatacaaTAATgacatatg

ID

NO:

233

SEQ
(−)
1.000
0.870
tcataCAATaatgacat

ID

NO:

234

SEQ
(−)
0.944
0.914
aATATgtaaaacaga

ID

NO:

235

SEQ
(−)
1.000
0.856
tttaaaatatGTAAaacagat

ID

NO:

236

SEQ
(+)
1.000
0.886
tTTACatattt

ID

NO:

237

SEQ
(+)
1.000
0.841
tatttTAAAccatctct

ID

NO:

238

SEQ
(−)
1.000
0.783
caagCAATctaga

ID

NO:

239

SEQ
(+)
1.000
0.765
tctctagATTGcttgtaatat

ID

NO:

240

SEQ
(−)
1.000
0.997
tttaaCAATattacaag

ID

NO:

241

SEQ
(+)
1.000
0.885
tattgtTAAAcatagag

ID

NO:

242

SEQ
(+)
1.000
0.839
catagagagTAATaatgctat

ID

NO:

243

SEQ
(−)
1.000
0.872
atagcattATTActctc

ID

NO:

244

SEQ
(−)
0.826
0.843
tttatagcaTTATtactctct

ID

NO:

245

SEQ
(+)
1.000
0.829
agTAATaatgctataaa

ID

NO:

246

SEQ
(−)
1.000
0.906
tttaattTTTAtagcatta

ID

NO:

247

SEQ
(+)
1.000
0.983
aataatgctaTAAAaattaaaaa

ID

NO:

248

SEQ
(−)
0.755
0.805
tTTTAatttttatagca

ID

NO:

249

SEQ
(+)
1.000
0.832
gctataaaAATTaaa

ID

NO:

250

SEQ
(+)
1.000
0.991
tgctaTAAAaattaaaa

ID

NO:

251

SEQ
(−)
1.000
0.989
tttTAATttttat

ID

NO:

252

SEQ
(+)
1.000
0.944
taaaaATTAaaaa

ID

NO:

253

SEQ
(+)
1.000
0.807
ataaaaatTAAAaataatgataa

ID

NO:

254

SEQ
(+)
1.000
0.872
aataatgatAAGAaaga

ID

NO:

255

SEQ
(+)
1.000
0.993
taatGATAagaaa

ID

NO:

256

SEQ
(+)
1.000
0.931
gaaAGATcctata

ID

NO:

257

SEQ
(+)
1.000
0.915
tacAGATgaaaat

ID

NO:

258

SEQ
(+)
0.763
0.867
cagATGAaaatttag

ID

NO:

259

SEQ
(+)
0.985
0.964
aaaatttaGAAAtacttta

ID

NO:

260

SEQ
(−)
0.958
0.866
agcTAAAgtatttct

ID

NO:

261

SEQ
(−)
1.000
0.763
TCGTcagtggtag

ID

NO:

262

SEQ
(+)
1.000
0.923
taccacTGACgaaatttgtat

ID

NO:

263

SEQ
(−)
1.000
0.959
tttaattCCAGacattc

ID

NO:

264

SEQ
(−)
1.000
0.977
cttTAATtccaga

ID

NO:

265

SEQ
(+)
1.000
0.923
ctggaATTAaaga

ID

NO:

266

SEQ
(+)
1.000
0.898
ctggAATTaaagaaa

ID

NO:

267

SEQ
(−)
1.000
0.915
cagTAATttcttt

ID

NO:

268

SEQ
(+)
1.000
0.922
aaagaaattacTGTTcttt

ID

NO:

269

SEQ
(−)
1.000
0.934
ttataTAAAgaacagta

ID

NO:

270

SEQ
(−)
1.000
0.890
attataTAAAgaacagt

ID

NO:

271

SEQ
(−)
0.891
0.923
tattaTATAaagaacag

ID

NO:

272

SEQ
(−)
0.769
0.856
ctattattatATAAagaacag

ID

NO:

273

SEQ
(+)
1.000
0.836
tttatataaTAATagactgta

ID

NO:

274

SEQ
(+)
0.791
0.760
tataataATAGactgtaaaat

ID

NO:

275

SEQ
(+)
1.000
0.912
gactgTAAAatggcaac

ID

NO:

276

SEQ
(+)
0.750
0.817
gtaaaatgGCAActt

ID

NO:

277

SEQ
(+)
1.000
0.907
ctgtaaaatgGCAActttt

ID

NO:

278

SEQ
(−)
1.000
0.882
taaAAGTtgccat

ID

NO:

279

SEQ
(+)
1.000
0.878
tatttgctAATTcac

ID

NO:

280

SEQ
(−)
0.777
0.865
tcctgTGAAttagcaaatatt

ID

NO:

281

SEQ
(+)
1.000
0.969
tgCTAAttcacag

ID

NO:

282

SEQ
(−)
0.850
0.866
tcctgtgAATTagca

ID

NO:

283

SEQ
(+)
0.750
0.788
ttgctaatTCACaggat

ID

NO:

284

SEQ
(−)
1.000
0.973
agAAAAaatcc

ID

NO:

285

SEQ
(−)
1.000
0.908
agatgTTCCaaagaaaaaa

ID

NO:

286

SEQ
(−)
1.000
0.919
ttgttCAGAtgttccaa

ID

NO:

287

SEQ
(+)
1.000
0.953
tgaacaAATTtccctta

ID

NO:

288

SEQ
(+)
0.750
0.757
acaAATTtccctt

ID

NO:

289

SEQ
(+)
1.000
0.771
tttccctTATAtgaatcac

ID

NO:

290

SEQ
(−)
1.000
0.908
agtGATTcatataaggg

ID

NO:

291

SEQ
(−)
1.000
0.797
gTGATtcatataa

ID

NO:

292

SEQ
(−)
1.000
0.912
agtgATTCata

ID

NO:

293

SEQ
(+)
0.881
0.958
ttatatgaATCActtacattt

ID

NO:

294

SEQ
(+)
1.000
0.860
cTTACattttt

ID

NO:

295

SEQ
(+)
0.850
0.829
gcctgttCATTtaaa

ID

NO:

296

SEQ
(−)
1.000
0.832
gtttTTTAaatgaacag

ID

NO:

297

SEQ
(+)
1.000
0.853
tcattTAAAaaactgca

ID

NO:

298

SEQ
(+)
1.000
0.866
actgcAGGAaagttgtg

ID

NO:

299

SEQ
(+)
1.000
0.891
ggaAAGTtgtgat

ID

NO:

300

SEQ
(−)
1.000
1.000
ataAATCacaacttt

ID

NO:

301

SEQ
(−)
1.000
0.931
cattaTAAAtcacaact

ID

NO:

302

SEQ
(−)
1.000
0.933
tgcattatAAATcacaa

ID

NO:

303

SEQ
(+)
1.000
0.924
gTGATttataatg

ID

NO:

304

SEQ
(−)
0.866
0.827
agttgcatTATAaatcacaactt

ID

NO:

305

SEQ
(+)
0.894
0.898
tgtgATTTataatgc

ID

NO:

306

SEQ
(+)
1.000
0.971
tgtGATTtataatgcaa

ID

NO:

307

SEQ
(+)
1.000
0.916
gtgatttaTAATgcaac

ID

NO:

308

SEQ
(+)
0.884
0.891
atttaTAATgcaact

ID

NO:

309

SEQ
(+)
1.000
0.861
ataATGCaactgcac

ID

NO:

310

SEQ
(+)
1.000
0.910
cagtctTAAAcaatgct

ID

NO:

311

SEQ
(+)
1.000
0.992
ttaaaCAATgctaacca

ID

NO:

312

SEQ
(+)
1.000
0.981
actgtGTTTcagc

ID

NO:

313

SEQ
(−)
1.000
0.889
gggAAGTttatgc

ID

NO:

314

SEQ
(−)
1.000
0.878
tgtgTGGGaagttta

ID

NO:

315

SEQ
(+)
0.763
0.826
actATGAaaacacat

ID

NO:

316

SEQ
(+)
1.000
0.786
actatgaaAACAcatgc

ID

NO:

317

SEQ
(+)
0.895
0.920
gaaaaCACAtgcttaaa

ID

NO:

318

SEQ
(−)
0.773
0.791
cctttAAGCatgtgttttc

ID

NO:

319

SEQ
(+)
1.000
0.874
cttaaaggCAAAtct

ID

NO:

320

SEQ
(−)
0.858
0.782
aGGTAaagatttgcctt

ID

NO:

321

SEQ
(−)
1.000
0.853
ctgaggTAAAgatttgc

ID

NO:

322

SEQ
(−)
0.789
0.830
ctgAGGTaaagat

ID

NO:

323

SEQ
(+)
0.766
0.820
aaatctttACCTcagttaact

ID

NO:

324

SEQ
(+)
1.000
0.862
tTTACctcagt

ID

NO:

325

SEQ
(−)
0.750
0.775
gaaTAGTtaactg

ID

NO:

326

SEQ
(+)
1.000
0.811
aGTTAactattccatag

ID

NO:

327

SEQ
(+)
0.856
0.925
agagCCATtga

ID

NO:

328

SEQ
(−)
1.000
0.873
tgaacTCAAtggctc

ID

NO:

329

SEQ
(−)
1.000
0.980
ctTGAActcaa

ID

NO:

330

SEQ
(+)
1.000
0.854
attgagtTCAAgtgcattt

ID

NO:

331

SEQ
(+)
0.782
0.813
tgagTTCAagtgcattt

ID

NO:

332

SEQ
(+)
1.000
1.000
gttcAAGTgcatt

ID

NO:

333

SEQ
(+)
1.000
0.928
agaAGATataatg

ID

NO:

334

SEQ
(−)
0.891
0.912
atataTATAtggccata

ID

NO:

335

SEQ
(+)
1.000
0.777
atggccaTATAtatatata

ID

NO:

336

SEQ
(−)
1.000
0.806
atatatatatatATGGc

ID

NO:

337

SEQ
(−)
0.750
0.675
CTGTgctgatatatatata

ID

NO:

338

SEQ
(+)
0.750
0.827
atatataTCAGcacagt

ID

NO:

339

SEQ
(+)
1.000
0.904
ataTATCagcacagt

ID

NO:

340

SEQ
(+)
1.000
0.704
atcAGCAcagtggaaacagtt

ID

NO:

341

SEQ
(+)
1.000
0.970
agtgGAAAcag

ID

NO:

342

SEQ
(−)
1.000
0.991
taactGTTTccac

ID

NO:

343

SEQ
(−)
1.000
0.798
tGTTAttaactgtttcc

ID

NO:

344

SEQ
(+)
0.826
0.824
aaacagttAATAacatt

ID

NO:

345

SEQ
(+)
1.000
0.749
ggaaacagtTAATaacatttt

ID

NO:

346

SEQ
(−)
1.000
0.863
taTATGctaaaatgt

ID

NO:

347

SEQ
(−)
0.891
0.908
tagtaTATAtgctaaaa

ID

NO:

348

SEQ
(+)
1.000
0.897
gaggctGGAAgggggct

ID

NO:

349

SEQ
(+)
1.000
0.787
gctggaagggggcTCAGcagtta

ID

NO:

350

SEQ
(−)
0.876
0.901
attAACTgctg

ID

NO:

351

SEQ
(+)
0.750
0.840
atagcacatacTATTcttc

ID

NO:

352

SEQ
(+)
0.782
0.747
gtttggtttTCATcacccatg

ID

NO:

353

SEQ
(−)
1.000
0.988
gaacCACCtgacatg

ID

NO:

354

SEQ
(−)
1.000
0.958
tacaGATAgaaat

ID

NO:

355

SEQ
(+)
1.000
0.924
gtaacCAGAtgatacga

ID

NO:

356

SEQ
(−)
0.750
0.762
agGTACccaaggggact

ID

NO:

357

SEQ
(−)
1.000
0.960
aggtGATAgaggt

ID

NO:

358

SEQ
(−)
1.000
0.939
atagCAGGtgataga

ID

NO:

359

SEQ
(+)
0.759
0.710
cacctgctattctCACCcaaaga

ID

NO:

360

SEQ
(+)
1.000
0.805
aCCCAaagacacaca

ID

NO:

361

SEQ
(−)
0.944
0.904
tGTATgtgagtgtgt

ID

NO:

362

SEQ
(+)
1.000
0.854
tgcATGCacatagtt

ID

NO:

363

SEQ
(−)
0.977
0.855
tGAACtatgtgcatg

ID

NO:

364

SEQ
(+)
0.750
0.767
catagttcAAAAaataaaatttt

ID

NO:

365

SEQ
(−)
1.000
0.896
ttaaaatTTTAttttttga

ID

NO:

366

SEQ
(−)
0.750
0.798
taaAATTttattt

ID

NO:

367

SEQ
(+)
1.000
0.991
aaagGAAAaaa

ID

NO:

368

SEQ
(+)
1.000
0.977
ggAAAAaaagc

ID

NO:

369

SEQ
(−)
1.000
0.946
aaaAGATttgagc

ID

NO:

370

SEQ
(−)
1.000
0.901
aggaATTTt

ID

NO:

371

SEQ
(+)
0.806
0.820
taaaatTCCTatgagtgtgtgat

ID

NO:

372

SEQ
(−)
0.782
0.753
tactgacttTGATcacacact

ID

NO:

373

SEQ
(−)
1.000
0.942
cacAGATtatacc

ID

NO:

374

SEQ
(+)
1.000
0.971
tgtgGAAAaca

ID

NO:

375

SEQ
(+)
0.980
0.879
ctcagtATTCaca

ID

NO:

376

SEQ
(−)
1.000
0.827
ctactttCATGtgtgaata

ID

NO:

377

SEQ
(−)
0.850
0.952
cttTCATgtgtga

ID

NO:

378

SEQ
(+)
1.000
0.838
aagtagcTAAGaataaa

ID

NO:

379

SEQ
(−)
1.000
0.960
aatAGATtttatt

ID

NO:

380

SEQ
(+)
0.806
0.819
aataaaATCTattcatc

ID

NO:

381

SEQ
(+)
0.785
0.846
taaaaTCTAttcatc

ID

NO:

382

SEQ
(+)
1.000
0.890
atctATTCatc

ID

NO:

383

SEQ
(−)
1.000
0.881
aaaaaCAGAtgaataga

ID

NO:

384

SEQ
(−)
1.000
0.981
ggAAAAacaga

ID

NO:

385

SEQ
(−)
1.000
0.976
taagGAAAaac

ID

NO:

386

SEQ
(−)
1.000
0.872
aggattttaaGGAAaaaca

ID

NO:

387

SEQ
(+)
1.000
0.897
ttcctTAAAatcctggc

ID

NO:

388

SEQ
(−)
1.000
0.880
actgagtcAACActgta

ID

NO:

389

SEQ
(−)
1.000
0.984
accactgaGTCAacactgtag

ID

NO:

390

SEQ
(+)
0.964
0.984
agtgttgaCTCAgtggttgct

ID

NO:

391

SEQ
(−)
0.826
0.904
gcaaCCACtga

ID

NO:

392

SEQ
(+)
1.000
0.883
tttaaatTTTAtgctcaaa

ID

NO:

393

SEQ
(+)
1.000
0.891
caaAAGTtgaagc

ID

NO:

394

SEQ
(+)
1.000
0.829
tgaaCCGGtaattctac

ID

NO:

395

SEQ
(−)
1.000
0.757
acaAAGTagaatt

ID

NO:

396

SEQ
(−)
0.750
0.816
aagtattTAATacaaag

ID

NO:

397

SEQ
(−)
1.000
0.939
acaagtattTAATacaa

ID

NO:

398

SEQ
(−)
1.000
0.865
taacAAGTattta

ID

NO:

399

SEQ
(+)
1.000
0.882
acttgTTATgcatcg

ID

NO:

400

SEQ
(−)
1.000
0.758
aacttgatttgttgAGCGatgcataacaa

ID

NO:

401

SEQ
(+)
0.750
0.809
ctcaaCAAAtcaagt

ID

NO:

402

SEQ
(+)
1.000
0.976
acaAATCaagtttta

ID

NO:

403

SEQ
(+)
1.000
0.830
acaaaTCAAgtttta

ID

NO:

404

SEQ
(−)
0.750
0.756
taaAACTtgattt

ID

NO:

405

SEQ
(+)
1.000
0.907
aaaTCAAgtttta

ID

NO:

406

SEQ
(+)
1.000
0.936
caaatCAAGttttaa

ID

NO:

407

SEQ
(+)
1.000
0.887
atcAAGTtttaac

ID

NO:

408

SEQ
(+)
1.000
0.883
atcaagtTTTAacacacca

ID

NO:

409

SEQ
(−)
1.000
0.925
ttaaaaAATTtaagata

ID

NO:

410

SEQ
(+)
1.000
0.780
atttTTTAaatgggcat

ID

NO:

411

SEQ
(−)
0.750
0.818
tttatgccCATTtaa

ID

NO:

412

SEQ
(+)
1.000
0.796
ctaTTCCtacagaagtc

ID

NO:

413

SEQ
(+)
1.000
0.860
ctgaaaATGCatt

ID

NO:

414

SEQ
(+)
1.000
0.898
tgCATTcctgatt

ID

NO:

415

SEQ
(−)
1.000
0.981
ataAATCaggaatgc

ID

NO:

416

SEQ
(+)
1.000
0.929
cTGATttatgtaa

ID

NO:

417

SEQ
(+)
1.000
0.964
cctGATTtatgtaaata

ID

NO:

418

SEQ
(+)
1.000
0.861
ctgatTTATgtaaat

ID

NO:

419

SEQ
(−)
1.000
0.929
tTTACataaat

ID

NO:

420

SEQ
(+)
1.000
0.943
cctgatttatGTAAatatatg

ID

NO:

421

SEQ
(+)
1.000
0.895
ttTATGtaaatatat

ID

NO:

422

SEQ
(+)
1.000
0.940
tttatgTAAAtatatgt

ID

NO:

423

SEQ
(+)
1.000
0.691
atgtaaaTATAtgtatata

ID

NO:

424

SEQ
(+)
0.849
0.883
atgtatATACata

ID

NO:

425

SEQ
(+)
0.888
0.755
gtatatacatatATAGc

ID

NO:

426

SEQ
(−)
0.891
0.903
ggctaTATAtgtatata

ID

NO:

427

SEQ
(+)
1.000
0.709
atatacaTATAtagcctta

ID

NO:

428

SEQ
(−)
1.000
0.816
ttgttttTAAGgctata

ID

NO:

429

SEQ
(+)
1.000
0.899
agcctTAAAaacaaaga

ID

NO:

430

SEQ
(+)
1.000
0.973
aaAACAaagat

ID

NO:

431

SEQ
(+)
1.000
0.863
ttaaaaaCAAAgattgt

ID

NO:

432

SEQ
(+)
1.000
0.811
aagattgtAATTttt

ID

NO:

433

SEQ
(−)
1.000
0.900
acaatttaTAAAaattacaatct

ID

NO:

434

SEQ
(−)
1.000
0.844
tttataaaAATTaca

ID

NO:

435

SEQ
(−)
1.000
0.956
atttaTAAAaattacaa

ID

NO:

436

SEQ
(+)
1.000
0.875
tgTAATttttataaatt

ID

NO:

437

SEQ
(−)
1.000
0.903
aatttaTAAAaattaca

ID

NO:

438

SEQ
(−)
1.000
0.973
tcacaatttaTAAAaattacaat

ID

NO:

439

SEQ
(−)
0.750
0.798
atcACAAtttataaaaa

ID

NO:

440

SEQ
(+)
1.000
0.927
ttttaTAAAttgtgatt

ID

NO:

441

SEQ
(−)
1.000
0.997
aaaAATCacaattta

ID

NO:

442

SEQ
(+)
1.000
0.806
gTGATttttaaaa

ID

NO:

443

SEQ
(−)
1.000
0.923
tattttttTAAAaatcacaattt

ID

NO:

444

SEQ
(+)
1.000
0.990
ttgtgattttTAAAaaaataaac

ID

NO:

445

SEQ
(+)
1.000
0.905
gtgattttTAAAaaaataaacct

ID

NO:

446

SEQ
(−)
0.755
0.796
gGTTTatttttttaaaa

ID

NO:

447

SEQ
(+)
0.750
0.775
gatttttaAAAAaataaacctgc

ID

NO:

448

SEQ
(+)
1.000
0.848
aaacctgcATTAtcttc

ID

NO:

449

SEQ
(−)
0.884
0.851
gaagaTAATgcaggt

ID

NO:

450

SEQ
(−)
1.000
0.853
tgctgaagaTAATgcaggttt

ID

NO:

451

SEQ
(−)
1.000
0.993
tgaaGATAatgca

ID

NO:

452

SEQ
(+)
0.867
0.951
TGAAtgttcct

ID

NO:

453

SEQ
(+)
1.000
0.893
cctAAGTtttgta

ID

NO:

454

SEQ
(+)
1.000
0.772
agttttgTAGAacttga

ID

NO:

455

SEQ
(−)
1.000
0.927
cgtgtCAAGttctac

ID

NO:

456

SEQ
(−)
1.000
0.997
tctgccaCGTGtcaagt

ID

NO:

457

SEQ
(+)
1.000
0.795
aggattTTAGtctacac

ID

NO:

458

SEQ
(−)
1.000
0.981
gatgCAGGtgtagac

ID

NO:

459

SEQ
(−)
1.000
0.832
ctgtcctcagatgcaGGTGta

ID

NO:

460

SEQ
(+)
1.000
0.873
ctaacaGGAAaggagac

ID

NO:

461

SEQ
(+)
1.000
0.829
ggagacaCATGtgtggtag

ID

NO:

462

SEQ
(+)
1.000
1.000
catgtgTGGTagttc

ID

NO:

463

SEQ
(+)
1.000
0.919
tgtggtagTTCCcag

ID

NO:

464

SEQ
(−)
1.000
0.841
aactgGGAActac

ID

NO:

465

SEQ
(−)
1.000
0.842
aaacTGGGaactacc

ID

NO:

466

SEQ
(−)
0.750
0.784
ttcacgtCAAAactg

ID

NO:

467

SEQ
(−)
1.000
0.830
ttcACGTcaaaac

ID

NO:

468

SEQ
(+)
1.000
0.985
cagttttGACGtgaaaagtcc

ID

NO:

469

SEQ
(+)
1.000
0.891
gttttgaCGTGaaaagt

ID

NO:

470

SEQ
(+)
1.000
0.909
ttgACGTgaaaag

ID

NO:

471

SEQ
(+)
1.000
0.890
tttgACGTgaaaagt

ID

NO:

472

SEQ
(+)
1.000
0.837
ttgacgtGAAAagtc

ID

NO:

473

SEQ
(+)
1.000
0.937
cattcttactGGAAacctc

ID

NO:

474

SEQ
(+)
0.800
0.805
ttcttacTGGAaacctc

ID

NO:

475

SEQ
(+)
0.782
0.791
acctCCCTgaatccatgccaagc

ID

NO:

476

SEQ
(−)
1.000
0.964
gctTGGCatggattcaggg

ID

NO:

477

SEQ
(+)
1.000
0.820
tccATGCcaagcact

ID

NO:

478

SEQ
(+)
1.000
0.787
gCCAAgcactacccatcaccttgac

ID

NO:

479

SEQ
(−)
1.000
0.954
cagtCAAGgtgat

ID

NO:

480

SEQ
(−)
1.000
0.800
ctTATGccagtcaag

ID

NO:

481

SEQ
(−)
1.000
0.862
agtgcTTATgccagt

ID

NO:

482

SEQ
(−)
1.000
0.920
atcaaAGGAaatgagtg

ID

NO:

483

SEQ
(−)
1.000
0.969
ggggcatCAAAggaaat

ID

NO:

484

SEQ
(−)
1.000
0.912
gaggGAGGggcat

ID

NO:

485

SEQ
(−)
0.876
0.920
tgagGGAGgggcatc

ID

NO:

486

SEQ
(+)
1.000
0.973
tattaTAAAagcacagt

ID

NO:

487

SEQ
(−)
1.000
0.700
gaaagagacgaCTGTgctt

ID

NO:

488

Number	Date	Country
60346898	Jan 2002	US
60361073	Mar 2002	US
60388315	Jun 2002	US

	Number	Date	Country
Parent	12062740	Apr 2008	US
Child	13195596		US
Parent	10339767	Jan 2003	US
Child	12062740		US

ASSAY FOR THE DETECTION OF FACTORS THAT MODULATE THE EXPRESSION OF INGAP

Information

Publication Number

Date Filed

Date Published

Inventors

Original Assignees

CPC

US Classifications

International Classifications

Abstract

Description

Claims

Parent Case Info

Provisional Applications (3)

Continuations (2)