The present invention relates to novel bio-active peptide hormone, process for the production of the same, and use of the same. The present invention identified novel bioactive peptide precursor and salts thereof which can be used as drugs, for example therapeutic polypeptides, ligands to discover relevant targets (e.g. GPCRs) or targets for drug intervention.
The present invention relates to novel bio-active peptide hormone, a process for the production of the same, and use of the same. More particularly, the present invention relates to a method for the identification of bioactive peptide hormones derived from precursor proteins that can be used as therapeutic polypeptides, targets for drug intervention, ligands to discover relevant targets (e.g. GPCR deorphaning) or biomarkers to monitor diseases.
Many biologically active peptides exhibit profound effects both in health and disease, either by growth stimulation, growth inhibition, or the regulation of critical metabolic pathways.
Peptide hormones are produced as precursors in different cell types and organs like glands, neurons, intestine, brain; etc. Peptide hormones are initially synthesized as larger precursors, or prohormones, and may acquire a number of post-translational modifications during transportation through the ER and Golgi stacks. They are processed and transported to their final destination to act as active substances (first messengers) to trigger a cellular response by binding to a cell surface receptor. They play a key role in physiological processes that are relevant to many areas of biomedical research such as Diabetes (Insulin), blood pressure regulation (Angiotensin), Anemia (Erythropoietin-α), Multiple Sclerosis (Interferon-β) and others.
The recent sequencing of the human genome has revealed around 30,000 genes, far fewer than would be predicted from the complexity of human biological processes. Alternative splicing of these genes prior to translation would likely generate up to 200,000 primary transcripts. It is now widely accepted that post translational modifications to the protein products coded by these genes represents the additional level of complexity required, to explain the diversity of function.
Gene finding typically refers to the area of computational biology that is concerned with algorithmically identifying stretches of sequence, usually genomic DNA, that are biologically functional. This especially includes protein-coding genes, but may also include other functional elements such as RNA genes and regulatory regions. Gene finding is one of the first and most important steps in understanding the genome of a species once it has been sequenced.
In extrinsic gene finding systems, the target genome is searched for sequences that are similar to extrinsic evidence in the form of the known sequence of a messenger RNA (mRNA) or protein product. Given an mRNA sequence, it is critical to derive a unique genomic DNA sequence from which it had to have been transcribed. Given a protein sequence, a family of possible coding DNA sequences can be derived by reverse translation of the genetic code. Once candidate DNA sequences have been determined, it is an algorithmic problem to efficiently search a target genome for matches, complete or partial, and exact or inexact. BLAST is a widely used system designed for this purpose.
A high degree of similarity to a known messenger RNA or protein product is strong evidence in many cases that a region of a target genome is a protein-coding gene. However, to apply this approach systemically requires extensive sequencing of mRNA and protein products. Not only is this expensive, but in complex organisms, only a subset of all genes in the organism's genome are expressed at any given time, meaning that extrinsic evidence for many genes is not readily accessible in any single cell culture. Thus, in order to collect extrinsic evidence for most or all of the genes in a complex organism, many hundreds or thousands of different cell types must be studied, which itself presents further difficulties. For example, some human genes may be expressed only during development as an embryo or fetus. Despite these difficulties, extensive transcript and protein sequence databases have been generated for human as well as other important model organisms in biology, such as mice and yeast. For example, the RefSeq database contains transcript and protein sequence from many different species and the Ensembl system comprehensively maps this evidence to human and several other genomes.
Because of the inherent expense and difficulty in obtaining extrinsic evidence for many genes, it is also necessary to resort to ab initio gene finding, in which genomic DNA sequence alone is systematically searched for certain telltale signs of protein-coding genes. These signs can be broadly categorized as either signals, specific sequences that indicate the presence of a gene nearby, or content, statistical properties of protein-coding sequence itself. Ab initio gene finding might be more accurately characterized as gene prediction, since extrinsic evidence is generally required to conclusively establish that a putative gene is functional.
Ab initio gene finding in eukaryotes, especially complex organisms like humans, is considerably more challenging for several reasons. First, the promoter and other regulatory signals in these genomes are more complex and less well-understood than in prokaryotes, making them more difficult to reliably recognize. Two classic examples of signals identified by eukaryotic gene finders are CpG islands and binding sites for a poly (A) tail.
Second, splicing mechanisms employed by eukaryotic cells mean that a particular protein-coding sequence in the genome is divided into several parts (exons), separated by non-coding sequences (introns). Splice sites are themselves another signal that eukaryotic gene finders are often designed to identify. A typical protein-coding gene in human might be divided into a dozen exons, each less than two hundred base pairs in length, and some as short as twenty to thirty. It is therefore much more difficult to detect periodicities and other known content properties of protein-coding DNA in eukaryotes.
Advanced gene finders for both prokaryotic and eukaryotic genomes typically use complex probabilistic models, such as hidden Markov Models, in order to combine information from a variety of different signal and content measurements. The Glimmer system is a widely used and highly accurate gene finder for prokaryotes. Eukaryotic ab initio gene finders, by comparison, have achieved only limited success; a notable example is the GENSCAN program.
The present invention identified novel bioactive peptide hormone precursors by means of identification of novel single exon genes that encode for peptide hormone precursor sequences. To find novel single exon genes, the human genome (NCBI 33 assembly, 1 Jul. 2003) and the mouse genome (NCBI 30 assembly, 1 Jul. 2003), both were translated into all six reading frames using the standard genetic code. Only sequence fragments starting with the amino acid Methionine and with a length between 50 and 200 amino acids were selected. The human and the mouse set were compared to each other using the program BLAST in order to find closely related sequences in both organisms. Only sequences that appear in both organisms (human and mouse) were selected. To filter for secreted proteins, potential signal sequences were predicted with the program signalP and the absence of potential membrane spanning regions was confirmed by the program TMHMM. In addition, an InterPro search was performed on the selected sequences in order to rule out the presence of readily described protein domains (e.g. kinase domain etc). The novelty of the remaining sequences was verified by sequence comparison to publicly available databases such as UNIPROT. These in silico analysis suggested the discovery of novel secreted proteins that lack any previously described protein domains.
It is generally understood, that peptide hormones are characterized by their high specificity as well as their effectiveness in very low concentrations. Another characteristic of peptide hormones is that their corresponding mRNA is expressed in a small number of distinct tissues. A ubiquitous expression pattern of peptide hormones is rarely observed in mammalian systems.
To determine the tissues transcribing the 8 novel genes in the human body, commonly used in vitro transcription assays were performed on a panel of human tissues (see
Bioactive peptides hormones have a tremendous use in biomedical research and are therefore of interest to the pharmaceutical industry. Various peptide hormones are used for treatment of diseases.
Whereas WO2004039956 (Title: “compositions and methods for treatment of immune related diseases”) disclose several bioactive polypeptide sequences, the method of identification and the use of such sequences had been made however no reference to.
The present invention identified novel genes encoding for bioactive peptide hormone precursors. Peptide hormones are characterized by their high specificity as well as their effectiveness in very low concentrations. Bioactive peptides hormones have a tremendous use in biomedical research. Various peptide hormones are used for treatment of diseases or to monitor a disease states. Such polypeptide sequences can be used in therapeutically effective amounts as medicines and medicaments for immune related diseases.
The problem as laid out by availability of the closest prior art (WO2004039956) could be defined as to identify novel hormone polypeptide sequences which are useful for the treatment of human diseases. The present invention solves that problem by providing 8 novel peptide hormone precursors and fragments thereof which could be used to interfere with physiological factors that increase the risk of arteriosclerosis, inflammation or uncontrolled cell division. According to the invention it will provide a new reservoir of hormonal polypeptides that could be used for treatment of human disease in the field of biomedical research.
The present invention therefore refers to polypeptide chain consisting of an amino acid sequence according Seq id no 1-8 or an amino acid sequence derived there from by deletion, substitution or insertion of at least one amino acid residue, its amide or ester or a salt of said peptide.
An embodiment of present invention refers to a polypeptide chain consisting of the following amino acid sequence:
Another embodiment of present invention relates to a polypeptide consisting of following amino acid sequence:
Another embodiment of present invention relates to a polypeptide consisting of the following amino acid sequence:
Another embodiment of present invention relates to a polypeptide consisting of the following amino acid sequence:
Another embodiment of present invention relates to a polypeptide consisting of the following amino acid sequence:
Another embodiment of present invention relates to a polypeptide consisting of the following amino acid sequence:
Another embodiment of present invention relates to a polypeptide consisting of the following amino acid sequence:
Another embodiment of present invention relates to a polypeptide consisting of the following amino acid sequence:
An embodiment of the invention also provides a DNA comprising of a nucleotide base sequence according to Seq Id no 9 to 16 encoding a polypeptide chain comprising an amino acid sequences represented by Seq Id no 1-8 or its amide, ester or a salts thereof.
This invention relates to a DNA that consist of a nucleotide base sequence according to Seq Id no 9, encoding a polypeptide chain that consist of an amino acid sequence according to Seq Id no 1 or its amide, ester or a salts thereof.
The Present invention relates to a DNA that consists of a nucleotide base sequence according to Seq Id no 10, encoding polypeptide chain described in paragraph 1 that consists of an amino acid sequence according to Seq Id no 2 or its amide, ester or a salts thereof.
Another embodiment of present invention relates to a DNA that consist of a nucleotide base sequence according to Seq Id no 11, encoding polypeptide chain that consist of an amino acid sequence according to Seq Id no 3 or its amide, ester or a salts thereof.
Another embodiment of present invention relates to a DNA that consist of a nucleotide base sequence according to Seq Id no 12, encoding polypeptide chain that consist of an amino acid sequence according to Seq id no 4 or its amide, ester or a salts thereof.
Another embodiment of present invention relates to a DNA that consist of a nucleotide base sequence according to Seq Id no 13, encoding polypeptide chain that consist of an amino acid sequence according to Seq Id no 5 or its amide, ester or a salts thereof.
Another embodiment of present invention relates to a DNA that consist of a nucleotide base sequence according to Seq Id no 14, encoding polypeptide chain that consist of an amino acid sequence according to Seq Id no 6 or its amide, ester or a salts thereof.
Another embodiment of present invention relates to a DNA that consist of a nucleotide base sequence according to Seq Id no 15, encoding polypeptide chain that consist of an amino acid sequence according to Seq Id no 7 or its amide, ester or a salts thereof.
Another embodiment of present invention relates to a DNA that consist of a nucleotide base sequence according to Seq Id no 16, encoding polypeptide chain that consist of an amino acid sequence according to Seq Id no 8 or its amide, ester or a salts thereof.
Present invention refers to a method for manufacturing of a polypeptide wherein the said method comprises the step of providing the amino acid, synthesizing the amino acid by solid phase or liquid phase synthesis, extraction of polypeptide, purification of the polypeptide
Present invention also provides a method of producing the peptide, precursor or salt thereof which comprises subjecting an amino terminus-constituting amino acid or peptide and a carboxyl terminus-constituting amino acid or peptide to condensation, optionally followed by intramolecular disulfide bond formation.
An embodiment of the invention provides a pharmaceutical composition comprising as the active agent a polypeptide chain or precursor, or pharmaceutically acceptable amide, ester, or a salt thereof wherein said polypeptide chain consists of an amino acid sequence according Seq Id no 1-8.
Present invention also refers to the use of a pharmaceutical composition, comprising as the active agent a polypeptide chain or precursor, or pharmaceutically acceptable amide, ester, or a salt thereof wherein said polypeptide chain consists of an amino acid sequence according Seq Id no 1-8, and can be used as therapeutic polypeptides, targets for drug intervention, ligands to discover relevant targets, biomarkers to monitor diseases.
The invention also provides the use of a peptide, precursor or salt of the invention in the production of an agent for the treatment or prevention of cardio vascular disease, hormone-producing tumors, a hormone secretion inhibitor, a tumor growth inhibitor, which comprises at least one of the amino acid sequence defined under Seq. Id 1 to 8.
An embodiment of the invention also provides an antibody to a polypeptide chain according to paragraph 1 that comprises an amino acid sequences according to Seq Id no 1-8, or its amide, ester or salts thereof. An antibody of the invention can be used also for detecting an inventive polypeptide present in a sample such as a body fluid or a tissue. It may be used also to produce an antibody column for purifying an inventive polypeptide, to detect an inventive polypeptide in each fraction during purification, or to analyze the behavior of an inventive polypeptide in a test cell.
The term “polypeptide” as used herein shall be taken to refer to any polymer consisting of amino acids linked by covalent bonds and this term includes within its scope parts or fragments of full length proteins, such as, for example, peptides, oligopeptides and shorter peptide sequences consisting of at least 2 amino acids, more particularly at least about 5 amino acid residues or more.
The term “polypeptide” includes all moieties containing one or more amino acids linked by a peptide bond. In addition, this term includes within its ambit polymers of modified amino acids, including amino acids which have been post-translationally modified, for example by chemical modification including but not restricted to glycosylation, phosphorylation, acetylation and/or sulphation reactions that effectively alter the basic peptide backbone. Accordingly, a polypeptide may be derived from a naturally-occurring protein, and in particular may be derived from a full-length protein by chemical or enzymatic cleavage, using reagents such as CNBr, or proteases such as trypsin or chymotrypsin, amongst others. Alternatively, such polypeptides may be derived by chemical synthesis using well known peptide synthetic methods. Also included within the scope of the definition of a “polypeptide” are amino acid sequence variants (referred to herein as polypeptide variants). These may contain one or more preferably conservative, amino acid substitutions, deletions, or insertions, in a naturally-occurring amino acid sequence which do not alter at least one essential property of said polypeptide, such as, for example, its biological activity. Such polypeptides may be synthesised by chemical polypeptide synthesis. Conservative amino acid substitutions are well-known in the art. For example, one or more amino acid residues of a native protein can be substituted conservatively with an amino acid residue of similar charge, size or polarity, with the resulting polypeptide retaining functional ability as described herein. Rules for making such substitutions are well known. More specifically, conservative amino acid substitutions are those that generally take place within a family of amino acids that are related in their side chains. Genetically-encoded amino acids are generally divided into four groups: (1) acidic=aspartate, glutamate; (2) basic=lysine, arginine, and histidine; (3) non-polar=alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, and tryptophan; and (4) unchargedpolar=glycine, asparagine, glutamine, cysteine, serine, threonin, and tyrosine. Phenylalanine, tyrosine and tryptophan are also jointly classified as aromatic amino acids. One or more replacements within any particular group such as, for example, the substitution of leucine for isoleucine or valine are alternatively, the substitution of aspartate for glutamate or threonin for serine, or of any other amino acid residue with a structurally-related amino acid residue will generally have an insignificant effect on the function of the resulting polypeptide.
Included within the scope of the definition of a “polypeptide” are amino acid sequence variants that have undergone unnatural modifications such as but not limited to protection, carboxylation, and derivatization by amide and non-amide bonds as well as covalent and non-covalent modification.
Included in the scope of the definition of the term “polypeptide” is a peptide whose biological activity is predictable as a result of its amino acid sequence corresponding to a functional domain. Also encompassed by the term “polypeptide” is a peptide whose biological activity could not have been predicted by the analysis of its amino acid sequence.
An amino acid is any molecule that contains both amine and carboxylic acid functional groups. Amino acid residue is what is left of an amino acid once a molecule of water has been lost (an H+ from the nitrogenous side and an OH− from the carboxylic side) in the formation of a peptide bond, the chemical bond that links the amino acid monomers in a protein chain. Each protein has its own unique amino acid sequence that is known as its primary structure. Just as the letters of the alphabet can be combined in different ways to form an endless variety of words, amino acids can be linked together in varying sequences to form a huge variety of proteins. The unique shape of each protein determines its function in the body.
A precursor is a substance from which another, usually more active or mature substance is formed. A protein precursor is an inactive protein (or peptide) that can be turned into an active form by posttranslational modification. The name of the precursor for a protein is often prefixed by pro or prepro. Precursors are often used by an organism when the subsequent protein is potentially harmful, but needs to be available on short notice and/or in large quantities.
The Polypeptide, precursors or salts thereof, of the present invention have hormonal activity. Therefore, the polypeptides, precursors and salts of the invention are useful as drugs, for example therapeutic polypeptide, ligand to discover relevant targets (e.g. GPCRs), targets for drug intervention (eg targets for monoclonal antibodies or therelike, receptor fragments), biomarkers to monitor diseases (in combination with tool antibodies to detect peptide fragments in body fluids), protein kinase inhibitors and substrates, T-cell epitopes, Peptide mimotopes of receptor binding sites, Biomarker to measure the expression level.
The DNAs coding for the peptide or precursor of the invention are useful, for example, as agents for the gene therapy or treatment or prevention of cardio vascular disease, hormone-producing tumors, diabetes, gastric ulcer and the like, hormone secretion inhibitors, tumor growth inhibitors, neural activity and so for. Furthermore, the DNAs of the invention are useful as agents for the gene diagnosis of diseases such as cardio vascular disease, hormone-producing tumours, diabetes, gastric ulcer and the like.
A vector is a vehicle for delivering genetic material such as DNA to a cell. DNA by itself may be regarded as a vector, for example in particular when it is used for cell transformation. A vector in this sense is a DNA construct, such as a plasmid or a bacterial artificial chromosome that contains an origin of replication. An appropriate replication origin causes a cell to copy the construct along with the cell's chromosomes and to pass it along to its progeny. A single cell that has been transformed with a vector will grow into an entire culture of cells, which all contain the vector, as well as any gene attached to it within the construct. Because the constructs can be extracted from the cells by purification techniques, transformation with a vector is a way of making a small number of DNA molecules in to a much larger one. A vector can be a E. coli-derived plasmid (e.g., pBR322, pBR325, pUC12, pUC13), a Bacillus subtilis-derived plasmid (e.g., pUB110, pTP5, pC194), a yeast-derived plasmid (e.g., pSH19, pSH15), a bacteriophage such as [lambda] phage, as well as an animal virus such as retrovirus, vaccinia virus, vaculovirus and the like.
The antibodies against the peptide, precursor or salt of the invention can specifically recognize the peptide, precursor or salt of the invention. It may be used also to produce an antibody column for purifying an inventive polypeptide, to detect an inventive polypeptide in each fraction during purification, or to analyze the behavior of a novel polypeptide in a test cell. Hence it can be used for assaying the peptide or equivalent of the invention in test solutions.
Objective: This program was used to detect potential signal sequences, it was used with a cut off score of 0.98. Signal P version 2.0 predicts the presence and location of signal peptide cleavage sites in amino acid sequences from different organisms: The method incorporates a prediction of cleavage sites and a signal peptide/non-signal peptide prediction based on a combination of several artificial neural networks and hidden Markov models.
Objective: This program was used to define possible membrane spanning regions in protein sequences. TMHMM version 2.0 is used for prediction of transmembrane helices in proteins. Predicted TM segments in the n-terminal region sometime turn out to be signal peptides.
Objective: This program was used to detect potential cleavage sites in protein sequences. It was used with a score of 0.09. This program predicts arginine and lysine propeptide cleavage sites in eukaryotic protein sequences using an ensemble of neural networks. Furin-specific prediction is the default. It is also possible to perform a general proprotein convertase (PC) prediction. This program is integrated with the SignalP program predicting the presence and location of signal peptide cleavage sites.
1.4 InterPro Version 12 in Combination with InterProScan
InterPro is a database of protein families, domains and functional sites in which identifiable features found in known proteins can be applied to unknown protein sequences. 1.5 InterProScan is the program used to compare amino acid sequences to the InterPro database.
The Basic Local Alignment Search Tool (BLAST) finds regions of local similarity between sequences. The program compares nucleotide or protein sequences to sequence databases and calculates the statistical significance of matches. BLAST can be used to infer functional and evolutionary relationships between sequences as well as help identify members of gene families. BLAST uses Karlin-Altschul Statistics to determine the statistical significance of the alignments it produces. The basic algorithm can be implemented in a number of ways and applied in a variety of contexts including straightforward DNA and protein sequence database searches, motif searches, gene identification searches, and in the analysis of multiple regions of similarity in long DNA sequences. In addition to its flexibility and tractability to mathematical analysis, BLAST is an order of magnitude faster than existing sequence comparison tools of comparable sensitivity.
All these programs as mentioned before are accessible via the public domain internet
The biological role of the hormone peptides of the invention have been examined by means of expression profile studies (
The tissue expression analysis was performed by means of TaqMan gene expression analyses
PCR (Polymerase Chain Reaction) is a standard technique in medical and biological research labs for a variety of tasks, such as the detection of hereditary diseases, the identification of genetic fingerprints, the diagnosis of infectious diseases, the cloning of genes, paternity testing, and DNA computing.
Quantitative PCR is used to rapidly measure the quantity of PCR product (preferably real-time), thus is an indirect method for quantitatively measuring starting amounts of DNA, cDNA or RNA. This is commonly used for the purpose of determining whether a sequence is present or not, and if it is present the number of copies in the sample. Description of protocol used for gene expression studies in present invention:
RNA samples from the tissues tested were purchased from different vendors.
All products were purchased from Ambion. DNA free DNase treatment and removal reagents (1906). DNase digest was performed using 50 μg RNA and according to the manufacturer's protocol.
cDNA Synthesis:
Products for cDNA were purchased from Applera Reverse Transcriptase Kit (N8080234), RNase Inhibitor (N8080119). cDNA Synthesis from 10 μg RNA was performed.
Products for the PCR were purchased from Applera. TaqMan Universal PCR Master Mix (4305719) was used in each reaction. Target forward Primer, Target reverse Primer, Target probe labeled with FAM for each gene, respectively (see Table with primer sequences). PCR was performed using the ABI Prism 7900 (Applera) under the following PCR conditions: 2 minutes at 50° C., 10 minutes at 95° C., 40 cycles with 95° C. for 15 s and 1 minute at 60° C. PCR was set up as a Multiplex PCR using B2M as endogenous control for normalisation.
2.3) The following expression profiles could be gained for a gene encoding a peptide sequence according to Seq ID Nr 1 (
2.4) The following expression profiles could be gained for a gene encoding a peptide sequence according to Seq ID Nr 2 (
2.5) The following expression profiles could be gained for a gene encoding a peptide sequence according to Seq ID Nr 3 (
2.6) The following expression profiles could be gained for a gene encoding a peptide sequence according to Seq ID Nr 4 (
2.7) The following expression profiles could be gained for a gene encoding a peptide sequence according to Seq ID Nr 5 (
2.8) The following expression profiles could be gained for a gene encoding a peptide sequence according to Seq ID Nr 6 (
Number | Date | Country | Kind |
---|---|---|---|
10 2006 059 825.3 | Dec 2006 | DE | national |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP2007/010842 | 12/12/2007 | WO | 00 | 2/24/2010 |