The present invention relates to novel marker sequences for systemic lupus erythematosus and to the use thereof in diagnosis, together with a method for screening potential active ingredients for systemic lupus erythematosus diseases by way of these marker sequences. The invention further relates to a diagnostic device comprising such marker sequences for systemic lupus erythematosus, in particular a protein biochip, and to the use thereof.
Protein biochips are gaining increasing industrial importance for analytical and diagnostic purposes as well as in pharmaceutical development. Protein biochips have also become established as screening tools.
To this end, the fast and highly parallel detection of a plurality of specifically binding analysis molecules during a single experiment is made possible. Producing protein biochips requires the availability of the necessary proteins. For this purpose, in particular protein expression libraries have become established. High throughput cloning of defined open reading frames is one option (Heyman, J. A., Cornthwaite, J., Foncerrada, L., Gilmore, J. R., Gontang, E., Hartman, K. J., Hernandez, C. L., Hood, R., Hull, H. M., Lee, W. Y., Marcil, R., Marsh, E. J., Mudd, K. M., Patino, M. J., Purcell, T. J., Rowland, J. J., Sindici, M. L. and Hoeffler, J. P. (1999) Genome-scale cloning and expression of individual open reading frames using topoisomerase mediated ligation. Genome Res, 9, 383-392; Kersten, B., Feilner, T., Kramer, A., Wehrmeyer, S., Possling, A., Witt, I., Zanor, M. I., Stracke, R., Lueking, A., Kreutzberger, J., Lehrach, H. and Cahill, D. J. (2003) Generation of Arabidopsis protein chip for antibody and serum screening. Plant Molecular Biology, 52, 999-1010; Reboul, J., Vaglio, P., Rual, J. F., Lamesch, P., Martinez, M., Armstrong, C. M., Li, S., Jacotot, L., Bertin, N., Janky, R., Moore, T., Hudson, J. R., Jr., Hartley, J. L., Brasch, M. A., Vandenhaute, J., Boulton, S., Endress, G. A., Jenna, S., Chevet, E., Papasotiropoulos, V., Tolias, P. P., Ptacek, J., Snyder, M., Huang, R., Chance, M. R., Lee, H., Doucette-Stamm, L., Hill, D. E. and Vidal, M. (2003) C. elegans ORFeome version 1.1: experimental verification of the genome annotation and resource for proteome-scale protein expression. Nat Genet, 34, 35-41.; Walhout, A. J., Temple, G. F., Brasch, M. A., Hartley, J. L., Lorson, M. A., van den Heuvel, S, and Vidal, M. (2000) GATEWAY recombinational cloning: application to the cloning of large numbers of open reading frames or ORFeomes. Methods Enzymol, 328, 575-592). However, such an approach is highly dependent on the progress of genome sequencing projects and the annotation of these gene sequences. Moreover, the determination of the expressed sequence can be ambiguous due to differential splicing processes. This problem can be circumvented by the use of cDNA expression libraries (Büssow, K., Cahill, D., Nietfeld, W., Bancroft, D., Scherzinger, E., Lehrach, H. and Walter, G. (1998) A method for global protein expression and antibody screening on high-density filters of an arrayed cDNA library. Nucleic Acids Research, 26, 5007-5008; Büssow, Nordhoff, E., Lübbert, C., Lehrach, H. and Walter, G. (2000) A human cDNA library for high-throughput protein expression screening. Genomics, 65, 1-8; Holz, C., Lueking, A., Bovekamp, L., Gutjahr, C., Bolotina, N., Lehrach, H. and Cahill, D. J. (2001) A human cDNA expression library in yeast enriched for open reading frames. Genome Res, 11, 1730-1735; Lueking, A., Holz, C., Gotthold, C., Lehrach, H. and Cahill, D. (2000) A system for dual protein expression in Pichia pastoris and Escherichia coli, Protein Expr, Purif., 20, 372-378). To this end, the cDNA of a particular tissue is cloned into a bacterial or eukaryotic expression vector, such as yeast. The vectors used for expression are generally characterized in that these carry inducible promoters, by way of which the time of protein expression can be controlled. In addition, expression vectors comprise sequences for so-called affinity epitopes or affinity proteins, which permit the specific detection of recombinant fusion proteins by way of an antibody that is directed against the affinity epitope and additionally enable specific purification by way of affinity chromatography (IMAC).
For example, the gene products of a cDNA expression library from human fetal brain tissue in the bacterial expression system Escherichia coli were arranged in a high-density format on a membrane and able to be successfully screened with various antibodies. It was shown that the proportion of full-length proteins was at least 66%. It was further possible to express the recombinant proteins from expression libraries in high throughput and purify them (Braun P., Hu, Y., Shen, B., Halleck, A., Koundinya, M., Harlow, E. and LaBaer, J. (2002) Proteome-scale purification of human proteins from bacteria. Proc Natl Acad Sci USA, 99, 2654-2659; Buessow (2000) supra; Lueking, A., Horn, M., Eickhoff, H., Buessow, K., Lehrach, H. and Walter, G. (1999) Protein microarrays for gene expression and antibody screening. Analytical Biochemistry, 270, 103-111). Such cDNA expression library-based protein biochips are the subject matter in particular of WO 99/57311 and WO 99/57312. In addition to antigen-presenting protein biochips, antibody-presenting arrangements are described (Lal et al (2002) Antibody arrays: An embryonic but rapidly growing technology, DDT, 7, 143-149; Kusnezow et al. (2003), Antibody microarrays: An evaluation of production parameters, Proteomics, 3, 254-264).
However, there is a high need to make indication-specific diagnostic devices, such as a protein biochip, available.
The object of the present invention is to provide improved marker sequences and the diagnostic use thereof for treating systemic lupus erythematosus.
The provision of specific marker sequences allows a reliable diagnosis and stratification of patients with systemic lupus erythematosus, in particular by way of a protein biochip.
The invention therefore relates to the use of marker sequences for diagnosing systemic lupus erythematosus, wherein at least one marker sequence of a cDNA selected from the group SEQ 1-716, or a respective protein coding therefor, or a respective partial sequence or fragment thereof (hereinafter: marker sequences according to the invention) is determined on or from a patient to be examined.
It was possible to identify the marker sequences according to the invention by way of differential screening of samples, specifically from healthy participants, with samples from patients with systemic lupus erythematosus.
For the first time, these marker sequences according to the invention were identified by way of protein chips (see examples).
The term “systemic lupus erythematosus (SLE)” relates a systemtic autoimmune disease from the collagenosis group. A particular characteristic of SLE (systemic lupus erythematosus) is the so-called butterfly rash.
Criteria for diagnosis are:
1. Butterfly rash
2. Discoid skin changes
3. Sensitivity to light
4. Mucous membrane ulcers (generally painless)
5. Arthritis in at least two joints
6. Serositis (pleurisy or pericarditis)
7. Kidney involvement (proteinuria <0.5 g/d or cylinder)
8. CNS involvement (seizures or psychosis)
9. Hematological findings (hemolytic anemia, leucopenia or thrombopenia)
10. Immunological findings: anti-dsDNA antibodies, anti-Sm antibodies, anticardiolipin antibodies;
11. Antinuclear antibodies without taking lupus erythematosus-triggering medication;
Evaluation: With four (three) positive findings, the diagnosis is considered reliable (likely) (definition e.g. according to Pschyrembel, de Gruyter, 261st edition (2007), Berlin).
It is essential for the invention that the samples are not taken from conventional blood banks, but were carefully selected from SLE patients who are HIV and HCV negative, for example, and were tested in particular for infectious diseases. The complex sample selection procedure allows, for example, sufficient advantageous differentiation from diseases with symptoms similar to SME.False positive results are thus excluded, also because of the strict bioinformational evaluation (see examples).
The protein biochips are additionally produced by normalizing at least 1,000, preferably 2,000 different, or more, autoantigens of humans, which are not indication-specific of systemic lupus erythematosus. For example, such autoantigens can be obtained from other bodily fluids of patients with other illnesses such as pancreatic cancer, rheumatoid arthritis, prostate and the like.
The invention therefore also relates to such indication-specific protein biochips according to the invention for diagnosing systemic lupus erythematosus, wherein in a further verification step, the proteins represented on the protein biochip are normalized with autoantibodies from non-systemic lupus erythematosus patients and false positive proteins can be eliminated in this way.
Hereafter, a protein biochip thus produced is referred to as a “normalized protein biochip”. Such a normalized protein biochip has been found to be particularly advantageous for identifying specific marker sequences according to the invention because other similar autoimmune-relevant marker sequences can be excluded. This is contrary to the teaching of WO2003/090694.
This procedure likewise allows autoantibodies with a positive response to E. coli to be excluded. This is a further qualitative improvement, for example because autoantibodies that are directed to E. coli enterobacteria in humans can be excluded. As a result, new marker sequences can advantageously be identified with an improved signal-to-noise ratio.
In a further preferred embodiment, at least 2 to 5 or 10, preferably 30 to 50 marker sequences, or 50 to 100 or more marker sequences are thus determined on or from a patient to be examined.
In a further embodiment of the invention, the marker sequences according to the invention can also be combined, supplemented or expanded with known biomarkers for this indication.
In a preferred embodiment, the marker sequences are determined outside the human body and the determination is carried out in an ex vivo/in vitro diagnosis.
In a further embodiment of the invention, the invention relates to the use of marker sequences as diagnostic products, wherein at least one marker sequence of a cDNA is selected from the group SEQ 1-716, or a respective protein coding therefor, or a respective partial sequence or fragment thereof.
The invention further relates to a method for diagnosing systemic lupus erythematosus, wherein a.) at least one marker sequence of a cDNA selected from the group SEQ 1-716, or a protein coding therefor, or a respective partial sequence or fragment thereof, is applied to a solid support, and b.) brought in contact with body fluid or tissue extract and c.) the detection of an interaction of the body fluid or tissue extract with the marker sequences from a.) is carried out.
The invention therefore also relates to diagnostic products for diagnosing systemic lupus erythematosus, each selected from the group SEQ 1-716, or a respective protein coding therefor, or a respective partial sequence or fragment thereof.
Such an interaction can be detected by a probe, in particular by an antibody, for example.
The invention therefore also relates to the problem of providing a diagnostic device or an array, in particular a protein biochip, which allows a diagnosis of or examination for systemic lupus erythematosus.
The invention further relates to a method for the stratification, in particular for the risk stratification, and/or treatment management of a patient with systemic lupus erythematosus, wherein at least one marker sequence of a cDNA selected from the group SEQ 1-716, or a respective protein coding therefor, is determined on a patient to be examined.
The invention further encompasses the stratification of patients with systemic lupus erythematosus into new or established sub-groups of systemic lupus erythematosus as well as the expedient selection of patient groups for the clinical development of new therapeutic agents. The term treatment management also includes dividing the patients into responders and non-responders with respect to a treatment or the treatment course thereof.
The term “diagnosis” within the meaning of the present invention denotes the positive identification of systemic lupus erythematosus by way of the marker sequences according to the invention and the association of patients with the systemic lupus erythematosus disease. The term ‘diagnosis’ comprises medical diagnostics and examinations in this regard, in particular in vitro diagnostics and laboratory diagnostics, as well as proteomics and nucleic acid blotting. Additional examinations may be required for validation and to exclude other illnesses. The term ‘diagnosis’ therefore likewise encompasses the differential diagnosis of systemic lupus erythematosus by way of the marker sequences according to the invention and the prognosis of systemic lupus erythematosus.
“Stratifying (also: stratification) or treatment management” within the meaning of the present invention shall mean that the method according to the invention allows decisions regarding the treatment and therapy of the patient, be it hospitalization of the patent, use, effect and/or dosage of one or more pharmaceuticals, a therapeutic measure or monitoring the progression of an illness or treatment, or etiology or classification of a disease, for example into a new or existing sub-type, or the differentiation of illnesses and the patients thereof.
In a further embodiment of the invention, the term “stratification” comprises in particular risk stratification with the prognosis of an outcome of a disadvantageous health event.
Within the scope of the present invention, the term “patient” is considered to mean any participant—human or mammal—with the proviso that the participant is examined for systemic lupus erythematosus.
The term “marker sequences” within the meaning of the present invention shall mean that the cDNA, or the respective polypeptide or protein obtainable therefrom, is significant for systemic lupus erythematosus. For example, the cDNA, or the respective polypeptide or protein obtainable therefrom, can exhibit an interaction with substances from the body fluid or tissue extract of a patient with systemic lupus erythematosus (for example antigen (epitope)/antibody (paratope) interaction). Within the meaning of the invention, “wherein at least one marker sequence of a cDNA selected from the group SEQ 1-716, or a respective protein coding therefor, or a respective partial sequence or fragment thereof, is determined on or from a patient to be examined” shall mean that an interaction between the body fluid or tissue extract of a patient and the marker sequences according to the invention is detected. Such an interaction includes, for example, a bond, in particular a binding substance on at least one marker sequence according to the invention or, in the case of cDNA, hybridization with a suitable substance under select conditions, in particular stringent conditions (for example as customarily defined in J. Sambrook, E. F. Fritsch, T. Maniatis (1989), Molecular cloning: A laboratory manual, 2nd Edition, Cold Spring Habor Laboratory Press, Cold Spring Habor, USA or Ausubel, “Current Protocols in Molecular Biology”, Green Publishing Associates and Wiley Interscience, N.Y. (1989)). One example of stringent hybridization conditions is: hybridization in 4×SSC at 65° C. (alternatively in 50% formamide and 4×SSC at 42° C.), followed by several washing steps in 0.1×SSC at 65° C. for a total of approximately one hour. One example of less stringent hybridization conditions is hybridization in 4s SCC at 37° C., followed by several washing steps in 1×SCC at room temperature.
According to the invention, such substances are constituents of a body fluid, in particular blood, whole blood, blood plasma, serum, patient serum, urine, cerebrospinal fluid, synovial fluid or a tissue extract of the patient.
In a further embodiment of the invention, however, the marker sequences according to the invention can be present in a significantly higher or lower expression rate or concentration, indicating systemic lupus erythematosus. To this end, the relative sick/healthy expression rates of the marker sequences for systemic lupus erythematosus according to the invention can be determined by way of proteomics or nucleic acid blotting.
In a further embodiment of the invention, the marker sequences comprise a detection signal that is addressed to the substance to be bound (for example antibody, nucleic acid). According to the invention, the detection signal is preferably an epitope and/or paratope and/or haptene for a protein, and it is a hybridization region or binding region for a cDNA.
The marker sequences according to the invention are likewise listed in Table A and can be unambiguously identified by the respective cited database entry (also via the Internet: http://www.ncbi.nlm.nih.gov/) (see in Table A: Accession No. there), see also the associated sequence protocol.
The invention thus likewise relates to full-length sequences of the markers according to the invention, more particularly as defined in Table 1 by way of the known database entry, hereafter referred to as SEQ 1a-716a (cDNA).
The invention therefore also comprises embodiments of SEQ 1a-716a that are analogous to the marker sequences SEQ 1-716, as described in the claims for example, because the SEQ 1-716 according to the invention again represent partial sequences, at least with high homology. However, the specific marker sequences SEQ 1-716 are preferred according to the invention.
According to the invention, the marker sequences also comprise modifications of the cDNA sequence, and of the corresponding amino acid sequence, such as chemical modification, for example citrullination, acetylation, phosphorylation, glycosylation or polyA tail and other relevant modifications known to a person skilled in the art.
Another embodiment of the invention also encompasses partial sequences or fragments of the marker sequences according to the invention. These are in particular partial sequences that are 95%, 90%, notably 80% or 70% identical to the marker sequences according to the invention.
Partial sequences also include sequences that comprise 50 to 100 nucleotides, 70 to 120 nucleotides of a sequence of SEQ 1-716, or peptides obtainable therefrom.
“Partial sequences or fragments” of the marker sequences according to the invention are functionally defined and comprise sequences that have the same diagnostic function according to the invention.
In a further embodiment, the respective marker sequence may be represented in differing quantities in one or more regions on a solid support. This allows the sensitivity to be varied. The regions can each comprise a collectivity of marker sequences, which is to say a sufficient number of different marker sequences, in particular 2 to 5, or 10 or more, and optionally additional nucleic acids and/or proteins, in particular biomarkers. However, at least 96 to 25,000 (numerical) or more different or identical marker sequences and additional nucleic acids and/or proteins, in particular biomarkers, are preferred. Further preferred are more than 2,500, particularly preferred are 10,000 or more different or identical marker sequences and optionally additional nucleic acids and/or proteins, in particular biomarkers.
Another object of the invention is an arrangement of marker sequences comprising at least one marker sequence of a cDNA selected from the group SEQ 1-716, or a respective protein coding therefor. The arrangement preferably comprises at least 2 to 5 or 10, preferably 30 to 50 marker sequences, or 50 to 100 or more marker sequences.
Within the scope of the present invention, “arrangement” shall be synonymous with “array”, and provided that this “array” is used to identify substances on marker sequences, this shall be understood to mean an “assay” or a diagnostic device. In a preferred embodiment, the arrangement is designed so that the marker sequences represented on the arrangement are present in the form of a grid on a solid support. Moreover, arrangements that allow a high-density arrangement of protein binders and where the marker sequences are spotted are preferred. Such high-density spotted arrangements are disclosed in WO 99/57311 and WO 99/57312, for example, and can advantageously be employed in a robot-assisted automated high-throughput method.
However, within the scope of the present invention the term “assay” or diagnostic device also comprises embodiments of a device, such as ELISA, bead-based assay, line assay, western blot, immunochromatographic methods (for example so-called lateral flow immunoassays) or similar immunological single or multiplex detection methods. A protein biochip within the meaning of the present invention is a systematic arrangement of proteins on a solid support.
The marker sequences of the arrangement are fixed on a solid support, however preferably they are spotted or immobilized, even printed on, which is to say they are applied reproducibly. One or more marker sequences can be present multiple times in the collectivity of all marker sequences and be present in differing quantities relative to a spot. Furthermore, the marker sequences can be standardized on the solid support (for example by way of human globulin serial dilution series as internal calibrators for data normalization and quantitative evaluation).
As a result, the invention also relates to an assay or a protein biochip comprising an arrangement containing marker sequences according to the invention.
In a further embodiment, the marker sequences are present in the form of clones. For example, such clones can be obtained by way of a cDNA expression library according to the invention (Büssow et al. 1998 (supra)). In a preferred embodiment, such expression libraries containing clones are obtained by way of expression vectors from an expressing cDNA library comprising the cDNA marker sequences. These expression vectors preferably comprise inducible promoters. The expression can, for example, be induced by way of an inducer such as IPTG. Suitable expression vectors are described in Terpe et al. (Terpe T Appl Microbiol Biotechnol. 2003 January; 60(5):523-33).
Expression libraries are known to a person skilled in the art and can be produced according to standard reference books such as Sambrook et al, “Molecular Cloning, A laboratory handbook, 2nd edition (1989), CSH press, Cold Spring Harbor, N.Y. Also preferred are expression libraries that are tissue-specific (for example human tissue, in particular human organs). According to the invention, expression libraries that can be obtained by way of exon trapping are also covered. The term ‘expression bank’ can be employed synonymously for the term expression library.
Also preferred are protein biochips or corresponding expression libraries that have no redundancy (so-called: Uniclone® library) and that can be produced according to the teachings of WO 99/57311 and WO 99/57312, for example. These preferred Uniclone libraries have a high proportion of non-defective fully expressed proteins of a cDNA expression library.
Within the scope of the present invention, the clones can also be, but are not limited to, transformed bacteria, recombinant phages or transformed cells from mammals, insects, fungi, yeast or plants.
The clones are fixed, spotted or immobilized on a solid support.
The invention thus relates to an arrangement, wherein the marker sequences are present in the form of clones.
The marker sequences can also be present in the form of a fusion protein comprising at least one affinity epitope or “tag”, for example. The tag may be one such as c-myc, his tag, arg tag, FLAG, alkaline phosphatase, V5 tag, T7 tag or strep tag, HAT tag, NusA, S tag, SBP tag, thioredoxin, DsbA, including a fusion protein, preferably a cellulose-binding domain, green fluorescent protein, maltose-binding protein, calmodulin-binding protein, glutathione S-transferase or lacZ.
In all embodiments, the term “solid support” encompasses designs such as a filter, a membrane, a magnetic or fluorophore-labeled bead, a silicon wafer, glass, metal, plastic material, a chip, a mass spectrometry target or a matrix. However, a filter is preferred according to the invention.
Moreover, PVDF, nitrocellulose or nylon are preferred filters (for example Immobilon P Millipore, Protran Whatman, Hybond N+Amersham).
In a further preferred embodiment of the arrangement according to the invention, this arrangement corresponds to a grid having the size of a microtiter plate (8-12 wells, 96 wells, 384 wells or more), a silicon wafer, a chip, a mass spectrometry target or a matrix.
In a further embodiment, the invention relates to an assay or protein biochip for identifying and characterizing a substance for systemic lupus erythematosus, characterized in that an arrangement or assay according to the invention a.) is brought in contact with at least one substance to be analyzed and b.) successful binding is detected.
The invention further relates to a method for identifying and characterizing a substance for systemic lupus erythematosus, characterized in that an arrangement or assay according to the invention a.) is brought in contact with at least one substance to be analyzed and b.) successful binding is detected.
The substance to be analyzed can be any arbitrary native or non-native biomolecule, a synthetic chemical molecule, a mixture, or a substance library.
After the substance to be analyzed has come in contact with a marker sequence, the successful binding process is evaluated, which takes place, for example, using commercially available image analysis software (GenePix Pro (Axon Laboratores), Aida (Raytest), ScanArray (Packard Bioscience)).
The protein-protein interactions according to the invention (for example protein on marker sequence, such as antigen/antibody) or corresponding “means for detecting successful binding” can be visualized, for example, in the customary manner by way of fluorescent labeling, biotinylation, radioisotope labeling or colloidal gold or latex particle labeling. Bound antibodies are detected with the aid of secondary antibodies labeled with commercially available reporter molecules (for example Cy, Alexa, Dyomics, FITC or similar fluorescent dyes, colloidal gold or latex particles), or with reporter enzymes, such as alkaline phosphatase, horseradish peroxidase, or the like, and the corresponding colorimetric, fluorescent or chemiluminescent substrates. Readout is carried out, for example, by way of a microarray laser scanner, a CCD camera or visually.
In a further embodiment, the invention relates to a pharmaceutical/active ingredient or prodrug developed for systemic lupus erythematosus and obtainable through the use of the assay or protein biochip according to the invention.
The invention therefore likewise relates to the use of an arrangement according to the invention or an assay for screening active ingredients for systemic lupus erythematosus.
In a further embodiment, the invention therefore likewise relates to a target for the treatment and therapy of systemic lupus erythematosus, selected in each case from the group SEQ 1-716 or a protein coding therefor.
In a further embodiment, the invention likewise relates to the use of the marker sequences according to the invention, preferably in the form of an arrangement as an affinity material for carrying out apheresis or, in the broader sense, dialysis, wherein substances from body fluids of a patient with systemic lupus erythematosus, such as blood or plasma, bind to the marker sequences according to the invention and consequently can be selectively withdrawn from the body fluid.
Ten or more patient samples were individually screened against a cDNA expression library. The systemic lupus erythematosus-specific expression clones were determined by way of comparison to ten or more healthy samples. The identity of the marker sequences was determined by way of DNA sequencing.
Within the scope of the biomarker identification, various bioinformatics analyses are carried out. Reactivities against approximately 2000 different antigens are measured for each serum using microarrays. This data is used to rank the spotted antigens with respect to the differentiation capability thereof between healthy and diseased sera. This evaluation is carried out by way of the non-parametric Mann-Whitney tests using normalized intensity data. An internal standard, which is also spotted on each chip, is used for normalization purposes. Because a p-value is calculated for each antigen, methods for correcting multiple testing are employed. A very conservative approach that is taken is to carry out a Bonferroni correction, and additionally the less restrictive false discovery rate (FDR) according to Benjamini & Hochberg is calculated.
Additionally, the data is utilized to classify the sera. To this end, different multivariate methods are employed. These are methods selected from statistical learning methods such as support vector machines (SVM), neuronal networks or classification trees, as well as a threshold value method, which is suitable both for classifying and for visually representing the data.
So as to avoid overfitting, tenfold cross validation of the data is carried out.
sapiens]
sapiens]
sapiens]
sapiens]
sapiens]
sapiens]
Number | Date | Country | Kind |
---|---|---|---|
10187316.4 | Oct 2010 | EP | regional |
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/EP11/67842 | 10/12/2011 | WO | 00 | 7/31/2013 |