The invention relates generally to clinical diagnostics and prognostics for infection.
“Break-bone fever”, or dengue fever (DF), was first spread worldwide in the tropics during the 18th and 19th century following the expansion of the commerce and shipping industry. The Aedes aegypti, main mosquito vector, was introduced, along with the dengue virus (DENV), in the new regions chartered by the industry. During the last decade, dengue was able to spread due to an increase in air travel, unprecedented population growth, unplanned and uncontrolled urbanization, and the lack of mosquito control among other things (Rigau-Perez, J., et al., 1998, Lancet 352:971-977). Today it is estimated that 2.5 billion people are at risk of DENV infection in more than 100 countries in the Americas, Southeast Asia, western Pacific, Africa and the eastern Mediterranean. There is an estimated 50 million cases of dengue infection each year with 500,000 cases of dengue hemorrhagic fever (the more severe case of the disease) and at least 12,000 deaths, mostly in children (DengueNet, 2002, Weekly Epidemiological Record 77:300-304).
Dengue virus belongs to the Flavivirus genus that also includes yellow fever, West Nile, tick-borne encephalitis (TBEV), and Japanese encephalitis viruses. There are 4 primary serotypes that exist which can cause different degrees of disease severity ranging from the mildest form of dengue fever (DF), to dengue hemorrhagic fever (DHF), and the most severe form of dengue shock syndrome (DSS). DENV possesses an icosahedral core of 40-50 nm in diameter, containing one of the 3 structural proteins, the C protein. It encapsulates the 10,700 nucleotide plus-sense RNA genome. Surrounding the core is a smooth lipid bilayer composed of the other 2 structural proteins, the membrane (M) protein, and the envelope glycoprotein (E) (Kuhn, R. J., et al., 2002, Cell 108:717-725). The main biological properties of the virus come from the E protein where it allows for receptor binding, haemagglutination of erythrocytes, neutralizing antibody induction, and protective immune response (Chang, G. J. 1997, p. 175-198. In D. J. Gubler and G. Kuno (ed.), CAB International, New York). It also possesses 7 non-structural proteins (NS1, NS2a, NS2b, NS3, NS4a, NS4b, NS5), of which two, NS1 and NS3, are believed to be the most important ones involved in the pathogenesis. Upon primary infection with DENV, antibodies against the surface E, NS1, and NS3 proteins are generated (Green, S. and A. Rothman, 2006, Current Opinion in Infectious Diseases 19:429-436.). Therefore serotypes can be distinguished by virus-neutralizing antibodies, but non-neutralizing antibodies against the E protein and non-structural proteins NS1 and NS3 are cross-reactive. A life-long immunity against the infective serotype ensues, but protection against others is only for a short period of time. During a second infection by a different serotype, the presence of neutralizing antibodies can reduce the severity of the disease. However, if the levels of these antibodies drop under the neutralizing amount, the heterotypic IgG antibodies form complexes with dengue viruses that can bind to the FcyR resulting in an augmentation of the virus infection. This model is called the antibody dependent enhancement (ADE) (Green, S. and A. Rothman, 2006, Current Opinion in Infectious Diseases 19:429-436; Guzman, M. G. and G. Kouri. 2002, The Lancet Infectious Diseases 2:33-42; Kliks, S. C., et al., 1989, American Journal of Tropical Medicine & Hygiene 40:444-451; Oishi, K., et al., 2003, Journal of Medical Virology 71:259-264; and Stephenson, J. R., 2005, Bulletin of the World Health Organization 83:308-314). To further support this model, it has been observed that the incidence of DHF/DSS in children occurs at two distinct peaks in their lives. The first occurs when the child is 6-9 months old. This is the age at which the maternal antibodies are still present in the circulation. If the child gets infected by a different heterotypic DENV than the mother, DHF/DSS ensues since the levels of maternal antibodies have fallen below the protective levels (Simmons, C. P., et al., Journal of Infectious Diseases 196:416-424). The other peak occurs in young children infected for a second time. ADE supports the fact that DHF/DSS is 15-80 times more likely in secondary infections. However, this can not explain the whole pathogenesis of dengue virus and many other factors still to be studied might play a role such as the strain's virulence and the serotype, and the host susceptibility and the specific role of T cells (Chaturvedi, U., et al., 2006, FEMS Immunology & Medical Microbiology 47:155-166, Fink, J., et al., 2006, Reviews in Medical Virology 16:263-275). All these factors need to be considered in the design of a vaccine (Stephenson, J. R., 2005, Bulletin of the World Health Organization 83:308-314).
Once one is bitten by an infected mosquito, there is an incubation period of up to 2 weeks. Most infections are asymptomatic, especially in children under 15 years of age, but can cause a range of symptoms and even lead to death. Population-based studies have shown that the severity of the disease increases with the patient's age (Burke, D. S., 1988, American Journal of Tropical Medicine & Hygiene 38:172-80, Cobra, C., et al., 1995, American Journal of Epidemiology 142:1204-1211, Dietz, V., et al., 1996. Puerto Rico Health Sciences Journal 15:201-210; and Kuberski, T., et al., 1977, American Journal of Tropical Medicine & Hygiene 26:775-783). DF is an acute febrile disease often characterized by frontal headache, retroocular pain, muscle and joint pain, nausea, vomiting, and rash (Kalayanarooj, S., et al., 1997, Journal of Infectious Diseases 176:313-321). The febrile period usually terminates between 5-7 days after the onset of symptoms, often correlating with the disappearance of the virus from the circulation. In Southeast Asia, DHF is mostly seen in children, but it is seen in all age groups in the tropical Americas. This suggests the involvement of race or strain virulence as risk factors. DHF is an acute febrile illness, typically with bleeding, thrombocytopenia, elevated haematocrit, pleural effusions, and hypoproteinaemia. It begins as DF with a sudden onset of fever, and then develops into DHF around 3-7 days of illness (around the time of defervescence for DF) and continues for about 2-7 days. The main pathophysiological difference between DF and DHF is plasma leakage. Dengue shock syndrome (DSS) is the most severe form of the disease characterized by circulatory failure and a narrowing pulse range. Once shock begins, the fatality rate can be as high as 44% if the proper precautions are not taken (Oishi, K., et al., 2003, Journal of Medical Virology 71:259-264). There are no antiviral drugs administered nor are any drugs known to be useful in limiting the plasma leakage. Dengue treatment is only supportive where analgesics and antipyretics (but not aspirin) are given and fluid management is applied. Only when the molecular biology of DHF is understood will we able to treat it (Lei, H. Y., et al., 2001, Journal of Biomedical Science 8:377-388; and Rigau-Perez, J., et al., 1998, Lancet 352:971-977). This is why the diagnostic of a dengue infection needs to be given early in the disease progression so to maximize the patient's chance of survival. However, clinical findings alone are often not very helpful in distinguishing DF from other febrile illnesses (OFIs) such as the chikungunya, measles, leptospirosis, yellow fever, influenza, West Nile, Japanese, and St Louis encephalitis (Rigau-Perez, J., et al., 1998, Lancet 352:971-977; Senanayake, S., 2006, Australian Family Physician 35:609-612; and Teles, F. R., D. M. Prazeres, and J. L. Lima-Filho, 2005, Reviews in Medical Virology 15:287-302).
During a primary infection, IgM antibodies are developed after 5-6 days and are present in the circulation for up to 2-3 months after infection, while IgG antibodies become present after only 7-10 days. On the other hand, a secondary infection occurs when an individual has been previously infected or immunized with a flavivirus. IgM levels are lower if not absent but IgG levels are very high, even during the acute phase of the infection. Therefore, IgM is a sign of an early infection while high levels of IgG reveal a secondary infection (Guzman, M. G. and G. Kouri, 2002, The Lancet Infectious Diseases 2:33-42). Viable DENV particles are detectable in the circulation for up to 5 days after the symptoms but then rapidly disappear upon the appearance of DENV-specific antibodies (Kao, C. L., et al., 2005, Journal of Microbiology, Immunology & Infection 38:5-16).
Enzyme immunoassay (EIA) is used to detect IgM and IgG antibodies to dengue. This method can distinguish a primary infection from a secondary infection by determining the IgM/IgG ratio; if the ratio in convalescent sera exceeds 1.5, it reveals a primary infection. The World Health Organization (WHO) recommends the use of the dengue monoclonal antibody (IgM)-capture EIA (MAC-EIA) which is inexpensive, simple, fast, and only requires one blood sample. However, IgM antibodies can only be detected at least 5 days after infection since this is the time needed for the body to produce anti-dengue antibodies. Moreover, some false-positives can occur due to the persistence of IgM in the blood even after a few months (Teles, F. R., D. M. Prazeres, and J. L. Lima-Filho, 2005, Reviews in Medical Virology 15:287-302).
The haemagglutination-inhibition (HI) is slightly more sensitive than the EIA test. On the other hand, chemical treatment of the samples is needed to remove non-specific inhibitor of heamagglutination as well as non-specific agglutinins Moreover, this test does not differentiate between closely related flavivirus infections or different DENV serotypes. Paired sera are needed and so the results can take weeks.
There exists also the neutralization test which is more sensitive than the HI-test but employs live virus and so Biosafety Level 3 Laboratories are needed. It also encounters the same difficulties as the HI-test in terms of specificity in addition to the extra cost, time, and technical difficulty associated with the neutralizing test (Teles, F. R., D. M. Prazeres, and J. L. Lima-Filho, 2005, Reviews in Medical Virology 15:287-302).
The complement fixation (CF) test is a good marker of recent infection compared to the detection of IgM dengue specific antibodies due to their short persistence in the blood. However, the CF antibody appears only 7-14 days after the onset of symptoms. Also, it is the least sensitive of the serological tests.
Due to some cross-reactivity in flaviviruses, any serologic test must include as controls the four dengue serotypes, another serotype, a non-flavivirus and an uninfected control for it to be a confirmatory diagnosis (Teles, F. R., D. M. Prazeres, and J. L. Lima-Filho, 2005, Reviews in Medical Virology 15:287-302). Also, the high rate of IgG positive results for people in the tropics indicate that paired acute and convalescent serum samples are often critical for the significance of the tests (Rigau-Perez, J., et al., 1998, Lancet 352:971-977).
Inoculation of clinical specimens into mosquito cells, larvae or adult mosquitoes is the most sensitive approach. Specific detection and identification of the virus by immunofluorescence assays with serotype-specific anti-dengue monoclonal antibodies makes this technique able to determine the serotype of DENV. This test is convenient since the samples are relatively suitable for 2 weeks and the test does not require special facilities or special training (Teles, F. R., D. M. Prazeres, and J. L. Lima-Filho, 2005, Reviews in Medical Virology 15:287-302). However, days to weeks are necessary for virus isolation and the cost of equipment and laboratory maintenance is high (Kao, C. L., et al., 2005, Journal of Microbiology, Immunology & Infection 38:5-16).
RNA viral genome can be detected by PCR-based techniques, e.g., RT-PCR. It is a technique that is just as expensive as the virus culture technique with higher contamination risks associated with sample manipulation, but only takes a few hours to perform and is much more sensitive. By using 4 serotype-specific oligonucleotide primers, it is also possible to detect the serotype of the given DENV (Teles, F. R., D. M. Prazeres, and J. L. Lima-Filho, 2005, Reviews in Medical Virology 15:287-302).
Thus a need exists for the identification of biomarkers that could simplify the diagnosis and/or prognosis of dengue and its symptoms at, e.g., reduced costs. The present invention provides for these and other advantages, as described below.
The present invention provides, inter alia, biomarkers that are differentially present in subjects with dengue. In addition, the present invention provides methods of using the biomarkers to qualify dengue in a subject or in a biological sample taken from a subject, including a sample of serum, blood, or other donated tissue. As such, the invention provides biomarkers that represent full length proteins or fragments of proteins expressed in infected individuals by a member of the Flaviviridae family, the pathogen responsible for dengue.
The biomarkers can be used, inter alia, to qualify dengue status, determine the course of dengue, monitor the response to treatment by a drug used to treat dengue, and/or determine a treatment regimen for dengue. The dengue can be caused by members of the Flaviviridae family.
In one aspect, the present invention provides a method for qualifying dengue status in a subject, the method including: (a) measuring at least one biomarker in a biological sample from the subject, wherein the at least one biomarker is selected from the group consisting of the biomarkers of Tables 1-5, 17, 21, and 24; and (b) correlating the measurement with dengue status. In one aspect, the biological sample is a serum sample.
The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 2.5, 2.6, 2.7, 2.8, 3.0, 3.2, 3.4, 3.5, 3.8, 3.9, 4.0, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 5.0, 5.1, 5.2, 5.3, 5.5, 5.6, 5.7, 6.0, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.4, 7.5, 7.6, 8.2, 8.8, 9.0, 9.3, 9.5, 9.6, 10.0, 10.1, 10.2, 10.3, 10.4, 10.5, 10.6, 10.7, 10.8, 10.9, 11.0, 11.1, 11.3, 11.5, 11.7, 11.8, 11.9, 12.1, 12.2, 12.4, 12.5, 12.6, 12.7, 12.9, 13.0, 13.1, 13.2, 13.3, 13.4, 13.5, 13.8, 14.0, 14.1, 14.2, 14.3, 14.4, 15.3, 17.4, 17.7, 17.8, 18.0, 21.5, 22.4, 23.1, 23.3, 23.6, 23.7, 23.8, 25.4, 25.6, 26.5, 26.7, 28.2, 28.4, 29.0, 30.3, 30.8, 31.2, 32.3, 33.5, 33.6, 34.0, 34.2, 34.5, 34.7, 36.5, 36.6, 38.7, 39.5, 39.9, 42.3, 43.5, 44.0, 44.6, 44.7, 45.0, 45.4, 45.5, 45.6, 46.2, 46.6, 46.7, 49.7, 50.5, 51.7, 52.4, 52.6, 53.4, 53.6, 54.3, 54.4, 54.6, 54.8, 55.1, 55.3, 55.8, 56.6, 59.4, 59.5, 61.4, 63.1, 66.6, 66.7, 66.8, 67.1, 69.0, 70.9, 71.3, 71.5, 75.1, 75.2, 75.3, 77.1, 79.1, 79.3, 88.3, 95.7, 109.0, 111.3, 117.2, 123.0, 125.4, 133.4, 133.7, 150.1, 164.6, 188.6, 194.2, and 198.3 kDa and any combination thereof.
The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 2.5, 2.6, 2.7, 2.8, 3.0, 3.2, 3.4, 3.5, 3.8, 3.9, 4.0, 4.1, 4.2, 4.3, 4.4, 4.5, 4.6, 5.0, 5.1, 5.2, 5.3, 5.5, 5.6, 5.7, 6.0, 6.5, 6.6, 6.7, 6.8, 6.9, 7.0, 7.4, 7.5, 7.6, 8.2, 8.8, 9.0, 9.3, 9.5, 9.6, 10.0, 10.1, 10.2, 10.3, 10.4, 10.5, 10.6, 10.7, 10.8, 10.9, 11.0, 11.1, 11.3, 11.5, 11.7, 11.8, 11.9, 12.1, 12.2, 12.4, 12.5, 12.6, 12.7, 12.9, 13.0, 13.1, 13.2, 13.3, 13.4, 13.5, 13.8, 14.0, 14.1, 14.2, 14.3, 14.4, 15.3, 17.4, 17.7, 17.8, 18.0, 21.5, 22.4, 23.1, 23.3, 23.6, 23.7, 23.8, and 25.4 kDa and any combination thereof. The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 25.6, 26.5, 26.7, 28.2, 28.4, 29.0, 30.3, 30.8, 31.2, 32.3, 33.5, 33.6, 34.0, 34.2, 34.5, 34.7, 36.5, 36.6, 38.7, 39.5, 39.9, 42.3, 43.5, 44.0, 44.6, 44.7, 45.0, 45.4, 45.5, 45.6, 46.2, 46.6, 46.7, 49.7, 50.5, 51.7, 52.4, 52.6, 53.4, 53.6, 54.3, 54.4, 54.6, 54.8, 55.1, 55.3, 55.8, 56.6, 59.4, 59.5, 61.4, 63.1, 66.6, 66.7, 66.8, 67.1, 69.0, 70.9, 71.3, 71.5, 75.1, 75.2, 75.3, 77.1, 79.1, 79.3, 88.3, 95.7, 109.0, 111.3, 117.2, 123.0, 125.4, 133.4, 133.7, 150.1, 164.6, 188.6, 194.2, and 198.3 kDa and any combination thereof.
The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 3.4, 3.8, 3.9, 4.0, 4.6, 6.6, 6.7, 7.0, 7.6, 10.6, 11.1, 11.7, 12.5, 12.7, 12.9, 13.1, 13.2, 13.3, 13.4, 14.4, 23.1, 23.3, 23.6, 23.8, 25.4, 34.2, 44.7, 45.6, 46.2, 46.4, 56.6, 117.2, 133.4, 133.7, 198.3 kDa and any combination thereof. The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 3.4, 3.8, 3.9, 4.0, 4.6, 6.6, 6.7, 7.0, 7.6, 10.6, 11.1, 11.7, and 12.5 kDa and any combination thereof. The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 12.7, 12.9, 13.1, 13.2, 13.3, 13.4, 14.4, 23.1, and 23.3 kDa. The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 23.6, 23.8, 25.4, 34.2, and 44.7 kDa and any combination thereof. The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 45.6, 46.2, 46.4, 56.6, 117.2, 133.4, 133.7, and 198.3 kDa and any combination thereof. The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 3.4, 3.8, 3.9, 4.0, 4.6, 6.6, 6.7, 133.4, 133.7, and 198.3 kDa and any combination thereof. The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 3.4, 3.8, 3.9, 25.4, 34.2, 44.7, 45.6, 46.2, and 46.4 kDa and any combination thereof. The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 4.6, 25.4, 34.2, and 44.7 kDa and any combination thereof. The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 7.0, 7.6, 10.6, 11.1, 11.7, 12.5, 12.7, 12.9, 13.1, 13.2, 13.3, 13.4, 14.4, 23.8, 25.4, 34.2, 44.7, and 45.6 kDa and any combination thereof. The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 3.4, 3.8, 3.9, 4.0, 4.6, 6.6, 6.7, 117.2, 133.4, 133.7, and 198.3 kDa and any combination thereof. The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 12.7, 12.9, 13.1, 13.2, and 13.3 kDa and any combination thereof.
The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 2.6, 2.7, 2.8, 3.5, 4.2, 5.1, 5.2, 5.3, 5.5, 5.6, 5.7, 6.7, 6.8, 6.9, 7.0, 7.4, 8.8, 9.0, 9.3, 9.5, 9.6, 10.3, 10.9, 11.3, 11.5, 11.7, 11.8, 11.9, 12.4, 12.6, 12.9, 13.5, 13.8, 25.6, 32.3, 39.9, 42.3, 44.0, 44.6, 45.0, 46.6, 46.7, 49.7, 53.6, 54.4, 55.8, 63.1, 67.1, 75.3, 88.3, 111.3, and 150.1 kDa and any combination thereof. The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 2.6, 2.7, 2.8, 3.5, 4.2, 5.1, 5.2, 5.3, 5.5, 5.6, 5.7, 6.7, and 6.8 kDa and any combination thereof. The at least one biomarker can be selected from the group consisting of biomarkers of molecular masses of about 6.9, 7.0, 7.4, 8.8, 9.0, 9.3, 9.5, 9.6, 10.3, 10.9, 11.3, 11.5, 11.7, 11.8, 11.9, and 12.4 kDa and any combination thereof. It will be understood that any combination of the biomarkers described herein can be measured using the methods described herein.
In some aspects, the at least one biomarker is selected from the group consisting of biomarkers of molecular masses of about 12.6, 12.9, 13.5, 13.8, 25.6, 32.3, 39.9, 42.3, 44.0, 44.6, and 45.0 kDa and any combination thereof. In some aspects, the at least one biomarker is selected from the group consisting of biomarkers of molecular masses of about 46.6, 46.7, 49.7, 53.6, 54.4, 55.8, 63.1, 67.1, 75.3, 88.3, 111.3, and 150.1 kDa and any combination thereof. In some aspects, the at least one biomarker is selected from the group consisting of biomarkers of molecular masses of about 2.6, 2.7, 2.8, 3.5, 4.2, 5.1, 5.2, 5.3, 67.1, 75.3, 88.3, 111.3, and 150.1 kDa and any combination thereof. In some aspects, each of the biomarkers having a molecular mass of about 75.3, 88.3, 111.3, and 150.1 kDa is measured.
In some aspects, the at least one biomarker is selected from the group consisting of biomarkers of molecular masses of about 2.6, 2.7, 11.7, 11.8, 11.9, 12.4, 67.1, 75.3, 88.3, 111.3, and 150.1 kDa and any combination thereof. In some aspects, the at least one biomarker is selected from the group consisting of biomarkers of molecular masses of about 7.4, 8.8, 9.0, 9.3, 9.5, 9.6, 10.3, 10.9, 12.4, 12.6, 12.9, 13.5, 13.8, 25.6, and 32.3 kDa and any combination thereof. In some aspects, the at least one biomarker is selected from the group consisting of biomarkers of molecular masses of about 11.5, 25.6, and 32.3 kDa and any combination thereof.
In some aspects, the at least one biomarker is a protein or fragment thereof as provided in Table 5. In certain aspects, the at least one biomarker is represented by at least one of the accession numbers provided in Table 5.
In one aspect, the at least one biomarker is measured by capturing the biomarker on an adsorbent of a SELDI probe and detecting the captured biomarkers by laser desorption-ionization mass spectrometry. In certain aspects, the adsorbent is a cation exchange adsorbent, whereas in other aspects, the adsorbent is a metal chelation adsorbent. In another aspect, the at least one biomarker is measured by immunoassay.
In another aspect, the correlating is performed by a software classification algorithm. In a further aspect, dengue status is selected from chronically infected versus uninfected. In yet other aspects, dengue status is selected from chronically infected status versus acutely infected disease status, chronically infected asymptomatic status versus chronically affected with symptoms, or acutely infected status versus healthy uninfected status. In still another aspect, dengue status is selected from dengue versus healthy. In yet other aspects, dengue status is selected from dengue fever (DF), dengue hemorrhagic fever (DHF), and dengue shock syndrome (DSS). In other aspects, the biomarkers of the present invention can be used to predict the effectiveness of a dengue vaccine. In other aspects, dengue status is selected from primary infection and secondary infection.
In yet another aspect, the method further comprises managing subject treatment based on the status. If the measurement correlates with dengue, then managing subject treatment comprises administering to a patient drugs selected from a group consisting of, but not necessarily limited to, drugs such as paracetamol, antipyretics, and combinations thereof.
In a further aspect, the method further comprises measuring the at least one biomarker after subject management.
In another aspect, the present invention provides a method comprising measuring at least one biomarker in a sample from a subject, wherein the at least one biomarker is selected from the group consisting of the biomarkers set forth in Tables 1-5, 17, 21, and 24. In one aspect, the sample is a serum sample.
In still another aspect, the present invention provides a kit comprising: (a) a solid support comprising at least one capture reagent attached thereto, wherein the capture reagent binds at least one biomarker from a first group consisting of the biomarkers set forth in Table 1, Table 2, Table 3, Table 4, Table 5, Table 17, Table 21, and Table 24; and (b) instructions for using the solid support to detect the at least one biomarker set forth in Table 1, Table 2, Table 3, Table 4, Table 5, Table 17, Table 21, and Table 24.
In other aspects, the kit additionally comprises (c) a container containing at least one of the biomarkers of Table 1, Table 2, Table 3, Table 4, Table 5, Table 17, Table 21, and Table 24.
In yet a further aspect, the present invention provides a software product, the software product comprising: (a) code that accesses data attributed to a sample, the data comprising measurement of at least one biomarker in the sample, the biomarker selected from the group consisting of the biomarkers of Table 1, Table 2, Table 3, Table 4, Table 5, Table 17, Table 21, and Table 24; and (b) code that executes a classification algorithm that classifies dengue status of the sample as a function of the measurement.
In one aspect, the classification algorithm classifies dengue status of the sample as a function of the measurement of a biomarker selected from the biomarkers of Tables 1-5, 17, 21, and 24.
In other aspects, the present invention provides purified biomolecules selected from the biomarkers set forth in Table 1, Table 2, Table 3, Table 4, Table 5, Table 17, Table 21, and Table 24 and, additionally, methods comprising detecting a biomarker set forth in Table 1, Table 2, Table 3, Table 4, Table 5, Table 17, Table 21, and Table 24 by mass spectrometry or immunoassay.
In yet another aspect, the method further comprises testing and qualifying stocks of blood based on the status of blood which has been tested according to the methods described herein. If the measurements taken from blood samples correlate with dengue, then the management of blood stocks comprises decontamination of the infected blood by treatment of the infected blood with purification agents available to one skilled in the art. Alternatively, the infected blood can be discarded or destroyed and only stocks of blood which have not tested positively for dengue are retained.
In one aspect, the present invention provides a method for qualifying dengue status in a subject in comparison to the status of a different viral infection, the method comprising: (a) measuring at least one biomarker in a biological sample from the subject, wherein the at least one biomarker specifically indicates the presence of dengue and does not indicate the presence of a different infection; and (b) correlating the measurement with dengue status in comparison to the status of a different infection. In one aspect, the biological sample is a serum sample. In a preferred aspect of this method, the at least one biomarker is selected from the group of biomarkers of Tables 1-5, 17, 21, and 24. In still another preferred aspect, the infection includes, but is not limited to other febrile illnesses (OFIs).
In another aspect, the present invention provides a method for monitoring the course of progression of dengue in a patient comprising: (a) measuring at least one biomarker in a first biological sample from the patient, wherein the at least one biomarker specifically indicates the presence of dengue; (b) measuring the at least one biomarker in a second biological sample from the subject, wherein the second biological sample was obtained from the subject after the first biological sample; and (c) correlating the measurements with the progression or regression of dengue in the subject. In one aspect, the at least one biomarker is selected from the group consisting of the biomarkers of Tables 1-5, 17, 21, and 24.
Other features, objects and advantages of the invention and its preferred aspects will become apparent from the detailed description, examples and claims that follow.
These and other features, aspects, and advantages of the present invention will become better understood with regard to the following description, and accompanying drawings, where:
A biomarker is an organic biomolecule which is differentially present in a sample taken from a subject of one phenotypic status (e.g., having a disease) as compared with another phenotypic status (e.g., not having the disease). A biomarker is differentially present between different phenotypic statuses if the expression level of the biomarker (e.g., as indicated by the mean, median, or other measure) in the different groups is calculated to be statistically significant. Common tests for statistical significance include, among others, t-test, ANOVA, Kruskal-Wallis, Wilcoxon, Mann-Whitney and odds ratio. Biomarkers, alone or in combination, provide measures of relative risk that a subject belongs to one phenotypic status or another. Therefore, they are useful as markers for disease (diagnostics), therapeutic effectiveness of a drug (theranostics), drug toxicity, and the like.
It is to be understood that this invention is not limited to particular methods, reagents, compounds, compositions, or biological systems, which can, of course, vary. It is also to be understood that the terminology used herein is for the purpose of describing particular aspects only, and is not intended to be limiting. As used in this specification and the appended claims, the singular forms “a”, “an”, and “the” include plural referents unless the content clearly dictates otherwise. Thus, for example, reference to “a biomarker” includes a combination of two or more biomarkers, and the like.
“About” as used herein when referring to a measurable value such as an amount, a temporal duration, and the like, is meant to encompass variations of ±20% or ±10%, more preferably ±5%, even more preferably ±1%, and still more preferably ±0.1% from the specified value, as such variations are appropriate to perform the disclosed methods.
Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which the invention pertains. Although any methods and materials similar or equivalent to those described herein can be used in the practice for testing of the present invention, the preferred materials and methods are described herein. In describing and claiming the present invention, the following terminology will be used.
The term “in situ” refers to processes that occur in a living cell growing separate from a living organism, e.g., growing in tissue culture.
The term “in vivo” refers to processes that occur in a living organism.
The term “mammal” as used herein includes both humans and non-humans and include but is not limited to humans, non-human primates, canines, felines, murines, bovines, equines, and porcines.
As used herein, the term “residue” refers to amino acids or analogs thereof.
As used herein, the term “peptide” refers to peptides, proteins, fragments of proteins, peptidomimetics, and the like that are comprised of more than one amino acid residue or similar molecule.
The term percent “identity,” in the context of two or more nucleic acid or polypeptide sequences, refer to two or more sequences or subsequences that have a specified percentage of nucleotides or amino acid residues that are the same, when compared and aligned for maximum correspondence, as measured using one of the sequence comparison algorithms described below (e.g., BLASTP and BLASTN or other algorithms available to persons of skill) or by visual inspection. Depending on the application, the percent “identity” can exist over a region of the sequence being compared, e.g., over a functional domain, or, alternatively, exist over the full length of the two sequences to be compared.
For sequence comparison, typically one sequence acts as a reference sequence to which test sequences are compared. When using a sequence comparison algorithm, test and reference sequences are input into a computer, subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. The sequence comparison algorithm then calculates the percent sequence identity for the test sequence(s) relative to the reference sequence, based on the designated program parameters.
Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of Smith & Waterman, 1981, Adv. Appl. Math. 2:482, by the homology alignment algorithm of Needleman & Wunsch, 1970, J. Mol. Biol. 48:443, by the search for similarity method of Pearson & Lipman, 1988, Proc. Nat'l. Acad. Sci. USA 85:2444, by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.), or by visual inspection (see generally Ausubel et al., infra).
One example of an algorithm that is suitable for determining percent sequence identity and sequence similarity is the BLAST algorithm, which is described in Altschul et al., 1990, J. Mol. Biol. 215:403-410. Software for performing BLAST analyses is publicly available through the National Center for Biotechnology Information (www.ncbi.nlm.nih.gov/).
The term “sufficient amount” means an amount sufficient to produce a desired effect, e.g., an amount sufficient to modulate protein aggregation in a cell.
The term “therapeutically effective amount” is an amount that is effective to ameliorate a symptom of a disease. A therapeutically effective amount can be a “prophylactically effective amount” as prophylaxis can be considered therapy.
A biomarker is an organic biomolecule which is differentially present in a sample taken from a subject of one phenotypic status (e.g., having a disease) as compared with another phenotypic status (e.g., not having the disease). A biomarker is differentially present between different phenotypic statuses if the mean or median expression level of the biomarker in the different groups is calculated to be statistically significant. Common tests for statistical significance include, among others, t-test, ANOVA, Kruskal-Wallis, Wilcoxon, Mann-Whitney and odds ratio. Biomarkers, alone or in combination, provide measures of relative risk that a subject belongs to one phenotypic status or another. Therefore, they are useful as markers for disease (diagnostics), therapeutic effectiveness of a drug (theranostics), prognostics, and drug toxicity.
The term “chronic” refers to a disease or condition that is long-lasting or recurrent. The term chronic describes the course of the disease, or its rate of onset and development. A chronic course is distinguished from a recurrent course; recurrent diseases or conditions relapse repeatedly, with periods of remission in between.
The term “acute” means an exacerbated event or attack, of short course, followed by a period of remission.
Biomarkers for Dengue
This invention provides, inter alia, polypeptide-based biomarkers that are differentially present in subjects having dengue, in particular, and particularly that are differentially expressed in subjects infected with dengue versus non uninfected individuals (e.g., control, healthy, benign condition or other disease state). The biomarkers are characterized by mass-to-charge ratio as determined by mass spectrometry, by the shape of their spectral peak in time-of-flight mass spectrometry and by their binding characteristics to adsorbent surfaces. These characteristics provide one method to determine whether a particular detected biomolecule is a biomarker of this invention. These characteristics represent inherent characteristics of the biomolecules and not process limitations in the manner in which the biomolecules are discriminated. In one aspect, this invention provides these biomarkers in isolated form.
The biomarkers of Tables 3-4 were discovered using SELDI technology employing ProteinChip® arrays from Ciphergen Biosystems, Inc. (Fremont, Calif.) (“Ciphergen”). Serum samples were collected from subjects diagnosed with dengue and subjects diagnosed as healthy as well as subjects diagnosed with other febrile illnesses (OFIs). “Other febrile illnesses” are defined as cases with no evidence of dengue infection and no obvious bacterial, rickettsial or protozoan etiology, including, without limitation, chikungunya, measles, leptospirosis, yellow fever, influenza, West Nile, Japanese, and St Louis encephalitis. The samples were fractionated by anion exchange chromatography. Fractionated samples were applied to SELDI biochips and spectra of polypeptides in the samples were generated by time-of-flight mass spectrometry on a Ciphergen PBS IIc mass spectrometer. The spectra thus obtained were analyzed by Ciphergen Express™ Data Manager Software with Biomarker Wizard and Biomarker Pattern Software from Ciphergen Biosystems, Inc. The mass spectra for each group were subjected to scatter plot analysis. A Mann-Whitney test analysis was employed to compare dengue and control groups for each protein cluster in the scatter plot, and proteins were selected that differed significantly (p<0.05) between the two groups. This method is described in more gel electrophoresis followed by protein identification by matrix-assisted laser desorption/ionization mass spectrometry (DIGE and MALDI-TOFMS). This method is described in more detail in the Examples.
The biomarkers thus discovered are presented in Tables 1-4 (the protocol for the data obtained is further described below in the Examples).
Streptococcus
mutans
Streptococcus
mutans
± 19.421
± 22.022
)
&
chain
&
chain
&
chain
indicates data missing or illegible when filed
The biomarkers are characterized by their mass-to-charge ratio as determined by mass spectrometry. The mass-to-charge ratios were determined from mass spectra generated on a Ciphergen Biosystems, Inc. PBS IIc mass spectrometer. This instrument has a mass accuracy of about +/−0.15 percent. Additionally, the instrument has a mass resolution of about 400 to 1000 m/dm, where m is mass and dm is the mass spectral peak width at 0.5 peak height. The mass-to-charge ratio of the biomarkers was determined using Biomarker Wizard™ software (Ciphergen Biosystems, Inc.). Biomarker Wizard assigns a mass-to-charge ratio to a biomarker by clustering the mass-to-charge ratios of the same peaks from all the spectra analyzed, as determined by the PBSIIc, taking the maximum and minimum mass-to-charge-ratio in the cluster, and dividing by two. Accordingly, the masses provided reflect these specifications.
The identity of certain of the biomarkers of Tables 1-4 of this invention has been determined and is indicated in Tables 1-4 and/or Table 5. Table 5 shows the accession numbers for the biomarkers as determined on the NCBI web-site on Oct. 10, 2008. Thus, one of ordinary skill in the art could ascertain the nucleotide and amino acid sequences of the biomarkers based on this information without undue experimentation.
Tables 17-24 (below) show biomarkers of the invention. Table 17 shows the exemplary biomarkers for detecting primary DENV infection as detected by Biomarker Pattern Software (BPS). Tables B-D show all biomarkers detected by SELDI for primary DENV infection that had a p-value smaller than or equal to 0.05. Table 21 shows the exemplary biomarkers for detecting secondary DENV infection as detected by BPS. Tables F and G show the biomarkers for detecting secondary DENV infection. Table 24 shows the biomarkers that can be used to differentiate between primary and secondary DENV infection as detected by BPS.
For biomarkers whose identify has been determined, the presence of the biomarker can be determined by methods known in the art other than mass spectrometry.
Streptococcus mutans
The biomarkers of this invention can be further characterized by the shape of their spectral peak in time-of-flight mass spectrometry.
The biomarkers of this invention can be further characterized by their binding properties on chromatographic surfaces.
Because the biomarkers are characterized by mass-to-charge ratio and binding properties, they can be detected by mass spectrometry without knowing their specific identity. The identity of certain of the biomarkers of Tables 1-4, and 17-24 is known and, if known, is shown in Tables 1-4 and/or Table 5. If desired, biomarkers whose identity is not determined can be identified by, for example, determining the amino acid sequence of the polypeptides. For example, a biomarker can be peptide-mapped with a number of enzymes, such as trypsin or V8 protease, and the molecular weights of the digestion fragments can be used to search databases for sequences that match the molecular weights of the digestion fragments generated by the various enzymes. Alternatively, protein biomarkers can be sequenced using tandem MS technology. In this method, the protein is isolated by, for example, gel electrophoresis. A band containing the biomarker is cut out and the protein is subject to protease digestion. Individual protein fragments are separated by a first mass spectrometer. The fragment is then subjected to collision-induced cooling, which fragments the peptide and produces a polypeptide ladder. A polypeptide ladder is then analyzed by the second mass spectrometer of the tandem MS. The difference in masses of the members of the polypeptide ladder identifies the amino acids in the sequence. An entire protein can be sequenced this way, or a sequence fragment can be subjected to database mining to find identity candidates.
The preferred biological source for detection of the biomarkers is serum. However, in other aspects, the biomarkers are detected in urine and other biological samples.
The biomarkers of this invention are biomolecules. Accordingly, this invention provides these biomolecules in isolated form. The biomarkers can be isolated from biological fluids, such as serum. They can be isolated by any method known in the art, based on both their mass and their binding characteristics. For example, a sample comprising the biomolecules can be subject to chromatographic fractionation, as described herein, and subject to further separation by, e.g., acrylamide gel electrophoresis. Knowledge of the identity of the biomarker also allows their isolation by immunoaffinity chromatography.
Biomarkers and Modified Forms of a Protein
Proteins frequently exist in a sample in a plurality of different forms. These forms can result from either, or both, of pre- and post-translational modification. Pre-translational modified forms include allelic variants, slice variants and RNA editing forms. Post-translationally modified forms include forms resulting from proteolytic cleavage (e.g., fragments of a parent protein), glycosylation, phosphorylation, lipidation, oxidation, methylation, cysteinylation, sulphonation and acetylation. When detecting or measuring a protein in a sample, the ability to differentiate between different forms of a protein depends upon the nature of the difference and the method used to detect or measure. For example, immunological methods of detection typically cannot distinguish between different forms of a protein that contain the same epitope or epitopes to which the antibody or antibodies are directed. In diagnostic assays, the inability to distinguish different forms of a protein has little impact when the forms detected by the particular method used are equally good biomarkers as any particular form. However, when a particular form (or a subset of particular forms) of a protein is a better biomarker than the collection of modified forms detected together by a particular method, the power of the assay can suffer. In this case, it is useful to employ an assay method that distinguishes between forms of a protein and that specifically detects and measures a desired modified form or forms of the protein. Distinguishing different forms of an analyte or specifically detecting a particular form of an analyte is referred to as “resolving” the analyte.
The collection of analytes detected in an assay and the ability to resolve modified forms of a protein of course depends on the methodology used. For example, an immunoassay using a monoclonal antibody will detect all forms of a protein containing the eptiope and will not distinguish between them. However, a sandwich immunoassay that uses two antibodies directed against different epitopes on a protein will detect all forms of the protein that contain both epitope and will not detect those forms that contain only one of the epitopes. Accordingly this method can be useful when the modified forms differ in a terminal amino acid and one of the antibodies is directed to the terminus of one of these forms.
Preferably, the biospecific capture reagent is bound to a solid phase, such as a bead, a plate, a membrane or a chip. Methods of coupling biomolecules, such as antibodies, to a solid phase are well known in the art. They can employ, for example, bifunctional linking agents, or the solid phase can be derivatized with a reactive group, such as an epoxide or an imidizole, that will bind the molecule on contact. Biospecific capture reagents against different target proteins can be mixed in the same place, or they can be attached to solid phases in different physical or addressable locations. For example, one can load multiple columns with derivatized beads, each column able to capture a single protein cluster. Alternatively, one can pack a single column with different beads derivatized with capture reagents against a variety of protein clusters, thereby capturing all the analytes in a single place. Accordingly, antibody-derivatized bead-based technologies, such as xMAP technology of Luminex (Austin, Tex.) can be used to detect the protein clusters. However, the biospecific capture reagents must be specifically directed toward the members of a cluster in order to differentiate them.
Mass spectrometry is a particularly powerful resolving methodology because different forms of a protein typically have different masses and can be differentiated by mass spectrometry. One useful methodology combines mass spectrometry with immunoassay. First, a biospecific capture reagent (e.g., an antibody, aptamer or Affibody that recognizes the biomarker and modified forms of it) is used to capture the biomarker of interest. Preferably, the biospecific capture reagent is bound to a solid phase, such as a bead, a plate, a membrane or a chip. After unbound materials are washed away, the captured analytes are detected and/or measured by mass spectrometry. (This method also will also result in the capture of protein interactors that are bound to the proteins or that are otherwise recognized by antibodies and that, themselves, can be biomarkers.) Then, the captured proteins can be detected by SELDI mass spectrometry or by eluting the proteins from the capture reagent and detecting the eluted proteins by traditional MALDI, SELDI or any other ionization method for mass spectrometry (e.g., electrospray).
Thus, when reference is made herein to detecting a particular protein or to measuring the amount of a particular protein, it means detecting and measuring the protein with or without resolving modified forms of protein. For example, the step of “measuring Apolipoprotein A-IV precursor” includes measuring Apolipoprotein A-IV precursor by means that do not differentiate between various forms of the protein (e.g., certain immunoassays) as well as by means that differentiate some forms from other forms or that measure a specific form of the protein. In contrast, when it is desired to measure a particular form or forms of a protein, the particular form (or forms) is specified. For example, “measuring M7.065159” or a biomarker of 7.065159 kDa means measuring it in a way that distinguishes it from forms of the protein that do not have the characteristic properties identified in Tables 1-5.
Detection of Biomarkers for Dengue
The biomarkers of this invention can be detected by any suitable method. Detection paradigms that can be employed to this end include optical methods, electrochemical methods (voltametry and amperometry techniques), atomic force microscopy, and radio frequency methods, e.g., multipolar resonance spectroscopy. Illustrative of optical methods, in addition to microscopy, both confocal and non-confocal, are detection of fluorescence, luminescence, chemiluminescence, absorbance, reflectance, transmittance, and birefringence or refractive index (e.g., surface plasmon resonance, ellipsometry, a resonant mirror method, a grating coupler waveguide method or interferometry).
In one aspect, a sample is analyzed by means of a biochip. Biochips generally comprise solid substrates and have a generally planar surface, to which a capture reagent (also called an adsorbent or affinity reagent) is attached. Frequently, the surface of a biochip comprises a plurality of addressable locations, each of which has the capture reagent bound there.
Protein biochips are biochips adapted for the capture of polypeptides. Many protein biochips are described in the art. These include, for example, protein biochips produced by Ciphergen Biosystems, Inc. (Fremont, Calif.), Zyomyx (Hayward, Calif.), Invitrogen (Carlsbad, Calif.), Biacore (Uppsala, Sweden) and Procognia (Berkshire, UK). Examples of such protein biochips are described in the following patents or published patent applications: U.S. Pat. No. 6,225,047 (Hutchens & Yip); U.S. Pat. No. 6,537,749 (Kuimelis and Wagner); U.S. Pat. No. 6,329,209 (Wagner et al.); PCT International Publication No. WO 00/56934 (Englert et al.); PCT International Publication No. WO 03/048768 (Boutell et al.); and U.S. Pat. No. 5,242,828 (Bergstrom et al.).
Detection by Mass Spectrometry
In a preferred aspect, the biomarkers of this invention are detected by mass spectrometry, a method that employs a mass spectrometer to detect gas phase ions. Examples of mass spectrometers are time-of-flight, magnetic sector, quadrupole filter, ion trap, ion cyclotron resonance, electrostatic sector analyzer and hybrids of these.
In a further preferred method, the mass spectrometer is a laser desorption/ionization mass spectrometer. In laser desorption/ionization mass spectrometry, the analytes are placed on the surface of a mass spectrometry probe, a device adapted to engage a probe interface of the mass spectrometer and to present an analyte to ionizing energy for ionization and introduction into a mass spectrometer. A laser desorption mass spectrometer employs laser energy, typically from an ultraviolet laser, but also from an infrared laser, to desorb analytes from a surface, to volatilize and ionize them and make them available to the ion optics of the mass spectrometer.
SELDI
A preferred mass spectrometric technique for use in the invention is “Surface Enhanced Laser Desorption and Ionization” or “SELDI,” as described, for example, in U.S. Pat. No. 5,719,060 and No. 6,225,047, both to Hutchens and Yip. This refers to a method of desorption/ionization gas phase ion spectrometry (e.g., mass spectrometry) in which an analyte (here, one or more of the biomarkers) is captured on the surface of a SELDI mass spectrometry probe. There are several versions of SELDI.
One version of SELDI is called “affinity capture mass spectrometry.” It also is called “Surface-Enhanced Affinity Capture” or “SEAC”. This version involves the use of probes that have a material on the probe surface that captures analytes through a non-covalent affinity interaction (adsorption) between the material and the analyte. The material is variously called an “adsorbent,” a “capture reagent,” an “affinity reagent” or a “binding moiety.” Such probes can be referred to as “affinity capture probes” and as having an “adsorbent surface.” The capture reagent can be any material capable of binding an analyte. The capture reagent is attached to the probe surface by physisorption or chemisorption. In certain aspects the probes have the capture reagent already attached to the surface. In other aspects, the probes are pre-activated and include a reactive moiety that is capable of binding the capture reagent, e.g., through a reaction forming a covalent or coordinate covalent bond. Epoxide and acyl-imidizole are useful reactive moieties to covalently bind polypeptide capture reagents such as antibodies or cellular receptors. Nitrilotriacetic acid and iminodiacetic acid are useful reactive moieties that function as chelating agents to bind metal ions that interact non-covalently with histidine containing peptides. Adsorbents are generally classified as chromatographic adsorbents and biospecific adsorbents.
“Chromatographic adsorbent” refers to an adsorbent material typically used in chromatography. Chromatographic adsorbents include, for example, ion exchange materials, metal chelators (e.g., nitrilotriacetic acid or iminodiacetic acid), immobilized metal chelates, hydrophobic interaction adsorbents, hydrophilic interaction adsorbents, dyes, simple biomolecules (e.g., nucleotides, amino acids, simple sugars and fatty acids) and mixed mode adsorbents (e.g., hydrophobic attraction/electrostatic repulsion adsorbents).
“Biospecific adsorbent” refers to an adsorbent comprising a biomolecule, e.g., a nucleic acid molecule (e.g., an aptamer), a polypeptide, a polysaccharide, a lipid, a steroid or a conjugate of these (e.g., a glycoprotein, a lipoprotein, a glycolipid, a nucleic acid (e.g., DNA)-protein conjugate). In certain instances, the biospecific adsorbent can be a macromolecular structure such as a multiprotein complex, a biological membrane or a virus. Examples of biospecific adsorbents are antibodies, receptor proteins and nucleic acids. Biospecific adsorbents typically have higher specificity for a target analyte than chromatographic adsorbents. Further examples of adsorbents for use in SELDI can be found in U.S. Pat. No. 6,225,047. A “bioselective adsorbent” refers to an adsorbent that binds to an analyte with an affinity of at least 10−8 M.
Protein biochips produced by Ciphergen Biosystems, Inc. comprise surfaces having chromatographic or biospecific adsorbents attached thereto at addressable locations. Ciphergen ProteinChip® arrays include NP20 (hydrophilic); H4 and HSO (hydrophobic); SAX-2, Q-10 and LSAX-30 (anion exchange); WCX-2, CM-10 and LWCX-30 (cation exchange); IMAC-3, IMAC-30 and IMAC 40 (metal chelate); and PS-10, PS-20 (reactive surface with acyl-imidizole, epoxide) and PG-20 (protein G coupled through acyl-imidizole). Hydrophobic ProteinChip arrays have isopropyl or nonylphenoxy-poly(ethylene glycol)methacrylate functionalities. Anion exchange ProteinChip arrays have quaternary ammonium functionalities. Cation exchange ProteinChip arrays have carboxylate functionalities. Immobilized metal chelate ProteinChip arrays have nitrilotriacetic acid functionalities that adsorb transition metal ions, such as copper, nickel, zinc, and gallium, by chelation. Preactivated ProteinChip arrays have acyl-imidizole or epoxide functional groups that can react with groups on proteins for covalent binding.
Such biochips are further described in: U.S. Pat. No. 6,579,719 (Hutchens and Yip, “Retentate Chromatography,” Jun. 17, 2003); U.S. Pat. No. 6,897,072 (Rich et al., “Probes for a Gas Phase Ion Spectrometer,” Can 24, 2005); U.S. Pat. No. 6,555,813 (Beecher et al., “Sample Holder with Hydrophobic Coating for Gas Phase Mass Spectrometer,” Apr. 29, 2003); U.S. Patent Application No. U.S. 2003 0032043 A1 (Pohl and Papanu, “Latex Based Adsorbent Chip,” Jul. 16, 2002); and PCT International Publication No. WO 03/040700 (Um et al., “Hydrophobic Surface Chip,” Can 15, 2003); U.S. Patent Application No. US 2003/0218130 A1 (Boschetti et al., “Biochips With Surfaces Coated With Polysaccharide-Based Hydrogels,” Apr. 14, 2003) and U.S. Patent Application No. 60/448,467, entitled “Photocrosslinked Hydrogel Surface Coatings” (Huang et al., filed Feb. 21, 2003).
In general, a probe with an adsorbent surface is contacted with the sample for a period of time sufficient to allow the biomarker or biomarkers that can be present in the sample to bind to the adsorbent. After an incubation period, the substrate is washed to remove unbound material. Any suitable washing solutions can be used; preferably, aqueous solutions are employed. The extent to which molecules remain bound can be manipulated by adjusting the stringency of the wash. The elution characteristics of a wash solution can depend, for example, on pH, ionic strength, hydrophobicity, degree of chaotropism, detergent strength, and temperature. Unless the probe has both SEAC and SEND properties (as described herein), an energy absorbing molecule then is applied to the substrate with the bound biomarkers.
The biomarkers bound to the substrates are detected in a gas phase ion spectrometer such as a time-of-flight mass spectrometer. The biomarkers are ionized by an ionization source such as a laser, the generated ions are collected by an ion optic assembly, and then a mass analyzer disperses and analyzes the passing ions. The detector then translates information of the detected ions into mass-to-charge ratios. Detection of a biomarker typically will involve detection of signal intensity. Thus, both the quantity and mass of the biomarker can be determined.
Another version of SELDI is Surface-Enhanced Neat Desorption (SEND), which involves the use of probes comprising energy absorbing molecules that are chemically bound to the probe surface (“SEND probe”). The phrase “energy absorbing molecules” (EAM) denotes molecules that are capable of absorbing energy from a laser desorption/ionization source and, thereafter, contribute to desorption and ionization of analyte molecules in contact therewith. The EAM category includes molecules used in MALDI, frequently referred to as “matrix,” and is exemplified by cinnamic acid derivatives, sinapinic acid (SPA), cyano-hydroxy-cinnamic acid (CHCA) and dihydroxybenzoic acid, ferulic acid, and hydroxyaceto-phenone derivatives. In certain aspects, the energy absorbing molecule is incorporated into a linear or cross-linked polymer, e.g., a polymethacrylate. For example, the composition can be a co-polymer of a-cyano-4-methacryloyloxycinnamic acid and acrylate. In another aspect, the composition is a co-polymer of a-cyano-4-methacryloyloxycinnamic acid, acrylate and 3-(tri-ethoxy)silyl propyl methacrylate. In another aspect, the composition is a co-polymer of a-cyano-4-methacryloyloxycinnamic acid and octadecylmethacrylate (“C18 SEND”). SEND is further described in U.S. Pat. No. 6,124,137 and PCT International Publication No. WO 03/64594 (Kitagawa, “Monomers And Polymers Having Energy Absorbing Moieties Of Use In Desorption/Ionization Of Analytes,” Aug. 7, 2003).
SEAC/SEND is a version of SELDI in which both a capture reagent and an energy absorbing molecule are attached to the sample presenting surface. SEAC/SEND probes therefore allow the capture of analytes through affinity capture and ionization/desorption without the need to apply external matrix. The C18 SEND biochip is a version of SEAC/SEND, comprising a C18 moiety which functions as a capture reagent, and a CHCA moiety which functions as an energy absorbing moiety.
Another version of SELDI, called Surface-Enhanced Photolabile Attachment and Release (SEPAR), involves the use of probes having moieties attached to the surface that can covalently bind an analyte, and then release the analyte through breaking a photolabile bond in the moiety after exposure to light, e.g., to laser light (see, U.S. Pat. No. 5,719,060). SEPAR and other forms of SELDI are readily adapted to detecting a biomarker or biomarker profile, pursuant to the present invention.
Other Mass Spectrometry Methods
In another mass spectrometry method, the biomarkers are first captured on a chromatographic resin having chromatographic properties that bind the biomarkers. In the present example, this could include a variety of methods. For example, one could capture the biomarkers on a cation exchange resin, such as CM Ceramic HyperD F resin, wash the resin, elute the biomarkers and detect by MALDI. Alternatively, this method could be preceded by fractionating the sample on an anion exchange resin before application to the cation exchange resin. In another alternative, one could fractionate on an anion exchange resin and detect by MALDI directly. In yet another method, one could capture the biomarkers on an immuno-chromatographic resin that comprises antibodies that bind the biomarkers, wash the resin to remove unbound material, elute the biomarkers from the resin and detect the eluted biomarkers by MALDI or by SELDI. In yet another method, one could isolate the biomarkers using gel elecrophoresis and detect the biomarkers by MALDI OR SELDI.
Data Analysis
Analysis of analytes by time-of-flight mass spectrometry generates a time-of-flight spectrum. The time-of-flight spectrum ultimately analyzed typically does not represent the signal from a single pulse of ionizing energy against a sample, but rather the sum of signals from a number of pulses. This reduces noise and increases dynamic range. This time-of-flight data is then subject to data processing. In Ciphergen's ProteinChip® software, data processing typically includes TOF-to-M/Z transformation to generate a mass spectrum, baseline subtraction to eliminate instrument offsets and high frequency noise filtering to reduce high frequency noise.
Data generated by desorption and detection of biomarkers can be analyzed with the use of a programmable digital computer. The computer program analyzes the data to indicate the number of biomarkers detected, and optionally the strength of the signal and the determined molecular mass for each biomarker detected. Data analysis can include steps of determining signal strength of a biomarker and removing data deviating from a predetermined statistical distribution. For example, the observed peaks can be normalized, by calculating the height of each peak relative to some reference.
The computer can transform the resulting data into various formats for display. The standard spectrum can be displayed, but in one useful format only the peak height and mass information are retained from the spectrum view, yielding a cleaner image and enabling biomarkers with nearly identical molecular weights to be more easily seen. In another useful format, two or more spectra are compared, conveniently highlighting unique biomarkers and biomarkers that are up- or down-regulated between samples. Using any of these formats, one can readily determine whether a particular biomarker is present in a sample.
Analysis generally involves the identification of peaks in the spectrum that represent signal from an analyte. Peak selection can be done visually, but software is available, as part of Ciphergen's ProteinChip® software package, that can automate the detection of peaks. In general, this software functions by identifying signals having a signal-to-noise ratio above a selected threshold and labeling the mass of the peak at the centroid of the peak signal. In one useful application, many spectra are compared to identify identical peaks present in some selected percentage of the mass spectra. One version of this software clusters all peaks appearing in the various spectra within a defined mass range, and assigns a mass (M/Z) to all the peaks that are near the mid-point of the mass (M/Z) cluster.
Software used to analyze the data can include code that applies an algorithm to the analysis of the signal to determine whether the signal represents a peak in a signal that corresponds to a biomarker according to the present invention. The software also can subject the data regarding observed biomarker peaks to classification tree or ANN analysis, to determine whether a biomarker peak or combination of biomarker peaks is present that indicates the status of the particular clinical parameter under examination. Analysis of the data can be “keyed” to a variety of parameters that are obtained, either directly or indirectly, from the mass spectrometric analysis of the sample. These parameters include, but are not limited to, the presence or absence of one or more peaks, the shape of a peak or group of peaks, the height of one or more peaks, the log of the height of one or more peaks, and other arithmetic manipulations of peak height data.
General Protocol for SELDI Detection of Biomarkers for Dengue
A preferred protocol for the detection of the biomarkers of this invention is as follows. The biological sample to be tested, e.g., serum, preferably is subject to pre-fractionation before SELDI analysis. This simplifies the sample and improves sensitivity. A preferred method of pre-fractionation involves contacting the sample with an anion exchange chromatographic material, such as Q HyperD (BioSepra, SA). The bound materials are then subject to stepwise pH elution using buffers at pH 9, pH 7, pH 5 and pH 4. (The fractions in which the biomarkers are eluted also are indicated in Tables 1-2, and 4) Various fractions containing the biomarker are collected.
The sample to be tested (preferably pre-fractionated) is then contacted with an affinity capture probe comprising an cation exchange adsorbent (preferably a WCX ProteinChip array (Ciphergen Biosystems, Inc.)) or an IMAC adsorbent (preferably an IMAC3 ProteinChip array (Ciphergen Biosystems, Inc.)). The probe is washed with a buffer that will retain the biomarker while washing away unbound molecules. The biomarkers are detected by laser desorption/ionization mass spectrometry.
Alternatively, if antibodies that recognize the biomarker are available, these can be attached to the surface of a probe, such as a pre-activated PS10 or PS20 ProteinChip array (Ciphergen Biosystems, Inc.). These antibodies can capture the biomarkers from a sample onto the probe surface. Then the biomarkers can be detected by, e.g., laser desorption/ionization mass spectrometry.
Detection by Immunoassay
In another aspect of the invention, the biomarkers of the invention are measured by a method other than mass spectrometry or other than methods that rely on a measurement of the mass of the biomarker. In one such aspect that does not rely on mass, the biomarkers of this invention are measured by immunoassay. Immunoassay requires biospecific capture reagents, such as antibodies, to capture the biomarkers. Antibodies can be produced by methods well known in the art, e.g., by immunizing animals with the biomarkers. Biomarkers can be isolated from samples based on their binding characteristics. Alternatively, if the amino acid sequence of a polypeptide biomarker is known, the polypeptide can be synthesized and used to generate antibodies by methods well known in the art.
This invention contemplates traditional immunoassays including, for example, sandwich immunoassays including ELISA or fluorescence-based immunoassays, as well as other enzyme immunoassays. Nephelometry is an assay done in liquid phase, in which antibodies are in solution. Binding of the antigen to the antibody results in changes in absorbance, which is measured. In the SELDI-based immunoassay, a biospecific capture reagent for the biomarker is attached to the surface of an MS probe, such as a pre-activated ProteinChip array. The biomarker is then specifically captured on the biochip through this reagent, and the captured biomarker is detected by mass spectrometry.
Determination of Subject Dengue Status
Single Markers
The biomarkers of the invention can be used in diagnostic tests to assess dengue status in a subject, e.g., to diagnose Dengue. The phrase “Dengue status” includes any distinguishable manifestation of the disease, including non-disease. For example, disease status includes, without limitation, the presence or absence of disease (e.g., dengue v. non dengue or Dengue v. other disease (e.g., OFIs), the risk of developing disease, the stage of the disease, the progress of disease (e.g., progress of disease or remission of disease over time) and the effectiveness or response to treatment of disease. The status of the subject can inform the practitioner about what status set is being distinguished. For example, a subject that presents with signs of a disease could be classed into Dengue v. non-Dengue disease, while a person exposed to a situation in which Dengue infection is possible and who is presenting with signs of Dengue infection could be classified into Dengue v. non-Dengue. Based on this status, further procedures can be indicated, including additional diagnostic tests or therapeutic procedures or regimens.
The power of a diagnostic test to correctly predict status is commonly measured as the sensitivity of the assay, the specificity of the assay or the area under a receiver operated characteristic (“ROC”) curve. Sensitivity is the percentage of true positives that are predicted by a test to be positive, while specificity is the percentage of true negatives that are predicted by a test to be negative. An ROC curve provides the sensitivity of a test as a function of 1-specificity. The greater the area under the ROC curve, the more powerful the predictive value of the test. Other useful measures of the utility of a test are positive predictive value and negative predictive value. Positive predictive value is the percentage of people who test positive that are actually positive. Negative predictive value is the percentage of people who test negative that are actually negative.
The biomarkers of this invention show a statistical difference in different dengue statuses of at least p≦0.05, p≦10−2, p≦10−3, p≦10−4 or p≦10−5. Diagnostic tests that use these biomarkers alone or in combination show a sensitivity and specificity of at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98% and about 100%.
Each biomarker listed in Tables 1-5 and 17-24 is differentially present in dengue, and, therefore, each is individually useful in aiding in the determination of dengue status. The method involves, first, measuring the selected biomarker in a subject sample using the methods described herein, e.g., capture on a SELDI biochip followed by detection by mass spectrometry and, second, comparing the measurement with a diagnostic amount or cut-off that distinguishes a positive dengue status from a negative dengue status. The diagnostic amount represents a measured amount of a biomarker above which or below which a subject is classified as having a particular dengue status, e.g. DF, DHF, DSS. For example, if the biomarker is up-regulated compared to normal during dengue, then a measured amount above the diagnostic cutoff provides a diagnosis of dengue status. Alternatively, if the biomarker is down-regulated during dengue, then a measured amount below the diagnostic cutoff provides a diagnosis of dengue status. As is well understood in the art, by adjusting the particular diagnostic cut-off used in an assay, one can increase sensitivity or specificity of the diagnostic assay depending on the preference of the diagnostician. The particular diagnostic cut-off can be determined, for example, by measuring the amount of the biomarker in a statistically significant number of samples from subjects with the different dengue statuses, as was done here, and drawing the cut-off to suit the diagnostician's desired levels of specificity and sensitivity.
Combinations of Markers
While individual biomarkers are useful diagnostic biomarkers, it has been found that a combination of biomarkers can provide greater predictive value of a particular status than single biomarkers alone. Specifically, the detection of a plurality of biomarkers in a sample can increase the sensitivity and/or specificity of the test. A combination of at least two biomarkers is sometimes referred to as a “biomarker profile” or “biomarker fingerprint.”
Presence of Dengue
In one aspect, this invention provides methods for determining the presence or absence of dengue in a subject (status: dengue v. non-dengue). The presence or absence of dengue is determined by measuring the relevant biomarker or biomarkers and then either submitting them to a classification algorithm or comparing them with a reference amount and/or pattern of biomarkers that is associated with the particular risk level.
Determining Risk of Developing Disease
In one aspect, this invention provides methods for determining the risk of developing disease in a subject. Biomarker amounts or patterns are characteristic of various risk states, e.g., high, medium or low. The risk of developing a disease is determined by measuring the relevant biomarker or biomarkers and then either submitting them to a classification algorithm or comparing them with a reference amount and/or pattern of biomarkers that is associated with the particular risk level.
Determining Stage of Disease
In one aspect, this invention provides methods for determining the stage of disease in a subject. Each stage of the disease has a characteristic amount of a biomarker or relative amounts of a set of biomarkers (a pattern). The stage of a disease is determined by measuring the relevant biomarker or biomarkers and then either submitting them to a classification algorithm or comparing them with a reference amount and/or pattern of biomarkers that is associated with the particular stage.
Determining Course (Progression/Remission) of Disease
In one aspect, this invention provides methods for determining the course of disease in a subject. Disease course refers to changes in disease status over time, including disease progression (worsening) and disease regression (improvement). Over time, the amounts or relative amounts (e.g., the pattern) of the biomarkers changes. Therefore, the trend of these markers, either increased or decreased over time toward diseased or non-diseased indicates the course of the disease. Accordingly, this method involves measuring one or more biomarkers in a subject at least two different time points, e.g., a first time and a second time, and comparing the change in amounts, if any. The course of disease is determined based on these comparisons.
Subject Management
In certain aspects of the methods of qualifying dengue status, the methods further comprise managing subject treatment based on the status. Such management includes the actions of the physician or clinician subsequent to determining dengue status. For example, if a physician makes a diagnosis of dengue, then a certain regime of treatment, such as prescription or administration of paracetamol, antipyretics or a combination thereof, might follow. Alternatively, a diagnosis of non-dengue might be followed with further testing to determine a specific disease that might the patient might be suffering from. Also, if the diagnostic test gives an inconclusive result on dengue status, further tests can be called for.
The methods described herein can be used in combination with and other tests and/or methods that are used to qualify dengue status in a subject. For example, in certain aspects, the methods described herein are used to determine whether or not a subject has an increased likelihood of having dengue. These methods can be used in combination with other tests that are useful for either diagnosing dengue in a subject or ruling out other diagnoses.
Additional aspects of the invention relate to the communication of assay results or diagnoses or both to technicians, physicians or patients, for example. In certain aspects, computers will be used to communicate assay results or diagnoses or both to interested parties, e.g., physicians and their patients. In some aspects, the assays will be performed or the assay results analyzed in a country or jurisdiction which differs from the country or jurisdiction to which the results or diagnoses are communicated.
In a preferred aspect of the invention, a diagnosis based on the presence or absence in a test subject of any the biomarkers of Table 1-5, and 17-24 is communicated to the subject as soon as possible after the diagnosis is obtained. The diagnosis can be communicated to the subject by the subject's treating physician. Alternatively, the diagnosis can be sent to a test subject by email or communicated to the subject by phone. A computer can be used to communicate the diagnosis by email or phone. In certain aspects, the message containing results of a diagnostic test can be generated and delivered automatically to the subject using a combination of computer hardware and software which will be familiar to artisans skilled in telecommunications. One example of a healthcare-oriented communications system is described in U.S. Pat. No. 6,283,761; however, the present invention is not limited to methods which utilize this particular communications system. In certain aspects of the methods of the invention, all or some of the method steps, including the assaying of samples, diagnosing of diseases, and communicating of assay results or diagnoses, can be carried out in diverse (e.g., foreign) jurisdictions.
Determining Therapeutic Efficacy of Pharmaceutical Drug
In another aspect, this invention provides methods for determining the therapeutic efficacy of a pharmaceutical drug. These methods are useful in performing clinical trials of the drug, as well as monitoring the progress of a patient on the drug. Therapy or clinical trials involve administering the drug in a particular regimen. The regimen can involve a single dose of the drug or multiple doses of the drug over time. The doctor or clinical researcher monitors the effect of the drug on the patient or subject over the course of administration. If the drug has a pharmacological impact on the condition, the amounts or relative amounts (e.g., the pattern or profile) of the biomarkers of this invention changes toward a non-disease profile. One can follow the course of the amounts of these biomarkers in the subject during the course of treatment. Accordingly, this method involves measuring one or more biomarkers in a subject receiving drug therapy, and correlating the amounts of the biomarkers with the disease status of the subject. One aspect of this method involves determining the levels of the biomarkers at least two different time points during a course of drug therapy, e.g., a first time and a second time, and comparing the change in amounts of the biomarkers, if any. For example, the biomarkers can be measured before and after drug administration or at two different time points during drug administration. The effect of therapy is determined based on these comparisons. If a treatment is effective, then the biomarkers will trend toward normal, while if treatment is ineffective, the biomarkers will trend toward disease indications. If a treatment is effective, then the biomarkers will trend toward normal, while if treatment is ineffective, the biomarkers will trend toward disease indications.
Generation of Classification Algorithms for Qualifying Dengue Status
In some aspects, data derived from the spectra (e.g., mass spectra or time-of-flight spectra) that are generated using samples such as “known samples” can then be used to “train” a classification model. A “known sample” is a sample that has been pre-classified. The data that are derived from the spectra and are used to form the classification model can be referred to as a “training data set.” Once trained, the classification model can recognize patterns in data derived from spectra generated using unknown samples. The classification model can then be used to classify the unknown samples into classes. This can be useful, for example, in predicting whether or not a particular biological sample is associated with a certain biological condition (e.g., diseased versus non-diseased).
The training data set that is used to form the classification model can comprise raw data or pre-processed data. In some aspects, raw data can be obtained directly from time-of-flight spectra or mass spectra, and then can be optionally “pre-processed” as described above.
Classification models can be formed using any suitable statistical classification (or “learning”) method that attempts to segregate bodies of data into classes based on objective parameters present in the data. Classification methods can be either supervised or unsupervised. Examples of supervised and unsupervised classification processes are described in Jain, “Statistical Pattern Recognition: A Review”, IEEE Transactions on Pattern Analysis and Machine Intelligence, Vol. 22, No. 1, January 2000, the teachings of which are incorporated by reference.
In supervised classification, training data containing examples of known categories are presented to a learning mechanism, which learns one or more sets of relationships that define each of the known classes. New data can then be applied to the learning mechanism, which then classifies the new data using the learned relationships. Examples of supervised classification processes include linear regression processes (e.g., multiple linear regression (MLR), partial least squares (PLS) regression and principal components regression (PCR)), binary decision trees (e.g., recursive partitioning processes such as CART—classification and regression trees), artificial neural networks such as back propagation networks, discriminant analyses (e.g., Bayesian classifier or Fischer analysis), logistic classifiers, and support vector classifiers (support vector machines).
A preferred supervised classification method is a recursive partitioning process. Recursive partitioning processes use recursive partitioning trees to classify spectra derived from unknown samples. Further details about recursive partitioning processes are provided in U.S. Patent Application No. 2002 0138208 A1 to Paulse et al., “Method for analyzing mass spectra.”
In other aspects, the classification models that are created can be formed using unsupervised learning methods. Unsupervised classification attempts to learn classifications based on similarities in the training data set, without pre-classifying the spectra from which the training data set was derived. Unsupervised learning methods include cluster analyses. A cluster analysis attempts to divide the data into “clusters” or groups that ideally should have members that are very similar to each other, and very dissimilar to members of other clusters. Similarity is then measured using some distance metric, which measures the distance between data items, and clusters together data items that are closer to each other. Clustering techniques include the MacQueen's K-means algorithm and the Kohonen's Self-Organizing Map algorithm.
Learning algorithms asserted for use in classifying biological information are described, for example, in PCT International Publication No. WO 01/31580 (Barnhill et al., “Methods and devices for identifying patterns in biological systems and methods of use thereof”), U.S. Patent Application No. 2002 0193950 A1 (Gavin et al., “Method or analyzing mass spectra”), U.S. Patent Application No. 2003 0004402 A1 (Hitt et al., “Process for discriminating between biological states based on hidden patterns from biological data”), and U.S. Patent Application No. 2003 0055615 A1 (Zhang and Zhang, “Systems and methods for processing biological expression data”).
The classification models can be formed on and used on any suitable digital computer. Suitable digital computers include micro, mini, or large computers using any standard or specialized operating system, such as a Unix, Windows™ or Linux™ based operating system. The digital computer that is used can be physically separate from the mass spectrometer that is used to create the spectra of interest, or it can be coupled to the mass spectrometer.
The training data set and the classification models according to aspects of the invention can be embodied by computer code that is executed or used by a digital computer. The computer code can be stored on any suitable computer readable media including optical or magnetic disks, sticks, tapes, and the like, and can be written in any suitable computer programming language including C, C++, visual basic, and the like
The learning algorithms described above are useful both for developing classification algorithms for the biomarkers already discovered, or for finding new biomarkers for dengue. The classification algorithms, in turn, form the base for diagnostic tests by providing diagnostic values (e.g., cut-off points) for biomarkers used singly or in combination.
Compositions of Matter
In another aspect, this invention provides compositions of matter based on the biomarkers of this invention.
In one aspect, this invention provides biomarkers of this invention in purified form. Purified biomarkers have utility as antigens to raise antibodies. Purified biomarkers also have utility as standards in assay procedures. As used herein, a “purified biomarker” is a biomarker that has been isolated from other proteins and peptdies, and/or other material from the biological sample in which the biomarker is found. Biomarkers can be purified using any method known in the art, including, but not limited to, mechanical separation (e.g., centrifugation), ammonium sulphate precipitation, dialysis (including size-exclusion dialysis), size-exclusion chromatography, affinity chromatography, anion-exchange chromatography, cation-exchange chromatography, and methal-chelate chromatography. Such methods can be performed at any appropriate scale, for example, in a chromatography column, or on a biochip.
In another aspect, this invention provides a biospecific capture reagent, optionally in purified form, that specifically binds a biomarker of this invention. In one aspect, the biospecific capture reagent is an antibody. Such compositions are useful for detecting the biomarker in a detection assay, e.g., for diagnostics.
In another aspect, this invention provides an article comprising a biospecific capture reagent that binds a biomarker of this invention, wherein the reagent is bound to a solid phase. For example, this invention contemplates a device comprising bead, chip, membrane, monolith or microtiter plate derivatized with the biospecific capture reagent. Such articles are useful in biomarker detection assays.
In another aspect this invention provides a composition comprising a biospecific capture reagent, such as an antibody, bound to a biomarker of this invention, the composition optionally being in purified form. Such compositions are useful for purifying the biomarker or in assays for detecting the biomarker.
In another aspect, this invention provides an article comprising a solid substrate to which is attached an adsorbent, e.g., a chromatographic adsorbent or a biospecific capture reagent, to which is further bound a biomarker of this invention. In one aspect, the article is a biochip or a probe for mass spectrometry, e.g., a SELDI probe. Such articles are useful for purifying the biomarker or detecting the biomarker.
Kits for Detection of Biomarkers for Dengue
In another aspect, the present invention provides kits for qualifying dengue status, which kits are used to detect biomarkers according to the invention. In one aspect, the kit comprises a solid support, such as a chip, a microtiter plate or a bead or resin having a capture reagent attached thereon, wherein the capture reagent binds a biomarker of the invention. Thus, for example, the kits of the present invention can comprise mass spectrometry probes for SELDI, such as ProteinChip® arrays. In the case of biospecfic capture reagents, the kit can comprise a solid support with a reactive surface, and a container comprising the biospecific capture reagent.
The kit can also comprise a washing solution or instructions for making a washing solution, in which the combination of the capture reagent and the washing solution allows capture of the biomarker or biomarkers on the solid support for subsequent detection by, e.g., mass spectrometry. The kit can include more than type of adsorbent, each present on a different solid support.
In a further aspect, such a kit can comprise instructions for suitable operational parameters in the form of a label or separate insert. For example, the instructions can inform a consumer about how to collect the sample, how to wash the probe or the particular biomarkers to be detected.
In yet another aspect, the kit can comprise one or more containers with biomarker samples, to be used as standard(s) for calibration.
Use of Biomarkers for Dengue in Screening Assays and Methods of Treating Dengue
The methods of the present invention have other applications as well. For example, the biomarkers can be used to screen for compounds that modulate the expression of the biomarkers in vitro or in vivo, which compounds in turn can be useful in treating or preventing dengue in patients. In another example, the biomarkers can be used to monitor the response to treatments for dengue. In yet another example, the biomarkers can be used in heredity studies to determine if the subject is at risk for developing dengue.
Thus, for example, the kits of this invention could include a solid substrate having a hydrophobic function, such as a protein biochip (e.g., a Ciphergen H50 ProteinChip array, e.g., ProteinChip array) and a sodium acetate buffer for washing the substrate, as well as instructions providing a protocol to measure the biomarkers of this invention on the chip and to use these measurements to diagnose dengue.
Compounds suitable for therapeutic testing can be screened initially by identifying compounds which interact with one or more biomarkers listed in Table 1, 2, 3, 4, 5, 17, 18, 19, 20, 21, 22, 23, or 24. By way of example, screening might include recombinantly expressing a biomarker listed in Table 1, 2, 3, 4, 5, 17, 18, 19, 20, 21, 22, 23, or 24, purifying the biomarker, and affixing the biomarker to a substrate. Test compounds would then be contacted with the substrate, typically in aqueous conditions, and interactions between the test compound and the biomarker are measured, for example, by measuring elution rates as a function of salt concentration. Certain proteins can recognize and cleave one or more biomarkers of Table 1, 2, 3, 4, 5, 17, 18, 19, 20, 21, 22, 23, or 24, in which case the proteins can be detected by monitoring the digestion of one or more biomarkers in a standard assay, e.g., by gel electrophoresis of the proteins.
In a related aspect, the ability of a test compound to inhibit the activity of one or more of the biomarkers of Table 1, 2, 3, 4, 5, 17, 18, 19, 20, 21, 22, 23, or 24, can be measured. One of skill in the art will recognize that the techniques used to measure the activity of a particular biomarker will vary depending on the function and properties of the biomarker. For example, an enzymatic activity of a biomarker can be assayed provided that an appropriate substrate is available and provided that the concentration of the substrate or the appearance of the reaction product is readily measurable. The ability of potentially therapeutic test compounds to inhibit or enhance the activity of a given biomarker can be determined by measuring the rates of catalysis in the presence or absence of the test compounds. The ability of a test compound to interfere with a non-enzymatic (e.g., structural) function or activity of one of the biomarkers of Table 1, 2, 3, 4, 5, 17, 18, 19, 20, 21, 22, 23, or 24, can also be measured. For example, the self-assembly of a multi-protein complex which includes one of the biomarkers of Table 1, 2, 3, 4, 5, 17, 18, 19, 20, 21, 22, 23, or 24, can be monitored by spectroscopy in the presence or absence of a test compound. Alternatively, if the biomarker is a non-enzymatic enhancer of transcription, test compounds which interfere with the ability of the biomarker to enhance transcription can be identified by measuring the levels of biomarker-dependent transcription in vivo or in vitro in the presence and absence of the test compound.
Test compounds capable of modulating the activity of any of the biomarkers of Table 1, 2, 3, 4, 5, 17, 18, 19, 20, 21, 22, 23, or 24, can be administered to patients who are suffering from or are at risk of developing dengue. For example, the administration of a test compound which increases the activity of a particular biomarker can decrease the risk of dengue in a patient if the activity of the particular biomarker in vivo prevents the accumulation of proteins for dengue. Conversely, the administration of a test compound which decreases the activity of a particular biomarker can decrease the risk of dengue in a patient if the increased activity of the biomarker is responsible, at least in part, for the onset of dengue.
In an additional aspect, the invention provides a method for identifying compounds useful for the treatment of disorders such as dengue which are associated with increased levels of modified forms of the biomarkers in Table 1, 2, 3, 4, 5, 17, 18, 19, 20, 21, 22, 23, or 24. For example, in one aspect, cell extracts or expression libraries can be screened for compounds which catalyze the cleavage of a full-length biomarker to form truncated forms of the biomarker. In one aspect of such a screening assay, cleavage of the biomarker can be detected by attaching a fluorophore to the biomarker which remains quenched when the biomarker is uncleaved but which fluoresces when the protein is cleaved. Alternatively, a version of full-length biomarker modified so as to render the amide bond between amino acids x and y uncleavable can be used to selectively bind or “trap” the cellular protesase which cleaves full-length biomarker at that site in vivo. Methods for screening and identifying proteases and their targets are well-documented in the scientific literature, e.g., in Lopez-Ottin et al. (Nature Reviews, 2002, 3:509-519).
In yet another aspect, the invention provides a method for treating or reducing the progression or likelihood of a disease, e.g., dengue, which is associated with the increased levels of a truncated biomarker. For example, after one or more proteins have been identified which cleave the full-length biomarker, combinatorial libraries can be screened for compounds which inhibit the cleavage activity of the identified proteins. Methods of screening chemical libraries for such compounds are well-known in art. See, e.g., Lopez-Otin et al. (2002). Alternatively, inhibitory compounds can be intelligently designed based on the structure of the biomarker.
At the clinical level, screening a test compound includes obtaining samples from test subjects before and after the subjects have been exposed to a test compound. The levels in the samples of one or more of the biomarkers listed in Table 1, 2, 3, 4, 5, 17, 18, 19, 20, 21, 22, 23, or 24, can be measured and analyzed to determine whether the levels of the biomarkers change after exposure to a test compound. The samples can be analyzed by mass spectrometry, as described herein, or the samples can be analyzed by any appropriate means known to one of skill in the art. For example, the levels of one or more of the biomarkers listed in Table 1, 2, 3, 4, 5, 17, 18, 19, 20, 21, 22, 23, or 24, can be measured directly by Western blot using radio- or fluorescently-labeled antibodies which specifically bind to the biomarkers. Alternatively, changes in the levels of mRNA encoding the one or more biomarkers can be measured and correlated with the administration of a given test compound to a subject. In a further aspect, the changes in the level of expression of one or more of the biomarkers can be measured using in vitro methods and materials. For example, human tissue cultured cells which express, or are capable of expressing; one or more of the biomarkers of Table 1, 2, 3, 4, 5, 17, 18, 19, 20, 21, 22, 23, or 24, can be contacted with test compounds. Subjects who have been treated with test compounds will be routinely examined for any physiological effects which can result from the treatment. In particular, the test compounds will be evaluated for their ability to decrease disease likelihood in a subject. Alternatively, if the test compounds are administered to subjects who have previously been diagnosed with dengue, test compounds will be screened for their ability to slow or stop the progression of the disease.
The invention will be further described with reference to the following exemplary aspects; however, it is to be understood that the invention is not limited to such exemplary aspects.
Below are examples of specific aspects for carrying out the present invention. The examples are offered for illustrative purposes only, and are not intended to limit the scope of the present invention in any way. Efforts have been made to ensure accuracy with respect to numbers used (e.g., amounts, temperatures, and the like), but some experimental error and deviation should, of course, be allowed for.
The practice of the present invention will employ, unless otherwise indicated, conventional methods of protein chemistry, biochemistry, recombinant DNA techniques and pharmacology, within the skill of the art. Such techniques are explained fully in the literature. See, e.g., T. E. Creighton, Proteins: Structures and Molecular Properties (W.H. Freeman and Company, 1993); A. L. Lehninger, Biochemistry (Worth Publishers, Inc., current addition); Sambrook, et al., Molecular Cloning: A Laboratory Manual (2nd Edition, 1989); Methods In Enzymology (S. Colowick and N. Kaplan eds., Academic Press, Inc.); Remington's Pharmaceutical Sciences, 18th Edition (Easton, Pa.: Mack Publishing Company, 1990); Carey and Sundberg Advanced Organic Chemistry 3rd Ed. (Plenum Press) Vols A and B(1992).
Two complimentary approaches to identifying potential biomarkers for the diagnosis or prognosis of dengue have been taken: 1) SELDI-based and 2) gel-based. Based on estimated molecular weight, there is an overlap of biomarkers identified by both approaches (Tables 1-5). Similar methods for the discovery of biomarkers for babesia were used in U.S. provisional application Ser. No. 60/749,449 filed on Dec. 12, 2005 and U.S. provisional application Ser. No. 60/752,285 filed on Dec. 20, 2005, both of which are herein incorporated by reference for all purposes. The discovered biomarkers are shown in Tables 1-5, and Tables 17-24.
Sample Collection
Sample collection was previously peformed by Takol.
Plasma samples from pediatric That patients were obtained. For each dengue infected patient, 3 blood samples were taken at 3 different time points: t1 (1st day of admission), t2 (fever decreased to normal), t3 (convalescence stage 30 days after admission). Each probable dengue diagnosis was confirmed and the serotype as well as the type (primary or secondary) of the infection recorded. Samples of patients with other febrile illnesses (OFIs) were also collected to be used as controls. The samples were stored at −80° C. (Table 6). Table 6 shows a list of specimens collected in Thailand from pediatric patients. The list of 15 controls is not included.
Preparation and Fractionation of Serum Samples
Preparation and fractionation of serum samples was previously performed by Takol.
Fractionation of serum samples was performed with the use of the Biomek 2000 Laboratory Automation Workstation (Beckman Coulter, USA) using software protocols provided by Ciphergen (Ciphergen Biosystems, Fremont, Calif., USA). An Expression Difference Mapping Kit (Ciphergen Biosystems, Fremont, Calif., USA) was also used according to the manufacturer's instructions. Six fractions obtained through Fractionation of serum samples was performed with the use of the Biomek 2000 Laboratory Automation Workstation (Beckman Coulter, USA) using software protocols provided by Ciphergen (Ciphergen Biosystems, Fremont, Calif., USA). An Expression Difference Mapping Kit (Ciphergen Biosystems, Fremont, Calif., USA) was also used according to the manufacturer's instructions. Six fractions obtained through isoelectric point separation were obtained and collected using different buffers: F1 (pH 9), F2 (pH 7), F3 (pH5), F4 (pH 4), F5 (pH 3), F6 (organic solvent). The fractions were stored at −80° C.
SELDI Analysis
Protein Binding Using ProteinChip Arrays
Protein binding using ProteinChip Arrays was previously performed by Takol.
The following chip binding protocol was followed and the samples were processed using an IMAC-3 ProteinChip Array according to the protocol below:
Chip Binding Protocol
Weak Cation Exchange (WCX2) ProteinChip Array
Materials:
Bioprocessor
WCX-2 chip
Vortex
CM low stringency buffer
Deionized water
EAM solution
Processing Samples Using an IMAC-3 ProteinChip Array
Material:
Bioprocessors
IMAC Chips
Pap Pen
Votex (VWR VX-2500 Multitube Vortexer)
IMAC3 Chip Buffer:
A) Binding Buffer: 100 mM Sodium Phosphate+0.5M NaCl pH 7.0+0.1% Triton X
B) Charging Buffer (Copper): 100 mM CuSO4+0.1% Triton X 20
C) Neutralizing Buffer:100 mM NaAcetate pH 4.0+0.1% Triton X 20
Protein binding to ProteinChip Arrays was performed using the Biomek 2000 Laboratory Automation Workstation (Beckman Coulter) and protein binding software protocols provided by Ciphergen Biosystems. Immobilized affinity capture (IMAC3), weak cation-exchange (CM10) and hydrophobic (H50) ProteinChip Array types (eight spot format) were used (Ciphergen Biosystems). ProteinChip arrays were analyzed in the ProteinChip Biology System reader (model PBS IIc, Ciphergen Biosystems).
To initially compare data between different diseases tested, arrays were read at low (intensity=175, sensitivity=7, optimization range=2000-20,000 Da, high range=50,000 Da) and high (intensity=175, sensitivity=8, optimization range=20,000-50,000 Da, high range=150,000 Da) laser settings. The data was analyzed using ProteinChip software (version 3.2.1) and Ciphergen Express Data Manager (version 2.1) (Ciphergen Biosystems).
All data were imported into Ciphergen Express (CE) and grouped according to each condition (e.g., DHF fraction 1 bound to a WCX2 array, read at low laser intensity). Each data set was calibrated using an equation generated from a spectrum of protein standards, which were collected at the same laser intensity as the collected sample data.
The Baseline for all data was set at 15, and Noise set at 2000 Da (for arrays read at low laser energy) or 10,000 Da (high laser energy). Sample spectra for each group were normalized using a specific set of conditions. Arrays read at low laser intensity were normalized between 2000-100,000 Da, and 10,000-200,000 Da for high laser intensity. An external normalization coefficient of 0.2 was applied for both conditions. As a quality control measure for the comparison of spectra processed on different days, the average normalization factor was first calculated for all spectra within the condition. Any spectra that did not fall within twice the overall average normalization factor were discarded from the analysis.
Peak and Cluster detection (EDM) was then performed for both low and high laser intensities for each sample condition. A distinct set of variables were set for each of the samples collected depending on if they were obtained using low or high laser intensity.
The first set of comparisons was carried between control1 and 1DF1 and 1DHF1, control2 and 1DF2 and 1DHF2, 1DF1 and 1DHF1, 1DF2 and 1DHF2, 1DF3 and 1DHF3 plasma samples. After the first-pass analysis, all clusters found to have a p-value ≦0.05 were visually inspected for peak quality. High quality protein peaks were manually relabelled. A second-pass analysis was carried out; the EDM was run again using only user-detected peaks. Using Biomarker Pattern Software (BPS), a decision analysis software, combination of these candidate biomarkers was determined as well as their specificity and sensitivity using pooled data from 1DF1, 1DF2, 1DHF1 and 1DHF2 versus pooled data from control 1 and 2. These candidate biomarkers represent potential diagnostic biomarkers.
A second set of comparisons was carried out between secondary 2DF1 and 2DF1, 2DF2 and 2DHF2, 2DF3 and 2DHF3. The same first- and second-pass analysis protocol was followed with the same p-value limit.
Since the samples from primary and secondary infections were carried on 2 separate bioprocessors on 2 different days, the quality method described above was applied before the following analyses were carried out. A third set of comparisons was carried out between primary and secondary DF at each 3 time point as well as between primary and secondary DHF at each 3 time points. A comparison between control1 and 2DF1 and 2DHF1 as well as between control2 and 2DF2 and 2DHF2 was also carried. The same first- and second-pass analysis protocol was followed but only clusters found to have a p-value ≦0.005 were kept. BPS analysis was also carried using the same comparisons above.
ZOOM Fractionation and SDS PAGE
Control 1 and 2 samples were pooled together and 1DF1,2 samples were pooled with 1DHF1,2. The plasma samples were prepared following Invitrogen's recommendations. 650 μl of the prepared samples were dispensed in 5 of the ZOOM® IEF Fractionator chambers. The ZOOM was run under standard conditions (100V for 20 min, 200V for 80 min, and 600V for 80 min). Once completed, the fractions from each chamber were kept at −20° C.
40 μl of for each fraction was desalted. Each aliquot was run on a Denaturing 4-12% Bis-Tris NuPAGE Gel Electrophoresis using Mark12 MW Marker 1× (Invitrogen) as the molecular weight ladder. The gel was run at 200V for 45 min with an expected current of 100-125 mA at the beginning and 60-80 mA towards the end. The gel was stained using a Coomassie stain for 2 days. It was destained with MiliQ water until band visualization was satisfying. The gel was kept in acetic acid. The candidate biomarkers were cut and kept in 2% acetic acid tubes and were sent for sequencing using mass spectrometry. Tables 1-2 and
While the invention has been particularly shown and described with reference to a preferred aspect and various alternate aspects, it will be understood by persons skilled in the relevant art that various changes in form and details can be made therein without departing from the spirit and scope of the invention.
All references, issued patents and patent applications cited within the body of the instant specification are hereby incorporated by reference in their entirety, for all purposes.
Each recited range includes all combinations and sub-combinations of ranges, as well as specific numerals contained therein.
All publications and patent applications cited in this specification are herein incorporated by reference in their entirety for all purposes as if each individual publication or patent application were specifically and individually indicated to be incorporated by reference for all purposes.
Although the foregoing invention has been described in detail by way of example for purposes of clarity of understanding, it will be apparent to the artisan that certain changes and modifications are comprehended by the disclosure and can be practiced without undue experimentation within the scope of the appended claims, which are presented by way of illustration not limitation.
Filing Document | Filing Date | Country | Kind | 371c Date |
---|---|---|---|---|
PCT/IB2009/007358 | 10/14/2009 | WO | 00 | 10/11/2011 |
Number | Date | Country | |
---|---|---|---|
61105381 | Oct 2008 | US |