The invention relates to detecting disease in animals and humans; and particularly the invention provides a method and system for constructing a diagnosis for one or more particular health affections using a plurality of biomarkers, and further using the diagnosis to differentiate between the several underlying health affections regardless of the similarity in the apparent symptoms, and detect the propensity of an animal or human to develop a health affection.
The level of biomarkers in body fluids is used in the process of detecting numerous health affections. Measuring the level of one or more specific biomarkers in the blood is typically a fast and relatively inexpensive means for diagnosing a disease or leading to the diagnosis thereof before prescribing other (more expensive and/or time consuming) tests such as radiological, cytological, histological and immunological tests etc.
However, many challenges face a medical (or veterinary) practitioner in selecting a panel of biomarkers to be tested for any specific case, and then in interpreting the results of the measured level of each biomarker in view of the symptoms the patient is exhibiting. The symptoms are generally only broad indicators of any particular disease, since some diseases, such as infection-related diseases, trigger symptoms in a patient at the onset of the disease or shortly thereafter, while other diseases, such as many types of cancer, trigger symptoms a considerable time after a tumor starts to develop. Diagnosing the underlying disease often requires testing for a panel of biomarkers, where some tests may be conducted to confirm a suspicion of a disease, while other tests may be conducted to rule out one or more other diseases.
Along with the challenge of choosing the panel of biomarkers for testing, there is the challenge of interpreting the test results. Some biomarkers are proven to be tightly correlated with the presence of a certain disease, while other biomarkers provide only a probability level a patient has a particular disease. For example, in humans sustained hyperglycemia is a good indicator of diabetes, while, for example, in dogs an increased level of thymidine kinase above a given threshold is an indicator of Hemangiosarcoma, which should be confirmed through histology.
Moreover, screening, diagnosing and/or monitoring a disease may involve any number of tests. According to current methods and systems, when using biomarkers the broader are the symptoms, the more tests are carried out. A practitioner uses his/her own experience to interpret the test results when using biomarkers to detect a disease, to follow the progression of a disease and/or to monitor the result of a treatment. The latter introduces a level of subjectivity in diagnosing test results, which may cause discrepancies between interpretations by the same person over time, among practitioners and even among entire health institutions.
Therefore, there is a need for a method and system for selecting a set of biomarkers and developing a method of use for detecting one or more target diseases and differentiating between the diseases to help a practitioner interpret the test results and potentially reveal the underlying affection or the propensity of a patient to develop a given disease.
The invention provides a method and system for constructing a medical (or veterinary) diagnosis using a plurality of biomarkers to reveal one or more particular health affections, and further to differentiate a particular underlying health affection regardless of the similarity in the apparent symptoms with other health affections. The diagnosis may also be used to reveal the propensity of an animal or human to develop a health affection at a later stage in life. The invention may be practiced in humans or animals such as domestic animal species.
Constructing a diagnosis method, according to the invention, starts by defining a target set of health affections in humans or an animal specie, selecting a set of biomarkers and measuring their levels in a group of subjects. The invention teaches how to compute a numerical value, i.e. an index, using the biomarker levels, then define ranges of the index on a scale, where each range may be matched with a subgroup of subjects segregated on the basis of their health status. Provided the latter method of segregation of subjects by health status, a practitioner may subsequently measure the level of biomarkers of a particular diagnosis in a patient, compute the index value for that particular patient and determine the status (or the type) of the underlying health affection of the patient by comparing the index value of the subject to the predefined ranges of the index.
A system according to the invention may be implemented as a computer program configured to receive input data (e.g., biomarker data and health status data etc.), and determine ranges for a particular diagnosis. The computer system may also receive the input for a particular patient, compute the index value and output the result of the diagnosis. The system may stand alone or be embedded in any diagnosis machine.
Currently, practitioners are faced with the difficulty of interpreting the results of biomarker data, particularly when comparing the progress of a disease, such as while monitoring a subject with a particular propensity of having a disease, or monitoring the health progress of patients following a treatment. The invention provides the latter practitioners tools for diagnosing an underlying health affection and monitoring the progress of a disease using numerical indicators for any particular situation.
The invention provides a method and system for detecting disease in animals and humans. The invention provides a method and system for developing and using a diagnosis targeting a particular one or more health conditions using a plurality of biomarkers, and further using the diagnosis to differentiate between several affections that may trigger similar health symptoms in a patient. Furthermore, the method and system according to the invention may be used to determine the propensity for an individual to develop a disease (e.g., one or more types of cancer), providing a practitioner the means for detecting a disease before the symptoms are visible and/or monitoring a disease post treatment.
In the following description, numerous specific details are set forth to provide a more thorough description of the invention. It will be apparent, however, to one skilled in the pertinent art, that the invention may be practiced without these specific details. In other instances, well known features have not been described in detail so as not to obscure the invention. The claims following this description are what define the metes and bounds of the invention.
Portions of the methods and systems disclosed in the present disclosure have been disclosed in co-pending patent application number (to be determined), co-pending patent application number (to be determined) and co-pending patent application number (to be determined).
Throughout the description, the terms individual, subject or patient may refer to an animal subject or a person whose biological data are used to develop and/or use an implementation of the invention. The subject may be normal (or disease-free) or showing any level of symptoms.
The term biomarker refers to any indicator in any body part (e.g., bodily fluid or tissue) that may be collected and the presence of which measured through any of its manifestations such as enzymatic activity, mass, concentration, cell count, cell shrinkage/shape, deoxyribonucleic acid (DNA) and/or ribonucleic acid (RNA) genetic level of expression or any aspect of the biochemical or the physiological markers that may be related to one or more health conditions. Moreover, for the purpose of designing health status indices (see below) a biomarker data may be any related data that may be considered for diagnosing a disease (or the probability of occurrence thereof) such as age, sex, any biometric data, genetic history (e.g., parent's health status or presence of any affection in the family) or any other data that may contribute to the diagnosis of a disease.
The term “index” is used throughout the disclosure to refer to a dependent variable that is calculated using two or more data inputs such as the level of a biomarker in the blood stream. A “neoplasia index” refers to an index that is computed with the goal of classifying subjects into groups based on cancer status. For example, a subject that may be apparently healthy (e.g., showing no signs of cancer), diagnosed with a malignant or a benign cancer or in any health status with regard to cancer, would have an neoplasia index value that reflects the health status, in accordance with embodiments of the invention.
The term “user” may be used to refer to a person, machine or a computer program acting as or on behalf of a person.
The main concept of the invention is that by selecting a set of biomarkers and measuring their levels in an animal or a human subject, it is possible to compute a numerical value, i.e. an index, using those levels, and to compare the index value to a predefined scale that characterizes the health status of the subject using index ranges. The scale may define two or more ranges of the index values, wherein each range indicates a level of one or more diseases. For example, in a subject showing general symptoms of inflammation, a predefined scale may define two or more ranges that may indicate the presence of cancer, an infection, both cancer and infection or other diseases.
The invention teaches developing any particular diagnosis by selecting a set of biomarkers, then measuring the level of each biomarker from each individual of a sample group of subjects. In addition, other diagnoses (e.g., cytological, histological and physiological tests, physical examination etc.) are carried out on the subjects to accurately establish the health status of each subject.
According to embodiments of the invention, the biomarkers data serve to compute the index values, while the health status data serve to define two or more health status categories (e.g., healthy, cancer, benign tumor, infection etc.). Ranges of index values are then defined providing efficient segregation of subjects into the two or more health status categories.
Subsequently, to provide a diagnosis to a patient, a set of biomarkers according to a particular diagnosis is collected and measured, then an index value is computed using the test data, and the index value is compared to the predefined index scale to match a health status category, which reveals the patient's health status.
Step 120 represents selecting a set of biomarkers for use in the diagnosis. Selecting a set of biomarkers may be based on previous knowledge of a correlation (be it positive or negative) between the level of a given biomarkers and the presence (or absence) of the one or more target diseases. For example, thymidine kinase may be used as a biomarker to detect any type of cancer since thymidine kinase is typically present in the cells undergoing cell division, which is the case of cancerous cells.
Step 130 represents collecting data from a group of subjects. The group of subjects may be a sample of subjects comprising normal subjects (i.e. healthy) showing none of the symptoms defined in Step 110, and affected subjects showing any level of severity of those symptoms. Bodily fluids, tissue or any other body sample may be appropriately collected in order to measure the level of each biomarker of the set of biomarkers defined at step 120.
In addition, the subjects undergo a plurality of tests, such as histological, radiological tests or any other test designed to establish the presence or absence of the target disease(s). Other tests may be conducted on each subject to either further confirm the disease or rule out other diseases that may share common symptoms.
Moreover, other non-disease related data may also be considered. The latter data comprise age, sex, any biometric data, genetic history (e.g., parent's health status or presence of any affection in the family) or any other data that may contribute to the diagnosis of a disease.
The outcome of step 130 is a set of data points that characterizes each subject individual data and its level of each biomarker in the set of data, and a health status that establishes whether each subject is a non-carrier or a carrier of one or more diseases and eventually the stage (or severity) of each disease. For example, when considering cancer, a subject may be classified as non-carrier of cancer, having a benign tumor, an early cancer stage or advanced cancer stage, and any given type of cancer. In the latter example, the set of biomarkers may comprise thymidine kinase, C-reactive protein and/or any other biomarker selected at step 110 to include in the development of the diagnosis method. The level of each biomarker may be expressed in any unit that characterizes the presence of the biomarker in the body. Thus, an enzyme may be characterized by the level of its enzymatic activity, a protein, a hormone or any other biomarker may be expressed by a concentration level such as its mass or moles per volume of tissue or bodily fluid.
Step 140 represents the process of finding range values for each biomarker. For example, when considering thymidine kinase as a biomarker for cancer, a first range of zero units per liter (0 U/l) through five units per liter (5 U/l), a second range of five units per liter (5 U/l) through eight units per liter (8 U/l), and a third range of eight units per liter (8 U/l) and above may be defined as ranges for that specific biomarker.
Step 140 also involves discretizing the data, which comprises attributing a score number to each previously defined range of a biomarker level. Using the example of the three (3) thymidine kinase ranges above, the first range may be attributed the value zero (0), the second range may be attributed the value one (1) and the third range may be attributed the value two (2).
The discretization may be carried on other non-disease related data such as age. In the latter example, age may be selected for the diagnosis as a factor in the increase of the probability of having a target affection. Thus, age may be discretized such that a person of 0 to 20 years of age is attributed a value of “0”, a person of 20 to 40 years of age may be attributed a value “1” and a person over 40 years of age may be attributed a value of “2”. Sex may be discretized as “1” and “0” for female and male, respectively.
Step 150 represents computing an index value for each subject as follows:
where the index value “I” for each subject may be the sum of the product of the score level “L” (e.g., computed at step 140) and a coefficient “C” associated with the “ith” data input for a number “N” of data inputs (e.g., biomarker level, age, biometric data etc.). The coefficient “C” may be determined empirically as shown below at steps 160 and 170.
Step 160 represents applying one or more methods for segregating subjects using the health status data and the computed index values. For example, the method of segregation may be the Receiver Operating Characteristic (ROC) curve analysis. ROC curve analysis is a well known method in the medical field for determining whether a correlation between the level of a biomarker may serve as an indicator of the presence of a health condition. The latter is possible for example when there is a strong correlation between the amount of a substance in the body (e.g., high cholesterol) and a health condition (e.g., sclerosis of blood vessels).
Using the ROC curve analysis on the index values of all subjects in the group, it is possible to determine whether there is a cutoff value capable of classifying individuals into groups matching their health status. For example, if subjects carrying a disease are labeled as positive and the non-carriers are labeled as negative, the ROC curve analysis may yield a threshold that classifies the subjects into an above and a below-threshold groups matching the health statuses carrier and non-carrier of the disease, respectively. There may be false positives and false negatives for each chosen cutoff value in the range of possible values. The rate of success in determining true positive cases is called “Sensitivity”, whereas the rate of success in determining true negative cases is called “Specificity”. Sensitivity and specificity for a plurality of cutoff values are computed. Sensitivity and Specificity are rates, and thus may be expressed in the range of zero (0) to one (1), or as a percentage from zero (0) to one hundred percent (100%). The results are plotted as Sensitivity values versus one (1) (or 100% depending on the unit of choice) minus the corresponding specificity. The area under the curve (AUC) reveals whether ROC analysis may be a valid classifier of the data: the closer the AUC is to 100%, the better classifier is the ROC analysis. On the contrary, the ROC analysis may not be considered for classification purposes if the AUC is closer to 50%, which is considered close to a random process. In general, the ROC method of analysis may be considered valid, if the AUC is at least 0.8.
Moreover, each threshold value yields a “Sensitivity” and “Specificity”. In populations where ROC analysis appears adequate, the “Sensitivity” curve decreases as the “Specificity” increases. At a particular threshold, the apex, the total of Sensitivity and Specificity is at a maximum. The apex is typically chosen as the threshold of classification if it yields a Sensitivity and Specificity each above 0.85, otherwise a threshold for Specificity and a threshold for Sensitivity may be respectively selected to yield a success rate of at least 0.85.
ROC analysis is one of any existing methods that may be utilized in embodiments of the invention to detect clusters in the data that define the clustering boundaries capable of segregating subjects into groups matching health status categories. For example, k-means clustering, hierarchical clustering, neural networks or any other clustering clustering method may be utilized in one or more embodiments of the invention. Furthermore, an embodiment of the invention may conduct the steps of
Step 170 represents computing success scores of the method of segregating of subjects in the test group. If the success level of the segregation into health categories is not satisfactory (e.g., no statistical difference compared to a population drawn from a random process), the parameters for computing the index values are revised and the analysis is repeated at step 140. The process of searching for optimal parameters may be repeated until the result of classification of subjects reaches (or exceeds) an acceptable success rate. Otherwise, if no optimal parameters may be found, the result may indicate that the chosen set of biomarkers is unsuitable for segregating the subjects, based on the index method in question, into the proposed health status categories.
The search for optimal parameters may involve changing one or more boundary values for discretizing biomarker values, and/or the weight coefficients associated with each biomarker in computing the index value for each subject. The search method may be manual i.e. an expert practitioner may set the initial parameters and adjust them, through multiple iterations of computation, while considering the outcome of the success rate of classification of subjects into health status categories. Implementations of the invention may also use numerical methods for automatic search to optimize parameters. Such methods comprise brute force search, where a large number of values of parameters and combinations thereof are tested. The numerical methods for determining optimal values may use gradient descent search, random walk search or any other mathematical method for searching for optimal parameters in order to achieve the goal of maximizing the success rate of the classification of subjects into correct corresponding health status categories.
Computer programs for conducting a search, in accordance with an implementation of the invention, require ordinary skills in the art of computer programming. Moreover, existing computer programs may be adapted (through a programming scripting language) to carry out a search process in an implementation of the invention. Computer programs include such programs as Mathematica™, Matlab™, Medcalc™, or any other available computer program may be used.
Step 180 represent the final step of determining the final parameters (or range thereof) that may be used in a diagnosis of the target disease(s). The optimal parameters include the coefficient associated with each biomarker, the number of ranges and the boundary values that define the ranges for each biomarker. Step 180 also includes determining the index range boundaries that define the categories as defined by the health status of subjects. The latter parameters may be used in systems for diagnosing whether a subject is a carrier of the a disease, as will detailed below in the method of use.
The invention provides a means for facilitating the display and read out of the results by defining the boundaries between ranges as discrete values for ease of use. For example, a scale comprising two health statuses, such as “disease present” and “disease not present”, may be defined has having a discrete boundary, such as one “1”, where the scale range lower than “1” may be mapped to “disease not present” status, while the scale range greater than “1” is mapped to “disease present” status.
Defining range boundaries as discrete values may be carried out during the search for the optimal parameters (as described above). The discrete range boundary values may also be provided computationally (e.g., using multipliers and offsets) subsequent to determining the optimal parameters.
Step 220 represents computing an index value for the patient. Provided the discretization boundary values for each biomarker, the level of each biomarker is converted into a score value, and provided the coefficient associated with each biomarker, the index value for the patient may be computed using equation (1).
Step 230 represents determining a patient's health status group. The patient's computed index value is compared to that of the established boundary values for health status categories. As described above, the established mapping between index values allows for ascertaining the health condition of a patient using its own index value.
The method steps as described in
An embodiment of the invention may be implemented in a way where the biomarkers data considered for developing a target diagnosis are collected in (healthy) subjects showing no symptoms of the target affection. By monitoring the subjects over time and determining which subjects develop an ailment, the invention allows for building a diagnosis (or a predictor index) for revealing the propensity of a subject to develop a target affection in a future time based on current biomarker data.
The benefits of developing an index-based scoring system, in accordance with one or more implementations of the invention, are numerous. The teachings of the invention allow a practitioner to compare results obtained from different individuals using a plurality of data combined in an index. For assessing progress in an individual (e.g., monitoring health condition during or post-treatment), a practitioner may conduct the tests using several biomarkers and follow the variations of the index values. For assessing the risk factors for an individual to develop a given disease, a practitioner may determine a range of index values and/or a variation thereof over time that may be indicative of the development of the disease. For example, some dog breeds are more susceptible than others to developing certain types of cancer. The index values provided by a diagnosis, in accordance with an implementation of the invention, may be utilized to spot those individuals that may be in the process of developing a cancer at an early stage.
Moreover, since a plurality of implementations of the invention may be developed for the diagnosis/detection of various aliments, a patient may be subjected only once to a test of a superset of biomarkers that would include biomarkers from several target diagnoses. By measuring the level of various biomarkers, more than one index may be computed at any time. The result is that each patient may be represented in a multidimensional space of indices that characterizes the state of the patient. Thus, a practitioner is provided a means to assess the probability for the patient to have one ailment versus another ailment when both present common symptoms.
Cancer Detection in Dogs using TK and CRP
An embodiment of the invention targets the diagnosis of cancer in a typical patient i.e. showing general symptoms of inflammation while failing to reveal the underlying affection.
There has been a long standing and studied relationship between cancer and inflammation. The inflammatory response orchestrates host defenses to infection, trauma, toxins, or other tissue damaging events and mediates tissue repair and regeneration. Epidemiological evidence points to a connection between inflammation and a predisposition for the development of cancer, i.e. long-term inflammation leads to the development of dysplasia. Thus, while acute inflammation is normally tightly controlled and part of the healing process, chronic inflammation may be associated with a number of diseases including cancer.
In cancer, there is evidence that inflammation plays an essential role at each stage of the disease (initiation and proliferation), and both tumor and inflammatory cells are able to directly or indirectly either inhibit or stimulate tumor growth. The effectiveness of tumor development has been demonstrated to correlate directly with the degree of the inflammatory reactions, and it seems that there are interactions between the cytokines produced in response to inflammatory reactions and tumor growth and even indications that inflammatory cytokines favor tumor promotion. Furthermore, with the assistance of inflammation, tumor cells infiltrate neighboring tissues, enter into the bloodstream, migrate, and establish remote colonies i.e. metastases.
With the inflammatory process initiating the acute-phase reaction the generation of acute-phase proteins (APP) occurs. C-reactive protein (CRP) is a major APP and has been shown to be an effective measure of general inflammation. The concentration of CRP or any serum APP level correlates to both the severity and duration of the inflammatory stimuli.
Thymidine kinase type 1 (TK) is a cytosolic enzyme involved in DNA synthesis through the so called “salvage pathway” for thymidine biosynthesis, in which deoxythymidine is converted to deoxythymidine monophosphate, leading to its eventual incorporation into DNA. Cellular TK activity is closely correlated with the DNA synthesis phase of the cell cycle. As such, its expression is restricted to proliferating cells, and thus is often more highly expressed in malignant cells, which are characterized by dysregulated proliferation.
A method according to the invention may use the measurement of the biomarkers thymidine kinase (TK) and c-reactive protein (CRP) to diagnose whether a patient is affected by cancer. In accordance with the teachings of the invention, it should be implicit that CRP is used as a biomarker that represents the family of biomarkers involved in acute-phase reaction.
TK and CRP levels may be assessed through any available method for measuring their levels, whether directly or indirectly.
In accordance with the method of the invention described in
Out of the 356 dogs recruited for the study, 378 dogs were enrolled and 22 disqualified due to prior history of cancer or inadequate specimen to complete analysis. The details of the cohorts breed and recruitment is shown in Table 1, and the cohort's sex distribution is shown in Table 2. In the latter study the biomarkers chosen were thymidine kinase (TK) and a canine-specific c-reactive protein (c-CRP). After the initial measurement of the biomarkers, the subjects were monitored and tested for cancer over a period of several months.
Table 3 shows the types of cancer as confirmed through histological or cytological tests, or by observation when the subjects
The measurement of Thymidine kinase enzymatic activity in samples of blood plasma is described in U.S. Pat. No. 8,097,432 B2, which is included herewith in its entirety by reference. In brief, a blood plasma is separated from a sample of blood to be tested for TK activity. The plasma is introduced into a solution containing an analog of deoxythymidine nucleotide and a phosphate donor. The product of the enzymatic activity is then measured using an immunoassay.
The measurement of CRP may be carried out using any available method for extracting and measuring protein concentration in a bodily fluid or tissue. The latter methods comprise using centrifugal force, electrophoresis, chromatography, immuno-binding assays and any available method for measuring the concentration of a protein.
Using TK and CRP in the latter study, and in accordance with the invention as described by formula (1) and
N
2
=a
2
·dTK+b
2
·dCRP (2)
Where “N2” denotes the Neoplasia index in a two (2) parameter model using TK and CRP, and where “a2” and “b2” denote the coefficients associated with TK and CRP, respectively. “dTK” and “dCRP” denote the discrete score value matched with a range of TK level and CRP level, respectively. As described above, TK level may be represented by the level of its enzymatic activity, whereas CRP may be represented by its mass (or moles) per volume of blood plasma.
As described above, the level of the biomarkers was measured and discretized. A discrete score assigned to ranges of the levels for each biomarker. Table 4.0 shows the detail of the discretization scores corresponding to ranges for each of TK and CRP level as used for a diagnosing method; and Table 4.1 show the details of the discretization scores used in a method for screening for cancer in canine subjects.
In the latter two-biomarker method, the coefficients “a2” and “b2” of formula (2) are given the values 2.1 and 1.6, respectively, when using the method for screening. When the method is used in a group of patients showing signs cancer (see below), the coefficients “a2” and “b2” of formula (2) are given the values 1.76 and 2.32, respectively.
Using the latter coefficients in formula (2), an index value lower than “1” would indicate low probability a patient has cancer versus a value greater than “1”, which would indicate that the patient has cancer.
In the latter study, the subjects were followed over time, and tests were carried out at four (4) months, and six (6) months from the initial test. Other (histological) tests were also carried out to confirm the presence of cancer.
For comparison of the performance of the method provided by the invention versus previous methods that relied solely on a single biomarker (e.g., TK), Receiver Operating Characteristic (ROC) analysis has been carried out using TK alone or the two-biomarker implementation of the invention. The ROC curves representing Sensitivity vs. one hundred (100) minus Specificity is plotted in
Block 610 represents the data acquisition layer of any system that implements an embodiment of the invention. The system may be any system required for the acquisition of the biological data that may be associated with any particular target health affection for diagnosis or the development of a diagnosis thereof. For example, the biological data may require the measurement of the level of a particular substance in the blood (or in any other bodily fluid) and/or in an organ tissue. The substance may be a protein, a peptide, any type of hormone or any other molecule or ion the measurement of which may be relevant to the diagnosis of a particular health affection or the development of diagnosis thereof. In the case where a biomarker is a biological substance, Block 610 represents the necessary laboratory equipment for collecting biological samples and processing the samples in order to obtain the biological data required for a particular embodiment of the invention. For example, measuring a protein level in the blood requires many steps comprising collecting blood from subjects, separating the portion of blood that contains the protein (e.g., using a centrifuge), purifying the protein, submitting the a purified solution of the protein to a calibrated assay (e.g., using marked antibodies), or any other step that may yield the concentration of the protein in the blood sample. In other instances, the protein may be an enzyme, in which case it may be desirable to measure the concentration of the protein through its enzymatic activity level.
One with ordinary skills in the art of medical or veterinary diagnosis is able to recognize the laboratory method steps, and laboratory equipment represented by block 610 for collecting biological sample, extracting the pertinent biomarker, and measuring its level.
As described above in the definition of term “biomarker”, the biological data may be any type of data that be involved in diagnosing a specific health affection whether the biological data may be assessed using a biological sample from the subject's body or through observational assessment of symptoms. For example, biological data comprise body temperature (e.g., in fever cases) heart beat rate, the number of siblings or parents having an affection (e.g., in cases of inherited affections), the elapsed time since the first signs of a disease started to show symptoms or any other non-substance related measurement that may be obtained from a patient and that may be considered in a diagnosis. Block 610 represents the tools and equipments necessary for collecting the biological data, which is accessible to one with ordinary skills in the pertinent art.
Block 620 represents a computer system for implementing and executing computer program instructions following the teachings of the methods developed in an embodiment of the invention. The computer system is any analog and/or digital computer capable of being configured to take input data, execute some or all of the method steps of the invention and provide the result of such execution to a user.
The computer system 620 may be a digital computer having a digital processor, a memory, a data transfer bus, a storage medium and any electronic communication means that allows the computer to receive and send data through display and communication to and from users and/or other machines. In embodiments of the invention, the computer may be embedded (as symbolized by block 605) in a device for carrying out medical (or veterinary) tests. Thus, a mobile and/or portable system comprises a device configured to collect the data such as determining the level or one or more biomarker level in a blood sample, and a computer system for carrying the steps of the invention. The device may be enabled with display means such that the results are communicated to a user.
In other embodiments of the invention, the computer 620 may stand alone, such as a computer system that is detached from any particular a device, while being capable of receiving data through direct communication (e.g., user interface), and/or remote communication means (e.g., networked data transfer).
The input data may be any of the biological and non-biological data, described above, that may be entered to the computer automatically through one or more links 632, or through a user interface 634 provided to a user 600. For example, a practitioner 600 may obtain the biological data from the data collection system through one or more communication (or interface links) 630, then enter the data into the computer system through the computer interface 634.
The practitioner may also enter in the computer system further configuration data, such as range boundaries for data discretization, optimization method or any other configuration data to conduct a search for optimal parameters. The computer system comprises program instructions to conduct a search for optimal parameters. Computer system 620 contains program instructions the execution of which allows to discretize new data (e.g., from a patient) compute one or more indices, then compare the new data to previously generated (or stored data) in order to provide a diagnosis.
In embodiments of the invention, a computer program as well as the data for any particular diagnosis method, in computer system 620, may be replicated from one machine to another, thus, allowing the diagnosis programs to be replicated to any number of other machines. For example, in portable blood test devices, a computer program may be configured to process the data and provide fast diagnosis using pre-stored diagnosis parameters.
However, the data acquisition system may be separated and remotely located and may serve the practitioner remotely. For example, the computer program may be implemented on a central unit that may collected biological data from a plurality of data acquisition systems (e.g., using computer networks), and serve client machines with diagnoses as they may be entered remotely. Furthermore, the collection of data from a plurality of client data acquisition system may serve to further refine the diagnosis program as more and more data become available.
Thus, a method and system for selecting a set of biomarkers and developing a method of use for detecting one or more target diseases and differentiating between the diseases to help a practitioner interpret the test results and potentially reveal the underlying affection or the propensity of a patient to develop a given disease.