The invention relates to the field of medical diagnosis, and in particular to the diagnosis of lung cancer by screening a biomarker by utilizing serum metabonomics, especially the differential diagnosis between benign pulmonary nodules and lung cancer.
Lung cancer is one of the most common malignant tumors in the world, and one of the cancers with the highest mortality. According to the latest data released by China in 2018, there are 2.1 million new cases of lung cancer in China, ranking first among malignant tumors, accounting for 18.4% (ranking first) of all tumor deaths, and the number of deaths is 1.8 million (ranking first), accounting for more than a quarter of malignant tumor deaths. Early diagnosis is of great significance to improve the treatment prognosis and survival rate of tumor patients. Currently, the confirmed diagnosis of lung cancer mainly depends on pathological examination of tissues or cells obtained through invasive paracentesis and bronchoscopy. CT imageological examination is a main auxiliary diagnostic means, but there are still some challenges in the differential diagnosis of benign or malignant pulmonary nodules. Serological examination of lung cancer, such as a carcinoembryonic antigen, a keratin fragment, a squamous cell carcinoma antigen, etc., can be used as an auxiliary diagnosis or follow-up monitoring of lung cancer, but its current sensitivity and specificity still need to be improved.
In recent years, with the rapid development of mass spectrometry, the study on application of Metabolomics in disease diagnosis has gradually attracted widespread attention. Metabonomics is a new discipline to qualitatively and quantitatively analyze a small-molecule metabolite with a relative molecular weight less than 1,000 in an organism. Metabolome refers to all of the low-molecular-weight metabolites of a certain organism or cell in a specific physiological period, and many life activities in the cell take place at the level of metabolites. Therefore, the detection and identification of metabolome can judge the pathophysiological state of the organism, and it is possible to find out a marker related to its pathogenesis. Therefore, the metabonomics has a wide application prospect in the field of clinical medicine. Metabolites in serum are stable and quantifiable, which provides a possibility of non-invasive diagnosis for clinical application.
Currently, there are no metabolic markers that can be used for diagnosis of lung cancer or for differential diagnosis between lung tumors and benign nodules. However, it is of great clinical significance to differentially diagnose whether a patient with lung nodules is a patient with lung cancer.
Meanwhile, it is worth paying attention to that male and female patients have their own characteristics in the pathogenesis, etiology, diagnosis, pathology, molecular biology, treatment and prognosis of some tumors (including lung cancer), while the prior art does not distinguish the serum metabolic markers of tumors (including lung cancer) according to gender.
It is necessary to improve the traditional technology, hoping to have a method and reagent for diagnosing or predicting lung cancer.
In the invention, serum samples of healthy people, a patient with benign pulmonary nodules and a patient with lung cancer (lung malignant tumor) are collected, and subjected to metabonomics analysis and typing of profiling by utilizing liquid chromatography-high resolution mass spectrometry (LC-HRMS), so as to screen out a biomarker among healthy people, the patient with benign pulmonary nodules and the patient with lung cancer, and the biomarker is further distinguished according to gender to find out a biomarker among healthy people, the patient with benign pulmonary nodules and the patient with lung cancer of the same gender.
An objective of the present invention is to find a metabolic biomarker between healthy people and a patient with lung cancer, and between a patient with benign pulmonary nodules and the patient with lung cancer, so as to be used for the diagnosis of lung cancer, especially for the early differential diagnosis of whether a patient with nodules has lung cancer. Moreover, considering the effect of gender differences, the invention differentiates according to gender, looking for a biomarker for diagnosis of lung cancer in a man or woman.
In an aspect, the invention provides a method for screening a lung cancer biomarker based on serum metabolomics, comprising the specific steps of:
(1) collecting serum samples of a patient with lung cancer, a patient with benign pulmonary nodules and healthy people;
(2) extracting serum metabolites;
(3) conducting detection of the extracted serum metabolites by liquid chromatography-mass spectrometry (LC-MS) and preprocessing the data;
(4) grouping the samples by partial least squares discriminant analysis, so as to screen out differential metabolites or differential biomarkers in different groups in connection with significance analysis; and
(5) mining biomarkers of lung cancer and applications thereof according to the screened differential metabolites, for example how to use these markers to diagnose or predict a patient with lung cancer, or to differentially diagnose a patient with lung cancer from healthy people or a patient with nodules.
In some embodiments, the specific implementation of the step (1) is: the serum samples are derived from patients with lung cancer, patients with benign pulmonary nodules and healthy people of different genders and age groups. Here, the so-called population with lung cancer, population with benign pulmonary nodules and healthy population have been diagnosed and confirmed, such as the population with lung cancer, population with (benign) pulmonary nodules or healthy population (without nodules) confirmed by histology or later symptoms.
In some embodiments, the specific implementation of the step (2) is: the serum metabolites are extracted by employing a three-phase extraction method of methyl tert-butyl ether:methanol:water (10:3:2.5, v/v/v), wherein methanol and methyl tert-butyl ether are sequentially added into 50 μL of serum, the mixture is incubated with shaking on ice for 1 hour, then added with water, subjected to vortex shaking, and subsequently centrifuged, the subnatant is taken and spin-dried in a low-temperature vacuum dryer, and the obtained dry extract of the serum metabolites is stored in a refrigerator at −80° C.
Considering the batch effect of sample pretreatment, in this study, processing of each batch of experimental samples is conducted simultaneously with the processing of one Reference serum for subsequent data correction.
The specific implementation of the step (3) is: the dry extract of the serum metabolites is reconstituted and centrifuged, then the supernatant is taken to make samples to be tested, and all of samples are detected by liquid chromatography-high resolution mass spectrometry (LC-HRMS). The m/z ion, retention time and peak area are extracted from the original data, subjected to data normalization, and finally searched in a database for identification, so as to obtain a data matrix for subsequent analysis.
Further, the specific implementation of the step (4) is: data filtering is performed on the data matrix of liquid chromatography-high resolution mass spectrometry, and the remaining data is subjected to partial least squares discriminant analysis for sample grouping, and thus three groups, a lung cancer group, a benign pulmonary nodules group and a healthy group, can be obviously clustered.
In some embodiments, the specific implementation of the step (5) is: a compound with an FDR value less than 0.05 and meanwhile with a VIP greater than 1 is screened out as a differential metabolite, and the fold change is calculated. Furthermore, combined with biological significance, the differential metabolic markers of the patient with lung cancer, the patient with benign pulmonary nodules and healthy people are mined, and metabolic pathways are analyzed.
In some embodiments, the differential metabolic markers for the patient with lung cancer, the patient with benign pulmonary nodules and healthy people of the same gender are screened according to steps (4) and (5) according to gender.
In a second aspect of the present invention, provided is a method for detecting whether an individual has lung cancer, which includes the steps of: detecting a marker in a blood or serum sample of the individual, wherein the biomarker is selected from one or more of: 1-Methylnicotinamide, 2-Ketobutyric acid, 2-Octenoylcarnitine, 2−Pyrrolidone, 2-trans,4-cis-Decadienoylcarnitine, 3b,16a-Dihydroxyandrostenone sulfate, 3-Chlorotyrosine, 3-hydroxybutyryl carnitine, 3-hydroxydecanoyl carnitine, 3-hydroxydodecanoyl carnitine, 3-hydroxyoctanoyl carnitine, 4-oxo-Retinoic acid, 7-Methylguanine, Acetophenone, Acetylcarnitine, Alanine, alpha-Eleostearic acid, Aminoadipic acid, Arabinosylhypoxanthine, Asparagine, Bilirubin, Carnitine, Choline Sulfate, cis-5-Tetradecenoylcarnitine, Citrulline, Creatinine, Cyclohexaneacetic acid, Diethylamine, Dihydrothymine, Dihydroxybenzoic acid, Docosahexaenoic acid, Ecgonine, Ergothioneine, Ethyl 3-oxohexanoate, Glutamine, Hexanoylcarnitine, Hippuric acid, Homo-L-arginine, Hydroxybutyric acid, Hypoxanthine, Inosine, Isoleucine, Kynurenine, Lactic acid, Leucine, Linoleyl carnitine, Lysine, Methylacetoacetic acid, N6,N6,N6-Trimethylysine, N-Acetyl-L-alanine, Nicotine, Octanoylcarnitine, 5-Oxoproline, Phenylalanine, Pilocarpine, Propionylcarnitine, Pyruvic acid, Serotonin, Succinic acid semialdehyde, Trimethylamine N-oxide, Tyrosine, Uridine, Urocanic acid, Xanthine, 4-Hydroxyphenylacetic acid, Dehydroepiandrosterone sulfate, Androsterone sulfate, Dihydrotestosterone sulfate, Epiandrosterone sulfate, Citric acid, Uric acid, Pantothenic acid, Indole-3-acetic acid, gamma-Butyrobetaine, Calcitriol, all-trans-retinal, 3,4-dihydroxyphenylacetic acid, Caprylic acid, Arachidic acid, Hydrocortisone Valerate, Dopamine, Tryptophan, 3-Hydroxybutyric acid, Arachidonic acid.
In some embodiments, the biomarker for diagnosing lung cancer is one of or a combination of several ones of: alpha-Eleostearic acid, 2-Ketobutyric acid, 2-Octenoylcarnitine, 2-trans,4-cis-Decadienoylcarnitine, 3-Chlorotyrosine, 3-hydroxydecanoyl carnitine, 3-hydroxydodecanoyl carnitine, 3-hydroxyoctanoyl carnitine, Acetophenone, Arabinosylhypoxanthine, Cyclohexaneacetic acid, Dihydroxybenzoic acid, Docosahexaenoic acid, Ecgonine, Ethyl 3-oxohexanoate, Hexanoylcarnitine, Hippuric acid, Homo-L-arginine, Hypoxanthine, Lactic acid, N-Acetyl-L-alanine, Octanoylcarnitine, 5-Oxoproline, Pyruvic acid, Serotonin, Succinic acid semialdehyde, Xanthine. The aforementioned biomarkers have significant differences both between the patient with lung cancer and healthy people, and between the patient with lung cancer and the patient with benign pulmonary nodules, indicating that they are closely related to lung cancer and are not affected by whether there are benign pulmonary nodules. Therefore, they can be used for differential diagnosis of lung cancer and benign pulmonary nodules, and can also be used for differential diagnosis of the patient with lung cancer and healthy people (without nodules). In some embodiments, when 2-Ketobutyric acid, Hypoxanthine, Lactic acid, N-Acetyl-L-alanine, 5-Oxoproline, Pyruvic acid, Xanthine, Succinic acid semialdehyde among the aforementioned markers in the serum of an individual (including those with and without pulmonary nodules) are increased, it indicates that the individual has high possibility of suffering from lung cancer. In some embodiments, similarly, if other biomarkers are decreased at the same time, it further indicates a large possibility of suffering from lung cancer.
In some embodiments, during a process of comparing differential metabolites between the patient with lung cancer and healthy people, between the patient with lung cancer and the patient with benign pulmonary nodules, between the patient with lung cancer and healthy people in men and women, and between the patient with lung cancer and the patient with benign pulmonary nodules in men and women, it is found that: 3-hydroxydecanoyl carnitine, 3-hydroxyoctanoyl carnitine, Arabinosylhypoxanthine, Cyclohexaneacetic acid, Ecgonine, Ethyl 3-oxohexanoate, Hippuric acid, Homo-L-arginine, Hypoxanthine, Octanoylcarnitine, 5-Oxoproline have significant differences between the patient with lung cancer and healthy people or the patient with benign pulmonary nodules (including if they are distinguished according to gender), which indicates that these differential metabolites are more closely related to lung cancer, and are not affected by benign pulmonary nodules and gender. Therefore, they can be used for differential diagnosis between the patient with lung cancer and healthy people and between the patient with lung cancer and the patient with benign pulmonary nodules, and can also be used for differential diagnosis between the patient with lung cancer and healthy people and between the patient with lung cancer and the patient with benign pulmonary nodules in men or women.
When in the serum of an individual (including men and women, with and without nodules), Hypoxanthine and 5-Oxoproline are increased, while 3-hydroxydecanoyl carnitine, 3-hydroxyoctanoyl carnitine, Arabinosylhypoxanthine, Cyclohexaneacetic acid, Ecgonine, Ethyl 3-oxohexanoate, Hippuric acid, Homo-L-arginine, and Octanoylcarnitine are decreased, it indicates that the individual has high possibility of suffering from lung cancer. In some embodiments, similarly, if other biomarkers are decreased at the same time, it further indicates a large possibility of suffering from lung cancer.
In some embodiments, the biomarker is selected from one or more of the following Table 2 when used for diagnosing whether an individual without pulmonary nodules has lung cancer. The increase of one or more of Hypoxanthine, Lactic acid, Xanthine, N-Acetyl-L-alanine, Succinic acid semialdehyde, Pyruvic acid, 2-Ketobutyric acid, Methylacetoacetic acid, and 5-Oxoproline, or the decrease of other markers, indicates a large possibility of suffering from lung cancer. In some embodiments, similarly, if other biomarkers are decreased at the same time, it further indicates a large possibility of suffering from lung cancer.
In some embodiments, when it is clinically known that there is a mass or nodules in the lung of a patient, the biomarkers are selected from one or more of those in Table 3 when used for differential diagnosis between lung cancer and benign lung nodules. In some embodiments, the increase of one or more of the following markers: Hypoxanthine, Lactic acid, Xanthine, Dihydrothymine, N-Acetyl-L-alanine, 5-Oxoproline, 2−Pyrrolidone, Hydroxybutyric acid, Succinic acid semialdehyde, Pyruvic acid, 2-Ketobutyric acid, or alternatively the decrease of other markers, indicates a high possibility of suffering from lung cancer.
In some embodiments, the biomarker used for differential diagnosis between lung cancer and benign pulmonary nodules is one of or a combination of several ones of: 1-Methylnicotinamide, 2−Pyrrolidone, 4-oxo-Retinoic acid, 7-Methylguanine, Acetylcarnitine, Bilirubin, Choline Sulfate, cis-5-Tetradecenoylcarnitine, Citrulline, Creatinine, Diethylamine, Dihydrothymine, Glutamine, Hydroxybutyric acid, Inosine, Kynurenine, Linoleyl carnitine, Lysine, Trimethylamine N-oxide. These biomarkers have significant differences between the patient with lung cancer and the patient with benign pulmonary nodules, but there is no significant difference between the patient with lung cancer and healthy people, which indicates that these biomarkers are preferred and specific biomarkers for distinguishing the patient with lung cancer from the patient with benign pulmonary nodules, and cannot distinguish the patient with lung cancer from healthy population (without nodules). These biomarkers have more practical significance. The possibility of further detection of canceration is only possible when nodules are generally found in the process of physical examination or diagnosis. At this time, apart from routine puncture biopsy, an effective way of preliminary screening is to conduct preliminary screening by detecting whether there are changes or abnormalities, such as significant changes, in one or more of the aforementioned markers in a blood sample.
In some embodiments, the method for differential diagnosis of lung cancer and benign pulmonary nodules through a blood sample includes: detecting a biomarker in a blood sample, wherein the biomarker is selected from one of or a combination of several ones of: 1-Methylnicotinamide, 2-Octenoylcarnitine, 3-hydroxydecanoyl carnitine, 3-hydroxyoctanoyl carnitine, 4-oxo-Retinoic acid, 7-Methylguanine, Arabinosylhypoxanthine, Cyclohexaneacetic acid, Ecgonine, Ethyl 3-oxohexanoate, Hippuric acid, Homo-L-arginine, Hypoxanthine, Inosine, Lactic acid, Octanoylcarnitine, 5-Oxoproline, Trimethylamine N-oxide. These biomarkers have significant differences between the patient with lung cancer and the patient with benign pulmonary nodules (including men and women), and have significant differences between the patient with lung cancer and the patient with benign pulmonary nodules in men or women, which indicates that these biomarkers are not affected by gender and can effectively distinguish lung cancer from benign pulmonary nodules.
In some embodiments, the biomarker is selected from one or more of those in Table 4 when used for determining whether a man without pulmonary nodules has lung cancer. The increase of one or more of Hypoxanthine, N-Acetyl-L-alanine, Pyruvic acid, and 5-Oxoproline, or the decrease of one or more of other biomarkers, indicates that the man has a high possibility of suffering from lung cancer.
In some embodiments, when it is clinically known that there is a mass or nodules in the lung of the male patient, the biomarker is selected from one or more of those in Table 5 when used for determine whether the mass or nodules is lung cancer or benign pulmonary nodules. The increase of one or more of Hypoxanthine, N-Acetyl-L-alanine, Pyruvic acid, 5-Oxoproline, Lactic acid, Dihydrothymine, Aminoadipic acid, and N6,N6,N6-Trimethylysine, or the decrease of one or more of other markers, indicates that the man has a large possibility of suffering from lung cancer.
In some embodiments, the biomarker used for determining lung cancer and benign pulmonary nodules in men is one of or a combination of several ones of: 1-Methylnicotinamide, 2-trans,4-cis-Decadienoylcarnitine, 3-hydroxydecanoyl carnitine, 3-hydroxydodecanoyl carnitine, 3-hydroxyoctanoyl carnitine, 4-oxo-Retinoic acid, 7-Methylguanine, Acetylcarnitine, alpha-Eleostearic acid, Arabinosylhypoxanthine, Cyclohexaneacetic acid, Diethylamine, Docosahexaenoic acid, Ecgonine, Ethyl 3-oxohexanoate, Glutamine, Hippuric acid, Homo-L-arginine, Hypoxanthine, Inosine, Linoleyl carnitine, N-Acetyl-L-alanine, Octanoylcarnitine, 5-Oxoproline, Pyruvic acid, and Trimethylamine N-oxide. These biomarkers have significant differences both between the patient with lung cancer and the patient with benign pulmonary nodules in men and between the patient with lung cancer and healthy people in men, which indicates that these biomarkers are closely related to lung cancer in men, and they can be used for not only distinguishing the patient with lung cancer from the patient with benign pulmonary nodules in men, but also distinguishing the patient with lung cancer from healthy people (without nodules) in men.
In some embodiments, the biomarker used for determining lung cancer and benign pulmonary nodules in men is one of or a combination of several ones of: 2-Octenoylcarnitine, 3-hydroxybutyryl carnitine, Aminoadipic acid, Bilirubin, Dihydrothymine, Ergothioneine, Lactic acid, N6,N6,N6-Trimethylysine, Nicotine. These biomarkers have significant differences both between the patient with lung cancer and the patient with benign pulmonary nodules in men and between the patient with lung cancer and healthy people in men, which indicates that these biomarkers can be used for distinguishing the patient with lung cancer from the patient with benign pulmonary nodules in men, but cannot be used for distinguishing the patient with lung cancer from healthy people (without nodules) in men.
In some embodiments, the biomarker used for determining lung cancer and benign pulmonary nodules in men is one of or a combination of several ones of: alpha-Eleostearic acid, 2-trans,4-cis-Decadienoylcarnitine, 3-hydroxydodecanoyl carnitine, Acetylcarnitine, Bilirubin, Diethylamine, Dihydrothymine, Docosahexaenoic acid, Glutamine, Linoleyl carnitine, N-Acetyl-L-alanine, Pyruvic acid, 3-hydroxybutyryl carnitine, Aminoadipic acid, Ergothioneine, N6,N6,N6-Trimethylysine, Nicotine. These biomarkers have significant differences between the patient with lung cancer and the patient with benign pulmonary nodules in men, but have no significant difference the patient with lung cancer and the patient with benign pulmonary nodules in women, which indicates that these biomarkers are related to gender and can be used for distinguishing lung cancer from benign pulmonary nodules in men, but not in women.
In some embodiments, the biomarker used for determining lung cancer and benign pulmonary nodules in men is one of or a combination of several ones of: 3-hydroxybutyryl carnitine, Aminoadipic acid, Ergothioneine, Nicotine. These biomarkers only have significant differences between the patient with lung cancer and the patient with benign pulmonary nodules in men, but have no significant differences between the patient with lung cancer and healthy people (including men and women), between the patient with lung cancer and the patient with pulmonary nodules (including men and women), between the patient with lung cancer and healthy people in men, between the patient with lung cancer and healthy people in women, and between the patient with lung cancer and the patient with pulmonary nodules in women, which indicates that these compounds are specific biomarkers for lung cancer and benign pulmonary nodules in men, and can only be used for distinguishing lung cancer from pulmonary nodules in men, but not for distinguishing lung cancer from pulmonary nodules or distinguishing lung cancer from healthy people (without nodules) in women.
In some embodiments, the biomarker is selected from one or more of those in Table 6 when used for determining whether a woman with pulmonary nodules has lung cancer. The increase of one or more of Alanine, Linoleyl carnitine, Pyruvic acid, Methylacetoacetic acid, Hypoxanthine, Lactic acid, Xanthine, 2−Pyrrolidone, Succinic acid semialdehyde, 2-Ketobutyric acid, and 5-Oxoproline, or the decrease of one or more of other markers, indicates that the woman has a high possibility of suffering from lung cancer.
In some embodiments, the biomarker used for determining lung cancer and benign pulmonary nodules in women is one of or a combination of several ones of: 1-Methylnicotinamide, 2-Ketobutyric acid, 2−Pyrrolidone, 3-Chlorotyrosine, 3-hydroxydecanoyl carnitine, 3-hydroxyoctanoyl carnitine, 4-oxo-Retinoic acid, 7-Methylguanine, Acetophenone, Arabinosylhypoxanthine, Choline Sulfate, Citrulline, Creatinine, Cyclohexaneacetic acid, Ecgonine, Ethyl 3-oxohexanoate, Hexanoylcarnitine, Hippuric acid, Homo-L-arginine, Hypoxanthine, Inosine, Lactic acid, Lysine, Octanoylcarnitine, 5-Oxoproline, Serotonin, Succinic acid semialdehyde, Trimethylamine N-oxide, Xanthine. These biomarkers have significant differences both between the patient with lung cancer and the patient with benign pulmonary nodules in women and between the patient with lung cancer and healthy people in women, which indicates that these biomarkers are closely related to lung cancer in women, and they can be used for not only distinguishing the patient with lung cancer from the patient with benign pulmonary nodules in women, but also distinguishing the patient with lung cancer from healthy people (without nodules) in women.
In some embodiments, the biomarker used for determining lung cancer and benign pulmonary nodules in women is one of or a combination of several ones of: 2-Octenoylcarnitine, cis-5-Tetradecenoylcarnitine, Kynurenine, Phenylalanine. These biomarkers have significant differences both between the patient with lung cancer and the patient with benign pulmonary nodules in women and between the patient with lung cancer and healthy people in women, which indicates that these biomarkers can be used for distinguishing the patient with lung cancer from the patient with benign pulmonary nodules in women, but cannot be used for distinguishing the patient with lung cancer from healthy people (without nodules) in women.
In some embodiments, the biomarker used for determining lung cancer and benign pulmonary nodules in women is one of or a combination of several ones of: 2-Ketobutyric acid, 2−Pyrrolidone, 3-Chlorotyrosine, Acetophenone, Choline Sulfate, cis-5-Tetradecenoylcarnitine, Citrulline, Creatinine, Hexanoylcarnitine, Kynurenine, Lysine, Serotonin, Succinic acid semialdehyde, Xanthine, Phenylalanine. These biomarkers have significant differences between the patient with lung cancer and the patient with benign pulmonary nodules in women, but have no significant difference the patient with lung cancer and the patient with benign pulmonary nodules in men, which indicates that these biomarkers are related to gender and can be used for distinguishing lung cancer from benign pulmonary nodules in women, but not in men.
In some embodiments, the biomarker used for determining lung cancer and benign pulmonary nodules in women is Phenylalanine. This biomarker only has significant differences between the patient with lung cancer and the patient with benign pulmonary nodules in women, but have no significant differences between the patient with lung cancer and healthy people (including men and women), between the patient with lung cancer and the patient with pulmonary nodules (including men and women), between the patient with lung cancer and healthy people in women, between the patient with lung cancer and healthy people in men, and between the patient with lung cancer and the patient with pulmonary nodules in men, which indicates that this compound is a specific biomarker for lung cancer and benign pulmonary nodules in women, and can only be used for distinguishing lung cancer from pulmonary nodules in women, but not for distinguishing lung cancer from benign pulmonary nodules or distinguishing lung cancer from healthy people (without nodules) in men.
In a third aspect of the invention, a model for identifying lung cancer and benign pulmonary nodules (including men and women) by a combination of various differential metabolites is established. The model parameters are the optimum model parameters. The model has an AUC of 0.955, sensitivity and specificity of 0.913 and 0.876 respectively as obtained by ROC analysis, which indicates that the model has high diagnostic accuracy.
In some embodiments, these models can be input into a computer system in advance, and when a biomarker is obtained, a diagnosis result is obtained through automatic calculation by the computer system. Therefore, the invention can provide a diagnosis system for lung cancer, which includes an operation module, wherein the operation or calculation module includes the following model equations. In some embodiments, the computer system further includes an output module for outputting the calculation result. In some embodiments, the computer system further includes an input module for inputting one or more detection results of the aforementioned biomarkers, and such detection results can be quantitative detection results or qualitative results. The established model here is not a finite model recited in the invention, as long as the biomarkers within the scope of the invention are applied to establish a model for lung cancer diagnosis, the model is within the scope of the invention. In some embodiments, the computer system further includes a negative control or reference data module.
In some embodiments, The model equation can be: Logit(P)=In[P/((1−P)]=5.553×VO4+2.92×V05+2.713×V06−0.332×V07−1.798×V10−7.922×V13−0.593×V14+0.643×V17−2.187×V19−0.992×V20−2.352×V33−1.441×V38+7.214×V39−1.22×V40−1.235×V42+1.61, wherein V04, V05, V06, V07, V10, V13, V14, V17, V19, V20, V33, V38, V39, V40, V42 are respectively the 5-Oxoproline, N-Acetyl-L-alanine, Hypoxanthine, Cyclohexaneacetic acid, Ethyl 3-oxohexanoate, Arabinosylhypoxanthine, Docosahexaenoic acid, Hydroxybutyric acid, Serotonin, Ecgonine, Lysine, Kynurenine, Inosine, 4-oxo-Retinoic acid, Linoleylcarnitine. In some embodiments, a cut-off value of P is 0.424, and when P>0.424, the possibility of being diagnosed with or suffering from lung cancer is very high.
In some embodiments, a model for identifying benign pulmonary nodules and lung cancer in men by a combination of multiple differential metabolites is established. As obtained by ROC analysis, the model has an AUC of 0.968, and sensitivity and specificity of 0.870 and 0.988 respectively, indicating that the model had high diagnostic accuracy. The model equation of the model is: Logit(P)=In[P/((1−P)]=6.283×MV02−0.646×MV10−2.758×MV13+1.864×MV15−1.126×MV19−1.145×MV27−3.918×MV30+1.494, wherein MV02, MV10, MV13, MV15, MV19, MV27, MV30 are respectively the 5-Oxoproline, Nicotine, Ecgonine, N6,N6,N6-Trimethylysine, Arabinosylhypoxanthine, Docosahexaenoic acid, Linoleyl carnitine. In some embodiments, a cut-off value of P is 0.701, and when P>0.701, the possibility of being diagnosed with or suffering from lung cancer is very high.
Moreover, a model for identifying benign pulmonary nodules and lung cancer in women by a combination of multiple differential metabolites is established. The model parameters are the optimum model parameters. As obtained by ROC analysis, the model has an AUC of 0.969, and sensitivity and specificity of 0.870 and 0.953 respectively, indicating that the model had high diagnostic accuracy. The model equation of the model is: Logit(P)=In[P/((1−P)]=10.742×FV05−1.031×FV08−7.442×FV09+11.839×FV13−2.617×FV15−3.030×FV20−1.413×FV23−2.278×FV29−6.905, wherein FV05, FV08, FV09, FV13, FV15, FV20, FV23, FV29 are respectively the 5-Oxoproline, Cyclohexaneacetic acid, Lysine, Phenylalanine, Serotonin, Kynurenine, Arabinosylhypoxanthine, 3-hydroxydecanoyl carnitine. In some embodiments, a cut-off value of P is 0.629, and when P>0.629, the possibility of being diagnosed with or suffering from lung cancer is very high.
In some embodiments, an ROC curve is established for each metabolic compound, and those compounds with large areas under the curve can be found, so that a batch of compounds can be chosen to establish a diagnostic model or obtain more reliable diagnostic results. It is generally understood that the reliability of the established model may be higher when more biomarkers are selected. For example, the sensitivity is higher when the accuracy is higher and the specificity is stronger. However, a single compound or several important compounds can also be selected for diagnosis, or for preliminary screening and detection. This detection method can be of a variety, for example, by utilizing the liquid-phase mass spectrometry combined detection of the invention, or by detecting one or more biomarkers of the invention at one time in a high-throughput manner, and of course, the detection of a small amount of several biomarkers is not excluded. Of course, an immune method can also be adopted to detect a small amount of several important compounds, such as the joint detection of 1, 2, 3, 4 or 5 biomarkers, which can also explain some problems.
Therefore, in some schemes, the biomarker used for determining whether a patient with pulmonary nodules (including men and women) has lung cancer is one of or a combination of two or three of 5-Oxoproline, Arabinosylhypoxanthine and Inosine. A model for identifying benign pulmonary nodules and lung cancer by a single differential metabolite is established, and it is found by establishing an ROC curve of each differential metabolite that, the AUCs (areas under curve) of 5-Oxoproline, Arabinosylhypoxanthine and Inosine are 0.736, 0.784 and 0.747, respectively, which are higher than those of other differential metabolites, indicating that the three differential metabolites have higher differential diagnosis values.
Moreover, when the model for identifying benign pulmonary nodules and lung cancer by a combination of various differential metabolites is established, it is found that 5-Oxoproline, Arabinosylhypoxanthine, and Inosine have the largest absolute values of model coefficients in the model, and the ORs (odds ratios) of 5-Oxoproline and Inosine are much higher than those of other differential metabolites, while the OR of Arabinosylhypoxanthine is much smaller than those of other differential metabolites, which indicates that 5-Oxoproline, Arabinosylhypoxanthine, and Inosine account for a higher proportion in the model, and thus their values in differential diagnosis of lung cancer and benign pulmonary nodules are also higher, and this finding is consistent with the results obtained in the established model for identifying benign pulmonary nodules and malignant tumors through a single differential metabolite.
In some examples, the biomarker used for determining whether a male patient with pulmonary nodules has lung cancer is Linoleyl carnitine. Similarly, when a model for identifying benign pulmonary nodules and malignant tumors in men through a single differential metabolite is established, it is found that the AUC value of Linoleyl carnitine is 0.867, which is much higher than those of other differential metabolites; and when a model for identifying benign pulmonary nodules and lung cancer in men through a combination of multiple differential metabolites is established, it is found that the absolute value of the model coefficient of Linoleyl carnitine is larger, and its OR is much lower than those of other differential metabolites, which indicates that Linoleyl carnitine has a higher diagnostic value.
In some examples, the biomarker used for determining whether a female patient with pulmonary nodules has lung cancer is one or a combination of both of 5-Oxoproline and Phenylalanine. Similarly, when a model for identifying benign pulmonary nodules and malignant tumors in women through a single differential metabolite is established, it is found that the AUC values of 5-Oxoproline and Phenylalanine are 0.823 and 0.702, which are relatively larger; and when a model for identifying benign pulmonary nodules and lung cancer in men through a combination of multiple differential metabolites is established, it is found that the model coefficients and OR values of 5-Oxoproline and Phenylalanine are much higher than those of other differential metabolites, which indicates that 5-Oxoproline and Phenylalanine have higher diagnostic values.
The invention provides a detection method for diagnosing lung cancer, which includes detecting a biomarker in a blood or serum sample, wherein the biomarker is selected from one or more of: 2-trans,4-cis-Decadienoylcarnitine, Octanoylcarnitine, Decanoylcarnitine, 2-Octenoylcarnitine, Hexanoylcarnitine, 3-hydroxydodecanoylcarnitine, 3-hydroxydecanoylcarnitine, 3-hydroxyoctanoylcarnitine, Ecgonine, Trimethylamine N-oxide, 1-Methylnicotinamide, 3-Chlorotyrosine, Homo-L-arginine, Serotonin, Alanine, alpha-Eleostearic acid, Ethyl 3-oxohexanoate, Inosine, Arabinosylhypoxanthine, Hippuric acid, Cyclohexaneacetic acid, Lactic acid, 2-Ketobutyric acid, Pyruvic acid, Hypoxanthine, Xanthine, N6,N6,N6-Trimethylysine, Kynurenine, cis-5-Tetradecenoylcarnitine, Docosahexaenoic acid, Choline Sulfate, Dihydrothymine, 17-Hydroxypregnenolone sulfate, Pregnenolone sulfate, Tiglylcarnitine, Propionylcarnitine, 3-hydroxybutyrylcarnitine, Oxindole, Nicotine, Ergothioneine, Phenylacetylglutamine, Citrulline, Lysine, Aminocaproic acid, Methylimidazoleacetic acid.
In some embodiments, the biomarker is selected from one or more of: Decanoylcarnitine, 17-Hydroxypregnenolone sulfate, Pregnenolone sulfate, Tiglylcarnitine, Oxindole, Phenylacetylglutamine, Aminocaproic acid, Methylimidazoleacetic acid.
In some embodiments, the biomarker is selected from one or more of: 2-trans,4-cis-Decadienoylcarnitine, Octanoylcarnitine, Decanoylcarnitine, 2-Octenoylcarnitine, Hexanoylcarnitine, 3-hydroxydodecanoylcarnitine, 3-hydroxydecanoylcarnitine, 3-hydroxyoctanoylcarnitine, Ecgonine, Trimethylamine N-oxide, 1-Methylnicotinamide, 3-Chlorotyrosine, Homo-L-arginine, Serotonin, Alanine, alpha-Eleostearic acid, Ethyl 3-oxohexanoate, Inosine, Arabinosylhypoxanthine, Hippuric acid, Cyclohexaneacetic acid, Lactic acid, 2-Ketobutyric acid, Pyruvic acid, Hypoxanthine, Xanthine. These biomarkers have significant differences between the patient with lung cancer and healthy people and between the patient with lung cancer and the patient with benign pulmonary nodules, which indicates that they are closely related to lung cancer and are not affected by the presence of benign pulmonary nodules. Therefore, they can be used for differential diagnosis of or distinguishing between lung cancer and benign pulmonary nodules, and can also be used for differential diagnosis of or distinguishing between individuals with lung cancer and healthy individuals (without nodules).
In some embodiments, when it is diagnosed whether an individual without pulmonary nodules has lung cancer, the biomarker is selected from one or more of: 2-trans,4-cis-Decadienoylcarnitine, Octanoylcarnitine, Decanoylcarnitine, 2-Octenoylcarnitine, Hexanoylcarnitine, 3-hydroxydodecanoylcarnitine, 3-hydroxydecanoylcarnitine, 3-hydroxyoctanoylcarnitine, Ecgonine, Trimethylamine N-oxide, 1-Methylnicotinamide, 3-Chlorotyrosine, Homo-L-arginine, Serotonin, Alanine, alpha-Eleostearic acid, Ethyl 3-oxohexanoate, Inosine, Arabinosylhypoxanthine, Hippuric acid, Cyclohexaneacetic acid, Lactic acid, 2-Ketobutyric acid, Pyruvic acid, Hypoxanthine, Xanthine, N6,N6,N6-Trimethylysine, Tiglylcarnitine, Propionylcarnitine, 3-hydroxybutyrylcarnitine, Oxindole, Nicotine, Ergothioneine, Phenylacetylglutamine, Citrulline, Lysine, Aminocaproic acid, Methylimidazoleacetic acid, wherein the biomarkers N6,N6,N6-Trimethylysine, Tiglylcarnitine, Propionylcarnitine, 3-hydroxybutyrylcarnitine, Oxindole, Nicotine, Ergothioneine, Phenylacetylglutamine, Citrulline, Lysine, Aminocaproic acid, Methylimidazoleacetic acid have significant differences between the patient with lung cancer and healthy people (without nodules), but have no significant difference between the patient with lung cancer and the patient with benign pulmonary nodules, which indicates that these biomarkers are preferred and specific biomarkers to distinguish the patient with lung cancer from healthy people (without nodules).
In some embodiments, when it is diagnosed whether an individual with pulmonary nodules has lung cancer, the biomarker is selected from one or more of: 2-trans,4-cis-Decadienoylcarnitine, Octanoylcarnitine, Decanoylcarnitine, 2-Octenoylcarnitine, Hexanoylcarnitine, 3-hydroxydodecanoylcarnitine, 3-hydroxydecanoylcarnitine, 3-hydroxyoctanoylcarnitine, Ecgonine, Trimethylamine N-oxide, 1-Methylnicotinamide, 3-Chlorotyrosine, Homo-L-arginine, Serotonin, Alanine, alpha-Eleostearic acid, Ethyl 3-oxohexanoate, Inosine, Arabinosylhypoxanthine, Hippuric acid, Cyclohexaneacetic acid, Lactic acid, 2-Ketobutyric acid, Pyruvic acid, Hypoxanthine, Xanthine, Kynurenine, cis-5-Tetradecenoylcarnitine, Docosahexaenoic acid, Choline Sulfate, Dihydrothymine, 17-Hydroxypregnenolone sulfate, Pregnenolone sulfate, wherein the biomarkers Kynurenine, cis-5-Tetradecenoylcarnitine, Docosahexaenoic acid, Choline Sulfate, Dihydrothymine, 17-Hydroxypregnenolone sulfate, Pregnenolone sulfate.
These biomarkers have significant differences between lung cancer and benign pulmonary nodules, but have no significant difference between the patient with lung cancer and healthy people (without nodules), which indicates that these biomarkers are preferred and specific biomarkers to distinguish lung cancer from benign pulmonary nodules. These biomarkers have more practical significance. The possibility of further detection of canceration is only possible when nodules are generally found in the process of physical examination or diagnosis. At this time, apart from routine puncture biopsy, an effective way of preliminary screening is to conduct preliminary screening by detecting whether there are changes or abnormalities, such as significant changes, in one or more of the aforementioned markers in a blood sample.
The invention establishes a model for identifying between the patient with lung cancer and healthy (without nodules) people or between the patient with lung cancer and the patient with benign pulmonary nodules by a combination of multiple biomarkers.
In some embodiments, when the model is used for distinguishing or identifying the patient with lung cancer from the patient with benign pulmonary nodules, or for identifying whether a patient with pulmonary nodules has lung cancer, the model includes one or more of the following:
model A: In[P/(1−P)]=2.29×M1+1.02×M2+0.64×M3+0.62×M4+0.47×M5+0.42×M6+0.38×M7+0.26×M8+0.05×M9+0.03×M10−0.05×M11−0.12×M12−0.16×M13−0.17×M14−0.36×M15−0.4×M16−0.45×M17−0.46×M18−0.47×M19−0.53×M20−0.55×M21−0.79×M22−0.95×M23−1.02×M24−1.19×M25−1.52×M26−1.88×M27+4.01, wherein M1-M27 are respectively relative abundances of Hypoxanthine, Alanine, 2-Ketobutyric acid, 2-trans,4-cis-Decadienoylcarnitine, Xanthine, 17-Hydroxypregnenolone sulfate, Dihydrothymine, Octanoylcarnitine, Lactic acid, Pregnenolone sulfate, 3-Chlorotyrosine, Cyclohexaneacetic acid, Choline Sulfate, Trimethylamine N-oxide, 2-Octenoylcarnitine, 1-Methylnicotinamide, Serotonin, Docosahexaenoic acid, Decanoylcarnitine, alpha-Eleostearic acid, Homo-L-arginine, Pyruvic acid, 3-hydroxydecanoylcarnitine, Ecgonine, Kynurenine, Ethyl 3-oxohexanoate, Arabinosylhypoxanthine;
model B: In[P/(1−P)]=1.11×M1+0.25×M2+0.13×M3+0.09×M4+0.05×M5−0.01×M6−0.02×M7−0.02×M8−0.04×M9−0.11×M10−0.12×M11−0.19×M12−0.3×M13−0.34×M14−0.45×M15−0.46×M16−0.64×M17−0.68×M18−0.95×M19+2.17, wherein M1-M19 are respectively relative abundances of Hypoxanthine, Alanine, 2-Ketobutyric acid, 17-Hydroxypregnenolone sulfate, Dihydrothymine, Trimethylamine N-oxide, Serotonin, 3-Chlorotyrosine, Hippuric acid, Docosahexaenoic acid, 2-Octenoylcarnitine, 1-Methylnicotinamide, Homo-L-arginine, alpha-Eleostearic acid, Kynurenine, 3-hydroxydecanoyl carnitine, Ecgonine, Ethyl 3-oxohexanoate, Arabinosylhypoxanthine;
model C: In[P/(1−P)]=1.73×M1+0.72×M2+0.31×M3+0.29×M4+0.23×M5+0.15×M6−0.05×M7−0.07×M8−0.07×M9−0.09×M10−0.09×M11−0.16×M12−0.26×M13−0.26×M14−0.27×M15−0.29×M16−0.43×M17−0.45×M18−0.56×M19−0.75×M20−0.87×M21−1.15×M22−1.41×M23+3.07, wherein M1-M23 are respectively relative abundances of Hypoxanthine, Alanine, 2-Ketobutyric acid, 17-Hydroxypregnenolone sulfate, Dihydrothymine, Xanthine, 3-Chlorotyrosine, Cyclohexaneacetic acid, Choline Sulfate, Decanoylcarnitine, Trimethylamine N-oxide, 2-Octenoylcarnitine, PyruMic acid, Docosahexaenoic acid, 1-Methylnicotinamide, Serotonin, Homo-L-arginine, alpha-Eleostearic acid, 3-hydroxydecanoyl carnitine, Ecgonine, Kynurenine, Ethyl 3-oxohexanoate, Arabinosylhypoxanthine;
model D: In[P/(1−P)]=2.35×M1+1.03×M2+0.71×M3+0.68×M4+0.48×M5+0.45×M6+0.42×M7+0.39×M8+0.16×M9+0.03×M10−0.05×M11−0.12×M12−0.17×M13−0.17×M14−0.22×M15−0.39×M16−0.43×M17−0.46×M18−0.49×M19−0.54×M20−0.54×M21−0.57×M22−0.89×M23−0.97×M24−1.07×M25−1.23×M26−1.56×M27−1.93×M28+4.16, wherein M1-M28 are respectively relative abundances of Hypoxanthine, Alanine, 2-Ketobutyric acid, 2-trans,4-cis-Decadienoylcarnitine, Xanthine, Octanoylcarnitine, 17-Hydroxypregnenolone sulfate, Dihydrothymine, Lactic acid, Pregnenolone sulfate, 3-Chlorotyrosine, Cyclohexaneacetic acid, Choline Sulfate, Trimethylamine N-oxide, Hexanoylcarnitine, 2-Octenoylcarnitine, 1-Methylnicotinamide, Serotonin, Docosahexaenoic acid, Decanoylcarnitine, alpha-Eleostearic acid, Homo-L-arginine, Pyruvic acid, 3-hydroxydecanoyl carnitine, Ecgonine, Kynurenine, Ethyl 3-oxohexanoate, Arabinosylhypoxanthine;
wherein, the cut-off values of models A to D are 0.455, 0.511, 0.452 and 0.458, respectively. The measured relative abundance of each biomarker in the blood sample is substituted into the model equation to calculate a P value. When the P value is greater than the cut-off value, it means that the possibility of the blood sample coming from a patient with lung cancer is high; and when the P value is less than the cut-off value, it means that the possibility of the blood sample from an individual with benign pulmonary nodules is high.
In some embodiments, when the model is used for distinguishing or identifying the patient with lung cancer from the healthy (without nodules) individual, the model includes one or more of the following:
model E: In[P/(1−P)]=1.41×V1+0.26×V2+0.04×V3−0.01×V4−0.05×V5−0.09×V6−0.19×V7−0.2×V8−0.32×V9−0.34×V10−0.4×V11−0.48×V12−0.55×V13−0.64×V14−1.07×V15−1.58×V16+3.44, V1-V16 are respectively relative abundances of Hypoxanthine, Alanine, 2-Ketobutyric acid, 3-hydroxybutyrylcarnitine, Nicotine, Hippuric acid, Citrulline, Trimethylamine N-oxide, alpha-Eleostearic acid, 1-Methylnicotinamide, 3-hydroxydecanoylcarnitine, Ecgonine, Ethyl 3-oxohexanoate, 2-trans,4-cis-Decadienoylcarnitine, Arabinosylhypoxanthine, Lysine;
model F: In[P/(1−P)]=1.51×V1+0.29×V2+0.06×V3−0.03×V4−0.03×V5−0.03×V6−0.07×V7−0.1×V8−0.21×V9−0.22×V10−0.33×V11−0.35×V12−0.39×V13−0.51×V14−0.59×V15−0.63×V16−1.12×V17−1.69×V18+3.65, wherein V1-V18 are respectively relative abundances of
Hypoxanthine, Alanine, 2-Ketobutyric acid, 3-hydroxybutyryl carnitine, Decanoylcarnitine, Ergothioneine, Nicotine, Hippuric acid, Trimethylamine N-oxide, Citrulline, alpha-Eleostearic acid, 1-Methylnicotinamide, 3-hydroxydecanoyl carnitine, Ecgonine, Ethyl 3-oxohexanoate, 2-trans,4-cis-Decadienoylcarnitine, Arabinosylhypoxanthine, Lysine;
model G: In[P/(1−P)]=1.67×V1+0.34×V2+0.1×V3+0.01×V4−0.08×V5−0.08×V6−0.09×V7−0.1×V8−0.12×V9−0.23×V10−0.27×V11−0.36×V12−0.36×V13−0.38×V14−0.56×V15−0.61×V16−0.66×V17−1.2×V18−1.89×V19+4, wherein V1-V19 are respectively relative abundances of
Hypoxanthine, Alanine, 2-Ketobutyric acid, Aminocaproic acid, 3-hydroxybutyryl carnitine, Ergothioneine, Decanoylcarnitine, Nicotine, Hippuric acid, Trimethylamine N-oxide, Citrulline, 3-hydroxydecanoyl carnitine, alpha-Eleostearic acid, 1-Methylnicotinamide, Ecgonine, 2-trans,4-cis-Decadienoylcarnitine, Ethyl 3-oxohexanoate, Arabinosylhypoxanthine, Lysine;
model H: In[P/(1−P)]=2.03×V1+0.47×V2+0.15×V3+0.09×V4+0.04×V5+0.03×V6+0.01×V7−0.12×V8−0.12×V9−0.13×V10−0.14×V11−0.14×V12−0.27×V13−0.36×V14−0.37×V15−0.37×V16−0.4×V17−0.43×V18−0.59×V19−0.63×V20−0.75×V21−1.37×V22−2.18×V23+4.56, wherein V1-V22 are respectively relative abundances of Hypoxanthine, Alanine, 2-Ketobutyric acid, Tiglylcarnitine, N6,N6,N6-Trimethylysine, Aminocaproic acid, Oxindole, Decanoylcarnitine, Nicotine, Ergothioneine, 3-hydroxybutyryl carnitine, Hippuric acid, Trimethylamine N-oxide, Lactic acid, Citrulline, alpha-Eleostearic acid, 3-hydroxydecanoyl carnitine, 1-Methylnicotinamide, 2-trans,4-cis-Decadienoylcarnitine, Ecgonine, Ethyl 3-oxohexanoate, Arabinosylhypoxanthine, Lysine; or
model I: In[P/(1−P)]=3.08×V1+1.26×V2+0.7×V3+0.64×V4+0.41×V5+0.4×V6+0.38×V7+0.31×V8+0.31×V9+0.1×V10+0.09×V11+0.09×V12+0.04×V13−0.04×V14−0.04×V15−0.07×V16−0.12×V17−0.17×V18−0.24×V19−0.24×V20−0.26×V21−0.31×V22−0.32×V23−0.44×V24−0.44×V25−0.49×V26−0.53×V27−0.63×V28−0.73×V29−0.79×V30−0.81×V31−0.81×V32−0.85×V33−1.2×V34−1.62×V35−1.72×V36−3.68×V37+7.11, wherein V1-V37 are respectively relative abundances of Hypoxanthine, Octanoylcarnitine, Alanine, 3-hydroxydodecanoyl carnitine, Xanthine, 2-Ketobutyric acid, Oxindole, Tiglylcarnitine, N6,N6,N6-Trimethylysine, Cyclohexaneacetic acid, Aminocaproic acid, Methylimidazoleacetic acid, Homo-L-arginine, Pyruvic acid, 2-Octenoylcarnitine, Propionylcarnitine, Nicotine, Serotonin, Phenylacetylglutamine, Hippuric acid, Ergothioneine, 3-hydroxybutyryl carnitine, alpha-Eleostearic acid, Inosine, Citrulline, Trimethylamine N-oxide, 3-hydroxyoctanoyl carnitine, 1-Methylnicotinamide, 3-hydroxydecanoyl carnitine, Hexanoylcarnitine, Decanoylcarnitine, Ecgonine, 2-trans,4-cis-Decadienoylcarnitine, Ethyl 3-oxohexanoate, Lactic acid, Arabinosylhypoxanthine, Lysine.
The cut-off values of models E to I are 0.520, 0.527, 0.450, 0.466 and 0.456, respectively. The measured relative abundance of each biomarker in the blood sample is substituted into the model equation to calculate a P value. When the P value is greater than the cut-off value, it means that the possibility of the blood sample coming from a patient with lung cancer is high; and when the P value is less than the cut-off value, it means that the possibility of the blood sample from a healthy individual without nodules is high.
Preferably, the relative abundance of the biomarker in the aforementioned model is the measured relative abundance of the biomarker in serum, such as the relative abundance values of these biomarkers in serum as detected by precision analytical instruments including LC-UV, LC-MS, GC-MS, etc; and the relative abundance of these biomarkers in serum as detected by an immune kit, etc.
In some embodiments, these models can be input into the computer system in advance, and when a detection value of a biomarker is obtained, a diagnosis result is obtained through automatic calculation by the computer system. Therefore, the invention can provide a system for distinguishing the patient with lung cancer from the patient with benign pulmonary nodules or healthy people without nodules, which includes an operation module, wherein the operation or calculation module includes a model or model equation established by one or a combination of several ones of the biomarkers.
In some embodiments, the system further includes an output module for outputting the calculation result.
In some embodiments, the system further includes an input module for inputting one or more detection results of the aforementioned biomarkers, and such detection results can be quantitative detection results or qualitative results. The established model here is not a finite model recited in the invention, as long as the biomarkers within the scope of the invention are applied to establish a model for lung cancer diagnosis, the model is within the scope of the invention. In some embodiments, the system further includes a negative control or reference data module.
In some embodiments, an ROC curve is established for each biomarker, and those biomarkers with large areas under the curve (AUC values) can be found, so that one or more biomarkers are chosen to establish a diagnostic model, so as to obtain more reliable diagnostic results. It is generally understood that the reliability of the established model may be higher when more biomarkers are selected. For example, the sensitivity is higher when the accuracy is higher and the specificity is stronger. However, a single biomarker or several important biomarkers can also be selected for diagnosis, or for preliminary screening and detection, which can also provide a reference for diagnosis to a certain extent. In particular, the diagnostic value of a single biomarker may be low, but the diagnostic value of multiple biomarkers newly discovered in the present application or the model established by a combination of the novel biomarkers and known biomarkers may be extremely high. At the same time, it is not that when the diagnostic value of a single biomarker is lower, if it is used for establishing a model of a combination of multiple biomarkers, its weight in the model is lower. For example, in identifying of the patient with lung cancer and healthy people, the AUC value of a single alanine is 0.591 and that of a single 2-trans,4-cis-Decadienoylcarnitine is 0.746, indicating that the diagnostic value of the single 2-trans,4-cis-Decadienoylcarnitine is higher than that of the single alanine. However, in the model A established from multiple biomarkers, the weight coefficients of alanine and 2-trans,4-cis-Decadienoylcarnitine are 1.02 (OR of 2.77) and 0.62 (OR of 1.86), respectively, which means that the value of alanine in the model A is greater than that of 2-trans, 4-cis-Decadienoylcarnitine. That is, when the patient with lung cancer and healthy people are distinguished by the model A, the effect of alanine on the results is greater than that of 2-trans, 4-cis-Decadienoylcarnitine. Similarly, the weight coefficients (or OR) of single biomarkers with small AUC values such as serotonin and 2-butanoic acid in the model established from multiple biomarkers may be higher than those of single biomarkers with large AUC values, which will not be listed one by one here. This further shows that: the diagnostic value of even a biomarker with a very small AUC value cannot be denied, especially that it can be used in combination with other biomarkers and may play a major role in the combined model.
A fourth aspect of the invention provides a kit for screening lung cancer. The kit contains a reagent capable of detecting one or more of the aforementioned biomarkers, which can be a blood treatment reagent, such as a reagent for filtering and extracting the aforementioned biomarkers, and also includes a reagent directly used for directly detecting the presence or present amount of the biomarkers, such as antibodies, antigens or labeling substances.
The invention is advantageous in that:
The method for screening a biomarker for lung cancer is disclosed, a biomarker for distinguishing or identifying the patient with lung cancer from healthy (without nodules) people or the patient with benign pulmonary nodules is provided, and a model for identifying or distinguishing of lung cancer is established. Particularly, the biomarker and model provided by the invention have important clinical diagnostic significance for the differential diagnosis of whether a patient with nodules in the lung has lung cancer.
The invention is advantageous in that: in the invention, small molecular differential metabolites are screened out by utilizing the method of serum metabonomics, and used as biomarkers for the differential diagnosis of lung cancer, which can be used for distinguishing a population with lung cancer from a healthy population and distinguishing a patient with lung cancer from a patient with benign pulmonary nodules, and biomarkers suitable for diagnosis of lung cancer in different genders are further selected and used according to gender. Furthermore, the invention further provides a model for accurate differential diagnosis between lung cancer and benign pulmonary nodules.
Diagnostic Method
A fourth aspect of the invention provides a method for diagnosing lung cancer, which includes detecting the presence or quantity of the aforementioned biomarker in a blood sample, so as to determine the presence of lung cancer or the possibility of suffering from lung cancer.
In some embodiments, the present number is a result obtained by comparing with that in the negative blood sample. In some embodiments, the blood sample is a serum sample.
In some embodiments, the method of diagnosing lung cancer includes a method of screening patients with lung cancer from a healthy population; or alternatively, screening patients with lung cancer from patients with pulmonary nodules; or alternatively screening patients with lung cancer from male healthy people without pulmonary nodules, or alternatively screening patients with lung cancer from male patients with pulmonary nodules; or alternatively screening patients with lung cancer from healthy female people without pulmonary nodules, or alternatively screening patients with lung cancer from female patients with pulmonary nodules. One or more of the biomarkers targeted by these different methods can be selected from the aforementioned markers of the invention.
Such specific diagnosis or detection methods all can adopt existing conventional methods, e.g., a liquid phase detection method, a mass spectrometry method, a gas phase or liquid phase combined with mass spectrometry method, or an immunization method. The immune method includes an enzyme-linked immunosorbent assay, a dry chemical method, a dry test strip method, or an electrochemical method.
Diagnostic Device or Kit
In another aspect of the present invention, provided is a kit for detecting lung cancer, which includes a reagent capable of detecting one or more of the aforementioned biomarkers, which may be a blood treatment reagent, such as a reagent for filtering and extracting the aforementioned biomarkers; and further includes a reagent directly used for detecting the presence or present quantity of biomarkers, e.g., an antibody, antigen or labeling substance.
Metabolic Pathway
The metabolic pathways in which the aforementioned metabolites involve include glycolysis, and cycles of fatty acids, carnitine, amino acids, purine, nicotine, heme, sex hormone, vitamins and tricarboxylic acids.
Therefore, on the other hand, the invention provides use of a biomarker in preparation of a reagent for diagnosing lung cancer, wherein the described biomarkers are derived from one or more of the following metabolic pathways including: glycolysis, and cycles of fatty acids, carnitine, amino acids, purines, nicotine, heme, sex hormones, vitamins and tricarboxylic acid. it is shown through the invention that, the changes of the substances involved in the aforementioned metabolic pathways are closely related to the occurrence of lung cancer, and the degree of this correlation shows a significant difference. The changes of the metabolic pathway substances may be an increase relative to normal, or a decrease relative to normal. Here, the normal refers to a healthy population without nodules or people with benign nodules. Although the changes of some specific compounds found in the invention are presented with relevance to the occurrence of lung cancer, it does not mean that other specific compounds produced by abnormalities in these metabolic pathways are not presented with relevance to the occurrence of lung cancer. In other words, when it is necessary to diagnose or prevent lung cancer, we can first start with a metabolic pathway, and then look for specific compounds or substances involved in the metabolic pathway to find novel compounds, which thus can be used for diagnosing whether lung cancer occurs or for prevention and treatment.
In some embodiments, one of or a combination of several biomarkers mentioned above can be used for diagnosis of lung cancer, and when two or more of the biomarkers are used, the diagnostic effect will be better than that of a single serum marker.
(1) Diagnosis or Detection
Here diagnosis or detection refers to detecting or assaying a biomarker in a sample, or detecting the content of a target biomarker, e.g., absolute content or relative content, and then explaining whether an individual from which the sample is provided has a certain disease or the possibility of suffering from a certain disease through the presence or quantity of the target biomarker.
Here the meanings of diagnosis and detection can be used interchangeably. The result of this detection or diagnosis cannot be directly used as the direct result of suffering from the disease, but an intermediate result. If it is wanted to obtain a direct result, a certain disease can only be confirmed further by means of other auxiliary means such as pathology or anatomy. For example, the invention provides a variety of novel biomarkers with relevance to lung cancer, and changes in the contents of these biomarkers are directly related to whether suffering from lung cancer.
(2) Relationship Between Marker and Lung Cancer
Here the relationship means that the appearance of a certain biomarker in a sample or the change of its content is directly related to a specific disease. For example the relative increase or decrease of its content indicates that the possibility of suffering from this disease is higher than that of healthy people.
If multiple different markers are found in a sample at the same time or their contents changes relatively, it indicates that the possibility of suffering from this disease is higher than that of healthy people. That is, among the species of markers, some markers have a stronger correlation with suffering from the disease, while some markers have a weaker correlation with suffering from the disease, or even some markers have no correlation with a specific disease. One or more of those markers with strong correlation can be used as biomarkers for diagnosing the disease, and those with weak correlation can be combined with strong markers to diagnose certain diseases, thereby increasing the accuracy of detection results.
For the plurality of biomarkers in the serum found by the invention, these biomarkers all can be used for distinguishing the population with lung cancer from healthy people or population with pulmonary nodules. Here, the marker can be used as a single marker for direct detection or diagnosis. Selection of such a marker indicates that the relative change in the content of the marker has a strong correlation with lung cancer. Of course, it can be understood that one or more markers with strong correlation with lung cancer can be selected for simultaneous detection. The normal understanding is that in some embodiments, the selection of highly correlated biomarkers for detection or diagnosis can achieve certain standard accuracy, such as the accuracy of 60%, 65%, 70%, 80%, 85%, 90% or 95%, then it can show that these biomarkers can obtain an intermediate value for diagnosis of a certain disease, but it does not mean that it can be directly confirmed to have the disease. For example, in the invention, among the biomarkers in Tables 2-9, the biomarker with higher VIP value can be selected as a marker for diagnosing whether suffering from lung cancer, or as a marker for screening out the population with lung cancer from healthy population or the population with pulmonary nodules, and here the population includes both people without gender differences and people with gender differences.
Of course, the one with the higher ROC value can also be selected as a diagnostic marker. The so-called strong and weak are generally confirmed by some algorithms, such as the contribution rate of the marker to lung cancer or weight analysis thereof. Such calculation methods can be significance analysis (p value or FDR value) and Fold change. Multivariate statistical analysis mainly includes principal component analysis (PCA), partial least squares discriminant analysis (PLS-DA) and orthogonal partial least squares discriminant analysis (OPLS-DA). Of course, it also includes other methods, e.g., ROC analysis and so on. Of course, other model prediction methods are also possible. During specific selection of a biomarker, the marker disclosed by the invention can be selected, or other well-known markers can be selected or used in combination.
In order to describe the invention more specifically, the technical solution of the invention will be described in detail below with reference to the accompanying drawings and specific examples. These explanations only show how the invention is implemented, but do not define the specific scope of the invention. The scope of the invention is defined in the claims.
Serum samples were collected from patients of different genders and ages and healthy people. In this study, samples of men and women aged between 38-78 were collected, including three groups of serum samples from patients with lung cancer (138 cases), patients with benign pulmonary nodules (170 cases) and healthy people (174 cases), which were matched according to gender and age.
Serum metabolites were extracted by a three-phase extraction method of methyl tert-butyl ether:methanol:water (10:3:2.5, v/v/v). The specific operation was as follows: (1) the serum sample was placed on ice and completely thawed, 50 uL of the sample was taken into a 1.5 mL EP tube, added with 225 μL of frozen methanol, and subjected to vortex for 30 seconds; (2) it was added with 750 μL of frozen MTBE, subjected to vortex for 30 seconds, and shaken on ice at 400 rpm for 1 hour; (3) it was then added with 188 μL of pure water and subjected to vortex for 1 minute; (4) it was centrifuged at 15,000 rcf for 10 minutes at 4° C.; and (5) upon centrifugation, 125 μL of the subnatant was taken into an EP tube and spin-dried with a vacuum freeze dryer, and all the dry samples of serum metabolites were stored in a refrigerator at −80° C. until testing.
Considering that there might be batch errors in sample pretreatment, in this study, processing of each batch of experimental samples was conducted simultaneously with the processing of one Reference serum for subsequent data correction. The Reference serum sample was prepared by mixing sera from 100 healthy people (healthy people referred to the people whose blood pressure, blood sugar and blood routine were all normal and that had no hepatitis B virus, and of which the physical examination results showed no obvious diseases, so that they did not need to see a doctor for treatment currently). The men and women from which the sera of 100 healthy people were derived were of the equal number, and were aged between 40-55. The subjects needed to fast overnight and forbid taking drugs 72 hours before blood collection, and individuals with past disease history and body mass indexes (BM's) outside the 95th percentile were excluded. The mixed serum was sub-packaged in 50 μL per portion and stored in a refrigerator at −80° C.
(1) Reconstitution of serum metabolites: the dry extract of serum metabolites was added with 120 μL of a reconstitution solvent (acetonitrile:water=4:1), subjected to vortex for 5 minutes, and then centrifuges at 4° C. for 15,000×g for 10 minutes, and 100 μL of the supernatant was taken into a liner tube to prepare a sample to be tested.
(2) QC sample: each 10 μL of the serum samples to be tested from patients with lung cancer, patients with benign pulmonary nodules and healthy people was taken, subjected to vortex, and mixed evenly with shaking to prepare a QC sample.
(3) Sample detection method: detection was conducted with liquid chromatography-high resolution mass spectrometry (LC-HRMS).
I. Liquid Chromatography Conditions
Chromatographic Column: BEH Amide (100×2.1 mm, 1.7 μm).
Mobile phase: in positive mode, phase A was acetonitrile; water=95:5 (10 mM ammonium acetate, 0.1% formic acid), and phase B was acetonitrile:water=50:50 (10 mM ammonium acetate, 0.1% formic acid); and in negative mode, phase A was acetonitrile:water=95:5 (10 mM ammonium acetate, pH=9.0, adjusted by aqueous ammonia), and phase B was acetonitrile: water=50:50 (10 mM ammonium acetate, pH=9.0, adjusted by aqueous ammonia).
The elution gradient was shown in Table 1 below:
II. Mass Spectrometry Conditions
The model of a mass spectrometer was Q Exactive (Thermo Fisher Scientific Company, USA), and qualitative analysis was carried out by employing an electrospray ion source (ESI), a positive and negative Fullscan mode (Full Scan) and a data dependent scan mode (ddMS2). The spray voltage was+3,800/-3,200 V; the atomization temperature was 350° C.; high-purity nitrogen was used as sheath gas and auxiliary gas, and the parameters were set to 40 arb and 10 arb; respectively; the temperature of ion transfer tube was 320° C.; the mass scanning range was 70-1,050 m/z; the primary scan resolution was 70,000 FWHM, and the secondary scan resolution was 35,000 FWHM.
III. Injecting Method
Before each detection, six syringe volumes of the QC sample were injected to stabilize the detection system. The serum sample was injected in a random manner, in which testing of one syringe volume of the QC sample was inserted every injection of 10 syringe volumes of the serum samples. The first syringe and last syringe in the detection sequence were both the QC sample. Finally, the QC sample was subjected to full scanning and segmented scanning by ddMS2 for compound identification.
(4) Data Preprocessing
I. Raw Data Matrix
The raw data of each sample included total ion current data and mass spectrum data (as shown in
II. Excision and Interpolation of Data Missing Values
There were often data missing values in the original data matrix of metabonomics, which were mainly related to the detection of background noise, peak extraction and peak alignment methods of mass spectrometry, etc. Too many zero or missing values would bring difficulties to downstream analysis. Therefore, characteristic ions with missing values greater than 50% in all samples were generally excised, and the missing values of other compounds were interpolated. In this study, MetaboAnalyst 5.0 analysis software was used for processing the missing values, and a K-Nearest Neighbours (KNN) manner was selected for interpolation.
III. Data Correction and Filtering
A large amount of sample pretreatment was inevitably limited by the throughput of experimental treatment, so it was necessary to carry out sample pretreatment in batches. However, due to the multifarious types of metabolites, large differences in physical and chemical properties, and expensive isotope internal standards, it was difficult to choose an appropriate isotope internal standard that could meet the full coverage. Aiming at this problem, this study selected a reference serum that is processed simultaneously with the batch processing as a natural “like internal standard” to correct batch errors caused by pretreatment. That was, the original data of the experimental samples of each pretreatment batch was normalized based on the data of the Reference serum of the corresponding batch to obtain the relative abundance of each characteristic ion, and the characteristic ion with RSD>30% in the QC sample was deleted to obtain the final analysis data matrix.
Metabonomics generally adopted the combination of univariate analysis and multivariate statistical analysis to screen differential metabolites, in which the univariate analysis mainly included significance analysis (p value or FDR value) and Fold change of characteristic ions in different groups, while the multivariate statistical analysis mainly included principal component analysis (PCA), partial least squares discriminant analysis (PLS-DA) and orthogonal partial least squares discriminant analysis (OPLS-DA).
Before statistical analysis, the data should be properly normalized, transformed and scaled. In this study, MetaboAnalyst 5.0 analysis software was used for statistical analysis, and data normalization by the sum, Log transformation and Auto scaling were carried out. Partial least squares discriminant analysis (PLS-DA) was performed on the three groups of lung cancer, benign pulmonary nodules and healthy people (as shown in
Further, PLS-DA analysis between two groups of patients with lung cancer and healthy people, patients with lung cancer and patients with benign pulmonary nodules was carried out (as shown in
According to the screening criteria of the differential metabolite: (1) VIP>1; (2) when FDR<0.05, i.e. VIP>1 and FDR<0.05, it was judged that there was a significant difference in the metabolite between the two groups, and the metabolite was the differential metabolite between the two groups. Moreover, in the process of screening the differential metabolites, it was found that the differential metabolites for different genders were different, so the differential metabolites were further differentiated according to gender.
The main significant differential metabolites found by the invention were:
1. differential metabolites between the group of patients with lung cancer and the group of healthy people were shown in Table 2 below.
2. Different metabolites between group of patients with lung cancer and the group of patients with benign pulmonary nodules were shown in Table 3 below.
3. differential metabolites between the group of patients with lung cancer and the group of healthy people in men were shown in Table 4 below.
4. Different metabolites between group of patients with lung cancer and the group of patients with benign pulmonary nodules in men were shown in Table 5 below.
5. differential metabolites between patients with lung cancer and healthy people in women were shown in Table 6 below.
6. Different metabolites between group of patients with lung cancer and the group of patients with benign pulmonary nodules in women were shown in Table 7 below.
It could be seen from comparison of Tables 2 and 3 that:
(1) The following metabolites had significant differences between patients with lung cancer and patients with benign pulmonary nodules and between patients with lung cancer and healthy people: alpha-Eleostearic acid, 2-Ketobutyric acid, 2-Octenoylcarnitine, 2-trans, 4-cis-Decadienoylcarnitine, 3-Chlorotyrosine, 3-hydroxydecanoyl carnitine, 3-hydroxydodecanoyl carnitine, 3-hydroxyoctanoyl carnitine, Acetophenone, Arabinosylhypoxanthine, Cyclohexaneacetic acid, Dihydroxybenzoic acid, Docosahexaenoic acid, Ecgonine, Ethyl 3-oxohexanoate, Hexanoylcarnitine, Hippuric acid, Homo-L-arginine, Hypoxanthine, Lactic acid, N-Acetyl-L-alanine, Octanoylcarnitine, 5-Oxoproline, Pyruvic acid, Serotonin, Succinic acid semialdehyde, Xanthine;
(2) The following metabolites had significant differences between patients with lung cancer and patients with benign pulmonary nodules, but not between patients with lung cancer and healthy people: 1-Methylnicotinamide, 2-Pyrrolidone, 4-oxo-Retinoic acid, 7-Methylguanine, Acetylcarnitine, Bilirubin, Choline Sulfate, cis-5-Tetradecenoylcarnitine, Citrulline, Creatinine, Diethylamine, Dihydrothymine, Glutamine, Hydroxybutyric acid, Inosine, Kynurenine, Linoleyl carnitine, Lysine, Trimethylamine N-oxide.
It could be seen from comparison of Tables 4 and 5 that:
(1) The following metabolites had significant differences both between patients with lung cancer and patients with benign pulmonary nodules in men and between patients with lung cancer and healthy people in men: 1-Methylnicotinamide, 2-trans,4-cis-Decadienoylcarnitine, 3-hydroxydecanoyl carnitine, 3-hydroxydodecanoyl carnitine, 3-hydroxyoctanoyl carnitine, 4-oxo-Retinoic acid, 7-Methylguanine, Acetylcarnitine, alpha-Eleostearic acid, Arabinosylhypoxanthine, Cyclohexaneacetic acid, Diethylamine, Docosahexaenoic acid, Ecgonine, Ethyl 3-oxohexanoate, Glutamine, Hippuric acid, Homo-L-arginine, Hypoxanthine, Inosine, Linoleyl carnitine, N-Acetyl-L-alanine, Octanoylcarnitine, 5-Oxoproline, Pyruvic acid, Trimethylamine N-oxide;
(2) The following metabolites had significant differences between patients with lung cancer and patients with benign pulmonary nodules in men, but not between patients with lung cancer and healthy people in men: 2-Octenoylcarnitine, 3-hydroxybutyryl carnitine, Aminoadipic acid, Bilirubin, Dihydrothymine, Ergothioneine, Lactic acid, N6,N6,N6-Trimethylysine, Nicotine.
It could be seen from comparison of Tables 6 and 7 that:
(1) The following metabolites had significant differences both between patients with lung cancer and patients with benign pulmonary nodules in women and between patients with lung cancer and healthy people in women: 1-Methylnicotinamide, 2-Ketobutyric acid, 2-Pyrrolidone, 3-Chlorotyrosine, 3-hydroxydecanoyl carnitine, 3-hydroxyoctanoyl carnitine, 4-oxo-Retinoic acid, 7-Methylguanine, Acetophenone, Arabinosylhypoxanthine, Choline Sulfate, Citrulline, Creatinine, Cyclohexaneacetic acid, Ecgonine, Ethyl 3-oxohexanoate, Hexanoylcarnitine, Hippuric acid, Homo-L-arginine, Hypoxanthine, Inosine, Lactic acid, Lysine, Octanoylcarnitine, 5-Oxoproline, Serotonin, Succinic acid semialdehyde, Trimethylamine N-oxide, Xanthine;
(2) The following metabolites had significant differences between patients with lung cancer and patients with benign pulmonary nodules in women, but not between patients with lung cancer and healthy people in women: 2-Octenoylcarnitine, cis-5-Tetradecenoylcarnitine, Kynurenine, Phenylalanine.
It could be seen from comparison of Tables 3, 5 and 7 that:
there were both common ones and different ones of the differential metabolites between patients with lung cancer and patients with benign pulmonary nodules in men and women. The differential metabolite that was common in the patients with lung cancer and the patients with benign pulmonary nodules in men and women, and differential metabolites specific to the patients with lung cancer and the patients with benign pulmonary nodules in men or women were shown in
(1) The metabolite that was common between patients with lung cancer and patients with benign pulmonary nodules in men and women included: 1-Methylnicotinamide, 2-Octenoylcarnitine, 3-hydroxydecanoyl carnitine, 3-hydroxyoctanoyl carnitine, 4-oxo-Retinoic acid, 7-Methylguanine, Arabinosylhypoxanthine, Cyclohexaneacetic acid, Ecgonine, Ethyl 3-oxohexanoate, Hippuric acid, Homo-L-arginine, Hypoxanthine, Inosine, Lactic acid, Octanoylcarnitine, 5-Oxoproline, Trimethylamine N-oxide;
(2) The metabolite that had significant differences between patients with lung cancer and patients with benign pulmonary nodules in men, but not in women, included: alpha-Eleostearic acid, 2-trans,4-cis-Decadienoylcarnitine, 3-hydroxydodecanoyl carnitine, Acetylcarnitine, Bilirubin, Diethylamine, Dihydrothymine, Docosahexaenoic acid, Glutamine, Linoleyl carnitine, N-Acetyl-L-alanine, Pyruvic acid, 3-hydroxybutyryl carnitine, Aminoadipic acid, Ergothioneine, N6,N6,N6-Trimethylysine, Nicotine;
(3) The metabolite that had significant differences between patients with lung cancer and patients with benign pulmonary nodules in women, but not in men, included: 2-Ketobutyric acid, 2-Pyrrolidone, 3-Chlorotyrosine, Acetophenone, Choline Sulfate, cis-5-Tetradecenoylcarnitine, Citrulline, Creatinine, Hexanoylcarnitine, Kynurenine, Lysine, Serotonin, Succinic acid semialdehyde, Xanthine, Phenylalanine.
It could be seen from comparison of Tables 2 to 7 that:
(1) the specific metabolite between patients with lung cancer and healthy people in men included: 3b,16a-Dihydroxyandrostenone sulfate, Isoleucine, Leucine, Tyrosine;
(2) the specific metabolite between patients with lung cancer and patients with benign pulmonary nodules in men included: 3-hydroxybutyryl carnitine, Aminoadipic acid, Ergothioneine, Nicotine;
(3) the specific metabolite between patients with lung cancer and healthy people in women included: Alanine, Asparagine, Propionylcarnitine, Urocanic acid;
(4) the specific metabolite between patients with lung cancer and patients with benign pulmonary nodules in women included: Phenylalanine. Here, the specific differential metabolite referred to that these differential metabolites only had significant differences between two specific groups, but not between other groups.
1. The model for differential diagnosis of lung cancer and benign pulmonary nodules by a single differential metabolite and establishment thereof
An ROC curve of each metabolite was established, and the quality of the experimental results was judged by the area under the curve (AUC). The AUC of 0.5 indicated that the single metabolite had no diagnostic value; the AUC greater than 0.5 indicated that the single metabolite had a diagnostic value; and the diagnostic value of the single metabolite was higher when the AUC was larger.
The respective metabolites in Tables 3, 5 and 7 were analyzed by ROC curve, and the ROC values and related information of each metabolite were shown in Tables 8, 9 and 10, respectively:
2. The model for differential diagnosis of lung cancer and benign pulmonary nodules by a combination of multiple differential metabolites and establishment thereof
Based on the relative abundance of different metabolites between lung cancer and pulmonary nodules in Table 3, a model for differential diagnosis of lung cancer and benign pulmonary nodules was established by using binary logistic regression (SPSS software), and the forward maximum likelihood method (LR) was adopted to screen the optimum model parameters (SPSS software) for differential diagnosis of lung cancer and pulmonary nodules. As a result, a prediction model A (applicable to both men and women) was obtained.
The odds ratio (OR) referred to the ratio of occurrence and non-occurrence of lung cancer, which was an indicator of the correlation strength between lung cancer and a predictive variable. OR>1 indicated that with the increase of this variable, the probability of occurrence of lung cancer was increased, which was a “positive” correlation; OR<1 indicated that with the increase of this variable, the probability of occurrence of lung cancer was decreased, which was a “negative” correlation; and OR=1 indicated that there was no correlation between the disease and exposure. In logistic regression, the coefficient obtained by us was the logarithm of the OR value. p<0.05 in the table showed that this variable played a significant role in the model.
The variables and parameters of model A were listed in Table 11 below:
Finally, the resultant equation of model A was: Logit(P)=In[P/((1−P)]=5.553×V04+2.92×V05+2.713×V06−0.332×V07−1.798×V10−7.922×V13−0.593×V14+0.643×V17−2.187×V19−0.992×V20−2.352×V33−1.441×V38+7.214×V39−1.22×V40−1.235×V42+1.61, and the cut-off value of P was 0.424 (that is, when P>0.424, lung cancer was diagnosed). As shown in
Further, considering the gender factor, a model B for differential diagnosis of lung cancer and pulmonary nodules in men and a model C for differential diagnosis of lung cancer and benign pulmonary nodules in women were established according to Tables 5 and 7, respectively.
The variables and parameters of model B were listed in Table 12 below:
The equation of model B was: Logit(P)=In[P/((1−P)]=6.283×MV02−0.646×MV10−2.758×MV13+1.864×MV15−1.126×MV19−1.145×MV27−3.918×MV30+1.494, wherein the cut-off value of P was 0.701, and when P>0.701, it indicated that a man with nodules was a patient with lung cancer. As shown in
The variables and parameters of model C were listed in Table 13 below:
The equation of model C was: Logit(P)=In[P/((1−P)]=10.742×FV05−1.031×FV08−7.442×FV09+11.839×FV13−2.617×FV15−3.030×FV20−1.413×FV23−2.278×FV29−6.905, wherein the cut-off value of P was 0.629, and when P>0.629, it indicated that a woman with nodules was a patient with lung cancer. As shown in
3. The model for differential diagnosis of lung cancer and benign pulmonary nodules by a combination of all differential metabolites and establishment thereof
Based on the relative abundance of the differential metabolite in lung cancer and pulmonary nodules from Table 3, a model D for differential diagnosis with all differential metabolites between lung cancer and benign pulmonary nodules was established by using binary logistic regression (MetaboAnalyst software), and 10-fold Cross-Validation was adopted. The variables and parameters of model D were listed in Table 14 below:
The equation of the model D was: Logit(P)=In[PA(1−P)]=−17.026×V01+17.418×V02+0.2×V03+6.45×V04+1.479×V05+3.762×V06−0.337×V07−0.096×V08−0.681×V09−2.144×V10+0.654×V11−0.833×V12−10.388×V13−1.051×V14−1.526×V15+1.505×V16+1.806×V17+0.519×V18−2.051×V19−0.86×V20−0.552×V21−3.683×V22+0.091×V23−0.721×V24+1.43×V25+0.572×V26+1.466×V27−1.097×V28+0.272×V29−0.315×V30−1.12×V31−2.83×V32−2.85×V33+0.993×V34+2.321×V35−0.71×V36−0.616×V37−1.711×V38+9.051×V39−1.52×V40+0.302×V41−1.688×V42−0.739×V43−0.152×V44−0.282×V45−0.085×V46+7.81, wherein the cut-off value of P was 0.21, ROC analysis was conducted and as shown in
We utilized Model A of Example 5 to predict 30 cases of lung cancer and benign pulmonary nodules that were randomly selected inside and outside the hospital and not participating in establishment of the model. As shown in
The results here were only preliminary prediction results. If the sample size was increased, the prediction results might be more accurate, but this did not deny that these markers found in the invention were biomarkers that could be used for diagnosing whether suffering from lung cancer.
Screening and Detection of Additional Novel Markers
Serum samples were collected from patients of different genders and ages and healthy people. In this study, samples of people aged between 38-78 were collected, including three groups of serum samples from patients with lung cancer (136 cases), patients with benign pulmonary nodules (170 cases) and healthy people (174 cases).
Serum metabolites were extracted by a three-phase extraction method of methyl tert-butyl ether:methanol:water (10:3:2.5, v/v/v). The specific operation was as follows: (1) the serum sample was placed on ice and completely thawed, 50 uL of the sample was taken into a 1.5 mL EP tube, added with 225 μL of frozen methanol, and subjected to vortex for 30 seconds; (2) it was added with 750 μL of frozen MTBE, subjected to vortex for 30 seconds, and shaken on ice at 400 rpm for 1 hour; (3) it was then added with 188 μL of pure water and subjected to vortex for 1 minute; (4) it was centrifuged at 15,000 rcf for 10 minutes at 4° C.; and (5) upon centrifugation, 125 μL of the subnatant was taken into an EP tube and spin-dried with a vacuum freeze dryer, and all the dry samples of serum metabolites were stored in a refrigerator at −80° C. until testing.
Considering that there might be batch errors in sample pretreatment, in this study, processing of each batch of experimental samples was conducted simultaneously with the processing of one Reference serum for subsequent data correction. The Reference serum sample was prepared by mixing sera from 100 healthy people (healthy people referred to the people whose blood pressure, blood sugar and blood routine were all normal and that had no hepatitis B virus, and of which the physical examination results showed no obvious diseases, so that they did not need to see a doctor for treatment currently). The men and women from which the sera of 100 healthy people were derived were of the equal number, and were aged between 40-55. The subjects needed to fast overnight and forbid taking drugs 72 hours before blood collection, and individuals with past disease history and body mass indexes (BM's) outside the 95th percentile were excluded. The mixed serum was sub-packaged in 50 μL per portion and stored in a refrigerator at −80° C.
(1) Reconstitution of serum metabolites: the dry extract of serum metabolites was added with 120 μL of a reconstitution solvent (acetonitrile: water=4:1), subjected to vortex for 5 minutes, and then centrifuges at 4° C. for 15,000×g for 10 minutes, and 100 μL of the supernatant was taken into a liner tube to prepare a sample to be tested.
(2) QC sample: each 10 μL of the serum samples to be tested from patients with lung cancer, patients with benign pulmonary nodules and healthy people was taken, subjected to vortex, and mixed evenly with shaking to prepare a QC sample.
(3) Sample detection method: detection was conducted with liquid chromatography-high resolution mass spectrometry (LC-HRMS).
I. Liquid Chromatography Conditions
Chromatographic Column: BEH Amide (100×2.1 mm, 1.7 μm).
Mobile phase: in positive mode, phase A was acetonitrile; water=95:5 (10 mM ammonium acetate, 0.1% formic acid), and phase B was acetonitrile: water=50:50 (10 mM ammonium acetate, 0.1% formic acid); and in negative mode, phase A was acetonitrile: water=95:5 (10 mM ammonium acetate, pH =9.0, adjusted by aqueous ammonia), and phase B was acetonitrile: water=50:50 (10 mM ammonium acetate, pH=9.0, adjusted by aqueous ammonia).
The elution gradient was shown in the table below:
II. Mass Spectrometry Conditions
The model of a mass spectrometer was Q Exactive (Thermo Fisher Scientific Company, USA), and qualitative analysis was carried out by employing an electrospray ion source (ESI), a positive and negative Fullscan mode (Full Scan) and a data dependent scan mode (ddMS2). The spray voltage was +3,800/−3,200 V; the atomization temperature was 350° C.; high-purity nitrogen was used as sheath gas and auxiliary gas, and the parameters were set to 40 arb and 10 arb; respectively; the temperature of ion transfer tube was 320° C.; the mass scanning range was 70-1,050 m/z; the primary scan resolution was 70,000 FWHM, and the secondary scan resolution was 35,000 FWHM.
III. Injecting Method
Before each detection, six syringe volumes of the QC sample were injected to stabilize the detection system. The serum sample was injected in a random manner, in which testing of one syringe volume of the QC sample was inserted every injection of 10 syringe volumes of the serum samples. The first syringe and last syringe in the detection sequence were both the QC sample. Finally, the QC sample was subjected to full scanning and segmented scanning by ddMS2 for compound identification.
(4) Data Preprocessing
I. Raw Data Matrix
The raw data of each sample included total ion current data and mass spectrum data (as shown in
II. Excision and Interpolation of Data Missing Values
There were often data missing values in the original data matrix of metabonomics, which were mainly related to the detection of background noise, peak extraction and peak alignment methods of mass spectrometry, etc. Too many zero or missing values would bring difficulties to downstream analysis. Therefore, characteristic ions with missing values greater than 50% in all samples were generally excised, and the missing values of other compounds were interpolated. In this study, MetaboAnalyst 5.0 analysis software was used for processing the missing values, and 1/5 of the minimum value was selected for interpolation.
III. Data Correction and Filtering
A large amount of sample pretreatment was inevitably limited by the throughput of experimental treatment, so it was necessary to carry out sample pretreatment in batches. However, due to the multifarious types of metabolites, large differences in physical and chemical properties, and expensive isotope internal standards, it was difficult to choose an appropriate isotope internal standard that could meet the full coverage. Aiming at this problem, this study selected a reference serum that is processed simultaneously with the batch processing as a natural “like internal standard” to correct batch errors caused by pretreatment. That was, the original data of the experimental samples of each pretreatment batch was normalized based on the data of the Reference serum of the corresponding batch to obtain the relative abundance of each characteristic ion, and the characteristic ion with RSD>30% in the QC sample was deleted to obtain the final analysis data matrix.
Metabonomics generally adopted the combination of univariate analysis and multivariate statistical analysis to screen differential metabolites, in which the univariate analysis mainly included significance analysis (p value or FDR value) and Fold change of characteristic ions in different groups, while the multivariate statistical analysis mainly included principal component analysis (PCA), partial least squares discriminant analysis (PLS-DA) and orthogonal partial least squares discriminant analysis (OPLS-DA). Before statistical analysis, the data should be properly normalized, transformed and scaled. In this study, MetaboAnalyst 5.0 analysis software was used for statistical analysis, and data normalization by the sum, Log transformation and Auto scaling were carried out. Partial least squares discriminant analysis (PLS-DA) was performed on the three groups of lung cancer, benign pulmonary nodules and healthy people (as shown in
According to the screening criteria of the differential metabolite: (1) Fold change>1.2 or <0.83; (2) when FDR<0.05, i.e. Fold change>1.2 or <0.83 and FDR<0.05, it was judged that there was a significant difference in the metabolite between the two groups, and the metabolite was the differential metabolite between the two groups.
It should be noted that when using the method of serum metabolomics to screen differential metabolites, the screened differential metabolites would be affected by many factors, including: the sample size, such as differences in sample sizes and sources of the patients with lung cancer, the patients with benign pulmonary nodules and healthy people in this application, and the like, which each could affect the final results; sample treatment methods, wherein for example different substances would be obtained by using different extraction solvents, which would also lead to different detection results; liquid chromatography and mass spectrometry conditions, wherein the compounds detected under different liquid chromatography and/or mass spectrometry conditions were different; and data analysis methods, wherein differential metabolites obtained by employing different statistical analysis methods would also be different. Furthermore, the effect of the combination of these influencing factors would be more complicated, so it was impossible to predict the results of the final screened differential metabolites. Particularly, in the method of screening lung cancer biomarkers based on serum metabonomics in the present application, the missing value was processed by using MetaboAnalyst 5.0 analysis software, and 1/5 of the minimum value was selected for interpolation. Under the aforementioned screening criteria, even if the number of samples was further increased on the basis of the existing sample size, the obtained differential metabolites hardly changed, which indicated that the screening method in the present application was relatively stable and the obtained differential metabolites had high representativeness. The main differential metabolites found in the present application were shown in the table below.
It could be seen from the data in the table that:
(1) The metabolites that had significant differences both between the lung cancer group and the healthy (without nodules) group and between the lung cancer group and the group with benign pulmonary nodules included: 2-trans,4-cis-Decadienoylcarnitine, Octanoylcarnitine, Decanoylcarnitine, 2-Octenoylcarnitine, Hexanoylcarnitine, 3-hydroxydodecanoylcarnitine, 3-hydroxydecanoylcarnitine, 3-hydroxyoctanoylcarnitine, Ecgonine, Trimethylamine N-oxide, 1-Methylnicotinamide, 3-Chlorotyrosine, Homo-L-arginine, Serotonin, Alanine, alpha-Eleostearic acid, Ethyl 3-oxohexanoate, Inosine, Arabinosylhypoxanthine, Hippuric acid, Cyclohexaneacetic acid, Lactic acid, 2-Ketobutyric acid, Pyruvic acid, Hypoxanthine, Xanthine.
(2) The metabolites that had significant differences only between the lung cancer group and the healthy (without nodules) group, but not between the lung cancer group and the group with benign pulmonary nodules included: N6,N6,N6-Trimethylysine, Tiglylcarnitine, Propionylcarnitine, 3-hydroxybutyrylcarnitine, Oxindole, Nicotine, Ergothioneine, Phenylacetylglutamine, Citrulline, Lysine, Aminocaproic acid, Methylimidazoleacetic acid.
(3) The metabolites that had significant differences only between the lung cancer group and the group with benign pulmonary nodules, but not between the lung cancer group and the healthy (without nodules) group included: Kynurenine, cis-5-Tetradecenoylcarnitine, Docosahexaenoic acid, Choline Sulfate, Dihydrothymine, 17-Hydroxypregnenolone sulfate, Pregnenolone sulfate.
(4) The differential metabolites in the table were classified into types of acyl carnitine, amines, alkaloids, fatty acids, amino acids, etc., which indicated that these types of serum metabolites were highly likely to be related to lung cancer. Moreover, the related metabolic pathways of the differential metabolites in the table and other metabolites in the pathways were also highly likely to be related to lung cancer, so they could be given priority in consideration during the process of finding biomarkers of lung cancer.
1. The model for differential diagnosis between lung cancer and benign pulmonary nodules or between patients with lung cancer and healthy people by a single differential metabolite and establishment thereof
An ROC curve of each differential metabolite was established, and the quality of the experimental results was judged by the area under the curve (AUC). The AUC less than or equal to 0.5 indicated that the single differential metabolite had no diagnostic value; the AUC greater than 0.5 indicated that the single differential metabolite had a diagnostic value; and the diagnostic value of the single differential metabolite was higher when the AUC was larger.
The respective differential metabolites in Table 16 were analyzed by ROC curve, and the ROC values and related information of them were shown in Tables 17 and 18, respectively:
The ROC values of the differential metabolites between lung cancer samples and healthy (without nodules) samples in Table 3 were all equal to and greater than 0.5, indicating that these differential metabolites had a certain value for distinguishing the lung cancer samples from the healthy (without nodules) samples, and could be used as biomarkers for the auxiliary diagnosis of lung cancer. The higher AUC value in the table indicated the higher diagnostic value of the single biomarker, and the greater value or reference significance of it that might be obtained when it was used for diagnosis alone or in combination with multiple biomarkers. Meanwhile, for biomarkers with relatively small AUC values, their diagnostic or identifying significance cannot be denied, and use of single ones of them also provided a possibility or reference to a certain extent. Moreover, when multiple biomarkers with relatively small AUC values were combined together, the value of combined diagnosis was often higher, even reaching the accuracy of 80%, 90% and above. Therefore, any biomarker in Table 17 had more or less significance in distinguishing the lung cancer samples from the health (without nodules) samples or auxiliary diagnosis of lung cancer. Similarly, the biomarkers in Table 18 had the same value or significance for the diagnosis or identification of benign pulmonary nodules and lung cancer.
2. The model for identifying lung cancer samples and healthy (without nodules) samples or samples with benign pulmonary nodules by a combination of multiple differential metabolites and establishment thereof
Based on the serum detection values of differential metabolites in the lung cancer samples and the samples with pulmonary nodules in Table 15, a model for differential diagnosis between the lung cancer samples and the healthy (without nodules) samples or between the lung cancer samples and the samples with benign pulmonary nodules was established by utilizing binary logistic regression (LASSO algorithm, R language). The models for distinguishing lung cancer from benign pulmonary nodules included model A, model B, model C and model D. The models for distinguishing individuals with lung cancer from healthy individuals included: model E, model F, model G, model H and model I.
I. The variables and parameters of model A were listed in the table below:
The equation of model A was: In[P/(1−P)]=2.29×M1+1.02×M2+0.64×M3+0.62×M4+0.47×M5+0.42×M6+0.38×M7+0.26×M8+0.05×M9+0.03×M10−0.05×M11−0.12×M12−0.16×M13−0.17×M14−0.36×M15−0.4×M16−0.45×M17−0.46×M18−0.47×M19−0.53×M20−0.55×M21−0.79×M22−0.95×M23−1.02×M24−1.19×M25−1.52 ×M26−1.88×M27+4.01.
The cut-off value of P was 0.455. That was, the serum detection values (relative abundances) of the aforementioned markers was substituted into the equation of model A for calculation. When P>0.455, it was identified as lung cancer; and when P≤0.455, it was identified as benign pulmonary nodules.
As shown in
II. The variables and parameters of model B were listed in the table below:
The equation of model B was: In[P/(1−P)]=1.11×M1+0.25×M2+0.13×M3+0.09×M4 +0.05×M5−0.01×M6−0.02×M7−0.02×M8−0.04×M9−0.11×M10−0.12×M11−0.19×M12−0.3×M13−0.34×M14−0.45×M15−0.46×M16−0.64×M17−0.68×M18−0.95×M19+2.17.
The cut-off value of P was 0.511. That was, the serum detection values (relative abundances) of the aforementioned markers was substituted into the equation of model B for calculation. When P>0.511, it was identified as lung cancer; and when P≤0.511, it was identified as benign pulmonary nodules. ROC analysis was conducted (as shown in
III. The variables and parameters of model C were listed in the table below:
The equation of model C was: In[P/(1−P)]=1.73×M1+0.72×M2+0.31×M3+0.29×M4+0.23×M5+0.15×M6−0.05×M7−0.07×M8−0.07×M9−0.09×M10−0.09×M11−0.16×M12−0.26×M13−0.26×M14−0.27×M15−0.29×M16−0.43×M17−0.45×M18−0.56×M19−0.75×M20−0.87×M21−1.15×M22−1.41×M23+3.07.
The cut-off value of P was 0.452. That was, the serum detection values (relative abundances) of the aforementioned markers was substituted into the equation of model C for calculation. When P>0.452, it was identified as lung cancer; and when P≤0.452, it was identified as benign pulmonary nodules. ROC analysis was conducted (as shown in
IV. The variables and parameters of model D were listed in the table below:
The equation of model D was: In[P/(1−P)]=2.35×M1+1.03×M2+0.71×M3+0.68×M4+0.48×M5+0.45×M6+0.42×M7+0.39×M8+0.16×M9+0.03×M10−0.05×M11−0.12×M12−0.17×M13−0.17×M14−0.22×M15−0.39×M16−0.43×M17−0.46×M18−0.49×M19−0.54×M20−0.54×M21−0.57×M22−0.89×M23−0.97×M24−1.07×M25−1.23×M26−1.56×M27−1.93×M28+4.16.
The cut-off value of P was 0.458. That was, the serum detection values (relative abundances) of the aforementioned markers was substituted into the equation of model D for calculation. When P>0.458, it was identified as lung cancer; and when P≤0.458, it was identified as benign pulmonary nodules. ROC analysis was conducted (as shown in
V. The variables and parameters of model E were listed in the table below:
The equation of model E was: In[P/(1−P)]=In[P/(1−P)]=1.41×V1+0.26×V2+0.04×V3−0.01×V4−0.05×V5−0.09×V6−0.19×V7−0.2×V8−0.32×V9−0.34×V10−0.4×V11−0.48×V12−0.55×V13−0.64×V14−1.07×V15−1.58×V16+3.44.
The cut-off value of P was 0.520. That was, the serum detection values (relative abundances) of the aforementioned markers was substituted into the equation of model E for calculation. When P>0.520, it was identified as lung cancer; and when P≤0.520, it was identified as healthy people.
As shown in
VI. The variables and parameters of model F were listed in the table below:
The equation of model F was: In[P/(1−P)]=1.51×V1+0.29×V2+0.06×V3−0.03×V4−0.03×V5−0.03×V6−0.07×V7−0.1×V8−0.21×V9−0.22×V10−0.33×V11−0.35×V12−0.39×V13−0.51×V14−0.59×V15−0.63×V16−1.12×V17−1.69×V18+3.65.
The cut-off value of P was 0.527. That was, the serum detection values (relative abundances) of the aforementioned markers was substituted into the equation of model F for calculation. When P>0.527, it was identified as lung cancer; and when P≤0.527, it was identified as healthy people. ROC analysis was conducted (as shown in
VII. The variables and parameters of model G were listed in the table below:
The equation of model G was: In[P/(1−P)]=1.67×V1+0.34×V2+0.1×V3+0.01×V4−0.08×V5−0.08×V6−0.09×V7−0.1×V8−0.12×V9−0.23×V10−0.27×V11−0.36×V12−0.36×V13−0.38×V14−0.56×V15−0.61×V16−0.66×V17−1.2×V18−1.89×V19+4.
The cut-off value of P was 0.450. That was, the serum detection values (relative abundances) of the aforementioned markers was substituted into the equation of model G for calculation. When P>0.450, it was identified as lung cancer; and when P≤0.450, it was identified as healthy people. ROC analysis was conducted (as shown in
IX. The variables and parameters of model H were listed in the table below:
The equation of model H was: In[P/(1−P)]=2.03×V1+0.47×V2+0.15×V3+0.09×V4+0.04×V5+0.03×V6+0.01×V7−0.12×V8−0.12×V9−0.13×V10−0.14×V11−0.14×V12−0.27×V13−0.36×V14−0.37×V15−0.37×V16−0.4×V17−0.43×V18−0.59×V19−0.63×V20−0.75×V21−1.37×V22−2.18×V23+4.56.
The cut-off value of P was 0.466. That was, the serum detection values (relative abundances) of the aforementioned markers was substituted into the equation of model H for calculation. When P>0.466, it was identified as lung cancer; and when P≤0.466, it was identified as healthy people. ROC analysis was conducted (as shown in
X. The variables and parameters of model I were listed in the table below:
The equation of model I was: In[P/(1−P)]=3.08×V1+1.26×V2+0.7×V3+0.64×V4+0.41×V5+0.4×V6+0.38×V7+0.31×V8+0.31×V9+0.1×V10+0.09×V11+0.09×V12+0.04×V13−0.04×V14−0.04×V15−0.07×V16−0.12×V17−0.17×V18−0.24×V19−0.24×V20−0.26×V21−0.31×V22−0.32×V23−0.44×V24−0.44×V25−0.49×V26−0.53×V27−0.63×V28−0.73×V29−0.79×V30−0.81×V31−0.81×V32−0.85×V33−1.2×V34−1.62×V35−1.72×V36−3.68×V37+7.11.
The cut-off value of P was 0.456. That was, the serum detection values (relative abundances) of the aforementioned markers was substituted into the equation of model I for calculation. When P>0.456, it was identified as lung cancer; and when P≤0.456, it was identified as healthy people. ROC analysis was conducted (as shown in
The models A to I described above were just illustrative examples for showing models for auxiliary diagnosis of lung cancer as established by a combination of multiple biomarkers. These models are of great value for the differential diagnosis of lung cancer. It should be noted that the aforementioned models are not exhaustive, and the models established by selecting and combining multiple biomarkers from Table 3 or multiple biomarkers from Table 4 should fall within the scope of the present application, and also had diagnostic values. Meanwhile, it was not limited to the biomarkers in Table 3 or 4, and the biomarkers in Table 3 or 4 could also be selected to establish a model for auxiliary diagnosis of lung cancer together with known biomarkers.
Number | Date | Country | Kind |
---|---|---|---|
202011077018.3 | Oct 2020 | CN | national |
The present application claims the priority of the prior Chinese application No. 2020110770183 filed on Oct. 10, 2020, all contents of which are incorporated as a part of the present application, and which is a partial continuation of PCT application No. PCT/CN2021/122812 filed on Oct. 9, 2021.
Number | Date | Country | |
---|---|---|---|
Parent | PCT/CN2021/122812 | Oct 2021 | US |
Child | 18073905 | US |