PREDICTION SYSTEM FOR IDENTIFYING KEY HETEROGENEOUS MOLECULES DRIVING TUMOR METASTASIS

Information

  • Patent Application
  • 20240221867
  • Publication Number
    20240221867
  • Date Filed
    March 18, 2024
    10 months ago
  • Date Published
    July 04, 2024
    6 months ago
Abstract
A prediction system for identifying key heterogeneous molecules that drive tumor metastasis including an input module, an analysis module, and an output module. The input module is configured to input a first quantitative analysis result of proteins and a second quantitative analysis result of proteins; the first quantitative analysis result of proteins is a collection of quantitative analysis results of various protein expression levels in tumor metastases of a target patient before drug intervention, and the second quantitative analysis result of proteins is a collection of quantitative analysis results of various protein expression levels in residual tumor metastases of the target patient after drug intervention. The analysis module includes a primary analysis submodule, a secondary analysis submodule, and a calculation and sorting submodule. The primary analysis submodule is used for preliminary screening analysis. The output module is used to output a sorted list of heterogeneous molecules.
Description
BACKGROUND

The disclosure relates to the field of pharmaceutical biotechnology, and more particularly to a prediction system for identifying key heterogeneous molecules driving tumor metastasis.


Malignant tumors are a major public health problem having a detrimental effect on human health. Tumor drug resistance and metastasis are one of the underlying causes of low survival rates and death in tumor patients. The biggest challenge facing tumor therapy is how to overcome tumor drug resistance and metastasis resulting from tumor heterogeneity. Although new approaches including biotherapies and immunotherapies have brought hope to tumor treatment, the treatment targets a single site, and tumor heterogeneity leads to difficulty in completely conquering the tumor. Even if there is some efficacy in the early stages of treatment, tumor cell drug resistance or metastasis often occurs after a period of time, rendering the treatment ineffective. Since heterogeneous tumor cell subpopulations are independent of each other, even if an effective intervention can be taken against an oncogene or cancer suppressor gene, other heterogeneous molecules will not be affected at the same time; these heterogeneous tumor cell subpopulations will play a dominant role in adapting to the host environment when the conditions are suitable, and further drive the occurrence of tumor metastasis.


That tumor drug resistance and metastasis driven by heterogeneous molecules lead to tumor progression is a consensus in the field of cancer research. However, most studies are still searching for differentially expressed molecules and proving their exact upstream and downstream regulatory networks and mechanisms. There is currently no method to accurately identify heterogeneous molecules of tumors in individual patients. It is urgent to develop technical means that can accurately identify heterogeneous molecules of tumors in individual patients.


There are certain prediction systems in the related art, for example, a system based on lncRNA molecular model to predict the efficacy and prognosis of neoadjuvant chemoradiotherapy for esophageal squamous cell carcinoma patients. The system is directed to detect the expression levels of three lncRNAs: SCAT1, PRKAG2-AS1, and FLG-AS1, to predict the efficacy of neoadjuvant chemoradiotherapy for esophageal squamous cell carcinoma patients, and also to predict the prognosis of esophageal squamous cell carcinoma patients, such as overall prognostic survival and relapse-free survival. However, there has not yet been a technical solution aimed at discovering and guiding tumor treatment interventions with technical effects.


SUMMARY

One objective of the disclosure is to overcome the problems in the prior art, and provide a prediction system for identifying key heterogeneous molecules driving tumor metastasis. Based on the quantitative analysis results of proteins of metastatic tumor tissues before and after tumor treatment, combined with bioinformatics and computational biology methods, a list of tumor heterogeneous molecules was screened, sorted, and determined. The system can effectively identify the heterogeneous molecules driving tumor metastasis in different patients, providing precise guidance for individualized treatment plans for tumor metastasis.


The disclosure provides a prediction system for identifying key heterogeneous molecules driving tumor metastasis, the system comprising: an input module, an analysis module, and an output module.


The input module is configured to input a first quantitative analysis result of proteins and a second quantitative analysis result of proteins;

    • the first quantitative analysis result of proteins is a collection of quantitative analysis results of various protein expression levels in tumor metastases of a target patient before drug intervention, and the second quantitative analysis result of proteins is a collection of quantitative analysis results of various protein expression levels in residual tumor metastases of the target patient after drug intervention;
    • the analysis module comprises a primary analysis submodule, a secondary analysis submodule, and a calculation and sorting submodule;
    • the primary analysis submodule is used for preliminary screening analysis;
    • the preliminary screening analysis comprises comparing the first quantitative analysis result of proteins and the second quantitative analysis result of proteins, and including proteins with expression changes within a predetermined range in a candidate protein set;
    • the secondary analysis submodule is used for secondary screening analysis;
    • the secondary screening analysis comprises constructing, using a protein interaction network analysis tool, experimentally validated protein interaction networks based on the candidate protein set, supplementing the protein interaction networks by utilizing signal pathways involved in the candidate protein set in a signal pathway database; including most important proteins in each protein interaction network as independent heterogeneous molecules in an independent molecular set; constructing, using a protein interaction network prediction tool and based on the candidate protein set, a protein interaction prediction network according to a protein interaction law; and including proteins in the candidate protein set that do not participate in any protein interaction prediction network as independent heterogeneous molecules in the independent molecular set;
    • the calculation and sorting submodule is used for calculating and sorting; the calculating and sorting comprises calculating a hazard ratio (HR) value of each heterogeneous molecule in the independent molecular set, sorting the heterogeneous molecule based on the HR value, to form a sorting list of each heterogeneous molecule; and
    • the output module is used to output a sorted list of heterogeneous molecules.


Practical researches show that accurate identification of new heterogeneous targets driving tumor metastasis and timely intervention can overcome tumor drug resistance and metastasis driven by heterogeneous molecules, effectively control tumor progression, and prolong patient life before tumor cells fully adapt to a current treatment plan. The key to driving tumor metastasis or recurrence lies in the heterogeneous molecules that are not controlled by current drugs but closely related to tumor progression. If precise intervention can be implemented against the heterogeneous molecules in residual tumor foci, perhaps the problem of tumor drug resistance and metastasis recurrence driven by heterogeneous molecules can be solved.


Through the prediction system of the disclosure, relevant proteins in tumor metastases before and after intervention are first screened, and then further screened through bioinformatics and computational biology methods. The “importance” of the heterogeneous molecules driving tumor metastasis is ranked to determine the theoretical intervention order of target molecules. The prediction system can accurately identify heterogeneous molecules driving tumor metastasis, providing precise intervention strategies for effectively suppressing tumor drug resistance and metastasis driven by heterogeneous molecules.


In a class of this embodiment, the first quantitative analysis result of proteins and the second quantitative analysis result of proteins are acquired through high-throughput protein sequencing; and the high-throughput protein sequencing comprises protein mass spectrometry analysis.


Through the technical solution, the specific details of obtaining quantitative analysis results of proteins can be further optimized.


In a class of this embodiment, during the preliminary screening analysis, the proteins with expression changes within a predetermined range are proteins whose expression levels after drug intervention are 0.95-1.05 times higher than those before drug intervention.


In a class of this embodiment, during the preliminary screening analysis, the proteins with expression changes within the predetermined range are first screened and then included in the candidate protein set; the proteins that are screened meet the following conditions: a number of specific peptide segments is ≥2, and is not a prediction protein; a coverage of amino acids of identified peptide segments of a current protein is ≥10%; a quantification frequency of the current protein is ≥2, and a sequencing quality comprehensive score of the current protein is ≥200; evidence in a database shows the current protein is related to tumor progression; the specific peptide segments refer to a specific peptide segment used to identify the current protein; the prediction protein refers to an attribute that defines a reliability of the current protein in Uniport or NCBI databases as a prediction protein; the identified peptide segments refer to a specific peptide segment that can be used to indicate the current protein being detected; the quantification frequency of the current protein refers to a number of repetitions of the current protein being quantitatively detected.


Through the technical solution, the specific details of the preliminary screening analysis of the primary analysis submodule can be further optimized.


In a class of this embodiment, during the secondary screening analysis, the protein interaction network analysis tool comprises a STRING network tool; the signal pathway database comprises KEGG, SMPDB, WikiPathways, Cell Signaling Technology, BioCarta Pathway, Pathway Commons, and PID; the protein interaction network prediction tool comprises a Genemania network tool, Unihi network tool, and Hitpredict network tool.


In a class of this embodiment, a Cytoscape tool is employed to construct and supplement the protein interaction networks, and to construct the protein interaction prediction network.


In a class of this embodiment, during the secondary screening analysis, the most important proteins in each protein interaction network are identified through characteristic analysis; the characteristic analysis comprises analyzing the importance, degree, and radiancy of each protein in the protein interaction network; and proteins with the highest importance, degree, and radiancy are the most important proteins in the protein interaction network.


Through the technical solution, the specific details of the secondary analysis submodule can be further optimized for further screening and analysis.


In a class of this embodiment, during the calculating and sorting, the HR value of each heterogeneous molecule is calculated as follows: obtaining sequencing data of tumors from a public database, obtaining a standardized counting matrix of genes corresponding to the heterogeneous molecules from the sequencing data, and performing batch single factor COX regression on the genes to obtain the HR value.


In a class of this embodiment, the public database comprises TCGA, ICGC, and GEO.


In a class of this embodiment, during the calculating and sorting, a sorting rule is: the heterogeneous molecules with HR values greater than or equal to 1 are directly sorted in reverse order according to their HR values, and the heterogeneous molecules with HR values less than 1 are sorted in reverse order according to their 1/HR values.


Through the technical solution, the calculation and sorting of the calculation and sorting submodule are optimized.


In another aspect, the disclosure also provides a sequential intervention system for executing a sequential intervention order of heterogeneous molecules driving tumor metastasis. The sequential intervention system comprises the abovementioned prediction system for identifying key heterogeneous molecules driving tumor metastasis, and an execution module. The execution module is used to determine the sequential intervention order based on the sorting list of heterogeneous molecules output by the prediction system and a predetermined rule. The predetermined rule refers to the selection and ranking rules determined by users based on actual situations (such as certain heterogeneous molecules without inhibitors or agonists, or unreachable drugs due to high prices), making the sequential intervention sequence more in line with the actual situation.


The following advantages are associated with the prediction system for identifying key heterogeneous molecules driving tumor metastasis of the disclosure:

    • (1) Uniqueness: compared to other identification and analysis methods, the prediction system of the disclosure focuses on targets with mutations, variants, and abnormal expression, the prediction system is able to identify and screen heterogeneous molecules with stable protein expression in tumor metastases before and after drug intervention, and with the potential to drive tumor metastasis.
    • (2) Practicality: sequential intervention validation studies of the heterogeneous molecules predicted and screened by the disclosure are carried out in gastric cancer and melanoma respectively, and the expected results are obtained, indicating that the heterogeneous molecules and the sequential intervention strategy of the disclosure are correct and feasible.
    • (3) Effectiveness: the results of the validation studies on the heterogeneous molecules in gastric cancer and melanoma show that each additional intervention of a heterogeneous molecule significantly improves the outcome or prolongs the survival time of the model mice, which indicates that the heterogeneous molecules identified and predicted by the disclosure are indeed able to independently drive the metastasis of gastric cancer or melanoma; the implementation of sequential interventions in accordance with the order of heterogeneous molecules identified by the method of the disclosure combined with the actual situation can effectively overcome the tumor drug resistance and recurrence of tumor metastasis caused by the heterogeneous molecules.


In summary, the prediction system can reasonably and accurately identify the key heterogeneous molecules driving tumor metastasis and rank the key heterogeneous molecules, and then carry out sequential interventions according to the results in combination with the actual situation, and the therapeutic efficacy is certain. The findings and evidences provide innovative ideas, specific algorithms and practical guidance for the clinical treatment of drug-resistant and metastatic tumors due to tumor heterogeneity.





BRIEF DESCRIPTION OF THE DRAWINGS


FIGS. 1A-1B, 2A-2D, 3A-3D, and 4A-4B show a relationship between the tumor suppressor gene JWA and the prognosis and metastasis characteristics of gastric cancer in operations 1.1 of Example 1 of the disclosure;



FIGS. 5A-5B, 6A-6C, 7A-7B, and 8A-8E show a relationship between the oncogene MDM2 and the prognosis and metastasis characteristics of gastric cancer in operations 1.2 of Example 1 of the disclosure;



FIGS. 9A-9C, 10A-10B, and 11A-11D show JWA and MDM2 are functionally independent heterogeneous molecules that can synergistically inhibit in vivo and in vitro metastasis of gastric cancer in operations 1.3 of Example 1 of the disclosure;



FIGS. 12-18 show results of identifying other heterogeneous molecules driving gastric cancer metastasis after joint intervention on JWA and MDM2 in operations 1.4 of Example 1 of the disclosure;



FIGS. 19A-19B, 20A-20D, and 21A-21C show results of sequential intervention on JWA, MDM2, and FASN in operations 1.5 of Example 1 of the disclosure, which can effectively inhibit in vivo metastasis of gastric cancer;



FIGS. 22 and 23A-23B show how the sequential interventions on JWA, MDM2, and FASN can further identify other heterogeneous molecules in gastric cancer driving metastasis in operations 1.6 of Example 1 of the disclosure;



FIGS. 24A-24C, 25A-25C, and 26A-26D show sequential intervention on JWA, MDM2, FASN, USP9X, and PFN1 can effectively inhibit in vivo metastasis of gastric cancer in operations 1.7 of Example 1 of the disclosure, and each addition intervention on a heterogeneous molecule significantly delays the occurrence of gastric cancer recurrence and metastasis;



FIGS. 27A-27D and 28A-28D show JWA mimetic peptide JP1 inhibits in vivo metastasis of melanoma in operations 2.1 of Example 2 of the disclosure;



FIGS. 29A-29D shows MDM2 inhibitor SP141 inhibits in vivo metastasis of melanoma in operations 2.2 of Example 2 of the disclosure;



FIGS. 30A-30C and 31A-31F show JWA and MDM2 are functionally independent heterogeneous molecules that can synergistically inhibit the in vitro and in vivo metastasis of melanoma in operations 2.3 of Example 2 of the disclosure;



FIGS. 32, 33A-33C, 34A-34D, 35A-35B, 36, 37, 38A-38H, and 39A-39B show how to identify heterogeneous molecules driving melanoma metastasis after joint intervention on JWA and MDM2 in operations 2.4 of Example 2 of the disclosure;



FIGS. 40 and 41A-41F show sequential intervention on JWA, MDM2, SMAD3, IDH1/2, and mTOR can effectively inhibit in vivo metastasis of melanoma in operations 2.5 of Example 2 of the disclosure; and



FIGS. 42A-42C shows the abbreviations and corresponding English names of the main genes involved in the disclosure.





DETAILED DESCRIPTION

Specifically, the disclosure provides a prediction system for identifying key heterogeneous molecules driving tumor metastasis, the system comprising an input module, an analysis module, and an output module.


The input module is configured to input a first quantitative analysis result of proteins and a second quantitative analysis result of proteins; the first quantitative analysis result of proteins is a collection of quantitative analysis results of various protein expression levels in tumor metastases of a target patient before drug intervention, and the second quantitative analysis result of proteins is a collection of quantitative analysis results of various protein expression levels in residual tumor metastases of the target patient after drug intervention.


The first quantitative analysis result of proteins and the second quantitative analysis result of proteins are acquired through high-throughput protein sequencing; and the high-throughput protein sequencing comprises protein mass spectrometry analysis.


The analysis module comprises a primary analysis submodule, a secondary analysis submodule, and a calculation and sorting submodule.


The primary analysis submodule is used for preliminary screening analysis; the preliminary screening analysis comprises comparing the first quantitative analysis result of proteins and the second quantitative analysis result of proteins, and including proteins with expression changes within a predetermined range in a candidate protein set.


During the preliminary screening analysis, the proteins with expression changes within a predetermined range are proteins whose expression levels after drug intervention are 0.95-1.05 times higher than those before drug intervention.


During the preliminary screening analysis, the proteins with expression changes within the predetermined range are first screened and then included in the candidate protein set; the proteins that are screened meet the following conditions: a number of specific peptide segments is ≥2, and is not a prediction protein; a coverage of amino acids of identified peptide segments of a current protein is ≥10%; a quantification frequency of the current protein is ≥2, and a sequencing quality comprehensive score of the current protein is ≥200; evidence in a database shows the current protein is related to tumor progression; the specific peptide segments refer to a specific peptide segment used to identify the current protein; the prediction protein refers to an attribute that defines a reliability of the current protein in Uniport or NCBI databases as a prediction protein; the identified peptide segments refer to a specific peptide segment that can be used to indicate the current protein being detected; the quantification frequency of the current protein refers to a number of repetitions of the current protein being quantitatively detected.


The secondary analysis submodule is used for secondary screening analysis; the secondary screening analysis comprises constructing, using a protein interaction network analysis tool, experimentally validated protein interaction networks based on the candidate protein set, supplementing the protein interaction networks by utilizing signal pathways involved in the candidate protein set in a signal pathway database; including most important proteins in each protein interaction network as independent heterogeneous molecules in an independent molecular set; constructing, using a protein interaction network prediction tool and based on the candidate protein set, a protein interaction prediction network according to a protein interaction law; and including proteins in the candidate protein set that do not participate in any protein interaction prediction network as independent heterogeneous molecules in the independent molecular set.


The protein interaction network analysis tool comprises a STRING network tool; the signal pathway database comprises KEGG, SMPDB, WikiPathways, Cell Signaling Technology, BioCarta Pathway, Pathway Commons, and PID; the protein interaction network prediction tool comprises a Genemania network tool, Unihi network tool, and Hitpredict network tool. A Cytoscape tool is employed to construct and supplement the protein interaction networks, and to construct the protein interaction prediction network.


During the secondary screening analysis, the most important proteins in each protein interaction network are identified through characteristic analysis; the characteristic analysis comprises analyzing the importance, degree, and radiancy of each protein in the protein interaction network; and proteins with the highest importance, degree, and radiancy are the most important proteins in the protein interaction network.


The calculation and sorting submodule is used for calculating and sorting; the calculating and sorting comprises calculating a hazard ratio (HR) value of each heterogeneous molecule in the independent molecular set, sorting the heterogeneous molecule based on the HR value, to form a sorting list of each heterogeneous molecule. HR is hazard ratio in short.


During the calculating and sorting, the HR value of each heterogeneous molecule is calculated as follows: obtaining sequencing data of tumors from a public database, obtaining a standardized counting matrix of genes corresponding to the heterogeneous molecules from the sequencing data, and performing batch single factor COX regression on the genes to obtain the HR value. The public database comprises TCGA, ICGC, and GEO.


During the calculating and sorting, a sorting rule is: the heterogeneous molecules with HR values greater than or equal to 1 are directly sorted in reverse order according to their HR values, and the heterogeneous molecules with HR values less than 1 are sorted in reverse order according to their 1/HR values.


The output module is used to output a sorted list of heterogeneous molecules. The above modules are all runnable on computer-readable media.


In another aspect, the disclosure also provides a sequential intervention system for executing a sequential intervention order of heterogeneous molecules driving tumor metastasis. The sequential intervention system comprises the abovementioned prediction system for identifying key heterogeneous molecules driving tumor metastasis, and an execution module. The execution module is used to determine the sequential intervention order based on the sorting list of heterogeneous molecules output by the prediction system.


To validate the effectiveness of each of the above systems, the following validation methods are used:


Validation method of sequential interventions against predicted heterogeneous molecules, that is, the inhibition of tumor metastasis and prolonged survival time through interventions on heterogeneous molecules, the method is performed as follows (refers to examples):

    • 1) based on the principle of mutual non-regulation of tumor heterogeneous molecules, selecting two molecules such as cancer suppressor gene JWA and oncogene MDM2 for the first round of joint intervention;
    • 2) extracting residual tumor metastasis tissue after joint intervention on JWA and MDM2 for high-throughput protein sequencing;
    • 3) based on the results of high-throughput protein sequencing, using the above prediction system, screening and sorting the heterogeneous molecules driving tumor metastasis according to HR value, as candidate targets for sequential intervention;
    • 4) developing and implementing a sequential intervention program to inhibit tumor metastasis based on the candidate targets;
    • 5) extracting tumor metastasis residual foci resulting from the sequential intervention program in 4), performing high-throughput protein sequencing, utilizing the above prediction system again to screen for heterogeneous molecules driving tumor metastasis, and developing and implementing a next round of sequential intervention program to inhibit tumor metastasis; and
    • 6) repeating operations in 5) for several rounds of sequential interventions until the tumor metastasis is completely and effectively inhibited.


The disclosure is described in further detail below with reference to the accompanying drawings and in connection with examples. However, the invention is not limited to the examples given below. Unless otherwise stated, the experimental methods used are conventional experimental methods.


In the examples of the disclosure, the focus is on the systematic algorithms on how to identify heterogeneous molecules that drive tumor metastasis, and the method is applied to different species and different tumor types. The method is preferentially applicable to humans, but also to other mammals; the tumor types include, but are not limited to, gastric cancer and melanoma.


In the examples of the disclosure, the sequencing method for identifying the heterogeneous molecules driving the tumor metastasis is high-throughput protein sequencing, and the promotion and application of the achievements of the disclosure include but are not limited to being based on the high-throughput protein sequencing.


In the examples of the disclosure, the sequential intervention for gastric cancer/melanoma comprises 5 independent heterogeneous molecular targets, respectively, while the promotion and application of the achievements of the disclosure include, but are not limited to, the 5 molecule targets listed in the disclosure.


Example 1

The example provides a method of identifying heterogeneous molecules driving metastasis of gastric cancer and implementing sequential intervention on the heterogeneous molecules.


The main purpose of the example is to apply the technical solution of the disclosure to identify heterogeneous molecules driving metastasis of gastric cancer after the joint intervention of JWA and MDM2, and further to implement a sequential intervention strategy to verify the effectiveness of the technical solution of the disclosure.


1.1 Relationship Between Cancer Suppressor Gene JWA and the Prognostic and Metastatic Characteristics of Gastric Cancer

The relationship between JWA and prognosis of the gastric cancer population was analyzed using TCGA database. FIGS. 1A-1B show a comparison of JWA mRNA levels in paired and unpaired cancerous and paracancerous tissues, which indicates that the JWA mRNA levels in cancerous tissues are lower than that in the paracancerous tissues (FIG. 1A); patients with low JWA expression have a poor prognosis FIG. 1B).


Gastric cancer cells BGC823 and SGC7901 were transfected with si-JWA to interfere with JWA mRNA and its protein expression level. The results of a transwell assay showed that low expression of JWA in the gastric cancer cells led to the increase of the number of tumor cells passing through the cells in the same period of time (FIG. 2A and FIG. 2B); the results of a scratch assay showed that low expression of JWA in the gastric cancer cells accelerated the speed of healing (FIG. 2C and FIG. 2D). Thus, as shown in FIGS. 2A-2D, reducing the expression level of JWA could enhance the migration ability of gastric cancer cells.


Similarly, Flag-JWA was transfected into gastric cancer cells BGC823 and MFC to increase JWA protein expression levels. The results of a transwell assay showed that JWA were highly expressed in the gastric cancer cells, and the number of tumor cells passing through the small chamber decreased at the same time (FIG. 3A and FIG. 3B); the results of a scratch assay showed that the healing speed of gastric cancer cells significantly slowed down after high expression of JWA (FIG. 3C and FIG. 3D). As shown in FIGS. 3A-3D, increasing the expression level of JWA can weaken the migration ability of gastric cancer cells.


BGC823 cells were selected and injected into the tail vein of nude mice to construct a passive metastasis model. On the 2nd day after cell inoculation, mice were grouped and injected intraperitoneally with peptides, JP3 or Ctrl-P-H. As shown in FIGS. 4A-4B, compared with the control peptide, Ctrl-P-H, the lung metastasis foci of mice in the JP3 group were significantly smaller. Note: JP3 and Ctrl-P-H peptides are described in Chen J et al. J Exp Clin Cancer Res. 2020; 39: 118.


The above results indicate that the use of a simulated peptide JP3 based on the tumor suppressor gene JWA can effectively inhibit the in vivo and in vitro metastasis of gastric cancer.


1.2 Relationship Between Oncogene MDM2 and the Prognosis and Metastasis Characteristics of Gastric Cancer

The relationship between MDM2 and prognosis of the gastric cancer population was analyzed using TCGA database. FIGS. 5A-5B show a comparison of MDM2 mRNA levels in paired and unpaired cancerous and paracancerous tissues, which indicates that the MDM2 mRNA levels in cancerous tissues are higher than that in the paracancerous tissues (FIG. 5A); patients with low MDM2 expression have a better prognosis than those with high MDM2 expression (FIG. 5B).


As shown in FIGS. 6A-6C, the CCK-8 method was used to detect the half inhibitory concentration (IC50) of MDM2 inhibitor SP141 (existing product) on gastric cancer cells MGC803, AGS, MFC, SGC7901, and BGC823, with IC50 values of 0.616 μM, 0.335 μM, 0.345 μM, 0.687 μM, and 0.599 μM, respectively (FIG. 6A). The results of a scratch assay showed that when gastric cancer cells BGC823, SGC7901, and MFC were treated with different doses of SP141 for 24 hours, the number of perforated cells gradually decreased with increasing dose of SP141 in a dose-response relationship (FIG. 6B and FIG. 6C).


Similar results were obtained from a cell scratch assay. After the gastric cancer cells BGC823, SGC7901, and MFC were treated with SP141, the migration rate of the cells significantly slowed down, as shown in FIGS. 7A-7B.


The mouse-derived gastric cancer cell line MFC were selected to construct an abdominal implant metastasis model. On the second day of cell inoculation, different doses of SP141 were administered intraperitoneally for intervention. The modeling was completed and the number of abdominal metastatic nodules was recorded. As shown in FIGS. 8A-8E, SP141 could inhibit the number of gastric cancer abdominal implantation metastases, presenting a dose-response relationship. Compared with the control group, the data of the 40 mg/kg group was statistically significant. Whole protein immunoblotting analysis was performed on the residual tumor tissues from the control group and the 40 mg/kg group, and the results showed a significant decrease in MDM2 protein levels in the SP141 intervention group.


The above results indicate that reducing the expression of the oncogene MDM2 can effectively inhibit the in vivo and in vitro metastasis of gastric cancer.


1.3 JWA and MDM2 are Functionally Independent Heterogeneous Molecules that can Synergistically Inhibit Gastric Cancer Metastasis


The correlation between JWA and MDM2 mRNA in 413 gastric cancer tissue samples was analyzed using the TCGA database, and the results showed no correlation between the two genes (FIG. 9A). With a cell model, the mutual regulation between JWA and MDM2 proteins was examined. Different doses of Flag-JWA plasmids were transfected into BGC823 cells, and the gradient increase of JWA protein levels did not result in significant changes in MDM2 protein (FIG. 9B). Thereafter, 0, 0.25, 0.5 μM SP141 were used to inhibit the expression level of MDM2 in gastric cancer cells BGC823 for 24 hours. The results showed that the protein expression level of MDM2 decreased with increasing SP141 dose, while there was no significant change in the protein expression level of JWA (FIG. 9C). Thus, it can be seen that as shown in FIGS. 9A-9C, in gastric cancer, JWA and MDM2 have no mutually regulatory relationship at the protein and mRNA levels, and their respective biological functions on gastric cancer cells are independent.


The results of the transwell assay showed that transfection of Flag-JWA plasmid into AGS cells followed by SP141 treatment resulted in a decrease in the number of cell perforations in both the single and combined treatment groups compared to the control group, while the combined treatment group had the least number of cells, and the difference was statistically significant. Thus, as shown in FIGS. 10A-10B, the combined intervention of JWA and MDM2 has a synergistic inhibitory effect on gastric cancer cell migration.


In an in vivo model, fluorescent labeled gastric cancer cells BGC823-luc were injected into the tail vein of nude mice to construct a passive metastasis model of gastric cancer. Drug intervention began on the second day after cell inoculation (FIG. 11A). The observation results showed that in the early stage of the model, the weight of the mice steadily increased, and after 33 days, the weight of the control group mice began to decrease. The weight of the mice of the drug intervention group was relatively stable and in good condition (FIG. 11B). The results of in vivo fluorescence imaging showed that the fluorescence intensity of lung metastases in the saline group and solvent control group was relatively similar. The fluorescence intensity of the individual SP141 and JP3 intervention groups was weaker than that of the control group, and the difference was statistically significant. The joint intervention group had the weakest lung fluorescence intensity, and there was a statistically significant difference in fluorescence intensity compared to the individual intervention group (FIG. 11C and FIG. 11D). Thus, as shown in FIGS. 11A-11D, the joint intervention of JWA and MDM2 has a synergistic inhibitory effect on gastric cancer metastasis in vivo.


The above results indicate that the joint intervention on the heterogeneous molecules JWA and MDM2 driving gastric cancer metastasis have a synergistic inhibitory effect on gastric cancer metastasis.


1.4 Identifying Other Heterogeneous Molecules Driving Gastric Cancer Metastasis after Joint Intervention on JWA and MDM2


This section proposes a theory for identifying heterogeneous molecules driving tumor metastasis and implements sequential interventions for validation. Tumor heterogeneous molecules are partially independent in function and can drive various malignant phenotypes including metastasis. Even if the best intervention measures are taken for a gene (whether it is an oncogene or a tumor suppressor gene), the expression and function of other heterogeneous molecules are not affected. Therefore, under appropriate conditions, these heterogeneous molecules will play a dominant role in tumor cell fate through adaptation, driving tumor recurrence and metastasis. In the disclosure, the traditional concept of searching for and discovering differentially expressed molecules for targeted interventions has been replaced with a concept to search for heterogeneous molecules that remain unaffected in expression after targeted drug intervention but may drive tumor metastasis. Assuming that sequential interventions targeting these molecules can effectively prolong the survival of cancer patients, as shown in FIG. 12.


As described in 1.3, the joint intervention on JWA and MDM2 significantly enhanced the inhibitory effect on gastric cancer metastasis. However, even with continued joint intervention, after a period of time, lung metastases and drug resistance in mice still develop due to adaptation to the microenvironment, indicating the presence of heterogeneous molecules that are not regulated by JWA and MDM2 after JP3 and SP141 intervention. To predict and identify unknown heterogeneous molecules driving the subsequent progression of gastric cancer, protein mass spectrometry analysis was performed on the metastatic lesions that still existed after the joint intervention of JP3 and SP141. Protein molecules with stable expression levels compared to the control group were screened from a total of more than 7000 protein quantitative analyses (specific indicators were: the expression level of target molecules in residual metastatic lesions after JP3 combined with SP141 intervention/the range of changes in residual metastatic lesions in the control group was 0.95-1.05).


Furthermore, based on sequencing related quality control data and the correlation between molecules and tumors, candidate heterogeneous molecules meeting the following conditions were preliminarily screened: (1) protein molecules with a specific peptide number ≥2; (2) remove predicted protein molecules; (3) remove molecules with identified peptide segments covering less than 10% of the entire protein; (4) remove molecules with protein (129C/129N peptide) quantification less than twice; (5) remove molecules with a comprehensive score of less than 200 in protein sequencing; (6) retrieve keywords, including proteins and their corresponding gene names, and cancer in relevant databases (such as PubMed, Web of Science, CNKI, Wanfang Database, and VIP) to determine their correlation with tumor progression, and remove irrelevant molecules. Through the screening, 2066 protein molecules with a specific peptide segment (referring to the specific peptide segment used to identify the current protein) of ≥2 were identified; 1581 remained after predicted proteins (based on the reliability, the attribute of the current protein is defined as predicted proteins in Uniport or NCBI databases) were excluded; 1187 peptide segment molecules with an amino acid coverage rate of over 10% in the entire protein have been identified (referring to specific peptide segments that can be used to indicate the current protein being detected); 1169 molecules with 129C/129N (mass spectrometry report ion ratio) peptide quantification times (referring to the number of repetitions of the current protein being quantitatively detected) exceeding two times; 117 molecules with a sequencing quality comprehensive score exceeding 200; database analysis confirmed a total of 38 protein molecules associated with tumor progression, as shown in FIG. 13. FIGS. 14 and 42A-42C provide relevant information on candidate heterogeneous molecules.


The preliminarily screened heterogeneous molecules may comprise completely independent individual heterogeneous molecules and some heterogeneous molecules with mutual regulatory effects. To screen for independent heterogeneous molecules, a currently known protein interaction network was constructed using STRING website, with the basis for determining protein interactions being experimental validation and database knowledge. Afterwards, using the 38 heterogeneous molecules as inputs, based on KEGG, SMPDB, WikiPathways, Cell Signaling Technology, BioCarta Pathway, Pathway Commons, and PID databases, the signal pathway information involved in the heterogeneous molecules was analyzed to see if there were protein molecules present in the same signal pathway. Then, new protein points and protein interaction relationships were added through Cytoscape software for further modification and correction (FIG. 15). Finally, based on the molecular point relationships in the interaction network, the network's attributes (connectivity, clustering coefficient, compactness, and topology coefficient) was calculated using a script, and the importance, degree, radiance, and other characteristics of the network were analyzed using the network analyzer module in Cytoscape software (FIGS. 16-18). Thereafter, based on the independent interaction network, the heterogeneous molecules with the highest importance, degree, and radiance were selected to determine which molecules can form an independent network (containing more than 2 proteins) and the most important heterogeneous molecules in the network, namely HSP90AB1, ENO1/PKM, HNRNPK/PCBP2, and RAB1A/SPTBN1. For the heterogeneous molecules that made up the regulatory network, the cumulative HR values of each molecule in the network were calculated and assigned to the representative heterogeneous molecules in the network, as shown in FIGS. 15-18. On the other hand, using the Genemania website, Unihi website, and Hitpredict website, based on algorithmic ideas such as text mining, coexpression, structural domain and gene fusion, it was found that among the 38 molecules, there were independent heterogeneous molecules that did not form an interaction network with other molecules, namely FASN, SLC3A2, ANXA4, VIM, USP9X, and AGRN. The HR cumulative values of these molecules were also calculated. Afterwards, the intervention order was sorted by reference to the HR values of the overall survival corresponding to independent heterogeneous molecules (sorted in reverse order according to their HR values when HR values were greater than 1, and sorted in reverse order according to their 1/HR values when HR values were less than 1). The HR sequence of independent heterogeneous molecules driving gastric cancer metastasis after joint intervention on JWA and MDM2 was determined as follows: FASN (2.21 (1.82-2.86), p=1.6e-16); HNRNPK (0.46 (0.37-0.56), p=3.9e-14); AGRN (2.07 (1.75-2.46), p<1e-16); HSP90AB1 (1.73 (1.46-2.05), p=1.7e-10). RAB1A (0.64 (0.53-0.76), p=5.2e-07); ANXA4 (0.65 (0.55-0.78), p=1.4e-06). SLC3A2 (1.47 (1.27-1.74), p=1.1e-05); ENO1 (1.32 (1.11-1.56), p=0.0014); VIM (1.3 (1.09-1.53), p=0.0027) and USP9X (1.25 (0.99-1.59), p=0.061). In this way, whether heterogeneous molecules with regulatory relationships or completely independent individual heterogeneous molecules, corresponding HR values have been obtained through the analysis, calculation, and assignment. The intervention order of all heterogeneous molecules can be determined based on the size of the HR value. FIG. 15 shows the molecular screening and HR value ranking of partial gastric cancer heterogeneity.


1.5 Sequential Intervention on JWA, MDM2, and FASN can Effectively Inhibit In Vivo Metastasis of Gastric Cancer.

Based on the results of 1.4, FASN was determined as the next intervention target.


The relationship between FASN and the prognosis of gastric cancer population was analyzed using TCGA database, as shown in FIGS. 19A-19B. The mRNA levels of FASN in paired and unpaired cancerous tissues were higher than that in the paracancerous tissues, and the difference was statistically significant (FIG. 19A). Patients with low FASN expression had better prognosis than those with higher FASN expression (FIG. 19B).


The detection of FASN protein levels in residual metastatic foci in lung tissue of mice in the solvent control group and JWA+MDM2 joint intervention group in animal models showed stable FASN expression in both groups, consistent with the protein mass spectrometry results (FIG. 20A). Further validation of the relationship between FASN and gastric cancer metastasis using transwell and scratch experiments: BGC823 cells and BGC823-LPC1 (primary cells of residual lung metastases in the animal model SP141 and JP3 combined intervention group) were treated with different concentrations of FASN inhibitor EGCG for 24 hours, followed by transwell experiments. The results showed that the number of perforation cells in the EGCG treated group was significantly reduced, and there was a dose-response relationship. The inhibition rate of perforation cells was similar in both types of cells (FIG. 20B). Similar results were obtained in the scratch experiment (FIG. 20C). The results of the immunoblotting experiment (FIG. 20D) showed that after EGCG treatment of cells, the expression level of FASN protein was reduced, and the transfer related protein FAK was also correspondingly reduced, both of which showed a dose-response relationship. As shown in FIGS. 20A-20D, intervention on FASN can inhibit the migration of gastric cancer cells.


A passive metastasis model was constructed by extracting primary cells BGC823-LPC1 from residual lung metastases after joint intervention on JWA and MDM2. On the second day after injecting cells into the tail vein of the model mice, they were grouped and administered. During the medication period, they were weighed on time. At the end of the modeling, the lung wet weight of each group of mice was measured, and the situation of lung metastases was observed. The results showed (as shown in FIGS. 21A-21C) that the reduction of the lung wet weight of the mice in the intervention group continued to be treated with SP141 and JP3 was not significant compared to the solvent control group, and the reduction in the number of metastases was not significant, either. However, the FASN inhibitor EGCG group and another FASN inhibitor Orlistat group of mice showed a significant decrease in lung wet weight and a significant decrease in the number of metastatic foci, with statistical significance compared to the control group.


The results indicate that BGC823-LPC1 primary gastric cancer cells extracted from lung metastases in the SP141 and JP3 combined treatment group are re-inoculated into nude mice for modeling, followed by intervention with SP141 and JP3, however, there is little effect on inhibiting lung metastasis of gastric cancer. This indicates that the mice have developed drug resistance in the second round of model mice after receiving the first round of SP141+JP3 intervention. However, after intervention with inhibitors EGCG or Orlistat (both existing technology products) targeting the new heterogeneous molecule FASN, the lung metastasis of gastric cancer cells is significantly inhibited. This indicates that the intervention based on the algorithm proposed in the disclosure for identifying heterogeneous molecules driving tumor metastasis (i.e. the result of 1.4) is effective.


1.6 Further Screening of Heterogeneous Molecules Driving Gastric Cancer Metastasis after Sequential Intervention on JWA, MDM2, and FASN


On the basis of the above model, primary cells were extracted from the residual gastric cancer cells metastatic foci of the lungs of EGCG-treated mice, named BGC823-LPC2. Firstly, protein mass spectrometry analysis was continued on the residual metastatic lesions in the lungs of the EGCG group and the combined treatment group with SP141 and JP3, with a focus on observing the protein expression changes of each independent molecule identified in 1.4 above. The results showed that FASN was inhibited by 22% in the EGCG group, while the rest of the proteins were inhibited by less than 20%. As shown in FIG. 22, the heterogeneous molecules driving tumor metastasis are functionally independent from each other, and the expression level of other heterogeneous molecules remains stable after intervening one heterogeneous target molecule.


According to the strategy of identifying the heterogeneous molecular driving gastric cancer metastasis as described in section 1.4, and based on the results of the protein mass spectrometry analysis, taking into account factors such as the accessibility of inhibitors used for intervention (although many heterogeneous molecules have been screened out and theoretically intervention can be targeted at each molecule, in reality, some molecules do not have inhibitors or agonists, or are inaccessible due to high drug prices, so it is necessary to select candidate heterogeneous molecules based on the actual situation), USP9X and PFN1 oncogenes were selected as the next intervention targets. Using the TCGA database to analyze the relationship between USP9X and PFN1 and the prognosis of gastric cancer population, as shown in FIGS. 23A-23B, patients with low expression of USP9X or PFN1 have better prognosis than those with high expression.


1.7 Sequential Intervention on JWA, MDM2, FASN, USP9X, PFN1 Effectively Inhibits In Vivo Metastasis of Gastric Cancer.

To verify the relationship between USP9X and gastric cancer metastasis, cultured BGC823 LPC2 cells were first treated with USP9X inhibitor WP-1130 (existing product) for 8 hours before a transwell experiment. The results show (as shown in FIGS. 24A-24C) that WP-1130 can significantly inhibit the number of cell perforations, and there is a dose-response relationship (FIG. 24A). The immunoblotting results showed that the expression of USP9X protein in cells treated with WP-1130 was reduced, and the transfer related protein p-AKT was also correspondingly reduced, with a dose-response relationship (FIG. 24B). The results of the scratch experiment are consistent with those of the small chamber perforation experiment (FIG. 24C). Thus, it can be seen that inhibiting the expression of USP9X can effectively reduce the migration of gastric cancer cells.


For the heterogeneous molecule PFN1, a low expression lentiviral plasmid sh-PFN1 was constructed due to the lack of commercialized inhibitor compounds. Transwell experiments were performed after transfection of sh-con and sh-PFN1 in BGC823-LPC2 for 48 hours. The results showed (FIGS. 25A-25C) that the number of perforation in gastric cancer cells with low expression of PFN1 was significantly reduced. Thus, it can be seen that interfering with PFN1 expression can effectively inhibit the migration of gastric cancer cells.


On the basis of the positive results obtained in the cell model, the animal model was then validated. A passive metastasis model was constructed using BGC823-LPC2 cells injected through the tail vein of nude mice. The mice were randomized on the second day and given control solvent, EGCG, WP-1130 alone, and WP-1130 plus sh-PFN1 lentivirus in a sequential manner. The lung metastasis of the mice was observed at the end of the modeling (FIG. 26A). Body weight observation showed that the mice gained weight steadily in the early stage and entered a plateau period on the 24th day, and the weight of the sequential group (lentiviral plasmid intervention group) increased steadily on the 40th day, while the weight of the rest of the groups began to decrease (FIG. 26B). Compared with the solvent control group, the lung metastatic foci of the mice in the continuous EGCG intervention group did not decrease significantly, and the reduction of lung wet weight was not obvious; while the lung metastatic foci of the WP-1130 group and the sequential group decreased significantly, and the lung wet weight decreased significantly (FIGS. 26C and 26D). The sequential intervention was more effective in inhibiting tumor metastasis. Thus, as shown in FIGS. 26A-26D, the sequential intervention on JWA, MDM2, FASN, USP9X and PFN1 effectively inhibited the metastasis of gastric cancer in vivo and in vitro.


Conclusion of Example 1

The example focuses on the animal model validation for the inhibition of gastric cancer drug resistance and metastasis, and the core of the research protocol is to carry out a combined intervention with a targeted drug against two independent tumor heterogeneity molecules, and then sequence (protein mass spectrographic analysis) the residual metastatic tumor foci tissues, to screen for heterogeneous molecules driving the tumor metastasis and rank them according to the HR value, to find out the next heterogeneous molecule to be intervened on; after intervening on a new heterogeneous molecule, the residual metastatic tumor foci tissues are further sequenced and screened, and so on, until all the desired heterogeneous molecules are found out and the interventions are implemented.


To verify the sequential intervention scheme for inhibiting gastric cancer metastasis, the first round candidate molecules were JWA and MDM2. After joint intervention, protein sequencing analysis was performed on residual metastatic lesions in the lungs of experimental animals, and the third heterogeneous molecule was the fatty acid synthase FASN. A passive metastasis mouse model of gastric cancer was constructed using primary cells extracted from residual tumor metastases after joint intervention on JWA and MDM2, and sequential intervention therapy was performed with FASN inhibitors. After intervention, resequencing was performed on the residual metastatic tissue of gastric cancer in the lungs of experimental animals to find subsequent heterogeneous molecules. According to the scheme, a series of new heterogeneous molecules would be discovered. Corresponding inhibitors or agonists were selected for verification of sequential intervention based on factors such as the accessibility of heterogeneous molecules and market prices. The example focuses on gastric cancer metastasis, and subsequently, two heterogeneous molecules, USP9X and PFN1, were selected for intervention verification. Therefore, the example sequentially intervened with a total of 5 heterogeneous molecules. In a confirmatory study of sequential intervention to verify the regulation of gastric cancer metastasis by heterogeneous molecules, the effectiveness of each round of sequential intervention was evaluated based on the number and size of lung metastatic nodules.


It should be noted that in the implementation of the confirmatory model, except for JWA and MDM2, which were known to be important independent molecules for direct joint intervention, FASN, USP9X, and PFN1 selected based on factors such as the availability of intervention drugs targeting the target molecule were all independent heterogeneous molecules, and different software predicted their interaction intensity scores to be 0 (FIG. 16).


Example 2

The example relates to a systematic algorithm and verification of sequential intervention for identifying heterogeneous molecules driving melanoma metastasis.


One objective of the example is to apply the method of predicting and identifying heterogeneous molecules driving tumor metastasis of the disclosure to verify whether the heterogeneous molecules identified through the algorithm of the disclosure have independent driving functions for melanoma lung metastasis in a mouse lung metastasis model constructed using melanoma cell B16F10. Meanwhile, combining the conclusion 1.6 of the identification of heterogeneous molecules driving gastric cancer metastasis and the implementation of sequential intervention in Example 1: the heterogeneous molecules driving tumor metastasis are functionally independent of each other; intervention of one heterogeneous target molecule has no effect on the stable expression levels of other heterogeneous molecules, the example further optimize the identifying and screening scheme of heterogeneous molecules driving tumor metastasis, and validate the effectiveness of the optimization scheme in the model.


2.1 Mimetic peptide JP1 of JWA inhibits in vivo metastasis of melanoma


A passive metastasis model was constructed by injecting B16F10 melanoma cells into the tail vein of mice. On the second day of cell inoculation, the mice were grouped, and Ctrl-R or JP1 peptides were injected intraperitoneally for intervention. As shown in FIGS. 27A-27D, compared to the Ctrl-R group, the JP1 intervention group significantly inhibited melanoma lung metastasis, and the difference was statistically significant. Note: Ctrl-R peptide and JP1 peptide can be found in Cui J, et al. Theranotics 2020; 10 (18): Recorded in 8036-8050.


To verify the effect of JP1 intervention on mouse survival, a B16F10 cell tail vein injection lung metastasis survival model was constructed. On the second day after tail vein injection of B16F10 cells, mice were randomly divided according to body weight and given daily intraperitoneal injection of Ctrl-R or JP1 intervention. As shown in FIGS. 28A-28D, compared to the Ctrl-R group, JP1 intervention significantly prolonged the survival period of mice with lung metastasis. The average survival days of mice in the Ctrl-R group and JP1 intervention group were 25 days and 32 days, respectively, with statistically significant differences. Conclusion: Intervention with JWA mimetic peptide JP1 can effectively inhibit in vivo metastasis of melanoma.


2.2 MDM2 Inhibitor SP141 Inhibits In Vivo Metastasis of Melanoma

A passive metastatic mouse model was constructed using B16F10 cells. On the second day of cell inoculation, different doses of SP141 were used for intervention. At the end of the modeling, the number of metastatic nodules in the lungs was recorded. As shown in FIGS. 29A-29D, SP141 can inhibit the lung metastasis of melanoma in mice, and there is a dose-response relationship. Compared with the control group, the difference data of the 80 mg/kg group was statistically significant. Conclusion: inhibition of MDM2 by SP141 can effectively reduce in vivo metastasis of melanoma in mice.


2.3 Based on the Characteristic that JWA and MDM2 were Two Functionally Independent Heterogeneous Molecules, a Synergistic Inhibition Model for In Vitro and In Vivo Metastasis of Melanoma was Constructed.


The relationship between the expression levels of JWA and MDM2 genes and the prognosis of melanoma population was analyzed using the TCGA database. The results showed that patients with high expression of JWA had significantly longer survival time than those with low expression. The survival time of MDM2 low expression patients was significantly longer than that of high expression patients. To analyze the relationship between the expression levels of the two molecular proteins and the malignant phenotype of B16F10 cells, different amounts of Flag JWA plasmids were transfected into B16F10 cells, and the JWA protein level was gradually increased, but there was no significant change in the MDM2 protein expression level. Meanwhile, melanoma cells B16F10 were treated respectively with 0, 0.25, 0.5 μM SP141 for 24 hours, and the protein expression levels of MDM2 and JWA were measured. It was found that the protein expression level of MDM2 decreased with increasing SP141 dose, but there was no significant change in JWA protein expression level. From the results, it can be seen that as shown in FIGS. 30A-30C, as far as melanoma is concerned, JWA and MDM2 do not have a mutually regulatory relationship at the protein level and are functionally independent heterogeneous molecules.


The mouse model validation was further carried out on the basis of cell model validation. A lung metastasis model with B16F10 cell tail vein injection was constructed, and on the second day, the mice were randomly grouped according to their body weight for drug intervention. As shown in FIGS. 31A-31F, the joint intervention of JWA and MDM2 has a synergistic inhibitory effect on the in vivo metastasis of melanoma. Conclusion: the joint intervention of heterogeneous molecules JWA and MDM2 driving melanoma metastasis has a synergistic effect in inhibiting melanoma lung metastasis.


2.4 after Joint Intervention on JWA and MDM2, Although the Melanoma Lung Metastasis in Mice was Significantly Inhibited, the Mice Ultimately Died from Lung Metastasis. In this Regard, Other Heterogeneous Molecules Driving Melanoma Metastasis in the Lungs were Further Screened and Identified after Joint Intervention on the Residual Metastatic Lesions of Melanoma.


The joint intervention of JWA and MDM2 enhanced the inhibitory effect on melanoma metastasis, but due to the presence of heterogeneous molecules in melanoma cells that were not regulated by JWA and MDM2, there were still molecules driving the progression of melanoma after the joint intervention. Protein mass spectrometry analysis was performed on residual metastatic lesions in the lungs of mice after joint intervention with JP1 and SP141, and a total of over 7000 proteins were quantitatively analyzed, and protein molecules with stable expression levels (within the range of 0.95-1.05) were selected compared to the control group. The heterogeneous molecules were further screened according to the following principles: (1) specific peptide segments ≥2, totaling 2651 molecules; (2) remove predicted protein molecules from mass spectrometry analysis results; (3) remove molecules with less than 10% coverage of identified peptide segments in the protein; (4) retain molecules with protein quantification exceeding twice; (5) retain numerators with a comprehensive score exceeding 200; (6) confirmed 54 protein molecules related to tumor progression, as shown in FIG. 32. FIGS. 33A-33C and FIGS. 42A-42C list the relevant information of each candidate protein molecule.


A series of bioinformatics related analyses were conducted on the 54 molecules, similar to the process described in 1.4 of Example 1. Initial interaction networks were constructed based on experimental data and protein interaction patterns, respectively, to obtain the modifiable heterogeneous molecules in melanoma: SMAD3, IDH1/IDH2, mTOR, KHSRP/SRSF3, PGD, POLB, TOP1, L2HGDH, MOB2, PDIA4, CD2AP, FADS1, PGP, GSTP1, MAPAK3, MAPAK4, MEAF6, RNF181, D54, ITGA4, TYR, UFC1, NDRG1, OAT (FIGS. 34A-38H).


Calculate the HR value of each molecule and determine the order in which heterogeneous molecules can intervene in sequential treatment of melanoma as follows: RNF181 (0.47 (0.29-0.77), p=0.003), mTOR (1.75 (1.12-2.74), p=0.015), SRSF3 (1.68 (0.86-3.27), p=0.127) TOP1 (0.62 (0.40-0.97), p=0.035), L2HGDH (0.63 (0.43-0.92), p=0.017), MOB2 (0.68 (0.45-1.05), p=0.079), UFC1 (1.45 (1.01-2.08), p=0.043), IDH1/2 (1.42 (1.07-1.89), p=0.014), PGP (1.41 (0.87-2.29), p=0.16), CD2AP (0.73 (0.54-0.99), p=0.04), PGD (0.75 (0.55-1.01), p=0.06), ITGA4 (0.78, p=0.066), FADS1 (0.81 (0.66-1.00), p=0.047), POLB (1.23 (0.93-1.62), p=0.138), SMAD3 (1.22 (0.94-1.60), p=0.139), MEAF6 (1.13 (0.66-1.93), p=0.652), OAT (1.11 (0.83-1.47), p=0.486) GSTP1 (1.10 (0.95-1.27), p=0.188), NDRG1 (0.96 (0.85-1.09), p=0.503), TYR (0.98 (0.90-1.07), p=0.706), PDIA4 (1.00 (0.70-1.43), p=0.991), MAP2K3 (1.00, p=0.74), MAP2K4 (1.00, p=0.87) and D54 (1.00, p=0.95)(FIGS. 34A-38H).


However, it is unknown whether the screened heterogeneous molecules have corresponding antagonists or agonists that can be used to intervene in the model. The validation comprehensively considered the possibility of using these heterogeneous molecules as target interventions. In addition to JWA and MDM2, SMAD3, IDH1/2, and mTOR were also selected as heterogeneous molecules for intervention.


It should be noted that for the weight of heterogeneous molecules, the disclosure proposes to sort them based on the HR values, and the intervention measures based on decreasing HR values may theoretically be the best. However, the actual contribution of molecules with different HR values to tumor progression may be influenced by the dynamic changes in big data. As data accumulates, the HR values of the same molecule may change. Therefore, the HR value is a relative reference value. Since the intervenable molecules obtained by the prediction system are independent of each other, it can be considered that intervening in any one of the molecules (with interaction or independent molecules) is effective. In practical work, factors such as the availability and price of heterogeneous molecular drugs can be comprehensively considered and decided based on the order of HR values. For example, in the confirmatory model, the target molecules for sequential intervention in melanoma metastasis are JWA, MDM2, SMAD3, IDH1/2, mTOR in sequence. Although the decreasing order of HR values is not fully followed, the intervention results are effective.


Subsequently, pathway analysis was performed on the 5 molecules, as shown in FIGS. 39A-39B. There is indeed a significant difference in the signal networks where the 5 molecules function in tumors, and they are functionally independent heterogeneous molecules.


2.5 Sequential Intervention on JWA, MDM2, SMAD3, IDH1/2, mTOR Effectively Inhibits In Vivo Metastasis of Melanoma


Based on the screened 5 target molecules, a 4-stage sequential intervention plan was developed for melanoma metastasis model mice, as shown in FIG. 40. In the first stage, JWA and MDM2 (JP1 and SP141) were targeted for intervention from day 1 to day 14; in the second phase, targeted intervention was performed on SMAD3 (SIS3) from day 15 to day 21; in the third phase, targeted intervention was performed on IDH1/2 (AG881) from day 22 to day 28; in the fourth stage, from day 29 to the end of the model, targeted intervention was performed on mTOR (Rapamycin). Each stage of intervention was evaluated by using the previous stage model mice as a control group with continued drug use. Note: JP1 and SP141 are the same as before, while SIS3, AG881, and Rapamycin are all existing products.


B16F10 cells were injected into the tail vein of lung metastasis mice to construct a survival model, and the mice were randomly grouped according to their body weight after tail vein injection. The drug was administered from the 2nd day. The results showed (as shown in FIGS. 41A-41F): the average survival days of mice in the control group, JP1+SP141 sequential intervention group, JP1+SP141+SIS3 sequential intervention group, JP1+SP141+SIS3+AG881 sequential intervention group, and P1+SP141+SIS3+AG881+Rapamycin sequential intervention group were 22.5, 25.55, 27.05, 29.35, and 30.7 days, respectively (FIG. 41D). The effects of the 3rd intervention of JP1+SP141+SIS3 versus the 4th intervention of JP1+SP141+SIS3+AG881 and the 4th intervention of JP1+SP141+SIS3+AG881 versus the 5th intervention of JP1+SP141+SIS3+AG881+Rapamycin on the survival of the mice were compared individually as shown in FIGS. 41E-41F, the survival time of mice increased significantly with each additional round of intervention compared with the control group, and the difference was statistically significant.


Conclusion of Example 2

To enhance the feasibility of the achievements of the disclosure in future drug development and clinical application, the strategy for screening heterogeneous molecules was further optimized in the sequential intervention confirmatory studies to inhibit melanoma metastasis. Specifically, in the gastric cancer confirmatory studies, the strategy for screening heterogeneous molecules was optimized from resequencing predicted heterogeneous molecules after each intervention to predicting all heterogeneous molecules in residual metastatic lesions with the first round of intervention as the basis for subsequent interventions. If the method is feasible, it greatly simplifies the process of identifying and predicting heterogeneous molecules, and will inevitably improve the feasibility of promotion and application. For melanoma, previous studies have shown that the expression level of JWA is significantly lower than that of para-carcinoma tissues, while the expression level of MDM2 is significantly higher than that of para-carcinoma tissues. Moreover, JWA and MDM2 are two independent heterogeneous molecules. In a confirmatory experimental model, after a joint intervention on two heterogeneous molecules, JWA and MDM2, residual metastatic foci from mouse lungs was extracted for high-throughput protein sequencing, and the heterogeneous molecules SMAD3, IDH1/2, and mTOR were identified and used for subsequent three rounds of sequential intervention, respectively.


In validating the sequential interventions for heterogeneous molecules driving melanoma metastasis, survival rates associated with lung metastases were used to judge the effectiveness of each round of sequential interventions.


It should be noted that in the confirmatory model for inhibiting melanoma metastasis, JWA and MDM2 are known independent molecules and are directly used for joint intervention, and the other three molecules, SMAD3, IDH1/2, and mTOR, predicted by different software for their interaction strength, are molecules with high interaction strength scores that exist in independent interaction networks (FIGS. 35A-35B).


It will be obvious to those skilled in the art that changes and modifications may be made, and therefore, the aim in the appended claims is to cover all such changes and modifications.

Claims
  • 1. A prediction system for identifying key heterogeneous molecules driving tumor metastasis, the system comprising: an input module, an analysis module, and an output module;
  • 2. The system of claim 1, wherein the first quantitative analysis result of proteins and the second quantitative analysis result of proteins are acquired through high-throughput protein sequencing; and the high-throughput protein sequencing comprises protein mass spectrometry analysis.
  • 3. The system of claim 1, wherein during the preliminary screening analysis, the proteins with expression changes within a predetermined range are proteins whose expression levels after drug intervention are 0.95-1.05 times higher than those before drug intervention.
  • 4. The system of claim 1, wherein during the preliminary screening analysis, the proteins with expression changes within the predetermined range are first screened and then included in the candidate protein set; the proteins that are screened meet the following conditions: a number of specific peptide segments is ≥2, and is not a prediction protein; a coverage of amino acids of identified peptide segments of a current protein is ≥10%; a quantification frequency of the current protein is ≥2, and a sequencing quality comprehensive score of the current protein is ≥200; evidence in a database shows the current protein is related to tumor progression; the specific peptide segments refer to a specific peptide segment used to identify the current protein; the prediction protein refers to an attribute that defines a reliability of the current protein in Uniport or NCBI databases as a prediction protein; the identified peptide segments refer to a specific peptide segment that can be used to indicate the current protein being detected; the quantification frequency of the current protein refers to a number of repetitions of the current protein being quantitatively detected.
  • 5. The system of claim 1, wherein during the secondary screening analysis, the protein interaction network analysis tool comprises a STRING network tool; the signal pathway database comprises KEGG, SMPDB, WikiPathways, Cell Signaling Technology, BioCarta Pathway, Pathway Commons, and PID; the protein interaction network prediction tool comprises a Genemania network tool, Unihi network tool, and Hitpredict network tool.
  • 6. The system of claim 5, wherein a Cytoscape tool is employed to construct and supplement the protein interaction networks, and to construct the protein interaction prediction network.
  • 7. The system of claim 1, wherein during the secondary screening analysis, the most important proteins in each protein interaction network are identified through characteristic analysis; the characteristic analysis comprises analyzing the importance, degree, and radiancy of each protein in the protein interaction network; and proteins with the highest importance, degree, and radiancy are the most important proteins in the protein interaction network.
  • 8. The system of claim 1, wherein during the calculating and sorting, the HR value of each heterogeneous molecule is calculated as follows: obtaining sequencing data of tumors from a public database, obtaining a standardized counting matrix of genes corresponding to the heterogeneous molecules from the sequencing data, and performing batch single factor COX regression on the genes to obtain the HR value.
  • 9. The system of claim 8, wherein the public database comprises TCGA, ICGC, and GEO.
  • 10. The system of claim 1, wherein during the calculating and sorting, a sorting rule is: the heterogeneous molecules with HR values greater than or equal to 1 are directly sorted in reverse order according to their HR values, and the heterogeneous molecules with HR values less than 1 are sorted in reverse order according to their 1/HR values.
Priority Claims (1)
Number Date Country Kind
202111444532.0 Nov 2021 CN national
CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a continuation-in-part of International Patent Application No. PCT/CN2022/080847 with an international filing date of Mar. 15, 2022, designating the United States, now pending, and further claims foreign priority benefits to Chinese Patent Application No. 202111444532.0 filed Nov. 30, 2021. The contents of all of the aforementioned applications, including any intervening amendments thereto, are incorporated herein by reference. Inquiries from the public to applicants or assignees concerning this document or the related applications should be directed to: Matthias Scholl P.C., Attn.: Dr. Matthias Scholl Esq., 245 First Street, 18th Floor, Cambridge, MA 02142.

Continuation in Parts (1)
Number Date Country
Parent PCT/CN22/80847 Mar 2022 WO
Child 18608874 US