The invention relates to metabolic biomarkers for inflammatory bowel diseases and methods of their detection and use in the treatment of Crohn’s disease or ulcerative colitis.
Inflammatory bowel disease (IBD), including Crohn’s disease (CD) and ulcerative colitis (UC) are becoming major public health concerns, due to their dramatic expansion worldwide (Ng et al. Lancet 2017;390:2769-2778.), lack of cure and challenges in disease management. These diseases exhibit, however, heterogeneity in terms of disease onset, behavior, location, progression and response to therapy. Currently, there is an unmet need to identify biomarkers that would capture both host and environmental contributions to and thereby improve diagnosis, patient stratification, as well as treatment monitoring.
In general, the invention provides lipid biomarkers for inflammatory bowel disease and methods of their use. The biomarkers may be used, for example, to generate quantitative data in a subject, treat inflammatory bowel disease and/or assign treatment regimens to a subject.
In an aspect, the invention provides a method for generating quantitative data for a subject, including determining a level of a biomarker in a blood sample from the subject, wherein the biomarker is selected from 1) a phosphatidylcholine ether, 2) a phosphatidylethanolamine ether, 3) a phospholipid, 4) a sphingomyelin, 5) a cholesterol ester, 6) a very long-chain dicarboxylic acid (VLCDCA), 7) sitosterol sulfate, 8) cholesta-4,6-dien-3-one, 9) a triglyceride, 10) a diglyceride, and 11) a ceramide, or a combination thereof, and wherein the subject is suffering from an inflammatory bowel disease or is suspected of suffering from an inflammatory bowel disease.
In another aspect, the invention provides a method of assigning a treatment regimen to a subject diagnosed with an inflammatory bowel disease, the method including determining a level in a blood sample of at least one biomarker relative to a control, wherein the biomarker is selected from 1) a phosphatidylcholine ether, 2) a phosphatidylethanolamine ether, 3) a phospholipid, 4) a sphingomyelin, 5) a cholesterol ester, 6) a very long-chain dicarboxylic acid (VLCDCA), 7) sitosterol sulfate, 8) cholesta-4,6-dien-3-one, 9) a triglyceride, 10) a diglyceride, and 11) a ceramide, or a combination thereof, and wherein the treatment regimen is assigned to the subject based on the determination of the level of the biomarker.
In some embodiments of the foregoing aspects, the phosphatidylethanolamine ether is PE(O-16:0/20:4), PE(O-18:0/20:4), PE(O-16:0/22:6), or PE(O-18:0/22:6), the phosphatidylcholine ether is PC(O-20:0/22:6) or PC(O-18:0/22:6), the phospholipid is PC(18:1_22:5), PC(18:2_20:5), PC(19:0_18:2), PE(18:1_20:4)-1, PE(18:1_20:4)-2, PE(16:0_18:1), or PE(18:0_22:5), the sphingomyelin is SM(d18:1/21:0), SM(d17:1/24:1), SM(d18:2/16:0), SM(d18:2/23:0), SM(d18:2/24:0), SM(d17:1/24:1), SM(d18:1/25:1)-2, SM(d18:1/25:1)-1, SM(d18:1/24:0), SM(d19:1/24:0), SM(d18:1/21:0), or SM(d16:1/22:0), the cholesterol ester is CE(14:1) or CE(14:0), the ceramide is Cer(d16:1/23:0), the triglyceride is TG(16:0_18:1_20:4), TG(18:1_18:2_22:5), TG(16:0_18:1_22:5), TG(18:1_18:2_20:4), TG(56:4), TG(18:1_18:2_22:4), or TG(18:2_16_0_20:4), the diglyceride is DG(16:0_18:2), and/or the very long-chain dicarboxylic acid is 28:1(OH) or 28:4.
In some embodiments, the determination of the level of the biomarker includes performing an in vitro assay. In some embodiments, the in vitro assay includes spectrometry. In some embodiments, the spectrometry is mass spectrometry (MS) or liquid chromatography-mass spectrometry (LC-MS). In some embodiments, the MS or LC-MS includes multiple reaction monitoring, selected reaction monitoring or full scan monitoring.
In some embodiments, the blood sample is serum or plasma.
In some embodiments, the inflammatory bowel disease is Crohn’s disease. In some embodiments, the inflammatory bowel disease is Ulcerative Colitis. In some embodiments, the inflammatory bowel disease is indeterminate colitis.
In some embodiments, a decrease in the level of PE(O-16:0/20:4), a decrease in the level of VLCDCA 28:1 (OH), an increase in the level of sitosterol sulfate, or an increase in the level of cholesta-4,6-dien-3-one relative to the control is indicative of the subject suffering from stricturing or penetrating (B2/B3) Crohn’s Disease.
In some embodiments, a decrease in the level of CE(14:1), an increase in the level of sitosterol sulfate, or an increase in the level of cholesta-4,6-dien-3-one relative to the control is indicative of the subject suffering from ileal disease.
In some embodiments, a level of the phosphatidylethanolamine ether, the sphingomyelin, the cholesterol ester, or the very long-chain dicarboxylic acid lower relative to the control is indicative of the subject suffering from Crohn’s Disease.
In some embodiments, the subject is assigned a treatment regimen including the administration of a biologic drug. In some embodiments, the biologic drug is a molecularly targeted therapy. In some embodiments, the biologic drug is an antibody. In some embodiments, the biologic drug binds α4β7 integrin. In some embodiments, the biologic drug is vedolizumab.
In another aspect, the invention provides a method of treating an inflammatory bowel disease in a subject, the method including:
In some embodiments, the phosphatidylethanolamine ether is PE(O-16:0/20:4), PE(O-18:0/20:4), PE(O-16:0/22:6), or PE(O-18:0/22:6), the phosphatidylcholine ether is PC(O-20:0/22:6) or PC(O-18:0/22:6), the phospholipid is PC(18:1_22:5), PC(18:2_20:5), PC(19:0_18:2), PE(18:1_20:4)-1, PE(18:1_20:4)-2, PE(16:0_18:1), or PE(18:0_22:5), the sphingomyelin is SM(d18:1/21:0), SM(d17:1/24:1), SM(d18:2/16:0), SM(d18:2/23:0), SM(d18:2/24:0), SM(d17:1/24:1), SM(d18:1/25:1)-2, SM(d18:1/25:1)-1, SM(d18:1/24:0), SM(d19:1/24:0), SM(d18:1/21:0), or SM(d16:1/22:0), the cholesterol ester is CE(14:1) or CE(14:0), the ceramide is Cer(d16:1/23:0), the triglyceride is TG(16:0_18:1_20:4), TG(18:1_18:2_22:5), TG(16:0_18:1_22:5), TG(18:1_18:2_20:4), TG(56:4), TG(18:1_18:2_22:4), or TG(18:2_16_0_20:4), the diglyceride is DG(16:0_18:2), and/or the very long-chain dicarboxylic acid is 28:1(OH) or 28:4.
In some embodiments, the inflammatory bowel disease is Crohn’s disease. In some embodiments, the inflammatory bowel disease is Ulcerative Colitis. In some embodiments, the inflammatory bowel disease is indeterminate colitis.
In some embodiments, the therapeutic agent is a biologic drug. In some embodiments, the biologic drug is a molecularly targeted therapy. In some embodiments, the biologic drug is an antibody. In some embodiments, the biologic drug binds α4β7 integrin. In some embodiments, the biologic drug is vedolizumab.
In some embodiments of any of the foregoing aspects, the subject is a human.
AUC is area under the curve; C4,6D3O is cholesta-4,6-dien-3-one; CD is Crohn’s Disease; CE is cholesterol ester; FA is fatty acid; HPLC is high-performance liquid chromatography; IBD is inflammatory bowel diseases; LC is liquid chromatography; MS is mass spectrometry; NIDDK is National Institute of Diabetes and Digestive and Kidney Diseases; IBDGC is NIDDK IBD Genetics Consortium; OCFA is odd-chain fatty acid; PC is phosphatidylcholine; PCO- is PC ether; PE is phosphatidylethanolamine; PEO- is PE ether; QTOF is quadrupole time-of-flight; ROC is receiver operating characteristic; RT is retention time; Sit is sitosterol; SitS is sitosterol sulfate; SM is sphingomyelin; SP is sphingolipid; UC is ulcerative colitis; VLCDCA is very long-chain dicarboxylic acid; VLCFA is very long-chain fatty acid.
The term “antibody” herein is used in the broadest sense and encompasses various antibody structures, including but not limited to monoclonal antibodies, polyclonal antibodies, multispecific antibodies (e.g., bispecific antibodies), and antibody fragments so long as they exhibit the desired antigen-binding activity.
As used herein, a “biologic drug” or “biologic” refers to a product that is produced from living organisms or contains components of living organisms. Biologic drugs include a wide variety of products derived from human, animal, or microorganisms by using biotechnology.
The term “ileal disease,” as used herein, refers to a subtype of Crohn’s disease in which inflammation occurs in the ileum, the last division of the small intestine extending between the jejunum and large intestine.
The term “inflammatory bowel disease,” as used herein, refers to any disease that is characterized by chronic inflammation of the digestive tract. Exemplary inflammatory bowel diseases include Crohn’s disease and ulcerative colitis. The Montreal Disease Classification of inflammatory bowel disease is used herein to define the subtypes of Crohn’s disease and ulcerative colitis. This system of classification is described, for example, in Satsangi et al. Gut 2006;55:749-753. These classifications include Crohn’s disease location (L), including ileal (L1), colorectal (L2) and ileocolonic (L3) subtypes; Crohn’s disease behavior (B), including non-stricturing and non-penetrating (B1, also referred as inflammatory), stricturing (B2), and penetrating (B3) subtypes; and ulcerative colitis disease extent (E), including ulcerative proctitis (E1), left sided UC (E2) and extensive UC (E3).
The term “subject,” as used herein, refers to a human suffering from or at risk for an inflammatory bowel disease. The inflammatory bowel disease may be, for example, Crohn’s disease or ulcerative colitis. A subject may be diagnosed as having an inflammatory bowel disease or may be one experiencing one or more symptoms of an inflammatory bowel disease. Non-limiting examples of symptoms of an inflammatory bowel disease include fever, nausea, diarrhea, fatigue, abdominal pain, blood in stool, reduced appetite, mouth sores, and unexpected weight loss.
“Treatment” and “treating,” as used herein, refer to the medical management of a subject with the intent to improve, ameliorate, stabilize, prevent, or cure a disease, disorder, or condition. This term includes active treatment (treatment directed to improve the disease, disorder, or condition); palliative treatment (treatment designed for the relief of symptoms of the disease, disorder, or condition); and supportive treatment (treatment employed to supplement another therapy). “Treatment” and “treating,” as used herein, also refers to disease modification, meaning, that the expression of the disease is modified towards a less severe expression of the symptoms.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawing(s) will be provided by the Office upon request and payment of the necessary fee.
In general, the invention provides metabolic biomarkers for inflammatory bowel disease and their use in treating the inflammatory bowel disease. Circulating lipids play crucial roles in many biological mechanisms relevant to IBD pathophysiology (Chiurchiu et al. Front Immunol 2018;9:38) and originate from host metabolic processes occurring in specific tissues/cells or from the incorporation/modification of dietary or gut microbiota metabolites. Lipids represent 75% of all circulating metabolites (Quehenberger et al. N Engl J Med 2011;365:1812-23). The application of mass spectrometry (MS)-based lipidomics to large number of blood samples from clinical cohorts remains, however, challenging (Burla B et al. J Lipid Res 2018;59:2001-2017; Gallagher K et al. J Crohns Colitis 2021;15:813-826). Notably the ability to adequately resolve and identify the multiple lipid isomers is still limiting biological data interpretation. For example, specifically in IBD, identification of lipid sub-classes with their acyl side chains is likely crucial because specific groups of fatty acids (FAs) could possibly discriminate between host vs microbiome metabolism (odd-chain FAs; OCFAs) (Jenkins B et al. Molecules 2015;20:2425-44), determine intestinal barrier integrity (very-long chain FAs; VLCFA)(Kim YR, et al. J Cell Mol Med 2017;21:3565-3578; Oertel et al. Cell Mol Life Sci 2017;74:3039-3055) or distinguish between reservoirs of pro- and anti-inflammatory precursors, namely omega-6 vs. omega-3 polyunsaturated fatty acids (PUFAs)(Alhouayek M et al. Biochim Biophys Acta Mol Cell Biol Lipids 2021;1866:158854), whose synthesis requires desaturase encoding enzymes (FADS1 and FADS2) which are known IBD risk genes (Jostins et al. Nature 2012;491:119-24; de Lange KM et al. Nat Genet 2017;49:256-261).
A validated LC-MS-based lipidomic workflow which is optimized for wide coverage of polar and non-polar lipids as well as resolution of their isomers was applied. It enables measurement of >1,500 high-quality MS signals, from 100 µL of plasma or serum with a single instrument, of which 509 structurally unique lipid species covering 25 lipid sub-classes, including their acyl side chains, have been annotated to date by MS/MS analyses. Median inter- and intra-assay coefficients of variations (or relative standard deviation (RSD)) for the resulting annotated lipids meet criteria of the Food and Drug Administration (i.e. >85% with RSD<20%), thereby enabling robust semi-quantification. First, this workflow was applied to 300 CD and 300 control subjects, matched for sex, age and ethnicity, collected by the National Institute of Diabetes and Digestive and Kidney Disease (NIDDK) IBD Genetics Consortium (IBDGC), in two independent phases. Second, it was tested if quantitative variations in circulating lipids were associated with response to therapy by analyzing serum samples from an independent cohort of 92 CD and UC patients commencing Vedolizumab, an anti-α4β7 integrin blocker antibody that is approved for moderate to severe IBD in patients who are refractory to conventional and/or anti-TNF therapy. The analyses identified candidate circulating lipid biomarkers that are strong CD disease classifiers, as well as being differently associated with location and/or behavior and response to therapy, thereby supporting their potential for capturing patient heterogeneity and improving disease management.
Inflammatory bowel disease (IBD) refers to a group of conditions characterized by inflammation of the large and/or small intestine. The most common subtypes of IBD are Crohn’s disease (CD) and ulcerative colitis (UC). Other non-limiting forms of IBD include indeterminate colitis, collagenous colitis, lymphocytic colitis, ischemic colitis, diversion colitis, Behcet’s syndrome, infective colitis.
Crohn’s disease is a disease characterized by chronic inflammation of the gastrointestinal tract, often the ileum and the cecum. It can also be confined to the small intestine, colon, anorectal region. It may further involve the duodenum, stomach, esophagus, or oral cavity. As a result of varying anatomic localization of the disease, the symptoms may differ among subjects. The most common symptoms are abdominal pain, diarrhea, and fever. It is also commonly associated with intestinal obstruction or fistula. It may lead to complications including inflammation of the eye, joints, and skin, liver disease, kidney stones, and amyloidosis.
Several features are characteristic of the pathology of CD. The inflammation associated with CD, known as transmural inflammation, involves all layers of the bowel wall. Thickening and edema, for example, typically also appear throughout the bowel wall, with fibrosis present in long-standing forms of the disease. The inflammation characteristic of CD is discontinuous in that segments of inflamed tissue, known as “skip lesions,” are separated by apparently normal intestine. Furthermore, linear ulcerations, edema, and inflammation of the intervening tissue lead to a “cobblestone” appearance of the intestinal mucosa, which is distinctive of CD.
A hallmark of CD is the presence of discrete aggregations of inflammatory cells, known as granulomas, which are generally found in the submucosa. Some CD cases display typical discrete granulomas, while others show a diffuse granulomatous reaction or a nonspecific transmural inflammation. As a result, the presence of discrete granulomas is indicative of CD, although the absence of granulomas is also consistent with the disease. Thus, transmural or discontinuous inflammation, rather than the presence of granulomas, is a preferred diagnostic indicator of CD (Rubin and Farber, Pathology (Second Edition), Philadelphia, J.B. Lippincott Company (1994)).
There are three categories of disease presentation in Crohn’s disease: (B1) inflammatory, (B2) stricturing, and (B3) penetrating. Stricturing disease causes narrowing of the bowel which may lead to bowel obstruction or changes in the caliber of the feces. Penetrating disease creates abnormal passageways (fistulae) between the bowel and other structures such as the skin. Inflammatory disease (also known as non-stricturing, non-penetrating disease) causes inflammation without causing strictures or fistulae.
Subjects with CD can be classified as having complicated CD, which is a clinical subtype characterized by stricturing or penetrating phenotypes. In certain other instances, subjects with CD can be classified as having a form of CD characterized by one or more of the following complications: fibrostenosis, internal perforating disease, and the need for small bowel surgery. Subjects with CD can be also classified as having an aggressive form of fibrostenotic disease requiring small bowel surgery. Criteria relating to these subtypes have been described, for example, in Gasche et al., Inflamm. Bowel Dis., 6:8-15 (2000); Abreu et al., Gastroenterology, 123:679-688 (2002); Vasiliauskas et al., Gut, 47:487-496 (2000); Vasiliauskas et al., Gastroenterology, 110:1810-1819 (1996); and Greenstein et al., Gut, 29:588-592 (1988).
One skilled in the art understands that overlap can exist between clinical subtypes of CD and that a subject having CD can have more than one clinical subtype of CD. For example, a subject having CD can have the fibrostenotic subtype of CD and can also meet clinical criteria for a clinical subtype characterized by the need for small bowel surgery or the internal perforating disease subtype. Similarly, the metabolic biomarkers described herein can be associated with more than one clinical subtype of CD.
Ulcerative colitis (UC) is a disease of the large intestine characterized by chronic diarrhea with cramping, abdominal pain, rectal bleeding, loose discharges of blood, pus, and mucus. UC may present widely different clinical manifestations. A pattern of exacerbations and remissions typifies the clinical course for about 70% of UC patients, although continuous symptoms without remission are present in some patients with UC. Local and systemic complications of UC include arthritis, eye inflammation such as uveitis, skin ulcers, and liver disease. In addition, UC, and especially the long-standing, extensive form of the disease is associated with an increased risk of colon carcinoma.
UC usually extends from the most distal part of the rectum for a variable distance proximally. Left-sided colitis describes an inflammation that involves the distal portion of the colon, extending as far as the splenic flexure. Sparing of the rectum or involvement of the right side (proximal portion) of the colon alone is unusual in UC. The inflammatory process of UC is limited to the colon and does not involve, for example, the small intestine, stomach, or esophagus. In addition, UC is distinguished by a superficial inflammation of the mucosa that generally spares the deeper layers of the bowel wall. Crypt abscesses, in which degenerated intestinal crypts are filled with neutrophils, are also typical of UC.
In certain instances, with respect to UC, the variability of symptoms reflect differences in the extent of disease (i.e., the amount of the colon and rectum that are inflamed) and the intensity of inflammation. Disease starts at the rectum and moves “up” the colon to involve more of the organ. UC can be categorized by the amount of colon involved. Typically, patients with inflammation confined to the rectum and a short segment of the colon adjacent to the rectum have milder symptoms and a better prognosis than patients with more widespread inflammation of the colon.
In comparison with CD, which is a patchy disease with frequent sparing of the rectum, UC is characterized by a continuous inflammation of the colon that usually is more severe distally than proximally. The inflammation in UC is superficial in that it is usually limited to the mucosal layer and is characterized by an acute inflammatory infiltrate with neutrophils and crypt abscesses. In contrast, CD affects the entire thickness of the bowel wall with granulomas often, although not always, present. Disease that terminates at the ileocecal valve, or in the colon distal to it, is indicative of UC, while involvement of the terminal ileum, a cobblestone-like appearance, discrete ulcers, or fistulas suggests CD.
The different clinical subtypes of ulcerative colitis are classified according to the location and the extent of inflammation. As non-limiting examples, subjects with UC can be classified as having ulcerative proctitis, proctosigmoiditis, left-sided colitis, pancolitis, fulminant colitis, and combinations thereof. Criteria relating to these subtypes have been described, for example, in Kornbluth et al., Am. J. Gastroenterol., 99: 1371-85 (2004).
Ulcerative proctitis is a clinical subtype of UC defined by inflammation that is limited to the rectum. Proctosigmoiditis is a clinical subtype of UC which affects the rectum and the sigmoid colon. Left-sided colitis is a clinical subtype of UC which affects the entire left side of the colon, from the rectum to the place where the colon bends near the spleen and begins to run across the upper abdomen (the splenic flexure). Pancolitis is a clinical subtype of UC which affects the entire colon. Fulminant colitis is a rare, but severe form of pancolitis. Patients with fulminant colitis are extremely ill with dehydration, severe abdominal pain, protracted diarrhea with bleeding, and even shock.
In some embodiments, classification of the clinical subtype of UC is important in planning an effective course of treatment. While ulcerative proctitis, proctosigmoiditis, and left-sided colitis can be treated with local agents introduced through the anus, including steroid-based or other enemas and foams, pancolitis must be treated with oral medication so that active ingredients can reach all of the affected portions of the colon.
One skilled in the art understands that overlap can exist between clinical subtypes of UC and that a subject having UC can have more than one clinical subtype of UC. Similarly, the metabolic biomarkers described herein can be associated with more than one clinical subtype of UC.
Indeterminate colitis (IC) is a clinical subtype of IBD that includes features of CD and UC. The overlap of symptoms can occur temporarily (e.g., in the early stages of the disease) or persistently (e.g., throughout the progression of the disease) in patients with IC. Clinically, IC is characterized by abdominal pain and diarrhea with or without rectal bleeding. For example, colitis with intermittent multiple ulcerations separated by normal mucosa is found in patients with the disease. Histologically, there is a pattern of severe ulceration with transmural inflammation. The rectum is typically free of the disease and the lymphoid inflammatory cells do not show aggregation. Although deep slit-like fissures are observed with foci of myocytolysis, the intervening mucosa is typically minimally congested with the preservation of goblet cells in patients with IC.
Inflammatory bowel disease is currently not curable. However, there are a number of treatments for IBD. The goal of such treatments is multifold, and includes improvement of the subject’s clinical condition, reducing side effects, improving quality of life, reducing drug poisoning, nutritional support, and reducing the need for hospital admission or surgery. Diagnosis and treatment of IBD is discussed in Seyedian et al., J. Med. Life., 12:113-122 (2019), the disclosure of which is incorporated herein by reference. Typically, treatment occurs in stages, with drugs with fewer expected side effects being prescribed first and advancing to other drugs if the initial therapy does not provide relief. The biomarkers disclosed herein may be used in determining a treatment regimen that includes any of the drugs described herein.
The first line treatment for IBD is often anti-inflammatory drugs of the aminosalicylate family. Non-limiting examples of these drugs include balsalazide, mesalamine, olsalazine, and sulfasalazine, the structure of which are shown below.
Other examples of anti-inflammatory drugs used to treat IBD are corticosteroids. Non-limiting examples of corticosteroids include budenoside, hydrocortisone, methylprednisolone, and prednisone, the structure of which are shown below.
Immunosuppressive drugs are also used to treat IBD when the above anti-inflammatory drugs are unsuccessful. Immunomodulators include, but are not limited to, 6-mercaptopurine, azathioprine, cyclosporine, and methotrexate.
Antibiotic medications may also be used in addition to other medications or when there is concern for infection, such as in perianal Crohn’s disease. Commonly used antibiotics to treat IBD include ciprofloxacin and metronidazole.
Biologic therapies can also be used to treat IBD. Anti-TNF (tumor necrosis factor) agents, such as infliximab, adalimumab, and certolizumab, are a class of biologics used to treat IBD. Alternatively, biologic therapies may be anti-integrin agents, such as natalizumab and vedolizumab. Specifically, vedolizumab binds α4β7 integrin. As a further alternative, anti-IL-12 and IL-23 antibodies, such as ustekinumab, may be used to treat IBD.
Integrins play a role in leukocyte trafficking and extravasation from the vascular compartment into target tissues. They are heterodimers having an α and a β subunit. Anti-integrin therapies modulate inflammation by binding to integrins that contribute to leukocyte trafficking, thereby preventing leukocyte migration into GI mucosa and the development of inflammation within these tissues. Natalizumab targets the α4 subunit, and therefore blocks both α4β1 and α4β7 integrin. Vedolizumab blocks α4β7 without binding to α4β1. By only binding α4β7, the ability of α4β1 to bind its ligand, vascular cell adhesion protein 1 (VCAM-1) is preserved, permitting continued immune surveillance within the CNS and theoretically eliminating the risk of progressive multifocal leukoencephalopathy associated with natalizumab treatment.
The use of biologic drugs in the context treatment of IBD is known as molecularly targeted therapy (MTT). Molecularly targeted therapies used an agent (or combination of agents) that acts with a high degree of specificity on a well-defined target or biologic pathway.
The methods disclosed herein may include the detection in serum of one or more lipid biomarkers of inflammatory bowel disease. These lipid biomarkers may include, but are not limited to, the classes of lipids described below.
The lipid biomarker used in the methods described herein may be a sphingolipid. Sphingolipids are structural molecules of cell membranes that play a role in maintaining barrier function and fluidity. They also regulate various biological processes (e.g., growth, proliferation, migration, invasion and/or metastasis) by controlling signaling functions within the cell signal transduction network.
In some embodiments, the sphingolipid is a sphingomyelin (SM). The sphingomyelin may be SM(d18:1/21:0), SM(d17:1/24:1), SM(d18:2/16:0), SM(d18:2/23:0), SM(d18:2/24:0), SM(d17:1/24:1), SM(d18:1/25:1)-2, SM(d18:1/25:1)-1, SM(d18:1/24:0), SM(d19:1/24:0), SM(d18:1/21:0), or SM(d16:1/22:0).
SM(d18:1/21:0) is described in the Human Metabolome Database (identification number HMDB0240611). It is a type of sphingolipid found in animal cell membranes, especially in the membranous myelin sheath which surrounds some nerve cell axons. It usually contains phosphorylcholine and ceramide. SM(d18:1/21:0) has a sphingosine backbone and a heneicosanoic acid chain. In humans, sphingomyelin is the only membrane phospholipid not derived from glycerol. Like all sphingolipids, SM has a ceramide core (sphingosine bonded to a fatty acid via an amide linkage). In addition, it contains one polar head group, which is either phosphocholine or phosphoethanolamine. The plasma membrane of cells is highly enriched in sphingomyelin and is considered largely to be found in the exoplasmic leaflet of the cell membrane. However, there is some evidence that there may also be a sphingomyelin pool in the inner leaflet of the membrane. Moreover, neutral sphingomyelinase-2, an enzyme that breaks down sphingomyelin into ceramide, has been found to localize exclusively to the inner leaflet further suggesting that there may be sphingomyelin present there. Sphingomyelin can accumulate in a rare hereditary disease called Niemann-Pick Disease, types A and B. Niemann-Pick disease is a genetically-inherited disease caused by a deficiency in the enzyme sphingomyelinase, which causes the accumulation of sphingomyelin in spleen, liver, lungs, bone marrow, and the brain, causing irreversible neurological damage. SMs play a role in signal transduction. Sphingomyelins are synthesized by the transfer of phosphorylcholine from phosphatidylcholine to a ceramide in a reaction catalyzed by sphingomyelin synthase. The structure of SM(d18:1/21:0) is below.
Alternatively, the sphingolipid may be a ceramide. Ceramides are structurally similar to sphingomyelins but lack the phosphatidylcholine group. An exemplary ceramide used herein is Cer(d16:1/23:0), the structure of which is shown below.
The lipid biomarker used in the methods described herein may be a phosphatidylcholine. Phosphatidylcholines are a class of phospholipids that incorporate choline as a head group. They are glycerophosphocholines in which the two free —OH are attached to one fatty acid each through an ester linkage. They are a component of biological membranes.
The phosphatidylcholine may be a phosphatidylcholine ether. For example, the lipid biomarker may be the phosphatidylcholine ether PC(19:0_18:2), PC(18:2_20:5), PC(O-20:0/22:6), PC(19:0_18:2), or PC(O-18:0/22:6), PC(18:1_22:5). The structure of PC(19:0_18:2) is shown below.
The lipid biomarker used in the methods described herein may be a phosphatidylethanolamine. Phosphatidylethanolamines are glycerophospholipids in which a phosphorylethanolamine moiety occupies a glycerol substitution site. They are found in biological membranes.
The phosphatidylethanolamine may be a phosphatidylethanolamine ether such as PE(O-16:0/20:4), PE(O-18:0/20:4), PE(O-16:0/22:6), PE(18:1_20:4)-1, PE(18:1_20:4)-2, PE(16:0_18:1), PE(18:0_22:5), or PE(O-18:0/22:6). The structure of PE(O-16:0/20:4) is shown below.
The lipid biomarker used in the methods described herein may be a carboxylic acid. For example, it may be a fatty acid, such as caprylic acid, capric acid, lauric acid, myristic acid, palmitic acid, stearic acid, arachidic acid, behenic acid, lignoceric acid, cerotic acid, myristoleic acid, palmitoleic acid, sapienic acid, oleic acid, elaidic acid, vaccenic acid, linoleic acid, linoelaidic acid, and arachidonic acid. The carboxylic acid may also be a dicarboxylic acid. In preferred embodiments, the dicarboxylic acid is a very long-chain dicarboxylic acid. For example, the dicarboxylic acid may be a VLCDCA 28:1 (OH) or 28:4.
The biomarker used in the methods described herein may be a sterol. For example, the sterol may be cholesterol or an analog thereof. In preferred embodiments, the cholesterol analog is cholesta-4,6-dien-3-one. The biomarker could be an ester of cholesterol, where the alcohol in cholesterol forms an ester with a fatty acid. For example, the cholesterol ester can be CE(14:1) or CE(14:0). The sterol may also be sitosterol or an analog thereof. In preferred embodiments, the sterol is sitosterol sulfate (SitS).
The biomarker used in the methods described herein may be a triglycerides. Triglycerides, also referred to as triacylglycerols, are esters derived from glycerol and three fatty acids. Triglycerides are the major dietary fat, and they are hydrolyzed in the guy by lipases to fatty acids and monoglycerides. Endogenous triglycerides are, to a large extent, synthesized in the liver and adipose tissue.
Exemplary triglycerides used herein include TG(16:0_18:1_20:4), TG(18:1_18:2_22:5), TG(16:0_18:1_22:5), TG(18:1_18:2_20:4), TG(56:4), TG(18:1_18:2_22:4), and TG(18:2_16_0_20:4). The structure of TG(16:0_18:1_20:4) is shown below.
Alternatively, the biomarker may be a diglyceride, also referred to as a diacylglycerol. Diglycerides contain a glycerol with two fatty acids attached through ester linkages. Diglycerides may be 1,2-aiacylglycerols or 1,3-diacylglycerols. Diacylglycerols may be intermediates in lipid biosynthetic pathways and may act as signaling lipids. An exemplary diglyceride used herein is DG(16:0_18:2), the structure of which is shown below.
The following examples are meant to illustrate the invention. They are not meant to limit the invention in any way.
CD and UC subtypes were defined according to the Montreal classification as follows: (i) CD disease location (L), including ileal (L1), colorectal (L2) and ileocolonic (L3) subtypes, (ii) CD disease behavior (B), including non-stricturing and non-penetrating (B1, also referred as inflammatory), stricturing (B2) and penetrating (B3) subtypes, with B2 and B3 being severe forms of CD frequently resulting in the requirement for surgery, and (iii) UC disease extent (E), including ulcerative proctitis (E1), left sided UC (E2) and extensive UC (E3).
LC-MS grade methanol, acetonitrile, isopropanol, ethyl acetate and high-performance liquid chromatography (HPLC) grade methyl tert-butyl ether (MTBE) were purchased from J.T. Baker (USA), HPLC grade hexane from Laboratoire Mat (Quebec, QC, Canada) and HPLC grade chloroform and formic acid from Fisher Scientific (Ottawa, ON, Canada). Ultra-pure water was produced using a Milli-Q system (Millipore, Billerica, MA, USA). Ammonium formate was obtained from Sigma-Aldrich (St. Louis, MO, USA). Lipid standards: LPC(13:0), PC(19:0/19:0), PC(14:0/14:0), PS(12:0/12:0), PG(15:0/15:0) and PE(17:0/17:0) were purchased from Avanti Polar Lipids Inc, (Alabaster, USA).
First, serum samples from non-fasting CD patients and healthy donors matched for sex, age and ethnicity, collected by the six Genetic Research Centers of the IBDGC between 2004 and 2011, were selected from the IBDGC repository and processed in two separate phases, referred to as IBDGC-1 and IBDGC-2. For the discovery phase (IBDGC-1), 100 CD patients with a more complicated disease behavior (stricturing or penetrating according to the Montreal classification) were selected, whereas 200 CD patients with a more representative proportion of disease behavior and location subtypes were selected for the second phase (IBDGC-2), as described in Boucher et al., Inflamm Bowel Dis 2021. Second, serum samples from 92 IBD patients commencing Vedolizumab (42 CD and 50 UC) were obtained from the Massachusetts General Hospital, from a previously described multicenter cohort. Two samples per patient were collected at weeks 0 and 14 (i.e., immediately prior to commencing therapy and at first assessment visit). Clinical response and remission were defined as described in Shelton et al., Inflamm Bowel Dis 2015;21:2879-85 and considered together to assess the lipidome rearrangements associated with response to Vedolizumab. A study flow diagram is shown in
All samples were analyzed using a previously validated untargeted LC-MS-based lipidomic workflow (Forest A et al. J Proteome Res 2018;17:3657-3670). Following recommended guidelines, stratified randomization of samples was achieved according to potential confounding factors, namely age, sex, disease status, as well as treatment whenever applicable, in order to minimize batch-dependent bias. MS data were acquired in positive and negative modes. MS quality controls (QCs) were performed by (i) injecting “in-house” plasma pool QC sample at the beginning, the end and every 20 runs, (ii) injecting blanks every 20 runs and (iii) monitoring six internal standards spiked in samples for signal intensity, mass mass-to-charge ratios (m/z) and retention time (RT) accuracies.
Raw MS data were processed as previously described (Forest A et al. J Proteome Res 2018;17:3657-3670) using Mass Hunter Qualitative Analysis (Agilent, Santa Clara, USA) for peak picking and using an in-house bioinformatic script encoded in both Perl and R languages for (i) MS peak alignment; (ii) filter of presence (features retained must be present in 80% of samples from at least one group and have coefficient of variation < 80% in healthy donors samples); (iii) normalization of signal intensities using cyclic loess algorithm; (iv) imputation of missing values using k-Nearest Neighbor (KNN) on scaled data; (v) batch and collection center effect correction using Combat. The resulting final datasets listed high-quality MS signals, thereafter referred to as features, defined by their m/z, RT and signal intensity.
All features associated with stricturing or penetrating (B2/B3) CD vs control with p-value<0.05 in IBDGC-1 or 2 were analyzed by MS/MS for annotation to lipid species as previously described (Forest A et al. J Proteome Res 2018;17:3657-3670). Cholesta-4,6-dien-3-one (C4,6D3O) and cholesterol sulfate IDs were validated using analytical standards purchased from BOC Sciences (Shirley, NY, USA) and Sigma-Aldrich (St. Louis, MO, USA), respectively. Sitosterol sulfate (SitS) ID was validated using a standard kindly provided by Pr Hubert Schaller. All lipid IDs are mentioned herein as annotated lipid molecular species, irrespective of validation with an analytical standard. Features without ID assumption were classified as unidentified and mentioned using their feature ID, defined as ionization mode:mass@RT.
MS data retained in the final IBDGC-1 and 2 datasets were analyzed independently, following untargeted data mining approaches. (i) Individual testing identified features associated with B2/B3 vs control. Results were compared between the IBDGC phases to assess replicability. (ii) The larger IBDGC-2 dataset was selected for network analysis of correlations existing between B2/B3-associated features. (iii) Individual testing was used to test the association of lipid features with CD subtypes. (iv) Classification models validated the potential of circulating lipids to discriminate between CD and control phenotypes. Similarly, the final VEDO dataset was analyzed using individual testing to identify changes in lipid features associated with response to Vedolizumab.
After log2-transformation of the signal intensity values, independent testing was done for each individual feature using regression analysis corrected for sex. For annotated features detected in both IBDGC datasets, the evidence of association with B2/B3 vs control was compared and then pooled using Z-score statistics. Association with disease behavior was performed as B2/B3 vs inflammatory (B1), conditional on sex and age. Analyses of disease location was performed conditional on sex and age and considered ileocolonic (L3) as intermediate between colorectal (L2) and ileal (L1). Multiple testing was accounted for by evaluating false discovery rate, using q-values (R package qvalue).
For the IBDGC datasets, threshold for selecting associated features using individual testing was p-value<1 E-4. For the pilot VEDO study, p-value<0.05 threshold was used.
Positive correlation (r>0.4) between B2/B3-associated features (p-value<1E-4 and | log2(FC) | >0.3) was projected onto a 2-dimensional display using Fruchterman-Reingold layout algorithm (R package igraph) (Csardi G, Nepusz T. InterJournal 2006;Complex Systems:1695). A starting point was given to the algorithm as the first two principal component analysis coordinates.
A classification model was built from the IBDGC-2 dataset using 135 features associated with B2/B3 vs control with p-value<0.05 and | log2(FC) | >0.3, based on the Bayesian Information Criterion. Annotated lipid molecular species were prioritized to be included in the model. Evaluation of the model performances were represented using receiver operating characteristic (ROC) curves and summarized with area under the curve (AUC). Confidence intervals (95% CI) were computed using bootstrap (R package pROC) (Robin X et al. BMC Bioinformatics 2011;12:77). Given these performances are expected to be overestimated when the model is built (trained) and validated within the same samples, five-fold cross validation, as well as an alternate model built in IBDGC-2 and tested independently in IBDGC-1, were used to evaluate a lower-bound for AUC.
Serum samples from a total of 300 CD patients and 300 healthy donors matched for sex, age and ethnicity were analyzed in two independent phases. The demographic and clinical information for these subjects is presented in Table 1.
In the first phase (IBDGC-1), we conducted an untargeted LC-MS-based lipidomic profiling of 100 CD patients with stricturing or penetrating disease behavior (B2/B3) and 100 control samples. Raw data consisted of 1068 MS signals that were processed for peak alignment, filters of presence normalization of signal intensities, imputation of missing values as well as corrections for batch and collection center effects (see
Given the untargeted nature of this lipidomic screen, the next step was to determine the identity of these lipid features. This was performed by MS/MS analyses that resulted in the annotation of 46 of these features (46/72, 64%), corresponding to 37 structurally unique lipid molecular species once duplicate ions had been removed (Tables 2A and 2B). Interestingly, these can be grouped into four distinct lipid categories and 12 sub-classes (Table 2C). Among these lipid sub-classes, sphingomyelins (SMs; decreased in CD vs control) are known to modulate inflammatory processes and the intestinal epithelium barrier function. Tables 2A and 2B list the 72 B2/B3-associated lipid features with p-value<1E-4 and | log2(FC) | >0.3 in IBDGC-1. ID and characteristics of features (m/z, RT, ionization mode, detected adducts and MS/MS fragments considered for annotation) as well as lipid ID and category are shown. Association with B2/B3 vs control (p- and q-values), as well as effect size (log2(FC)) and standard error (SE) are shown for each feature. Bold lipid IDs correspond to the 15 most significant structurally unique B2/B3-associated lipids, that are also replaced in
Given these very positive results, this lipidomic profiling was extended to a larger set of samples consisting of 200 CD patients and 200 matched controls (IBDGC-2), that didn’t overlap with IBDGC-1. IBDGC-2 had roughly equivalent numbers of cases in the inflammatory (B1) and stricturing (B2) or penetrating (B3) behavior categories (Table 1). Processed MS data retained 1894 features in the final IBDGC-2 dataset. The larger number of detected features, in comparison to IBDGC-1, is attributed to the difference in sensitivity performances between the LC-QTOF instruments used for the two phases. To be comparable to IBDGC-1, the initial analysis of IBDGC-2 focused on CD patients with either B2 or B3 phenotype. Using the same criteria as above (p-value<1E-4, corresponding to q-value<5E-4, and | log2(FC) | >0.3), 73 features were significantly associated with B2/B3 vs control (
Replicability of results between the two IBDGC studies was also assessed by a manual inter-study alignment based on feature characteristics (i.e. MS/MS spectra and RT matching to avoid misinterpretation related to potential isomers), primarily focusing on annotated unique lipid molecular species associated with B2/B3 vs control with p-value<1 E-4 and | log2(FC) | >0.3 in either dataset and detected in both datasets (Tables 4A and 4B). High replicability was found between findings obtained through independent analyses of the two IBDGC studies using two LC-QTOF instruments in a year and a half-interval (Tables 4A and 4B). Nearly 90% (28 of 32) of these were associated with B2/B3 with p-value<0.05 in both datasets. It should be noted, however, given the greater sensitivity performance in IBDGC-2, we not only identified more ether lipids of both phosphatidylcholine (i.e. PCO-) and phosphatidylethanolamine (i.e. PEO-) that were lower in CD patients, but we also identified two additional lipids that were associated to CD, but that were not detectable in the first phase. These were: (i) C4,6D3O (higher in CD patients) and (ii) cholesterol ester (CE 14:1); lower).
Tables 4A and 4B lists 27 and 36 annotated lipid molecular species associated with B2/B3 vs control with p-value<1E-4 and | log2(FC) | >0.3 in IBDGC-1 and 2, respectively, that were included in the inter-study alignment. Feature IDs and adducts are shown, as well as the corresponding lipid IDs. Association for B2/B3 vs control (p- and q-values) as well as effect size (log2(FC)) are shown for each feature, in both datasets. Symbols ø indicate that the feature was not detected in the dataset. Bold feature IDs correspond to features associated with p-value<0.05 in the other dataset. Grey feature IDs correspond to features with p-value>1E-4 and/or | log2(FC) | <0.3 in the corresponding dataset. Bold lipid IDs correspond to 16 B2/B3-associated features in IBDGC-2, also detected in IBDGC-1, and used to build the alternate classification model (
For inter-study alignment, Neg features were considered for ceramides. Whereas the Cer(d16:1/23:0)-corresponding feature, namely Pos:607.5886@30.05, was associated with B2/B3 vs control in IBDGC-2 (Tables 3A and 3B), the corresponding Neg feature was not detected in either of the two datasets. For similar reasons PC(19:0_18:2) is missing in the list, and 36 instead of 38 B2/B3-associated lipids were included in the list for IBDGC-2. Similarly, 27 instead of 37 B2/B3-associated lipids were included in the list for IBDGC-1. Accordingly, 96% (26/27) of the B2/B3-associated lipids with p-value<1E-4 and | log2(FC) | >0.3 in IBDGC-1 were also detected in IBDGC-2, and 89% (24/27) were also associated with p-value<0.05 in IBDGC-2. Conversely, of the 36 B2/B3-associated lipids with p-value<1E-4 and | log2(FC) | >0.3 in IBDGC-2, 21 could not be detected in IBDGC-1, with 93% (14/15) of the remaining being associated with p-value<0.05 in IBDGC-1.
For the 31 B2/B3-associated lipids with p-value<1 E-4 and | log2(FC) | >0.3 in at least one dataset and detected in both datasets, association (p-value) was calculated for the combined datasets (combined-IBDGC). Crossed-out boxes are for features not detected in one or the other dataset. Symbols * indicate 19 unique lipid molecular species associated with B2/B3 vs control with p-value<1E-4 in both datasets.
Since high dimensional lipidomic data is expected to have a correlation structure, this was assessed in the 73 B2/B3-associated features from IBDGC-2. Six correlation clusters and four individual features were found (
>70 structurally unique lipids were identified with strong association to CD (p-value<1E-4 and q-value<5E-4) and effect size (|log2(FC)|>0.3) in both datasets described herein, of which over 60% were annotated at the molecular species level by MS/MS. Changes encompass four lipid categories and 12 sub-classes including the lipid species VLCDCA 28:1(OH), SitS and C4,6D3O. Most associated lipid features fell into five major correlation clusters that included lipids predominantly of similar (sub-)classes and/or sharing similar acyl side chains. A few other lipid features were, however, independent of these major clusters. This suggests that a relatively smaller number of biological pathways are involved in, and/or impacted by, disease mechanisms than the number of associated lipid features.
Given the relatively strong effect sizes observed for the associated lipid metabolites, a model could be built with only nine lipid features that had very high-performance characteristics with the ROC curve having an AUC of 0.97. Of note, the nine lipid features were from six different correlation clusters plus 2 individual features, highlighting the non-redundant information provided by these different clusters of lipids and likely distinct biological pathways. The identity of five of these classifiers has been determined, namely PE(O-16:0/20:4), VLCDCA 28:1(OH), SM(d18:1/21:0), SitS and CE(14:1). Cross validation of this model demonstrated that this classifier was robust, and alternate models with as few as five lipid features were also of high quality (ROC curves with AUC ranging from 0.84 to 0.9). The strong performance characteristics suggest that a small set of circulating lipids are candidate biomarkers to assist disease classification in CD.
Given that a majority of the lipid entities detected have been identified, the markers selected could be analyzed by the classifier models and the lipids present in their correlation clusters, with the objective of identifying structural commonalities within each cluster to inform about the important biological properties of each. Changes in serum levels of identified lipid classifiers or their correlated counterparts as well as their potential impact on the (patho)physiology of IBD are illustrated in
Cluster A includes PE(O-16:0/20:4) and five other correlated phospholipids identified by MS/MS analyses. Correlated lipids include the ether phospholipids PE(O-18:0/20:4), PE(O-16:0/22:6), PE(O-18:0/22:6), PC(O-20:0/22:6), and PC(O-18:0/22:6), and the phospholipid PC(18:2_20:5). PE(O-16:0/20:4) was selected by the predictive models, was lower in the serum of patients with CD as compared to controls, and was significantly associated with the B2/B3 phenotype, but not disease location. Ether lipids are known to be unusually abundant in neutrophil membranes where they are essential for cell viability. Additional lipids within this cluster share a structural feature with ether lipids; they have a PUFA moiety in sn-2 position, predominantly the omega-6 arachidonic acid (AA; C20:4) or omega-3 docosahaexanoic acid (C22:6; DHA), which are susceptible to oxidative stress and likely contribute to their biological role via their well-known metabolism to pro/anti-inflammatory as well as pro-resolving mediators. Given these ether lipids are plasmanyls, which are intermediates in plasmalogen synthesis, these findings suggest also a host dysmetabolism in CD of ether lipids in peroxisomes, which are specialized cellular organelle recently shown to function as hubs that coordinate responses to stress, metabolism and immune signaling to maintain enteric health and the functionality of the gut-microbe interface. Hence, the lower circulating levels in ether lipids containing PUFAs may reflect an enhanced oxidative stress and/or dysregulated peroxisomal lipid metabolism and likely compromise inflammation resolution in CD.
Cluster B includes VLCDCA 28:1(OH) and four correlated lipid features, with one of the latter also being classified as very-long-chain dicarboxylic acids (VLCDCA). A correlated lipid includes VLCDCA 28:4. VLCDCA 28:1(OH) was lower in the serum of patients with CD, associated with the B2/B3 phenotypes, but not disease location. Of particular relevance, it has been reported that VLCDCAs have anti-proliferative and anti-inflammatory properties in vitro. While originally named Gastrointestinal Tract Acids, VLCDCAs are known to be reduced in patients with colorectal cancer. Given that IBD patients are at risk for colitis-associated cancer, likely due to mechanisms associated with chronic inflammation and repeated events of inflammatory relapse, one may consider that VLCDCAs belong to the endogenous chemopreventive/anti-inflammatory genius. If so, VLCDCAs would fall into the sphere of the bioactive lipids, where they would join their structurally and functionally related cousins, namely the specialized pro-resolution mediators (SPMs), which are hydroxylated ω-3 or ω-6 PUFA metabolites actively produced to trigger the resolution phase of inflammation via signaling through G Protein-Coupled Receptor-dependent signaling. Disclosed herein is the finding that of VLCDCA 28:1(OH) and its non-hydroxylated and polyunsaturated relative VLCDCA 28:4, exhibit lower circulating levels in serum in CD. This may reflect a dysregulation of the metabolism of VLCDCAs in CD patients, which may result from (i) reduced VLCFA metabolism via cytochrome P450 of the CYP4 family (CYP4F) of enzymes in liver, which was associated with IBD, and/or (ii) their enhanced catabolism in peroxisomes. Irrespective of the mechanism, this is, however, likely impacting inflammatory processes possibly via SPM-like effects which have not been explored to date in IBD.
Cluster C contains SM(d18:1/21:0) and 11 correlated lipids that were lower in the sera of CD patients as compared to controls. Correlated lipids include SM(d17:1/24:1), SM(d18:2/16:0), SM(d18:2/23:0), SM(d18:2/24:0), SM(d17:1/24:1), SM(d18:1/25:1)-2, SM(d18:1/25:1)-1, SM(d18:1/24:0), SM(d19:1/24:0), SM(d18:1/21:0), SM(d16:1/22:0), Cer(d16:1/23:0), and PC(19:0_18:2). Examination of the structural features of these lipids found that they bear OCFA and/or VLCFA moieties. It is likely that OCFA levels reflect gut microbiota-mediated synthesis of its short-chain FA precursor, namely propionic acid, and/or branched-chain amino acid by the host. Given a dysbiosis-related reduction of short-chain FA-producing bacterial species has been described in CD and reduced levels of branched-chain amino acids were also reported, the lower levels of circulating biomarkers bearing OCFAs and correlated with SM(d18:1/21:0), which is strongly associated with disease behavior but independent of location, are particularly consistent. Since incorporation of VLCFAs into sphingolipid species, which involves intestinal ceramide synthase 2 activity, has also been connected to intestinal defense in mice, lower serum SMs bearing OCFA/VLCFA likely reflect dysbiosis and/or a defective synthesis and may impact on intestinal epithelial barrier function.
Cluster D includes SitS and 15 other MS/MS-identified lipids, including C4,6D3O, which were all found to be elevated in the serum of patients with CD vs. control individuals, and associated with disease behavior and location. Additional lipids in this cluster include PE(18:1_20:4)-1, PE(18:1_20:4)-2, PE(16:0_18:1), PE(18:0_22:5), PC(18:1_22:5), TG(16:0_18:1_20:4), TG(18:1_18:2_22:5), TG(16:0_18:1_22:5), TG(18:1_18:2_20:4), TG(56:4), TG(18:1_18:2_22:4), TG(18:2_16_0_20:4), TG(56:4), and DG(16:0_18:2). The SitS precursor, sitosterol, is a dietary phytosterol with known anti-proliferative and anti-inflammatory properties. By analogy with bile acids, dysbiosis may favor elevated levels of its putative inactive 3-OH sulfated form (i.e. SitS). Given non-cholesterol sterols are absorbed in the small intestine but their transfer into the lymph fluid is far less efficient, the elevated levels of SitS, which are more pronounced in B2/B3 and L1 presentation, may therefore reflect dysbiosis and/or intestinal epithelial barrier dysfunction. As for C4,6D3O, little is known about this specific oxysterol, except for its reported accumulation in patients with cerebrotendinous xanthomatosis, of which more than 50% suffer from chronic diarrhea. This disease is attributed to mutations in the mitochondrial enzyme sterol 27-hydroxylase, a cytochrome P450 oxidase encoded in humans by the CYP27A1 gene, involved in cholesterol metabolism. Several oxysterols play important roles in CD pathophysiology, acting through immune cells receptors such as GPR183 and liver X receptors (LXR), which are IBD risk genes identified by GWAS. While the specific role of C4,6D3O in CD remains to be clarified, its higher serum levels concur with the lower CYP27A1 mRNA levels recently documented in colon biopsies of CD patients and also add to our knowledge about the dysregulated metabolism of cholesterol and bile acids in CD.
Cluster E includes the cholesterol ester CE(14:1) and two other correlated lipids. An additional lipid in this cluster includes CE(14:0). CE(14:1) was found to be lower in the serum of patients with CD when compared to controls and was also found to be associated with the ileal disease location. As with SitS, CE(14:1) belongs to the sterol lipid class but is likely to represent distinct biological information given that they are in different clusters. CE(14:1) may result from the acylation of cholesterol by acyl-CoA cholesterol acyltransferase activity in the intestine or liver, as well as lecithin-cholesterol acyltransferase activity in high density lipoproteins HDL. While it could be expected that CE(14:1) would follow a similar pattern to SitS as it is also a dietary lipid and could simply correlate with epithelial barrier dysfunction, its lower level in the serum of CD patients suggests a different mechanism. One possible explanation is that it is linked to a cellular deficiency in plasmalogens, which is known to impact several steps of cholesterol homeostasis. A recent study showing that gut microbial species can influence host cholesterol species provides, however, another interesting potential explanation. Specifically, metagenomic analyses discovered a microbial cholesterol dehydrogenase named ismA, and that ismA+ species decreased fecal and serum cholesterol in humans. Lower CE(14:1) in the serum of CD patients may reflect changes in the gut microbiome that increase the proportion of ismA+ species.
Taken together, these five annotated classifiers, as well as many of the correlated lipids within their clusters, appear to capture multiple different biologic mechanisms, which is reasonable to assume, are associated with the etiology and/or pathology of CD.
To explore the relationship between circulating lipids and CD subtypes, the entire IBDGC-2 dataset, which was larger and more representative in terms of disease behavior and location than IBDGC-1, was further analyzed. As a first step, it was tested whether the inclusion of B1 impacted on the previous results that focused on the B2 and B3 phenotypes. It was found that, compared to the previous results obtained with B2/B3, four new features showed evidence of association with CD (all patients) vs controls (p-value<1E-4 (q-value<8E-4) and | log2(FC) | >0.3), likely gained by the increase in power due to increased sample size (Table 5). Table 5 lists the 37 lipid features associated with CD (all patients including B1, B2 and B3) vs control with p-value<1E-4 and | log2(FC) | >0.3 in IBDGC-2. Feature and lipid IDs, association with CD vs control (p- and q-values) as well as effect size of change (log2(FC)) and standard error (SE) are shown. Bold feature IDs correspond to features with p-value>1E-4 and/or | log(FC) | <0.3 for the analysis focused on B2/B3 in IBDGC-2.
In contrast, half of the B2/B3-associated features failed to reach the significance threshold in “all CD vs controls” analysis, indeed suggesting an impact of disease subtype on the association to some lipid features. To assess this directly, we tested each feature for association to a specific disease location or behavior and found 182 associations with p-value<1E-4 for at least one subtype (behavior (138), location (94); Tables 6A-6J). Next, to better interpret the relationship between these features and disease subtypes, their effect sizes were plotted with respect to disease status on separate axes (
Given the relatively strong effect sizes for many of the features, we decided to assess their ability in building a disease classifier. Specifically, we built a model using the B2/B3-associated features with p-value<0.05 and | log2(FC) | >0.3 obtained in IBDGC-2, trained and tested with all of the B2/B3 and control samples in the IBDGC-2 dataset. The purpose of this model was to use a minimal number of features while reaching maximal classification performance. The ROC curve for this model showed a very high performance with an AUC of 0.97 (
Since performance is always overestimated by using the same samples for training and validation, we addressed this by using two different approaches. We first performed cross-validations using distinct subparts of the IBDGC-2 dataset for training and validation. In doing so, the ROC curves yielded an average AUC>0.84, implying low over-estimation due to over-fitting in our setting (
With the assumption that quantitative changes in serum lipids from patients with CD are at least in part a result of pathogenic mechanisms, the relationships between circulating lipids and response to therapy was examined. To do so, we analyzed an independent cohort of 92 IBD patients that were treated with Vedolizumab, with serum samples taken just prior to initiation of therapy (baseline) and at first assessment visit (week 14), using the same MS instrument than the IBDGC-1 samples. The demographic and clinical information for these subjects is presented in Table 8. Table 8 presents the demographic and clinical phenotypes of the IBD subjects retained into the final dataset. CD and UC were independently analyzed in the VEDO-CD and VEDO-UC subsets of the dataset, respectively.
These MS data were processed as described above, with 1111 lipid features retained in the final dataset. Statistical analyses were focused on serum samples from CD (VEDO-CD subset) and UC patients (VEDO-UC subset) with active disease at baseline, for whom therapeutic response data at week 14 was available (i.e. clinical response and/or remission) and with serum samples collected at both timepoints (i.e. weeks 0 and 14). The results suggest that Vedolizumab treatment alters the level of circulating lipid features from baseline to week 14 in CD and UC patients. Specifically, 101 and 27 features had variation associated with Vedolizumab response vs non-response with p-value<0.05 in CD and UC (
Table 9 lists 101 lipid features with variation occurring from baseline to week 14 associated with Vedolizumab response status at p-value<0.05 in CD patients of the VEDO dataset (VEDO-CD). Feature and lipid IDs, association (p- and q-values) as well as effect size of change (log2(FC)) and standard error (SE) are shown. Bold features have variation also associated with Vedolizumab response vs non-response in UC patients. Bold lipids were associated with B2/B3 vs control p-value<1E-4 and | log2(FC) | >0.3 in IBDGC- 2.
Table 10 lists 27 lipid features with changes (w0-w14) associated with Vedolizumab response vs non-response with p-value<0.05 in UC patients of the VEDO dataset (VEDO-UC). Feature and lipid IDs, association (p- and q-values) as well as effect size (log2(FC)) and standard error (SE) are shown. Bold features have changes also associated with Vedolizumab response vs non-response in CD patients.
When comparing these 35 lipids associated with Vedolizumab response (p-value<0.05) in CD patients with the list of metabolites significantly associated (p-value<10E-4, FC>0.3) with B2/B3 in IBDGC-2, four were in common (
Specifically, C4,6D3O (Cluster D), which had a greater than two-fold higher concentration in patients with CD than in controls, decreased significantly between weeks 0 and 14 in patients who responded to treatment and increased slightly in non-responders (
In summary, serum samples taken just before initiation of treatment with vedolizumab as well as samples taken at first assessment visit ~16 weeks later were analyzed. Given vedolizumab selectively inhibits leukocyte extravasation into the gut, it is expected to induce mucosal healing while intestinal tissue injury occurs in areas where activated lymphocytes produce inflammatory mediators. In responders, levels of three metabolites significantly associated with CD, namely SM(d18:1/24:0), PC(19:0_18:2) and C4,6D3O tended to normalize, and were independent of the known clinical markers ESR and CRP. As mentioned earlier, these three lipid metabolites are part of Clusters C and D, suggesting that these variations reflect a return to a more normal epithelial barrier function.
Various modifications and variations of the described invention will be apparent to those skilled in the art without departing from the scope and spirit of the invention. Although the invention has been described in connection with specific embodiments, it should be understood that the invention as claimed should not be unduly limited to such specific embodiments. Indeed, various modifications of the described modes for carrying out the invention that are obvious to those skilled in the art are intended to be within the scope of the invention.
All publications, patents, and patent applications are herein incorporated by reference in their entirety to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference in its entirety.
Other embodiments are in the claims.
Number | Date | Country | |
---|---|---|---|
63335844 | Apr 2022 | US |