N.A
The present invention relates to idiopathic scoliosis. More specifically, the present invention is concerned with molecular markers associated with IS and their use in the diagnosis, genotyping, classification and treatment of IS.
This application contains a Sequence Listing in computer readable form (as an ASCII compliant text file) entitled “Seq_List_14033_161_ST25”, created on Aug. 21, 2017 having a size of 196 kilobytes. The content of the aforementioned file named Seq_List_14033_161_ST25 is hereby incorporated by reference in its entirety.
Primary cilia are antenna-like organelles that transmit chemical and mechanical signals from the pericellular environment.10,11 They are found in the cells of all human tissues (except blood), including bone, cartilage, tendons, and skeletal muscle (a comprehensive list of tissue types and cell lines with primary cilia can be found at: http://www.bowserlab.org/primarycilia/ciliumpage2.htm). In addition to functions linked to olfaction, photo and chemical sensation, recent studies have established a mechanosensory role for primary cilia in tissues, such as the kidney, liver, embryonic node, and bone structure (the mechanosensory role of cilia in bone is reviewed by Nguyen, et al., 2013).12 As the most recent established role for cilia, mechanisms for mechanosensation are not yet entirely understood. For example, the involvement of calcium channels, in response to cilia bending following a fluid movement, is yet a matter of debate and might vary depending on the tissue examined.13,14
As a mechanosensor in bone, the primary cilium can transduce fluid flow induced shear stress occurring within the canaliculi that interconnect osteocytes as well as strain-related mechanical stimuli in pre-osteoblasts.15 The load-induced fluid flow in bone canaliculi is recognized to play a role in maintaining bone homeostasis through bone resorption and formation cycles (i.e. bone tissue remodeling).16 Cilia mediate the transduction of this fluid flow to mesenchymal stem cells (MSCs), and is implicated in osteogenic gene expression and lineage commitment.17 Mechanical loading modulates the incidence and length of primary cilia in cells, such as chondrocytes, in which cilia direction affects the direction of growth in growth plates.18 Mechanical loading has also been shown to induce bone cell proliferation through a cilia-dependent mechanism.15 Interestingly, skeletal disorders are a common feature in several human ciliopathies, such as Jeune syndrome and short rib-polydactyly.19.
Idiopathic scoliosis (IS) is a complex pediatric syndrome that manifests primarily as an abnormal three-dimensional curvature of the spine. Eighty percent of all spinal curvatures are idiopathic, (MIM 181800) making IS the most prevalent form of spinal deformity. With a global incidence of 0.15% to 10% (depending on curve severity),1 IS contributes significantly to the burden of musculoskeletal diseases on healthcare (http://www.boneandjointburden.org). Children with IS are born with a normal spine, and the abnormal curvature may begin at different points during growth, though adolescent onset is the most prevalent.2 Idiopathic scoliosis is diagnosed by ruling out congenital defects and other causes of abnormal curvature, such as muscular dystrophies, tumors, or other syndromes.
The etiology of idiopathic scoliosis is unknown largely because of phenotypic and genetic heterogeneity. Curve magnitude is highly variable and the risk for severe curvature is not understood beyond the observed female bias. Although a genetic basis is accepted, genetic heterogeneity has been implicated in several familial studies,3,4 and numerous genome-wide association studies (GWAS) have detected different loci with small effects.5 Despite such genetic correlations, no clear biological mechanism for IS has emerged. It is likely that IS phenotypic heterogeneity is a consequence of genetic variations combined with biomechanical factors that are influenced by individual behavioral patterns. As a musculoskeletal syndrome, biomechanics are thought to have an important role in the IS deformity. Pathological stressors applied to a normal spine or normal forces on an already deformed spine have been studied for a role in curve predisposition and progression.6 For example, factors that contribute to spinal flexibility, sagittal balance, shear loading on the spine, and compressive or tension forces may contribute to the ‘column buckling’ phenotype associated with IS.7-9 Furthermore, therapeutic options available for IS, bracing and corrective surgery, approach the disease from a mechanical perspective, and successful outcomes depend on understanding the complex biomechanics of the spine.
Thus, there remains a need for the identification of new molecular markers associated with IS. There also remains a need for new ways to characterize, classify, diagnose and treat subjects suffering from IS or at risk of suffering from IS.
The present description refers to a number of documents, the content of which is herein incorporated by reference in their entirety.
The present invention discloses evidence supporting an association between IS and mechanotransduction through the non-motile microtubule-based signaling organelle known as cilium. Applicant has found that numerous ciliary genes present a spinal curvature phenotype when knocked down in animal models, and that scoliosis is associated with many human ciliopathy syndromes.20,21 Additionally, the majority of confirmed IS associated genes are connected to cilia structure or function. Confocal images of primary osteoblast cultures, derived from bone fragments obtained intraoperatively at the time of the spine surgery, revealed that IS subjects have longer primary cilia and an increased density of cells with elongated cilia. Also, further studies have demonstrated the presence of an altered cellular response to mechanostimulation in cells from IS subjects. Furthermore, SKAT-O analysis of whole exomes has allowed to identify rare gene variants with a role in mechanotransduction in IS subjects.
Accordingly, there are provided novel molecular markers and alternative methods of identifying subjects at risk of developing IS or suffering from IS and of genotyping and classifying IS subjects into genetic and functional groups. Methods of the present invention can be used alone or with one or more previous methods to increase the specificity and/or sensitivity of risk prediction to improve subject's classification and to facilitate and improve the application of preventive treatment measures. Once a subject has been classified into one or more IS group, treatment and preventive measures can be adapted to his/her specific endophenotype/genotype.
More specifically, in an aspect, the present invention provides a method of determining the risk of or predisposition to developing a scoliosis comprising determining a cellular response to mechanostimulation in a cell sample from a subject, wherein an altered cellular response in said sample as compared to that in a control sample is indicative of an increased risk of developing a scoliosis.
In a further aspect, the present invention provides a method of determining the risk of or predisposition to developing a scoliosis comprising (i) determining the average length of cilia on the surface of cells in a cell sample from the subject; (ii) determining the number of cells with elongated cilia in a cell sample from the subject; (iii) determining the number of ciliated cells in a cell sample from the subject; or (iv) any combination of one of (i), (ii) and (iii), wherein an increase in the average length of cilia, an increase in the number of cells having elongated cilia or a decrease in the number of ciliated cells in the cell sample from the subject as compared to that in a control sample is indicative of an increased risk of or predisposition to developing a scoliosis.
In a further aspect, the present invention provides a method of determining the risk of or predisposition to developing a scoliosis comprising determining a cellular response to mechanostimulation of cells in a cell sample from a subject, wherein the determining comprises: (i) applying mechanostimulation to cells in a cell sample from the subject; and (ii) measuring the expression level of at least one mechanoresponsive gene, wherein the at least one mechanoresponsive gene is ITGB1; ITGB3, CTNNB1; POC5, BMP2, COX-2, RUNX2, CTNNB1 or any combination thereof; (iii) comparing the expression level measured in (b)(ii) to that of a control sample, wherein an altered expression level in said mechanoresponsive gene as compared to that of the control sample is indicative of an increased risk of or predisposition to developing a scoliosis. In embodiments, the above method is performed on cells having elongated cilia.
In embodiments, (i) determining the average length of cilia on the surface of cells in a cell sample from the subject; (ii) determining the number of cells with elongated cilia in a cell sample from the subject; (iii) determining the number of ciliated cells in a biological sample from the subject; (iv) determining a cellular response to mechanostimulation of cells in a cell sample from a subject; or (v) any combination of (i), (ii), (iii) and (iv), is assessed over time.
In a further aspect, the present invention provides a method of determining the risk of developing a scoliosis in a cell sample from a subject, the method comprising detecting the presence or absence of a polymorphic marker in at least one allele of at least one gene listed in Table 4 or substitute marker in linkage disequilibrium with the polymorphic marker. In embodiments, the polymorphic marker is a polynucleotide variant set forth in Table 6.
In another aspect, the present invention provides a method of genotyping a subject suffering from Idiopathic scoliosis or at risk of developing a scoliosis comprising determining in a cell sample from the subject the presence or absence of a polymorphic marker in at least one allele of at least one gene listed in Table 4 or a substitute marker in linkage disequilibrium with the polymorphic marker. In embodiments, the polymorphic marker is a polynucleotide variant set forth in Table 6.
In a further aspect, the present invention provides a method of classifying a subject (e.g., suffering from a scoliosis or at risk of developing a scoliosis) comprising performing one or more of the above-described methods and classifying the subject into an IS group.
In embodiments, the above-described methods comprise determining the presence or absence of at least two polymorphic markers. In embodiments, the methods comprise determining the presence or absence of at least two polymorphic markers in at least two genes. In embodiments, the above-described methods comprise determining the presence or absence of at least 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120 or more polymorphic markers. In embodiments, the methods comprise determining the presence or absence of at least 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120 or more polymorphic markers in at least 3, 4, 5, 6, 7, 8, 9, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 110, 120 or more genes.
The above-described methods may be used alone or in combination with each other. Methods of the present invention may also be used in combination with known methods useful for determining the risk of or predisposition to developing a scoliosis; for genotyping a subject; and/or for classifying a subject into an IS group.
In a further aspect, the present invention provides a composition or kit comprising one or more reagent for detecting (a) the length of cilia at the surface of cells; (b) the number of cells with elongated cilia; (c) the number of ciliated cells; (d) the level of expression of at least one mechanoresponsive gene; and/or (e) the presence or absence of a polymorphic marker in at least one gene listed in Table 4 or a substitute marker in linkage disequilibrium therewith in a cell sample from a subject. In embodiments, the composition or kit further comprises the cell sample from the subject. In embodiments, the cell sample comprises cells which have been submitted to a mechanostimulation. In embodiments, the composition or kit comprises at least one oligonucleotide probe or primer for the specific detection of a polymorphic marker in a gene listed in Table 4. In embodiments, the polymorphic marker is a polynucleotide variant set forth in Table 6.
In a further aspect, the present invention provides a DNA chip comprising at least one oligonucleotide for detecting the presence or absence of a polymorphic marker in at least one gene listed in Table 4 and a substrate on which the oligonucleotide is immobilized. In embodiments, the polymorphic marker is a polynucleotide variant set forth in Table 6.
In a further aspect, the present invention provides oligonucleotide primers or probes for use in the above-described methods. In embodiments, the oligonucleotide is for the specific detection of a polymorphic marker of the present invention and comprises or consists of a nucleotide sequence having a variant at a position corresponding to that defined in Table 6. In embodiments, the variant is a risk variant defined in Table 6. In embodiments, the oligonucleotide primer or probe hybridizes to a reference or a variant polynucleotide sequence set forth in Table 6 or to its complementary sequence. In embodiments, the oligonucleotide primer or probe further comprises a label. In embodiments, the oligonucleotide primer or probe comprises or consists of a polynucleotide sequence set forth in Table 6 or the complement thereof. In embodiments, the oligonucleotide primer or probe consists of 10 to 100 nucleotides, preferably 10 to 60 nucleotides. In embodiments, the oligonucleotide primer or probe consists of at least 12 nucleotides.
In a further aspect, the present invention relates to the use of methods, compositions, oligonucleotide primers or probes, kits or DNA chips of the present invention for (i) determining the risk of or predisposition to developing a scoliosis; (ii) genotyping a subject; and (iii) classifying a subject into an IS group.
In embodiments, the above-mentioned mechanostimulation is fluid sheer stress. In embodiments, the level of sheer stress corresponds to a Womersley number of between about 5 and 18. In embodiments, the level of sheer stress corresponds to a Womersley number of about 8. In embodiments, mechanostimulation corresponds to an average sheer stress of about 1 Pa. In embodiments, mechanostimulation is applied at a frequency of between about 1 and about 3 Hz.
In embodiments, the at least one gene comprising a polymorphic marker (gene variant) comprises FEZF1, CDH13, FBXL2, TRIM13, CD1B, VAX1, CLASP1, SUGT1, MIPEP, FAM188A, TAF6, WHSC1, GPR158, HNRNPD, RUNX1T1, GRIK3, FUZ, LYN, DDX5, PODXL, ATP5B, PIGK, AL159977.1, BTN1A1, CDK11A, HIVEP1, HSD17B14, KCNMA1, PXDN, RAB31, RBM5, RNF149, SOD2, TOPBP1, ZCCHC14, ZNF323, or any combination thereof. In embodiments, the at least one gene comprises FEZF1, CDH13, FBXL2, TRIM13, CD1B, VAX1, CLASP1, SUGT1, MIPEP, FAM188A, TAF6, WHSC1, GPR158, HNRNPD, RUNX1T1, GRIK3, FUZ, LYN, DDX5, PODXL, ATP5B, PIGK, AL159977.1, or any combination thereof. In embodiments, the at least one gene comprises ATP5B, BTN1A1, CD1B, CDK11A, CLASP1, DDX5, FBXL2, HIVEP1, HSD17B14, KCNMA1, PXDN, RAB31, RBM5, RNF149, SOD2, SUGT1, TOPBP1, ZCCHC14, ZNF323 or any combination thereof. In embodiments the at least one gene comprises ATP5B, BTN1A1, CD1B, CDK11A, CLASP1, DDX5, FBXL2, HIVEP1, HSD17B14, KCNMA1, PXDN, RAB31, RBM5, RNF149, SOD2, SUGT1, TOPBP1, ZCCHC14 or ZNF323 or any combination thereof. In embodiments, the at least one gene comprises CDB1, CLASP1 and SUGT1.
In embodiments, the above-mentioned polymorphic marker is a polymorphic marker defined in Table 6. In embodiments, the polymorphic marker is a risk variant defined in Table 6.
In embodiments, the above-mention subject is a female. In embodiments, the subject is prediagnosed with a scoliosis (e.g., iodiopathic scoliosis). In embodiments, the subject has a family member which has been diagnosed with a scoliosis. In embodiments, the subject is a likely candidate for developing a scoliosis or for developing a more severe scoliosis.
In embodiments, the cell sample comprises bone cells. In embodiments, the cell sample comprises mesenchymal stem cells, myoblasts, preosteoblasts, osteoblasts, osteocytes and/or chondrocytes. In embodiments, the cell sample is a blood sample. In embodiments, the cell sample is a blood sample comprising PBMCs. In embodiments, the cell sample is a nucleic acid sample. In embodiments, the cell sample is a protein sample.
Other objects, advantages and features of the present invention will become more apparent upon reading of the following non-restrictive description of specific embodiments thereof, given by way of example only with reference to the accompanying drawings.
In the appended drawings:
The primary cilium is an outward projecting antenna-like organelle with an important role in bone mechanotransduction. The capacity to sense mechanical stimuli can affect important cellular and molecular aspects of bone tissue. Idiopathic scoliosis (IS) is a complex pediatric disease of unknown cause, defined by abnormal spinal curvatures. As shown herein, a significant elongation of primary cilia in IS patient bone cells was established. Furthermore, IS subjects have an increase number of cells with elongated cilia. In response to mechanical stimulation, these cells differentially express osteogenic factors, mechanosensitive genes, and beta-catenin. Considering that numerous ciliary genes are associated with a scoliosis phenotype, among ciliopathies and knockout animal models, IS patients were expected to have an accumulation of rare variants (risk variants) in ciliary genes. Instead, the SKAT-O analysis of whole exomes presented herein showed enrichment among IS patients for rare variants in genes with a role in cellular mechanotransduction. Applicant's data indicates defective cilia in IS bone cells, which is likely linked to heterogeneous gene variants pertaining to cellular mechanotransduction.
The present invention is thus based on the identification of functional defects in cells from IS subjects and on the identification of novel molecular markers, and in particular novel SNPs in various genes involved in mechanotransduction. The present invention thus provides novel methods of determining the risk of developing IS (or of detecting a predisposition to or the presence of), of genotyping subjects (e.g., IS subjects or subjects at risk of developing a scoliosis) and of classifying subjects (e.g., IS subject or subjects at risk of developing a scoliosis). The present invention further provides methods for identifying novel therapeutic targets and means for improving the application of treatment and preventive measures.
For clarity, definitions of the following terms in the context of the present invention are provided.
The articles “a,” “an” and “the” are used herein to refer to one or to more than one (i.e., to at least one) of the grammatical object of the article.
The term “including” and “comprising” are used herein to mean, and are used interchangeably with, the phrases “including but not limited to” and “comprising but not limited to”.
The terms “such as” are used herein to mean, and is used interchangeably with, the phrase “such as but not limited to”.
Polymorphism. The genomic sequence within populations is not identical when individuals are compared. Rather, the genome exhibits sequence variability between individuals at many locations in the genome. Such variations in sequence are commonly referred to as polymorphisms, and there are many such sites within each genome. For example, the human genome exhibits sequence variations which occur on average every 500 base pairs. Thus, as used herein, a “polymorphism” refers to a variation in the sequence of nucleic acid (e.g., a gene sequence). Such variation include insertion, deletion, and substitutions in one or more nucleotides.
The most common sequence variation (or polymorphism) consist of base variations at a single base position in the genome, and such sequence variants, or polymorphisms, are commonly called Single Nucleotide Polymorphisms (“SNPs”). There are usually two possibilities (or two alleles) at each SNP site; the original allele and the mutated allele (although there may 3 or 4 possibilities for each SNP site). Due to natural genetic drift and possibly also selective pressure, the original mutation has resulted in a polymorphism characterized by a particular frequency of its alleles in any given population. There may also exists SNPs that vary between paired chromosomes in an individual. Each individual is in this instance either homozygous for one allele of the polymorphism (i.e. both chromosomal copies of the individual have the same nucleotide at the SNP location), or the individual is heterozygous (i.e. the two sister chromosomes of the individual contain different nucleotides). As used herein an SNP thus refers to a variation at a single nucleotide in a given nucleic acid sequence.
As noted above, many other types of sequence variants are found in the human genome, including microsatellites, insertions, deletions, inversions and copy number variations. A polymorphic microsatellite has multiple small repeats of bases (such as CA repeats, TG on the complimentary strand) at a particular site in which the number of repeat lengths varies in the general population.
In general terms, each version of the sequence with respect to the polymorphic site represents a specific allele of the polymorphic site. These sequence variants can all be referred to as polymorphisms, occurring at specific polymorphic sites characteristic of the sequence variant in question. In general terms, polymorphisms can comprise any number of specific alleles.
In some instances, reference is made to different alleles at a variant/polymorphic site without choosing a reference allele. Alternatively, a reference sequence can be referred to for a particular polymorphic site. The reference allele is sometimes referred to as the “wild-type” allele and refers herein to the allele from a “non-affected” or control/reference individual (e.g., an individual that does not display a trait or disease phenotype i.e., which does not suffer from a scoliosis or which has a lower risk of (or predisposition to) developing a scoliosis).
A “polymorphic marker”, also referred to as a “genetic marker” or “gene variant”, as described herein, refers to a variation (mutation or alteration) in a gene sequence that occurs in a given population. Each polymorphic marker/gene variant has at least two sequence variations (e.g., 2, 3, 4, 5, 6, 7, 8, or more sequence variations) characteristic of particular alleles at the polymorphic site. The marker/gene variant can comprise any allele of any variant type found in the genome, including variations in a single nucleotide (SNPs, microsatellites, insertions, deletions, duplications and translocations. The polymorphic marker/gene variant, if found in a transcribed region of the genome can be detected not only in genomic DNA but also in RNA. Polymorphic markers or gene variants of the present invention and identified in Table 6 are found in transcribed regions of the genome (were identified following exome sequencing). In addition, when the polymorphism/variant is found in the gene portion that is translated into a polypeptide or protein, the polymorphic marker/gene variant can be detected at the protein/polypeptide level.
The polymorphic marker/gene variant of the present invention and its specific sequence variation can be detected by various means such as by sequencing the nucleic acid or protein. Alternatively, when the polymorphism/variation affects the function of the gene or of its translated protein/polypeptide, the biological activity can be evaluated in order to identify which allele is present in the subject's sample. For example, if a particular risk allele (comprising a risk variant or combination of risk variants) affects the enzymatic activity of the protein, then, the presence of the allele or variant(s) can be assessed by performing an enzymatic test. Alternatively, if the risk allele (comprising a gene variant or combination of variants) affects the expression level of a polypeptide or nucleic acid, then, the presence of the variants(s) can be determined by assessing the expression level (e.g., Immunoassays, amplification assays, etc.) of such protein or nucleic acid and comparing it to a reference level in a control sample (e.g., sample from a subject not suffering from a scoliosis or at risk of developing a scoliosis).
An “allele” refers to the nucleotide sequence of a given locus (position) on a chromosome. A polymorphic marker allele thus refers to the composition (i.e., sequence) of the marker on a chromosome. Genomic DNA from an individual contains two alleles for any given polymorphic marker, representative of each copy of the marker on each chromosome. A “risk allele”, a “susceptibility allele” or a “predisposition allele” or a “risk variant” is nucleic acid sequence variation that is associated with an increased risk of (i.e. compared to a control/reference) or predisposition to suffering from a scoliosis. Conversely, a “protective allele” or “protective variant” is a sequence variation of a polymorphic marker that is associated with a lower risk of (i.e., compared to a control/reference) or predisposition to suffering from a scoliosis.
A “nucleic acid or gene associated with idiopathic scoliosis” as used herein is a nucleic acid (e.g., genomic DNA or RNA) that comprises a polymorphic marker/gene variant of the present invention, or any substitute marker in linkage disequilibrium therewith, which affects the risk of developing a scoliosis (e.g., a risk variant defined in Table 6). This nucleic acid may be of any length as long as it comprises the polymorphic region that is relevant to the determination of the presence or absence of susceptibility to scoliosis (e.g., the polymorphic markers or genes listed in Tables 4-6). For example, it can consist of a whole gene sequence (including the promoter or any other regulatory sequence) or of a fragment thereof. Similarly, a “polypeptide associated with idiopathic scoliosis” or a “protein associated with idiopathic scoliosis” refers to a protein or polypeptide which is encoded by a nucleic acid comprising a polymorphic marker of the present invention, or any marker in linkage disequilibrium therewith, which is associated with idiopathic scoliosis (e.g., comprising a risk variant defined in Table 6).
The term “sample” as used herein is any type of biological sample which may be used under the methods of the present invention. The term “cell sample” refers to a sample which originally comprised cells from the subject. For example, in certain embodiments, the “cell sample” is a sample in which it is possible to determine the average lengths of cilia on the surface of cells. Such cell sample thus comprises cells which normally have cilia. In embodiments, the cell sample allows for the detecting the presence or absence of a polymorphic marker (or gene variant) of the present invention (at the nucleic acid level or at the protein level) including but not limited to blood (including plasma and serum), urine, saliva, amniotic fluid, tissue biopsy etc. The sample may be a crude sample or a purified sample, it may be processed to a nucleic acid sample or a protein sample. In embodiments, the cell sample comprises bone cells. In embodiments, the cell sample comprises mesenchymal stem cells (MSC), chondrocytes, preosteoblasts, osteoblasts and/or osteocytes. In other embodiments, the cell sample is a blood sample (plasma or serum).
As used herein the terms “at risk of developing a scoliosis”, “predisposition to developing a scoliosis”, “at risk of developing IS”, and “predisposition to developing IS” refer to a genetic or metabolic predisposition of a subject to develop a scoliosis (i.e. spinal deformity) and/or a more severe scoliosis at a future time (i.e., curve progression of the spine). For instance, an increase of the Cobb's angle of a subject (e.g., from 40° to 50° or from 18° to 25°) is a “development” of a scoliosis. The terminology “a subject at risk of developing a scoliosis” includes asymptomatic subjects which are more likely than the general population to suffer in a future time of a scoliosis (i.e., a likely candidate for developing or suffering from a scoliosis) such as subjects (e.g., children) having at least one parent, sibling or family member suffering from a scoliosis. Among others, age (adolescence), gender and other family antecedent are factors that are known to contribute to the risk of developing a scoliosis and are used to evaluate the risk of developing a scoliosis. Also included in the terminology “a subject at risk of developing a scoliosis” are subjects already diagnosed with IS but which are at risk to develop a more severe scoliosis (i.e. curve progression).
As used herein the term “subject” is meant to refer to any mammal including human, mouse, rat, dog, chicken, cat, pig, monkey, horse, etc. In particular embodiments, it refers to a human (e.g., a child, adolescent (teenager) or adult which may benefit from any of the methods, compositions and kits of the present invention). In embodiments, the subject is a female. In embodiments, the subject has at least one family member which has been diagnosed with IS. In embodiments, the family member is a sibling.
As used herein the terminology “control sample” is meant to refer to a sample from which it is possible to make a suitable comparison under the methods of the present invention (e.g., to determine the risk of developing a scoliosis, to genotype subjects, classify/stratify subjects into a specific genetic or functional group, etc.). In embodiments, a “control sample” is a sample that does not originate from a subject known to have scoliosis or known to be a likely candidate for developing a scoliosis (e.g., idiopathic scoliosis (e.g., Infantile Idiopathic Scoliosis, Juvenile Idiopathic Scoliosis or Adolescent Idiopathic Scoliosis (AIS))). In the context of the present invention, “a control sample” also includes a “control value” or “reference signal” derived from one or more control samples from one or more subjects. In methods for classifying subjects or for determining the risk of developing a scoliosis in a subject that is pre-diagnosed with scoliosis, the sample may also come from the subject under scrutiny at an earlier stage of the disease or disorder. Preferably, the control sample is a cell of the same type (e.g., both the test sample and the reference sample(s) are e.g., lymphocytes, osteoblasts, myoblasts or chondrocytes) as that from the subject. Of course multiple control samples derived from different subjects can be used in the methods of the present invention. A control sample (or reference signal or control value) may correspond to a single control subject ((i.e., a normal healthy subject or a subject already classified in a given functional or genetic group) or may be derived from a group of control subjects (i.e., equivalent to the reference signal in control subjects).
In one aspect, the present invention provides a method of determining the risk of or predisposition to developing a scoliosis comprising determining the biomechanical profile of cells in a cell sample from a subject, wherein an altered biomechanical profile in the cell sample as compared to that in a control sample is indicative of an increased risk of or predisposition to developing a scoliosis.
In embodiments, determining the biochemical profile comprises (i) determining (e.g., measuring, detecting) the average length of cilia on the surface of cells in a cell sample from the subject; (ii) determining the number of cells with elongated cilia in a cell sample from the subject (iii) determining the number of ciliated cells in a cell sample from the subject; or (iv) any combination of (i), (ii) and (iii). An increase in the average length of cilia, an increase in the number of cells with elongated cilia or a decrease in the number of ciliated cells in the cell sample from the subject as compared to that in a control sample is indicative of an increased risk of or predisposition to developing a scoliosis.
In embodiments, determining the biochemical profile comprises measuring a response to mechanostimulation of cells in a cell sample from a subject, comprising: (i) applying mechanostimulation to cells from the cell sample from the subject; (ii) measuring the expression level of at least one mechanoresponsive gene. An altered expression level in said mechanoresponsive gene as compared to that of the control sample is indicative of an increased risk of or predisposition to developing a scoliosis.
Cellular mechanostimulation is performed by methods well known in the art (reviewed for example in Thomas D. brown: Techniques for cell and tissue culture mechanostimulation: historical and contemporary design considerations”, Iowa Orthop. J. 1995; 15: 112-117; Cha, B., Geng, X., Mahamud, M. R., Fu, J., Mukherjee, A., Kim, Y & Dixon, J. B. (2016). Mechanotransduction activates canonical Wnt/β-catenin signaling to promote lymphatic vascular patterning and the development of lymphatic and lymphovenous valves. Genes & Development, 30(12), 1454-1469; Zhou X, Liu D, You L, Wang L. Quantifying Fluid Shear Stress in a Rocking Culture Dish. Journal of biomechanics. 2010; 43(8):1598-1602). Such stimulation may be performed in various ways and may include the use of known mechanical devices designed to deliver proper loading, distention, or other mechanical stimuli. In particular embodiments, the mechanostimulation involves the application of fluid sheer stress to the cells. Fluid sheer stress may be defined in terms of the well-known Womersley number (see Example 1 for more details on the calculation of the Womersely number).
Preferably, the mechanical stimulation that is applied in accordance with the present invention is similar to that normally encountered by cells under physiological conditions (i.e., “a physiological mechanostimulation”). For example, in the case of the application of fluid sheer stress, the value of the Womersley number ranges from 5 to 18 in fluid motion of cerebrospinal fluid in the spinal cavity.
In certain embodiments of the methods of the present invention, the mechanostimulation is fluid sheer stress and the level of sheer stress applied is within the range of fluid sheer stress that may be encountered by cells, preferably human cells, under normal conditions. In embodiments, the level of fluid sheer stress applied corresponds to a Womersley number between about 5 and about 18. In particular embodiments the level of fluid sheer stress applied corresponds to a Womersley number of about 8. In embodiments, the frequency of mechanostimulation is between about 1 and 3 hz, preferably, 1 hz.
In another specific embodiment, said mechanostimulation is a pulsative compressive pressure. In another specific embodiment, said pulsative compressive pressure is applied using an inflatable strap. In another specific embodiment, said pulsative compressive pressure is applied using an inflatable cuff. In another specific embodiment, said mechanical stimulus or force is applied for a period of at least about 15 minutes. In another specific embodiment, said mechanical stimulus or force is applied for a period of between about 30 to about 90 minutes. In another specific embodiment, said mechanical stimulus or force is applied for a period of about 90 minutes. In another specific embodiment, the subject is a likely candidate for developing adolescent idiopathic scoliosis. In embodiments, the biological sample is taken from the subject after the end of mechanostimulation at a time which is sufficient for detecting variations in the expression (at the mRNA or protein level) of mechanoresponsive genes (e.g., 15, 20, 30, 45, 60, 90, 120 minutes from end of mechanostimulation). The time necessary to detect variations in gene expression may vary depending on the gene(s) of interest and on whether the variation in expression level is detected at the protein or nucleic acid (mRNA) level. For example, for some genes, a delay of 15 min. from the start of mechanostimulation may be sufficient to detect variations in gene expression. Thus in some embodiments the biological sample is taken from the subject 15, 20, 30, 45, 60, 90, 120 minutes from start of mechanostimulation). As used herein, a “mechanoresponsive gene” is a gene which expression varies in response to mechanostimulation. Non-limiting examples of such gene includes ITGB1, ITGB3, CTNNB1, POC5, BMP2, COX-2 (PTGS2) and RUNX2.
Thus, in embodiments, the methods of the present invention comprise measuring the expression level of at least one of ITGB1, ITGB3, CTNNB1; POC4, BMP2, COX-2 and RUNX2, preferably at least one of ITGB1; CTNNB1; BMP2, COX-2 and RUNX2, and more preferably at least one of ITGB1; CTNNB1; BMP2 and COX-2. An altered expression in at least one of the above mechanoresponsive genes in the cell sample from the subject as compared to that in the control sample is indicative of an increased risk of developing a scoliosis (or predisposition to IS or presence of IS). For example, (i) a decrease in BMP2, POC5, COX-2, ITGB1 (e.g., at 4 h post mechanostimulation) expression; or (ii) an increase in CTNNB1 expression or ITGB3 (e.g., at 8 or 6 h post mechanostimulation) expression in the cell sample from the subject as compared to that of a control sample is indicative of an increased risk of developing IS (or predisposition to IS or presence of IS).
In embodiments, the above method comprises determining in a cell sample from a subject (i) the average length of cilia on the surface of cells; (ii) the number of cells with elongated cilia; (iii) the number of ciliated cells; or (iv) the expression level of mechanoresponsive genes, over time. In embodiments, the determination is made prior to and after applying a mechanical stimuli to the cells (e.g., 1, 2, 4, 6, 8, 10, 12, 16, 24, 48 and/or 72 h following the application of a mechanical stimulation). An altered average length of cilia, an increase in the number of cells with elongated cilia, a reduced number of ciliated cells or an altered expression level in at least one mechanoresponsive gene over at least one point in time is indicative of an increased risk of (or predisposition to) developing a scoliosis.
In a particular aspect, the present invention provides a method of determining the risk of developing a scoliosis (or of detecting a predisposition to IS or the presence of IS) in a subject comprising (i) applying a physiological level of fluid sheer stress to a cell sample from the subject; and (ii) determining the expression level of a mechanoresponsive gene in the cell sample; (iii) comparing the expression level of the mechanoresponsive gene to that in a control sample; and (iv) determining the risk of developing a scoliosis (or predisposition to IS or the presence of IS) based on the results in step (iii), wherein the mechanoresponsive gene is BMP2, COX-2, RUNX2, ITGB1, ITGB3, CTNNB1, POC5 or any combination of at least two of these genes. An altered gene expression in the mechanoresponsive gene in the cell sample from the subject as compared to that in the control sample is indicative of an increased risk of developing a scoliosis (or predisposition to IS or presence of IS). For example, (i) a decrease in BMP2, COX-2, POC5, ITGB3 (e.g., at 4 h post mechanostimulation) or ITGB1 expression; or (ii) an increase in CTNNB1 expression or ITGB3 expression (e.g., at 8 or 16 h post mechanostimulation) in the cell sample from the subject as compared to that of a control sample is indicative of an increased risk of developing IS (or predisposition to IS or presence of IS).
In a related aspect of the present invention, there is provided a method (e.g., an in vitro method) for determining the risk of developing a scoliosis (e.g., an Idiopathic Scoliosis (IS) such as Infantile Idiopathic Scoliosis, Juvenile Idiopathic Scoliosis or Adolescent Idiopathic Scoliosis (AIS)), in a subject said method comprising: (a) measuring a first level of at least one mechanoresponsive gene in a biological sample from said subject; (b) applying a mechanical stimulus or force to one or more members from said subject (e.g., compressive pressure); (c) measuring a second level of the at least one mechanoresponsive gene in a corresponding biological sample from the subject after the application of said mechanostimulation (biomechanical stimulus); (d) determining a variation between the first level of the at least one mechanoresponsive gene and the second level of the at least one mechanoresponsive gene; (e) comparing the variation to a control variation value; and (f) determining whether the subject has a scoliosis or is predisposed to developing a scoliosis based on the comparison.
Any of the above methods may also be used to classify subjects into specific IS groups. Thus, in a further aspect, the present invention provides a method of classifying a subject (e.g., a subject suffering from IS or at risk of developing IS) comprising determining the biomechanical profile of cells in a cell sample from a subject, wherein an altered biomechanical profile in the cell sample as compared to that in a control sample allows classifying the subject into a specific IS group.
In embodiments, determining the biochemical profile comprises (i) determining the average length of cilia on the surface of cells in a cell sample from the subject; (ii) determining the number of cells with elongated cilia in a cell sample from the subject (iii) determining the number of ciliated cells in a cell sample from the subject; or (iv) any combination of (i), (ii) and (iii). Scoliotic subjects may then be classified into specific groups based on for example, the average length of cilia on the surface of their cells, their number of cells having elongated cilia or based on the number of ciliated cells in their cell sample.
In embodiments, determining the biochemical profile comprises measuring a response to mechanostimulation of cells in a cell sample from a subject suffering from a scoliosis, comprising: (i) applying mechanostimulation to cells from the cell sample from the subject; (ii) measuring the expression level of at least one mechanoresponsive gene. An altered expression level in the at least one mechanoresponsive gene as compared to that of a control sample allows classifying the subject into a specific IS group.
Thus, in embodiments, the above classification method comprises measuring the expression level of at least one of the following mechanoresponsive gene: ITGB1, ITGB3, CTNNB1; POC5, BMP2, COX-2, RUNX2 and CTNNB1, preferably of at least one of ITGB1; CTNNB1; BMP2, COX-2 and RUNX2, and more preferably ITGB1; CTNNB1; BMP2 and COX-2. Subjects may be classified for example according to the specific gene or genes which expression is altered. In addition (or alternatively), subjects may be classified according to the level of variation in gene expression detected (overtime or at a single point in time) and/or based on the absence or presence of a variation in gene expression following mechanostimulation (overtime or at a single point in time). Scoliotic subjects may be compared to control non-scoliotic subjects or to each other and classified accordingly.
As noted above, the present invention discloses the presence of certain gene variants (polymorphic markers) in cells from IS subjects (see Table 4). In particular, rare gene variants (e.g., polymorphisms such as SNPs) have been identified in the following genes: FEZF1, CDH13, FBXL2, TRIM13, CD1B, VAX1, CLASP1, SUGT1, MIPEP, FAM188A, TAF6, WHSC1, GPR158, HNRNPD, RUNX1T1, GRIK3, FUZ, LYN, DDX5, PODXL, ATP5B, PIGK, AL159977.1, SEPT9, TMEM87A, CDYL, SPINT3, SERTM1, FOLR3, FCER2, MAEA, PXT1, UVRAG, SPPL3, IGHV3-50, HIVEP1, SMAD5, PPP1R21, SEC62, TOPBP1, HIPK3, KRTAP12-2, FYB, PXDN, CDV3, RP3-344J20.2, RP11-405L18.2, MRPL18, SOD2, FOXP2, REEP1, C1orf106, DNASE1L1, BTN1A1, MLST8, HMP19, OR8B4, AC105901.1, OR5F1, GLE1, OR5P3, SCFD1, CDK11A, HSD17B14, NFU1, GTF2H3, RAB7A, HOXA3, ZC3H4, DDX55, FBXW10, OSBPL2, POLR1A, NOP58, RAB31, EFNB2, ZCCHC14, GLP1R, RNF149, OR1J2, WI2-81516E3.1, GAPDHP27, SFTA3, ACSF3, POU2F2, MIR345, SNPH, MATR3, RP11-73B8.2, SNORA48, PATZ1, RBM5, HMGA1, ATP1A3, ACTG1P1, PAIP1, KCNMA1, PALB2, PLEKHG5, C11orf2, MT1DP, CYC1, DTD1, CREB3L3, RPL23A, CD164L2, PCCB, GIMAP7, AHCYL1, TNNT2, ZNF134, AC079612.1, MTA2, RP11-672F9.1, CLEC5A, C1orf222, CD96, PPFIBP1, ZNF323 and SUPT3H.
Accordingly, in a further aspect, the present invention provides a method of determining the risk of developing a scoliosis (or a predisposition to IS or the presence of IS) in a cell sample from a subject, the method comprising detecting the presence or absence of at least one polymorphic marker (gene variant) in at least one gene allele of FEZF1, CDH13, FBXL2, TRIM13, CD1B, VAX1, CLASP1, SUGT1, MIPEP, FAM188A, TAF6, WHSC1, GPR158, HNRNPD, RUNX1T1, GRIK3, FUZ, LYN, DDX5, PODXL, ATP5B, PIGK, AL159977.1, SEPT9, TMEM87A, CDYL, SPINT3, SERTM1, FOLR3, FCER2, MAEA, PXT1, UVRAG, SPPL3, IGHV3-50, HIVEP1, SMAD5, PPP1R21, SEC62, TOPBP1, HIPK3, KRTAP12-2, FYB, PXDN, CDV3, RP3-344J20.2, RP11-405L18.2, MRPL18, SOD2, FOXP2, REEP1, C1orf106, DNASE1L1, BTN1A1, MLST8, HMP19, OR8B4, AC105901.1, OR5F1, GLE1, OR5P3, SCFD1, CDK11A, HSD17B14, NFU1, GTF2H3, RAB7A, HOXA3, ZC3H4, DDX55, FBXW10, OSBPL2, POLR1A, NOP58, RAB31, EFNB2, ZCCHC14, GLP1R, RNF149, OR1J2, WI2-81516E3.1, GAPDHP27, SFTA3, ACSF3, POU2F2, MIR345, SNPH, MATR3, RP11-73B8.2, SNORA48, PATZ1, RBM5, HMGA1, ATP1A3, ACTG1P1, PAIP1, KCNMA1, PALB2, PLEKHG5, C11orf2, MT1DP, CYC1, DTD1, CREB3L3, RPL23A, CD164L2, PCCB, GIMAP7, AHCYL1, TNNT2, ZNF134, AC079612.1, MTA2, RP11-672F9.1, CLECSA, C1orf222, CD96, PPFIBP1, ZNF323 or SUPT3H.
In another aspect, the present invention provides a method of genotyping a subject (e.g., a subject suffering from a scoliosis or at risk of developing a scoliosis (e.g., Idiopathic scoliosis)) comprising determining in a cell sample from the subject the presence or absence of at least one polymorphic marker in an allele of at least one of FEZF1, CDH13, FBXL2, TRIM13, CD1B, VAX1, CLASP1, SUGT1, MIPEP, FAM188A, TAF6, WHSC1, GPR158, HNRNPD, RUNX1T1, GRIK3, FUZ, LYN, DDX5, PODXL, ATP5B, PIGK, AL159977.1, SEPT9, TMEM87A, CDYL, SPINT3, SERTM1, FOLR3, FCER2, MAEA, PXT1, UVRAG, SPPL3, IGHV3-50, HIVEP1, SMAD5, PPP1R21, SEC62, TOPBP1, HIPK3, KRTAP12-2, FYB, PXDN, CDV3, RP3-344J20.2, RP11-405L18.2, MRPL18, SOD2, FOXP2, REEP1, C1orf106, DNASE1L1, BTN1A1, MLST8, HMP19, OR8B4, AC105901.1, OR5F1, GLE1, OR5P3, SCFD1, CDK11A, HSD17B14, NFU1, GTF2H3, RAB7A, HOXA3, ZC3H4, DDX55, FBXW10, OSBPL2, POLR1A, NOP58, RAB31, EFNB2, ZCCHC14, GLP1R, RNF149, OR1J2, WI2-81516E3.1, GAPDHP27, SFTA3, ACSF3, POU2F2, MIR345, SNPH, MATR3, RP11-73B8.2, SNORA48, PATZ1, RBM5, HMGA1, ATP1A3, ACTG1P1, PAIP1, KCNMAI, PALB2, PLEKHG5, C11orf2, MTIDP, CYC1, DTD1, CREB3L3, RPL23A, CD164L2, PCCB, GIMAP7, AHCYL1, TNNT2, ZNF134, AC079612.1, MTA2, RP11-672F9.1, CLEC5A, C1orf222, CD96, PPFIBP1, ZNF323 or SUPT3H. Such method allows classifying subjects into specific IS groups. Such classification may allow the application of personalized prevention and treatment regimens, based on the specific genotype the subject.
In embodiments, the above methods comprise detecting the presence or absence of at least one polymorphic marker (gene variant) in at least one gene allele of FEZF1, CDH13, FBXL2, TRIM13, CD1B, VAX1, CLASP1, SUGT1, MIPEP, FAM188A, TAF6, WHSC1, GPR158, HNRNPD, RUNX1T1, GRIK3, FUZ, LYN, DDX5, PODXL, ATP5B, PIGK, AL159977.1, BTN1A1, CDK11A, HIVEP1, HSD17B14, KCNMA1, PXDN, RAB31, RBM5, RNF149, SOD2, TOPBP1, ZCCHC14 and ZNF323.
In embodiments, the above methods comprise detecting the presence or absence of at least one polymorphic marker (gene variant) in at least one gene allele of FEZF1, CDH13, FBXL2, TRIM13, CD1B, VAX1, CLASP1, SUGT1, MIPEP, FAM188A, TAF6, WHSC1, GPR158, HNRNPD, RUNX1T1, GRIK3, FUZ, LYN, DDX5, PODXL, ATP5B, PIGK and AL159977.1.
In embodiments, the above methods comprise detecting the presence or absence of at least one polymorphic marker (gene variant) in at least one gene allele of ATP5B, BTN1A1, CD1B, CDK11A, CLASP1, DDX5, FBXL2, HIVEP1, HSD17B14, KCNMA1, PXDN, RAB31, RBM5, RNF149, SOD2, SUGT1, TOPBP1, ZCCHC14 and ZNF323.
In embodiments, the above methods comprise detecting the presence or absence of at least one polymorphic marker (gene variant) in at least one gene allele of HNRNPD, ATP5B, LYN, CD1B, CLASP1, SUGT1 and AL159977.1.
Preferably, the above methods comprise detecting the presence or absence of at least one polymorphic marker (gene variant) in at least one gene allele of CD1B, CDK11A, CLASP1, RNF149 and SUGT1, more preferably, in at least one of CDB1, CLASP1 and SUGT1. In embodiments, the methods comprise detecting the presence or absence of a polymorphic marker in CDB1, CLASP1 and SUGT1.
In a particular aspect, the above-mentioned polymorphic marker is a gene variant (e.g., SNP) as defined in Table 6. In embodiments, the polymorphic marker is a risk variant defined in Table 6.
Methods of the present invention may further comprise detecting the presence or absence of at least polymorphic marker (e.g., risk variant, SNP) in at least one gene listed in Table 2.
In the above methods, detecting the presence of a risk allele (risk variant(s)) in polymorphic markers of one or more of the above genes is indicative of a risk of developing a scoliosis (or predisposition to IS). The level of risk or the likelihood of developing a scoliosis is determined depending on the number of risk-associated variants that are present in cells from a subject. The level of risk is determined by calculating a genetic score (ODD ratio), as well known in the art.
Accordingly, the present invention encompasses detecting the presence or absence of a polymorphic marker (e.g., SNP) in multiple genes listed in Tables 2 and 4-6 (e.g., a combination of 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 15, 16, 17, 18, 20, 21, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 35, 40, 46, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110, 115 and 120 genes).
Alleles for SNP markers as referred to herein refer to the bases A, C, G or T as they occur at the polymorphic site in the SNP assay employed. The person skilled in the art will realize that by assaying or reading the opposite DNA strand, the complementary allele can in each case be measured. Thus, for a polymorphic site (polymorphic marker) characterized by an A/G polymorphism, the assay employed may be designed to specifically detect the presence of one or both of the two bases possible, i.e. A and G. Alternatively, by designing an assay that is designed to detect the opposite strand on the DNA template, the presence of the complementary bases T and C can be measured. Quantitatively (for example, in terms of relative risk), identical results would be obtained from measurement of DNA strands (+ strand or − strand).
Detecting specific gene variants or polymorphic markers and/or haplotypes of the present invention can be accomplished by methods known in the art. Such detection can be made at the nucleic acid or amino acid (protein) level.
For example, standard techniques for genotyping for the presence of gene variants (e.g., SNPs and/or microsatellite markers) can be used, such as sequencing, fluorescence-based techniques (Chen, X. et al., Genome Res. 9(5): 492-98 (1999)), methods utilizing PCR, LCR, Nested PCR and other methods for nucleic acid amplification. Specific methodologies available for SNP genotyping include, but are not limited to, TaqMan™ genotyping assays and SNPlex™ platforms (Applied Biosystems), mass spectrometry (e.g., MassARRAY™ system from Sequenom), minisequencing methods, real-time PCR, Bio-Plex™ system (BioRad), CEQ and SNPstream™ systems (Beckman), Molecular Inversion Probe™ array technology (e.g., Affymetrix GeneChip), and BeadArray™ Technologies (e.g., Illumine GoldenGate and Infinium assays). By these or other methods available to the person skilled in the art, one or more alleles at polymorphic markers, including microsatellites, SNPs or other types of polymorphic markers, can be identified.
In accordance with another aspect of the present invention, there is provided a method of selecting a preventive measure, treatment or follow-up schedule for a subject suffering from IS or at risk of developing IS comprising classifying or genotyping the subject using one or more of the above-noted methods:
In order to determine the risk of developing a scoliosis (or predisposition to IS) it is also possible to assess the presence of a gene variant (such as a SNP) in linkage disequilibrium with any of the gene variants identified herein (e.g., SNPs/variants listed in Table 6).
Once a first SNP has been identified in a genomic region of interest, the practitioner of ordinary skill in the art can easily identify additional SNPs in linkage disequilibrium with this first SNP. In the context of the invention, the additional SNPs in linkage disequilibrium with a first SNP are within the same gene of said first SNP. Linkage disequilibrium (LD) is defined as the non-random association of alleles at different loci across the genome. Alleles at two or more loci are in LD if their combination occurs more or less frequently than expected by chance in the population.
For example, if a particular genetic element (e.g., an allele of a polymorphic marker, or a haplotype) occurs in a population at a frequency of 0.50 (50%) and another element occurs at a frequency of 0.50 (50%), then the predicted occurrence of a person's having both elements is 0.25 (25%), assuming a random distribution of the elements. However, if it is discovered that the two elements occur together at a frequency higher than 0.25, then the elements are said to be in linkage disequilibrium, since they tend to be inherited together at a higher rate than what their independent frequencies of occurrence (e.g., allele or haplotype frequencies) would predict.
When there is a causal locus in a DNA region, due to LD, one or more SNPs nearby are likely associated with the trait too. Therefore, any SNPs in LD with a first SNP associated with autism or an associated disorder will be associated with this trait. Identification of additional SNPs in linkage disequilibrium with a given SNP involves: (a) amplifying a fragment from the gene comprising a first SNP from a plurality of individuals; (b) identifying of second SNPs in the gene comprising said first SNP; (c) conducting a linkage disequilibrium analysis between said first SNP and second SNPs; and (d) selecting said second SNPs as being in linkage disequilibrium with said first marker. Subcombinations comprising steps (b) and (c) are also contemplated.
Methods to identify SNPs and to conduct linkage disequilibrium analysis can be carried out by the skilled person without undue experimentation by using well-known methods.
Thus, the practitioner of ordinary skill in the art can easily identify SNPs or combination of SNPs within haplotypes in linkage disequilibrium with the at risk gene variant (e.g. risk SNP).
Such markers are mapped and listed in public databases like HapMap as well known to the skilled person. Genomic LD maps have been generated across the genome, and such LD maps have been proposed to serve as framework for mapping disease-genes (Risch et ai, 1996; Maniatis et ai, 2002; Reich et ai, 2001). If all polymorphisms in the genome were independent at the population level (i.e., no LD), then every single one of them would need to be investigated in association studies, to assess all the different polymorphic states. However, due to linkage disequilibrium between polymorphisms, tightly linked polymorphisms are strongly correlated, which reduces the number of polymorphisms that need to be investigated in an association study to observe a significant association. Another consequence of LD is that many polymorphisms may give an association signal due to the fact that these polymorphisms are strongly correlated.
The two metrics most commonly used to measure LD are D′ and r2 and can be written in terms of each other and allele frequencies. Both measures range from 0 (the two alleles are independent or in equilibrium) to 1 (the two allele are completely dependent or in complete disequilibrium), but with different interpretation. D′ is equal to 1 if at most two or three of the possible haplotypes defined by two markers are present, and <1 if all four possible haplotypes are present. r2 measures the statistical correlation between two markers and is equal to 1 if only two haplotypes are present.
Most SNPs in humans probably arose by single base modifying events that took place within chromosomes many times ago. A single newly created allele, at its time of origin, would have been surrounded by a series of alleles at other polymorphic loci like SNPs establishing a unique grouping of alleles (i.e. haplotype). If this specific haplotype is transmitted intact to next generations, complete LD exists between the new allele and each of the nearby polymorphisms meaning that these alleles would be 100% predictive of the new allele. Thus, because of complete LD (D′=1 or r2=1) an allele of one polymorphic marker can be used as a surrogate for a specific allele of another. Event like recombination may decrease LD between markers. But, moderate (i.e. 0.5≤r; r2<0.8) to high (i.e. 0.8≤; r2<1) LD conserve the “surrogate” properties of markers. In LD based association studies, when LD exist between markers and an unknown pathogenic allele, then all markers show a similar association with the disease.
It is well known that many SNPs have alleles that show strong LD (or high LD, defined as r2≥0.80) with other nearby SNP alleles and in regions of the genome with strong LD, a selection of evenly spaced SNPs, or those chosen on the basis of their LD with other SNPs (proxy SNPs or Tag SNPs), can capture most of the genetic information of SNPs, which are not genotyped with only slight loss of statistical power. In association studies, this region of LD are adequately covered using few SNPs (Tag SNPs) and a statistical association between a SNP and the phenotype under study means that the SNP is a causal variant or is in LD with a causal variant. It is a general consensus that a proxy (or Tag SNP) is defined as a SNP in LD (r2≥0.8) with one or more other SNPs. The genotype of the proxy SNP could predict the genotype of the other SNP via LD and inversely. In particular, any SNP in LD with one of the SNPs used herein may be replaced by one or more proxy SNPs defined according to their LD as r2≥0.8.
These SNPs in linkage disequilibrium can also be used in the methods according to the present invention, and more particularly in the diagnostic methods according to the present invention. In particular, the presence of SNPs in linkage disequilibrium (LD) with the above identified SNPs may be genotyped, in place of, or in addition to, said identified SNPs. In the context of the present invention, the SNPs in linkage disequilibrium with the above identified SNP are within the same gene of the above identified SNP. Therefore, in the present invention, the presence of SNPs in linkage disequilibrium (LD) with a SNP of interest and located within the same gene as the SNP of interest may be genotyped, in place of, or in addition to, said SNP of interest. Preferably, such an SNP and the SNP of interest have r2≥0.70, preferably r2≥0.75, more preferably r2≥0.80, and/or have D′≥0.60, preferably D′≥0.65, D′≥0.7, D′≥0.75, more preferably D+≥0.80. Most preferably, such an SNP and the SNP of interest have r2≥0.80, which is used as reference value to define “LD” between SNPs.
Compositions and kits for use in the methods of the present invention (i.e., for determining the risk of developing a scoliosis; for genotyping a subject and for classifying a subject suffering from a scoliosis or at risk of developing a scoliosis) may include for example (i) one or more reagents for detecting (a) the length of cilia at the surface of cells; (b) the number of ciliated cells; (c) the level of expression of at least one mechanoresponsive gene; and/or (d) the presence or absence of a variant (polymorphic marker) in a gene listed in Table 4 or 6 or a substitute marker in linkage disequilibrium therewith.
Compositions and kits can comprise oligonucleotide primers and hybridization probes (e.g., allele-specific oligonucleotide primers and hybridization probes for determining the level of a mechanoresponsive gene or variant in a gene listed in Tables 4-6), restriction enzymes (e.g., for RFLP analysis) and/or antibodies that bind to a mutated polypeptide (polymorphic polypeptide) which is encoded by a nucleic acid comprising a polymorphic marker (e.g., gene variant) of the present invention (e.g., a nucleic acid comprising a variant (polymorphic marker) as defined in Table 6).
The kit (or composition) may also include any necessary buffers, enzymes (e.g., DNA polymerase) and/or reagents necessary for performing the methods of the present invention. The kit may comprise one or more labeled nucleic acids (or labeled antibody) capable of specific detection of one or more polymorphic markers of the present invention (e.g., gene variants defined in Table 6) or any markers in linkage disequilibrium therewith as well as reagents for the detection of the label. The kit may also comprise a device for applying a mechanical stimulus or force on one or more members of the subject (e.g., an inflatable strap or arm cuff).
Reagents may be provided in separate containers or premixed depending on the requirements of the method. Suitable labels are well known in the art and will be chosen according to the specific method used. Non-limiting examples of suitable labels (including non-naturally occurring labels/synthetic labels) include a radioisotope, a fluorescent label, a magnetic label, an enzyme, etc.
In a preferred embodiment, the detection of a polymorphic marker (e.g., gene variant defined in Table 6) in a gene associated with IS in accordance with the present invention is determined by DNA Chip analysis. Such DNA chip or nucleic acid microarray consists of different nucleic acid probes that are chemically attached to a substrate, which can be a microchip, a glass slide or a microsphere-sized bead. A microchip may be constituted of polymers, plastics, resins, polysaccharides, silica or silica-based materials, carbon, metals, inorganic glasses, or nitrocellulose. Probes comprise nucleic acids such as cDNAs or oligonucleotides that may be about 10 to about 60 base pairs. To determine the alteration of the genes, a sample from a test subject is labelled and contacted with the microarray in hybridization conditions, leading to the formation of complexes between target nucleic acids that are complementary to probe sequences attached to the microarray surface. The presence of labelled hybridized complexes is then detected. Many variants of the microarray hybridization technology are available to the man skilled in the art.
In embodiments, there is provided a composition (e.g., a diagnostic composition) which is generated following one or more steps of the methods describe herein and which include a biological sample (e.g., cell sample, blood sample, etc.) from the subject to be tested. The preparation of such composition occurs while testing a subject's biological sample for the risk of developing a scoliosis (including the risk of developing a more severe scoliosis); for aiding in the prevention and treatment of scoliosis including for determining the best treatment regimen; for adapting an undergoing treatment regimen; for selecting a new treatment regimen or for determining the frequency of a specific treatment regimen or follow-up schedule. Such compositions may be prepared using as kits described herein.
In embodiments, compositions and kits of the present invention may thus comprise one or more oligonucleotides probe or amplification primer for the detection (e.g., amplification or hybridization) of a mechanoresponsive gene (e.g., ITGB1, ITGB3, CTNNB1; POC5, BMP2, COX-2, RUNX2 and CTNNB1) or for the detection of a polymorphic marker of the present invention (e.g., a variant or reference sequence defined in Table 6). In embodiments, oligonucleotide probes are provided in the form of a microarray or DNA chip. The kit may further include instructions to use the kit in accordance with the methods of the present invention (e.g., for determining the risk of (or predisposition to) developing a scoliosis; for genotyping a subject or for classifying a subject suffering from a scoliosis or at risk of developing a scoliosis in a specific genetic or functional group).
The present invention is illustrated in further details by the following non-limiting examples.
In certain subjects, scoliosis develops rapidly over a short period of time to the point of requiring a corrective surgery (often when the deformity reaches a Cobb's angle≥45°). Current courses of action available from the moment a scoliosis such as IS is diagnosed (when scoliosis is apparent) include observation (including periodic x-rays, when Cobb's angle is around 10-25°), orthopedic devices (such as bracing, when Cobb's angle is around 25-30°), and surgery (Cobb's angle over 45°). Thus, a more reliable determination of the risk assessment (through using one or more methods of the present invention) could enable to 1) select an appropriate diet to remove certain food products identified as contributors to scoliosis in certain subjects; 2) select the best therapeutic agent or treatment or preventive measure (e.g., neutralizing antibody specific to OPN, long term brace treatment, melatonin, selenium, PROTANDIM; HA supplements or HA-rich diet, antibody against CD44 etc.); 3) select the least invasive available treatment such as postural exercises (e.g., massages, or low intensity pulsed ultrasound (LIPUS), orthopedic device (brace) or other treatment or preventive measure (e.g., accupoint heat sensitive moxibustion, heat therapy with pad, thermal bath, electroacupuncture); or less invasive surgeries or surgeries without fusions (a surgery that does not fuse vertebra and preserves column mobility) and/or 4) the best follow-up schedule (e.g., increasing or decreasing the number of follow-up visit to the doctor during for example a 3, 6 or 12 month period or increasing or decreasing the number of x-rays during for example a 3, 6 or 12 month period).
Study cohorts. This study was approved by the institutional review boards of The Sainte-Justine University Hospital, The Montreal Children's Hospital, The Shriners Hospital for Children in Montreal and McGill University, as well as the Affluent and Montreal English School Boards. Parents or legal guardians of all the participants gave written informed consent, and minors gave their assent. All the experiments were performed in accordance with the approved guidelines and regulations. All subjects are residents of Quebec, Canada and are of European descent. Each IS patient was clinically assessed by an orthopedic surgeon at the Sainte-Justine Children's Hospital. The inclusion criteria for this study were a minimum Cobb angle of 10 degrees and a diagnosis of idiopathic scoliosis. Cobb angle is the clinical parameter used for quantification of curve magnitude where a larger angle denotes a greater magnitude. For cellular experiments, we used bone samples from a subset of patients who required corrective surgery. Control bone samples were from surgical non-scoliotic trauma patients recruited at the Sainte-Justine University Hospital. All patients used for cellular studies were adolescent females (Table 1). The medical files of controls were reviewed to exclude the possibility of scoliosis. For exome sequencing, control blood samples were collected from non-IS participants that were first screened by an orthopedic surgeon using the forward-bending test (Adam's test).67 Any children with apparent spinal curvature were not included in the control cohort.
Cell culture. Primary osteoblasts were derived from bone specimens obtained from IS patients and trauma patients (as controls), intraoperatively. For all IS cases, bone specimens were surgically removed from affected vertebrae (the sampled vertebrae varied from T3 to L4) as a part of correctional surgery. For non-scoliotic control cases, bone specimens were obtained from other anatomic sites (tibia or femur) during trauma surgery. Using a cutter, bone fragments were manually reduced to small pieces under sterile conditions. The small bone pieces were incubated in aMEM medium containing 10% fetal bovine serum (FBS; certified FBS, Invitrogen Life Technologies, ON, Canada) and 1% penicillin/streptomycin (Invitrogen) at 37° C. in 5% CO2, in a 10-cm2 culture dish. After one month, osteoblasts emerging from the bone pieces were separated from the remaining bone fragments by trypsinization. Bone cells were characterized using alizarin red (
Immunofluorescence. Cells were seeded in 8-well chamber slides (Falcon, Corning Incorporated, AZ, USA) at a density of 9×105 cells per well. Upon reaching 80% confluence, the cells were washed with PBS and starved to induce cilia differentiation. At each time point during starvation, the cells were washed with PBS, fixed with 4% PFA and 3% sucrose in PBS buffer for 10 minutes at room temperature, washed (1% BSA in PBS), and then permeabilized with 0.1% Triton™ X-100 in PBS for 10 min at room temperature. After two washes, the cells were blocked in 5% BSA in PBS for 1 h at room temperature. Mouse anti-acetylated α-tubulin (Invitrogen; 32-2700) diluted (1:1000) in 3% BSA-PBS was the primary antibody to detect cilia. Cells were incubated with this primary antibody overnight at 4° C. The following day, after 3 washes, the cells were incubated for 1 h at room temperature with Alexa Fluor® 488 conjugated goat anti-mouse secondary antibody (Invitrogen; A11029). After 3 washes, 1 μg/ml dilution of Hoechst (Sigma-Aldrich, ON, Canada; 94403) in 1% BSA-PBS was used to stain the nucleus at room temperature for 10 min. Alexa Fluor® 555 Phalloidin dilution (1:40) in 1% BSA-PBS incubation at room temperature for 20 min was used to stain the cytoskeleton. The images were captured on a Leica Confocal TCS-SP8 or Zeiss Confocal 880 using ×63 (oil) objective with 1,024×1,024 pixel resolution. Each sample has been examined in stitched 5×5 tile images, in duplicate (50 fields of view). Maximum projections of the Z-stacks were used for primary cilium measurement and counting was done in Image J (NIH). Two separate double staining with anti-Ninein antibody (Millipore, CA, USA, ABN1720) as the cilia base marker and anti-IFT88 (Proteintech, IL, USA, 13967-1-AP) were performed to co-stain the cilia alongside anti-acetylated α-tubulin to confirm the method.
Proliferation assay. Bone cells acquired from IS patients were cultured, as previously described above for cell culture. Upon reaching 90% confluence, cells were harvested by adding Trypsin-EDTA (0.25%) and phenol red (Thermo Fisher Scientific Inc., NY, USA 25200-072) for subculture (P3 to P5). Cells were washed and counted (Trypan Blue staining of viable cells) using the Vi-cell® XR (Beckman Coulter, Inc., CA, USA) automated cell counter and then seeded in 12 well plates (100,000 per well in triplicate for each sample). Cells were allowed to grow at 37° C. in 5% CO2 and each well was counted at different time points (24 h, 48 h, 72 h, 96 h and 120 h post culture). Bone cells from age- and gender-matched trauma surgical patients were used as controls.
In vitro fluid flow stimulation. For each sample, the cells were divided equally between 4 vented 75 cm2 tissue culture flasks and cultured in 21 ml medium (αMEM+10% FBS+1% penicillin/streptomycin). Upon reaching 80% confluence, culture medium was removed, the cells were washed with warm PBS and then transferred to starvation medium for 48 h. After 48 h, cells were washed again and transferred back to 20 ml regular medium immediately before they were subjected to oscillatory fluid flow using a double-tier rocking platform, with some modification of the rocker method described by Robin M. Delaine Smith et al.16 with a maximum tilt angle of ±20 degrees, and a speed of 1 tilt per second (equal to 1 Hz). The entire unit was housed in a cell culture incubator held at 37° C. and 5% CO2 for the duration of the flow experiments (0 h, 4 h, 8 h, and 16 h). Control, no flow cells were housed in the same incubator and harvested at 8 h.
Fluid shear stress patterns were applied to cells in a predictable, controlled, and physiologically relevant manner through the whole experiment. From a biomechanical point of view, one expects that the cilia-related gene expression to be a function of time elapsed, t, and the shear stress exerted on the cells, which in turn depends on the fluid viscosity, v, the frequency of flow oscillations, f, and the thickness of the fluid film, h. Designing an experiment in which all these parameters match the physiological conditions can be prohibitively challenging, and in fact unnecessary. Fortunately, the design of the experiment can be simplified using the π-Buckingham theorem68 that is widely used in engineering and physics. The π-Buckingham theorem states that the dynamics of a problem (e.g. fluid flow) can be completely described and measured by a set of nondimensional quantities.
Here, ϕ is defined as the ratio of gene expression measured at a given time, t, to its expression at 0 h. Noting that ϕ is a nondimensional quantity, one can use the π-Buckingham theorem to show that ϕ is a function of two other nondimensional numbers:
The Womersley number takes into account the effect of viscosity and shear stress exerted on the cell and is widely used in biomechanical studies involving pulsating fluid flow.69 The value of Womersley number ranges from 5 to 18 in fluid motion of cerebrospinal fluid in the spinal cavity.25 We designed our experiment such that the Womersley number experienced by the cells is equal to 8, which is well within the expected range in vivo. This value corresponds to an average shear stress at the center of the dish with a magnitude of 1 [Pa] in our experiment (see Zhou et al. 201070 for details of calculation).
RNA extraction. All RNA was extracted using Trizol™ (Invitrogen-Thermo Fisher Scientific, 15596-026), according to the manufacturer's instructions. Briefly, cell culture dishes containing adherent bone cells (passage 2 or 3) were washed with PBS before trypsinization, then transferred to a 15 ml tube and after centrifugation, the cell pellet was stored immediately at −80° C. All the cells went through RNA extraction the following day and were lysed in 1 ml Trizol™. RNeasy MinElute™ Cleanup Kit (Qiagen Inc., ON, Canada; 74204) was used to purifiy RNA, according to the manufacturer's instructions.
Quantitative RT-PCR. Reverse transcriptase quantitative PCR (RT-qPCR) was used to assay gene expression levels. All primer design, validation, and gene expression were performed at the Genomics core facility of Institut de Recherche en Immunologie et Cancérologie (IRIC), University of Montreal, Quebec. All RNA was run on a bioanalyzer using a Nano RNA chip to verify its integrity. Total RNA was treated with DNase and reverse transcribed using the Maxima™ First Strand cDNA synthesis kit with ds DNase (Thermo Fisher Scientific). Before use, RT samples were diluted 1:5. Gene expression was determined using assays designed with the Universal Probe Library from Roche (www.universalprobelibrary.com). For each qPCR assay, a standard curve was performed to ensure that the efficacy of the assay is between 90% and 110%. Quantitative PCR (qPCR) reactions were performed in triplicate with 2 internal controls (GAPDH and HPRT) using PERFECTA qPCR FASTMIX II™ (Quanta Biosciences, Inc., MD, USA), 2 μM of each primer, and 1 μM of the corresponding UPL probe. The Viia7 qPCR instrument (Thermo Fisher Scientific) was used to detect the amplification level and was programmed with an initial step of 20 sec at 95° C., followed by 40 cycles of: 1 sec at 95° C. and 20 sec at 60° C. Relative expression (RQ=2−ΔΔCT) was calculated using the Expression Suite software (Thermo Fisher Scientific), and normalization was done using both GAPDH and HPRT. The baseline expression level at 0 h (before treatment) of every sample was defined as its own calibrator. The calibrator has a RQ value of 1 because it does not vary compared to itself. For each gene, the two groups (control and IS) were compared at each time point using a pairwise t-test. In this way, we asked one question per gene and we did three comparisons to answer (three comparisons per family of test). After looking at the results of these tests, we asked another question for three of the genes that seemed to show an overall expression profile that was different between IS and controls (ITGB1, CTNNB1 and POC5). For these genes we added three more comparisons: the expression at 0 h for IS was compared to each of the other time points (4 h, 8 h and 16 h) using 3 separate pairwise t-tests. We also examined the base line of gene expression in IS versus control, by comparing the delta Ct mean (expression level normalized with endogenous controls) at 0 h of all samples using t-test.
Exome and Sanger sequencing. Genomic DNA was extracted from the whole blood of 73 IS patients and 70 controls using the PureLink® Genomic DNA extraction kit (Thermo Fisher Scientific). Library preparation and exome sequencing was performed at GENESE (Génomique de la Santé de l'Enfant, Sainte-Justine University Hospital Research Center). Selected variants were confirmed using Sanger sequencing technologies at the Genome Quebec Innovation Centre. Samples were barcoded, and captured using libraries of synthetic biotinylated RNA oligonucleotides (baits) targeting 50 Mb of genome (Agilent SureSelect Human All Exon 50 Mb v3), and sequenced on the 5500 SOLiD™ Sequencing System (Thermo Fisher Scientific). Trimmed FASTQ formatted sequences were aligned to the exome target sequence using Bfast+bwa (version 0.7.0a) in the paired-end alignment mode.71 Mapped reads were refined using GATK and Picard program suites34 to improve mapped reads near indels (GATK indel realigner) and improve quality scores (GATK base recalibration) and to remove duplicate reads with the same paired start sites (Picard mark Duplicates). Variants were called using SAMTOOLS batch calling procedure referenced against the UCSC assembly hg19 (NCBI build 37). Variants were additionally filtered to remove variants that are present with minor allele frequencies (MAF)>0.05 (dbSNP, 1000 genomes, ExAC and/or Exome variant server (ESP). Variants were annotated using the GEMINI framework72 that provides quality metrics and extensive metadata (e.g. OMIM, clinVar, etc.) to help further prioritize variants. To optimize the querying criteria for the GEMINI database, we performed bidirectional Sanger sequencing for more than 100 different variants. Using an optimized threshold (Coverage DP>10x, Genotype quality GQ>80, Call rate>90%, Alternate quality (QUAL)>50, Map quality>20), the results show 85% genotype correlation between the sequencing methods. This threshold was used to filter our data prior to analysis.
Statistical analyses. To test for accumulation of rare variants in genes associated with IS, we used the Sequence Kernel Association Optimal unified test algorithm SKAT-O.35 SKAT-O is a region-based omnibus test that increases a study's power to detect rare variants. Because there is no model for the genetic basis underlying IS, SKAT-O is optimal over SKAT or burden testing alone, since it is a robust technique to detect variable effect rare polymorphisms.22 Variants that passed our filtering criteria, with a dataset minor allele frequency 5% were analyzed in two different sets. Additionally, high quality variants with membership to the Illumina Human Exome Chip were extracted from the Gemini database for population structure analyses using R package SNPRelate. The top two components were used as covariates in the SKAT-O analysis. The first set used the manual recommended settings for rare variants: SKATBinary with SNP weighting based on Madsen and Browning weights (i.e. less frequent are more impactful) (B1=0.5, B2=0.5). The second set weighted-SNPs are based on Combined Annotation Dependent Depletion (CADD) scores (i.e. functional, deleterious, and disease-causing variants have greater impact).73 For both sets, we generated a null model of no association between genetic variables and outcome phenotype adjusting for covariates (see Tables 4 and 5 in Example 6). Covariate analysis confirmed that there was not population stratification in our dataset. The gene-level significant thresholds were determined by the efficient resampling (ER) method and the conservative minor allele count (MAC) threshold of≤40.74 To examine the number of ciliary genes in our datasets, we used two reference lists that define a ciliary function based on experimental evidence: 1) the SYSCILIA™ gold standard list of genes containing 303 ciliary genes verified by independent publications36; 2) a list of 52 genes (51 novel genes because one is in the SYSCILIA™ list) from a functional genomic screen that used RNA interference to identify genes involved in the regulation of ciliogenesis and cilia length.37
Review of ciliary genes associated with spinal curvature. If idiopathic scoliosis is a genetically heterogeneous ciliopathy-like condition, then we expect a large number of known ciliopathies have spinal curvature as a comorbidity. We reviewed the SYSCILIA™ gold standard gene list and the Kim et al., 2010 list36,37 to ascertain how many ciliary genes were associated with spinal curvature phenotypes in either a human syndrome or an animal model. For each of the 303 genes, the search terms in Google included the gene name with “spinal curvature” and “scoliosis”. Additionally, if the gene was known to be associated with a syndrome in the OMIM database, we also searched the syndrome name with “scoliosis”.
Ciliopathies comprise a large number of human genetic disorders that are defined by the causative or predisposing gene being related to cilia structure, function, sensory pathways, or localization. To examine whether the IS cilia phenotype is linked to ciliary genes, the inventors reviewed established cilia gene lists for associations to spinal curvature and surveyed well defined IS genes in human and animal studies for a functional link to cilia. From the review using the SYSCILIA gold standard list of 303 verified ciliary genes, 55 genes are associated with a human syndrome having clinical reports of scoliosis were found. Two of these genes, SUFU and Adherens junctions associated protein 1 (AJAP1) are in loci that are associated with IS through linkage studies,55,56 and 19 have both clinical (human) and experimental (animal model) associations with scoliosis. Furthermore, an additional 13 published animal model studies were found in which manipulation of the ciliary gene caused spinal curvature. In summary, 22% of these well-established cilia genes are associated with spinal curvature. In addition, from the study by Kim et al. 2010,37 3 other genes that modulate ciliogenesis or cilia length and feature a clinical syndrome with reported scoliosis were identified. Table 2 below provides a list of ciliary genes that are associated with a scoliosis phenotype.
To assess whether there is an observable defect associated with primary cilia in IS patients, osteoblast cells derived from bone specimens obtained during surgery were examined. All samples were from age matched adolescent female subjects. FBS deprivation was used to promote ciliogenesis and differentiation. Cilia morphology was examined using anti-acetylated α-tubulin immunofluorescence staining prior to and after 24, 48, and 72 hours (h) starvation in primary osteoblasts from 4 IS patients and 4 non-scoliotic trauma patients used as controls (
The average length of cilia in μm±variance at four starvation time points (0, 24 h, 48 h, 72 h) is shown in Table 3. To compare the length of cilia between IS and non-IS controls, the inventors combined measurements of up to 1000 cilia for each sample (25 fields per sample, in duplicate), at each time point, then used a t-test to compare the mean lengths of cilia in IS vs. control pools. The results show that cells from IS subjects have significantly longer cilia. The difference in length was significant across all the time points for IS patients (P value≤0.005), n=8 (4 IS vs. 4 Controls).
Cilia assembly, disassembly, and length have been associated with cell cycle and control of cell proliferation.23 To investigate if there is a correlation between longer cilia in IS patients and a differential growth rate, cell proliferation was assayed by counting viable cell (Trypan Blue stained) number as a function of time. Cell proliferation rate varied in all the samples as visible in the error bars (
As shown in Example 3, cilia in IS cells are significantly longer across all measured time points, but the most conspicuous length differences are visible before starvation and at 24 hours after starvation (
The functional response of IS cells having long cilia was evaluated by monitoring changes in expression for several mechanoresponsive genes under fluid flow, at four time points (0, 4 h, 8 h, 16 h). A 1 [Pa] shear stress (the magnitude at the center of the dish) in 1 [Hz] frequency was applied, which corresponds to a Womersley number of 8. The biomechanical parameters were chosen to be physiologically relevant based on the reported frequency spectra of forces affecting the human hip during walking, (1-3 Hz),24 and the Womersley number estimated for cerebrospinal fluid motion in the spinal cavity (5-18).25
Differential gene expression was compared for IS vs. controls at each time point, and then the whole response profile of each gene was examined. For each gene, after normalization to two endogenous controls (GAPDH and HPRT), the baseline expression level at 0 h (before treatment) of every sample was defined as its own calibrator. The gene expression for all time points for each sample was compared to its own 0 h (which has a RQ value of 1). The results have been shown as fold changes compared to the calibrator. One question per gene was asked: is there any difference between IS and control at each time point? For three genes (ITGB1, CTNNB1 and POC5) that showed a different overall expression pattern in IS vs. controls, a second question was asked: is there a significant difference in gene expression before and after flow? Each gene has been analysed independently using a pair wise t-test for each question followed by a post hoc Bonferroni. Concordant with previous findings regarding biomechanical induction and the expression of osteogenic factors,12,17,26 our assay showed a dramatic increase in Bone morphogenetic protein 2 (BMP2) and Cyclooxygenase-2 (COX2) expression in IS and controls, following 4 and 8 hours of fluid flow induction. However, for both genes, the IS response was significantly less than controls (
Finally, Fuzzy planar cell polarity (FUZ), Protein Of Centriole 5 (POC5), and Ladybird homeobox 1 (LBX1) genes were added to the experiment following our exome analysis, and recent published scoliosis genetic studies.3,32,33 For FUZ, we did not see any significant differential expression between IS and controls at any time point following flow application. POC5 expression decreased almost by half at the 4 hour point in both IS and controls (p<0.05), suggesting a role in early stages of mechanotransduction response (
Whole exome sequencing was performed to test the hypothesis that rare variants in ciliary genes might be causal for IS. Exome sequencing was done on peripheral blood DNA samples from 73 IS and 70 matched controls using the Agilent SureSelect™ Human All Exon 50 Mb v3 capture kit and the Life Technologies 5500 SOLiD™ Sequencing System. Variants were called and annotated using a customized bioinformatics pipeline including SAMTOOLS™, GATK™ and Picard™ program suites.34 To reduce the number of likely variants, we subsequently filtered the total variant set to remove those with a minor allele frequency greater than 5%, as well as variants not in or adjacent to protein-coding exons. After filtering, our dataset included 73 IS patients, 70 controls, 8544 genes, and 16,384 variants. We used SKAT-O to survey our exome data under two different weighting parameters: in favor of lower frequency variants (Madsen Browning weighting, Set I), and in favor of variants with projected deleterious effects and pathogenicity (Combined Annotation Dependent Depletion: CADD weighting, Set II). Since the underlying biology of idiopathic scoliosis is not understood, an omnibus test such as SKAT-O is considered more powerful than a burden test because it does not make assumptions regarding direction or size of variant effect.35 Analysis using Madsen Browning weighting (Set I) identified 259 genes and analysis using CADD weighting (Set II) identified 240 genes that are significant (p≤0.01) after correction for multiple testing. The Sets were compared and genes that were significant in both (n=120; Table 4) were considered candidates for idiopathic scoliosis (i.e., polymorphic/genetic markers comprising risk variants). This list was examined for ciliary genes using the SYSCILIA gold standard list36 and the Kim et al. 201037 list as references, along with inquiries using Google search engine. Fuzzy planar cell polarity protein (FUZ) is the only known ciliary gene in both data lists. However, there is a greater number of variants in controls compared with cases (12 controls with at least one variant vs 1 patient). Of the candidate genes, the 23 most significant (p<0.001) were further examined to determine the number of patients and controls having at least one variant (Table 5). Seven of these genes have greater variant enrichment among patients: CD1B, CLASP1, SUGT1, HNRNPD, LYN, ATPSB, AL159977.1. Table 6, provides a list of rare variants identified in the 120 genes linked to IS (risk variants) and listed in Tables 4 and 5. The polynucleotide sequence used as a reference for each of these genes was from Ensembl version 70.
The variant profile for each of the four IS patients used in the cellular analyses (see Examples 3-5) was also analyzed, to see if there are shared genes with variant enrichment. Controls could not be examined because the cohort used for molecular work differs from the genotyped control cohort. Control bone tissue were obtained intraoperatively from non-scoliotic trauma patients whereas the genotyped controls were from a non-surgical cohort. None of the genes identified in our combined SKAT-O table were shared among all the four patients, but all the patients have variants in either CD1B, CLASP1, or SUGT1. The CDK11A gene is represented among three patients, but in the exome cohort, nearly all patients and controls have at least one variant for this gene (
After analyses by SKAT-O the genes statistically significant (at p<0.01) in Set I and Set II were compared. One hundred and twenty (120) genes were identified by this approach.
The scope of the claims should not be limited by the preferred embodiments set forth in the examples, but should be given the broadest interpretation consistent with the description as a whole.
This application is a PCT application Serial No PCT/CA2017/* filed on Aug. 23, 2017 and published in English under PCT Article 21(2), which itself claims benefit of U.S. provisional application Ser. No. 62/378,297, filed on Aug. 23, 2016, which is incorporated herein in its entirety by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/CA2017/050992 | 8/23/2017 | WO | 00 |
Number | Date | Country | |
---|---|---|---|
62378297 | Aug 2016 | US |