The present invention relates to genetic markers for Autism Spectrum Disorders (ASD), and methods of determining risk of ASD in an individual.
Autism (MIM 209850) is a severe, lifelong neurodevelopmental disorder characterized by impairments in communication and socialization, and by repetitive behavior. Autism is not a distinct categorical disorder but is the prototype of a group of conditions defined as Pervasive Developmental Disorders (PDDs) or Autism Spectrum Disorders (ASD), which include Asperger's Disorder, Childhood Disintegrative Disorder, Pervasive developmental disorder-not otherwise specified (PDD-NOS) and Rett Syndrome. ASD is diagnosed in families of all racial, ethnic and social-economic backgrounds with incidence roughly four times higher in males compared to females. Data from several epidemiological twin and family studies provide substantial evidence that autism has a significant and complex genetic etiology. The concordance rate in monozygotic twins is 60-90%, and the recurrence rate in siblings of affected probands has been reported to be between 5-10% representing a 50 fold increase in risk compared to the general population. Although autism spectrum disorders are among the most heritable complex disorders, the genetic risk is clearly not conferred in simple Mendelian fashion.
Recent studies of sub-microscopic genomic copy number variation (CNV) have identified several loci associated with Autism Spectrum Disorder (ASD; MIM 209850). De novo CNVs associated with ASD have been reported in ˜7% of simplex families and ˜2% of multiplex families. CNV studies have also led to the identification of autism candidate genes such as SHANK3 (MIM 606230) and NRXN1 (MIM 600565). Intellectual disability (ID) is frequently associated with autism (in up to ˜30% of cases for ASD, and ˜67% for autism). Moreover, mutations in several X-linked ID (XLID) genes (e.g. NLGN4 and IL1RAPL1) have been shown to result in an autistic phenotype, which suggests that autism and ID may often share a common genetic etiology. Currently available data suggest substantial genetic heterogeneity, with the most likely cause of non-syndromic idiopathic ASD involving multiple epistatically-interacting loci. The identification of large scale copy number variants (CNVs) represents a considerable source of genetic variation in the human genome that contributes to phenotypic variation and disease susceptibility found in small inherited deletions in autistic kindreds, suggesting possible susceptibility loci.
It would thus be desirable to characterize putative susceptibility loci to identify genetic markers of ASD, as well as to understand the role of candidate genes for ASD in order to facilitate determination of the risk of ASD in an individual, and to assist in the diagnosis of ASD.
Systematic screening at PTCHD1 and 5′-flanking regions, suggests involvement of this locus in ˜1% of autism spectrum disorder (ASD) and intellectual disability (ID) individuals. Provided herein are mutations in the X-chromosome PTCHD1 (patched-related) locus, which are useful in assessing the risk of ASD and/or the risk of ID in an individual, as well as being useful to diagnose carrier status of an individual, or other condition(s). Provided markers are useful both individually and in the form of a microarray to screen individuals for risk of ASD and/or ID or for carrier status for risk of ASD and/or ID.
Thus, in one aspect of the present invention, a method of determining the risk of ASD in an individual is provided, comprising analyzing a nucleic acid-containing sample obtained from the individual for the presence or absence of a genomic sequence mutation at the PTCHD1 locus, wherein the mutation comprises a deletion of a region upstream to the PTCHD1 gene (e.g., a deletion as set forth in Table 2), a disruption of a non-coding RNA (ncRNA) selected from PTCHD1AS1, PTCHD1AS2, or PTCHD1AS3, or splice variants of these ncRNAs, or a disruption of other regulatory elements upstream of the PTCHD1 coding region. Presence of the mutations has been found to be indicative of ASD.
These and other aspects of the present invention are described by reference to the following figures.
The patent or application file contains at least one drawing executed in color. Copies of this patent or patent application publication with color drawings will be provided by the Office upon request and payment of the necessary fee.
A method of determining the risk of an autism spectrum disorder (ASD) in an individual, or carrier status of an individual, is provided comprising screening a biological sample obtained from the individual for a mutation that may modulate the expression of PTCHD1.
The term “an autism spectrum disorder” or “an ASD” is used herein to refer to at least one condition that results in developmental delay of an individual such as autism, Asperger's Disorder, Childhood Disintegrative Disorder, Pervasive Developmental Disorder-Not Otherwise Specified (PDD-NOS) and Rett Syndrome (APA DSM-IV 2000).
The term “intellectual disability” or “ID” refers to a disability originating before age 18, characterized by significant limitations in both intellectual functioning and adaptive behavior as expressed in conceptual, social, and practical adaptive skills.
Microdeletions that directly disrupt the PTCHD1 gene have been identified in males in families affected with ASD, ID or learning disability. Identified deletions are maternally inherited and were not observed in more than 10,000 controls, indicating that these alterations are associated with ASD and ID. Maternally inherited missense mutations in PTCHD1 in male probands have also been reported.
PTCHD1 encodes a Patched-related protein with 12 transmembrane domains and a sterol-sensing domain, structurally similar to the Hh receptors PTCH1 and PTCH2, as well as the Niemann-Pick Type C1 protein (NPC1) and several others. Many Patched-related genes have been found in various organisms, from nematodes to humans, and they appear to play diverse biological functions, including cytokinesis, growth and pattern formation (Zugasti, O. et al., Genome Res. 15, 1402-1410 (2005)). For instance, there are just seven patched-related genes in humans (PTCH1, PTCH2, PTCHD1, PTCHD2, PTCHD3, NPC1 and c6orf138, whereas in C. elegans there are at least 26 patched-related genes, with diverse roles in development in addition to Hh signaling, including cytokinesis, growth and pattern formation (Zugasti, O. et al., Genome Res. 15, 1402-1410 (2005)). We have found in 10T1/2 cells, an inhibitory effect of PTCHD1 was demonstrated on Gli-dependent transcription. Although these results suggest that PTCHD1 exhibits biochemical activity in Hh-dependent processes similar to that of PTCH1 and 2, other functions or roles for PTCHD1 cannot be excluded at this point.
We have further characterized the PTCHD1 locus and found variants identified in PTCHD1 were not seen in more than 500 controls, further supporting a role of PTCHD1 in autism and ID. As used herein, the term “PTCHD1 locus” refers to the region in the X chromosome which extends from about the distal-most exon of mRNA clone DA355362 at the distal end to a proximal boundary which at least includes the coordinate according to the UCSC 2006/hg18 build ChrX:23,329,120 and which may extend to BX115199 as illustrated in
Furthermore, 10 deletions were found that map to regions upstream of the coding region of PTCHD1. The region 5′ and distal to PTCHD1 is relatively gene poor. Within this upstream region, a coding gene, DDX53, encoding DEAD Box 53, lies ˜335 Kb 5′ to PTCHD1. Five of the 10 upstream deletions span DDX53. However, based on the function of the DDX53 protein and the expression pattern of this gene (which is restricted mainly to testis and tumor cells (Cho, B. et al., Biochem. Biophys. Res. Commun. 292, 715-726 (2002)), it is unlikely to contribute to the ASD or ID phenotype. Additionally, within the gene-poor region between PTCHD1 and DDX53, there is a putative pseudogene of FAM3C, FAM3C2, which is disrupted by five of the 10 upstream deletions. FAM3C, a cytokine-like gene on 7q31.31, consists of 10 exons (Zhu, Y. et al., Genomics 80, 144-150 (2002)) whereas FAM3C2, although 99% identical, has no intron/exon structure and is interrupted by a short interspersed nuclear element (SINE). It appears to have inserted on Xp22 after human/chimp evolutionary divergence. Since no mRNA or EST matches exactly to FAM3C2, it is most likely an untranscribed processed pseudogene.
The region just distal to PTCHD1 was examined in detail and a number of putative enhancer and promoter sequences were identified, as well as conserved (and putative regulatory) elements (
Although the ncRNAs do not appear to encode protein, they may serve as regulators for other coding genes, particularly for PTCHD1, since the 5′ exons are adjacent on opposite strands. Such ncRNAs may regulate expression of a coding transcript on the opposite strand through a number of mechanisms, including modification of chromatin, transcriptional regulation and post-transcriptional modification (Mercer, T. R. et al., Nat. Rev. Genet. 10, 155-159 (2009); Kleinjan, D. A et al., Am. J. Hum. Genet. 76, 8-32 (2005)).
All of the upstream deletions identified, as well as PTCHD1 deletions (e.g., Family 1) disrupt conserved (and putative regulatory) sequences and/or exons of ncRNAs (see
Thus, in ASD individuals there is evidence for the possible involvement of more than one locus in the disease, and these findings may support the threshold model of relative contribution in ASD and polygenic inheritance in autism. As such, some de novo CNVs may be highly penetrant in causing ASD susceptibility (e.g. disruption of PTCHD1 in Family 1). Conversely, other de novo CNVs (e.g. DPP6 and DPYD deletions) may have more subtle effects, requiring contributions of additional loci (e.g. PTCHD1 missense mutations in the case of Families 9 & 10) for ASD to be phenotypically evident. This scenario may also apply to the ID families with PTCHD1 mutations.
Cerebellar abnormalities have frequently been linked to autism, including recent magnetic resonance imaging (MRI) studies showing significant decrease in cerebellar grey matter (Courchesne, E. et al., Neurology 57, 245-254 (2001); Toal, F. et al., Br. J. Psychiatry 194, 418-425 (2009)), and decreased cerebellar connectivity and activity (Mostofsky, S. H. et al., Brain 132, 2413-2425 (2009)).
In the present methods, it is possible to determine ASD risk in an individual, as well as to determine carrier status of an individual (e.g., testing of females for the presence of mutations associated with ASD, to determine whether they are carriers). In the methods, a biological sample obtained from the individual is utilized. A suitable biological sample may include, for example, a nucleic acid-containing sample or a protein-containing sample. Examples of suitable biological samples include saliva, urine, semen, other bodily fluids or secretions, epithelial cells, cheek cells, hair and the like. Although such non-invasively obtained biological samples are preferred for use in the present method, one of skill in the art will appreciate that invasively-obtained biological samples, may also be used in the method, including for example, blood, serum, bone marrow, cerebrospinal fluid (CSF) and tissue biopsies such as tissue from the cerebellum, spinal cord, prostate, stomach, uterus, small intestine and mammary gland samples. Techniques for the invasive process of obtaining such samples are known to those of skill in the art. The present method may also be utilized in prenatal testing for the risk of ASD using an appropriate biological sample such as amniotic fluid and chorionic villus.
In one aspect, the biological sample is screened for nucleic acid encoding selected genes in order to detect mutations associated with an ASD. It may be necessary, or preferable, to extract the nucleic acid from the biological sample prior to screening the sample. Methods of nucleic acid extraction are well-known to those of skill in the art and include chemical extraction techniques utilizing phenol-chloroform (Sambrook et al., 1989), guanidine-containing solutions, or CTAB-containing buffers. As well, as a matter of convenience, commercial DNA extraction kits are also widely available from laboratory reagent supply companies, including for example, the QIAamp DNA Blood Minikit available from QIAGEN (Chatsworth, Calif.), or the Extract-N-Amp blood kit available from Sigma (St. Louis, Mo.).
Once an appropriate nucleic acid sample is obtained, it is subjected to well-established methods of screening, such as those described in the specific examples that follow, to detect genetic mutations indicative of ASD, i.e. ASD-linked mutations. Representative methods of screening include straight sequencing; use of arrays as described herein; as well as quantitative PCR (qPCR) and multiplex ligation-dependent probe amplification (MLPA). For example, various platforms can be used: affymetrix 500 k SNP arrays; Illumina 1M BeadChips; NimbleGen 385K arrays; Affymetrix 6.0 arrays; Illumina 550× arrays; and other platforms.
Mutations, including sequence mutations in coding and/or regulatory regions of a gene, as well as in flanking regions of a gene, have been found to be indicative of ASC. Representative mutations include, for example, genomic copy number variations (CNVs), which include gains and deletions of segments of DNA (e.g., segments of DNA greater than about 1 kb, such as DNA segments over about 50 kb, such as between 50 and 300 kb, or between about 300 and 500 kb); as well as base pair mutations such as nonsense, missense and splice site mutations.
Genomic sequence variations of various types in different genes have been identified as indicative of ASD. As described herein, deletions in the 5′ flanking region of PTCHD1 that disrupted a complex non-coding RNA (e.g., PTCHD1AS1, PTCHD1AS2, PTCHD1AS3), and potential regulatory element(s) in the PTCHD1 locus have been associated with ASD. In one embodiment, genomic sequence variations that alter the expression of PTCHD1 have been linked to ASD. The terminology “alter expression” refers broadly to sequence variations that may alter (e.g., inhibit, or at least reduce) any one of transcription and/or translation of the coding nucleic acid sequence of PTCHD1, as well as the activity of the PTCHD1 protein.
Genomic sequence variations other than CNVs have also been found to be indicative of ASD, including, for example, missense mutations which result in amino acid changes in a protein that may also affect protein expression. In one embodiment, missense mutations in the PTCHD1 gene have been identified which are indicative of ASD. In certain embodiments, a missense change is associated with a further genetic mutation and the presence of the combination of the missense change and the deletion is associated with ASD.
In another embodiment, sequence variations associated with ASD include deletions in the region that is within the 5′ region upstream of the PTCHD1 gene (e.g., in whole or in part, or a portion or more of the upstream region thereof). In certain embodiments, mutations include deletions (e.g., deletions described in Table 2). The term “upstream region,” as used herein, refers to a region that is distal to the PTCHD1 gene within approximately 1.2 mbp. For example, in one embodiment, the region comprises cDNA clone BRSTN2000219 (DA355362) (see
To determine risk of ASD in an individual, it may be advantageous to screen for multiple genomic mutations, including CNVs and/or mutations as indicated above applying array technology. In this regard, genomic sequencing and profiling, using well-established techniques as exemplified herein in the specific examples, may be conducted for an individual to be assessed with respect to ASD risk/diagnosis using a suitable biological sample obtained from the individual. Identification of one or more mutations associated with ASD would be indicative of a risk of ASD, or may be indicative of a diagnosis of ASD. This analysis may be conducted in combination with an evaluation of other characteristics of the individual being assessed, including for example, phenotypic characteristics.
In view of the determination of gene mutations which are linked to ASD, a method for determining risk of ASD in an individual is also provided in which the expression or activity of a product of an ASD-linked gene mutation is determined in a biological protein-containing sample obtained from the individual. Abnormal levels of the gene product or abnormal levels of the activity thereof, i.e. reduced or elevated levels, in comparison with levels that exist in healthy non-ASD individuals, are indicative of a risk of ASD, or may be indicative of ASD. Thus, a determination of the level and/or activity of the gene product of PTCHD1, may be used to determine the risk of ASD in an individual, or to diagnose ASD. Further, a determination of the level and/or activity of the gene product of PTCHD1AS1, PTCHD1AS2, and/or PTCHD1AS3 or splice variants thereof, may be used to determine the risk of ASD in an individual, or to diagnose ASD. As one of skill in the art will appreciate, standard assays may be used to identify and quantify the presence and/or activity of a selected gene product.
Embodiments of the invention are described by reference to the following specific exemplification which is not to be construed as limiting.
Subjects: CNVs at the PTCHD1 locus were initially assessed in 427 ASD patients as described (Marshall, C. R. et al., Am. J. Hum. Genet. 82, 477-488 (2008)). DNA samples from 900 individuals diagnosed with ASD were sequenced for PTCHD1 mutations, and compared to a reference nucleic acid sequence to identify mutations. In this regard,
Among the samples assessed, 400 samples were collected at three sites, namely The Hospital for Sick Children (HSC) in Toronto and child diagnostic centers in Hamilton, Ontario and St, John's, Newfoundland. Details of these samples are published elsewhere (Moessner, R. et al., Am. J. Hum. Genet. 81, 1289-1297 (2007)). 420 ASD cases were recruited at Montreal, details of these samples are published elsewhere (Gauthier, J. et al., Mol. Psychiatry 11, 206-213 (2006)). Another 80 ASD probands from the Autism Genetic Resource Exchange (AGRE) were also included. The second cohort of 996 autism probands was recruited at different sites as a part of the Autism Genome Project (AGP); ascertainment is described elsewhere (Pinto, D. et al., Nature 466, 368-372 (2010)). 246 male patients with intellectual disability were recruited from the UK, United States, Australia, Europe and South Africa as the IGOLD study. A subset of 225 from this cohort were also used for sequence analysis of PTCHD1. Details of these samples are published elsewhere (Tarpey, P. S. et al., Nat. Genet. 41, 535-543 (2009)). 167 unrelated patients diagnosed with ADHD were recruited through the Department of Psychiatry at the Hospital for Sick Children, Toronto. Microarray data from controls included 1,123 (M=623, F=500) controls recruited from northern Germany as a part of the PopGen project, 1,234 (M=586, F=648) healthy controls of European origin recruited from the province of Ontario, Canada, 1,287 (M=383, F=904) controls from the Study of Addiction: Genetics and Environment (SAGE), 1,320 (M=589, F=1320) controls from Children's Hospital of Philadelphia (CHOP), 4783 (M=2460, F=2323) controls were recruited by the Wellcome Trust Case Control Consortium, 440 (M=158, F=282) controls were recruited by The Centre of Addiction and Mental Health (CAMH) and GlaxoSmithKline (GSK), and 59 (M=30, F=29) from the Centre d'Etude Polymorphisme Humaine (CEPH) HapMap controls (total N=5,023). More than 650 Ontario controls were obtained from The Centre for Applied Genomics (TCAG) and The Centre for Addiction and Mental Health (CAMH) and sequenced. Institutional ethical review board approval (CAMH, HSC, CHOP and all other collaborating institutions) was obtained for the study, and informed written consent was obtained for each family. Details of the clinical findings in families with PTCHD1 mutations or CNVs are summarized in Table 1.
Copy Number Variation Analysis: Affymetrix 500K SNP arrays were used to assess CNVs in a cohort of 427 ASD cases. Details on the methods of copy number analysis and complete results are published elsewhere (Marshall, C. R. et al., Am. J. Hum. Genet. 82, 477-488 (2008)). Only the CNV result at PTCHD1 is described here. Another cohort of 996 autism probands was analyzed on 1M BeadChips (Illumina) (Pinto, D. et al., Nature 466, 368-372 (2010)). 246 male patients with ID were analyzed on a custom designed NimbleGen 385K array. Genomic DNA samples were sent to NimbleGen for the hybridizations to be performed. Each patient sample (Cy5-labelled) was co-hybridised with DNA from the reference sample NA10851 (Cy3-labelled; obtained from Coriell Cell Repository). After data normalisation, the ADM-1 algorithm (CGH Analytics 3.4, Agilent) was used for CNV discovery. The ADHD cohort was analyzed on Affymetrix 6.0 arrays. Three algorithms (Birdsuite, iPattern and Affymetrix Genotyping console (GTC)) were used to infer CNVs. The CEPH, PopGen and Ontario controls were analyzed on Affymetrix 6.0 arrays, SAGE controls were analyzed using 1M BeadChips (Illumina), and Illumina 550K arrays were used for the CHOP and CAMH\GSK controls. Similar methods were used to infer CNVs in controls. Fisher's Exact Test was used to calculate the two-tailed p value.
DNA Sequencing and Mutation Screening: PCR primers were designed with Primer 3 (v. 0.3.0) to amplify all three exons and intron-exon boundaries. PCR were performed under standard conditions, and products were purified and sequenced directly with the BigDye Terminator v3.1 Cycle Sequencing Ready Reaction Kit (Applied Biosystems).
X-Inactivation Studies: X Chromosome Inactivation assays were performed on genomic DNA extracted from peripheral blood as described (Allen, R. C. et al., Am. J. Hum. Genet. 51, 1229-1239 (1992)). Briefly, X Chromosome Inactivation was measured by the analysis of the (CAG)n repeat in the androgen receptor gene at Xq11-q12 before and after digestion with methylation sensitive restriction enzymes HhaI and HpaII. Quantitative PCR amplification of androgen receptor gene repeat alleles was compared, with and without restriction digestion, to determine the ratio of X-active/inactive alleles.
Expression Analysis and Protein Localization: Expression analysis and tissue distribution for PTCHD1, PTCHD1AS1 and PTCHD1AS2 was performed by RT-PCR, with a multiple tissue panel of first strand cDNA. The housekeeping gene G3PDH was used as a control. Origene human adult brain tissue panel was used to check the expression of PTCHD mRNA in different regions of the brain. qRT-PCR was performed with TaqMan Gene Expression assay Hs00288486, and samples were pre-normalized to GAPDH expression. Northern blot analysis was performed with a six tissue mRNA blot (BioChain). The BioChain FastHyb solution was used to hybridize the probe according to manufacturer's instructions. RNA in situ hybridization was performed on paraffin sections and whole-mounted fetal mouse and adult mouse brain using a 411 bp (chrX:152,008,934-152,009,344, UCSC Mouse July, 2007 (UCSC Genome Browser)) digoxigenin-labeled mouse antisense probe (and sense probe as negative control), using standard methods. To examine cellular localization of PTCHD1 protein, full-length human fetal brain PTCHD1 cDNA was PCR amplified and cloned into the pcDNA3.1/CT-GFP-TOPO expression vector (Invitrogen). After confirming sequence and orientation of the insert, COS-7 and SK-N-SH cells were transiently infected with 2 μg of purified construct DNA with SuperFect (Qiagen). 24 hours after transfection, the PTCHD1-GFP fusion protein was visualized in transfected cells using a Zeiss Axioplan 2 imaging microscope, equipped with the LSM510 array confocal laser scanning system, and the Zeiss LSM510 version 3.2 SP2 software package.
Luciferase Assays: A luciferase assay was performed to compare the effect of PTCH1, PTCH2 and PTCHD1 on Gli-dependent transcription with a previously described method (Nieuwenhuis, E. et al., Mol. Cell Biol. 26, 6609-6622 (2006)). Briefly, the 10T1/2 cells were transiently transfected with mixtures containing 0.1 μg β-galactosidase to normalize for transfection efficiency, 1 μg reporter plasmid (8× Glipro) encoding multimerized Gli binding sites fused to the luciferase gene and up to 1 μg of Gli2, PTCH1 or PTCH2 or PTCHD1. Gli-dependent transcription was measured and normalized by β-galactosidase. Data were replicated in independent experiments performed in triplicates. In another assay, 10T1/2 cells were transiently transfected with mixtures containing 0.1 μg β-galactosidase, 1 μg 8× Glipro reporter plasmid and purmorphamine, PTCH1 or PTCH2 or PTCHD1. The effect of PTCH1, PTCH2 and PTCHD1 on the endogenous Gli-dependent transcription was measured. Statistical significance was calculated asp below 0.05, using the Student's t-test.
Cytogenetic and CNV analysis of proband from Family 9: Localization of translocation breakpoints was performed by fluorescence in situ hybridization (FISH; performed in accordance with standard procedures) initially using bacterial artificial chromosome (BAC) clones across the suspected breakpoint regions, and then narrowing the search using fosmid clones. BAC clones were obtained from the RP11 human genomic library, and fosmid clones from the Whitehead fosmid library WIBR2. For the chromosome 19 locus, the clone G248P85500F11 was translocated, and thus distal to the breakpoint, while clone G248P85559B4 was not translocated, and thus proximal to the breakpoint. The breakpoint therefore lies within a 32 Kb region between these two clones (UCSC March 2006: Chr19: 7,843,511-7,874,724. This region encompasses just two genes: FLJ22184, LRRC8E. At the chromosome 21 translocation site, fosmid clone G248P87249E2 was translocated, and G248P89542E9 was not translocated, and the breakpoint thus lies within a ˜14.5 Kb region between these two clones, within an intron of the RUNX1 gene.
Whole-genome SNP analysis was performed using the Affymetrix 260K NspI SNP microarray. Analysis using the dCHIP and CNAG programs indicated a loss of heterozygosity from SNPs rs10875047 at Chr1:97,367,581 and rs822559 at Chr1:98,424,675 (inclusive; UCSC March 2006). This apparent deletion spans from intron 20 of the gene DPYD to include the first 20 DPYD exons, as well as two proximal putative genes, AK094607 and AX747691.
CNV Analysis of PTCHD1: Precise breakpoints of the 167 Kb deletion at PTCHD1 identified in the male proband from Family 1 were characterized. This CNV also disrupts long, spliced non-coding RNAs (ncRNAs) on the opposite strand that codes for PTCHD1, however, no other coding genes were interrupted. See
The 167 kb deletion was validated in the family using both PCR and SYBR-Green I-based real-time quantitative PCR (qPCR) and was found to be transmitted from a heterozygous unaffected mother to two affected dizygotic twin sons, also to an unaffected daughter (
Mutation Screening of PTCHD1: In order to identify additional cases with PTCHD1 mutations, the coding regions in 900 (M=723; F=177) unrelated ASD cases and 225 unrelated male ID cases were sequenced. Missense changes were identified in unrelated ASD probands and ID probands (
All of these variants, which resulted in the substitution of highly conserved amino acids, were inherited from unaffected carrier mothers (
The entire coding region of PTCHD1 was sequenced in 700 control individuals (M=531 F=169), and none of the missense changes identified from among the ASD and ID patient cohorts has been detected. Only two missense changes have been identified: P252L from amongst the controls, and N497K reported in the SNP database (rs35880456, in 1 out of 39 screened; NCBI), both in females who were heterozygotes. Altogether, absence of PTCHD1 missense variants indicates that these variants are significantly enriched in the males with ASD (6/723 male ASD versus 0/531 male control: Fisher's exact test: p=0.042) and may contribute to the phenotype.
Additional controls were sequenced for the exons in which missense mutations were identified. Control chromosomes were tested for the sequence underlying the I173V and V1951 mutations (N=1101 chromosomes), the ML336—337II mutation (N=1193), and the L73F and E479G mutations (N=869) and detected none of these variants.
CNVs upstream of PTCHD1 (PTCHD1AS1/PTCHD1AS2 locus): Copy number variations were also identified upstream of the coding region for PTCHD1. A study of 996 ASD families examined with the Illumina 1M BeadChip (Pinto, D. et al., Nature 466, 368-372 (2010)) identified deletions in probands or affected siblings, and in a father with a diagnosis of Broad Autism Phenotype (BAP) (Hurley R. S. et al., J. Autism Dev. Disord. 37, 1679-1690 (2007); Constantino, J. N. et al., Biol. Psychiatry 57, 655-660 (2005)). All of the upstream CNVs occurred 5′ of PTCHD1, and overlapping with an anti-sense non-coding RNA, PTCHD1AS1/PTCHD1AS2. A tenth deletion at this upstream locus was identified in a patient from a CNV study of 167 unrelated attention deficit-hyperactivity disorder (ADHD) patients. The ADHD proband with the deletion also has a BAP diagnosis. See
In
The breakpoints of the deletions for all families that are reported here were mapped by sequencing the junction. Breakpoints for all CNVs in controls were mapped by using the physical positions of microarray probe fragments. Deletions were validated with qPCR and exact breakpoints at the PTCHD1 locus were mapped (See Table 2). Additional CNV data for the individuals in other regions is included in Table 3.
SNP microarray data was analyzed from 10,246 control individuals (4,829 male; 5,417 female), for CNVs at PTCHD1 and the upstream region. In a 1.4-Mb region spanning from PTCHD1 to adjacent genes PRDX4 (proximal) and ZNF645 (proximal), 15 CNVs were identified (7 duplications and 8 deletions); however, it is notable that only 1 male control with a deletion was identified, which was 20.6 Kb in length and did not disrupt any known exons of any genes or non-coding RNAs, or any of the identified conserved or putative regulatory sequences. The remaining 7 deletions were all identified among female controls, consistent with the X-linked recessive inheritance observed for the PTCHD1 mutations. Thus, PTCHD1 and upstream deletions were not observed in 4,829 male controls, or in the Database of Genomic Variants (Iafrate, A. J. et al., Nat. Genet. 36, 949-951 (2004)), which suggests that the CNV directly disrupting PTCHD1 and the 6 CNVs located just upstream in unrelated ASD probands are associated with autism (male ASD cases N=7, out of 1,185; male controls N=0 out of 4,829; Fisher's exact test: p=1.2×10−5).
Expression and Functional Studies of PTCHD1: Expression analysis for the PTCHD1 and the ncRNA transcripts suggests that they are transcribed in brain regions, notably the cerebellum, as well as in other tissues (data not shown). RNA in situ hybridization of Ptchd1 in mouse showed widespread expression in the developing brain from E9.5/10.5 to P1 (data not shown), as well as broad expression in the adult mouse brain (6 months), with highest density in the cerebellum (see Allen brain atlas online (Allen Institute mouse brain atlas in situ hybridization data for Ptchd1: http://mouse.brain-map.org/brain/Ptchd1.html)).
Gene expression and genes co-expressed with PTCHD1 were also analyzed, from gene Affymetrix gene expression microarray analysis from BioGPS (Gene Atlas U133A, gcrma; http://biogps.gnf.org); UCLA Gene Expression Tool (UGET: http://genome.ucla.edu/˜jdong/GeneCorr.html; using human HG-U133_Plus—2 microarrays (2), and correlation with mouse Ptchd1 using UGET and Mouse430—2 microarrays. These algorithms correlate expression based on banked Affymetrix gene microarray data, and is not tissue specific. Ranking counts multiple probes as single hits, and excludes hypothetical proteins.PTCHD1 gene expression showed high correlation with expression of other cerebellar genes such as ZIC1, CADPS2, EN2, CBLN1, and with synaptic genes such as PCLO, NRXN3, SNAP25, SYT2, DPP6 and DPP10 (see Table 4).
To investigate its function, the sub-cellular localization of PTCHD1 was studied. It was found that a PTCHD1-GFP fusion protein predominantly localizes to the cell membrane (data not shown). It was further hypothesized that PTCHD1 may function in the Hh-signaling pathway and have similar functional attributes as PTCH1 and PTCH2. A Gli-dependent transcription assay was performed in Hh-responsive 10T1/2 cells to test whether PTCHD1 could interfere with Hh signaling. In 10T1/2 cells, overexpression of PTCH1 or PTCH2 inhibits transcription from a Gli-luciferase reporter containing multiple copies of the Gli protein-binding site in the presence of Smoothened agonist purmorphamine (Sinha, S. and J. K. Chen, Nat. Chem. Biol. 2, 29-30 (2006)) or Gli2 (data not shown). Similar to PTCH proteins, PTCHD1 also exerted a statistically significant inhibitory effect in these assays suggesting that PTCHD1 functions in the Hedgehog signalling pathway.
RT-PCR failed to find evidence for a shortened 3′ PTCHD1 transcript from individual with PTCHD1 exon 1 deletion: It was speculated that the difference in phenotype between the PTCHD1 deletion families, could be explained by residual PTCHD1 protein function in relevant brain regions in Family 1 due to downstream transcription and translation of a shorter isoform, possibly driven by a secondary promoter just upstream of exon 2, resulting in the milder ASD symptoms, rather than the severer ID with the full deletion. However, RT-PCR did not detect any evidence of shorter downstream transcripts.
RT-PCR and 5′ RACE (Rapid Amplification of cDNA Ends) analysis of the ncRNAs, PTCHD1AS1 and PTCHD1AS2 and the PTCHD1 gene: By RT-PCR, the annotated exons of PTCHD1AS1 and PTCHD1AS2 were amplified from human cerebellum cDNA. Sequencing of RT-PCR product confirmed the current annotation of the ncRNAs. Additionally, the annotation of PTCHD1AS1 was verified by re-sequencing of the IMAGE clone 1560626.
It was attempted to identify additional 5′ sequence of the ncRNAs and PTCHD1 by 5′ RACE analysis using the Clontech Marathon-Ready™ fetal brain cDNA (Cat. No. 639300). According to the manufacturer instructions the gene specific primers were designed for PTCHD1AS1, PTCHD1AS2 and PTCHD1 and RT-PCR was performed. The PCR products were cloned into the Promega pGEM®-T Easy Vector and the clones were sequenced using standard methods. No additional upstream sequence for PTCHD1 could be found; however, for the PTCHD1AS1 at least two additional exons were identified. One of these exons completely overlaps with the PTCHD1AS2 exon 2 (chrX:23,198,089-23,198,215), while the second exon mapped further upstream at chrX:23,261,313-23,261,767 (UCSC 2006). RT-PCR also identified another splice variant with an initial exon at ChrX:23,262,967-23,262,009, which skips to exon 2 in the current annotation of PTCHD1AS1. It is possible that the extremely GC-rich nature of the 5′ region of PTCHD1 prevented the finding of additional upstream sequence.
Alternative 5′ exons for PTCHD1AS1, identified by 5′RACE, are shown in Table 5 below.
The relevant sequences are as follows:
Putative promoter and enhancer sequences in intergenic region between DDX53 and PTCHD1: The identification of predicted promoter sequences may indicate the presence of an alternative upstream transcription start site for PTCHD1 (or possibly another unknown gene), that may be disrupted by the CNVs identified upstream of PTCHD1 in ASD families (see supra). The Genomatix ElDorado suite was used to predict promoter sequences. The promoter sequence for DDX53 is (hg18/UCSC March 2006 build):
In addition to promoter sequences at the 5′ ends of DDX53 and PTCHD1, on the plus strand a putative promoter sequence was identified in the intergenic region, from ChrX:22,927,508-22,928,108 (hg18/UCSC March 2006 build):
This putative promoter lies ahead of ENSEMBL predicted non-coding transcript ENST00000407873. On the minus strand a putative promoter sequence was identified in the intergenic region, from ChrX: chrX:23,022,123-23,022,723, which lies just ahead of ENSEMBL predicted non-coding transcript ENST00000356867 and an EST clone (AU118198) (hg18/UCSC March 2006 build):
The ElDorado Suite from Genomatix, as well as the FPROM algorithm from the Softberry suite, was also used to predict promoter/enhancer sequences just upstream of the FAM3C2 predicted pseudogene.
Comparative sequence analysis indicated a number of regions located in the gene desert upstream of PTCHD1 and between DDX53 where nucleotide sequence conservation is relatively high through vertebrate evolution or through mammalian evolution. Such conserved regions may represent functional regions, possibly cis-regulatory sequences for PTCHD1. Regions were selected through the Vertebrate Multiz Alignment & PhastCons Conservation (28 Species) track on the UCSC (March 2006 build) browser. Results are shown in Table 1 and indicate which conserved elements overlap with CNV losses upstream of PTCHD1.
eQTL at PTCHD1 locus: The SNP rs7878766, located within PTCHD1 intron 1, has been reported as a quantitative trait locus for expression of mRNA levels of MAP8KIP2 in control brain cortex (http//eqtl.uchicago.edu), with a QTL score of 5.3. RefSeq Summary reports this to encode a scaffold protein involved in the c-Jun N-terminal kinase signaling pathway, and is thus thought to act as a regulator of signal transduction. Using mRNA by SNP Browser 1.0.1, other SNPs at the PTCHD1 locus that showed as suggestive QTLs for mRNAs included rs5925800 (ACSM2A; LOD=5.039, p=1.5×10−6; GALNT4, LOD=5.095, p=1.3×10−6; PIK3C2G, LOD=5.27, p=8.4×10−7), rs868659 (DLEU2, LOD=5.427, p=5.8×10−7), and rs6526278 (SGCG, LOD=5.248, p=8.8×10−7).
In summary, the data indicate that mutations at the PTCHD1 locus are highly penetrant and strongly associated with ASD (including BAP) and ID in ˜1.1% and ˜1.3% of the individuals analyzed, respectively (based on probands for whom comprehensive mutation screening, for both CNVs and sequence variants, has been performed (4 out of 353 ASD, and 3 out of 225 ID). As one of skill in the art will appreciate, mutations indicative of ASD and ID may vary from the exact CNVs identified (e.g. in Table 2 or other mutations), but will include at least a portion of one or more of the identified CNVs.
Overall, the findings are reminiscent of genetic findings for several other X chromosome genes, including NLGN4 (Jamain, S. et al., Nat. Genet. 34, 27-29 (2003); Laumonnier, F. et al., Am. J. Hum. Genet. 74, 552-557 (2004)) and IL1RAPL1 (Bhat, S. S. et al., Clin. Genet. 73, 94-96 (2008); Piton, A. et al., Hum. Mol. Genet. 17, 3965-3974 (2008); Carrie, A. et al., Nat. Genet. 23, 25-31 (1999)), in that mutations can apparently cause either ASD or ID (or both), and thus PTCHD1 may be a gene for both. IL1RAPL1, for example, was initially reported as a gene for non-syndromic X-linked ID (Carrie, A. et al., Nat. Genet. 23, 25-31 (1999)), and then subsequently was also found to harbor mutations in ASD pedigrees (Bhat, S. S. et al., Clin. Genet. 73, 94-96 (2008); Piton, A. et al., Hum. Mol. Genet. 17, 3965-3974 (2008)). Families have also been identified in whom at least two loci may be contributing to the pathogenesis of ASD, and other families bearing upstream microdeletions that disrupt a complex non-coding RNA, providing possible genetic explanations for the clinical heterogeneity of these disorders. Finally, the results raise the possibility that Hh signaling may be perturbed in these conditions.
Number | Date | Country | Kind |
---|---|---|---|
2,744,424 | Jun 2011 | CA | national |
This application claims the benefit of U.S. Provisional Application No. 61/382,834, filed on Sep. 14, 2010. This application claims priority under 35 U.S.C. §119 or 365 to Canadian Application No. 2,744,424, filed Jun. 9, 2011. The entire teachings of the above applications are incorporated herein by reference.
Number | Date | Country | |
---|---|---|---|
61382834 | Sep 2010 | US |