Male fertility over the past century has been observed to have dramatically declined, with recent analysis of data for the past 50 years noting a 50% reduction in male sperm counts. The primary causal factors suggested are environmental exposures influencing testis biology and sperm production. In rodent models, a number of defined toxicants and other exposures promote testis effects associated with a reduction in sperm number. The current estimated infertility range is approximately 15-20% of the human male population. A common strategy for medically assisted reproduction when male factor infertility is identified involves in vitro fertilization and intracytoplasmic sperm injection (ICSI), which are invasive and expensive procedures. In addition to low sperm counts associated with infertility, there is also an increase in idiopathic infertility, which can have normal sperm cohorts and motility. While seminal parameters are commonly used to screen for male factor infertility, the sperm number, motility and shape cannot fully explain the infertility. The development of a clinical diagnostic analysis based on molecular alterations in the sperm would help address this clinical problem.
In an aspect, the present disclosure provides methods for providing a likelihood of fertility in a subject, comprising assaying a nucleic acid sequence from at least a portion of a sperm sample from a subject; detecting a methylation alteration of a portion of said nucleic acid sequence comprised in a differential DNA methylation region (DMR) listed in Table 2, thereby generating an epigenetic profile; and analyzing said epigenetic profile using a computer processor to compare said epigenetic profile with a reference epigenetic profile of a methylation level of at least a portion of a corresponding nucleic acid sequence comprised in said DMR listed in Table 2, wherein when said DMR is DMRMT:1, at least a portion of a nucleic acid sequence comprised in a second DMR, optionally listed in Table 2, is detected and analyzed.
In some embodiments, the methods further comprise determining a likelihood of fertility in said subject at least based in part on said analyzing. In some embodiments, the subject is infertile or has a reduced fertility relative to a normal subject.
In some embodiments, the methods further comprise administering a treatment to said subject. In some embodiments, the treatment comprises performing in vitro fertilization (IVF).
In some embodiments, the treatment comprises performing intracytoplasmic sperm injection (ICSI). In some embodiments, the treatment comprises administering a therapeutic effective amount of follicle stimulating hormone (FSH), or an analog thereof to said subject. In some embodiments, the treatment comprises administering a therapeutic effective amount of human menopausal gonadotropin (hMG), or an analog thereof to said subject.
In some embodiments, the reference epigenetic profile comprises a methylation level of a nucleotide sequence of a fertile subject.
In some embodiments, the detecting comprises measuring an epigenetic alteration of: six or more, ten or more, fifteen or more, twenty or more, thirty or more, forty or more, fifty or more, sixty or more, seventy or more, eighty or more, ninety or more, or one hundred or more DMRs listed in Table 2. In some embodiments, the detecting comprises measuring a methylation alteration of 1-217 DMRs listed in Table 2. In some embodiments, the detecting comprises measuring a methylation alteration of 1-50 DMRs listed in Table 2. In some embodiments, the detecting comprises measuring a methylation alteration of 100-217 DMRs listed in Table 2. In some embodiments, the detecting comprises measuring a methylation alteration of 50-150 DMRs listed in Table 2.
In another aspect, the present disclosure provides methods, comprising: assaying a nucleic acid sequence from at least a portion of a sperm sample from a subject; detecting a methylation alteration of at least a portion of said nucleic acid sequence comprised in a differential DNA methylation region (DMR) listed in Table 3, thereby generating an epigenetic profile; and analyzing said epigenetic profile using a computer processor by comparing to a reference epigenetic profile of methylation level of at least a portion of corresponding nucleic acid sequence comprised in said DMR listed in Table 3.
In some embodiments, when administering a treatment, the methods further comprise determining whether said subject responds to a treatment. In some embodiments, the treatment comprises administering a therapeutic effective amount of follicle stimulating hormone (FSH), or an analog thereof to said subject. In some embodiments, the treatment comprises administering a therapeutic effective amount of human menopausal gonadotropin (hMG), or an analog thereof to said subject.
In some embodiments, wherein when said subject does not respond to said treatment, the methods further comprise performing IVF. In some embodiments, wherein when said subject does not respond to said treatment, the methods further comprise performing ICSI.
In some embodiments, the reference epigenetic profile comprises a methylation level of a nucleotide sequence of a subject that responds to said treatment. In some embodiments, the subject has increased sperm number or sperm motility after receiving said treatment.
In some embodiments, the detecting comprises measuring an epigenetic alteration of: six or more, ten or more, fifteen or more, twenty or more, thirty or more, forty or more, fifty or more DMRs listed in Table 3.
The method of any of claims 15-22, wherein said detecting comprises measuring an epigenetic alteration of 1-56 DMRs listed in Table 3. In some embodiments, the detecting comprises measuring an epigenetic alteration of 1-20 DMRs listed in Table 3. In some embodiments, the detecting comprises measuring an epigenetic alteration of 30-56 DMRs listed in Table 3. In some embodiments, the detecting comprises measuring an epigenetic alteration of 1-35 DMRs listed in Table 3.
In some embodiments, the assaying comprises performing a sequencing analysis, a pyrosequencing analysis, a microarray analysis, or any combination thereof. In some embodiments, the sequencing analysis comprises a methylated DNA immunoprecipitation (MeDIP) sequencing. In some embodiments, the MeDIP comprises using an antibody that binds to a methylated base (mB). In some embodiments, the hmB is 5-methylated base (5-mB). In some embodiments, the 5-hmB is a 5-methylated cytosine (5-mC).
In some embodiments, the epigenetic profile comprises an increased methylation level. In some embodiments, the epigenetic profile comprises a decreased methylation level. In some embodiments, the nucleotide sequence comprises a cytosine phosphate guanine (CpG) region. In some embodiments, the DMRs listed either in Table 2 or Table 3 comprise a CpG density that is less than 10 CpG regions per 100 bp nucleotides.
In some embodiments, the DMRs listed either in Table 2 or Table 3 are produced from about 95% of a genome. In some embodiments, the DMR listed in Table 2 has a range of about 1000 bp to about 50,000 bp nucleotide sequence. In some embodiments, the DMR listed in Table 2 has a range of about 1000 bp to about 4000 bp nucleotide sequence. In some embodiments, the DMR listed in Table 3 has a range of about 1000 bp to about 5000 bp nucleotide sequence. In some embodiments, the DMR listed in Table 3 has a range of about 1000 bp to about 2000 bp nucleotide sequence.
In some embodiments, Table 2 does not overlap with Table 3.
In some embodiments, the methods further comprise obtaining said sperm sample from said subject. In some embodiments, the methods further comprise contacting said nucleic acid sequence with a 5-mC specific antibody. In some embodiments, the methods further comprise contacting said nucleic acid sequence with a bisulfite.
In some embodiments, the subject is a human subject.
In some embodiments, the methods further comprise transmitting a result via a communication medium. In some embodiments, the result comprises an epigenetic profile, a reference epigenetic profile, or both.
In another aspect, the present disclosure provides a kit, comprising: bisulfite; a plurality of primers configured to detect a differential DNA methylation region (DMR) listed in Table 2 or Table 3; and a microarray chip or a DNA sequencing kit.
In another aspect, the present disclosure provides a computer-readable medium comprising machine-executable code that, upon execution by a computer processor, implements a method for determining a likelihood of fertility in a subject, comprising: assaying a nucleic acid sequence from at least a portion of a sperm sample from a subject; detecting a methylation alteration of a portion of said nucleic acid sequence comprised in a differential DNA methylation region (DMR) listed in Table 2, thereby generating an epigenetic profile; and analyzing said epigenetic profile using a computer processor to compare said epigenetic profile with a reference epigenetic profile of a methylation level of at least a portion of corresponding nucleic acid sequence comprised in said DMR listed in Table 2, wherein when said DMR is DMRMT:1, at least a portion of a nucleic acid sequence comprised in a second DMR, optionally listed in Table 2, is optionally detected, and is analyzed.
In another aspect, the present disclosure provides a computer-readable medium comprising machine-executable code that, upon execution by a computer processor, implements a method for determining whether a subject responds to a treatment, comprising: assaying a nucleic acid sequence from at least a portion of a sperm sample from a subject; detecting a methylation alteration of at least a portion of said nucleic acid sequence comprised in a differential DNA methylation region (DMR) listed in Table 3, thereby generating an epigenetic profile; and analyzing said epigenetic profile using a computer processor by comparing to a reference epigenetic profile of methylation status of at least a portion of corresponding nucleic acid sequence comprised in said DMR listed in Table 3.
Additional aspects and advantages of the present disclosure will become readily apparent to those skilled in this art from the following detailed description, wherein only illustrative embodiments of the present disclosure are shown and described. As will be realized, the present disclosure is capable of other and different embodiments, and its several details are capable of modifications in various obvious respects, all without departing from the disclosure. Accordingly, the drawings and description are to be regarded as illustrative in nature, and not as restrictive.
All publications, patents, and patent applications mentioned in this specification are herein incorporated by references to the same extent as if each individual publication, patent, or patent application was specifically and individually indicated to be incorporated by reference. To the extent publications or patents or patent applications incorporated by reference contradict the disclosure contained in the specification, the specification is intended to supersede and/or take precedence over any such contradictory material.
The following drawings form part of the present specification and are included to further demonstrate certain aspects of the present disclosure. The disclosure may be better understood by reference to one or more of these drawings in combination with the detailed description of specific embodiments presented herein.
The primary source of increased male factor infertility and decline in seminal parameters appear to be environmental exposures. This includes a variety of toxicants, endocrine disruptors, abnormal nutrition, smoking and alcohol, and stress. Animal models have demonstrated the direct actions of a number of environmental toxicants to reduce sperm numbers and promote testis disease and male infertility. Various human male exposures also have been shown to associate with poor sperm parameters and male infertility. The primary molecular actions considered involve environmental epigenetics.
Epigenetics is defined as “molecular factors or processes around DNA that regulate germline activity independent of DNA sequence and are mitotically stable”. One of the principal epigenetic processes involved in sperm abnormalities is DNA methylation. Cytosine methylation at CpG sites can alter gene expression, and within sperm these sites are associated with reduced fertility and promotion of disease in offspring. Altered sperm methylation has been shown to be a biomarker for environmental exposures that associate with various pathologies later in life. Although altered histone retention following protamine replacement in sperm and non-coding RNAs have also been shown to associate with male infertility, the primary epigenetic biomarker investigated in the current study involves DNA methylation.
Animal models initially demonstrated a correlation with sperm DNA methylation and male infertility. Human studies have also demonstrated a decreased fecundity associated with sperm DNA methylation alterations. A sperm DNA methylation biomarker assay has been developed and validated, which uses a microarray approach to assess CpG islands within the genome. Although this analysis only investigates approximately 1% of the genome, it has been shown to be useful in analysis of sperm DNA methylation in a clinical setting. Subsequently, studies with in vitro fertilization (IVF) applications have used measurement of DNA methylation with this biomarker analysis to assess male infertility prior to assisted reproduction. Since this previous analysis only examined a limited amount of the genome (i.e. <1%), the current study was designed to investigate a more genome-wide approach using low density CpG regions (i.e. 95% genome) to examine alterations in sperm DNA methylation.
A promising approach for the clinical therapy of male infertility is the use of endocrine therapeutics, similar to what is used in the female. For example, observations suggest a beneficial effect of FSH treatment on spontaneous pregnancy and live birth rate in men with idiopathic male factor infertility. Therapy with exogenous follicle stimulating hormone (FSH) is achieved by administration of urinary or recombinant FSH preparations or human menopausal gonadotropin (hMG) preparations, with the latter providing both FSH activity and luteinizing hormone (LH) activity. In women, FSH therapy is successfully used to stimulate oogenesis, and a similar approach would be expected to induce spermatogenesis. Due to the variable response within the infertile population, a diagnostic test to assess a responder versus non-responder individuals would be expected to significantly enhance the utility of FSH therapeutics.
All clinical therapeutic studies have identified responder and non-responder subpopulations. Those that are efficacious for the majority of the population are generally not as concerned with the non-responder population. When the majority of the disease population does not respond, such as immune therapy for arthritis, the advancement of a molecular diagnostic for the responder versus the non-responder population would be very useful in the management of the disease. Although a number of disease biomarkers or diagnostics have been identified for disease, few have been observed for specific responder versus non-responder subpopulations.
The following are definitions of terms that may be used in the present specification. The initial definition provided for a group or term herein applies to that group or term throughout the present specification individually or as part of another group, unless otherwise indicated.
Additionally, it will be understood that any list of such candidates or alternatives is merely illustrative, not limiting, unless implicitly or explicitly understood or stated otherwise.
As used in this specification and the appended claims, the singular forms “a”, “an” and “the” include plural referents unless the content clearly dictates otherwise.
The use of the word “a” or “an” when used in conjunction with the term “comprising” in the claims and/or the specification may mean “one,” but it is also consistent with the meaning of “one or more,” “at least one,” and “one or more than one.”
Throughout this application, the term “about” is used to indicate that a value includes the standard deviation of error for the device or method being employed to determine the value.
The use of the term “or” in the claims is used to mean “and/or” unless explicitly indicated to refer to alternatives only or the alternatives are mutually exclusive, although the disclosure supports a definition that refers to only alternatives and “and/or.”
As used in this specification and claim(s), the words “comprising” (and any form of comprising, such as “comprise” and “comprises”), “having” (and any form of having, such as “have” and “has”), “including” (and any form of including, such as “includes” and “include”) or “containing” (and any form of containing, such as “contains” and “contain”) are inclusive or open-ended and do not exclude additional, unrecited elements or method steps.
All scientific and technical terms used in this application have meanings commonly used in the art unless otherwise specified. As used in this application, the following words or phrases have the meanings specified.
The term “subject,” as used herein, may be any animal or living organism. Animals can be mammals, such as humans, non-human primates, rodents such as mice and rats, dogs, cats, pigs, sheep, rabbits, and others. Animals can be fish, reptiles, or others. Animals can be neonatal, infant, adolescent or adult animals. Humans can be more than about: 1, 2, 5, 10, 20, 30, 40, 50, 60, 65, 70, 75, or about 80 years of age. The subject may have or be suspected of having a condition or a disease, such as infertility or idiopathic infertility. The subject may be a patient, such as a patient being treated for a condition or a disease, such as an infertility patient. The subject may be predisposed to a risk of developing a condition or a disease such as infertility. The subject may be in remission from a condition or a disease, such as an infertility patient. The subject may be healthy or normal without any infertility issues.
The term sensitivity, or true positive rate, can refer to a test's ability to identify a condition correctly. For example, in a diagnostic test, the sensitivity of a test is the proportion of patients known to have the disease or condition, who will test positive for it. In some cases, this is calculated by determining the proportion of true positives (i.e. patients who test positive who have the disease) to the total number of individuals in the population with the condition (i.e., the sum of patients who test positive and have the condition and patients who test negative and have the condition).
“Infertility” generally refers to the inability of a sexually active, non-contracepting couple to achieve pregnancy in at least one year. The methods of the present disclosure relate to infertility that is attributable to the male subject. “Infertility” as used herein may also include subfertility which relates to reduced fertility compared to a normal subject that has no fertility issues for any period of time. Causes of infertility or reduced fertility in male subjects may include, for example, abnormal sperm production or function, problems with the delivery of sperm, overexposure to certain environmental factors such as pesticides, radiation, medication, cigarette smoke, etc., and damage related to cancer and its treatment.
“Epimutation,” “epigenetic modification,” as used herein generally refer to modifications of cellular DNA that affect gene expression without altering the DNA sequence. The epigenetic modifications are both mitotically and meiotically stable, i.e. after the DNA in a cell (or cells) of an organism has been epigenetically modified, the pattern of modification persists throughout the lifetime of the cell and is passed to progeny cells via both mitosis and meiosis. Therefore, with the organism's lifetime, the pattern of DNA modification and consequences thereof, remain consistent in all cells derived from the parental cell that was originally modified. Further, if the epigenetically modified cell undergoes meiosis to generate gametes (e.g. sperm), the pattern of epigenetic modification is retained in the gametes and thus inherited by offspring. In other words, the patterns of epigenetic DNA modification are transgenerationally transmissible or inheritable, even though the DNA nucleotide sequence per se has not been altered or mutated. Without being bound by theory, it is believed that enzymes known as methyltransferases shepherd or guide the DNA through the various phases of mitosis or meiosis, reproducing epigenetic modification patterns on new DNA strands as the DNA is replicated. Exemplary epigenetic modifications include, but are not limited, to DNA methylation, histone modifications, chromatin structure modifications, and non-coding RNA modifications, etc.
Further, the term “epigenetic modification” as used herein, may be any covalent modification of a nucleic acid base. In some cases, a covalent modification may comprise (i) adding a methyl group, a hydroxymethyl group, a carbon atom, an oxygen atom, or any combination thereof to one or more bases of a nucleic acid sequence, (ii) changing an oxidation state of a molecule associated with a nucleic acid sequence, such as an oxygen atom, or (iii) a combination thereof. A covalent modification may occur at any base, such as a cytosine, a thymine, a uracil, an adenine, a guanine, or any combination thereof. In some cases, an epigenetic modification may comprise an oxidation or a reduction. A nucleic acid sequence may comprise one or more epigenetically modified bases. An epigenetically modified base may comprise any base, such as a cytosine, a uracil, a thymine, adenine, or a guanine. An epigenetically modified base may comprise a methylated base, a hydroxymethylated base, a formylated base, or a carboxylic acid containing base or a salt thereof. An epigenetically modified base may comprise a 5-methylated base, such as a 5-methylated cytosine (5-mC). An epigenetically modified base may comprise a 5-hydroxymethylated base, such as a 5-hydroxymethylated cytosine (5-hmC). An epigenetically modified base may comprise a 5-formylated base, such as a 5-formylated cytosine (5-fC). An epigenetically modified base may comprise a 5-carboxylated base or a salt thereof, such as a 5-carboxylated cytosine (5-caC). In some cases, an epigenetically modified base may comprise a methyltransferase-directed transfer of an activated group (mTAG).
Epigenetic modifications may be caused by exposure to any of a variety of factors, examples of which include but are not limited to: chemical compounds e.g. endocrine disruptors such as vinclozolin; chemicals such as those used in the manufacture of plastics e.g. bispheol A (BPA); bis(2-ethylhexyl)phthalate (DEHP); dibutyl phthalate (DBP); insect repellants such as N, N-diethyl-meta-toluamide (DEET); pyrethroids such as permethrin; various polychlorinated dibenzodioxins, known as PCDDs or dioxins e.g. 2,3,7,8-tetrachlorodibenzo-p-dioxin (TCDD); extreme conditions such as abnormal nutrition, starvation, chemotherapeutic agents which include alkylating agents such as ifosfamide and cyclophosphamide, anthracyclines such as daunorubicin and doxorubicine, taxanes such as paclitaxel and docetaxel, epothilones, histone deacetylase inhibitors, topoisomerase inhibitors, kinase inhibitors such as gefitinib, platinum-based agents such as cisplatin, retinoids, and vinca alkaloids, etc.
Methylation level, as used herein, generally refers to a percentage of nucleotides of a nucleotide sequence that are methylated. DNA methylation is an epigenetic mechanism that occurs when a methyl group is added onto the C5 position of cytosine, thereby modifying gene function and affecting gene expression. Most DNA methylation occurs at cytosine residues that precede guanine residues, called CpG dinucleotides, which tend to cluster in DNA domains known as CpG islands. Methylation level may be measured on a genome wide basis of any CpG containing regions. Methylation alteration, as used herein, generally refers to an increase or decrease of a percentage of nucleotides of a nucleotide sequence that are methylated.
The term “nucleic acid sequence” as used herein may comprise DNA or RNA. In some cases, a nucleic acid sequence may comprise a plurality of nucleotides. In some cases, a nucleic acid sequence may comprise an artificial nucleic acid analogue. In some cases, a nucleic acid sequence comprising DNA, may comprise cell-free DNA, cDNA, fetal DNA, or maternal DNA. In some cases, a nucleic acid sequence may comprise miRNA, shRNA, or siRNA.
The term “fragment,” as used herein, may be a portion of a sequence, a subset that may be shorter than a full length sequence. A fragment may be a portion of a gene. A fragment may be a portion of a peptide or protein. A fragment may be a portion of an amino acid sequence. A fragment may be a portion of an oligonucleotide sequence. A fragment may be less than about: 20, 30, 40, 50 amino acids in length. A fragment may be less than about: 20, 30, 40, 50 oligonucleotides in length.
The term “biological sample” generally refers to any fluid or cellular sample or mixture thereof obtained from a living organism. The biological sample may be a reproductive sample, such as an egg or a sperm. Exemplary biological samples may include tissue biopsy, serum, plasma, and buccal cells.
The term “sequencing” as used herein, may comprise bisulfite-free sequencing, bisulfite sequencing, TET-assisted bisulfite (TAB) sequencing, ACE-sequencing, high-throughput sequencing, Maxam-Gilbert sequencing, massively parallel signature sequencing, Polony sequencing, 454 pyrosequencing, Sanger sequencing, Illumina sequencing, SOLiD sequencing, Ion Torrent semiconductor sequencing, DNA nanoball sequencing, Heliscope single molecule sequencing, single molecule real time (SMRT) sequencing, nanopore DNA sequencing, shot gun sequencing, RNA sequencing, Enigma sequencing, or any combination thereof.
A “plurality” as used herein generally refers to two or more DMRs, for example, two, three, four, five, six, and every integer up to and including all of the DMRs listed in table 2 or 3. A plurality may also refer to two or more DMRs listed in table 2 or 3 and every integer up to and including all DMRs listed in table 2 or 3.
The term “responder” as used herein, generally relates to patients for which the predicted response to the treatment/biological drug is positive, i.e., increasing in sperm numbers (sperm concentration), sperm motility, or both. Sperm numbers or sperm concentration may be measured by any suitable known methods. Further, sperm motility may be measured any suitable known methods. Similarly, the term “non-responder” as used herein, generally relates to patients for which the predicted response to the treatment/biological drug is negative.
The term “predicted response” or similar, as used herein refers to the determination of the likelihood that the patient will respond either favorably or unfavorably to a given therapy/biological drug. Especially, the term “prediction”, as used herein, relates to an individual assessment of any parameter that can be useful in determining the evolution of a patient. As it will be understood by those skilled in the art, the prediction of the clinical response to the treatment with a biological drug, although preferred to be, need not be correct for 100% of the subjects to be diagnosed or evaluated. The term, however, requires that a statistically significant portion of subjects can be identified as having an increased probability of having a positive response. Whether a subject is statistically significant can be determined without further effort by the person skilled in the art using various well known statistic evaluation tools, e.g., determination of confidence intervals, p-value determination, Student's t-test, Mann-Whitney test, etc. Preferred confidence intervals are at least 50%, at least 60%, at least 70%, at least 80%, at least 90% at least 95%. The p-values are, preferably, 0.2, 0.1 or 0.05.
Patients achieving complete or partial response are considered “responders”, and all other patients are considered “non-responders”.
Provided herein are differential DNA methylation regions (DMRs) which are useful for the identification of male subjects are infertile (Table 2). Also provided are DMRs which are useful for the identification of infertile male subjects who are responders to FSH therapy (Table 3). The tables provide the DMR name, chromosome location, start and stop base pair location, length in base pair (bp), number of significant windows (100 bp), p-value, number of CpG sites, CpG sites per 100 bp, and DMR associated gene symbol (annotation). Each start site corresponds to the GRCh38 reference genome (as originally released in December 2013) that is well known in the art. It is also known in the art that any “patches” to the GRCh38 genome that were subsequently released do not change the chromosomal coordinates of the reference genome. In the context of the present disclosure, the specific sequence where a DMR is located is not critical as methylation does not affect the underlying sequence. Disclosed herein are specific locations within the genome that contain differential methylation (irrespective of the underlying genome sequence) that is indicative of infertility or responsiveness to FSH therapy. Thus, the DMR may be located in within a sequence that is about 50% identical to any of the sequences listed in Table 2 or Table 3, e.g. at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, or 99% identical.
In some embodiments, the level of methylation at a DMR is increased or decreased by at least about 10% as compared to a control or a reference sample collected from a normal subject without infertility. In some embodiments, the methylation level is determined by a cytosine. In some embodiments, the DMRs are associated with certain genes in an individual. In some embodiments, the DMRs are associated with certain CpG loci. The CpG loci may be located in the promoter region of a gene, in an intron or exon of a gene or located near the gene in a patient's genomic DNA. In an alternate embodiment, the CpG may not be associated with any known gene or may be located in an intergenic region of a chromosome. In some embodiments, the CpG loci may be associated with one or more than one gene.
In some instances, the DMRs described herein are found in CpG desert regions of the genome, e.g. a CpG density of about 10% or less or a mean around two CpG per 100 base pairs. Due to the evolutionary conservations of CpG clusters in a CpG desert, these are likely epigenetic regulatory sites. Additional genomic features of characteristic of ECRs are described in U.S. Patent Publication 2013/0226468 incorporated herein by reference. Those of skill in the art will recognize that the “%” of a sequence of interest (e.g. CpG) means that the sequence occurs the indicated number of times per 100 base pairs analyzed, e.g. 15% or less CpG means that the dinucleotide sequence C followed by G occurs at most 15 times per 100 base pairs within a DNA segment that is analyzed. Analyses are usually carried out by iterative analysis of consecutively overlapping sequences within a large DNA molecule of interest, e.g. a chromosome, a section of a chromosome, etc.
The DMRs provided herein allow for determining if a male subject is infertile comprising detecting a methylation alteration of a portion of said nucleic acid sequence comprised in a differential DNA methylation region (DMR) listed in Table 2, thereby generating an epigenetic profile; and analyzing said epigenetic profile using a computer processor to compare said epigenetic profile with a reference epigenetic profile of a methylation level of at least a portion of corresponding nucleic acid sequence comprised in said DMR listed in Table 2, wherein when said DMR is DMRMT:1, a second DMR is detected and analyzed. For example, if a DMR from Table 2 comprises 2000 bp nucleotides, in some embodiments, at least about 50%, about 60%, about 70%, about 80%, about 90%, or about 100% of the 2000 bp nucleotides may be measured to determine the methylation level. The DMR named DMRMT:1 as listed in Table 2 is associated with genes, such as RNR1 as listed in Table 2. In some embodiments, the second DMR is selected from Table 2. In some embodiments, the second DMR is not selected from Table 2.
In some embodiments, the methylation level of each DMR contained in a subject's epigenetic profile is measured by methods disclosed herein, such as methylated DNA immunoprecipitation (MeDIP) sequencing. In some embodiments, the methylation level of a corresponding DMR contained in a reference epigenetic profile from a healthy normal subject is measure by methods disclosed herein or obtained from public information. Then the methylation levels of DMRs are compared between the epigenetic profile and the reference epigenetic profile using any suitable methods including suitable computer programs.
In some embodiments, the epigenetic profile comprises a plurality of DMRs selected from the group listed in table 2. In other embodiments, the epigenetic modification comprises all of the DMRs listed in table 2. In some embodiments, the epigenetic profile comprises at least two DMRs listed in Table 2. In some embodiments, the epigenetic profile comprises six or more, ten or more, fifteen or more, twenty or more, thirty or more, forty or more, fifty or more, sixty or more, seventy or more, eighty or more, ninety or more, or one hundred or more DMRs listed in Table 2.
In some embodiments, the subject may be infertile. In some embodiments, the subject may have a reduced fertility compared to a normal subject that has no fertility issues. In some embodiments, the present disclosure provides administering a treatment to the subject that may be infertile or is at risk of being infertile. In some embodiments, the present disclosure provides administering a treatment to the subject that may have reduced fertility compared to a normal subject. In some embodiments, the treatment may be any hormone therapies that may increase sperm number and motility. The hormone therapy may be administering a therapeutically effective amount of follicle stimulating hormone (FSH), or an analog thereof to a subject. In some embodiments, the hormone therapy may be administering a therapeutically effective amount of human menopausal gonadotropin (hMG), or an analog thereof to the subject. In some embodiments, the treatment may comprise performing in vitro fertilization (IVF), intracytoplasmic sperm injection (ICSI), or any other suitable procedures that may result in successful pregnancy.
Further, the present disclosure provides methods for detecting a methylation alteration of a portion of said nucleic acid sequence comprised in a differential DNA methylation region (DMR) listed in Table 3, thereby generating an epigenetic profile; and analyzing said epigenetic profile using a computer processor by comparing to a reference epigenetic profile of methylation status of at least a portion of corresponding nucleic acid sequence comprised in said DMR listed in Table 3. For example, if a DMR from Table 3 comprises 1000 bp nucleotides, in some embodiments, at least about 50%, about 60%, about 70%, about 80%, about 90%, or about 100% of the 1000 bp nucleotides may be measured to determine the methylation level.
In some embodiments, the methods further comprise determining whether a subject responds to a treatment. This may be used in a clinical test setting to quickly identify subjects that may respond to a certain treatment. In some embodiments, the treatment may be any hormone therapies that may increase sperm number and motility. The hormone therapy may be administering a therapeutically effective amount of follicle stimulating hormone (FSH), or an analog thereof to a subject. In some embodiments, the hormone therapy may be administering a therapeutically effective amount of human menopausal gonadotropin (hMG), or an analog thereof to the subject. In some embodiments, the treatment may comprise performing in vitro fertilization (IVF), intracytoplasmic sperm injection (ICSI), or any other suitable procedures that may result in successful pregnancy.
Methods of measuring methylation levels of DMRs in genomic DNA are well known to one skilled in the art. For example, microarray based methylome profiling and bioinformatics data analysis may be used to analyze DNA methylation profiles. In some embodiments, the microarray chip is a tiling array chip. In some embodiments, Methylated DNA immunoprecipitation (MeDIP) followed by next generation sequencing (NGS) is used. In some embodiments, MeDIP-Chip is used. Additional methods for detecting methylation levels can involve genomic sequencing before and after treatment of the DNA with bisulfite. When sodium bisulfite is contacted to DNA, unmethylated cytosine is converted to uracil, while methylated cytosine is not modified. Bisulfite methods may also be used in conjunction with pyrosequencing and PCR. Computer executable algorithms and software programs for implementing the same are also encompassed by the disclosure. Such software programs generally contain instructions for causing a computer to carry out the steps of the methods disclosed herein. The computer program will be embedded in a non-transient medium such as a hard drive, DVD, CD, thumb drive, etc.
Selection and identification of a subject for analysis may be predicated on and/or influenced by any number of factors. For example, the subject or subjects may be known or suspected to be afflicted with a disease or condition associated with infertility; or who have been or are suspected of having been exposed to an agent that causes, or is suspected of causing, infertility; or who have inexplicably inherited a disease or disease condition from a parent for which no DNA sequence mutation has been identified, etc. Subjects whose DNA is analyzed may be of any age, and in any stage of development, so long as cells containing a DNA sequence of interest can be obtained from the subject. For example, the subject may be an adult, an adolescent, a laboratory animal, etc. The cells from which the DNA is obtained may be any suitable cell, including but not limited to gametes, cells from swabs such as buccal swabs, cells sloughed into amniotic fluid, etc.
The different DMRs disclosed herein may be used as biomarkers for at least two related applications in fertility assessment. The panel of DMRs disclosed herein may serve as a sensitive and non-intrusive testing to diagnose whether a subject is infertile and screen for subjects that are responsive to any fertility or hormone treatment disclosed herein. In some embodiments, the panel of DMRs for indicating an infertility risk comprises at least 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, or 200 DMRs listed in Table 2. The predictive value of a subject having infertility may increase as more DMRs listed in Table 2 are included in the panel. Diagnostic methods described herein for indicating a likelihood of infertility in a subject has a sensitivity that is greater than 75%, greater than 80%, greater than 85%, greater than 90%, greater than 95%, greater than 96%, greater than 97%, greater than 98%, greater than 99%, or about 100%. In some embodiments, the sensitivity is at least about 97%, 98%, 99%, or 99.5%.
In some embodiments, the panel of DMRs for indicating an infertility risk comprises at least 10, 20, 30, 40, or 50 DMRs listed in Table 3. The predictive value of a subject responding to a hormone treatment or fertility treatment may increase as more DMRs listed in Table 3 are included in the panel. Diagnostic methods described herein for determining responsiveness to a hormone treatment or fertility treatment in a subject has a sensitivity that is greater than 75%, greater than 80%, greater than 85%, greater than 90%, greater than 95%, greater than 96%, greater than 97%, greater than 98%, greater than 99%, or about 100%. In some embodiments, the sensitivity is at least about 97%, 98%, 99%, or 99.5%.
The genomic features described herein may be used in a variety of applications. For example, the DMRs of the disclosure can be indicative of having, the risk of having, or the risk of developing infertility or a condition that could lead to pregnancy complications and/or passage of heritable mutations to an infant. Thus the methods of the disclosure may be used, for example, in an in vitro fertilization clinic setting to test sperm for epimutations and for the potential to pass epigenetic information to offspring. The methods of the disclosure are also useful for screening potential sperm donors at a donation center. Further applications include screening applicants for health insurance coverage.
The detection of epigenetic modifications at the regions described herein (i.e. a positive diagnostic result) will suggest or confirm that the subject is infertile and treatments suitable for infertility can be instituted. For example, an appropriate infertility treatment, such as surgical extraction of sperm or FSH therapy, may be implemented. In other instances, a male subject may decide to utilize a sperm donor due to the subject's infertility or to prevent the possibility of pregnancy complications and/or the passage of heritable mutations to an infant attributable to the male subject.
Information concerning the type and extent of epigenetic modification in a subject may be used in a variety of decision making processes undertaken by a subject that is tested. For example, depending on the severity of the symptoms caused by an epigenetic modification that is identified, a subject may decide to forego having children or to terminate a pregnancy in order to prevent transmission of the modification to offspring. Diagnostic tests based on the present disclosure can be included in prenatal testing.
Thus, an aspect of the disclosure provides a method for treating a male subject who is infertile, comprising detecting the presence or absence of an epigenetic modification at one or more regions of at least one genomic DNA sequence or site obtained from a biological sample from said male subject, wherein said epigenetic modification comprises at least one differential DNA methylation region (DMR) listed in table 2; determining that said subject is infertile if said epigenetic modification is identified to be present in said at least one genomic DNA sequence or site; and administering an appropriate treatment protocol to said subject determined to be infertile.
In contrast, a negative result (no significant methylation level change at the site) suggests that the subject is not infertile and does not require an infertility treatment. Ongoing monitoring of the extent of epigenetic modification and methylation level of a site can provide valuable information regarding the outcome of the administration of agents (e.g. drugs or other therapies) which are intended to treat or prevent a condition caused by epimutation, i.e. the therapeutic responsiveness of a patient. Those of skill in the art will recognize that such analyses are generally carried out by comparing the results obtained using an unknown or experimental sample with results obtained a using suitable negative or positive controls, or both.
Subjects whose DNA is analyzed may be suffering from any of a variety of disorders (diseases, conditions, etc.) including but not limited to: various known late or adult onset conditions, such as low sperm production, infertility, abnormalities of sexual organs, kidney abnormalities, prostate disease, immune abnormalities, behavioral effects, etc. In other embodiments, no symptoms are present but screening using the diagnostics is employed to rule out the presence of “silent” epigenetic mutations which could cause disease symptoms in the future, or which could be inherited and cause deleterious effects in offspring.
The DMRs described herein may also be used to identify therapeutic modalities for the treatment of epigenetic mutations. Those of skill in the art will recognize that such methods of screening are typically carried out in vitro, e.g. using a DNA sequence that is immobilized in a vessel, or that is present in a cell. However, such tests may also be carried out in model laboratory animals. In one embodiment, candidate agents which reverse epigenetic modification are screened by analyzing the regions. In another embodiment, candidate agents which prevent epigenetic modifications are screened by analyzing the regions. In this way, the epigenetic biomarkers described herein can be used to facilitate, e.g. drug development and clinical trials patient stratification (i.e. pharmacoepigenomics).
Secondly, the DMRs described herein may also be used to identify responders and non-responders to FSH therapy. In men, FSH acts on the Sertoli cells of the testes to stimulate sperm production (spermatogenesis).
An embodiment of the disclosure provides a method of determining whether a male subject is a responder to FSH treatment comprising detecting the presence or absence of an epigenetic modification at one or more regions of at least one genomic DNA sequence obtained from a biological sample from said male subject, wherein said epigenetic modification comprises at least one differential DNA methylation region (DMR) listed in table 3; and determining that said subject is a responder to FSH treatment if said epigenetic modification is identified to be present in said at least one genomic DNA sequence or site; or determining that said subject is a non-responder to FSH treatment if said epigenetic modification is not identified to be present in said at least one genomic DNA sequence or site.
In some embodiments, FSH treatment is administered to the subject determined to be a responder to FSH treatment. In some embodiments, an infertility treatment other than FSH therapy is administered to the subject determined to be a non-responder, e.g. surgical extraction of sperm.
In some embodiments, a kit is described. The kit comprises at least one polynucleotide that hybrid-izes to one of the DMR loci identified in table 2 or table 3 (or a nucleic acid sequence at least 90% identical to the DMR loci of table 2 or table 3), or that hybridizes to a region of DNA flanking one of the DMR loci identified in table 2 or table 3, and at least one reagent for detection of gene meth-ylation. Reagents for detection of methylation include, e.g., sodium bisulfite, polynucleotides de-signed to hybridize to sequence at or near the DMR loci of the disclosure if the sequence is not methylated, and/or a methylation-sensitive or methylation-dependent restriction enzyme. The kit may comprise bisulfite. The kits can provide solid supports in the form of an assay apparatus that is adapted to use in the assay. The kit may comprise a microarray chip or a DNA sequencing kit for sequencing any DMRs. The kit may further comprise detectable labels, optionally linked to a poly-nucleotide, e.g., a probe, in the kit. Other materials useful in the performance of the assays can also be included in the kit, including test tubes, transfer pipettes, and the like. The kit can also include written instructions for the use of one or more of these reagents in any of the assays described herein.
The present disclosure provides computer systems that are programmed to implement methods of the disclosure.
The computer system 201 includes a central processing unit (CPU, also “processor” and “computer processor” herein) 205, which can be a single core or multi core processor, or a plurality of processors for parallel processing. The computer system 201 also includes memory or memory location 210 (e.g., random-access memory, read-only memory, flash memory), electronic storage unit 215 (e.g., hard disk), communication interface 220 (e.g., network adapter) for communicating with one or more other systems, and peripheral devices 225, such as cache, other memory, data storage and/or electronic display adapters. The memory 210, storage unit 215, interface 220 and peripheral devices 225 are in communication with the CPU 205 through a communication bus (solid lines), such as a motherboard. The storage unit 215 can be a data storage unit (or data repository) for storing data. The computer system 201 can be operatively coupled to a computer network (“network”) 230 with the aid of the communication interface 220. The network 230 can be the Internet, an internet and/or extranet, or an intranet and/or extranet that is in communication with the Internet. The network 230 in some cases is a telecommunication and/or data network. The network 230 can include one or more computer servers, which can enable distributed computing, such as cloud computing. The network 230, in some cases with the aid of the computer system 201, can implement a peer-to-peer network, which may enable devices coupled to the computer system 201 to behave as a client or a server.
The CPU 205 can execute a sequence of machine-readable instructions, which can be embodied in a program or software. The instructions may be stored in a memory location, such as the memory 210. The instructions can be directed to the CPU 205, which can subsequently program or otherwise configure the CPU 205 to implement methods of the present disclosure. Examples of operations performed by the CPU 205 can include fetch, decode, execute, and writeback.
The CPU 205 can be part of a circuit, such as an integrated circuit. One or more other components of the system 201 can be included in the circuit. In some cases, the circuit is an application specific integrated circuit (ASIC).
The storage unit 215 can store files, such as drivers, libraries and saved programs. The storage unit 215 can store user data, e.g., user preferences and user programs. The computer system 201 in some cases can include one or more additional data storage units that are external to the computer system 201, such as located on a remote server that is in communication with the computer system 201 through an intranet or the Internet.
The computer system 201 can communicate with one or more remote computer systems through the network 230. For instance, the computer system 201 can communicate with a remote computer system of a user. Examples of remote computer systems include personal computers (e.g., portable PC), slate or tablet PC's (e.g., Apple® iPad, Samsung® Galaxy Tab), telephones, Smart phones (e.g., Apple® iPhone, Android-enabled device, Blackberry®), or personal digital assistants. The user can access the computer system 201 via the network 230.
Methods as described herein can be implemented by way of machine (e.g., computer processor) executable code stored on an electronic storage location of the computer system 201, such as, for example, on the memory 210 or electronic storage unit 215. The machine executable or machine-readable code can be provided in the form of software. During use, the code can be executed by the processor 205. In some cases, the code can be retrieved from the storage unit 215 and stored on the memory 210 for ready access by the processor 205. In some situations, the electronic storage unit 215 can be precluded, and machine-executable instructions are stored on memory 210.
The code can be pre-compiled and configured for use with a machine having a processer adapted to execute the code or can be compiled during runtime. The code can be supplied in a programming language that can be selected to enable the code to execute in a pre-compiled or as-compiled fashion.
Aspects of the systems and methods provided herein, such as the computer system 201, can be embodied in programming. Various aspects of the technology may be thought of as “products” or “articles of manufacture” typically in the form of machine (or processor) executable code and/or associated data that is carried on or embodied in a type of machine readable medium. Machine-executable code can be stored on an electronic storage unit, such as memory (e.g., read-only memory, random-access memory, flash memory) or a hard disk. “Storage” type media can include any or all of the tangible memory of the computers, processors or the like, or associated modules thereof, such as various semiconductor memories, tape drives, disk drives and the like, which may provide non-transitory storage at any time for the software programming. All or portions of the software may at times be communicated through the Internet or various other telecommunication networks. Such communications, for example, may enable loading of the software from one computer or processor into another, for example, from a management server or host computer into the computer platform of an application server. Thus, another type of media that may bear the software elements includes optical, electrical and electromagnetic waves, such as used across physical interfaces between local devices, through wired and optical landline networks and over various air-links. The physical elements that carry such waves, such as wired or wireless links, optical links or the like, also may be considered as media bearing the software. As used herein, unless restricted to non-transitory, tangible “storage” media, terms such as computer or machine “readable medium” refer to any medium that participates in providing instructions to a processor for execution.
Hence, a machine readable medium, such as computer-executable code, may take many forms, including but not limited to, a tangible storage medium, a carrier wave medium or physical transmission medium. Non-volatile storage media include, for example, optical or magnetic disks, such as any of the storage devices in any computer(s) or the like, such as may be used to implement the databases, etc. shown in the drawings. Volatile storage media include dynamic memory, such as main memory of such a computer platform. Tangible transmission media include coaxial cables; copper wire and fiber optics, including the wires that comprise a bus within a computer system. Carrier-wave transmission media may take the form of electric or electromagnetic signals, or acoustic or light waves such as those generated during radio frequency (RF) and infrared (IR) data communications. Common forms of computer-readable media therefore include for example: a floppy disk, a flexible disk, hard disk, magnetic tape, any other magnetic medium, a CD-ROM, DVD or DVD-ROM, any other optical medium, punch cards paper tape, any other physical storage medium with patterns of holes, a RAM, a ROM, a PROM and EPROM, a FLASH-EPROM, any other memory chip or cartridge, a carrier wave transporting data or instructions, cables or links transporting such a carrier wave, or any other medium from which a computer may read programming code and/or data. Many of these forms of computer readable media may be involved in carrying one or more sequences of one or more instructions to a processor for execution.
The computer system 201 can include or be in communication with an electronic display 235 that comprises a user interface (UI) 240 for providing, for example, measurements of the reproductive hormone (e.g., DHEA, DHEA-S, estradiol, progesterone, testosterone, and/or AMH). Examples of UI's include, without limitation, a graphical user interface (GUI) and web-based user interface.
Methods and systems of the present disclosure can be implemented by way of one or more algorithms. An algorithm can be implemented by way of software upon execution by the central processing unit 205. The algorithm can, for example, determine the levels of DHEA, DHEA-S, estradiol, progesterone, testosterone, and/or AMH in a biological sample.
The computer processor may be further programmed to direct the assaying a nucleic acid sequence from at least a portion of a sperm sample from a subject. The computer processor may be further programmed to direct the detecting a methylation alteration of a portion of said nucleic acid sequence comprised in a differential DNA methylation region (DMR) listed in Table 2, thereby generating an epigenetic profile. The computer processor may be further programmed to direct the analyzing said epigenetic profile using a computer processor to compare said epigenetic profile with a reference epigenetic profile of a methylation level of corresponding portion(s) of said nucleic acid sequence comprised in said DMR listed in Table 2, wherein when said DMR is DMRMT:1, a second DMR is detected and analyzed.
The computer processor may be further programmed to transmit a result via a communication medium. In some embodiments, the result may comprise an epigenetic profile, a reference epigenetic profile, or both. In some embodiments, the result may comprise a likelihood whether the tested subject has a fertility issue, a likelihood whether the subject responds to a treatment disclosed herein, or both. In some embodiments, the result may comprise a recommendation for treating infertility.
Further, the computer processor may be further programmed to direct the assaying a nucleic acid sequence from at least a portion of a sperm sample from a subject. The computer processor may be further programmed to direct the detecting a methylation alteration of a portion of said nucleic acid sequence comprised in a differential DNA methylation region (DMR) listed in Table 3, thereby generating an epigenetic profile. The computer processor may be further programmed to direct the analyzing said epigenetic profile using a computer processor by comparing to a reference epigenetic profile of methylation status of corresponding portion(s) of said nucleic acid sequence comprised in said DMR listed in Table 3.
Where a range of values is provided, it is understood that each intervening value, to the tenth of the unit of the lower limit unless the context clearly dictates otherwise, between the upper and lower limit of that range and any other stated or intervening value in that stated range, is encompassed within the disclosure. The upper and lower limits of these smaller ranges may independently be included in the smaller ranges and are also encompassed within the disclosure, subject to any specifically excluded limit in the stated range. Where the stated range includes one or both of the limits, ranges excluding either or both of those included limits are also included in the disclosure.
In addition, unless otherwise indicated, numbers expressing quantities of ingredients, constituents, reaction conditions and so forth used in the specification and claims are to be understood as being modified by the term “about.” Accordingly, unless indicated to the contrary, the numerical parameters set forth in the specification and attached claims are approximations that may vary depending upon the desired properties sought to be obtained by the subject matter presented herein. At the very least, and not as an attempt to limit the application of the doctrine of equivalents to the scope of the claims, each numerical parameter should at least be construed in light of the number of reported significant digits and by applying ordinary rounding techniques. Notwithstanding that the numerical ranges and parameters setting forth the broad scope of the subject matter presented herein are approximations, the numerical values set forth in the specific examples are reported as precisely as possible. Any numerical values, however, inherently contain certain errors necessarily resulting from the standard deviation found in their respective testing measurements.
The present example identifies a molecular biomarker or diagnostic for male infertility and provides the proof of concept that an epigenetic analysis is useful. Previously, an analysis for DNA methylation using a microarray of CpG islands and methylation sites constituting a couple percent of the human genome to identify altered methylation in sperm from infertility patients was performed. Observations are expanded in the current study with a genome-wide analysis that constitutes 95% of the human genome and advanced molecular analysis.
As disclosed herein, a genome-wide analysis of DNA methylation identifies a male infertility signature of DMRs that are present in male infertility patients. There was an efficient separation between the fertile versus infertile patient population with minimal overlap. A validation with a test set of infertile and fertile patients, not used in the initial establishment of the infertility DMRs, also efficiently separated the infertile versus the fertile patients. The infertility signature of DMRs was found in all the infertile patients' sperm samples showing the efficiency of the molecular biomarkers. The majority of the DNA methylation changes involved an increase in DNA methylation (i.e. hypermethylation), which suggests during early gametogenesis and/or spermatogenesis development of the sperm a hypermethylation may be an aspect of the male infertility molecular disease etiology. The development of a male infertility diagnostic is useful for the clinical management of the male infertility patient. Due to the increasing male infertility in the human population over the past fifty years, a greater demand for such an analysis in an assisted reproduction setting such as an IVF clinic is anticipated.
Observations also demonstrate that an epigenetic DNA methylation biomarker can identify pharmaceutical responders versus non-responders to FSH treatment among male infertility patients. The infertility responder versus non-responder DMR signature identified efficiently distinguished the two populations, and in contrast to the infertility diagnostic, the responder DMR signature involved an equal distribution of hypermethylation (increase) and hypomethylation (decrease) changes. No overlap was observed between the infertility DMRs and responder DMRs, suggesting a distinct set of epigenetic alterations. The current FSH therapeutic preparations in combination with this responder diagnostic allows for more effective patient management for infertility.
An initial sperm sample was collected upon enrollment, a second at the start of treatment, and a third after three months of treatment. Twenty-one patients were enrolled which included nine patients in the fertile control group and twelve in the infertility treatment group. The differences (mean±SD) between the seminal sample and hormonal parameters of both groups are shown in Table 1. Results from the baseline variables from the group of fertile subjects and those with infertility showed that there is a statistically significant difference in sperm number (i.e. concentration) between the fertile group and the infertile group, with the latter having the lowest values (95% CI-83, −2.87), p<0.001. Infertility patient samples also have a lower percentage of sperm motility, 95% CI [−2.62, 1.58], and p<0.001. The control group (fertile) showed lower FSHI levels than the infertility group, 95% CI [0.20, 0.95], p=0.005. Although not statistically significant, basal estradiol levels are higher in the group of subjects with infertility, 95% CI [−0.03, 0.89], p=0.06. Regarding the results in the infertility group after three months of FSH treatment (150 IU dose of FSH therapeutic three times per week), there was an increase in FSHI levels after treatment, although not statistically significant, p=0.063. However, the estimated confidence interval 95% difference should not be underestimated, 95% CI [−0.02, 0.73]. There was no statistically significant difference in regards to the group mean±SD in the other variables analyzed before and after treatment. In terms of pregnancy rate, there were three pregnancies ( 3/10, 30%). Two occurred after ICSI procedures, one was spontaneous, and seven non-pregnancies ( 7/10, 70%). There are two patients pending of ICSI procedure with frozen samples.
Table 1. Hormone, semen and sperm parameters. The mean±SD values for age (years), seminal volume (mL), sperm concentration (million/mL), motility (%), immotility (%), FSHI (IU/mL), LHI (IU/mL), estradiol (pg/mL), and testosterone (ng/mL).
The fertile control and infertile treatment for baseline and after 3 months is presented with n-value indicated for each Individual patient information was used to identify the infertility patient responsiveness or non-responsiveness to FSH therapy. The infertility patients that show a 2-3 fold increase in sperm number (semen concentration) and/or motility following three month treatment are shown in
Individual patient samples from the initial sperm sample collected upon enrollment, the sample at the start of the FSH therapy treatment, and the sample after 3 months of treatment were prepared for epigenetic analysis. The DNA was extracted from the sperm then fragmented for a methylated DNA immunoprecipitation (MeDIP) analysis in order to identify differential DNA methylated regions (DMRs). The MeDIP is a genome-wide analysis examining 95% of the genome comprising low density CpG regions in comparison to the less than 5% of the genome of high density regions and CpG islands. The MeDIP DNA is then prepared for next generation DNA sequencing and bioinformatic analyses, as described in the Materials and Methods section. A comparison of the sequences derived from fertile versus infertile patient sperm identified DMRs for infertility assessment, as shown in
All the infertility patients had a sperm collection prior to a three-month FSH therapeutic treatment period after which another sperm sample was collected for analysis.
The genomic features of the infertility DMRs and FSH therapeutic responder DMRs were investigated. The chromosomal locations of the DMRs within the human genome are presented in
The statistical significance and associations of the DMRs for each comparison was investigated. A principal component analysis (PCA) of the infertility versus fertility DMR principal components is presented in
The current study was designed to identify a molecular biomarker or diagnostic for male infertility and provide that an epigenetic analysis will be useful. Previously, researchers utilized an analysis for DNA methylation using a microarray of CpG islands and methylation sites constituting a couple percent of the human genome to identify altered methylation in sperm from infertility patients. Observations are expanded in the current study with a genome wide analysis that constitutes 95% of the human genome and advanced molecular analysis.
Observations from the current study demonstrate a genome-wide analysis of DNA methylation identifies a male infertility signature of DMRs that are present in male infertility patients. There was an efficient separation between the fertile versus infertile patient population with minimal overlap. A validation with a test set of infertile and fertile patients, not used in the initial establishment of the infertility DMRs, also distinctively and efficiently separated the infertile versus the fertile patients. The infertility signature of DMRs was found in all the infertile patients' sperm samples showing the efficiency of the molecular biomarkers. The majority of the DNA methylation change involved an increase in DNA methylation (i.e. hypermethylation), which suggests during early gametogenesis and/or spermatogenesis development of the sperm a hypermethylation may be an aspect of the male infertility molecular disease etiology.
Observations also demonstrate that an epigenetic DNA methylation biomarker can be used to identify pharmaceutical responders versus non-responders to FSH treatment among male infertility patients. The infertility responder versus non-responder DMR signature identified efficiently distinguished the two populations, and in contrast to the infertility diagnostic, the responder DMR signature involved an equal distribution of hypermethylation (increase) and hypomethylation (decrease) changes. No overlap was observed between the infertility DMRs and responder DMRs, suggesting a distinct set of epigenetic alterations.
In conclusion, the current study identified a male infertility epigenetic DMR signature for use as a diagnostic, as well as an FSH therapy response diagnostic within this patient population. The advancement of such technology is anticipated to enhance the diagnosis and management of male infertility patients, as well as improve general therapeutic options and therapeutic development.
Materials and Methods
Clinical Sample Collection and Analysis
A single center (Urology Department at Hospital Universitari i Politècnic La Fe), prospective and open clinical study. The IRB approval code protocol 2015-002521-19. We included two groups (infertility vs fertility). The infertility men (inability of the couple to become pregnant after one year of sexual activity), included Caucasians between 25-45 years of age with a total sperm concentration (concentration in millions/mL×volume in mL) between 1-10 million (oligozoospermia) in at least 2 spermiograms obtained after a 2-4 day period of sexual abstinence and with a 7-day separation period between tests. The hormone profile used inclusion criteria of FSH 2-12 IU/mL, total testosterone>300 ng/mL and bioavailable testosterone (calculated with the Sexual Hormone Binding Globulin or SHBG albumin)>145 ng/dL. The fertile control group included Caucasians without vasectomy and had a child in the last five years with a sperm concentration and motility above the 50th percentile according to the parameters set forth in the 5th edition of the World Health Organization (WHO) guidelines in at least two spermiograms obtained after a 2-4 day period of sexual abstinence and with a 7-day period between tests. The hormones profiled used inclusion criteria of estradiol<50 pg/mL, FSH<4.5 IU/L, total testosterone>300 ng/dL and bioavailable testosterone>145 ng/dL.
Initial semen analysis and basal hormone determination to assess eligibility criteria were performed. Sperm samples were processed and stored for the subsequent epigenetic analysis. The infertility group received 150 IU of urinary or recombinant FSH three times per week for 12 weeks and the fertile control group did not received treatment. After three months of treatment, semen analysis and hormone profiles were retested in both groups. The sperm samples of three months with treatment for infertility and three months after for control group were processed and stored for the epigenetic test.
DNA Preparation
Frozen human sperm samples were stored at −20 C and thawed for analysis. Genomic DNA from sperm was prepared as follows: A minimum of a one hundred μl of sperm suspension was used then 820 μl DNA extraction buffer (50 mM Tris pH 8, 10 mM EDTA pH 8, 0.5% SDS) and 80 μl 0.1 M Dithiothreitol (DTT) was added and the sample incubated at 65 C for 15 minutes. 80 μl Proteinase K (20 mg/ml) was added and the sample incubated on a rotator at 55 C for at least 2 hours. After incubation, 300 μl of protein precipitation solution (Promega, A795A, Madison, Wis.) was added, the sample was mixed and incubated on ice for 15 minutes, then spun at 4 C at 13,000 rpm for 30 minutes. The supernatant was transferred to a fresh tube, then precipitated over night at −20 C with the same volume 100% isopropanol and 2 μl glycoblue. The sample was then centrifuged and the pellet was washed with 75% ethanol, then air-dried and resuspended in 100 μl H2O. DNA concentration was measured using the Nanodrop (Thermo Fisher, Waltham, Mass.). The freeze-thaw will destroy any contaminating somatic cells within the sperm collection.
Methylated DNA Immunoprecipitation (MeDIP)
Methylated DNA Immunoprecipitation (MeDIP) with genomic DNA was performed as follows: individual sperm DNA samples were diluted to 130 μl with 1× Tris-EDTA (TE, 10 mM Tris, 1 mM EDTA) and sonicated with a COVARIS® M220 ultrasonicator using the 300 bp setting. Fragment size was verified on a 2% E-gel agarose gel. The sonicated DNA was transferred from the tube to a 1.7 ml microfuge tube and the volume was measured. The sonicated DNA was then diluted with TE buffer (10 mM Tris HCl, pH7.5; 1 mM EDTA) to 400 μl, heat-denatured for 10 min at 95 C, then immediately cooled on ice for 10 min. Then 100 μl of 5× IP buffer and 5 μg of antibody (monoclonal mouse anti 5-methyl cytidine; Diagenode #C15200006) were added to the denatured sonicated DNA. The DNA-antibody mixture was incubated overnight on a rotator at 4 C. The following day magnetic beads (DYNABEADS® M-280 Sheep anti-Mouse IgG; 11201D) were pre-washed as follows: The beads were resuspended in the vial, then the appropriate volume (50 μl per sample) was transferred to a microfuge tube. The same volume of Washing Buffer (at least 1 mL 1×PBS with 0.1% BSA and 2 mM EDTA) was added and the bead sample was resuspended. The tube was then placed into a magnetic rack for 1-2 minutes and the supernatant was discarded. The tube was removed from the magnetic rack and the beads were washed once. The washed beads were resuspended in the same volume of 1×IP buffer (50 mM sodium phosphate ph7.0, 700 mM NaCl, 0.25% TritonX-100) as the initial volume of beads. 50 μl of beads were added to the 500 μl of DNA-antibody mixture from the overnight incubation, then incubated for 2 h on a rotator at 4 C. After the incubation the bead-antibody-DNA complex was washed three times with 1×IP buffer as follows: The tube was placed into a magnetic rack for 1-2 minutes and the supernatant was discarded, then washed with 1×IP buffer 3 times. The washed bead-DNA solution was then resuspended in 250 μl digestion buffer with 3.5 μl Proteinase K (20 mg/ml). The sample was then incubated for 2-3 hours on a rotator at 55 C and then 250 gi of buffered Phenol-Chloroform-Isoamylalcohol solution was added to the sample and the tube was vortexed for 30 sec and then centrifuged at 14,000 rpm for 5 min at room temperature. The aqueous supernatant was carefully removed and transferred to a fresh microfuge tube. Then 250 μl chloroform were added to the supernatant from the previous step, vortexed for 30 sec and centrifuged at 14,000 rpm for 5 min at room temperature. The aqueous supernatant was removed and transferred to a fresh microfuge tube. To the supernatant 21 of glycoblue (20 mg/ml), 20 μl of 5M NaCl and 500 μl ethanol were added and mixed well, then precipitated in −20 C freezer for 1 hour to overnight. The precipitate was centrifuged at 14,000 rpm for 20 min at 4° C. and the supernatant was removed, while not disturbing the pellet. The pellet was washed with 500 μl cold 70% ethanol in −20 C freezer for 15 min. then centrifuged again at 14,000 rpm for 5 min at 4 C and the supernatant was discarded. The tube was spun again briefly to collect residual ethanol to the bottom of the tube and as much liquid as possible was removed with gel loading tip. The pellet was air-dried at RT until it looked dry (about 5 minutes) then resuspended in 20 μl H2O or TE. DNA concentration was measured in a QUBIT® fluorometer (Life Technologies) with ssDNA kit (Molecular Probes Q10212).
MeDIP-Seq Analysis
The MeDIP DNA samples were used to create libraries for next generation sequencing (NGS) using the NEBNEXT® ULTRATM RNA Library Prep Kit for ILLUMINA® (San Diego, Calif.) starting at step 1.4 of the manufacturer's protocol to generate double stranded DNA. After this step the manufacturer's protocol was followed. Each sample received a separate index primer. NGS was performed at WSU Spokane Genomics Core using the ILLUMINA HISEQ® 2500 high-throughput sequencing system with a PE50 application, with a read size of approximately 50 bp and approximately 20-25 million reads per sample and 9-10 sample libraries each were run in one lane.
Bioinformatics and Statistics
Basic read quality was verified using summaries produced by the FastQC program. Reads were filtered and trimmed to remove low quality base pairs using Trimmomatic. Samples with elevated read depths were randomly subsampled to obtain more consistent read depths across all samples. The reads for each sample were mapped to the GRCh38 human genome using Bowtie2 with default parameter options. The mapped read files were then converted to sorted BAM files using SAMtools. To identify DMR, the reference genome was broken into 1000 bp windows. The MEDIPS R package was used to calculate differential coverage between control and exposure sample groups. The edgeR p-value was used to determine the relative difference between the two groups for each genomic window. Windows with an edgeR p-value less than 10 were considered DMRs. The DMR edges were extended until no genomic window with an edgeR p-value less than 0.1 remained within 1000 bp of the DMR. CpG density and other information was then calculated for the DMR based on the reference genome. DMR were annotated using the biomaRt R package 31 to access the Ensembl database. The genes that overlapped with DMR were then input into the KEGG pathway search to identify associated pathways. The DMR associated genes were then sorted into functional groups using information provided by the DAVID and Panther databases incorporated into an internal curated database. All MeDIP-Seq genomic data obtained in the current study have been deposited in the NCBI public GEO database.
A permutation analysis to determine the significance of the number of DMR identified for each comparison was performed. For this analysis, samples from the two treatment groups were randomly assigned group membership. The number of samples in each treatment group was held constant. Twenty random permutations of each analysis were performed to obtain a null distribution for the expected number of DMR.
Statistical Analysis
In order to characterize clinical parameters of both groups (control and treatment group), a numerical descriptive analysis has been made using the mean with standard deviation (SD) and the median (1st and 3rd quartile). The baseline differences between the treatment group and the control group were then compared, as well as the effect of FSH between the before and after treatment in the treated group, in all variables collected. For this, we have used mixed linear regression models in case we had several measures per patient (semen volume and sperm concentration), and in the case of motility a beta logistic regression model was performed given its percentage character. The mixed models control the non-independence of data given that there are several measures per patient.
In the fertile group both baseline and 3-month measures were considered, because no difference was expected. On the other hand, in the infertile treatment group, two samples were extracted from these three variables (volume, concentration and motility). In this way, the power increases and there is a greater probability that we detect differences. In all other cases, associations between variables have been studied using linear regression models. The statistical analyses were performed with the statistical software R (version 3.4.1) and the packages nlme (version 3.1-131), lme4 (1.1-13), glmmADMB (0.8.3.3) and betareg (version 3.1-0). A p-value of less than 0.05 was considered statistically significant.
While preferred embodiments of the present disclosure have been shown and described herein, it will be obvious to those skilled in the art that such embodiments are provided by way of example only. It is not intended that the disclosure be limited by the specific examples provided within the specification. While the disclosure has been described with reference to the aforementioned specification, the descriptions and illustrations of the embodiments herein are not meant to be construed in a limiting sense. Numerous variations, changes, and substitutions will now occur to those skilled in the art without departing from the disclosure. Furthermore, it shall be understood that all aspects of the disclosure are not limited to the specific depictions, configurations or relative proportions set forth herein which depend upon a variety of conditions and variables. It should be understood that various alternatives to the embodiments of the disclosure described herein may be employed in practicing the disclosure. It is therefore contemplated that the disclosure shall also cover any such alternatives, modifications, variations or equivalents. It is intended that the following claims define the scope of the disclosure and that methods and structures within the scope of these claims and their equivalents be covered thereby.
This application claims the benefit of U.S. Provisional Patent Application No. 62/887,000, filed Aug. 15, 2019, which is entirely incorporated herein by reference.
Filing Document | Filing Date | Country | Kind |
---|---|---|---|
PCT/US2020/046455 | 8/14/2020 | WO |
Number | Date | Country | |
---|---|---|---|
62887000 | Aug 2019 | US |